recollapse


Namerecollapse JSON
Version 1.0.0 PyPI version JSON
download
home_pagehttps://github.com/0xacb/recollapse
SummaryREcollapse is a helper tool for black-box regex fuzzing to bypass validations and discover normalizations in web applications
upload_time2025-08-07 19:22:31
maintainerNone
docs_urlNone
author0xacb
requires_python>=3.7
licenseMIT
keywords security fuzzing regex bypass normalization waf
VCS
bugtrack_url
requirements unidecode prettytable
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # REcollapse

REcollapse is a helper tool for black-box regex fuzzing to bypass validations and discover normalizations in web applications.

It can also be helpful to bypass WAFs and weak vulnerability mitigations. For more information, take a look at the [REcollapse blog post](https://0xacb.com/2022/11/21/recollapse/).

The goal of this tool is to generate payloads for testing. Actual fuzzing shall be done with other tools like [Burp](https://portswigger.net/burp) (intruder), [Caido](https://caido.io) (automate), [ffuf](https://github.com/ffuf/ffuf), or similar.

---------------

### Installation

**Requirements**: Python 3

`python3 setup.py install` or `pip3 install .`

**Docker**

`docker build -t recollapse .` or `docker pull 0xacb/recollapse`

---------------

### Usage
```
$ recollapse -h
usage: recollapse [-h] [-m MODES] [-e {1,2,3,4}] [-r RANGE] [-s SIZE] [-f FILE] [-an] [-mn MAXNORM] [-mt MAXTRUNC] [-nt] [-tt] [-ct]
                  [--html] [--version]
                  [input]

REcollapse is a helper tool for black-box regex fuzzing to bypass validations and discover normalizations in web applications

positional arguments:
  input                 original input

options:
  -h, --help            show this help message and exit
  -m, --modes, -p, --positions MODES
                        variation modes. Example: 1,2,3,4,5,6,7 (default). 1: starting, 2: separator, 3: normalization, 4:
                        termination, 5: regex metacharacters, 6: case folding/upper/lower, 7: byte truncation
  -e, --encoding {1,2,3,4}
                        1: URL-encoded format (default), 2: Unicode format, 3: Raw format, 4: Double URL-encoded format
  -r, --range RANGE     range of bytes for fuzzing. Example: 0,0xff (default)
  -s, --size SIZE       number of fuzzing bytes (default: 1)
  -f, --file FILE       read input from file
  -an, --alphanum       include alphanumeric bytes in fuzzing range
  -mn, --maxnorm MAXNORM
                        maximum number of normalizations (default: 3)
  -mt, --maxtrunc MAXTRUNC
                        maximum number of truncations (default: 3)
  -nt, --normtable      print normalization table
  -tt, --trunctable     print truncation table
  -ct, --casetable      print case table
  --html                output tables in HTML format
  --version             show recollapse version
```

---------------

### Detailed options explanation

Let's consider `this_is.an_example` as the input.

**Modes**

1. Fuzz the beginning of the input: `$this_is.an_example`
2. Fuzz the before and after special characters: `this$_$is$.$an$_$example`
3. Fuzz normalization positions: replace all possible bytes according to the [normalization table](https://0xacb.com/normalization_table)
4. Fuzz the end of the input: `this_is.an_example$`
5. Fuzz regex metacharacters: replace all possible regex metacharacters: `.^$*+-?()[]{}\|`
6. Fuzz case folding/upper/lower: replace all possible bytes according to the [case table](https://0xacb.com/case_table)
7. Fuzz byte truncation: replace all possible bytes according to the [truncation table](https://0xacb.com/truncation_table)

**Encoding**

1. URL-encoded format to be used with `application/x-www-form-urlencoded` or query/body parameters: `%22this_is.an_example`
2. Unicode format to be used with `application/json`: `\u0022this_is.an_example`
3. Raw format to be used with `multipart/form-data`: `"this_is.an_example`
4. Double URL-encoded format

**Range**

Specify a range of bytes for fuzzing: `-r 1-127`. This will exclude alphanumeric characters unless the `-an` option is provided.

**Size**

Specify the size of fuzzing for modes `1`, `2` and `4`. The default approach is to fuzz all possible values for one byte. Increasing the size will consume more resources and generate many more inputs, but it can lead to finding new bypasses.

**File**

Input can be provided as a positional argument, stdin, or a file through the `-f` option.

**Alphanumeric**

By default, alphanumeric characters will be excluded from output generation, which is usually not interesting in terms of responses. You can allow this with the `-an` option.

**Maximum number or normalizations**

Not all normalization libraries have the same behavior. By default, three possibilities for normalizations are generated for each input index, which is usually enough. Use the `-mn` option to go further.

**Tables**

Use the `-nt` option to show the normalization table, the `-ct` option to show the case table, and the `-tt` option to show the truncation table. You can also use the `--html` option to output tables in HTML format.

```bash
$ recollapse -nt --html > normalization_table.html
$ recollapse -tt --html > truncation_table.html
$ recollapse -ct --html > case_table.html
```

---------------

### Examples

Using Recollapse as a command-line tool:

```bash
$ recollapse -e 1 -m 1,2,4 -r 10-11 https://legit.example.com
%0ahttps://legit.example.com
%0bhttps://legit.example.com
https%0a://legit.example.com
https%0b://legit.example.com
...

$ echo "a@b.com" | recollapse 
%00a@b.com
%01a@b.com
...

$ echo "<svg/onload=alert(1)>" | recollapse | ffuf -w - -u "https://example.com/?param=FUZZ" -mc 200,403,500
```

Using Recollapse as a library:

```python
from recollapse import Recollapse

recollapse = Recollapse(modes=Recollapse.DEFAULT_MODES,
                        encoding=Recollapse.ENCODING_RAW)
variants = recollapse.generate("<script")
for variant in variants:
    print(variant)
```

---------------

### Resources

This technique was originally presented on [BSidesLisbon 2022](https://bsideslisbon.org/)

**Blog post**: https://0xacb.com/2022/11/21/recollapse/

**Slides**:

- [nahamcon_2022_eu_till_recollapse.pdf](https://github.com/0xacb/recollapse/blob/main/slides/nahamcon_2022_eu_till_recollapse.pdf)
- [bsideslisbon_2022_till_recollapse.pdf](https://github.com/0xacb/recollapse/blob/main/slides/bsideslisbon_2022_till_recollapse.pdf)

**Videos**:

- [NahamCon 2022 EU](https://www.youtube.com/watch?v=1eLTMKWciic)
- [BSidesLisbon 2022](https://www.youtube.com/watch?v=nb91qhj5cOE)

**Tables**:

- **Normalization table**: https://0xacb.com/normalization_table
- **Case table**: https://0xacb.com/case_table
- **Truncation table**: https://0xacb.com/truncation_table

---------------

**Thanks**

- [@regala_](https://x.com/regala_)
- [@0xz3z4d45](https://x.com/0xz3z4d45)
- [@jllis](https://x.com/jllis)
- [@samwcyo](https://x.com/samwcyo)
- [@yassineaboukir](https://x.com/yassineaboukir)
- [@0xteknogeek](https://x.com/0xteknogeek)
- [@vgpinho](https://github.com/vgpinho)
- [@ryancbarnett](https://x.com/ryancbarnett)
- **BBAC**

and

- [@ethiack](https://x.com/ethiack) team
- [@0xdisturbance](https://x.com/0xdisturbance) team
- [@hacker0x01](https://x.com/hacker0x01) team

---------------

### ⚠ Legal Disclaimer ⚠

This project is made for educational and ethical testing purposes only. Usage of this tool for attacking targets without prior mutual consent is illegal. Developers assume no liability and are not responsible for any misuse or damage caused by this tool.

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/0xacb/recollapse",
    "name": "recollapse",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.7",
    "maintainer_email": null,
    "keywords": "security, fuzzing, regex, bypass, normalization, waf",
    "author": "0xacb",
    "author_email": null,
    "download_url": "https://files.pythonhosted.org/packages/ea/c5/1570bba70e0a16af55dfe56eaf6948e069ee82000c74f64428e995fefb03/recollapse-1.0.0.tar.gz",
    "platform": null,
    "description": "# REcollapse\n\nREcollapse is a helper tool for black-box regex fuzzing to bypass validations and discover normalizations in web applications.\n\nIt can also be helpful to bypass WAFs and weak vulnerability mitigations. For more information, take a look at the [REcollapse blog post](https://0xacb.com/2022/11/21/recollapse/).\n\nThe goal of this tool is to generate payloads for testing. Actual fuzzing shall be done with other tools like [Burp](https://portswigger.net/burp) (intruder), [Caido](https://caido.io) (automate), [ffuf](https://github.com/ffuf/ffuf), or similar.\n\n---------------\n\n### Installation\n\n**Requirements**: Python 3\n\n`python3 setup.py install` or `pip3 install .`\n\n**Docker**\n\n`docker build -t recollapse .` or `docker pull 0xacb/recollapse`\n\n---------------\n\n### Usage\n```\n$ recollapse -h\nusage: recollapse [-h] [-m MODES] [-e {1,2,3,4}] [-r RANGE] [-s SIZE] [-f FILE] [-an] [-mn MAXNORM] [-mt MAXTRUNC] [-nt] [-tt] [-ct]\n                  [--html] [--version]\n                  [input]\n\nREcollapse is a helper tool for black-box regex fuzzing to bypass validations and discover normalizations in web applications\n\npositional arguments:\n  input                 original input\n\noptions:\n  -h, --help            show this help message and exit\n  -m, --modes, -p, --positions MODES\n                        variation modes. Example: 1,2,3,4,5,6,7 (default). 1: starting, 2: separator, 3: normalization, 4:\n                        termination, 5: regex metacharacters, 6: case folding/upper/lower, 7: byte truncation\n  -e, --encoding {1,2,3,4}\n                        1: URL-encoded format (default), 2: Unicode format, 3: Raw format, 4: Double URL-encoded format\n  -r, --range RANGE     range of bytes for fuzzing. Example: 0,0xff (default)\n  -s, --size SIZE       number of fuzzing bytes (default: 1)\n  -f, --file FILE       read input from file\n  -an, --alphanum       include alphanumeric bytes in fuzzing range\n  -mn, --maxnorm MAXNORM\n                        maximum number of normalizations (default: 3)\n  -mt, --maxtrunc MAXTRUNC\n                        maximum number of truncations (default: 3)\n  -nt, --normtable      print normalization table\n  -tt, --trunctable     print truncation table\n  -ct, --casetable      print case table\n  --html                output tables in HTML format\n  --version             show recollapse version\n```\n\n---------------\n\n### Detailed options explanation\n\nLet's consider `this_is.an_example` as the input.\n\n**Modes**\n\n1. Fuzz the beginning of the input: `$this_is.an_example`\n2. Fuzz the before and after special characters: `this$_$is$.$an$_$example`\n3. Fuzz normalization positions: replace all possible bytes according to the [normalization table](https://0xacb.com/normalization_table)\n4. Fuzz the end of the input: `this_is.an_example$`\n5. Fuzz regex metacharacters: replace all possible regex metacharacters: `.^$*+-?()[]{}\\|`\n6. Fuzz case folding/upper/lower: replace all possible bytes according to the [case table](https://0xacb.com/case_table)\n7. Fuzz byte truncation: replace all possible bytes according to the [truncation table](https://0xacb.com/truncation_table)\n\n**Encoding**\n\n1. URL-encoded format to be used with `application/x-www-form-urlencoded` or query/body parameters: `%22this_is.an_example`\n2. Unicode format to be used with `application/json`: `\\u0022this_is.an_example`\n3. Raw format to be used with `multipart/form-data`: `\"this_is.an_example`\n4. Double URL-encoded format\n\n**Range**\n\nSpecify a range of bytes for fuzzing: `-r 1-127`. This will exclude alphanumeric characters unless the `-an` option is provided.\n\n**Size**\n\nSpecify the size of fuzzing for modes `1`, `2` and `4`. The default approach is to fuzz all possible values for one byte. Increasing the size will consume more resources and generate many more inputs, but it can lead to finding new bypasses.\n\n**File**\n\nInput can be provided as a positional argument, stdin, or a file through the `-f` option.\n\n**Alphanumeric**\n\nBy default, alphanumeric characters will be excluded from output generation, which is usually not interesting in terms of responses. You can allow this with the `-an` option.\n\n**Maximum number or normalizations**\n\nNot all normalization libraries have the same behavior. By default, three possibilities for normalizations are generated for each input index, which is usually enough. Use the `-mn` option to go further.\n\n**Tables**\n\nUse the `-nt` option to show the normalization table, the `-ct` option to show the case table, and the `-tt` option to show the truncation table. You can also use the `--html` option to output tables in HTML format.\n\n```bash\n$ recollapse -nt --html > normalization_table.html\n$ recollapse -tt --html > truncation_table.html\n$ recollapse -ct --html > case_table.html\n```\n\n---------------\n\n### Examples\n\nUsing Recollapse as a command-line tool:\n\n```bash\n$ recollapse -e 1 -m 1,2,4 -r 10-11 https://legit.example.com\n%0ahttps://legit.example.com\n%0bhttps://legit.example.com\nhttps%0a://legit.example.com\nhttps%0b://legit.example.com\n...\n\n$ echo \"a@b.com\" | recollapse \n%00a@b.com\n%01a@b.com\n...\n\n$ echo \"<svg/onload=alert(1)>\" | recollapse | ffuf -w - -u \"https://example.com/?param=FUZZ\" -mc 200,403,500\n```\n\nUsing Recollapse as a library:\n\n```python\nfrom recollapse import Recollapse\n\nrecollapse = Recollapse(modes=Recollapse.DEFAULT_MODES,\n                        encoding=Recollapse.ENCODING_RAW)\nvariants = recollapse.generate(\"<script\")\nfor variant in variants:\n    print(variant)\n```\n\n---------------\n\n### Resources\n\nThis technique was originally presented on [BSidesLisbon 2022](https://bsideslisbon.org/)\n\n**Blog post**: https://0xacb.com/2022/11/21/recollapse/\n\n**Slides**:\n\n- [nahamcon_2022_eu_till_recollapse.pdf](https://github.com/0xacb/recollapse/blob/main/slides/nahamcon_2022_eu_till_recollapse.pdf)\n- [bsideslisbon_2022_till_recollapse.pdf](https://github.com/0xacb/recollapse/blob/main/slides/bsideslisbon_2022_till_recollapse.pdf)\n\n**Videos**:\n\n- [NahamCon 2022 EU](https://www.youtube.com/watch?v=1eLTMKWciic)\n- [BSidesLisbon 2022](https://www.youtube.com/watch?v=nb91qhj5cOE)\n\n**Tables**:\n\n- **Normalization table**: https://0xacb.com/normalization_table\n- **Case table**: https://0xacb.com/case_table\n- **Truncation table**: https://0xacb.com/truncation_table\n\n---------------\n\n**Thanks**\n\n- [@regala_](https://x.com/regala_)\n- [@0xz3z4d45](https://x.com/0xz3z4d45)\n- [@jllis](https://x.com/jllis)\n- [@samwcyo](https://x.com/samwcyo)\n- [@yassineaboukir](https://x.com/yassineaboukir)\n- [@0xteknogeek](https://x.com/0xteknogeek)\n- [@vgpinho](https://github.com/vgpinho)\n- [@ryancbarnett](https://x.com/ryancbarnett)\n- **BBAC**\n\nand\n\n- [@ethiack](https://x.com/ethiack) team\n- [@0xdisturbance](https://x.com/0xdisturbance) team\n- [@hacker0x01](https://x.com/hacker0x01) team\n\n---------------\n\n### \u26a0 Legal Disclaimer \u26a0\n\nThis project is made for educational and ethical testing purposes only. Usage of this tool for attacking targets without prior mutual consent is illegal. Developers assume no liability and are not responsible for any misuse or damage caused by this tool.\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "REcollapse is a helper tool for black-box regex fuzzing to bypass validations and discover normalizations in web applications",
    "version": "1.0.0",
    "project_urls": {
        "Bug Reports": "https://github.com/0xacb/recollapse/issues",
        "Documentation": "https://github.com/0xacb/recollapse#readme",
        "Homepage": "https://github.com/0xacb/recollapse",
        "Source": "https://github.com/0xacb/recollapse"
    },
    "split_keywords": [
        "security",
        " fuzzing",
        " regex",
        " bypass",
        " normalization",
        " waf"
    ],
    "urls": [
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "c46b269ad63aaa3833f30d6e25673cc134628c2172eb867b2a179a3ec720dff1",
                "md5": "8fc605527a4bcce80c59a514408fb94e",
                "sha256": "6e978f4bf77fa6fb57ff86d7460a52fe12115487553df23b5725625a65a5ef7d"
            },
            "downloads": -1,
            "filename": "recollapse-1.0.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "8fc605527a4bcce80c59a514408fb94e",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.7",
            "size": 9710,
            "upload_time": "2025-08-07T19:22:29",
            "upload_time_iso_8601": "2025-08-07T19:22:29.658990Z",
            "url": "https://files.pythonhosted.org/packages/c4/6b/269ad63aaa3833f30d6e25673cc134628c2172eb867b2a179a3ec720dff1/recollapse-1.0.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "eac51570bba70e0a16af55dfe56eaf6948e069ee82000c74f64428e995fefb03",
                "md5": "e29106fc1a41d2364a9917475daa42ac",
                "sha256": "4098735d86ae97e5e73922f1ee5cf0914ef25c840e1aa5619dee413975e9fc94"
            },
            "downloads": -1,
            "filename": "recollapse-1.0.0.tar.gz",
            "has_sig": false,
            "md5_digest": "e29106fc1a41d2364a9917475daa42ac",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.7",
            "size": 15752,
            "upload_time": "2025-08-07T19:22:31",
            "upload_time_iso_8601": "2025-08-07T19:22:31.254179Z",
            "url": "https://files.pythonhosted.org/packages/ea/c5/1570bba70e0a16af55dfe56eaf6948e069ee82000c74f64428e995fefb03/recollapse-1.0.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-08-07 19:22:31",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "0xacb",
    "github_project": "recollapse",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "requirements": [
        {
            "name": "unidecode",
            "specs": []
        },
        {
            "name": "prettytable",
            "specs": []
        }
    ],
    "lcname": "recollapse"
}
        
Elapsed time: 1.10166s