<div align="center">
<h1> ReCaptaBreaker</h1>
<p><i>Auto Solve Google's Recaptcha Image Challenge with near human performance!</i></p>
<img src="https://img.shields.io/github/license/Hackear2041/ReCaptchaBreaker?style=for-the-badge">
<a href="https://pypi.org/project/recaptchabreaker"><img src="https://img.shields.io/pypi/v/recaptchabreaker?style=for-the-badge"></a>
<!-- <a href="https://github.com/QIN2DIM/recaptcha-challenger/"><img src="https://img.shields.io/github/stars/Hackear2041/ReCaptchaBreaker?style=for-the-badge"></a> -->
<br>
<br>
</div>
<!-- # ReCaptaBreaker - Auto Solve Google's Recaptcha Image Challenge with near human performance! -->
### This is one of the first library that breaks google recaptcha image challenge using autmoated system right in your device, with near human accuracy!
## Installing
You can use pypi distribution:
```shell
pip install recaptchabreaker
```
or install locally from source:
```shell
git clone https://github.com/Hackear2041/ReCaptchaBreaker.git
python setup.py install
```
Models will be downloaded on first use.
## Usage
1. You can use the solver in selenium based drivers
```python
from recaptchabreaker import ReCaptchaBreaker
breaker = RecaptchBreaker()
# Load the page, where recaptcha needs to be solved.
# The method will automatically find the captcha and solve it.
breaker.solve_captcha(driver)
```
2. You can directly classify on images as well
```python
from recaptchabreaker import ReCaptchaBreaker
breaker = RecaptchBreaker()
# Load the list of images
image_list : List[PIL.Image] = ...
for pred in breaker.solve_images():
print(pred) # Prints dictionary of label:score
```
## Hardware Requirements
Any device with python support and atleast 1GB RAM and 500MB free space should be supported. Expected classification time is ~1sec for 4*4 recaptcha Grid, and is dependent on number of CPU cores available. Faster Models Coming Soon!
## How it Works?
I train [Clip](https://huggingface.co/docs/transformers/model_doc/clip) model for classification and [Diffusion](https://huggingface.co/docs/diffusers/index) models for synthetic dataset generation. Further, using an iterative procedure, new annotated data points are collected by applying a week classifier to a real recaptcha system. The classifier model is further trained on the collected and augmented dataset resulting in a better classifier. The method is easily extendible to other forms of image captcha tasks such as hCaptcha.
## Citing
If you find this repo useful, please consider citing the following paper:
```bibtex
@software{Anonymous_ReCaptchaBreaker_Breaking_Google_s_2022,
author = {Anonymous},
month = {12},
title = {{ReCaptchaBreaker: Breaking Google's Recaptcha Image Challenge with near human performance}},
url = {https://github.com/Hackear2041/ReCaptchaBreaker},
version = {1.0.0},
year = {2022}
}
```
## Disclaimer⚠️
This repo is for educational and research purposes only. Any actions and/or activities related to the material contained on this repo is solely your responsibility. The misuse of the information in this repo can result in criminal charges brought against the persons in question. The author will not be held responsible in the event any criminal charges be brought against any individuals misusing the information in this repo to break the law. However, if you think this repo violates any of your terms of usage or policies, feel free to contact me.
Raw data
{
"_id": null,
"home_page": "https://github.com/Hackear2041/ReCaptchaBreaker",
"name": "ReCaptchaBreaker",
"maintainer": "",
"docs_url": null,
"requires_python": "",
"maintainer_email": "",
"keywords": "python recaptcha google captcha bot captchabreak",
"author": "Hackear",
"author_email": "",
"download_url": "https://files.pythonhosted.org/packages/e0/69/79eaeafbf7ce677abdd2c8534aefd17ff546adc61c71c0f60578afd1ab4f/ReCaptchaBreaker-0.2.0.tar.gz",
"platform": null,
"description": "<div align=\"center\">\n <h1> ReCaptaBreaker</h1>\n <p><i>Auto Solve Google's Recaptcha Image Challenge with near human performance!</i></p>\n <img src=\"https://img.shields.io/github/license/Hackear2041/ReCaptchaBreaker?style=for-the-badge\">\n <a href=\"https://pypi.org/project/recaptchabreaker\"><img src=\"https://img.shields.io/pypi/v/recaptchabreaker?style=for-the-badge\"></a>\n <!-- <a href=\"https://github.com/QIN2DIM/recaptcha-challenger/\"><img src=\"https://img.shields.io/github/stars/Hackear2041/ReCaptchaBreaker?style=for-the-badge\"></a> -->\n\t<br>\n\t<br>\n</div>\n\n<!-- # ReCaptaBreaker - Auto Solve Google's Recaptcha Image Challenge with near human performance! -->\n\n### This is one of the first library that breaks google recaptcha image challenge using autmoated system right in your device, with near human accuracy!\n\n\n## Installing\n\nYou can use pypi distribution:\n```shell\npip install recaptchabreaker\n```\nor install locally from source:\n```shell\ngit clone https://github.com/Hackear2041/ReCaptchaBreaker.git\npython setup.py install\n```\nModels will be downloaded on first use.\n\n## Usage\n\n1. You can use the solver in selenium based drivers \n\n```python\nfrom recaptchabreaker import ReCaptchaBreaker\nbreaker = RecaptchBreaker() \n\n# Load the page, where recaptcha needs to be solved.\n# The method will automatically find the captcha and solve it.\nbreaker.solve_captcha(driver)\n```\n\n2. You can directly classify on images as well\n\n```python\nfrom recaptchabreaker import ReCaptchaBreaker\nbreaker = RecaptchBreaker() \n\n# Load the list of images\nimage_list : List[PIL.Image] = ...\n\nfor pred in breaker.solve_images():\n print(pred) # Prints dictionary of label:score\n```\n\n## Hardware Requirements\nAny device with python support and atleast 1GB RAM and 500MB free space should be supported. Expected classification time is ~1sec for 4*4 recaptcha Grid, and is dependent on number of CPU cores available. Faster Models Coming Soon!\n\n## How it Works?\n\nI train [Clip](https://huggingface.co/docs/transformers/model_doc/clip) model for classification and [Diffusion](https://huggingface.co/docs/diffusers/index) models for synthetic dataset generation. Further, using an iterative procedure, new annotated data points are collected by applying a week classifier to a real recaptcha system. The classifier model is further trained on the collected and augmented dataset resulting in a better classifier. The method is easily extendible to other forms of image captcha tasks such as hCaptcha. \n\n## Citing\n\nIf you find this repo useful, please consider citing the following paper:\n\n```bibtex\n@software{Anonymous_ReCaptchaBreaker_Breaking_Google_s_2022,\n\tauthor = {Anonymous},\n\tmonth = {12},\n\ttitle = {{ReCaptchaBreaker: Breaking Google's Recaptcha Image Challenge with near human performance}},\n\turl = {https://github.com/Hackear2041/ReCaptchaBreaker},\n\tversion = {1.0.0},\n\tyear = {2022}\n}\n``` \n\n## Disclaimer\u26a0\ufe0f\n This repo is for educational and research purposes only. Any actions and/or activities related to the material contained on this repo is solely your responsibility. The misuse of the information in this repo can result in criminal charges brought against the persons in question. The author will not be held responsible in the event any criminal charges be brought against any individuals misusing the information in this repo to break the law. However, if you think this repo violates any of your terms of usage or policies, feel free to contact me.\n\n\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "An easy tool to automatically crack Google's ReCaptcha, with near-human performance.",
"version": "0.2.0",
"split_keywords": [
"python",
"recaptcha",
"google",
"captcha",
"bot",
"captchabreak"
],
"urls": [
{
"comment_text": "",
"digests": {
"md5": "e3cc79a687a24a5222981f9732b83d44",
"sha256": "6aa9cf030c9ed65baa23d00f53ed82811e91804e3c27cd5958432d0b94c1eafd"
},
"downloads": -1,
"filename": "ReCaptchaBreaker-0.2.0-py3-none-any.whl",
"has_sig": false,
"md5_digest": "e3cc79a687a24a5222981f9732b83d44",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 8518,
"upload_time": "2022-12-30T09:53:48",
"upload_time_iso_8601": "2022-12-30T09:53:48.662602Z",
"url": "https://files.pythonhosted.org/packages/b7/d4/7fcc7ec2657196325b6281828088ccf43bf11cbd5a9d93fc624c5e043580/ReCaptchaBreaker-0.2.0-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"md5": "5485fb991a0f2b3ca6e6dbbed9742cbd",
"sha256": "abcf468c8fd2990038ba7463573e8fc1fd553d76c8d1edc9d6abc750eaad7532"
},
"downloads": -1,
"filename": "ReCaptchaBreaker-0.2.0.tar.gz",
"has_sig": false,
"md5_digest": "5485fb991a0f2b3ca6e6dbbed9742cbd",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 7354,
"upload_time": "2022-12-30T09:53:50",
"upload_time_iso_8601": "2022-12-30T09:53:50.465620Z",
"url": "https://files.pythonhosted.org/packages/e0/69/79eaeafbf7ce677abdd2c8534aefd17ff546adc61c71c0f60578afd1ab4f/ReCaptchaBreaker-0.2.0.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2022-12-30 09:53:50",
"github": true,
"gitlab": false,
"bitbucket": false,
"github_user": "Hackear2041",
"github_project": "ReCaptchaBreaker",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"requirements": [
{
"name": "torch",
"specs": []
},
{
"name": "numpy",
"specs": []
},
{
"name": "transformers",
"specs": []
},
{
"name": "tqdm",
"specs": []
}
],
"lcname": "recaptchabreaker"
}