# Extract Emails
![Image](https://github.com/dmitriiweb/extract-emails/blob/docs_improvements/images/email.png?raw=true)
[![PyPI version](https://badge.fury.io/py/extract-emails.svg)](https://badge.fury.io/py/extract-emails)
Extract emails and linkedins profiles from a given website
**Support the project with BTC**: *bc1q0cxl5j3se0ufhr96h8x0zs8nz4t7h6krrxkd6l*
[Documentation](https://dmitriiweb.github.io/extract-emails/)
## Requirements
- Python >= 3.9
## Installation
```bash
pip install extract_emails[all]
# or
pip install extract_emails[requests]
# or
pip install extract_emails[selenium]
```
## Simple Usage
### As library
```python
from pathlib import Path
from extract_emails import DefaultFilterAndEmailFactory as Factory
from extract_emails import DefaultWorker
from extract_emails.browsers.requests_browser import RequestsBrowser as Browser
from extract_emails.data_savers import CsvSaver
websites = [
"website1.com",
"website2.com",
]
browser = Browser()
data_saver = CsvSaver(save_mode="a", output_path=Path("output.csv"))
for website in websites:
factory = Factory(
website_url=website, browser=browser, depth=5, max_links_from_page=1
)
worker = DefaultWorker(factory)
data = worker.get_data()
data_saver.save(data)
```
### As CLI tool
```bash
$ extract-emails --help
$ extract-emails --url https://en.wikipedia.org/wiki/Email -of output.csv -d 1
$ cat output.csv
email,page,website
bob@b.org,https://en.wikipedia.org/wiki/Email,https://en.wikipedia.org/wiki/Email
```
Raw data
{
"_id": null,
"home_page": "https://github.com/dmitriiweb/extract-emails",
"name": "extract-emails",
"maintainer": "",
"docs_url": null,
"requires_python": ">=3.9,<3.13",
"maintainer_email": "",
"keywords": "parser,email,linkedin",
"author": "Dmitrii Kurlov",
"author_email": "dmitriik@tutanota.com",
"download_url": "https://files.pythonhosted.org/packages/1f/0a/aab737bb136575d6fde8cac0007ebde46b2055e94c043c9d3803e4ed64e8/extract_emails-5.3.2.tar.gz",
"platform": null,
"description": "# Extract Emails\n\n![Image](https://github.com/dmitriiweb/extract-emails/blob/docs_improvements/images/email.png?raw=true)\n\n[![PyPI version](https://badge.fury.io/py/extract-emails.svg)](https://badge.fury.io/py/extract-emails)\n\nExtract emails and linkedins profiles from a given website\n\n**Support the project with BTC**: *bc1q0cxl5j3se0ufhr96h8x0zs8nz4t7h6krrxkd6l*\n\n[Documentation](https://dmitriiweb.github.io/extract-emails/)\n\n## Requirements\n- Python >= 3.9\n\n## Installation\n```bash\npip install extract_emails[all]\n# or\npip install extract_emails[requests]\n# or\npip install extract_emails[selenium]\n```\n\n## Simple Usage\n### As library\n```python\nfrom pathlib import Path\n\nfrom extract_emails import DefaultFilterAndEmailFactory as Factory\nfrom extract_emails import DefaultWorker\nfrom extract_emails.browsers.requests_browser import RequestsBrowser as Browser\nfrom extract_emails.data_savers import CsvSaver\n\n\nwebsites = [\n \"website1.com\",\n \"website2.com\",\n]\n\nbrowser = Browser()\ndata_saver = CsvSaver(save_mode=\"a\", output_path=Path(\"output.csv\"))\n\nfor website in websites:\n factory = Factory(\n website_url=website, browser=browser, depth=5, max_links_from_page=1\n )\n worker = DefaultWorker(factory)\n data = worker.get_data()\n data_saver.save(data)\n```\n### As CLI tool\n```bash\n$ extract-emails --help\n\n$ extract-emails --url https://en.wikipedia.org/wiki/Email -of output.csv -d 1\n$ cat output.csv\nemail,page,website\nbob@b.org,https://en.wikipedia.org/wiki/Email,https://en.wikipedia.org/wiki/Email\n```\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "Extract email addresses and linkedin profiles from given URL.",
"version": "5.3.2",
"project_urls": {
"Documentation": "https://dmitriiweb.github.io/extract-emails",
"Homepage": "https://github.com/dmitriiweb/extract-emails",
"Repository": "https://github.com/dmitriiweb/extract-emails"
},
"split_keywords": [
"parser",
"email",
"linkedin"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "f49572300860af0d2104bb49994cec1c5cab3ccf37effbb822c2a3770f081fae",
"md5": "23c56f47d05b947f425d6074e2535cc6",
"sha256": "d61384343e8b02f88937828758a14cbcc68c951d6008d0deec57cb8ed4d1d075"
},
"downloads": -1,
"filename": "extract_emails-5.3.2-py3-none-any.whl",
"has_sig": false,
"md5_digest": "23c56f47d05b947f425d6074e2535cc6",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.9,<3.13",
"size": 30826,
"upload_time": "2023-12-29T19:24:18",
"upload_time_iso_8601": "2023-12-29T19:24:18.580134Z",
"url": "https://files.pythonhosted.org/packages/f4/95/72300860af0d2104bb49994cec1c5cab3ccf37effbb822c2a3770f081fae/extract_emails-5.3.2-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "1f0aaab737bb136575d6fde8cac0007ebde46b2055e94c043c9d3803e4ed64e8",
"md5": "625808babaca0dea737e1fa06227197f",
"sha256": "84391a421e9999972d55b3a2c8f1db89597fb4c1cb2a1fd9843185b2ecdd764a"
},
"downloads": -1,
"filename": "extract_emails-5.3.2.tar.gz",
"has_sig": false,
"md5_digest": "625808babaca0dea737e1fa06227197f",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.9,<3.13",
"size": 19636,
"upload_time": "2023-12-29T19:24:20",
"upload_time_iso_8601": "2023-12-29T19:24:20.981384Z",
"url": "https://files.pythonhosted.org/packages/1f/0a/aab737bb136575d6fde8cac0007ebde46b2055e94c043c9d3803e4ed64e8/extract_emails-5.3.2.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2023-12-29 19:24:20",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "dmitriiweb",
"github_project": "extract-emails",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"tox": true,
"lcname": "extract-emails"
}