extract-emails


Nameextract-emails JSON
Version 5.3.2 PyPI version JSON
download
home_pagehttps://github.com/dmitriiweb/extract-emails
SummaryExtract email addresses and linkedin profiles from given URL.
upload_time2023-12-29 19:24:20
maintainer
docs_urlNone
authorDmitrii Kurlov
requires_python>=3.9,<3.13
licenseMIT
keywords parser email linkedin
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Extract Emails

![Image](https://github.com/dmitriiweb/extract-emails/blob/docs_improvements/images/email.png?raw=true)

[![PyPI version](https://badge.fury.io/py/extract-emails.svg)](https://badge.fury.io/py/extract-emails)

Extract emails and linkedins profiles from a given website

**Support the project with BTC**: *bc1q0cxl5j3se0ufhr96h8x0zs8nz4t7h6krrxkd6l*

[Documentation](https://dmitriiweb.github.io/extract-emails/)

## Requirements
- Python >= 3.9

## Installation
```bash
pip install extract_emails[all]
# or
pip install extract_emails[requests]
# or
pip install extract_emails[selenium]
```

## Simple Usage
### As library
```python
from pathlib import Path

from extract_emails import DefaultFilterAndEmailFactory as Factory
from extract_emails import DefaultWorker
from extract_emails.browsers.requests_browser import RequestsBrowser as Browser
from extract_emails.data_savers import CsvSaver


websites = [
    "website1.com",
    "website2.com",
]

browser = Browser()
data_saver = CsvSaver(save_mode="a", output_path=Path("output.csv"))

for website in websites:
    factory = Factory(
        website_url=website, browser=browser, depth=5, max_links_from_page=1
    )
    worker = DefaultWorker(factory)
    data = worker.get_data()
    data_saver.save(data)
```
### As CLI tool
```bash
$ extract-emails --help

$ extract-emails --url https://en.wikipedia.org/wiki/Email -of output.csv -d 1
$ cat output.csv
email,page,website
bob@b.org,https://en.wikipedia.org/wiki/Email,https://en.wikipedia.org/wiki/Email
```

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/dmitriiweb/extract-emails",
    "name": "extract-emails",
    "maintainer": "",
    "docs_url": null,
    "requires_python": ">=3.9,<3.13",
    "maintainer_email": "",
    "keywords": "parser,email,linkedin",
    "author": "Dmitrii Kurlov",
    "author_email": "dmitriik@tutanota.com",
    "download_url": "https://files.pythonhosted.org/packages/1f/0a/aab737bb136575d6fde8cac0007ebde46b2055e94c043c9d3803e4ed64e8/extract_emails-5.3.2.tar.gz",
    "platform": null,
    "description": "# Extract Emails\n\n![Image](https://github.com/dmitriiweb/extract-emails/blob/docs_improvements/images/email.png?raw=true)\n\n[![PyPI version](https://badge.fury.io/py/extract-emails.svg)](https://badge.fury.io/py/extract-emails)\n\nExtract emails and linkedins profiles from a given website\n\n**Support the project with BTC**: *bc1q0cxl5j3se0ufhr96h8x0zs8nz4t7h6krrxkd6l*\n\n[Documentation](https://dmitriiweb.github.io/extract-emails/)\n\n## Requirements\n- Python >= 3.9\n\n## Installation\n```bash\npip install extract_emails[all]\n# or\npip install extract_emails[requests]\n# or\npip install extract_emails[selenium]\n```\n\n## Simple Usage\n### As library\n```python\nfrom pathlib import Path\n\nfrom extract_emails import DefaultFilterAndEmailFactory as Factory\nfrom extract_emails import DefaultWorker\nfrom extract_emails.browsers.requests_browser import RequestsBrowser as Browser\nfrom extract_emails.data_savers import CsvSaver\n\n\nwebsites = [\n    \"website1.com\",\n    \"website2.com\",\n]\n\nbrowser = Browser()\ndata_saver = CsvSaver(save_mode=\"a\", output_path=Path(\"output.csv\"))\n\nfor website in websites:\n    factory = Factory(\n        website_url=website, browser=browser, depth=5, max_links_from_page=1\n    )\n    worker = DefaultWorker(factory)\n    data = worker.get_data()\n    data_saver.save(data)\n```\n### As CLI tool\n```bash\n$ extract-emails --help\n\n$ extract-emails --url https://en.wikipedia.org/wiki/Email -of output.csv -d 1\n$ cat output.csv\nemail,page,website\nbob@b.org,https://en.wikipedia.org/wiki/Email,https://en.wikipedia.org/wiki/Email\n```\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Extract email addresses and linkedin profiles from given URL.",
    "version": "5.3.2",
    "project_urls": {
        "Documentation": "https://dmitriiweb.github.io/extract-emails",
        "Homepage": "https://github.com/dmitriiweb/extract-emails",
        "Repository": "https://github.com/dmitriiweb/extract-emails"
    },
    "split_keywords": [
        "parser",
        "email",
        "linkedin"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "f49572300860af0d2104bb49994cec1c5cab3ccf37effbb822c2a3770f081fae",
                "md5": "23c56f47d05b947f425d6074e2535cc6",
                "sha256": "d61384343e8b02f88937828758a14cbcc68c951d6008d0deec57cb8ed4d1d075"
            },
            "downloads": -1,
            "filename": "extract_emails-5.3.2-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "23c56f47d05b947f425d6074e2535cc6",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.9,<3.13",
            "size": 30826,
            "upload_time": "2023-12-29T19:24:18",
            "upload_time_iso_8601": "2023-12-29T19:24:18.580134Z",
            "url": "https://files.pythonhosted.org/packages/f4/95/72300860af0d2104bb49994cec1c5cab3ccf37effbb822c2a3770f081fae/extract_emails-5.3.2-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "1f0aaab737bb136575d6fde8cac0007ebde46b2055e94c043c9d3803e4ed64e8",
                "md5": "625808babaca0dea737e1fa06227197f",
                "sha256": "84391a421e9999972d55b3a2c8f1db89597fb4c1cb2a1fd9843185b2ecdd764a"
            },
            "downloads": -1,
            "filename": "extract_emails-5.3.2.tar.gz",
            "has_sig": false,
            "md5_digest": "625808babaca0dea737e1fa06227197f",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.9,<3.13",
            "size": 19636,
            "upload_time": "2023-12-29T19:24:20",
            "upload_time_iso_8601": "2023-12-29T19:24:20.981384Z",
            "url": "https://files.pythonhosted.org/packages/1f/0a/aab737bb136575d6fde8cac0007ebde46b2055e94c043c9d3803e4ed64e8/extract_emails-5.3.2.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-12-29 19:24:20",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "dmitriiweb",
    "github_project": "extract-emails",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "tox": true,
    "lcname": "extract-emails"
}
        
Elapsed time: 0.16968s