PyScrappy


NamePyScrappy JSON
Version 0.1.1 PyPI version JSON
download
home_pagehttps://github.com/mldsveda/PyScrappy
SummaryPowerful web scraping tool.
upload_time2022-02-26 18:03:55
maintainer
docs_urlNone
authorVedant Tibrewal, Vedaant Singh
requires_python>=3.6
license
keywords pyscrappy scraping e-commerce wikipedia image scrapper youtube scrapy twitter social media web scraping news stocks songs food instagram movies
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            <div align = "center">
  <img src="https://raw.githubusercontent.com/mldsveda/PyScrappy/main/PyScrappy.png">
  <hr>
  <br/>
</div>

## PyScrappy: powerful Python data scraping toolkit

[![forthebadge made-with-python](http://ForTheBadge.com/images/badges/made-with-python.svg)](https://www.python.org/)

[![Python 3.6](https://img.shields.io/badge/python-3.6-blue.svg)](https://www.python.org/downloads/release/python-360/)
[![PyPI Latest Release](https://img.shields.io/pypi/v/PyScrappy.svg)](https://pypi.org/project/PyScrappy/)

[![Package Status](https://img.shields.io/pypi/status/PyScrappy.svg)](https://pypi.org/project/PyScrappy/)
[![License](https://img.shields.io/pypi/l/PyScrappy.svg)](https://github.com/mldsveda/PyScrappy/blob/main/LICENSE)
![](https://img.shields.io/pypi/dm/PyScrappy)

![](https://komarev.com/ghpvc/?username=mldsveda&style=flat-square)
![stars](https://img.shields.io/github/stars/mldsveda/PyScrappy?style=social)
![forks](https://img.shields.io/github/forks/mldsveda/PyScrappy?style=social)

[![](https://img.shields.io/badge/pyscrappy-official%20documentation-blue)](https://pyscrappy.netlify.app/)

## What is it?

**PyScrappy** is a Python package that provides a fast, flexible, and exhaustive way to scrape data from various different sources. Being an
easy and intuitive library. It aims to be the fundamental high-level building block for scraping **data** in Python. Additionally, it has the broader goal of becoming **the most powerful and flexible open source data scraping tool available**.

## Main Features

Here are just a few of the things that PyScrappy does well:

- Easy scraping of [**Data**](https://medium.com/analytics-vidhya/web-scraping-in-python-using-the-all-new-pyscrappy-5c136ed6906b) available on the internet
- Returns a [**DataFrame**](https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.html) for further analysis and research purposes.
- Automatic [**Data Scraping**](https://medium.com/analytics-vidhya/web-scraping-in-python-using-the-all-new-pyscrappy-5c136ed6906b): Other than a few user input parameters the whole process of scraping the data is automatic.
- Powerful, flexible

## Where to get it

The source code is currently hosted on GitHub at:
https://github.com/mldsveda/PyScrappy

Binary installers for the latest released version are available at the [Python
Package Index (PyPI)](https://pypi.org/project/PyScrappy/).

```sh
pip install PyScrappy
```

## Dependencies

- [selenium](https://www.selenium.dev/) - Selenium is a free (open-source) automated testing framework used to validate web applications across different browsers and platforms.
- [webdriver-manger](https://github.com/bonigarcia/webdrivermanager) - WebDriverManager is an API that allows users to automate the handling of driver executables like chromedriver.exe, geckodriver.exe etc required by Selenium WebDriver API. Now let us see, how can we set path for driver executables for different browsers like Chrome, Firefox etc.
- [beautifulsoup4](https://www.crummy.com/software/BeautifulSoup/bs4/doc/) - Beautiful Soup is a Python library for getting data out of HTML, XML, and other markup languages.
- [pandas](https://pandas.pydata.org/) - Pandas is a fast, powerful, flexible and easy to use open source data analysis and manipulation tool, built on top of the Python programming language.

## License

[MIT](https://github.com/mldsveda/PyScrappy/blob/main/LICENSE)

## Getting Help

For usage questions, the best place to go to is [StackOverflow](https://stackoverflow.com/questions/tagged/pyscrappy).
Further, general questions and discussions can also take place on GitHub in this [repository](https://github.com/mldsveda/PyScrappy).

## Discussion and Development

Most development discussions take place on GitHub in this [repository](https://github.com/mldsveda/PyScrappy).

Also visit the official documentation of [PyScrappy](https://pyscrappy.netlify.app/) for more information.

## Contributing to PyScrappy

All contributions, bug reports, bug fixes, documentation improvements, enhancements, and ideas are welcome.

If you are simply looking to start working with the PyScrappy codebase, navigate to the GitHub ["issues"](https://github.com/mldsveda/PyScrappy/issues) tab and start looking through interesting issues.

## End Notes

_Learn More about this package on [Medium](https://medium.com/analytics-vidhya/web-scraping-in-python-using-the-all-new-pyscrappy-5c136ed6906b)._

### **_This package is solely made for educational and research purposes._**



            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/mldsveda/PyScrappy",
    "name": "PyScrappy",
    "maintainer": "",
    "docs_url": null,
    "requires_python": ">=3.6",
    "maintainer_email": "",
    "keywords": "PyScrappy,Scraping,E-Commerce,Wikipedia,Image Scrapper,YouTube,Scrapy,Twitter,Social Media,Web Scraping,News,Stocks,Songs,Food,Instagram,Movies",
    "author": "Vedant Tibrewal, Vedaant Singh",
    "author_email": "mlds93363@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/73/1b/e8dac9b9fa3b7a1d7fcf336064c2a11ae8cfc86200a5ede8035559f12782/PyScrappy-0.1.1.tar.gz",
    "platform": "",
    "description": "<div align = \"center\">\n  <img src=\"https://raw.githubusercontent.com/mldsveda/PyScrappy/main/PyScrappy.png\">\n  <hr>\n  <br/>\n</div>\n\n## PyScrappy: powerful Python data scraping toolkit\n\n[![forthebadge made-with-python](http://ForTheBadge.com/images/badges/made-with-python.svg)](https://www.python.org/)\n\n[![Python 3.6](https://img.shields.io/badge/python-3.6-blue.svg)](https://www.python.org/downloads/release/python-360/)\n[![PyPI Latest Release](https://img.shields.io/pypi/v/PyScrappy.svg)](https://pypi.org/project/PyScrappy/)\n\n[![Package Status](https://img.shields.io/pypi/status/PyScrappy.svg)](https://pypi.org/project/PyScrappy/)\n[![License](https://img.shields.io/pypi/l/PyScrappy.svg)](https://github.com/mldsveda/PyScrappy/blob/main/LICENSE)\n![](https://img.shields.io/pypi/dm/PyScrappy)\n\n![](https://komarev.com/ghpvc/?username=mldsveda&style=flat-square)\n![stars](https://img.shields.io/github/stars/mldsveda/PyScrappy?style=social)\n![forks](https://img.shields.io/github/forks/mldsveda/PyScrappy?style=social)\n\n[![](https://img.shields.io/badge/pyscrappy-official%20documentation-blue)](https://pyscrappy.netlify.app/)\n\n## What is it?\n\n**PyScrappy** is a Python package that provides a fast, flexible, and exhaustive way to scrape data from various different sources. Being an\neasy and intuitive library. It aims to be the fundamental high-level building block for scraping **data** in Python. Additionally, it has the broader goal of becoming **the most powerful and flexible open source data scraping tool available**.\n\n## Main Features\n\nHere are just a few of the things that PyScrappy does well:\n\n- Easy scraping of [**Data**](https://medium.com/analytics-vidhya/web-scraping-in-python-using-the-all-new-pyscrappy-5c136ed6906b) available on the internet\n- Returns a [**DataFrame**](https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.html) for further analysis and research purposes.\n- Automatic [**Data Scraping**](https://medium.com/analytics-vidhya/web-scraping-in-python-using-the-all-new-pyscrappy-5c136ed6906b): Other than a few user input parameters the whole process of scraping the data is automatic.\n- Powerful, flexible\n\n## Where to get it\n\nThe source code is currently hosted on GitHub at:\nhttps://github.com/mldsveda/PyScrappy\n\nBinary installers for the latest released version are available at the [Python\nPackage Index (PyPI)](https://pypi.org/project/PyScrappy/).\n\n```sh\npip install PyScrappy\n```\n\n## Dependencies\n\n- [selenium](https://www.selenium.dev/) - Selenium is a free (open-source) automated testing framework used to validate web applications across different browsers and platforms.\n- [webdriver-manger](https://github.com/bonigarcia/webdrivermanager) - WebDriverManager is an API that allows users to automate the handling of driver executables like chromedriver.exe, geckodriver.exe etc required by Selenium WebDriver API. Now let us see, how can we set path for driver executables for different browsers like Chrome, Firefox etc.\n- [beautifulsoup4](https://www.crummy.com/software/BeautifulSoup/bs4/doc/) - Beautiful Soup is a Python library for getting data out of HTML, XML, and other markup languages.\n- [pandas](https://pandas.pydata.org/) - Pandas is a fast, powerful, flexible and easy to use open source data analysis and manipulation tool, built on top of the Python programming language.\n\n## License\n\n[MIT](https://github.com/mldsveda/PyScrappy/blob/main/LICENSE)\n\n## Getting Help\n\nFor usage questions, the best place to go to is [StackOverflow](https://stackoverflow.com/questions/tagged/pyscrappy).\nFurther, general questions and discussions can also take place on GitHub in this [repository](https://github.com/mldsveda/PyScrappy).\n\n## Discussion and Development\n\nMost development discussions take place on GitHub in this [repository](https://github.com/mldsveda/PyScrappy).\n\nAlso visit the official documentation of [PyScrappy](https://pyscrappy.netlify.app/) for more information.\n\n## Contributing to PyScrappy\n\nAll contributions, bug reports, bug fixes, documentation improvements, enhancements, and ideas are welcome.\n\nIf you are simply looking to start working with the PyScrappy codebase, navigate to the GitHub [\"issues\"](https://github.com/mldsveda/PyScrappy/issues) tab and start looking through interesting issues.\n\n## End Notes\n\n_Learn More about this package on [Medium](https://medium.com/analytics-vidhya/web-scraping-in-python-using-the-all-new-pyscrappy-5c136ed6906b)._\n\n### **_This package is solely made for educational and research purposes._**\n\n\n",
    "bugtrack_url": null,
    "license": "",
    "summary": "Powerful web scraping tool.",
    "version": "0.1.1",
    "split_keywords": [
        "pyscrappy",
        "scraping",
        "e-commerce",
        "wikipedia",
        "image scrapper",
        "youtube",
        "scrapy",
        "twitter",
        "social media",
        "web scraping",
        "news",
        "stocks",
        "songs",
        "food",
        "instagram",
        "movies"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "md5": "da73df65f44d3b4e2729454e94ef66ed",
                "sha256": "e60b3d62e1301a33d96d0c56892b03d4fcd103744163b2685e0086d9fb0bb939"
            },
            "downloads": -1,
            "filename": "PyScrappy-0.1.1-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "da73df65f44d3b4e2729454e94ef66ed",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.6",
            "size": 25950,
            "upload_time": "2022-02-26T18:03:53",
            "upload_time_iso_8601": "2022-02-26T18:03:53.971368Z",
            "url": "https://files.pythonhosted.org/packages/31/86/8e9e57ff50c3f03849cd72d6ff5a9450f2d02eac83e852b365a40514751b/PyScrappy-0.1.1-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "md5": "0da96ec7ccaeb4fe9fecae331f06be0c",
                "sha256": "c2b681e80079ea644c95541ab59e331717b2fe54ec842899a1024489720c397b"
            },
            "downloads": -1,
            "filename": "PyScrappy-0.1.1.tar.gz",
            "has_sig": false,
            "md5_digest": "0da96ec7ccaeb4fe9fecae331f06be0c",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.6",
            "size": 17845,
            "upload_time": "2022-02-26T18:03:55",
            "upload_time_iso_8601": "2022-02-26T18:03:55.497115Z",
            "url": "https://files.pythonhosted.org/packages/73/1b/e8dac9b9fa3b7a1d7fcf336064c2a11ae8cfc86200a5ede8035559f12782/PyScrappy-0.1.1.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2022-02-26 18:03:55",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "github_user": "mldsveda",
    "github_project": "PyScrappy",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "pyscrappy"
}
        
Elapsed time: 0.26621s