trustpilot-scraper


Nametrustpilot-scraper JSON
Version 0.10 PyPI version JSON
download
home_pagehttps://github.com/irfanalidv/trustpilot_scraper
SummaryA Python library for scraping Trustpilot reviews.
upload_time2024-04-16 20:35:08
maintainerNone
docs_urlNone
authorMd Irfan Ali
requires_pythonNone
licenseMIT
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            
trustpilot-scraper
===================

trustpilot-scraper is a Python package designed for scraping Trustpilot reviews. It provides functions to retrieve Trustpilot reviews from a given Trustpilot page URL.

Installation
------------

You can install trustpilot-scraper via pip:

.. code-block:: bash

    pip install trustpilot-scraper

Usage
-----

To use trustpilot-scraper, import the `scrape_trustpilot_reviews` function from the `trustpilot_scraper.scraper` module. Then, provide the base URL of the Trustpilot page from which you want to scrape reviews.

.. code-block:: python

    from trustpilot_scraper.scraper import scrape_trustpilot_reviews

    base_url = 'https://www.trustpilot.com/review/example.com'

    reviews = scrape_trustpilot_reviews(base_url)

    for review in reviews:
        print(review)

Output
------

The `scrape_trustpilot_reviews` function returns a list of dictionaries, where each dictionary represents a single Trustpilot review. Each review dictionary contains the following keys: 'Date', 'Author', 'Body', 'Heading', 'Rating', and 'Location'.

Features
--------

- Scrapes Trustpilot reviews from the provided base URL.
- Retrieves review data including date, author, body, heading, rating, and location.
- Handles pagination automatically to scrape all available reviews.

Dependencies
------------

- `requests`: For making HTTP requests.
- `beautifulsoup4`: For parsing HTML content.
- `pandas`: For data manipulation.

For more information and documentation, please visit the GitHub repository: https://github.com/irfanalidv/trustpilot_scraper.

    

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/irfanalidv/trustpilot_scraper",
    "name": "trustpilot-scraper",
    "maintainer": null,
    "docs_url": null,
    "requires_python": null,
    "maintainer_email": null,
    "keywords": null,
    "author": "Md Irfan Ali",
    "author_email": "irfanali29@hotmail.com",
    "download_url": null,
    "platform": null,
    "description": "\ntrustpilot-scraper\n===================\n\ntrustpilot-scraper is a Python package designed for scraping Trustpilot reviews. It provides functions to retrieve Trustpilot reviews from a given Trustpilot page URL.\n\nInstallation\n------------\n\nYou can install trustpilot-scraper via pip:\n\n.. code-block:: bash\n\n    pip install trustpilot-scraper\n\nUsage\n-----\n\nTo use trustpilot-scraper, import the `scrape_trustpilot_reviews` function from the `trustpilot_scraper.scraper` module. Then, provide the base URL of the Trustpilot page from which you want to scrape reviews.\n\n.. code-block:: python\n\n    from trustpilot_scraper.scraper import scrape_trustpilot_reviews\n\n    base_url = 'https://www.trustpilot.com/review/example.com'\n\n    reviews = scrape_trustpilot_reviews(base_url)\n\n    for review in reviews:\n        print(review)\n\nOutput\n------\n\nThe `scrape_trustpilot_reviews` function returns a list of dictionaries, where each dictionary represents a single Trustpilot review. Each review dictionary contains the following keys: 'Date', 'Author', 'Body', 'Heading', 'Rating', and 'Location'.\n\nFeatures\n--------\n\n- Scrapes Trustpilot reviews from the provided base URL.\n- Retrieves review data including date, author, body, heading, rating, and location.\n- Handles pagination automatically to scrape all available reviews.\n\nDependencies\n------------\n\n- `requests`: For making HTTP requests.\n- `beautifulsoup4`: For parsing HTML content.\n- `pandas`: For data manipulation.\n\nFor more information and documentation, please visit the GitHub repository: https://github.com/irfanalidv/trustpilot_scraper.\n\n    \n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "A Python library for scraping Trustpilot reviews.",
    "version": "0.10",
    "project_urls": {
        "Homepage": "https://github.com/irfanalidv/trustpilot_scraper"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "01cf5e203da30fb9ae6de81ff03c34190d208e4a778e4cb027b253598a9c7531",
                "md5": "8d5f88819d4776f328c5628792e16aa7",
                "sha256": "f916303b99a675e250d4fbfc1cfa704c2c0b51dc3a5d7f3ef35ce5581c9ad168"
            },
            "downloads": -1,
            "filename": "trustpilot_scraper-0.10-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "8d5f88819d4776f328c5628792e16aa7",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 2760,
            "upload_time": "2024-04-16T20:35:08",
            "upload_time_iso_8601": "2024-04-16T20:35:08.301448Z",
            "url": "https://files.pythonhosted.org/packages/01/cf/5e203da30fb9ae6de81ff03c34190d208e4a778e4cb027b253598a9c7531/trustpilot_scraper-0.10-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-04-16 20:35:08",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "irfanalidv",
    "github_project": "trustpilot_scraper",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "lcname": "trustpilot-scraper"
}
        
Elapsed time: 0.34745s