llama-index-readers-wordpress


Namellama-index-readers-wordpress JSON
Version 0.2.2 PyPI version JSON
download
home_pageNone
Summaryllama-index readers wordpress integration
upload_time2024-09-20 18:51:20
maintainerbbornsztein
docs_urlNone
authorYour Name
requires_python<4.0,>=3.9
licenseMIT
keywords blog wordpress
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Wordpress Loader

```bash
pip install llama-index-readers-wordpress
```

This loader fetches the text from Wordpress blog posts using the Wordpress API. It also uses the BeautifulSoup library to parse the HTML and extract the text from the articles.

## Usage

To use this loader, you need to pass base url of the Wordpress installation
(e.g. `https://www.mysite.com`) and optionally a username, and an application
password for the user (more about application passwords
[here](https://www.paidmembershipspro.com/create-application-password-wordpress/))

```python
from llama_index.readers.wordpress import WordpressReader

loader = WordpressReader(
    url="https://www.mysite.com",
    username="my_username",
    password="my_password",
)
documents = loader.load_data()
```

This loader is designed to be used as a way to load data into
[LlamaIndex](https://github.com/run-llama/llama_index/).

## Pages and Posts

Be default, the loader retrieves both Wordpress _pages_ (static content) and
_posts_ (blog entries) from the target site. This behavior can be configured
by setting `get_pages=False` or `get_posts=False` when initializing the
`WordpressReader` object.

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "llama-index-readers-wordpress",
    "maintainer": "bbornsztein",
    "docs_url": null,
    "requires_python": "<4.0,>=3.9",
    "maintainer_email": null,
    "keywords": "blog, wordpress",
    "author": "Your Name",
    "author_email": "you@example.com",
    "download_url": "https://files.pythonhosted.org/packages/57/ad/593eb7705e5d6bbcdf4a811972bc6c726f0f3fec77759188b85881fbb3c3/llama_index_readers_wordpress-0.2.2.tar.gz",
    "platform": null,
    "description": "# Wordpress Loader\n\n```bash\npip install llama-index-readers-wordpress\n```\n\nThis loader fetches the text from Wordpress blog posts using the Wordpress API. It also uses the BeautifulSoup library to parse the HTML and extract the text from the articles.\n\n## Usage\n\nTo use this loader, you need to pass base url of the Wordpress installation\n(e.g. `https://www.mysite.com`) and optionally a username, and an application\npassword for the user (more about application passwords\n[here](https://www.paidmembershipspro.com/create-application-password-wordpress/))\n\n```python\nfrom llama_index.readers.wordpress import WordpressReader\n\nloader = WordpressReader(\n    url=\"https://www.mysite.com\",\n    username=\"my_username\",\n    password=\"my_password\",\n)\ndocuments = loader.load_data()\n```\n\nThis loader is designed to be used as a way to load data into\n[LlamaIndex](https://github.com/run-llama/llama_index/).\n\n## Pages and Posts\n\nBe default, the loader retrieves both Wordpress _pages_ (static content) and\n_posts_ (blog entries) from the target site. This behavior can be configured\nby setting `get_pages=False` or `get_posts=False` when initializing the\n`WordpressReader` object.\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "llama-index readers wordpress integration",
    "version": "0.2.2",
    "project_urls": null,
    "split_keywords": [
        "blog",
        " wordpress"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "97fadaf662882e18f61ce07c60627ad05112dcf673b8c4f8a9a60a484d81436a",
                "md5": "bdaa047a24c6cab8df67bbd0ae6401ba",
                "sha256": "38b91aceacf526eb37a0b0d99008fd159e65c0d122f167a1fe458e2ca98b79d2"
            },
            "downloads": -1,
            "filename": "llama_index_readers_wordpress-0.2.2-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "bdaa047a24c6cab8df67bbd0ae6401ba",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": "<4.0,>=3.9",
            "size": 3296,
            "upload_time": "2024-09-20T18:51:19",
            "upload_time_iso_8601": "2024-09-20T18:51:19.882854Z",
            "url": "https://files.pythonhosted.org/packages/97/fa/daf662882e18f61ce07c60627ad05112dcf673b8c4f8a9a60a484d81436a/llama_index_readers_wordpress-0.2.2-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "57ad593eb7705e5d6bbcdf4a811972bc6c726f0f3fec77759188b85881fbb3c3",
                "md5": "2099ef5ca2e5552176ecfc516b7e45ba",
                "sha256": "18753154994f317557e154c50be58863fbf4b7c86f548a9afb401374dd0262e0"
            },
            "downloads": -1,
            "filename": "llama_index_readers_wordpress-0.2.2.tar.gz",
            "has_sig": false,
            "md5_digest": "2099ef5ca2e5552176ecfc516b7e45ba",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": "<4.0,>=3.9",
            "size": 2998,
            "upload_time": "2024-09-20T18:51:20",
            "upload_time_iso_8601": "2024-09-20T18:51:20.728675Z",
            "url": "https://files.pythonhosted.org/packages/57/ad/593eb7705e5d6bbcdf4a811972bc6c726f0f3fec77759188b85881fbb3c3/llama_index_readers_wordpress-0.2.2.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-09-20 18:51:20",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "llama-index-readers-wordpress"
}
        
Elapsed time: 0.73122s