# Wordpress Loader
```bash
pip install llama-index-readers-wordpress
```
This loader fetches the text from Wordpress blog posts using the Wordpress API. It also uses the BeautifulSoup library to parse the HTML and extract the text from the articles.
## Usage
To use this loader, you need to pass base url of the Wordpress installation
(e.g. `https://www.mysite.com`) and optionally a username, and an application
password for the user (more about application passwords
[here](https://www.paidmembershipspro.com/create-application-password-wordpress/))
```python
from llama_index.readers.wordpress import WordpressReader
loader = WordpressReader(
url="https://www.mysite.com",
username="my_username",
password="my_password",
)
documents = loader.load_data()
```
This loader is designed to be used as a way to load data into
[LlamaIndex](https://github.com/run-llama/llama_index/).
## Pages and Posts
Be default, the loader retrieves both Wordpress _pages_ (static content) and
_posts_ (blog entries) from the target site. This behavior can be configured
by setting `get_pages=False` or `get_posts=False` when initializing the
`WordpressReader` object.
Raw data
{
"_id": null,
"home_page": null,
"name": "llama-index-readers-wordpress",
"maintainer": "bbornsztein",
"docs_url": null,
"requires_python": "<4.0,>=3.9",
"maintainer_email": null,
"keywords": "blog, wordpress",
"author": "Your Name",
"author_email": "you@example.com",
"download_url": "https://files.pythonhosted.org/packages/57/ad/593eb7705e5d6bbcdf4a811972bc6c726f0f3fec77759188b85881fbb3c3/llama_index_readers_wordpress-0.2.2.tar.gz",
"platform": null,
"description": "# Wordpress Loader\n\n```bash\npip install llama-index-readers-wordpress\n```\n\nThis loader fetches the text from Wordpress blog posts using the Wordpress API. It also uses the BeautifulSoup library to parse the HTML and extract the text from the articles.\n\n## Usage\n\nTo use this loader, you need to pass base url of the Wordpress installation\n(e.g. `https://www.mysite.com`) and optionally a username, and an application\npassword for the user (more about application passwords\n[here](https://www.paidmembershipspro.com/create-application-password-wordpress/))\n\n```python\nfrom llama_index.readers.wordpress import WordpressReader\n\nloader = WordpressReader(\n url=\"https://www.mysite.com\",\n username=\"my_username\",\n password=\"my_password\",\n)\ndocuments = loader.load_data()\n```\n\nThis loader is designed to be used as a way to load data into\n[LlamaIndex](https://github.com/run-llama/llama_index/).\n\n## Pages and Posts\n\nBe default, the loader retrieves both Wordpress _pages_ (static content) and\n_posts_ (blog entries) from the target site. This behavior can be configured\nby setting `get_pages=False` or `get_posts=False` when initializing the\n`WordpressReader` object.\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "llama-index readers wordpress integration",
"version": "0.2.2",
"project_urls": null,
"split_keywords": [
"blog",
" wordpress"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "97fadaf662882e18f61ce07c60627ad05112dcf673b8c4f8a9a60a484d81436a",
"md5": "bdaa047a24c6cab8df67bbd0ae6401ba",
"sha256": "38b91aceacf526eb37a0b0d99008fd159e65c0d122f167a1fe458e2ca98b79d2"
},
"downloads": -1,
"filename": "llama_index_readers_wordpress-0.2.2-py3-none-any.whl",
"has_sig": false,
"md5_digest": "bdaa047a24c6cab8df67bbd0ae6401ba",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": "<4.0,>=3.9",
"size": 3296,
"upload_time": "2024-09-20T18:51:19",
"upload_time_iso_8601": "2024-09-20T18:51:19.882854Z",
"url": "https://files.pythonhosted.org/packages/97/fa/daf662882e18f61ce07c60627ad05112dcf673b8c4f8a9a60a484d81436a/llama_index_readers_wordpress-0.2.2-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "57ad593eb7705e5d6bbcdf4a811972bc6c726f0f3fec77759188b85881fbb3c3",
"md5": "2099ef5ca2e5552176ecfc516b7e45ba",
"sha256": "18753154994f317557e154c50be58863fbf4b7c86f548a9afb401374dd0262e0"
},
"downloads": -1,
"filename": "llama_index_readers_wordpress-0.2.2.tar.gz",
"has_sig": false,
"md5_digest": "2099ef5ca2e5552176ecfc516b7e45ba",
"packagetype": "sdist",
"python_version": "source",
"requires_python": "<4.0,>=3.9",
"size": 2998,
"upload_time": "2024-09-20T18:51:20",
"upload_time_iso_8601": "2024-09-20T18:51:20.728675Z",
"url": "https://files.pythonhosted.org/packages/57/ad/593eb7705e5d6bbcdf4a811972bc6c726f0f3fec77759188b85881fbb3c3/llama_index_readers_wordpress-0.2.2.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-09-20 18:51:20",
"github": false,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"lcname": "llama-index-readers-wordpress"
}