Name | nsloader JSON |
Version |
1.0.0
JSON |
| download |
home_page | https://github.com/new-village/nsloader |
Summary | This script collects articles from Wall Street Journal and returns it in dict format. |
upload_time | 2023-05-03 23:25:06 |
maintainer | |
docs_url | None |
author | new-village |
requires_python | |
license | |
keywords |
|
VCS |
 |
bugtrack_url |
|
requirements |
No requirements were recorded.
|
Travis-CI |
No Travis.
|
coveralls test coverage |
No coveralls.
|
# nsloader
[](https://github.com/new-village/nsloader/actions/workflows/unittest.yaml) [](https://badge.fury.io/py/nsloader)
This script collects articles from [Wall Street Journal](https://www.wsj.com/) and returns it in dict format.
### Installing nsloader and Supported Versions
----------------------
nsloader is not registered on the pypi. You have to download from github directly.
```
$ python -m pip install nsloader
```
nsloader is tested by Python `3.10`.
### Dependencies
----------------------
- [beautifulsoup4](https://www.crummy.com/software/BeautifulSoup/bs4/doc/#)
- [selenium](https://www.selenium.dev/)
- [webdriver_manager](https://github.com/SergeyPirogov/webdriver_manager)
- [chromedriver_binary](https://github.com/danielkaiser/python-chromedriver-binary)
Additionaly, you have to install `google-chrome-stable` in your execution environment.
### Usage
----------------------
To load the [Wall Street Journal](https://www.wsj.com/) articles and parse to dictionay format.
#### Wall Street Journal
NOTE: You have to set 2 enviroment valiables name as `WSJ_USERNAME` and `WSJ_PASSWORD` before execution.
```python
>>> from nsloader import wsj
>>> article = wsj.Article()
>>> article.load('https://www.wsj.com/articles/...')
>>> print(article.to_dict())
{"url": "https://www.wsj.com/articles/...", "title": "The Fed ...", "sub_title": "As expected ...", ... }
```
Raw data
{
"_id": null,
"home_page": "https://github.com/new-village/nsloader",
"name": "nsloader",
"maintainer": "",
"docs_url": null,
"requires_python": "",
"maintainer_email": "",
"keywords": "",
"author": "new-village",
"author_email": "",
"download_url": "https://files.pythonhosted.org/packages/e0/6c/1984e899356fde7c370dc40d2274948a26e56a7d7cf1ba90ca16caa3a3e3/nsloader-1.0.0.tar.gz",
"platform": null,
"description": "# nsloader \n[](https://github.com/new-village/nsloader/actions/workflows/unittest.yaml) [](https://badge.fury.io/py/nsloader) \nThis script collects articles from [Wall Street Journal](https://www.wsj.com/) and returns it in dict format. \n \n### Installing nsloader and Supported Versions\n----------------------\nnsloader is not registered on the pypi. You have to download from github directly.\n```\n$ python -m pip install nsloader\n```\nnsloader is tested by Python `3.10`.\n \n \n### Dependencies\n----------------------\n- [beautifulsoup4](https://www.crummy.com/software/BeautifulSoup/bs4/doc/#)\n- [selenium](https://www.selenium.dev/)\n- [webdriver_manager](https://github.com/SergeyPirogov/webdriver_manager)\n- [chromedriver_binary](https://github.com/danielkaiser/python-chromedriver-binary)\n \nAdditionaly, you have to install `google-chrome-stable` in your execution environment. \n \n \n### Usage\n----------------------\nTo load the [Wall Street Journal](https://www.wsj.com/) articles and parse to dictionay format.\n\n#### Wall Street Journal \nNOTE: You have to set 2 enviroment valiables name as `WSJ_USERNAME` and `WSJ_PASSWORD` before execution.\n\n```python\n>>> from nsloader import wsj\n>>> article = wsj.Article()\n>>> article.load('https://www.wsj.com/articles/...')\n>>> print(article.to_dict())\n{\"url\": \"https://www.wsj.com/articles/...\", \"title\": \"The Fed ...\", \"sub_title\": \"As expected ...\", ... }\n```\n",
"bugtrack_url": null,
"license": "",
"summary": "This script collects articles from Wall Street Journal and returns it in dict format.",
"version": "1.0.0",
"project_urls": {
"Homepage": "https://github.com/new-village/nsloader"
},
"split_keywords": [],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "d8bc37386f8f2c1f81c5dd18e2f5d38081053c348658bbf831549d56423e8543",
"md5": "cda2003c95e13fc12d9b136b7190c062",
"sha256": "0fb3d9d57106e8eaa2bf56edcb36b8382ec3b145a6965ff2596c9bfaaf1798c9"
},
"downloads": -1,
"filename": "nsloader-1.0.0-py3-none-any.whl",
"has_sig": false,
"md5_digest": "cda2003c95e13fc12d9b136b7190c062",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 5810,
"upload_time": "2023-05-03T23:25:05",
"upload_time_iso_8601": "2023-05-03T23:25:05.392521Z",
"url": "https://files.pythonhosted.org/packages/d8/bc/37386f8f2c1f81c5dd18e2f5d38081053c348658bbf831549d56423e8543/nsloader-1.0.0-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "e06c1984e899356fde7c370dc40d2274948a26e56a7d7cf1ba90ca16caa3a3e3",
"md5": "951821ed85f4852fb5666bc6c2a475f8",
"sha256": "ef87620dfd95a07c16601ecb089a172e922fda264aa56115a5ad0b22a56bd37e"
},
"downloads": -1,
"filename": "nsloader-1.0.0.tar.gz",
"has_sig": false,
"md5_digest": "951821ed85f4852fb5666bc6c2a475f8",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 5021,
"upload_time": "2023-05-03T23:25:06",
"upload_time_iso_8601": "2023-05-03T23:25:06.633753Z",
"url": "https://files.pythonhosted.org/packages/e0/6c/1984e899356fde7c370dc40d2274948a26e56a7d7cf1ba90ca16caa3a3e3/nsloader-1.0.0.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2023-05-03 23:25:06",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "new-village",
"github_project": "nsloader",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"lcname": "nsloader"
}