nsloader


Namensloader JSON
Version 1.0.0 PyPI version JSON
download
home_pagehttps://github.com/new-village/nsloader
SummaryThis script collects articles from Wall Street Journal and returns it in dict format.
upload_time2023-05-03 23:25:06
maintainer
docs_urlNone
authornew-village
requires_python
license
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # nsloader  
[![Test](https://github.com/new-village/nsloader/actions/workflows/test.yaml/badge.svg?branch=main)](https://github.com/new-village/nsloader/actions/workflows/unittest.yaml) [![PyPI version](https://badge.fury.io/py/nsloader.svg)](https://badge.fury.io/py/nsloader)  
This script collects articles from [Wall Street Journal](https://www.wsj.com/) and returns it in dict format.  
  
### Installing nsloader and Supported Versions
----------------------
nsloader is not registered on the pypi. You have to download from github directly.
```
$ python -m pip install nsloader
```
nsloader is tested by Python `3.10`.
  
  
### Dependencies
----------------------
- [beautifulsoup4](https://www.crummy.com/software/BeautifulSoup/bs4/doc/#)
- [selenium](https://www.selenium.dev/)
- [webdriver_manager](https://github.com/SergeyPirogov/webdriver_manager)
- [chromedriver_binary](https://github.com/danielkaiser/python-chromedriver-binary)
  
Additionaly, you have to install `google-chrome-stable` in your execution environment.  
  
  
### Usage
----------------------
To load the [Wall Street Journal](https://www.wsj.com/) articles and parse to dictionay format.

#### Wall Street Journal  
NOTE: You have to set 2 enviroment valiables name as `WSJ_USERNAME` and `WSJ_PASSWORD` before execution.

```python
>>> from nsloader import wsj
>>> article = wsj.Article()
>>> article.load('https://www.wsj.com/articles/...')
>>> print(article.to_dict())
{"url": "https://www.wsj.com/articles/...", "title": "The Fed ...", "sub_title": "As expected ...",  ... }
```

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/new-village/nsloader",
    "name": "nsloader",
    "maintainer": "",
    "docs_url": null,
    "requires_python": "",
    "maintainer_email": "",
    "keywords": "",
    "author": "new-village",
    "author_email": "",
    "download_url": "https://files.pythonhosted.org/packages/e0/6c/1984e899356fde7c370dc40d2274948a26e56a7d7cf1ba90ca16caa3a3e3/nsloader-1.0.0.tar.gz",
    "platform": null,
    "description": "# nsloader  \n[![Test](https://github.com/new-village/nsloader/actions/workflows/test.yaml/badge.svg?branch=main)](https://github.com/new-village/nsloader/actions/workflows/unittest.yaml) [![PyPI version](https://badge.fury.io/py/nsloader.svg)](https://badge.fury.io/py/nsloader)  \nThis script collects articles from [Wall Street Journal](https://www.wsj.com/) and returns it in dict format.  \n  \n### Installing nsloader and Supported Versions\n----------------------\nnsloader is not registered on the pypi. You have to download from github directly.\n```\n$ python -m pip install nsloader\n```\nnsloader is tested by Python `3.10`.\n  \n  \n### Dependencies\n----------------------\n- [beautifulsoup4](https://www.crummy.com/software/BeautifulSoup/bs4/doc/#)\n- [selenium](https://www.selenium.dev/)\n- [webdriver_manager](https://github.com/SergeyPirogov/webdriver_manager)\n- [chromedriver_binary](https://github.com/danielkaiser/python-chromedriver-binary)\n  \nAdditionaly, you have to install `google-chrome-stable` in your execution environment.  \n  \n  \n### Usage\n----------------------\nTo load the [Wall Street Journal](https://www.wsj.com/) articles and parse to dictionay format.\n\n#### Wall Street Journal  \nNOTE: You have to set 2 enviroment valiables name as `WSJ_USERNAME` and `WSJ_PASSWORD` before execution.\n\n```python\n>>> from nsloader import wsj\n>>> article = wsj.Article()\n>>> article.load('https://www.wsj.com/articles/...')\n>>> print(article.to_dict())\n{\"url\": \"https://www.wsj.com/articles/...\", \"title\": \"The Fed ...\", \"sub_title\": \"As expected ...\",  ... }\n```\n",
    "bugtrack_url": null,
    "license": "",
    "summary": "This script collects articles from Wall Street Journal and returns it in dict format.",
    "version": "1.0.0",
    "project_urls": {
        "Homepage": "https://github.com/new-village/nsloader"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "d8bc37386f8f2c1f81c5dd18e2f5d38081053c348658bbf831549d56423e8543",
                "md5": "cda2003c95e13fc12d9b136b7190c062",
                "sha256": "0fb3d9d57106e8eaa2bf56edcb36b8382ec3b145a6965ff2596c9bfaaf1798c9"
            },
            "downloads": -1,
            "filename": "nsloader-1.0.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "cda2003c95e13fc12d9b136b7190c062",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 5810,
            "upload_time": "2023-05-03T23:25:05",
            "upload_time_iso_8601": "2023-05-03T23:25:05.392521Z",
            "url": "https://files.pythonhosted.org/packages/d8/bc/37386f8f2c1f81c5dd18e2f5d38081053c348658bbf831549d56423e8543/nsloader-1.0.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "e06c1984e899356fde7c370dc40d2274948a26e56a7d7cf1ba90ca16caa3a3e3",
                "md5": "951821ed85f4852fb5666bc6c2a475f8",
                "sha256": "ef87620dfd95a07c16601ecb089a172e922fda264aa56115a5ad0b22a56bd37e"
            },
            "downloads": -1,
            "filename": "nsloader-1.0.0.tar.gz",
            "has_sig": false,
            "md5_digest": "951821ed85f4852fb5666bc6c2a475f8",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 5021,
            "upload_time": "2023-05-03T23:25:06",
            "upload_time_iso_8601": "2023-05-03T23:25:06.633753Z",
            "url": "https://files.pythonhosted.org/packages/e0/6c/1984e899356fde7c370dc40d2274948a26e56a7d7cf1ba90ca16caa3a3e3/nsloader-1.0.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-05-03 23:25:06",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "new-village",
    "github_project": "nsloader",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "nsloader"
}
        
Elapsed time: 0.23501s