rss-scrap


Namerss-scrap JSON
Version 0.3.0 PyPI version JSON
download
home_pagehttps://git.goral.net.pl/rss-scrap.git/about
SummaryWeb scrapper which converts sites to RSS feeds.
upload_time2024-01-08 11:17:35
maintainer
docs_urlNone
authorMichal Goral
requires_python>=3.8,<4.0
licenseGPL-3.0-only
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # RSS Scrap

rss-scrap is a command line utility which scraps contents of web pages and
converts them to RSS feeds. Specific web scrapers must be implemented for
each page.

rss-scrap works asynchronously, meaning that many web pages can be scraped
simultaneously.

## Implemented scrappers

- `economist`: The Economist, World This Week section: Politics this week,
  Business this week, Kal's Cartoon (3 separate feeds)
- `wiki_current_events`: Wikipedia Current Events
    - parameter `date`: day (ISO format) from which events should be fetched
      (e.g. `-f wiki_current_events,date=2020-10-27`)
- `gov_pl_gis`: Warnings of Główny Inspektorat Sanitarny (Polish Government Agency)

            

Raw data

            {
    "_id": null,
    "home_page": "https://git.goral.net.pl/rss-scrap.git/about",
    "name": "rss-scrap",
    "maintainer": "",
    "docs_url": null,
    "requires_python": ">=3.8,<4.0",
    "maintainer_email": "",
    "keywords": "",
    "author": "Michal Goral",
    "author_email": "dev@goral.net.pl",
    "download_url": "https://files.pythonhosted.org/packages/70/31/735f190acb4ea02a2acd267734dd5a0e774f77894c1f62fc37536028f0b9/rss_scrap-0.3.0.tar.gz",
    "platform": null,
    "description": "# RSS Scrap\n\nrss-scrap is a command line utility which scraps contents of web pages and\nconverts them to RSS feeds. Specific web scrapers must be implemented for\neach page.\n\nrss-scrap works asynchronously, meaning that many web pages can be scraped\nsimultaneously.\n\n## Implemented scrappers\n\n- `economist`: The Economist, World This Week section: Politics this week,\n  Business this week, Kal's Cartoon (3 separate feeds)\n- `wiki_current_events`: Wikipedia Current Events\n    - parameter `date`: day (ISO format) from which events should be fetched\n      (e.g. `-f wiki_current_events,date=2020-10-27`)\n- `gov_pl_gis`: Warnings of G\u0142\u00f3wny Inspektorat Sanitarny (Polish Government Agency)\n",
    "bugtrack_url": null,
    "license": "GPL-3.0-only",
    "summary": "Web scrapper which converts sites to RSS feeds.",
    "version": "0.3.0",
    "project_urls": {
        "Homepage": "https://git.goral.net.pl/rss-scrap.git/about",
        "Repository": "https://git.goral.net.pl/rss-scrap.git"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "397a902c804a4ce2b57dfeb389cd7bc289f0b1df1a8f414f3baaa69fce05cc2f",
                "md5": "4ac2d67aae1729ec235af9aaa355b96c",
                "sha256": "2c85d5ff4be9f4d0a41ce908681b229e4dd06d1413a3059c07dc059dda6a4be7"
            },
            "downloads": -1,
            "filename": "rss_scrap-0.3.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "4ac2d67aae1729ec235af9aaa355b96c",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.8,<4.0",
            "size": 9278,
            "upload_time": "2024-01-08T11:17:32",
            "upload_time_iso_8601": "2024-01-08T11:17:32.486087Z",
            "url": "https://files.pythonhosted.org/packages/39/7a/902c804a4ce2b57dfeb389cd7bc289f0b1df1a8f414f3baaa69fce05cc2f/rss_scrap-0.3.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "7031735f190acb4ea02a2acd267734dd5a0e774f77894c1f62fc37536028f0b9",
                "md5": "ec5c5070d266a85b350880b3091e615b",
                "sha256": "871666304011a4fd716f653678229841ae7328bae0a45dcadad3ec77447c1c2e"
            },
            "downloads": -1,
            "filename": "rss_scrap-0.3.0.tar.gz",
            "has_sig": false,
            "md5_digest": "ec5c5070d266a85b350880b3091e615b",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.8,<4.0",
            "size": 7378,
            "upload_time": "2024-01-08T11:17:35",
            "upload_time_iso_8601": "2024-01-08T11:17:35.739710Z",
            "url": "https://files.pythonhosted.org/packages/70/31/735f190acb4ea02a2acd267734dd5a0e774f77894c1f62fc37536028f0b9/rss_scrap-0.3.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-01-08 11:17:35",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "rss-scrap"
}
        
Elapsed time: 0.16948s