yawbdl


Nameyawbdl JSON
Version 1.0.1 PyPI version JSON
download
home_pagehttps://github.com/BGforgeNet/yawbdl
SummaryA tool to download pages from Internet Archive.
upload_time2025-01-04 06:06:09
maintainerNone
docs_urlNone
authorBGforge
requires_python<4.0,>=3.9
licenseNone
keywords internet archive web.archive.org
VCS
bugtrack_url
requirements certifi charset-normalizer idna requests urllib3
Travis-CI No Travis.
coveralls test coverage No coveralls.
            ## Yet Another WayBack DownLoader

YAWBDL is a tool to download archived pages from Internet Archive, which I wrote because none of other tools worked good enough (or at all) in my use cases.

### Usage

```bash
pipx install yawbdl

yawbdl
usage: yawbdl [-h] [-d DOMAIN] [-o DST_DIR] [--from FROM_DATE] [--to TO_DATE]
              [--timeout TIMEOUT] [-n] [--delay DELAY] [--retries RETRIES]
              [--no-fail]
              [--skip-timestamps SKIP_TIMESTAMPS [SKIP_TIMESTAMPS ...]]

Download a website from Internet Archive

options:
  -h, --help            show this help message and exit
  -d DOMAIN             domain to download (default: None)
  -o DST_DIR            output directory (default: None)
  --from FROM_DATE      from date, up to 14 digits: yyyyMMddhhmmss (default:
                        None)
  --to TO_DATE          to date (default: None)
  --timeout TIMEOUT     request timeout (default: 10)
  -n                    dry run (default: False)
  --delay DELAY         delay between requests (default: 1)
  --retries RETRIES     max number of retries (default: 0)
  --no-fail             if retries are exceeded, and the file still couldn't
                        have been downloaded, proceed to the next file instead
                        of aborting the run (default: False)
  --skip-timestamps SKIP_TIMESTAMPS [SKIP_TIMESTAMPS ...]
                        skip snapshots with these timestamps (sometimes Internet
                        Archive just fails to serve a specific snapshot)
                        (default: None)
```

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/BGforgeNet/yawbdl",
    "name": "yawbdl",
    "maintainer": null,
    "docs_url": null,
    "requires_python": "<4.0,>=3.9",
    "maintainer_email": null,
    "keywords": "Internet Archive, web.archive.org",
    "author": "BGforge",
    "author_email": "dev@bgforge.net",
    "download_url": "https://files.pythonhosted.org/packages/e6/6d/ef08a021f479ced30e3443ef77787c3c9c0a1c308881b766c5525f3e548f/yawbdl-1.0.1.tar.gz",
    "platform": null,
    "description": "## Yet Another WayBack DownLoader\n\nYAWBDL is a tool to download archived pages from Internet Archive, which I wrote because none of other tools worked good enough (or at all) in my use cases.\n\n### Usage\n\n```bash\npipx install yawbdl\n\nyawbdl\nusage: yawbdl [-h] [-d DOMAIN] [-o DST_DIR] [--from FROM_DATE] [--to TO_DATE]\n              [--timeout TIMEOUT] [-n] [--delay DELAY] [--retries RETRIES]\n              [--no-fail]\n              [--skip-timestamps SKIP_TIMESTAMPS [SKIP_TIMESTAMPS ...]]\n\nDownload a website from Internet Archive\n\noptions:\n  -h, --help            show this help message and exit\n  -d DOMAIN             domain to download (default: None)\n  -o DST_DIR            output directory (default: None)\n  --from FROM_DATE      from date, up to 14 digits: yyyyMMddhhmmss (default:\n                        None)\n  --to TO_DATE          to date (default: None)\n  --timeout TIMEOUT     request timeout (default: 10)\n  -n                    dry run (default: False)\n  --delay DELAY         delay between requests (default: 1)\n  --retries RETRIES     max number of retries (default: 0)\n  --no-fail             if retries are exceeded, and the file still couldn't\n                        have been downloaded, proceed to the next file instead\n                        of aborting the run (default: False)\n  --skip-timestamps SKIP_TIMESTAMPS [SKIP_TIMESTAMPS ...]\n                        skip snapshots with these timestamps (sometimes Internet\n                        Archive just fails to serve a specific snapshot)\n                        (default: None)\n```\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "A tool to download pages from Internet Archive.",
    "version": "1.0.1",
    "project_urls": {
        "Homepage": "https://github.com/BGforgeNet/yawbdl",
        "Repository": "https://github.com/BGforgeNet/yawbdl"
    },
    "split_keywords": [
        "internet archive",
        " web.archive.org"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "65b8957106ff69ab0bf261674667776f65db20b632c194cbe716b0fa66ca1e65",
                "md5": "884297a7aa5bd922508c0ad35ed1b4c9",
                "sha256": "3e7e83f07e12dcbd6be5814cb7d0c093d2a1bdb99c43b410de042674c5983f3c"
            },
            "downloads": -1,
            "filename": "yawbdl-1.0.1-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "884297a7aa5bd922508c0ad35ed1b4c9",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": "<4.0,>=3.9",
            "size": 4929,
            "upload_time": "2025-01-04T06:06:08",
            "upload_time_iso_8601": "2025-01-04T06:06:08.200714Z",
            "url": "https://files.pythonhosted.org/packages/65/b8/957106ff69ab0bf261674667776f65db20b632c194cbe716b0fa66ca1e65/yawbdl-1.0.1-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "e66def08a021f479ced30e3443ef77787c3c9c0a1c308881b766c5525f3e548f",
                "md5": "a7d22652a601d445cd81b7898341f63a",
                "sha256": "cedabb267e2c273dc44c425b4c4b219ffba94a4a9b07a0707956f320d3c59f5f"
            },
            "downloads": -1,
            "filename": "yawbdl-1.0.1.tar.gz",
            "has_sig": false,
            "md5_digest": "a7d22652a601d445cd81b7898341f63a",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": "<4.0,>=3.9",
            "size": 4199,
            "upload_time": "2025-01-04T06:06:09",
            "upload_time_iso_8601": "2025-01-04T06:06:09.217459Z",
            "url": "https://files.pythonhosted.org/packages/e6/6d/ef08a021f479ced30e3443ef77787c3c9c0a1c308881b766c5525f3e548f/yawbdl-1.0.1.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-01-04 06:06:09",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "BGforgeNet",
    "github_project": "yawbdl",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "requirements": [
        {
            "name": "certifi",
            "specs": [
                [
                    "==",
                    "2024.12.14"
                ]
            ]
        },
        {
            "name": "charset-normalizer",
            "specs": [
                [
                    "==",
                    "3.4.1"
                ]
            ]
        },
        {
            "name": "idna",
            "specs": [
                [
                    "==",
                    "3.10"
                ]
            ]
        },
        {
            "name": "requests",
            "specs": [
                [
                    "==",
                    "2.32.3"
                ]
            ]
        },
        {
            "name": "urllib3",
            "specs": [
                [
                    "==",
                    "2.3.0"
                ]
            ]
        }
    ],
    "lcname": "yawbdl"
}
        
Elapsed time: 0.72284s