yawbdl


Nameyawbdl JSON
Version 1.1.1 PyPI version JSON
download
home_pageNone
SummaryA tool to download pages from Internet Archive.
upload_time2025-07-14 07:59:46
maintainerNone
docs_urlNone
authorNone
requires_python>=3.12
licenseNone
keywords internet archive web.archive.org
VCS
bugtrack_url
requirements certifi charset-normalizer colorama idna loguru requests urllib3 win32-setctime
Travis-CI No Travis.
coveralls test coverage No coveralls.
            ## Yet Another WayBack DownLoader

YAWBDL is a tool to download archived pages from Internet Archive, which I wrote because none of other tools worked good enough (or at all) in my use cases.

### Usage

```bash
pipx install yawbdl

usage: yawbdl.py [-h] [-d DOMAIN] [-o DST_DIR] [--from FROM_DATE] [--to TO_DATE]
                 [--timeout TIMEOUT] [-n] [--delay DELAY] [--retries RETRIES]
                 [--no-fail]
                 [--skip-timestamps SKIP_TIMESTAMPS [SKIP_TIMESTAMPS ...]]
                 [--latest-only]

Download a website from Internet Archive

options:
  -h, --help            show this help message and exit
  -d DOMAIN             domain to download (default: None)
  -o DST_DIR            output directory (default: None)
  --from FROM_DATE      from date, up to 14 digits: yyyyMMddhhmmss (default:
                        None)
  --to TO_DATE          to date (default: None)
  --timeout TIMEOUT     request timeout (default: 10)
  -n                    dry run (default: False)
  --delay DELAY         delay between requests (default: 1)
  --retries RETRIES     max number of retries (default: 0)
  --no-fail             if retries are exceeded, and the file still couldn't
                        have been downloaded, proceed to the next file instead
                        of aborting the run (default: False)
  --skip-timestamps SKIP_TIMESTAMPS [SKIP_TIMESTAMPS ...]
                        skip snapshots with these timestamps (sometimes Internet
                        Archive just fails to serve a specific snapshot)
                        (default: None)
  --latest-only         download only the latest version of each URL (default:
                        False)
```

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "yawbdl",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.12",
    "maintainer_email": null,
    "keywords": "Internet Archive, web.archive.org",
    "author": null,
    "author_email": "BGforge <dev@bgforge.net>",
    "download_url": "https://files.pythonhosted.org/packages/93/9e/5984110f3b46752b91debf0a94f251fe251c48e3cd6be60ec962bc723f98/yawbdl-1.1.1.tar.gz",
    "platform": null,
    "description": "## Yet Another WayBack DownLoader\n\nYAWBDL is a tool to download archived pages from Internet Archive, which I wrote because none of other tools worked good enough (or at all) in my use cases.\n\n### Usage\n\n```bash\npipx install yawbdl\n\nusage: yawbdl.py [-h] [-d DOMAIN] [-o DST_DIR] [--from FROM_DATE] [--to TO_DATE]\n                 [--timeout TIMEOUT] [-n] [--delay DELAY] [--retries RETRIES]\n                 [--no-fail]\n                 [--skip-timestamps SKIP_TIMESTAMPS [SKIP_TIMESTAMPS ...]]\n                 [--latest-only]\n\nDownload a website from Internet Archive\n\noptions:\n  -h, --help            show this help message and exit\n  -d DOMAIN             domain to download (default: None)\n  -o DST_DIR            output directory (default: None)\n  --from FROM_DATE      from date, up to 14 digits: yyyyMMddhhmmss (default:\n                        None)\n  --to TO_DATE          to date (default: None)\n  --timeout TIMEOUT     request timeout (default: 10)\n  -n                    dry run (default: False)\n  --delay DELAY         delay between requests (default: 1)\n  --retries RETRIES     max number of retries (default: 0)\n  --no-fail             if retries are exceeded, and the file still couldn't\n                        have been downloaded, proceed to the next file instead\n                        of aborting the run (default: False)\n  --skip-timestamps SKIP_TIMESTAMPS [SKIP_TIMESTAMPS ...]\n                        skip snapshots with these timestamps (sometimes Internet\n                        Archive just fails to serve a specific snapshot)\n                        (default: None)\n  --latest-only         download only the latest version of each URL (default:\n                        False)\n```\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "A tool to download pages from Internet Archive.",
    "version": "1.1.1",
    "project_urls": {
        "Homepage": "https://github.com/BGforgeNet/yawbdl",
        "Repository": "https://github.com/BGforgeNet/yawbdl"
    },
    "split_keywords": [
        "internet archive",
        " web.archive.org"
    ],
    "urls": [
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "a65fc67b2edbc1ebc4b206cbe7a78689fff2d00ae0dd61723978cc8b671ecfc9",
                "md5": "282a2b146f80f4c4aef16a1f18ecfe62",
                "sha256": "3ce2613bad7af7c41ec9371cdb595a7e558cdca5c137372797feffab0bf9ca5b"
            },
            "downloads": -1,
            "filename": "yawbdl-1.1.1-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "282a2b146f80f4c4aef16a1f18ecfe62",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.12",
            "size": 8141,
            "upload_time": "2025-07-14T07:59:45",
            "upload_time_iso_8601": "2025-07-14T07:59:45.771942Z",
            "url": "https://files.pythonhosted.org/packages/a6/5f/c67b2edbc1ebc4b206cbe7a78689fff2d00ae0dd61723978cc8b671ecfc9/yawbdl-1.1.1-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "939e5984110f3b46752b91debf0a94f251fe251c48e3cd6be60ec962bc723f98",
                "md5": "95770112ec6213bcaf242c22b8f55616",
                "sha256": "c346b91109eee7e7ebd9d2cdfe695b66c72f68a6c18770039154dbb0ee9c4f90"
            },
            "downloads": -1,
            "filename": "yawbdl-1.1.1.tar.gz",
            "has_sig": false,
            "md5_digest": "95770112ec6213bcaf242c22b8f55616",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.12",
            "size": 7787,
            "upload_time": "2025-07-14T07:59:46",
            "upload_time_iso_8601": "2025-07-14T07:59:46.820579Z",
            "url": "https://files.pythonhosted.org/packages/93/9e/5984110f3b46752b91debf0a94f251fe251c48e3cd6be60ec962bc723f98/yawbdl-1.1.1.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-07-14 07:59:46",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "BGforgeNet",
    "github_project": "yawbdl",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "requirements": [
        {
            "name": "certifi",
            "specs": [
                [
                    "==",
                    "2025.7.14"
                ]
            ]
        },
        {
            "name": "charset-normalizer",
            "specs": [
                [
                    "==",
                    "3.4.2"
                ]
            ]
        },
        {
            "name": "colorama",
            "specs": [
                [
                    "==",
                    "0.4.6"
                ]
            ]
        },
        {
            "name": "idna",
            "specs": [
                [
                    "==",
                    "3.10"
                ]
            ]
        },
        {
            "name": "loguru",
            "specs": [
                [
                    "==",
                    "0.7.3"
                ]
            ]
        },
        {
            "name": "requests",
            "specs": [
                [
                    "==",
                    "2.32.4"
                ]
            ]
        },
        {
            "name": "urllib3",
            "specs": [
                [
                    "==",
                    "2.5.0"
                ]
            ]
        },
        {
            "name": "win32-setctime",
            "specs": [
                [
                    "==",
                    "1.2.0"
                ]
            ]
        }
    ],
    "lcname": "yawbdl"
}
        
Elapsed time: 1.07447s