wayback


Namewayback JSON
Version 0.4.5 PyPI version JSON
download
home_pagehttps://github.com/edgi-govdata-archiving/wayback
SummaryPython API to Internet Archive Wayback Machine
upload_time2024-02-01 19:17:07
maintainer
docs_urlNone
authorEnvironmental Data Governance Initiative
requires_python>=3.6
licenseBSD (3-clause)
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage
            ===============================
wayback
===============================

.. image:: https://circleci.com/gh/edgi-govdata-archiving/wayback/tree/main.svg?style=shield
        :target: https://circleci.com/gh/edgi-govdata-archiving/wayback
        :alt: Build Status

.. image:: https://img.shields.io/pypi/v/wayback.svg
        :target: https://pypi.python.org/pypi/wayback
        :alt: Download Latest Version from PyPI

.. image:: https://img.shields.io/badge/%E2%9D%A4-code%20of%20conduct-blue.svg?style=flat
        :target: https://github.com/edgi-govdata-archiving/overview/blob/main/CONDUCT.md
        :alt: Code of Conduct

.. image:: https://readthedocs.org/projects/wayback/badge/?version=stable
        :target: https://wayback.readthedocs.io/en/stable/?badge=stable
        :alt: Documentation Status


*Wayback* is A Python API to the `Internet Archive’s Wayback Machine <https://web.archive.org/>`_. It gives you tools to search for and load mementos (historical copies of web pages).

The Internet Archive maintains an official `“internetarchive” <https://archive.org/services/docs/api/internetarchive/>`_ Python package, but it does not focus on the Wayback Machine. Instead, it is mainly concerned with the APIs and tools that manage the Internet Archive as a whole: managing items and collections. These are how e-books, audio recordings, movies, and other content in the Internet Archive are managed. It doesn’t, however, provide particularly good tools for finding or loading historical captures of specific URLs (i.e. the part of the Internet Archive called the “Wayback Machine”). That’s what this package does.

* Documentation:
    * Current Release: https://wayback.readthedocs.io/en/stable/
    * Development: https://wayback.readthedocs.io/en/latest/


Installation & Basic Usage
--------------------------

Install via pip on the command line::

    $ pip install wayback

Then, in a Python script, import it and create a client:

.. code-block:: python

    import wayback
    client = wayback.WaybackClient()

Finally, search for all the mementos of ``nasa.gov`` before 1999 and download them:

.. code-block:: python

    for record in client.search('http://nasa.gov', to_date=date(1999, 1, 1)):
        memento = client.get_memento(record)

Read the `full documentation <https://wayback.readthedocs.io/>`_ for a more in-depth tutorial and complete API reference documentation at https://wayback.readthedocs.io/


Code of Conduct
---------------

This repository falls under EDGI’s `Code of Conduct <https://github.com/edgi-govdata-archiving/overview/blob/main/CONDUCT.md>`_. Please take a moment to review it before commenting on or creating issues and pull requests.


Contributors
------------

Thanks to the following people for their contributions and help on this package! See our `contributing guidelines <https://github.com/edgi-govdata-archiving/wayback/blob/main/CONTRIBUTING.rst>`_ to find out how you can help.

- `Dan Allan <https://github.com/danielballan>`_ (Code, Tests, Documentation, Reviews)
- `Rob Brackett <https://github.com/Mr0grog>`_ (Code, Tests, Documentation, Reviews)
- `David Gilman <https://github.com/dgilman>`_ (Documentation)
- `Will Sackfield <https://github.com/8W9aG>`_ (Code, Tests)
- `Ed Summers <https://github.com/edsu>`_ (Code, Tests)
- `Lion Szlagowski <https://github.com/LionSzl>`_ (Code, Tests)


License & Copyright
-------------------

Copyright (C) 2019-2023 Environmental Data and Governance Initiative (EDGI)

This program is free software: you can redistribute it and/or modify it under the terms of the 3-Clause BSD License. See the `LICENSE <https://github.com/edgi-govdata-archiving/wayback/blob/master/LICENSE>`_ file for details.



            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/edgi-govdata-archiving/wayback",
    "name": "wayback",
    "maintainer": "",
    "docs_url": null,
    "requires_python": ">=3.6",
    "maintainer_email": "",
    "keywords": "",
    "author": "Environmental Data Governance Initiative",
    "author_email": "EnviroDGI@protonmail.com",
    "download_url": "https://files.pythonhosted.org/packages/5e/2c/50cf23834aea1de728967a535ce94f65b432ba95b36110aee3b1dd3204f2/wayback-0.4.5.tar.gz",
    "platform": null,
    "description": "===============================\nwayback\n===============================\n\n.. image:: https://circleci.com/gh/edgi-govdata-archiving/wayback/tree/main.svg?style=shield\n        :target: https://circleci.com/gh/edgi-govdata-archiving/wayback\n        :alt: Build Status\n\n.. image:: https://img.shields.io/pypi/v/wayback.svg\n        :target: https://pypi.python.org/pypi/wayback\n        :alt: Download Latest Version from PyPI\n\n.. image:: https://img.shields.io/badge/%E2%9D%A4-code%20of%20conduct-blue.svg?style=flat\n        :target: https://github.com/edgi-govdata-archiving/overview/blob/main/CONDUCT.md\n        :alt: Code of Conduct\n\n.. image:: https://readthedocs.org/projects/wayback/badge/?version=stable\n        :target: https://wayback.readthedocs.io/en/stable/?badge=stable\n        :alt: Documentation Status\n\n\n*Wayback* is A Python API to the `Internet Archive\u2019s Wayback Machine <https://web.archive.org/>`_. It gives you tools to search for and load mementos (historical copies of web pages).\n\nThe Internet Archive maintains an official `\u201cinternetarchive\u201d <https://archive.org/services/docs/api/internetarchive/>`_ Python package, but it does not focus on the Wayback Machine. Instead, it is mainly concerned with the APIs and tools that manage the Internet Archive as a whole: managing items and collections. These are how e-books, audio recordings, movies, and other content in the Internet Archive are managed. It doesn\u2019t, however, provide particularly good tools for finding or loading historical captures of specific URLs (i.e. the part of the Internet Archive called the \u201cWayback Machine\u201d). That\u2019s what this package does.\n\n* Documentation:\n    * Current Release: https://wayback.readthedocs.io/en/stable/\n    * Development: https://wayback.readthedocs.io/en/latest/\n\n\nInstallation & Basic Usage\n--------------------------\n\nInstall via pip on the command line::\n\n    $ pip install wayback\n\nThen, in a Python script, import it and create a client:\n\n.. code-block:: python\n\n    import wayback\n    client = wayback.WaybackClient()\n\nFinally, search for all the mementos of ``nasa.gov`` before 1999 and download them:\n\n.. code-block:: python\n\n    for record in client.search('http://nasa.gov', to_date=date(1999, 1, 1)):\n        memento = client.get_memento(record)\n\nRead the `full documentation <https://wayback.readthedocs.io/>`_ for a more in-depth tutorial and complete API reference documentation at https://wayback.readthedocs.io/\n\n\nCode of Conduct\n---------------\n\nThis repository falls under EDGI\u2019s `Code of Conduct <https://github.com/edgi-govdata-archiving/overview/blob/main/CONDUCT.md>`_. Please take a moment to review it before commenting on or creating issues and pull requests.\n\n\nContributors\n------------\n\nThanks to the following people for their contributions and help on this package! See our `contributing guidelines <https://github.com/edgi-govdata-archiving/wayback/blob/main/CONTRIBUTING.rst>`_ to find out how you can help.\n\n- `Dan Allan <https://github.com/danielballan>`_ (Code, Tests, Documentation, Reviews)\n- `Rob Brackett <https://github.com/Mr0grog>`_ (Code, Tests, Documentation, Reviews)\n- `David Gilman <https://github.com/dgilman>`_ (Documentation)\n- `Will Sackfield <https://github.com/8W9aG>`_ (Code, Tests)\n- `Ed Summers <https://github.com/edsu>`_ (Code, Tests)\n- `Lion Szlagowski <https://github.com/LionSzl>`_ (Code, Tests)\n\n\nLicense & Copyright\n-------------------\n\nCopyright (C) 2019-2023 Environmental Data and Governance Initiative (EDGI)\n\nThis program is free software: you can redistribute it and/or modify it under the terms of the 3-Clause BSD License. See the `LICENSE <https://github.com/edgi-govdata-archiving/wayback/blob/master/LICENSE>`_ file for details.\n\n\n",
    "bugtrack_url": null,
    "license": "BSD (3-clause)",
    "summary": "Python API to Internet Archive Wayback Machine",
    "version": "0.4.5",
    "project_urls": {
        "Changelog": "https://wayback.readthedocs.io/en/stable/release-history.html",
        "Documentation": "https://wayback.readthedocs.io/en/stable/",
        "Homepage": "https://github.com/edgi-govdata-archiving/wayback",
        "Issues": "https://github.com/edgi-govdata-archiving/wayback/issues",
        "Source code": "https://github.com/edgi-govdata-archiving/wayback"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "040c3ac76773d81beb57696097eefe98d90137d263e57bb6a6406b9238b273a8",
                "md5": "355a58184e08df99baefccbd0a0910b6",
                "sha256": "4de2a65c99045ea99255f37ef2deeca445d94c612cf6fb7e78d46c9bb6b36e55"
            },
            "downloads": -1,
            "filename": "wayback-0.4.5-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "355a58184e08df99baefccbd0a0910b6",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.6",
            "size": 41107,
            "upload_time": "2024-02-01T19:17:06",
            "upload_time_iso_8601": "2024-02-01T19:17:06.379833Z",
            "url": "https://files.pythonhosted.org/packages/04/0c/3ac76773d81beb57696097eefe98d90137d263e57bb6a6406b9238b273a8/wayback-0.4.5-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "5e2c50cf23834aea1de728967a535ce94f65b432ba95b36110aee3b1dd3204f2",
                "md5": "30033b560c0ccd94ae12bf0aec623080",
                "sha256": "8f260a38ad8e317d9089fde86049151705a1bb4674e85b3da3a9f0204c5d4f55"
            },
            "downloads": -1,
            "filename": "wayback-0.4.5.tar.gz",
            "has_sig": false,
            "md5_digest": "30033b560c0ccd94ae12bf0aec623080",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.6",
            "size": 72216,
            "upload_time": "2024-02-01T19:17:07",
            "upload_time_iso_8601": "2024-02-01T19:17:07.803041Z",
            "url": "https://files.pythonhosted.org/packages/5e/2c/50cf23834aea1de728967a535ce94f65b432ba95b36110aee3b1dd3204f2/wayback-0.4.5.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-02-01 19:17:07",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "edgi-govdata-archiving",
    "github_project": "wayback",
    "travis_ci": false,
    "coveralls": true,
    "github_actions": true,
    "circle": true,
    "lcname": "wayback"
}
        
Elapsed time: 0.22223s