<h1 align="center">Auto Archiver</h1>
[](https://badge.fury.io/py/auto-archiver)
[](https://hub.docker.com/r/bellingcat/auto-archiver)
[](https://github.com/bellingcat/auto-archiver/actions/workflows/tests-core.yaml)
[](https://github.com/bellingcat/auto-archiver/actions/workflows/tests-download.yaml)
<!--  -->
<!-- [](https://pypi.python.org/pypi/auto-archiver/) -->
<!-- [](https://vk-url-scraper.readthedocs.io/en/latest/?badge=latest) -->
Auto Archiver is a Python tool to automatically archive content on the web in a secure and verifiable way. It takes URLs from different sources (e.g. a CSV file, Google Sheets, command line etc.) and archives the content of each one. It can archive social media posts, videos, images and webpages. Content can enriched, then saved either locally or remotely (S3 bucket, Google Drive). The status of the archiving process can be appended to a CSV report, or if using Google Sheets – back to the original sheet.
<div class="hidden_rtd">
**[See the Auto Archiver documentation for more information.](https://auto-archiver.readthedocs.io/en/latest/)**
</div>
Read the [article about Auto Archiver on bellingcat.com](https://www.bellingcat.com/resources/2022/09/22/preserve-vital-online-content-with-bellingcats-auto-archiver-tool/).
## Installation
View the [Installation Guide](installation/installation.md) for full instructions
To get started quickly using Docker:
`docker pull bellingcat/auto-archiver && docker run`
Or pip:
`pip install auto-archiver && auto-archiver --help`
## Contributing
We welcome contributions to the Auto Archiver project! See the [Contributing Guide](https://auto-archiver.readthedocs.io/en/latest/contributing.html) for how to get involved!
Raw data
{
"_id": null,
"home_page": "https://github.com/bellingcat/auto-archiver",
"name": "auto-archiver",
"maintainer": null,
"docs_url": null,
"requires_python": "<3.13,>=3.10",
"maintainer_email": null,
"keywords": "archive, oosi, osint, scraping",
"author": "Bellingcat",
"author_email": "tech@bellingcat.com",
"download_url": "https://files.pythonhosted.org/packages/58/20/a902fa17f08e4851ee5f3240c2d077a8be51b2f107b5e32ce3860e21230d/auto_archiver-0.13.3.tar.gz",
"platform": null,
"description": "<h1 align=\"center\">Auto Archiver</h1>\n\n[](https://badge.fury.io/py/auto-archiver)\n[](https://hub.docker.com/r/bellingcat/auto-archiver)\n[](https://github.com/bellingcat/auto-archiver/actions/workflows/tests-core.yaml)\n[](https://github.com/bellingcat/auto-archiver/actions/workflows/tests-download.yaml)\n<!--  -->\n<!-- [](https://pypi.python.org/pypi/auto-archiver/) -->\n<!-- [](https://vk-url-scraper.readthedocs.io/en/latest/?badge=latest) -->\n\n\n\nAuto Archiver is a Python tool to automatically archive content on the web in a secure and verifiable way. It takes URLs from different sources (e.g. a CSV file, Google Sheets, command line etc.) and archives the content of each one. It can archive social media posts, videos, images and webpages. Content can enriched, then saved either locally or remotely (S3 bucket, Google Drive). The status of the archiving process can be appended to a CSV report, or if using Google Sheets \u2013 back to the original sheet.\n\n<div class=\"hidden_rtd\">\n \n**[See the Auto Archiver documentation for more information.](https://auto-archiver.readthedocs.io/en/latest/)**\n\n</div>\n\nRead the [article about Auto Archiver on bellingcat.com](https://www.bellingcat.com/resources/2022/09/22/preserve-vital-online-content-with-bellingcats-auto-archiver-tool/).\n\n\n## Installation\n\nView the [Installation Guide](installation/installation.md) for full instructions\n\nTo get started quickly using Docker:\n\n`docker pull bellingcat/auto-archiver && docker run`\n\nOr pip:\n\n`pip install auto-archiver && auto-archiver --help`\n\n## Contributing\n\nWe welcome contributions to the Auto Archiver project! See the [Contributing Guide](https://auto-archiver.readthedocs.io/en/latest/contributing.html) for how to get involved!\n\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "Automatically archive links to videos, images, and social media content from Google Sheets (and more).",
"version": "0.13.3",
"project_urls": {
"Documentation": "https://github.com/bellingcat/auto-archiver",
"Homepage": "https://github.com/bellingcat/auto-archiver",
"Repository": "https://github.com/bellingcat/auto-archiver"
},
"split_keywords": [
"archive",
" oosi",
" osint",
" scraping"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "6c7055f41670752d1c89408edde85be9da40a165b68ff99d45929c7dee418be4",
"md5": "bb284f3a582188bbee8a91ad120ed8e8",
"sha256": "8d4cdb8d62d0b7e94e3774ac4eb4596f1390690a81708566285c0d86c67595dc"
},
"downloads": -1,
"filename": "auto_archiver-0.13.3-py3-none-any.whl",
"has_sig": false,
"md5_digest": "bb284f3a582188bbee8a91ad120ed8e8",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": "<3.13,>=3.10",
"size": 140188,
"upload_time": "2025-02-18T19:13:48",
"upload_time_iso_8601": "2025-02-18T19:13:48.661069Z",
"url": "https://files.pythonhosted.org/packages/6c/70/55f41670752d1c89408edde85be9da40a165b68ff99d45929c7dee418be4/auto_archiver-0.13.3-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "5820a902fa17f08e4851ee5f3240c2d077a8be51b2f107b5e32ce3860e21230d",
"md5": "63bc76c529e7c586edc0ee9bc0f17799",
"sha256": "e6863edc54d240148f215449f21d0d3591b6b0a494c847e2b2d8f5bedd7d209a"
},
"downloads": -1,
"filename": "auto_archiver-0.13.3.tar.gz",
"has_sig": false,
"md5_digest": "63bc76c529e7c586edc0ee9bc0f17799",
"packagetype": "sdist",
"python_version": "source",
"requires_python": "<3.13,>=3.10",
"size": 90684,
"upload_time": "2025-02-18T19:13:50",
"upload_time_iso_8601": "2025-02-18T19:13:50.054522Z",
"url": "https://files.pythonhosted.org/packages/58/20/a902fa17f08e4851ee5f3240c2d077a8be51b2f107b5e32ce3860e21230d/auto_archiver-0.13.3.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2025-02-18 19:13:50",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "bellingcat",
"github_project": "auto-archiver",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"lcname": "auto-archiver"
}