# Wayback Tweets
[![PyPI](https://img.shields.io/pypi/v/waybacktweets)](https://pypi.org/project/waybacktweets) [![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.12528447.svg)](https://doi.org/10.5281/zenodo.12528447) [![Streamlit App](https://static.streamlit.io/badges/streamlit_badge_black_white.svg)](https://waybacktweets.streamlit.app) [![Open In Collab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1tnaM3rMWpoSHBZ4P_6iHFPjraWRQ3OGe?usp=sharing)
Retrieves archived tweets CDX data from the Wayback Machine, performs necessary parsing (see [Field Options](https://claromes.github.io/waybacktweets/field_options.html)), and saves the data in HTML, for easy viewing of the tweets using the iframe tags, CSV, and JSON formats.
## Installation
```shell
pip install waybacktweets
```
## Quickstart
### Using Wayback Tweets as a standalone command line tool
waybacktweets [OPTIONS] USERNAME
```shell
waybacktweets --from 20150101 --to 20191231 --limit 250 jack
```
### Using Wayback Tweets as a Web App
[Open the application](https://waybacktweets.streamlit.app), a prototype written in Python with the Streamlit framework and hosted on Streamlit Cloud.
### Using Wayback Tweets as a Python Module
```python
from waybacktweets import WaybackTweets, TweetsParser, TweetsExporter
USERNAME = "jack"
api = WaybackTweets(USERNAME)
archived_tweets = api.get()
if archived_tweets:
field_options = [
"archived_timestamp",
"original_tweet_url",
"archived_tweet_url",
"archived_statuscode",
]
parser = TweetsParser(archived_tweets, USERNAME, field_options)
parsed_tweets = parser.parse()
exporter = TweetsExporter(parsed_tweets, USERNAME, field_options)
exporter.save_to_csv()
```
## Documentation
- [Wayback Tweets documentation](https://claromes.github.io/waybacktweets)
- [Wayback CDX Server API (Beta) documentation](https://archive.org/developers/wayback-cdx-server.html)
## Acknowledgements
- Tristan Lee (Bellingcat's Data Scientist) for the idea of the application.
- Jessica Smith (Snowflake's Community Growth Specialist) and Streamlit/Snowflake team for the additional server resources on Streamlit Cloud.
- OSINT Community for recommending the application.
> [!NOTE]
> If the Streamlit application is down, please check the [Streamlit Cloud Status](https://www.streamlitstatus.com/).
Raw data
{
"_id": null,
"home_page": "https://github.com/claromes/waybacktweets",
"name": "waybacktweets",
"maintainer": null,
"docs_url": null,
"requires_python": "<4.0,>=3.10",
"maintainer_email": null,
"keywords": "twitter, tweet, internet-archive, wayback-machine, osint-tools, osint, command-line",
"author": "Claromes",
"author_email": "support@claromes.com",
"download_url": "https://files.pythonhosted.org/packages/ab/14/dcbb0783397374fbe0eb593f0da8b69211e88267c6a3bd789548a2ac7700/waybacktweets-1.0rc0.tar.gz",
"platform": null,
"description": "# Wayback Tweets\n\n[![PyPI](https://img.shields.io/pypi/v/waybacktweets)](https://pypi.org/project/waybacktweets) [![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.12528447.svg)](https://doi.org/10.5281/zenodo.12528447) [![Streamlit App](https://static.streamlit.io/badges/streamlit_badge_black_white.svg)](https://waybacktweets.streamlit.app) [![Open In Collab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1tnaM3rMWpoSHBZ4P_6iHFPjraWRQ3OGe?usp=sharing)\n\n\nRetrieves archived tweets CDX data from the Wayback Machine, performs necessary parsing (see [Field Options](https://claromes.github.io/waybacktweets/field_options.html)), and saves the data in HTML, for easy viewing of the tweets using the iframe tags, CSV, and JSON formats.\n\n## Installation\n\n```shell\npip install waybacktweets\n```\n\n## Quickstart\n\n### Using Wayback Tweets as a standalone command line tool\n\nwaybacktweets [OPTIONS] USERNAME\n\n```shell\nwaybacktweets --from 20150101 --to 20191231 --limit 250 jack\n```\n\n### Using Wayback Tweets as a Web App\n\n[Open the application](https://waybacktweets.streamlit.app), a prototype written in Python with the Streamlit framework and hosted on Streamlit Cloud.\n\n### Using Wayback Tweets as a Python Module\n\n```python\nfrom waybacktweets import WaybackTweets, TweetsParser, TweetsExporter\n\nUSERNAME = \"jack\"\n\napi = WaybackTweets(USERNAME)\narchived_tweets = api.get()\n\nif archived_tweets:\n field_options = [\n \"archived_timestamp\",\n \"original_tweet_url\",\n \"archived_tweet_url\",\n \"archived_statuscode\",\n ]\n\n parser = TweetsParser(archived_tweets, USERNAME, field_options)\n parsed_tweets = parser.parse()\n\n exporter = TweetsExporter(parsed_tweets, USERNAME, field_options)\n exporter.save_to_csv()\n```\n\n## Documentation\n\n- [Wayback Tweets documentation](https://claromes.github.io/waybacktweets)\n- [Wayback CDX Server API (Beta) documentation](https://archive.org/developers/wayback-cdx-server.html)\n\n## Acknowledgements\n\n- Tristan Lee (Bellingcat's Data Scientist) for the idea of the application.\n- Jessica Smith (Snowflake's Community Growth Specialist) and Streamlit/Snowflake team for the additional server resources on Streamlit Cloud.\n- OSINT Community for recommending the application.\n\n> [!NOTE]\n> If the Streamlit application is down, please check the [Streamlit Cloud Status](https://www.streamlitstatus.com/).\n",
"bugtrack_url": null,
"license": "GPLv3",
"summary": "Retrieves archived tweets CDX data from the Wayback Machine, performs necessary parsing, and saves the data.",
"version": "1.0rc0",
"project_urls": {
"Documentation": "https://claromes.github.io/waybacktweets/",
"Homepage": "https://claromes.github.io/waybacktweets/",
"Issue Tracker": "https://github.com/claromes/waybacktweets/issues",
"Repository": "https://github.com/claromes/waybacktweets"
},
"split_keywords": [
"twitter",
" tweet",
" internet-archive",
" wayback-machine",
" osint-tools",
" osint",
" command-line"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "7406283550f1b768283cec7982bf1b612644d7a383da34f057af924a966fb0bf",
"md5": "61ff3062fc9bc6317d298dce0098033b",
"sha256": "17a079b7a4d2d826bca051a3be5f1f51e4b99b8f605005b8a2997d28cb4e4253"
},
"downloads": -1,
"filename": "waybacktweets-1.0rc0-py3-none-any.whl",
"has_sig": false,
"md5_digest": "61ff3062fc9bc6317d298dce0098033b",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": "<4.0,>=3.10",
"size": 30469,
"upload_time": "2024-07-18T23:31:59",
"upload_time_iso_8601": "2024-07-18T23:31:59.154584Z",
"url": "https://files.pythonhosted.org/packages/74/06/283550f1b768283cec7982bf1b612644d7a383da34f057af924a966fb0bf/waybacktweets-1.0rc0-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "ab14dcbb0783397374fbe0eb593f0da8b69211e88267c6a3bd789548a2ac7700",
"md5": "917cfc182b20a81f2654f52edb01e788",
"sha256": "a6c35822fa7b5c0d569cc9a2f47107589a9e044ea4fdaaaaa651da1d0cd022e4"
},
"downloads": -1,
"filename": "waybacktweets-1.0rc0.tar.gz",
"has_sig": false,
"md5_digest": "917cfc182b20a81f2654f52edb01e788",
"packagetype": "sdist",
"python_version": "source",
"requires_python": "<4.0,>=3.10",
"size": 26731,
"upload_time": "2024-07-18T23:32:00",
"upload_time_iso_8601": "2024-07-18T23:32:00.688874Z",
"url": "https://files.pythonhosted.org/packages/ab/14/dcbb0783397374fbe0eb593f0da8b69211e88267c6a3bd789548a2ac7700/waybacktweets-1.0rc0.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-07-18 23:32:00",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "claromes",
"github_project": "waybacktweets",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"lcname": "waybacktweets"
}