waybacktweets


Namewaybacktweets JSON
Version 1.0rc0 PyPI version JSON
download
home_pagehttps://github.com/claromes/waybacktweets
SummaryRetrieves archived tweets CDX data from the Wayback Machine, performs necessary parsing, and saves the data.
upload_time2024-07-18 23:32:00
maintainerNone
docs_urlNone
authorClaromes
requires_python<4.0,>=3.10
licenseGPLv3
keywords twitter tweet internet-archive wayback-machine osint-tools osint command-line
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Wayback Tweets

[![PyPI](https://img.shields.io/pypi/v/waybacktweets)](https://pypi.org/project/waybacktweets) [![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.12528447.svg)](https://doi.org/10.5281/zenodo.12528447) [![Streamlit App](https://static.streamlit.io/badges/streamlit_badge_black_white.svg)](https://waybacktweets.streamlit.app) [![Open In Collab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1tnaM3rMWpoSHBZ4P_6iHFPjraWRQ3OGe?usp=sharing)


Retrieves archived tweets CDX data from the Wayback Machine, performs necessary parsing (see [Field Options](https://claromes.github.io/waybacktweets/field_options.html)), and saves the data in HTML, for easy viewing of the tweets using the iframe tags, CSV, and JSON formats.

## Installation

```shell
pip install waybacktweets
```

## Quickstart

### Using Wayback Tweets as a standalone command line tool

waybacktweets [OPTIONS] USERNAME

```shell
waybacktweets --from 20150101 --to 20191231 --limit 250 jack
```

### Using Wayback Tweets as a Web App

[Open the application](https://waybacktweets.streamlit.app), a prototype written in Python with the Streamlit framework and hosted on Streamlit Cloud.

### Using Wayback Tweets as a Python Module

```python
from waybacktweets import WaybackTweets, TweetsParser, TweetsExporter

USERNAME = "jack"

api = WaybackTweets(USERNAME)
archived_tweets = api.get()

if archived_tweets:
    field_options = [
        "archived_timestamp",
        "original_tweet_url",
        "archived_tweet_url",
        "archived_statuscode",
    ]

    parser = TweetsParser(archived_tweets, USERNAME, field_options)
    parsed_tweets = parser.parse()

    exporter = TweetsExporter(parsed_tweets, USERNAME, field_options)
    exporter.save_to_csv()
```

## Documentation

- [Wayback Tweets documentation](https://claromes.github.io/waybacktweets)
- [Wayback CDX Server API (Beta) documentation](https://archive.org/developers/wayback-cdx-server.html)

## Acknowledgements

- Tristan Lee (Bellingcat's Data Scientist) for the idea of the application.
- Jessica Smith (Snowflake's Community Growth Specialist) and Streamlit/Snowflake team for the additional server resources on Streamlit Cloud.
- OSINT Community for recommending the application.

> [!NOTE]
> If the Streamlit application is down, please check the [Streamlit Cloud Status](https://www.streamlitstatus.com/).

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/claromes/waybacktweets",
    "name": "waybacktweets",
    "maintainer": null,
    "docs_url": null,
    "requires_python": "<4.0,>=3.10",
    "maintainer_email": null,
    "keywords": "twitter, tweet, internet-archive, wayback-machine, osint-tools, osint, command-line",
    "author": "Claromes",
    "author_email": "support@claromes.com",
    "download_url": "https://files.pythonhosted.org/packages/ab/14/dcbb0783397374fbe0eb593f0da8b69211e88267c6a3bd789548a2ac7700/waybacktweets-1.0rc0.tar.gz",
    "platform": null,
    "description": "# Wayback Tweets\n\n[![PyPI](https://img.shields.io/pypi/v/waybacktweets)](https://pypi.org/project/waybacktweets) [![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.12528447.svg)](https://doi.org/10.5281/zenodo.12528447) [![Streamlit App](https://static.streamlit.io/badges/streamlit_badge_black_white.svg)](https://waybacktweets.streamlit.app) [![Open In Collab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1tnaM3rMWpoSHBZ4P_6iHFPjraWRQ3OGe?usp=sharing)\n\n\nRetrieves archived tweets CDX data from the Wayback Machine, performs necessary parsing (see [Field Options](https://claromes.github.io/waybacktweets/field_options.html)), and saves the data in HTML, for easy viewing of the tweets using the iframe tags, CSV, and JSON formats.\n\n## Installation\n\n```shell\npip install waybacktweets\n```\n\n## Quickstart\n\n### Using Wayback Tweets as a standalone command line tool\n\nwaybacktweets [OPTIONS] USERNAME\n\n```shell\nwaybacktweets --from 20150101 --to 20191231 --limit 250 jack\n```\n\n### Using Wayback Tweets as a Web App\n\n[Open the application](https://waybacktweets.streamlit.app), a prototype written in Python with the Streamlit framework and hosted on Streamlit Cloud.\n\n### Using Wayback Tweets as a Python Module\n\n```python\nfrom waybacktweets import WaybackTweets, TweetsParser, TweetsExporter\n\nUSERNAME = \"jack\"\n\napi = WaybackTweets(USERNAME)\narchived_tweets = api.get()\n\nif archived_tweets:\n    field_options = [\n        \"archived_timestamp\",\n        \"original_tweet_url\",\n        \"archived_tweet_url\",\n        \"archived_statuscode\",\n    ]\n\n    parser = TweetsParser(archived_tweets, USERNAME, field_options)\n    parsed_tweets = parser.parse()\n\n    exporter = TweetsExporter(parsed_tweets, USERNAME, field_options)\n    exporter.save_to_csv()\n```\n\n## Documentation\n\n- [Wayback Tweets documentation](https://claromes.github.io/waybacktweets)\n- [Wayback CDX Server API (Beta) documentation](https://archive.org/developers/wayback-cdx-server.html)\n\n## Acknowledgements\n\n- Tristan Lee (Bellingcat's Data Scientist) for the idea of the application.\n- Jessica Smith (Snowflake's Community Growth Specialist) and Streamlit/Snowflake team for the additional server resources on Streamlit Cloud.\n- OSINT Community for recommending the application.\n\n> [!NOTE]\n> If the Streamlit application is down, please check the [Streamlit Cloud Status](https://www.streamlitstatus.com/).\n",
    "bugtrack_url": null,
    "license": "GPLv3",
    "summary": "Retrieves archived tweets CDX data from the Wayback Machine, performs necessary parsing, and saves the data.",
    "version": "1.0rc0",
    "project_urls": {
        "Documentation": "https://claromes.github.io/waybacktweets/",
        "Homepage": "https://claromes.github.io/waybacktweets/",
        "Issue Tracker": "https://github.com/claromes/waybacktweets/issues",
        "Repository": "https://github.com/claromes/waybacktweets"
    },
    "split_keywords": [
        "twitter",
        " tweet",
        " internet-archive",
        " wayback-machine",
        " osint-tools",
        " osint",
        " command-line"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "7406283550f1b768283cec7982bf1b612644d7a383da34f057af924a966fb0bf",
                "md5": "61ff3062fc9bc6317d298dce0098033b",
                "sha256": "17a079b7a4d2d826bca051a3be5f1f51e4b99b8f605005b8a2997d28cb4e4253"
            },
            "downloads": -1,
            "filename": "waybacktweets-1.0rc0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "61ff3062fc9bc6317d298dce0098033b",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": "<4.0,>=3.10",
            "size": 30469,
            "upload_time": "2024-07-18T23:31:59",
            "upload_time_iso_8601": "2024-07-18T23:31:59.154584Z",
            "url": "https://files.pythonhosted.org/packages/74/06/283550f1b768283cec7982bf1b612644d7a383da34f057af924a966fb0bf/waybacktweets-1.0rc0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "ab14dcbb0783397374fbe0eb593f0da8b69211e88267c6a3bd789548a2ac7700",
                "md5": "917cfc182b20a81f2654f52edb01e788",
                "sha256": "a6c35822fa7b5c0d569cc9a2f47107589a9e044ea4fdaaaaa651da1d0cd022e4"
            },
            "downloads": -1,
            "filename": "waybacktweets-1.0rc0.tar.gz",
            "has_sig": false,
            "md5_digest": "917cfc182b20a81f2654f52edb01e788",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": "<4.0,>=3.10",
            "size": 26731,
            "upload_time": "2024-07-18T23:32:00",
            "upload_time_iso_8601": "2024-07-18T23:32:00.688874Z",
            "url": "https://files.pythonhosted.org/packages/ab/14/dcbb0783397374fbe0eb593f0da8b69211e88267c6a3bd789548a2ac7700/waybacktweets-1.0rc0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-07-18 23:32:00",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "claromes",
    "github_project": "waybacktweets",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "waybacktweets"
}
        
Elapsed time: 0.94283s