webarchiver


Namewebarchiver JSON
Version 0.32.7 PyPI version JSON
download
home_pagehttps://github.com/Knuckles-Team/webarchiver
SummaryPython tool that allows you to take multiple full page screenshots of web pages without ads.
upload_time2024-05-09 04:22:43
maintainerNone
docs_urlNone
authorAudel Rouhi
requires_pythonNone
licenseMIT
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Webarchiver

![PyPI - Version](https://img.shields.io/pypi/v/webarchiver)
![PyPI - Downloads](https://img.shields.io/pypi/dd/webarchiver)
![GitHub Repo stars](https://img.shields.io/github/stars/Knuckles-Team/webarchiver)
![GitHub forks](https://img.shields.io/github/forks/Knuckles-Team/webarchiver)
![GitHub contributors](https://img.shields.io/github/contributors/Knuckles-Team/webarchiver)
![PyPI - License](https://img.shields.io/pypi/l/webarchiver)
![GitHub](https://img.shields.io/github/license/Knuckles-Team/webarchiver)

![GitHub last commit (by committer)](https://img.shields.io/github/last-commit/Knuckles-Team/webarchiver)
![GitHub pull requests](https://img.shields.io/github/issues-pr/Knuckles-Team/webarchiver)
![GitHub closed pull requests](https://img.shields.io/github/issues-pr-closed/Knuckles-Team/webarchiver)
![GitHub issues](https://img.shields.io/github/issues/Knuckles-Team/webarchiver)

![GitHub top language](https://img.shields.io/github/languages/top/Knuckles-Team/webarchiver)
![GitHub language count](https://img.shields.io/github/languages/count/Knuckles-Team/webarchiver)
![GitHub repo size](https://img.shields.io/github/repo-size/Knuckles-Team/webarchiver)
![GitHub repo file count (file type)](https://img.shields.io/github/directory-file-count/Knuckles-Team/webarchiver)
![PyPI - Wheel](https://img.shields.io/pypi/wheel/webarchiver)
![PyPI - Implementation](https://img.shields.io/pypi/implementation/webarchiver)

*Version: 0.32.7*

Python tool that allows you to take full page screenshots of pages without ads

Supports batching by adding multiple links in a text file, or my adding links to command line separated by commas.

This repository is actively maintained - Contributions are welcome!

#### Requirements:

One of the following browsers:

- Chrome/Chromium browser
- Firefox
- Selenoid Server

<details>
  <summary><b>Usage:</b></summary>

| Short Flag | Long Flag    | Description                                                |
|------------|--------------|------------------------------------------------------------|
| -h         | --help       | See Usage                                                  |
| -b         | --browser    | Specify browser: Chrome / Firefox / Selenoid               |
| -c         | --clean      | Convert mobile sites to regular site                       |
| -d         | --directory  | Location where the images will be saved                    |
|            | --dpi        | DPI for the image                                          |
| -e         | --executor   | Execution environment: Local / Selenoid Host\|Selenoid URL |
| -f         | --file       | Text file to read the URL(s) from                          |
| -l         | --links      | Comma separated URL(s)                                     |
| -i         | --image-type | Save images as PNG or JPEG                                 |
| -p         | --processes  | Number of processes to run concurrently                    |
| -s         | --scrape     | Scrape URL(s) by Downloading                               |
| -u         | --url-filter | Filter URL(s) that contain this string                     |
| -z         | --zoom       | The zoom to use on the browser                             |

</details>

<details>
  <summary><b>Example:</b></summary>

```bash
webarchiver -c -f <links_file.txt> -l "<URL1,URL2,URL3>" -i 'jpeg' -d "~/Downloads" -z 100 --dpi 1 --browser "Firefox"
```

```bash
webarchiver -c -f <links_file.txt> -l "<URL1,URL2,URL3>" -i 'png' -d "~/Downloads" -z 100 --dpi 1 --executor "selenoid|http://selenoid.com/wd/hub" --browser "Chrome"
```

```bash
webarchiver -s -f <links_file.txt> -l "<URL1,URL2,URL3>"
```

</details>

<details>
  <summary><b>Installation Instructions:</b></summary>

Install Python Package

```bash
python -m pip install webarchiver
```

</details>


## Geniusbot Application

Use with a GUI through Geniusbot

Visit our [GitHub](https://github.com/Knuckles-Team/geniusbot) for more information

<details>
  <summary><b>Installation Instructions with Geniusbot:</b></summary>

Install Python Package

```bash
python -m pip install geniusbot
```

</details>

<details>
  <summary><b>Repository Owners:</b></summary>


<img width="100%" height="180em" src="https://github-readme-stats.vercel.app/api?username=Knucklessg1&show_icons=true&hide_border=true&&count_private=true&include_all_commits=true" />

![GitHub followers](https://img.shields.io/github/followers/Knucklessg1)
![GitHub User's stars](https://img.shields.io/github/stars/Knucklessg1)
</details>



            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/Knuckles-Team/webarchiver",
    "name": "webarchiver",
    "maintainer": null,
    "docs_url": null,
    "requires_python": null,
    "maintainer_email": null,
    "keywords": null,
    "author": "Audel Rouhi",
    "author_email": "knucklessg1@gmail.com",
    "download_url": null,
    "platform": null,
    "description": "# Webarchiver\n\n![PyPI - Version](https://img.shields.io/pypi/v/webarchiver)\n![PyPI - Downloads](https://img.shields.io/pypi/dd/webarchiver)\n![GitHub Repo stars](https://img.shields.io/github/stars/Knuckles-Team/webarchiver)\n![GitHub forks](https://img.shields.io/github/forks/Knuckles-Team/webarchiver)\n![GitHub contributors](https://img.shields.io/github/contributors/Knuckles-Team/webarchiver)\n![PyPI - License](https://img.shields.io/pypi/l/webarchiver)\n![GitHub](https://img.shields.io/github/license/Knuckles-Team/webarchiver)\n\n![GitHub last commit (by committer)](https://img.shields.io/github/last-commit/Knuckles-Team/webarchiver)\n![GitHub pull requests](https://img.shields.io/github/issues-pr/Knuckles-Team/webarchiver)\n![GitHub closed pull requests](https://img.shields.io/github/issues-pr-closed/Knuckles-Team/webarchiver)\n![GitHub issues](https://img.shields.io/github/issues/Knuckles-Team/webarchiver)\n\n![GitHub top language](https://img.shields.io/github/languages/top/Knuckles-Team/webarchiver)\n![GitHub language count](https://img.shields.io/github/languages/count/Knuckles-Team/webarchiver)\n![GitHub repo size](https://img.shields.io/github/repo-size/Knuckles-Team/webarchiver)\n![GitHub repo file count (file type)](https://img.shields.io/github/directory-file-count/Knuckles-Team/webarchiver)\n![PyPI - Wheel](https://img.shields.io/pypi/wheel/webarchiver)\n![PyPI - Implementation](https://img.shields.io/pypi/implementation/webarchiver)\n\n*Version: 0.32.7*\n\nPython tool that allows you to take full page screenshots of pages without ads\n\nSupports batching by adding multiple links in a text file, or my adding links to command line separated by commas.\n\nThis repository is actively maintained - Contributions are welcome!\n\n#### Requirements:\n\nOne of the following browsers:\n\n- Chrome/Chromium browser\n- Firefox\n- Selenoid Server\n\n<details>\n  <summary><b>Usage:</b></summary>\n\n| Short Flag | Long Flag    | Description                                                |\n|------------|--------------|------------------------------------------------------------|\n| -h         | --help       | See Usage                                                  |\n| -b         | --browser    | Specify browser: Chrome / Firefox / Selenoid               |\n| -c         | --clean      | Convert mobile sites to regular site                       |\n| -d         | --directory  | Location where the images will be saved                    |\n|            | --dpi        | DPI for the image                                          |\n| -e         | --executor   | Execution environment: Local / Selenoid Host\\|Selenoid URL |\n| -f         | --file       | Text file to read the URL(s) from                          |\n| -l         | --links      | Comma separated URL(s)                                     |\n| -i         | --image-type | Save images as PNG or JPEG                                 |\n| -p         | --processes  | Number of processes to run concurrently                    |\n| -s         | --scrape     | Scrape URL(s) by Downloading                               |\n| -u         | --url-filter | Filter URL(s) that contain this string                     |\n| -z         | --zoom       | The zoom to use on the browser                             |\n\n</details>\n\n<details>\n  <summary><b>Example:</b></summary>\n\n```bash\nwebarchiver -c -f <links_file.txt> -l \"<URL1,URL2,URL3>\" -i 'jpeg' -d \"~/Downloads\" -z 100 --dpi 1 --browser \"Firefox\"\n```\n\n```bash\nwebarchiver -c -f <links_file.txt> -l \"<URL1,URL2,URL3>\" -i 'png' -d \"~/Downloads\" -z 100 --dpi 1 --executor \"selenoid|http://selenoid.com/wd/hub\" --browser \"Chrome\"\n```\n\n```bash\nwebarchiver -s -f <links_file.txt> -l \"<URL1,URL2,URL3>\"\n```\n\n</details>\n\n<details>\n  <summary><b>Installation Instructions:</b></summary>\n\nInstall Python Package\n\n```bash\npython -m pip install webarchiver\n```\n\n</details>\n\n\n## Geniusbot Application\n\nUse with a GUI through Geniusbot\n\nVisit our [GitHub](https://github.com/Knuckles-Team/geniusbot) for more information\n\n<details>\n  <summary><b>Installation Instructions with Geniusbot:</b></summary>\n\nInstall Python Package\n\n```bash\npython -m pip install geniusbot\n```\n\n</details>\n\n<details>\n  <summary><b>Repository Owners:</b></summary>\n\n\n<img width=\"100%\" height=\"180em\" src=\"https://github-readme-stats.vercel.app/api?username=Knucklessg1&show_icons=true&hide_border=true&&count_private=true&include_all_commits=true\" />\n\n![GitHub followers](https://img.shields.io/github/followers/Knucklessg1)\n![GitHub User's stars](https://img.shields.io/github/stars/Knucklessg1)\n</details>\n\n\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Python tool that allows you to take multiple full page screenshots of web pages without ads.",
    "version": "0.32.7",
    "project_urls": {
        "Homepage": "https://github.com/Knuckles-Team/webarchiver"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "9f8ef82b200da4a91b44baa9971991f0b3a1c2b41f7d78ffb6d0f885c524b689",
                "md5": "a427ce8388972354f7b1c41bfb95f326",
                "sha256": "c5f6ed2ec683a741fc8a2d5f65fb4c54ea411fcd09466fd07f2d72ae84513b27"
            },
            "downloads": -1,
            "filename": "webarchiver-0.32.7-py2.py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "a427ce8388972354f7b1c41bfb95f326",
            "packagetype": "bdist_wheel",
            "python_version": "py2.py3",
            "requires_python": null,
            "size": 5913572,
            "upload_time": "2024-05-09T04:22:43",
            "upload_time_iso_8601": "2024-05-09T04:22:43.617825Z",
            "url": "https://files.pythonhosted.org/packages/9f/8e/f82b200da4a91b44baa9971991f0b3a1c2b41f7d78ffb6d0f885c524b689/webarchiver-0.32.7-py2.py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-05-09 04:22:43",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "Knuckles-Team",
    "github_project": "webarchiver",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "requirements": [],
    "lcname": "webarchiver"
}
        
Elapsed time: 0.33912s