# Webarchiver
![PyPI - Version](https://img.shields.io/pypi/v/webarchiver)
![PyPI - Downloads](https://img.shields.io/pypi/dd/webarchiver)
![GitHub Repo stars](https://img.shields.io/github/stars/Knuckles-Team/webarchiver)
![GitHub forks](https://img.shields.io/github/forks/Knuckles-Team/webarchiver)
![GitHub contributors](https://img.shields.io/github/contributors/Knuckles-Team/webarchiver)
![PyPI - License](https://img.shields.io/pypi/l/webarchiver)
![GitHub](https://img.shields.io/github/license/Knuckles-Team/webarchiver)
![GitHub last commit (by committer)](https://img.shields.io/github/last-commit/Knuckles-Team/webarchiver)
![GitHub pull requests](https://img.shields.io/github/issues-pr/Knuckles-Team/webarchiver)
![GitHub closed pull requests](https://img.shields.io/github/issues-pr-closed/Knuckles-Team/webarchiver)
![GitHub issues](https://img.shields.io/github/issues/Knuckles-Team/webarchiver)
![GitHub top language](https://img.shields.io/github/languages/top/Knuckles-Team/webarchiver)
![GitHub language count](https://img.shields.io/github/languages/count/Knuckles-Team/webarchiver)
![GitHub repo size](https://img.shields.io/github/repo-size/Knuckles-Team/webarchiver)
![GitHub repo file count (file type)](https://img.shields.io/github/directory-file-count/Knuckles-Team/webarchiver)
![PyPI - Wheel](https://img.shields.io/pypi/wheel/webarchiver)
![PyPI - Implementation](https://img.shields.io/pypi/implementation/webarchiver)
*Version: 0.32.7*
Python tool that allows you to take full page screenshots of pages without ads
Supports batching by adding multiple links in a text file, or my adding links to command line separated by commas.
This repository is actively maintained - Contributions are welcome!
#### Requirements:
One of the following browsers:
- Chrome/Chromium browser
- Firefox
- Selenoid Server
<details>
<summary><b>Usage:</b></summary>
| Short Flag | Long Flag | Description |
|------------|--------------|------------------------------------------------------------|
| -h | --help | See Usage |
| -b | --browser | Specify browser: Chrome / Firefox / Selenoid |
| -c | --clean | Convert mobile sites to regular site |
| -d | --directory | Location where the images will be saved |
| | --dpi | DPI for the image |
| -e | --executor | Execution environment: Local / Selenoid Host\|Selenoid URL |
| -f | --file | Text file to read the URL(s) from |
| -l | --links | Comma separated URL(s) |
| -i | --image-type | Save images as PNG or JPEG |
| -p | --processes | Number of processes to run concurrently |
| -s | --scrape | Scrape URL(s) by Downloading |
| -u | --url-filter | Filter URL(s) that contain this string |
| -z | --zoom | The zoom to use on the browser |
</details>
<details>
<summary><b>Example:</b></summary>
```bash
webarchiver -c -f <links_file.txt> -l "<URL1,URL2,URL3>" -i 'jpeg' -d "~/Downloads" -z 100 --dpi 1 --browser "Firefox"
```
```bash
webarchiver -c -f <links_file.txt> -l "<URL1,URL2,URL3>" -i 'png' -d "~/Downloads" -z 100 --dpi 1 --executor "selenoid|http://selenoid.com/wd/hub" --browser "Chrome"
```
```bash
webarchiver -s -f <links_file.txt> -l "<URL1,URL2,URL3>"
```
</details>
<details>
<summary><b>Installation Instructions:</b></summary>
Install Python Package
```bash
python -m pip install webarchiver
```
</details>
## Geniusbot Application
Use with a GUI through Geniusbot
Visit our [GitHub](https://github.com/Knuckles-Team/geniusbot) for more information
<details>
<summary><b>Installation Instructions with Geniusbot:</b></summary>
Install Python Package
```bash
python -m pip install geniusbot
```
</details>
<details>
<summary><b>Repository Owners:</b></summary>
<img width="100%" height="180em" src="https://github-readme-stats.vercel.app/api?username=Knucklessg1&show_icons=true&hide_border=true&&count_private=true&include_all_commits=true" />
![GitHub followers](https://img.shields.io/github/followers/Knucklessg1)
![GitHub User's stars](https://img.shields.io/github/stars/Knucklessg1)
</details>
Raw data
{
"_id": null,
"home_page": "https://github.com/Knuckles-Team/webarchiver",
"name": "webarchiver",
"maintainer": null,
"docs_url": null,
"requires_python": null,
"maintainer_email": null,
"keywords": null,
"author": "Audel Rouhi",
"author_email": "knucklessg1@gmail.com",
"download_url": null,
"platform": null,
"description": "# Webarchiver\n\n![PyPI - Version](https://img.shields.io/pypi/v/webarchiver)\n![PyPI - Downloads](https://img.shields.io/pypi/dd/webarchiver)\n![GitHub Repo stars](https://img.shields.io/github/stars/Knuckles-Team/webarchiver)\n![GitHub forks](https://img.shields.io/github/forks/Knuckles-Team/webarchiver)\n![GitHub contributors](https://img.shields.io/github/contributors/Knuckles-Team/webarchiver)\n![PyPI - License](https://img.shields.io/pypi/l/webarchiver)\n![GitHub](https://img.shields.io/github/license/Knuckles-Team/webarchiver)\n\n![GitHub last commit (by committer)](https://img.shields.io/github/last-commit/Knuckles-Team/webarchiver)\n![GitHub pull requests](https://img.shields.io/github/issues-pr/Knuckles-Team/webarchiver)\n![GitHub closed pull requests](https://img.shields.io/github/issues-pr-closed/Knuckles-Team/webarchiver)\n![GitHub issues](https://img.shields.io/github/issues/Knuckles-Team/webarchiver)\n\n![GitHub top language](https://img.shields.io/github/languages/top/Knuckles-Team/webarchiver)\n![GitHub language count](https://img.shields.io/github/languages/count/Knuckles-Team/webarchiver)\n![GitHub repo size](https://img.shields.io/github/repo-size/Knuckles-Team/webarchiver)\n![GitHub repo file count (file type)](https://img.shields.io/github/directory-file-count/Knuckles-Team/webarchiver)\n![PyPI - Wheel](https://img.shields.io/pypi/wheel/webarchiver)\n![PyPI - Implementation](https://img.shields.io/pypi/implementation/webarchiver)\n\n*Version: 0.32.7*\n\nPython tool that allows you to take full page screenshots of pages without ads\n\nSupports batching by adding multiple links in a text file, or my adding links to command line separated by commas.\n\nThis repository is actively maintained - Contributions are welcome!\n\n#### Requirements:\n\nOne of the following browsers:\n\n- Chrome/Chromium browser\n- Firefox\n- Selenoid Server\n\n<details>\n <summary><b>Usage:</b></summary>\n\n| Short Flag | Long Flag | Description |\n|------------|--------------|------------------------------------------------------------|\n| -h | --help | See Usage |\n| -b | --browser | Specify browser: Chrome / Firefox / Selenoid |\n| -c | --clean | Convert mobile sites to regular site |\n| -d | --directory | Location where the images will be saved |\n| | --dpi | DPI for the image |\n| -e | --executor | Execution environment: Local / Selenoid Host\\|Selenoid URL |\n| -f | --file | Text file to read the URL(s) from |\n| -l | --links | Comma separated URL(s) |\n| -i | --image-type | Save images as PNG or JPEG |\n| -p | --processes | Number of processes to run concurrently |\n| -s | --scrape | Scrape URL(s) by Downloading |\n| -u | --url-filter | Filter URL(s) that contain this string |\n| -z | --zoom | The zoom to use on the browser |\n\n</details>\n\n<details>\n <summary><b>Example:</b></summary>\n\n```bash\nwebarchiver -c -f <links_file.txt> -l \"<URL1,URL2,URL3>\" -i 'jpeg' -d \"~/Downloads\" -z 100 --dpi 1 --browser \"Firefox\"\n```\n\n```bash\nwebarchiver -c -f <links_file.txt> -l \"<URL1,URL2,URL3>\" -i 'png' -d \"~/Downloads\" -z 100 --dpi 1 --executor \"selenoid|http://selenoid.com/wd/hub\" --browser \"Chrome\"\n```\n\n```bash\nwebarchiver -s -f <links_file.txt> -l \"<URL1,URL2,URL3>\"\n```\n\n</details>\n\n<details>\n <summary><b>Installation Instructions:</b></summary>\n\nInstall Python Package\n\n```bash\npython -m pip install webarchiver\n```\n\n</details>\n\n\n## Geniusbot Application\n\nUse with a GUI through Geniusbot\n\nVisit our [GitHub](https://github.com/Knuckles-Team/geniusbot) for more information\n\n<details>\n <summary><b>Installation Instructions with Geniusbot:</b></summary>\n\nInstall Python Package\n\n```bash\npython -m pip install geniusbot\n```\n\n</details>\n\n<details>\n <summary><b>Repository Owners:</b></summary>\n\n\n<img width=\"100%\" height=\"180em\" src=\"https://github-readme-stats.vercel.app/api?username=Knucklessg1&show_icons=true&hide_border=true&&count_private=true&include_all_commits=true\" />\n\n![GitHub followers](https://img.shields.io/github/followers/Knucklessg1)\n![GitHub User's stars](https://img.shields.io/github/stars/Knucklessg1)\n</details>\n\n\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "Python tool that allows you to take multiple full page screenshots of web pages without ads.",
"version": "0.32.7",
"project_urls": {
"Homepage": "https://github.com/Knuckles-Team/webarchiver"
},
"split_keywords": [],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "9f8ef82b200da4a91b44baa9971991f0b3a1c2b41f7d78ffb6d0f885c524b689",
"md5": "a427ce8388972354f7b1c41bfb95f326",
"sha256": "c5f6ed2ec683a741fc8a2d5f65fb4c54ea411fcd09466fd07f2d72ae84513b27"
},
"downloads": -1,
"filename": "webarchiver-0.32.7-py2.py3-none-any.whl",
"has_sig": false,
"md5_digest": "a427ce8388972354f7b1c41bfb95f326",
"packagetype": "bdist_wheel",
"python_version": "py2.py3",
"requires_python": null,
"size": 5913572,
"upload_time": "2024-05-09T04:22:43",
"upload_time_iso_8601": "2024-05-09T04:22:43.617825Z",
"url": "https://files.pythonhosted.org/packages/9f/8e/f82b200da4a91b44baa9971991f0b3a1c2b41f7d78ffb6d0f885c524b689/webarchiver-0.32.7-py2.py3-none-any.whl",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-05-09 04:22:43",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "Knuckles-Team",
"github_project": "webarchiver",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"requirements": [],
"lcname": "webarchiver"
}