much


Namemuch JSON
Version 0.0.7 PyPI version JSON
download
home_pagehttps://github.com/zeionara/much
SummaryA simple utility for crawling text from 2ch
upload_time2023-11-10 11:49:31
maintainer
docs_urlNone
authorZeio Nara
requires_python
licenseApache 2.0
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # much

<p align="center">
    <img src="https://i.ibb.co/zZYT9hg/logo.png"/>
    <!--<img src="assets/logo.png"/>-->
</p>

A simple utility for crawling text from 2ch

## Usage

The command `pull` requires two attributes - url of the web page to fetch and path to output file with `json` or `txt` extension depending on required output file format. For example:

```sh
python -m much pull https://2ch.hk/b/arch/2018-08-22/res/181770037.html assets/stories.txt
```

To fetch archived threads on `17`th page:

```sh
python -m much fetch 17
```

To list top `10` fetched threads by size (cumulative number of characters in messages longer than 100 symbols):

```sh
python -m much top 10
```

To star a thread (copy it to folder `assets/starred` with a given name):

```sh
python -m much star 263473351 discussion
```

## Installation

To install through pip:

```sh
pip install much
```

To install dependencies and create conda environment:

```sh
conda env create -f environment.yml
```

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/zeionara/much",
    "name": "much",
    "maintainer": "",
    "docs_url": null,
    "requires_python": "",
    "maintainer_email": "",
    "keywords": "",
    "author": "Zeio Nara",
    "author_email": "zeionara@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/ec/be/4d365446d8d1f0564e141fa9aa4b5f595450a15b3a8860ffd6fa7bcf82e0/much-0.0.7.tar.gz",
    "platform": null,
    "description": "# much\n\n<p align=\"center\">\n    <img src=\"https://i.ibb.co/zZYT9hg/logo.png\"/>\n    <!--<img src=\"assets/logo.png\"/>-->\n</p>\n\nA simple utility for crawling text from 2ch\n\n## Usage\n\nThe command `pull` requires two attributes - url of the web page to fetch and path to output file with `json` or `txt` extension depending on required output file format. For example:\n\n```sh\npython -m much pull https://2ch.hk/b/arch/2018-08-22/res/181770037.html assets/stories.txt\n```\n\nTo fetch archived threads on `17`th page:\n\n```sh\npython -m much fetch 17\n```\n\nTo list top `10` fetched threads by size (cumulative number of characters in messages longer than 100 symbols):\n\n```sh\npython -m much top 10\n```\n\nTo star a thread (copy it to folder `assets/starred` with a given name):\n\n```sh\npython -m much star 263473351 discussion\n```\n\n## Installation\n\nTo install through pip:\n\n```sh\npip install much\n```\n\nTo install dependencies and create conda environment:\n\n```sh\nconda env create -f environment.yml\n```\n",
    "bugtrack_url": null,
    "license": "Apache 2.0",
    "summary": "A simple utility for crawling text from 2ch",
    "version": "0.0.7",
    "project_urls": {
        "Bug Reports": "https://github.com/zeionara/much/issues",
        "Documentation": "https://github.com/zeionara/much#readme",
        "Homepage": "https://github.com/zeionara/much",
        "Source Code": "https://github.com/zeionara/much"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "ecbe4d365446d8d1f0564e141fa9aa4b5f595450a15b3a8860ffd6fa7bcf82e0",
                "md5": "5eb1559b4ca84ee5d260f117e607e6ab",
                "sha256": "b20fce3387addb839dfe205dc4ee0f4aa2a76329304715adb81a07bb5b0694d1"
            },
            "downloads": -1,
            "filename": "much-0.0.7.tar.gz",
            "has_sig": false,
            "md5_digest": "5eb1559b4ca84ee5d260f117e607e6ab",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 10344,
            "upload_time": "2023-11-10T11:49:31",
            "upload_time_iso_8601": "2023-11-10T11:49:31.381124Z",
            "url": "https://files.pythonhosted.org/packages/ec/be/4d365446d8d1f0564e141fa9aa4b5f595450a15b3a8860ffd6fa7bcf82e0/much-0.0.7.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-11-10 11:49:31",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "zeionara",
    "github_project": "much",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "lcname": "much"
}
        
Elapsed time: 0.23140s