zimscan


Namezimscan JSON
Version 0.2.0 PyPI version JSON
download
home_pagehttps://github.com/jojolebarjos/zimscan
SummaryZIM file iterator
upload_time2023-09-18 14:40:51
maintainer
docs_urlNone
authorJohan Berdat
requires_python>=3.6,<4.0
licenseMIT
keywords zim iterator wikipedia gutenberg kiwix
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # ZIM Scan

Minimal ZIM file reader, designed for article streaming.


## Getting Started

Install using pip:

```
pip install zimscan
```

Or from Git repository, for latest version:

```
pip install -U git+https://github.com/jojolebarjos/zimscan.git
```

Iterate over a records, which are binary file-like objects:

```python
from zimscan import Reader

path = "wikipedia_en_all_nopic_2019-10.zim"
with Reader(open(path, "rb"), skip_metadata=True) as reader:
    for record in reader:
        data = record.read()
        ...
```


## Links

 * [ZIM file format](https://openzim.org/wiki/ZIM_file_format), official documentation
 * [Kiwix ZIM repository](http://download.kiwix.org/zim/), to download official ZIM files
 * [Wikipedia ZIM dumps](https://dumps.wikimedia.org/other/kiwix/zim/wikipedia/), to download Wikipedia ZIM files
 * [ZIMply](https://github.com/kimbauters/ZIMply), a ZIM file reader in the browser, in Python
 * [libzim](https://github.com/openzim/libzim), the reference implementation, in C++
 * [pyzim](https://github.com/pediapress/pyzim), Python wrapper for libzim
 * [pyzim](https://framagit.org/mgautierfr/pyzim), another Python wrapper for libzim
 * [Internet In A Box](https://github.com/iiab/internet-in-a-box), a project to bundle open knowledge locally

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/jojolebarjos/zimscan",
    "name": "zimscan",
    "maintainer": "",
    "docs_url": null,
    "requires_python": ">=3.6,<4.0",
    "maintainer_email": "",
    "keywords": "zim,iterator,wikipedia,gutenberg,kiwix",
    "author": "Johan Berdat",
    "author_email": "jojolebarjos@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/f5/05/9dff0428a4e51d3b63b321f07ccea20dec8ba5c9345458de5a9425fe2ef8/zimscan-0.2.0.tar.gz",
    "platform": null,
    "description": "# ZIM Scan\n\nMinimal ZIM file reader, designed for article streaming.\n\n\n## Getting Started\n\nInstall using pip:\n\n```\npip install zimscan\n```\n\nOr from Git repository, for latest version:\n\n```\npip install -U git+https://github.com/jojolebarjos/zimscan.git\n```\n\nIterate over a records, which are binary file-like objects:\n\n```python\nfrom zimscan import Reader\n\npath = \"wikipedia_en_all_nopic_2019-10.zim\"\nwith Reader(open(path, \"rb\"), skip_metadata=True) as reader:\n    for record in reader:\n        data = record.read()\n        ...\n```\n\n\n## Links\n\n * [ZIM file format](https://openzim.org/wiki/ZIM_file_format), official documentation\n * [Kiwix ZIM repository](http://download.kiwix.org/zim/), to download official ZIM files\n * [Wikipedia ZIM dumps](https://dumps.wikimedia.org/other/kiwix/zim/wikipedia/), to download Wikipedia ZIM files\n * [ZIMply](https://github.com/kimbauters/ZIMply), a ZIM file reader in the browser, in Python\n * [libzim](https://github.com/openzim/libzim), the reference implementation, in C++\n * [pyzim](https://github.com/pediapress/pyzim), Python wrapper for libzim\n * [pyzim](https://framagit.org/mgautierfr/pyzim), another Python wrapper for libzim\n * [Internet In A Box](https://github.com/iiab/internet-in-a-box), a project to bundle open knowledge locally\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "ZIM file iterator",
    "version": "0.2.0",
    "project_urls": {
        "Homepage": "https://github.com/jojolebarjos/zimscan",
        "Repository": "https://github.com/jojolebarjos/zimscan"
    },
    "split_keywords": [
        "zim",
        "iterator",
        "wikipedia",
        "gutenberg",
        "kiwix"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "b0a9258ab47a503e312aff7138fca28852eadde9e6410e28ceba1907fae4fc97",
                "md5": "360d21d8caa57552b85840cc421de756",
                "sha256": "0511fd14048cb595795d967c1549ed03dc5f36d73885bca71135ad72a2e9de3a"
            },
            "downloads": -1,
            "filename": "zimscan-0.2.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "360d21d8caa57552b85840cc421de756",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.6,<4.0",
            "size": 6922,
            "upload_time": "2023-09-18T14:40:49",
            "upload_time_iso_8601": "2023-09-18T14:40:49.187539Z",
            "url": "https://files.pythonhosted.org/packages/b0/a9/258ab47a503e312aff7138fca28852eadde9e6410e28ceba1907fae4fc97/zimscan-0.2.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "f5059dff0428a4e51d3b63b321f07ccea20dec8ba5c9345458de5a9425fe2ef8",
                "md5": "5715ad61c8410a3fc05040bf525198aa",
                "sha256": "3e2b8c60532dcb9834c2b35a2ef1743d97d8074a95df1c36d921d9a1d007645d"
            },
            "downloads": -1,
            "filename": "zimscan-0.2.0.tar.gz",
            "has_sig": false,
            "md5_digest": "5715ad61c8410a3fc05040bf525198aa",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.6,<4.0",
            "size": 6040,
            "upload_time": "2023-09-18T14:40:51",
            "upload_time_iso_8601": "2023-09-18T14:40:51.713204Z",
            "url": "https://files.pythonhosted.org/packages/f5/05/9dff0428a4e51d3b63b321f07ccea20dec8ba5c9345458de5a9425fe2ef8/zimscan-0.2.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-09-18 14:40:51",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "jojolebarjos",
    "github_project": "zimscan",
    "github_not_found": true,
    "lcname": "zimscan"
}
        
Elapsed time: 1.92209s