oscar-corpus-downloader


Nameoscar-corpus-downloader JSON
Version 0.1.0 PyPI version JSON
download
home_pagehttps://github.com/jtourille/oscar-corpus-downloader
SummaryOSCAR Corpus Download Tool
upload_time2023-11-09 10:43:59
maintainer
docs_urlNone
authorJulien Tourille
requires_python>=3.9
licenseMIT
keywords oscar corpus
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # OSCAR Corpus Downloader

Simple tool to download the OSCAR corpus.

## 1. Installation

Installation can be done using [pypi](https://pypi.org/project/oscar-corpus-downloader/):

```shell
$ pip install oscar-corpus-downloader
```

## 2. Usage

Submit an OSCAR access request following the procedure described on the [project page](https://oscar-project.org/).

Once you have received your credentials, you can use the command line interface to download an OSCAR corpus part.

```shell
$ export OSCAR_USERNAME=username
$ export OSCAR_PASSWORD=password
$ oscar download --help
Usage: oscar download [OPTIONS]

Options:
  -u, --url TEXT         OSCAR corpus url  [required]
  -o, --output-dir TEXT  Output directory  [required]
  --resume               Resume download
  --help                 Show this message and exit.

$ oscar download \
  --url https://oscar-prive.huma-num.fr/2301/fr_meta \
  -o ./oscar-fr
```


            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/jtourille/oscar-corpus-downloader",
    "name": "oscar-corpus-downloader",
    "maintainer": "",
    "docs_url": null,
    "requires_python": ">=3.9",
    "maintainer_email": "",
    "keywords": "oscar,corpus",
    "author": "Julien Tourille",
    "author_email": "julien.tourille@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/84/84/3908f271ab6a6fc949ac79a1b6f52fa8ded9160f36a237d11c562740e95b/oscar_corpus_downloader-0.1.0.tar.gz",
    "platform": null,
    "description": "# OSCAR Corpus Downloader\n\nSimple tool to download the OSCAR corpus.\n\n## 1. Installation\n\nInstallation can be done using [pypi](https://pypi.org/project/oscar-corpus-downloader/):\n\n```shell\n$ pip install oscar-corpus-downloader\n```\n\n## 2. Usage\n\nSubmit an OSCAR access request following the procedure described on the [project page](https://oscar-project.org/).\n\nOnce you have received your credentials, you can use the command line interface to download an OSCAR corpus part.\n\n```shell\n$ export OSCAR_USERNAME=username\n$ export OSCAR_PASSWORD=password\n$ oscar download --help\nUsage: oscar download [OPTIONS]\n\nOptions:\n  -u, --url TEXT         OSCAR corpus url  [required]\n  -o, --output-dir TEXT  Output directory  [required]\n  --resume               Resume download\n  --help                 Show this message and exit.\n\n$ oscar download \\\n  --url https://oscar-prive.huma-num.fr/2301/fr_meta \\\n  -o ./oscar-fr\n```\n\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "OSCAR Corpus Download Tool",
    "version": "0.1.0",
    "project_urls": {
        "Documentation": "https://github.com/jtourille/oscar-corpus-downloader",
        "Homepage": "https://github.com/jtourille/oscar-corpus-downloader",
        "Repository": "https://github.com/jtourille/oscar-corpus-downloader"
    },
    "split_keywords": [
        "oscar",
        "corpus"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "590166fc217f3fba3fbf343264e7fdb64bb80ec4bcd806549306a247dfca4ffa",
                "md5": "0f917de2429b2d8d717d403bda67da3b",
                "sha256": "2e5b4169839cbd09b84cc7a57568957a338be3856714701c6ca35cd6263a8e75"
            },
            "downloads": -1,
            "filename": "oscar_corpus_downloader-0.1.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "0f917de2429b2d8d717d403bda67da3b",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.9",
            "size": 6225,
            "upload_time": "2023-11-09T10:43:58",
            "upload_time_iso_8601": "2023-11-09T10:43:58.173340Z",
            "url": "https://files.pythonhosted.org/packages/59/01/66fc217f3fba3fbf343264e7fdb64bb80ec4bcd806549306a247dfca4ffa/oscar_corpus_downloader-0.1.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "84843908f271ab6a6fc949ac79a1b6f52fa8ded9160f36a237d11c562740e95b",
                "md5": "591d719ba6d291800d7a967fb7ef2157",
                "sha256": "34beddcbd4b43cd52a31aab28ef02b1afea39e2c4bbd63436d15f3997851049e"
            },
            "downloads": -1,
            "filename": "oscar_corpus_downloader-0.1.0.tar.gz",
            "has_sig": false,
            "md5_digest": "591d719ba6d291800d7a967fb7ef2157",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.9",
            "size": 5346,
            "upload_time": "2023-11-09T10:43:59",
            "upload_time_iso_8601": "2023-11-09T10:43:59.438247Z",
            "url": "https://files.pythonhosted.org/packages/84/84/3908f271ab6a6fc949ac79a1b6f52fa8ded9160f36a237d11c562740e95b/oscar_corpus_downloader-0.1.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-11-09 10:43:59",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "jtourille",
    "github_project": "oscar-corpus-downloader",
    "github_not_found": true,
    "lcname": "oscar-corpus-downloader"
}
        
Elapsed time: 0.35864s