# OSCAR Corpus Downloader
Simple tool to download the OSCAR corpus.
## 1. Installation
Installation can be done using [pypi](https://pypi.org/project/oscar-corpus-downloader/):
```shell
$ pip install oscar-corpus-downloader
```
## 2. Usage
Submit an OSCAR access request following the procedure described on the [project page](https://oscar-project.org/).
Once you have received your credentials, you can use the command line interface to download an OSCAR corpus part.
```shell
$ export OSCAR_USERNAME=username
$ export OSCAR_PASSWORD=password
$ oscar download --help
Usage: oscar download [OPTIONS]
Options:
-u, --url TEXT OSCAR corpus url [required]
-o, --output-dir TEXT Output directory [required]
--resume Resume download
--help Show this message and exit.
$ oscar download \
--url https://oscar-prive.huma-num.fr/2301/fr_meta \
-o ./oscar-fr
```
Raw data
{
"_id": null,
"home_page": "https://github.com/jtourille/oscar-corpus-downloader",
"name": "oscar-corpus-downloader",
"maintainer": "",
"docs_url": null,
"requires_python": ">=3.9",
"maintainer_email": "",
"keywords": "oscar,corpus",
"author": "Julien Tourille",
"author_email": "julien.tourille@gmail.com",
"download_url": "https://files.pythonhosted.org/packages/84/84/3908f271ab6a6fc949ac79a1b6f52fa8ded9160f36a237d11c562740e95b/oscar_corpus_downloader-0.1.0.tar.gz",
"platform": null,
"description": "# OSCAR Corpus Downloader\n\nSimple tool to download the OSCAR corpus.\n\n## 1. Installation\n\nInstallation can be done using [pypi](https://pypi.org/project/oscar-corpus-downloader/):\n\n```shell\n$ pip install oscar-corpus-downloader\n```\n\n## 2. Usage\n\nSubmit an OSCAR access request following the procedure described on the [project page](https://oscar-project.org/).\n\nOnce you have received your credentials, you can use the command line interface to download an OSCAR corpus part.\n\n```shell\n$ export OSCAR_USERNAME=username\n$ export OSCAR_PASSWORD=password\n$ oscar download --help\nUsage: oscar download [OPTIONS]\n\nOptions:\n -u, --url TEXT OSCAR corpus url [required]\n -o, --output-dir TEXT Output directory [required]\n --resume Resume download\n --help Show this message and exit.\n\n$ oscar download \\\n --url https://oscar-prive.huma-num.fr/2301/fr_meta \\\n -o ./oscar-fr\n```\n\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "OSCAR Corpus Download Tool",
"version": "0.1.0",
"project_urls": {
"Documentation": "https://github.com/jtourille/oscar-corpus-downloader",
"Homepage": "https://github.com/jtourille/oscar-corpus-downloader",
"Repository": "https://github.com/jtourille/oscar-corpus-downloader"
},
"split_keywords": [
"oscar",
"corpus"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "590166fc217f3fba3fbf343264e7fdb64bb80ec4bcd806549306a247dfca4ffa",
"md5": "0f917de2429b2d8d717d403bda67da3b",
"sha256": "2e5b4169839cbd09b84cc7a57568957a338be3856714701c6ca35cd6263a8e75"
},
"downloads": -1,
"filename": "oscar_corpus_downloader-0.1.0-py3-none-any.whl",
"has_sig": false,
"md5_digest": "0f917de2429b2d8d717d403bda67da3b",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.9",
"size": 6225,
"upload_time": "2023-11-09T10:43:58",
"upload_time_iso_8601": "2023-11-09T10:43:58.173340Z",
"url": "https://files.pythonhosted.org/packages/59/01/66fc217f3fba3fbf343264e7fdb64bb80ec4bcd806549306a247dfca4ffa/oscar_corpus_downloader-0.1.0-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "84843908f271ab6a6fc949ac79a1b6f52fa8ded9160f36a237d11c562740e95b",
"md5": "591d719ba6d291800d7a967fb7ef2157",
"sha256": "34beddcbd4b43cd52a31aab28ef02b1afea39e2c4bbd63436d15f3997851049e"
},
"downloads": -1,
"filename": "oscar_corpus_downloader-0.1.0.tar.gz",
"has_sig": false,
"md5_digest": "591d719ba6d291800d7a967fb7ef2157",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.9",
"size": 5346,
"upload_time": "2023-11-09T10:43:59",
"upload_time_iso_8601": "2023-11-09T10:43:59.438247Z",
"url": "https://files.pythonhosted.org/packages/84/84/3908f271ab6a6fc949ac79a1b6f52fa8ded9160f36a237d11c562740e95b/oscar_corpus_downloader-0.1.0.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2023-11-09 10:43:59",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "jtourille",
"github_project": "oscar-corpus-downloader",
"github_not_found": true,
"lcname": "oscar-corpus-downloader"
}