# Efficiently run Augustus in multi-core environments
`parallel_augustus` cuts the input genome in chunks to feed to Augustus. The number of chunks and Augustus processes launched in parallel are configurable.
# Installation
```
pip install --user parallel_augustus
```
# Requirements
- Python >= 3.8
- Augustus
# Execution
Before launching `parallel_augustus`, please make sure that Augustus is available in your path. If you run `augustus -h` and do not encounter an error, you are good to go.
`parallel_augustus` first divides the genome (`-g` argument) in the desired number of chunks (`-c` argument, it can be any number > 1). After that it launches `-p` processes of Augustus in parallel until there is no chunks left. You can pass parameters to Augustus via the `--extra` flag. At least a `--species=thing` is required by Augustus.
_IMPORTANT_: the `--extra` argument should be the last one in the command line due to shortcomings in the `argparse` python module.
With all that in mind, a typical `parallel_augustus` command line will look like this:
```bash
parallel_augustus -g genome.fasta \
-o augustus_results \
-c 500 \
-p 20 \
--extra '--species=human'
```
This command creates the output directory `augustus_results`, divides the genome into 500 chunks and launches 20 processes of Augustus in parallel. At the end, results are concatenated into `augustus_results/augustus.gff`.
Raw data
{
"_id": null,
"home_page": "https://github.com/bistace/parallel_augustus",
"name": "parallel-augustus",
"maintainer": "",
"docs_url": null,
"requires_python": "",
"maintainer_email": "",
"keywords": "bioinformatics,genomics,genome,annotation,augustus",
"author": "['Benjamin Istace']",
"author_email": "",
"download_url": "https://files.pythonhosted.org/packages/6f/bd/744ff21645472eab07f94aca64247b95554dedafbb189c0aa726d4d9fed1/parallel_augustus-1.0.2.tar.gz",
"platform": null,
"description": "# Efficiently run Augustus in multi-core environments\n`parallel_augustus` cuts the input genome in chunks to feed to Augustus. The number of chunks and Augustus processes launched in parallel are configurable.\n\n# Installation\n```\npip install --user parallel_augustus\n```\n\n# Requirements\n - Python >= 3.8\n - Augustus\n\n# Execution\nBefore launching `parallel_augustus`, please make sure that Augustus is available in your path. If you run `augustus -h` and do not encounter an error, you are good to go.\n\n`parallel_augustus` first divides the genome (`-g` argument) in the desired number of chunks (`-c` argument, it can be any number > 1). After that it launches `-p` processes of Augustus in parallel until there is no chunks left. You can pass parameters to Augustus via the `--extra` flag. At least a `--species=thing` is required by Augustus. \n\n_IMPORTANT_: the `--extra` argument should be the last one in the command line due to shortcomings in the `argparse` python module.\n\nWith all that in mind, a typical `parallel_augustus` command line will look like this:\n```bash\nparallel_augustus -g genome.fasta \\\n -o augustus_results \\\n -c 500 \\\n -p 20 \\\n --extra '--species=human'\n```\nThis command creates the output directory `augustus_results`, divides the genome into 500 chunks and launches 20 processes of Augustus in parallel. At the end, results are concatenated into `augustus_results/augustus.gff`.\n",
"bugtrack_url": null,
"license": "CeCILL",
"summary": "Simple wrapper around Augustus to bring faster restitution times",
"version": "1.0.2",
"project_urls": {
"Download": "https://github.com/bistace/parallel_augustus",
"Homepage": "https://github.com/bistace/parallel_augustus"
},
"split_keywords": [
"bioinformatics",
"genomics",
"genome",
"annotation",
"augustus"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "eaeb15f09a3d99ffbcd1a7287f952e40839cbc1543ded5e977249f2ada8fcaa6",
"md5": "6104c0b0f38aa27fe19b367f3a6c5bb4",
"sha256": "93f85d7c567f7396abe238c2ad64ba6d2f74f6bffef1bcaa22aa70341cf08161"
},
"downloads": -1,
"filename": "parallel_augustus-1.0.2-py3-none-any.whl",
"has_sig": false,
"md5_digest": "6104c0b0f38aa27fe19b367f3a6c5bb4",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 13185,
"upload_time": "2023-10-19T10:07:57",
"upload_time_iso_8601": "2023-10-19T10:07:57.854123Z",
"url": "https://files.pythonhosted.org/packages/ea/eb/15f09a3d99ffbcd1a7287f952e40839cbc1543ded5e977249f2ada8fcaa6/parallel_augustus-1.0.2-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "6fbd744ff21645472eab07f94aca64247b95554dedafbb189c0aa726d4d9fed1",
"md5": "24bdc08ebeee97f6a6ddb1e4f3229096",
"sha256": "dbede448a3c3141bacfa7903064e7d69c3dbd763c69000a6bc764bbbcdb3409a"
},
"downloads": -1,
"filename": "parallel_augustus-1.0.2.tar.gz",
"has_sig": false,
"md5_digest": "24bdc08ebeee97f6a6ddb1e4f3229096",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 12202,
"upload_time": "2023-10-19T10:07:59",
"upload_time_iso_8601": "2023-10-19T10:07:59.540277Z",
"url": "https://files.pythonhosted.org/packages/6f/bd/744ff21645472eab07f94aca64247b95554dedafbb189c0aa726d4d9fed1/parallel_augustus-1.0.2.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2023-10-19 10:07:59",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "bistace",
"github_project": "parallel_augustus",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"lcname": "parallel-augustus"
}