Name | buscolite JSON |
Version |
24.11.3
JSON |
| download |
home_page | None |
Summary | busco analysis for gene predictions |
upload_time | 2024-11-04 06:26:38 |
maintainer | None |
docs_url | None |
author | None |
requires_python | >=3.6.0 |
license | BSD 2-Clause License Copyright (c) 2022, Jonathan M. Palmer All rights reserved. Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met: * Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer. * Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution. THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE. |
keywords |
annotation
bioinformatics
completeness
genome
|
VCS |
|
bugtrack_url |
|
requirements |
No requirements were recorded.
|
Travis-CI |
No Travis.
|
coveralls test coverage |
No coveralls.
|
[![Latest Github release](https://img.shields.io/github/release/nextgenusfs/buscolite.svg)](https://github.com/nextgenusfs/buscolite/releases/latest)
![Conda](https://img.shields.io/conda/dn/bioconda/buscolite)
# BUSCOlite: simplified BUSCO analysis for gene prediction
BUSCOlite can run the miniprot/Augustus mediated genome predictions as well as the [pyhmmer](https://pyhmmer.readthedocs.io/en/stable/index.html) HMM predictions using the BUSCO v9 or v10 databases. It also provides a python API to run busco analysis from within python, ie to be used inside the eukaryotic gene prediction pipeline Funannotate.
This tool is not meant to be a replacment of BUSCO, for most general use cases you should continue to use [BUSCOv5](https://busco.ezlab.org)
BUSCO models/lineages can be downloaded from the BUSCO site: [v5](https://busco-data.ezlab.org/v5/data/lineages/), [v4](https://busco-data.ezlab.org/v4/data/lineages/). BUSCOlite does not provide an internal method to do this, as it is trivial to download the lineage you need from your organism(s) by following these links.
##### There are limited dependencies with BUSCOlite:
* [augustus](https://github.com/Gaius-Augustus/Augustus) (note: many versions on conda have non-functional PPX/--proteinprofile mode)
* [miniprot](https://github.com/lh3/miniprot)
* [pyhmmer](https://pyhmmer.readthedocs.io/en/stable/index.html)
* [pyfastx](https://github.com/lmdu/pyfastx)
* [natsort](https://pypi.org/project/natsort/)
#### Why?
[Funannotate](https://github.com/nextgenusfs/funannotate) uses BUSCO to find core conserved marker genes that it uses as a basis to train several ab-initio gene predictors. When BUSCO v2 came out it was python3 only and at that time funannotate was still python2, so I modified the BUSCOv2 source code to be compatible with python2 so it could be run within funannotate. Now BUSCOv5 is the current release, that has numerous bells and whistles that funannotate does not need (no knock against bells and whistles) but the real problem is that due to the large number of dependencies associated with these extra tools is that I cannot build a conda image that includes funannotate and BUSCOv5. So I re-wrote BUSCOv2 here so that it has limited dependencies and will make it easier to incorporate as a dependency of funannotate. A side note is that the `metaeuk` method that BUSCOv5 now uses as default does not produce complete gene models, in fact the protein sequences it outputs have lowercase sequences that are actually not found in your genome at all. So for training ab-initio predictors, the `metaeuk` method is not useful -- however, it is faster to get your simple stats on "how complete is my genome assembly".
To install release versions use the pip package manager, like so:
```
python -m pip install buscolite
```
To install the most updated code in master you can run:
```
python -m pip install git+https://github.com/nextgenusfs/buscolite.git
```
Raw data
{
"_id": null,
"home_page": null,
"name": "buscolite",
"maintainer": null,
"docs_url": null,
"requires_python": ">=3.6.0",
"maintainer_email": null,
"keywords": "annotation, bioinformatics, completeness, genome",
"author": null,
"author_email": "Jon Palmer <nextgenusfs@gmail.com>",
"download_url": "https://files.pythonhosted.org/packages/43/4e/fca67ec4bb2df8eabdf74f6c21ae27c5570e7eb3821363615435f8d5f155/buscolite-24.11.3.tar.gz",
"platform": null,
"description": "[![Latest Github release](https://img.shields.io/github/release/nextgenusfs/buscolite.svg)](https://github.com/nextgenusfs/buscolite/releases/latest)\n![Conda](https://img.shields.io/conda/dn/bioconda/buscolite)\n\n# BUSCOlite: simplified BUSCO analysis for gene prediction\n\nBUSCOlite can run the miniprot/Augustus mediated genome predictions as well as the [pyhmmer](https://pyhmmer.readthedocs.io/en/stable/index.html) HMM predictions using the BUSCO v9 or v10 databases. It also provides a python API to run busco analysis from within python, ie to be used inside the eukaryotic gene prediction pipeline Funannotate.\n\nThis tool is not meant to be a replacment of BUSCO, for most general use cases you should continue to use [BUSCOv5](https://busco.ezlab.org)\n\nBUSCO models/lineages can be downloaded from the BUSCO site: [v5](https://busco-data.ezlab.org/v5/data/lineages/), [v4](https://busco-data.ezlab.org/v4/data/lineages/). BUSCOlite does not provide an internal method to do this, as it is trivial to download the lineage you need from your organism(s) by following these links.\n\n##### There are limited dependencies with BUSCOlite:\n* [augustus](https://github.com/Gaius-Augustus/Augustus) (note: many versions on conda have non-functional PPX/--proteinprofile mode)\n* [miniprot](https://github.com/lh3/miniprot)\n* [pyhmmer](https://pyhmmer.readthedocs.io/en/stable/index.html)\n* [pyfastx](https://github.com/lmdu/pyfastx)\n* [natsort](https://pypi.org/project/natsort/)\n\n#### Why?\n\n[Funannotate](https://github.com/nextgenusfs/funannotate) uses BUSCO to find core conserved marker genes that it uses as a basis to train several ab-initio gene predictors. When BUSCO v2 came out it was python3 only and at that time funannotate was still python2, so I modified the BUSCOv2 source code to be compatible with python2 so it could be run within funannotate. Now BUSCOv5 is the current release, that has numerous bells and whistles that funannotate does not need (no knock against bells and whistles) but the real problem is that due to the large number of dependencies associated with these extra tools is that I cannot build a conda image that includes funannotate and BUSCOv5. So I re-wrote BUSCOv2 here so that it has limited dependencies and will make it easier to incorporate as a dependency of funannotate. A side note is that the `metaeuk` method that BUSCOv5 now uses as default does not produce complete gene models, in fact the protein sequences it outputs have lowercase sequences that are actually not found in your genome at all. So for training ab-initio predictors, the `metaeuk` method is not useful -- however, it is faster to get your simple stats on \"how complete is my genome assembly\".\n\n\nTo install release versions use the pip package manager, like so:\n```\npython -m pip install buscolite\n```\n\nTo install the most updated code in master you can run:\n```\npython -m pip install git+https://github.com/nextgenusfs/buscolite.git\n```\n",
"bugtrack_url": null,
"license": "BSD 2-Clause License Copyright (c) 2022, Jonathan M. Palmer All rights reserved. Redistribution and use in source and binary forms, with or without modification, are permitted provided that the following conditions are met: * Redistributions of source code must retain the above copyright notice, this list of conditions and the following disclaimer. * Redistributions in binary form must reproduce the above copyright notice, this list of conditions and the following disclaimer in the documentation and/or other materials provided with the distribution. THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS \"AS IS\" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT HOLDER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.",
"summary": "busco analysis for gene predictions",
"version": "24.11.3",
"project_urls": {
"Homepage": "https://github.com/nextgenusfs/buscolite",
"Repository": "https://github.com/nextgenusfs/buscolitegit"
},
"split_keywords": [
"annotation",
" bioinformatics",
" completeness",
" genome"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "792be5d15a71b3758a42cc27fe393c5873435e33a3eb1897ff32833f2c064991",
"md5": "42b397e7272531bce00047973cf32605",
"sha256": "270c2cf199a05f4348b0dc57c773bc8d8e7254518c8a36314a818ff8713d3e21"
},
"downloads": -1,
"filename": "buscolite-24.11.3-py3-none-any.whl",
"has_sig": false,
"md5_digest": "42b397e7272531bce00047973cf32605",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.6.0",
"size": 142786,
"upload_time": "2024-11-04T06:26:36",
"upload_time_iso_8601": "2024-11-04T06:26:36.599890Z",
"url": "https://files.pythonhosted.org/packages/79/2b/e5d15a71b3758a42cc27fe393c5873435e33a3eb1897ff32833f2c064991/buscolite-24.11.3-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "434efca67ec4bb2df8eabdf74f6c21ae27c5570e7eb3821363615435f8d5f155",
"md5": "02bc61fba0d780d91ee91cf62033fd35",
"sha256": "f7ef8151b2dd16848ec8bf7e14658a81db3fbe7898e6377a2bb0553cf065f47b"
},
"downloads": -1,
"filename": "buscolite-24.11.3.tar.gz",
"has_sig": false,
"md5_digest": "02bc61fba0d780d91ee91cf62033fd35",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.6.0",
"size": 132241,
"upload_time": "2024-11-04T06:26:38",
"upload_time_iso_8601": "2024-11-04T06:26:38.591447Z",
"url": "https://files.pythonhosted.org/packages/43/4e/fca67ec4bb2df8eabdf74f6c21ae27c5570e7eb3821363615435f8d5f155/buscolite-24.11.3.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-11-04 06:26:38",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "nextgenusfs",
"github_project": "buscolite",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"lcname": "buscolite"
}