# ConoDictor
*A fast and accurate prediction and classification tool for conopeptides*
[![PyPI](https://img.shields.io/pypi/v/conodictor.svg)](https://pypi.org/project/conodictor)
[![Wheel](https://img.shields.io/pypi/wheel/conodictor.svg)](https://pypi.org/project/conodictor)
[![Language](https://img.shields.io/pypi/implementation/conodictor)](https://pypi.org/project/conodictor)
[![Pyver](https://img.shields.io/pypi/pyversions/conodictor.svg)](https://pypi.org/project/conodictor)
[![Downloads](https://img.shields.io/pypi/dm/conodictor)](https://pypi.org/project/conodictor)
[![Docker](https://img.shields.io/docker/pulls/ebedthan/conodictor.svg)]()
[![License: GPL v3](https://img.shields.io/badge/License-GPLv3-blue.svg)](https://www.gnu.org/licenses/gpl-3.0)
## πΊοΈ Overview
### Unlocking the Potential of Cone Snail Venom
Cone snails are a treasure trove of natural peptides with immense pharmacological and therapeutic potential. The advent of affordable RNA sequencing (RNAseq) has revolutionized the mining of novel bioactive conopeptides from venom gland transcriptomes. However, the complexity of bioinformatic analyses often impedes the discovery process.
### Introducing ConoDictor 2
ConoDictor 2 is a standalone, user-friendly command-line tool designed to streamline the discovery of conopeptides. Building on a decade-old web server, we have significantly upgraded ConoDictor with modern tools and algorithms, and enhanced our classification models using new, high-quality sequences. The result is a program that is more accurate, faster, and compatible across multiple platforms.
### Key Features
* **Enhanced Accuracy and Speed**: ConoDictor 2 processes entire venom gland transcriptomes, whether from raw reads or assembled contigs, in record time.
* **Ease of Use**: The program requires only the assembled transcriptome or raw reads file, in either DNA or amino acid format. ConoDictor 2 automatically recognizes the alphabet used.
* **Advanced Prediction Capabilities**: It runs predictions directly on the submitted or dynamically generated proteins file, aiming to identify the longest conopeptide precursor-like sequences.
### Simplified Bioinformatics for Breakthrough Discoveries
With ConoDictor 2, researchers can bypass the intricate bioinformatic challenges and focus on uncovering the next generation of bioactive peptides from cone snail venom. Its robust performance and user-centric design make it an indispensable tool in venom research and drug discovery.
## Installing
### Install from Pip
You will first have to install ~~[HMMER 3](https://hmmer.org) and~~ [Pftools](https://github.com/sib-swiss/pftools3) to be able to run conodictor (**as of version 2.4, conodictor does not need hmmer anymore as it use the wonderful [pyhmmer](https://github.com/althonos/pyhmmer) library**).
```bash
pip install conodictor
```
### Using containers
### Docker
Accessible at https://hub.docker.com/u/ebedthan or on [BioContainers](https://github.com/BioContainers/containers/tree/master/conodictor/2.2.2).
```bash
docker pull ebedthan/conodictor:latest
docker run ebedthan/conodictor:latest conodictor -h
```
Example of a run
```bash
docker run --rm=True -v $PWD:/data -u $(id -u):$(id -g) ebedthan/conodictor:latest conodictor --out /data/outdir /data/input.fa.gz
```
See https://staph-b.github.io/docker-builds/run_containers/ for more informations on how to properly run a docker container.
### Singularity
The singularity container does not need admin privileges making it
suitable for university clusters and HPC.
```bash
singularity build conodictor.sif docker://ebedthan/conodictor:latest
singularity exec conodictor.sif conodictor -h
```
### Install from source
```bash
# Download ConoDictor development version
git clone https://github.com/koualab/conodictor.git conodictor
# Navigate to directory
cd conodictor
# Install with poetry: see https://python-poetry.org
poetry install --no-dev
# Enter the Python virtual environment with
poetry shell
# Test conodictor is correctly installed
conodictor -h
```
If you do not want to go into the virtual environment just do:
```bash
poetry run conodictor -h
```
## π‘ Example
```bash
conodictor file.fa.gz
conodictor --out outfolder --cpus 4 --mlen 51 file.fa
```
## Output files
The comma separeted-values file summary.csv can be easily viewed with any office suite,
or text editor.
```csv
sequence,hmm_pred,pssm_pred definitive_pred
SEQ_ID_1,A,A,A
SEQ_ID_2,B,D,CONFLICT B and D
SEQ_ID_3,O1,O1,O1
...
```
## π Feedback
### Issue tracker
Found a bug ? Have an enhancement request ? Head over to the [GitHub issue
tracker](https://github.com/koualab/conodictor/issues) if you need to report
or ask something. If you are filing in on a bug, please include as much
information as you can about the issue, and try to recreate the same bug
in a simple, easily reproducible situation.
## βοΈ License
[GPL v3](https://github.com/koualab/conodictor/blob/main/LICENSE).
For commercial uses please contact Dominique Koua at dominique.koua@inphb.ci.
## π Citation
ConoDictor is a scientifc software, with a [published paper](https://doi.org/10.1093/bioadv/vbab011) in the [Bioinformatics Advances](https://academic.oup.com/bioinformaticsadvances) journal. Please cite this article if you are using it in an academic work, for instance as:
Koua, D., Ebou, A., & Dutertre, S. (2021). Improved prediction of conopeptide superfamilies with ConoDictor 2.0. Bioinformatics Advances, 1(1), vbab011. https://doi.org/10.1093/bioadv/vbab011
## Dependencies
* [**Pftools**](https://github.com/sib-swiss/pftools3)
Used for PSSM prediction.
*Schuepbach P et al. pfsearchV3: a code acceleration and heuristic to search PROSITE profiles. Bioinformatics 2013, 10.1093/bioinformatics/btt129*
## π References
* [**HMMER 3**](https://hmmer.org)
Used for HMM profile prediction.
*Eddy SR, Accelerated Profile HMM Searches. PLOS Computational Biology 2011, 10.1371/journal.pcbi.1002195*
* [**Pftools**](https://github.com/sib-swiss/pftools3)
Used for PSSM prediction.
*Schuepbach P et al. pfsearchV3: a code acceleration and heuristic to search PROSITE profiles. Bioinformatics 2013, 10.1093/bioinformatics/btt129*
## Authors
* [Anicet Ebou](https://orcid.org/0000-0003-4005-177X)
* [Dominique Koua](https://www.researchgate.net/profile/Dominique_Koua)
Raw data
{
"_id": null,
"home_page": "https://github.com/koualab/conodictor",
"name": "conodictor",
"maintainer": "Anicet Ebou",
"docs_url": null,
"requires_python": "<4.0,>=3.8",
"maintainer_email": "anicet.ebou@gmail.com",
"keywords": "bioinformatics, classification, prediction, conopeptides",
"author": "Anicet Ebou",
"author_email": "anicet.ebou@gmail.com",
"download_url": "https://files.pythonhosted.org/packages/dd/85/bb956ad7e80a68110dff16c0e4f127d21153ecc7f25cbd22e410f4ee6c6c/conodictor-2.4.1.tar.gz",
"platform": null,
"description": "# ConoDictor\n\n*A fast and accurate prediction and classification tool for conopeptides*\n\n[![PyPI](https://img.shields.io/pypi/v/conodictor.svg)](https://pypi.org/project/conodictor)\n[![Wheel](https://img.shields.io/pypi/wheel/conodictor.svg)](https://pypi.org/project/conodictor)\n[![Language](https://img.shields.io/pypi/implementation/conodictor)](https://pypi.org/project/conodictor)\n[![Pyver](https://img.shields.io/pypi/pyversions/conodictor.svg)](https://pypi.org/project/conodictor)\n[![Downloads](https://img.shields.io/pypi/dm/conodictor)](https://pypi.org/project/conodictor)\n[![Docker](https://img.shields.io/docker/pulls/ebedthan/conodictor.svg)]()\n[![License: GPL v3](https://img.shields.io/badge/License-GPLv3-blue.svg)](https://www.gnu.org/licenses/gpl-3.0)\n\n\n## \ud83d\uddfa\ufe0f Overview\n### Unlocking the Potential of Cone Snail Venom\nCone snails are a treasure trove of natural peptides with immense pharmacological and therapeutic potential. The advent of affordable RNA sequencing (RNAseq) has revolutionized the mining of novel bioactive conopeptides from venom gland transcriptomes. However, the complexity of bioinformatic analyses often impedes the discovery process.\n\n### Introducing ConoDictor 2\nConoDictor 2 is a standalone, user-friendly command-line tool designed to streamline the discovery of conopeptides. Building on a decade-old web server, we have significantly upgraded ConoDictor with modern tools and algorithms, and enhanced our classification models using new, high-quality sequences. The result is a program that is more accurate, faster, and compatible across multiple platforms.\n\n### Key Features\n* **Enhanced Accuracy and Speed**: ConoDictor 2 processes entire venom gland transcriptomes, whether from raw reads or assembled contigs, in record time.\n* **Ease of Use**: The program requires only the assembled transcriptome or raw reads file, in either DNA or amino acid format. ConoDictor 2 automatically recognizes the alphabet used.\n* **Advanced Prediction Capabilities**: It runs predictions directly on the submitted or dynamically generated proteins file, aiming to identify the longest conopeptide precursor-like sequences.\n\n### Simplified Bioinformatics for Breakthrough Discoveries\nWith ConoDictor 2, researchers can bypass the intricate bioinformatic challenges and focus on uncovering the next generation of bioactive peptides from cone snail venom. Its robust performance and user-centric design make it an indispensable tool in venom research and drug discovery.\n\n## Installing\n\n### Install from Pip\n\nYou will first have to install ~~[HMMER 3](https://hmmer.org) and~~ [Pftools](https://github.com/sib-swiss/pftools3) to be able to run conodictor (**as of version 2.4, conodictor does not need hmmer anymore as it use the wonderful [pyhmmer](https://github.com/althonos/pyhmmer) library**).\n\n```bash\npip install conodictor\n```\n\n### Using containers\n\n### Docker\n\nAccessible at https://hub.docker.com/u/ebedthan or on [BioContainers](https://github.com/BioContainers/containers/tree/master/conodictor/2.2.2).\n\n\n```bash\ndocker pull ebedthan/conodictor:latest\ndocker run ebedthan/conodictor:latest conodictor -h\n```\n\nExample of a run\n\n```bash\ndocker run --rm=True -v $PWD:/data -u $(id -u):$(id -g) ebedthan/conodictor:latest conodictor --out /data/outdir /data/input.fa.gz\n```\n\nSee https://staph-b.github.io/docker-builds/run_containers/ for more informations on how to properly run a docker container.\n\n\n### Singularity\n\nThe singularity container does not need admin privileges making it\nsuitable for university clusters and HPC.\n\n```bash\nsingularity build conodictor.sif docker://ebedthan/conodictor:latest\nsingularity exec conodictor.sif conodictor -h\n```\n\n\n### Install from source\n\n```bash\n# Download ConoDictor development version\ngit clone https://github.com/koualab/conodictor.git conodictor\n\n# Navigate to directory\ncd conodictor\n\n# Install with poetry: see https://python-poetry.org\npoetry install --no-dev\n\n# Enter the Python virtual environment with\npoetry shell\n\n# Test conodictor is correctly installed\nconodictor -h\n```\n\nIf you do not want to go into the virtual environment just do:\n\n```bash\npoetry run conodictor -h\n```\n\n\n## \ud83d\udca1 Example\n\n```bash\nconodictor file.fa.gz\nconodictor --out outfolder --cpus 4 --mlen 51 file.fa\n```\n\n\n## Output files\n\nThe comma separeted-values file summary.csv can be easily viewed with any office suite,\nor text editor.\n\n```csv\nsequence,hmm_pred,pssm_pred definitive_pred\nSEQ_ID_1,A,A,A\nSEQ_ID_2,B,D,CONFLICT B and D\nSEQ_ID_3,O1,O1,O1\n...\n\n```\n\n## \ud83d\udcad Feedback\n\n### Issue tracker\n\nFound a bug ? Have an enhancement request ? Head over to the [GitHub issue\ntracker](https://github.com/koualab/conodictor/issues) if you need to report\nor ask something. If you are filing in on a bug, please include as much\ninformation as you can about the issue, and try to recreate the same bug\nin a simple, easily reproducible situation.\n\n## \u2696\ufe0f License\n\n[GPL v3](https://github.com/koualab/conodictor/blob/main/LICENSE).\n\nFor commercial uses please contact Dominique Koua at dominique.koua@inphb.ci.\n\n## \ud83d\udd16 Citation\n\nConoDictor is a scientifc software, with a [published paper](https://doi.org/10.1093/bioadv/vbab011) in the [Bioinformatics Advances](https://academic.oup.com/bioinformaticsadvances) journal. Please cite this article if you are using it in an academic work, for instance as: \nKoua, D., Ebou, A., & Dutertre, S. (2021). Improved prediction of conopeptide superfamilies with ConoDictor 2.0. Bioinformatics Advances, 1(1), vbab011. https://doi.org/10.1093/bioadv/vbab011\n\n\n## Dependencies\n\n* [**Pftools**](https://github.com/sib-swiss/pftools3) \n Used for PSSM prediction. \n *Schuepbach P et al. pfsearchV3: a code acceleration and heuristic to search PROSITE profiles. Bioinformatics 2013, 10.1093/bioinformatics/btt129*\n\n\n## \ud83d\udcda References\n\n* [**HMMER 3**](https://hmmer.org) \n Used for HMM profile prediction. \n *Eddy SR, Accelerated Profile HMM Searches. PLOS Computational Biology 2011, 10.1371/journal.pcbi.1002195*\n\n* [**Pftools**](https://github.com/sib-swiss/pftools3) \n Used for PSSM prediction. \n *Schuepbach P et al. pfsearchV3: a code acceleration and heuristic to search PROSITE profiles. Bioinformatics 2013, 10.1093/bioinformatics/btt129*\n\n\n## Authors\n\n* [Anicet Ebou](https://orcid.org/0000-0003-4005-177X)\n* [Dominique Koua](https://www.researchgate.net/profile/Dominique_Koua)",
"bugtrack_url": null,
"license": "GPL-3.0-only",
"summary": "Prediction and classification of conopeptides",
"version": "2.4.1",
"project_urls": {
"Documentation": "https://github.com/koualab/conodictor",
"Homepage": "https://github.com/koualab/conodictor",
"Repository": "https://github.com/koualab/conodictor"
},
"split_keywords": [
"bioinformatics",
" classification",
" prediction",
" conopeptides"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "2bbb123622fbb072a35a559ba25bb092983df758d2cfc051e6a72b12c2b2297a",
"md5": "5a3fe54e2f121d7075ea6b721e44911a",
"sha256": "bbfbdc9420b4bee4df6608ff173b5246c08363ae33d0cbe642b0c47ddb7e10fd"
},
"downloads": -1,
"filename": "conodictor-2.4.1-py3-none-any.whl",
"has_sig": false,
"md5_digest": "5a3fe54e2f121d7075ea6b721e44911a",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": "<4.0,>=3.8",
"size": 274268,
"upload_time": "2024-07-07T00:17:21",
"upload_time_iso_8601": "2024-07-07T00:17:21.800978Z",
"url": "https://files.pythonhosted.org/packages/2b/bb/123622fbb072a35a559ba25bb092983df758d2cfc051e6a72b12c2b2297a/conodictor-2.4.1-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "dd85bb956ad7e80a68110dff16c0e4f127d21153ecc7f25cbd22e410f4ee6c6c",
"md5": "1f70341811e0adbc778c450f186986bc",
"sha256": "e73de035a1ae8c593be53efa818c749b54da00bde1c9243ac7214fabc4032061"
},
"downloads": -1,
"filename": "conodictor-2.4.1.tar.gz",
"has_sig": false,
"md5_digest": "1f70341811e0adbc778c450f186986bc",
"packagetype": "sdist",
"python_version": "source",
"requires_python": "<4.0,>=3.8",
"size": 271871,
"upload_time": "2024-07-07T00:17:25",
"upload_time_iso_8601": "2024-07-07T00:17:25.567338Z",
"url": "https://files.pythonhosted.org/packages/dd/85/bb956ad7e80a68110dff16c0e4f127d21153ecc7f25cbd22e410f4ee6c6c/conodictor-2.4.1.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-07-07 00:17:25",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "koualab",
"github_project": "conodictor",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"lcname": "conodictor"
}