conodictor


Nameconodictor JSON
Version 2.3.6 PyPI version JSON
download
home_pagehttps://github.com/koualab/conodictor
SummaryPrediction and classification of conopeptides
upload_time2024-02-23 20:40:13
maintainerAnicet Ebou
docs_urlNone
authorAnicet Ebou
requires_python>=3.8,<4.0
licenseGPL-3.0-only
keywords bioinformatics classification prediction conopeptides
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            [![PyPI](https://img.shields.io/pypi/v/conodictor.svg)](https://pypi.org/project/conodictor)
[![Wheel](https://img.shields.io/pypi/wheel/conodictor.svg)](https://pypi.org/project/conodictor)
[![Language](https://img.shields.io/pypi/implementation/conodictor)](https://pypi.org/project/conodictor)
[![Pyver](https://img.shields.io/pypi/pyversions/conodictor.svg)](https://pypi.org/project/conodictor)
[![Downloads](https://img.shields.io/pypi/dm/conodictor)](https://pypi.org/project/conodictor)
[![Docker](https://img.shields.io/docker/pulls/ebedthan/conodictor.svg)]()
[![License: GPL v3](https://img.shields.io/badge/License-GPLv3-blue.svg)](https://www.gnu.org/licenses/gpl-3.0)


## ConoDictor: A fast and accurate prediction and classification tool for conopeptides


### Important
If using conodictor and have issue like [CONODB issue](https://github.com/koualab/conodictor/issues/18), please update to v2.3.6 that provide a fix.

conodictor v2.3.6 introduce the -d option to specify the path to the db folder containing HMM and PSSM files for classification.

This is a temporary solution while I am working on the next big release. Thanks.

### Introduction

Cone snails are among the richest sources of natural peptides with promising pharmacological and therapeutic applications. With the reduced costs of RNAseq, scientists now heavily rely on venom gland transcriptomes for the mining of novel bioactive conopeptides, but the bioinformatic analyses often hamper the discovery process.

ConoDictor 2 is a standalone and user-friendly command-line program. We have updated the program originally published as a web server 10 years ago using novel and updated tools and algorithms and improved our classification models with new and higher quality sequences. ConoDictor 2 is now more accurate, faster, multiplatform, and able to deal with a whole cone snail venom gland transcriptome (raw reads or contigs) in a very short time.

The only input ConoDictor 2 requires is the assembled transcriptome or the raw reads file either in DNA or amino acid: the used alphabet is automatically recognized. ConoDictor 2 runs predictions directly on the proteins file (submitted or dynamically generated) and tries to report the longest conopeptide precursor-like sequence.

### Installation

#### Install from Pip

You will have first to install [HMMER 3](https://hmmer.org) and [Pftools](https://github.com/sib-swiss/pftools3) to be able to run conodictor.

```bash
pip install conodictor
```

#### Using containers

#### Docker

Accessible at https://hub.docker.com/u/ebedthan or on [BioContainers](https://github.com/BioContainers/containers/tree/master/conodictor/2.2.2).


```bash
docker pull ebedthan/conodictor:latest
docker run ebedthan/conodictor:latest conodictor -h
```

Example of a run

```bash
docker run --rm=True -v $PWD:/data -u $(id -u):$(id -g) ebedthan/conodictor:latest conodictor --out /data/outdir /data/input.fa.gz
```

See https://staph-b.github.io/docker-builds/run_containers/ for more informations on how to properly run a docker container.


#### Singularity

The singularity container does not need admin privileges making it
suitable for university clusters and HPC.

```bash
singularity build conodictor.sif docker://ebedthan/conodictor:latest
singularity exec conodictor.sif conodictor -h
```


#### Install from source

```bash
# Download ConoDictor development version
git clone https://github.com/koualab/conodictor.git conodictor

# Navigate to directory
cd conodictor

# Install with poetry: see https://python-poetry.org
poetry install --no-dev

# Enter the Python virtual environment with
poetry shell

# Test conodictor is correctly installed
conodictor -h
```

If you do not want to go into the virtual environment just do:

```bash
poetry run conodictor -h
```

## Test

* Type `conodictor -h` and it should output something like:

```
usage: conodictor [options] <FILE>

optional arguments:
  -o DIR, --out DIR   output result to DIR [ConoDictor]
  --mlen INT          minimum length of sequences to be considered [off]
  --ndup INT          minimum occurence sequences to be considered [off]
  --faa               dump a fasta file of matched sequences [false]
  --filter            only keep sequences matching sig, pro and mat regions [false]
  -a, --all           add unclassified sequences in result [false]
  -j INT, --cpus INT  number of threads [1]
  --force             re-use output directory [false]
  -q, --quiet         decrease program verbosity
  -v, --version       show program's version number and exit
  -h, --help          show this help message and exit

Citation: Koua et al., 2021, Bioinformatics Advances
```


## Invoking conodictor

```bash
conodictor file.fa.gz
conodictor --out outfolder --cpus 4 --mlen 51 file.fa
```
  

## Output files

The comma separeted-values file summary.csv can be easily viewed with any office suite,
or text editor.

```csv
sequence,hmm_pred,pssm_pred definitive_pred
SEQ_ID_1,A,A,A
SEQ_ID_2,B,D,CONFLICT B and D
SEQ_ID_3,O1,O1,O1
...

```

## Citation

When using ConoDictor2 in your work, you should cite:

Dominique Koua, Anicet Ebou, Sébastien Dutertre, Improved prediction of conopeptide superfamilies with ConoDictor 2.0, Bioinformatics Advances, Volume 1, Issue 1, 2021, vbab011, https://doi.org/10.1093/bioadv/vbab011.
  
## Bugs

Submit problems or requests to the [Issue Tracker](https://github.com/koualab/conodictor/issues).


## Dependencies

### Mandatory

* [**HMMER 3**](https://hmmer.org)  
  Used for HMM profile prediction.   
  *Eddy SR, Accelerated Profile HMM Searches. PLOS Computational Biology 2011, 10.1371/journal.pcbi.1002195*

* [**Pftools**](https://github.com/sib-swiss/pftools3)  
  Used for PSSM prediction.    
  *Schuepbach P et al. pfsearchV3: a code acceleration and heuristic to search PROSITE profiles. Bioinformatics 2013, 10.1093/bioinformatics/btt129*


## Licence

[GPL v3](https://github.com/koualab/conodictor/blob/main/LICENSE).

For commercial uses please contact Dominique Koua at dominique.koua@inphb.ci.

## Authors

* [Anicet Ebou](https://orcid.org/0000-0003-4005-177X)
* [Dominique Koua](https://www.researchgate.net/profile/Dominique_Koua)
            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/koualab/conodictor",
    "name": "conodictor",
    "maintainer": "Anicet Ebou",
    "docs_url": null,
    "requires_python": ">=3.8,<4.0",
    "maintainer_email": "anicet.ebou@gmail.com",
    "keywords": "bioinformatics,classification,prediction,conopeptides",
    "author": "Anicet Ebou",
    "author_email": "anicet.ebou@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/6a/ba/90afe0488c950af6d2930c89eb8c0d3364eeb51e3ec0e954832efeea9eaa/conodictor-2.3.6.tar.gz",
    "platform": null,
    "description": "[![PyPI](https://img.shields.io/pypi/v/conodictor.svg)](https://pypi.org/project/conodictor)\n[![Wheel](https://img.shields.io/pypi/wheel/conodictor.svg)](https://pypi.org/project/conodictor)\n[![Language](https://img.shields.io/pypi/implementation/conodictor)](https://pypi.org/project/conodictor)\n[![Pyver](https://img.shields.io/pypi/pyversions/conodictor.svg)](https://pypi.org/project/conodictor)\n[![Downloads](https://img.shields.io/pypi/dm/conodictor)](https://pypi.org/project/conodictor)\n[![Docker](https://img.shields.io/docker/pulls/ebedthan/conodictor.svg)]()\n[![License: GPL v3](https://img.shields.io/badge/License-GPLv3-blue.svg)](https://www.gnu.org/licenses/gpl-3.0)\n\n\n## ConoDictor: A fast and accurate prediction and classification tool for conopeptides\n\n\n### Important\nIf using conodictor and have issue like [CONODB issue](https://github.com/koualab/conodictor/issues/18), please update to v2.3.6 that provide a fix.\n\nconodictor v2.3.6 introduce the -d option to specify the path to the db folder containing HMM and PSSM files for classification.\n\nThis is a temporary solution while I am working on the next big release. Thanks.\n\n### Introduction\n\nCone snails are among the richest sources of natural peptides with promising pharmacological and therapeutic applications. With the reduced costs of RNAseq, scientists now heavily rely on venom gland transcriptomes for the mining of novel bioactive conopeptides, but the bioinformatic analyses often hamper the discovery process.\n\nConoDictor 2 is a standalone and user-friendly command-line program. We have updated the program originally published as a web server 10 years ago using novel and updated tools and algorithms and improved our classification models with new and higher quality sequences. ConoDictor 2 is now more accurate, faster, multiplatform, and able to deal with a whole cone snail venom gland transcriptome (raw reads or contigs) in a very short time.\n\nThe only input ConoDictor 2 requires is the assembled transcriptome or the raw reads file either in DNA or amino acid: the used alphabet is automatically recognized. ConoDictor 2 runs predictions directly on the proteins file (submitted or dynamically generated) and tries to report the longest conopeptide precursor-like sequence.\n\n### Installation\n\n#### Install from Pip\n\nYou will have first to install [HMMER 3](https://hmmer.org) and [Pftools](https://github.com/sib-swiss/pftools3) to be able to run conodictor.\n\n```bash\npip install conodictor\n```\n\n#### Using containers\n\n#### Docker\n\nAccessible at https://hub.docker.com/u/ebedthan or on [BioContainers](https://github.com/BioContainers/containers/tree/master/conodictor/2.2.2).\n\n\n```bash\ndocker pull ebedthan/conodictor:latest\ndocker run ebedthan/conodictor:latest conodictor -h\n```\n\nExample of a run\n\n```bash\ndocker run --rm=True -v $PWD:/data -u $(id -u):$(id -g) ebedthan/conodictor:latest conodictor --out /data/outdir /data/input.fa.gz\n```\n\nSee https://staph-b.github.io/docker-builds/run_containers/ for more informations on how to properly run a docker container.\n\n\n#### Singularity\n\nThe singularity container does not need admin privileges making it\nsuitable for university clusters and HPC.\n\n```bash\nsingularity build conodictor.sif docker://ebedthan/conodictor:latest\nsingularity exec conodictor.sif conodictor -h\n```\n\n\n#### Install from source\n\n```bash\n# Download ConoDictor development version\ngit clone https://github.com/koualab/conodictor.git conodictor\n\n# Navigate to directory\ncd conodictor\n\n# Install with poetry: see https://python-poetry.org\npoetry install --no-dev\n\n# Enter the Python virtual environment with\npoetry shell\n\n# Test conodictor is correctly installed\nconodictor -h\n```\n\nIf you do not want to go into the virtual environment just do:\n\n```bash\npoetry run conodictor -h\n```\n\n## Test\n\n* Type `conodictor -h` and it should output something like:\n\n```\nusage: conodictor [options] <FILE>\n\noptional arguments:\n  -o DIR, --out DIR   output result to DIR [ConoDictor]\n  --mlen INT          minimum length of sequences to be considered [off]\n  --ndup INT          minimum occurence sequences to be considered [off]\n  --faa               dump a fasta file of matched sequences [false]\n  --filter            only keep sequences matching sig, pro and mat regions [false]\n  -a, --all           add unclassified sequences in result [false]\n  -j INT, --cpus INT  number of threads [1]\n  --force             re-use output directory [false]\n  -q, --quiet         decrease program verbosity\n  -v, --version       show program's version number and exit\n  -h, --help          show this help message and exit\n\nCitation: Koua et al., 2021, Bioinformatics Advances\n```\n\n\n## Invoking conodictor\n\n```bash\nconodictor file.fa.gz\nconodictor --out outfolder --cpus 4 --mlen 51 file.fa\n```\n  \n\n## Output files\n\nThe comma separeted-values file summary.csv can be easily viewed with any office suite,\nor text editor.\n\n```csv\nsequence,hmm_pred,pssm_pred definitive_pred\nSEQ_ID_1,A,A,A\nSEQ_ID_2,B,D,CONFLICT B and D\nSEQ_ID_3,O1,O1,O1\n...\n\n```\n\n## Citation\n\nWhen using ConoDictor2 in your work, you should cite:\n\nDominique Koua, Anicet Ebou, S\u00e9bastien Dutertre, Improved prediction of conopeptide superfamilies with ConoDictor 2.0, Bioinformatics Advances, Volume 1, Issue 1, 2021, vbab011, https://doi.org/10.1093/bioadv/vbab011.\n  \n## Bugs\n\nSubmit problems or requests to the [Issue Tracker](https://github.com/koualab/conodictor/issues).\n\n\n## Dependencies\n\n### Mandatory\n\n* [**HMMER 3**](https://hmmer.org)  \n  Used for HMM profile prediction.   \n  *Eddy SR, Accelerated Profile HMM Searches. PLOS Computational Biology 2011, 10.1371/journal.pcbi.1002195*\n\n* [**Pftools**](https://github.com/sib-swiss/pftools3)  \n  Used for PSSM prediction.    \n  *Schuepbach P et al. pfsearchV3: a code acceleration and heuristic to search PROSITE profiles. Bioinformatics 2013, 10.1093/bioinformatics/btt129*\n\n\n## Licence\n\n[GPL v3](https://github.com/koualab/conodictor/blob/main/LICENSE).\n\nFor commercial uses please contact Dominique Koua at dominique.koua@inphb.ci.\n\n## Authors\n\n* [Anicet Ebou](https://orcid.org/0000-0003-4005-177X)\n* [Dominique Koua](https://www.researchgate.net/profile/Dominique_Koua)",
    "bugtrack_url": null,
    "license": "GPL-3.0-only",
    "summary": "Prediction and classification of conopeptides",
    "version": "2.3.6",
    "project_urls": {
        "Documentation": "https://github.com/koualab/conodictor",
        "Homepage": "https://github.com/koualab/conodictor",
        "Repository": "https://github.com/koualab/conodictor"
    },
    "split_keywords": [
        "bioinformatics",
        "classification",
        "prediction",
        "conopeptides"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "23f5b5cc924ef651118967f4193dec2e633033274a08feabbced8a680588b0ef",
                "md5": "bfafe8e2e9bbf63fb3f73a62ec7cd24f",
                "sha256": "4d399ceef5250c35f70f52db63c448770f73cb15c532c1b1822037d5c60d7074"
            },
            "downloads": -1,
            "filename": "conodictor-2.3.6-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "bfafe8e2e9bbf63fb3f73a62ec7cd24f",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.8,<4.0",
            "size": 273539,
            "upload_time": "2024-02-23T20:40:10",
            "upload_time_iso_8601": "2024-02-23T20:40:10.357169Z",
            "url": "https://files.pythonhosted.org/packages/23/f5/b5cc924ef651118967f4193dec2e633033274a08feabbced8a680588b0ef/conodictor-2.3.6-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "6aba90afe0488c950af6d2930c89eb8c0d3364eeb51e3ec0e954832efeea9eaa",
                "md5": "a55cca4705e51126ac590f90be7c7365",
                "sha256": "c26a99f2263ca2c6c97b1eb74b5bcdd5e5cbad696ff8332e3307f76f35b3a726"
            },
            "downloads": -1,
            "filename": "conodictor-2.3.6.tar.gz",
            "has_sig": false,
            "md5_digest": "a55cca4705e51126ac590f90be7c7365",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.8,<4.0",
            "size": 270183,
            "upload_time": "2024-02-23T20:40:13",
            "upload_time_iso_8601": "2024-02-23T20:40:13.416175Z",
            "url": "https://files.pythonhosted.org/packages/6a/ba/90afe0488c950af6d2930c89eb8c0d3364eeb51e3ec0e954832efeea9eaa/conodictor-2.3.6.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-02-23 20:40:13",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "koualab",
    "github_project": "conodictor",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "lcname": "conodictor"
}
        
Elapsed time: 0.18867s