panno

Name	panno JSON
Version	0.3.1 JSON
	download
home_page	https://github.com/PreMedKB/PAnno
Summary	PAnno is a Pharmacogenomics Annotation tool for clinical genomic testing.
upload_time	2022-12-30 14:51:57
maintainer
docs_url	None
author	Yaqing Liu
requires_python	>=3.7
license
keywords	pharmacogenomics pharmacology drug responses genomics bioinformatics
VCS
bugtrack_url
requirements	No requirements were recorded.
Travis-CI
coveralls test coverage	No coveralls.

            <p align="left" margin-bottom="-2rem"> <img src="https://raw.githubusercontent.com/premedkb/panno/main/docs/images/panno_logo.png" width="40%"/> </p>

## PAnno: A Pharmacogenomics Annotation Tool for Clinical Genomic Testing

![PyPI](https://img.shields.io/pypi/v/panno?color=pink)  ![Conda](https://img.shields.io/conda/v/lyaqing/panno?color=blue&label=conda) ![AppVeyor](https://img.shields.io/appveyor/build/PreMedKB/PAnno)

PAnno reports **prescribing recommendations** and **drug response phenotypes** by parsing the germline variant call format (VCF) file from NGS and the population to which the individual belongs.

## Installation

*Prerequisite: To ensure smooth installation and usage, [Python >= 3.7](https://docs.conda.io/en/latest/miniconda.html#system-requirements) (#1 and #3 below), or [Miniconda/Anaconda](https://docs.conda.io/en/latest/miniconda.html#system-requirements) (#2 below) are required.*

1. You can install PAnno from [PyPI](https://pypi.org/project/panno/) using pip as follows:
```Shell
pip install panno==0.3.1
```

2. Alternatively, you can create a environment using [Conda](https://anaconda.org/lyaqing/panno).
```Shell
conda create -n PAnno panno=0.3.1 -c lyaqing -c conda-forge -c bioconda
conda activate PAnno
```

3. If you would like the development version instead, the command is:
```Shell
pip install --upgrade --force-reinstall git+https://github.com/PreMedKB/PAnno.git
# Or download first and install later
git clone https://github.com/PreMedKB/PAnno.git; pip install PAnno
```

## Usage
Once installed, you can use PAnno by navigating to your VCF file and entering the corresponding three-letter abbreviation of the population:

```Shell
panno -s sample_id -i germline_vcf -p population -o outdir
```

* Required arguments
```Shell
-s, --sample_id TEXT            Sample ID that will be displayed in the PAnno report.

-i, --germline_vcf TEXT         Unannotated VCF file, preferably germline variant.

-p, --population [AAC|AME|EAS|EUR|LAT|NEA|OCE|SAS|SSA]
                                The three-letter abbreviation for biogeographic groups:
                                AAC (African American/Afro-Caribbean), AME (American),
                                EAS (East Asian), EUR (European), LAT (Latino),
                                NEA (Near Eastern), OCE (Oceanian),
                                SAS (Central/South Asian), SSA (Sub-Saharan African).

-o, --outdir TEXT               Create report in the specified output path.
```

### Input data
#### 1. Germline VCF file

PAnno directly uses the NGS-derived germline VCF file as input and assumes it has undergone quality control. Therefore, if the VCF file is of poor quality, inaccurate diplotypes and inappropriate clinical recommendations may be reported.

PAnno requires the VCF file aligned to the GRCh38 reference genome given the increasing generality and the built-in diplotype definition dependency version.


#### 2. Population
There are nine biogeographic groups supported by PAnno. Please use the ***three-letter abbreviation*** as input. This is to prevent errors caused by special symbols such as spaces.

**AAC** (African American/Afro-Caribbean), **AME** (American), **EAS** (East Asian), **EUR** (European), **LAT** (Latino), **NEA** (Near Eastern), **OCE** (Oceanian), **SAS** (Central/South Asian), **SSA** (Sub-Saharan African).

More information is available at https://www.pharmgkb.org/page/biogeographicalGroups.

### Output data

The report is created in `${sample_id}.html` at the `outdir` by default.

For more detailed instructions, run `panno -h`.

## Examples

The `demo` directory contains the VCF files and PAnno reports of four Coriell samples: NA10859 (European), NA19147 (African American/Afro-Caribbean), NA19785 (Latino), and HG00436 (East Asian).

In addition, we analyzed the germline variants of 88 samples which have been characterized in the GeT-RM PGx studies.

* The VCF files are available at https://github.com/PreMedKB/PAnno-analysis/tree/main/vcf.
* The PAnno reports are available at https://github.com/PreMedKB/PAnno-analysis/tree/main/report.

Here is a snapshot from the PAnno report:
<p align="center">
<img src="https://raw.githubusercontent.com/premedkb/panno/main/docs/images/panno_report.png" width="100%" />
</p>

## Core Components
A ranking model dedicated to inferring diplotypes, developed based on the **allele (haplotype) definition** and **population frequency**, was introduced in PAnno. The predictive performance was validated in comparison with four similar tools using the consensus diplotype data of the Genetic Testing Reference Materials Coordination Program (GeT-RM) as ground truth.

An annotation method was proposed to summarize prescriptions and classify drugs into **avoid use**, **use with caution**, and **routine use**, following the recommendations of the Clinical Pharmacogenetics Implementation Consortium (CPIC), etc. It further predicts phenotypes of specific drugs in terms of toxicity, dosage, efficacy, and metabolism by integrating the high-confidence clinical annotations in the Pharmacogenomics Knowledgebase (PharmGKB).

<p align="center">
<img src="https://raw.githubusercontent.com/premedkb/panno/main/docs/images/architecture.png" width="70%" />
</p>

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/PreMedKB/PAnno",
    "name": "panno",
    "maintainer": "",
    "docs_url": null,
    "requires_python": ">=3.7",
    "maintainer_email": "",
    "keywords": "pharmacogenomics,pharmacology,drug responses,genomics,bioinformatics",
    "author": "Yaqing Liu",
    "author_email": "yaqing.liu@outlook.com",
    "download_url": "https://files.pythonhosted.org/packages/f9/f5/3fe575624954a4302a5d2b38fea3c16661b65a930b93de4479055f1d718e/panno-0.3.1.tar.gz",
    "platform": null,
    "description": "<p align=\"left\" margin-bottom=\"-2rem\"> <img src=\"https://raw.githubusercontent.com/premedkb/panno/main/docs/images/panno_logo.png\" width=\"40%\"/> </p>\n\n## PAnno: A Pharmacogenomics Annotation Tool for Clinical Genomic Testing\n\n![PyPI](https://img.shields.io/pypi/v/panno?color=pink)  ![Conda](https://img.shields.io/conda/v/lyaqing/panno?color=blue&label=conda) ![AppVeyor](https://img.shields.io/appveyor/build/PreMedKB/PAnno)\n\nPAnno reports **prescribing recommendations** and **drug response phenotypes** by parsing the germline variant call format (VCF) file from NGS and the population to which the individual belongs.\n\n## Installation\n\n*Prerequisite: To ensure smooth installation and usage, [Python >= 3.7](https://docs.conda.io/en/latest/miniconda.html#system-requirements) (#1 and #3 below), or [Miniconda/Anaconda](https://docs.conda.io/en/latest/miniconda.html#system-requirements) (#2 below) are required.*\n\n1. You can install PAnno from [PyPI](https://pypi.org/project/panno/) using pip as follows:\n```Shell\npip install panno==0.3.1\n```\n\n2. Alternatively, you can create a environment using [Conda](https://anaconda.org/lyaqing/panno).\n```Shell\nconda create -n PAnno panno=0.3.1 -c lyaqing -c conda-forge -c bioconda\nconda activate PAnno\n```\n\n3. If you would like the development version instead, the command is:\n```Shell\npip install --upgrade --force-reinstall git+https://github.com/PreMedKB/PAnno.git\n# Or download first and install later\ngit clone https://github.com/PreMedKB/PAnno.git; pip install PAnno\n```\n\n## Usage\nOnce installed, you can use PAnno by navigating to your VCF file and entering the corresponding three-letter abbreviation of the population:\n\n```Shell\npanno -s sample_id -i germline_vcf -p population -o outdir\n```\n\n* Required arguments\n```Shell\n-s, --sample_id TEXT            Sample ID that will be displayed in the PAnno report.\n\n-i, --germline_vcf TEXT         Unannotated VCF file, preferably germline variant.\n\n-p, --population [AAC|AME|EAS|EUR|LAT|NEA|OCE|SAS|SSA]\n                                The three-letter abbreviation for biogeographic groups:\n                                AAC (African American/Afro-Caribbean), AME (American),\n                                EAS (East Asian), EUR (European), LAT (Latino),\n                                NEA (Near Eastern), OCE (Oceanian),\n                                SAS (Central/South Asian), SSA (Sub-Saharan African).\n\n-o, --outdir TEXT               Create report in the specified output path.\n```\n\n### Input data\n#### 1. Germline VCF file\n\nPAnno directly uses the NGS-derived germline VCF file as input and assumes it has undergone quality control. Therefore, if the VCF file is of poor quality, inaccurate diplotypes and inappropriate clinical recommendations may be reported.\n\nPAnno requires the VCF file aligned to the GRCh38 reference genome given the increasing generality and the built-in diplotype definition dependency version.\n\n\n#### 2. Population\nThere are nine biogeographic groups supported by PAnno. Please use the ***three-letter abbreviation*** as input. This is to prevent errors caused by special symbols such as spaces.\n\n**AAC** (African American/Afro-Caribbean), **AME** (American), **EAS** (East Asian), **EUR** (European), **LAT** (Latino), **NEA** (Near Eastern), **OCE** (Oceanian), **SAS** (Central/South Asian), **SSA** (Sub-Saharan African).\n\nMore information is available at https://www.pharmgkb.org/page/biogeographicalGroups.\n\n### Output data\n\nThe report is created in `${sample_id}.html` at the `outdir` by default.\n\nFor more detailed instructions, run `panno -h`.\n\n## Examples\n\nThe `demo` directory contains the VCF files and PAnno reports of four Coriell samples: NA10859 (European), NA19147 (African American/Afro-Caribbean), NA19785 (Latino), and HG00436 (East Asian).\n\nIn addition, we analyzed the germline variants of 88 samples which have been characterized in the GeT-RM PGx studies.\n\n* The VCF files are available at https://github.com/PreMedKB/PAnno-analysis/tree/main/vcf.\n* The PAnno reports are available at https://github.com/PreMedKB/PAnno-analysis/tree/main/report.\n\nHere is a snapshot from the PAnno report:\n<p align=\"center\">\n<img src=\"https://raw.githubusercontent.com/premedkb/panno/main/docs/images/panno_report.png\" width=\"100%\" />\n</p>\n\n## Core Components\nA ranking model dedicated to inferring diplotypes, developed based on the **allele (haplotype) definition** and **population frequency**, was introduced in PAnno. The predictive performance was validated in comparison with four similar tools using the consensus diplotype data of the Genetic Testing Reference Materials Coordination Program (GeT-RM) as ground truth.\n\nAn annotation method was proposed to summarize prescriptions and classify drugs into **avoid use**, **use with caution**, and **routine use**, following the recommendations of the Clinical Pharmacogenetics Implementation Consortium (CPIC), etc. It further predicts phenotypes of specific drugs in terms of toxicity, dosage, efficacy, and metabolism by integrating the high-confidence clinical annotations in the Pharmacogenomics Knowledgebase (PharmGKB).\n\n<p align=\"center\">\n<img src=\"https://raw.githubusercontent.com/premedkb/panno/main/docs/images/architecture.png\" width=\"70%\" />\n</p>\n\n\n",
    "bugtrack_url": null,
    "license": "",
    "summary": "PAnno is a Pharmacogenomics Annotation tool for clinical genomic testing.",
    "version": "0.3.1",
    "split_keywords": [
        "pharmacogenomics",
        "pharmacology",
        "drug responses",
        "genomics",
        "bioinformatics"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "md5": "f6cddfc4e9ae26158c99620d3c16a208",
                "sha256": "c35d4e8b525a6c444bac0f7b427868bb434f4c034d5a352e00f8caabeb9ced81"
            },
            "downloads": -1,
            "filename": "panno-0.3.1.tar.gz",
            "has_sig": false,
            "md5_digest": "f6cddfc4e9ae26158c99620d3c16a208",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.7",
            "size": 9534138,
            "upload_time": "2022-12-30T14:51:57",
            "upload_time_iso_8601": "2022-12-30T14:51:57.641058Z",
            "url": "https://files.pythonhosted.org/packages/f9/f5/3fe575624954a4302a5d2b38fea3c16661b65a930b93de4479055f1d718e/panno-0.3.1.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2022-12-30 14:51:57",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "github_user": "PreMedKB",
    "github_project": "PAnno",
    "travis_ci": true,
    "coveralls": false,
    "github_actions": false,
    "requirements": [],
    "tox": true,
    "lcname": "panno"
}

Yaqing Liu