vrs-anvil-toolkit

Name	vrs-anvil-toolkit JSON
Version	0.3.1 JSON
	download
home_page	None
Summary	Useful tools and methods for applying GA4GH GKS and VRS models on the NHGRI AnVIL platform.
upload_time	2025-03-03 21:11:05
maintainer	None
docs_url	None
author	GKS-AnVIL
requires_python	>=3.10
license	MIT License Copyright (c) 2024 Oregon Health and Science University Computational Biology Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions: The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software. THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
keywords
VCS
bugtrack_url
requirements	ga4gh.vrs diskcache biocommons.seqrepo glom click pyyaml google requests boto3 tqdm google-cloud-storage psutil setuptools firecloud ga4gh.va_spec pandas vrsix
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

# VRS AnVIL Toolkit

## Project Overview

This Python package is designed to process Variant Call Format (VCF) files and perform lookup operations on Genomic Variation Representation Service (GA4GH VRS) identifiers. The GA4GH VRS identifiers provide a standardized way to represent genomic variations, making it easier to exchange and share genomic information.

In addition, this project facilitates the retrieval of evidence associated with genomic alleles by leveraging the Genomic Data Representation and Knowledge Base (GA4GH MetaKB) service. GA4GH MetaKB provides a comprehensive knowledge base that links genomic variants to relevant clinical variant interpretations.

## Features

1. **VCF File Processing:**
- Streamlines reading and parsing of VCF files, to extract relevant genomic information.

2. **GA4GH VRS Identifier Lookup:**
- Utilizes the GA4GH VRS API to perform lookups for each genomic variation mentioned in the VCF file.
- Retrieves standardized identifiers for the alleles, enhancing interoperability with GA4GH-compliant systems.
- GA4GH MetaKB Service Integration: Utilizes the GA4GH MetaKB retrieve evidence associated with specified genomic alleles.

3. **Output Generation:**
- Generates summary metrics about throughput, errors, evidence, and hits.
- Presents the retrieved evidence in a structured format, providing access to information about studies, publications, and other relevant details.
- Create cohort allele frequency objects

4. **Additional Features**
- Provides configurable options like threading and caching for processing VCFs.
- Implements robust error handling to address issues like invalid input files, invalid variants, and more.

## Getting Started

### Prerequisites

- Python 3.10 or later
- Internet connectivity for data dependency setup (seqrepo)

### Installation

1. Get the repository either by...
1. Source code
```bash
git clone https://github.com/gks-anvil/vrs_anvil_toolkit
cd vrs_anvil_toolkit
```
2. PyPi
```bash
pip install vrs_anvil_toolkit
```

2. Install dependencies either...
1. for local use
```bash
# install postgresql@14 (required for vrs-python)
brew install postgresql@14
bash scripts/setup.sh
```
2. for use on Terra
```bash
bash terra/setup.sh
```

### Usage
**General**
All usage has the following general steps...

1. Create a manifest to configure your VCF processing run
1. Use the `vrs_bulk` CLI to create a metrics file of related evidence
1. Use the metrics files for downstream analysis

The follow steps are explained in detail below, with some additional info on using vrs-python to directly annotate VCFs with VRS IDs.

**Manifest**

The configuration of each VCF processing run run is controlled by a `manifest.yaml` file. Most importantly, this file specifies the...
- input VCF file(s) to process
- working directories
- performance and strictness configurations

Use this commented [sample manifest](tests/fixtures/manifest.yaml) as a starting point on the specific variables you can specify per run.

**CLI**

Below are a list of command line utilities that may be useful
```bash
# activate the environment
source venv/bin/activate

# run the vrs_bulk command in the foreground
vrs_bulk annotate

# run the vrs_bulk command in parallel, one process per VCF file
vrs_bulk annotate --scatter

# run the vrs_bulk command in parallel in the background
nohup vrs_bulk annotate --scatter & # press enter to continue

# get the status of the processes for the most recent scatter run
vrs_bulk ps
```

The command line utility supports Google Cloud URIs and running commands in the background to interop with Terra out-of-the-box. This is described in the CLI usage above. For an example notebook, see `vrs-anvil-demo.ipynb` on the `vrs-anvil` workspace.

## Cohort Allele Frequency Generation

### Description
Given a variant and an optional phenotype of interest, get aggregated allele frequency info for a cohort of interest as a cohort allele frequency object ([CAF](https://va-ga4gh.readthedocs.io/en/1.0.0-ballot.2024-11/base-profiles/study-result-profiles.html#cohort-allele-frequency-study-result)).

### General Prerequisites
- Variant of interest
- Valid VRS-annotated joint VCF
- genotyping laid out per-sample
- Precomputed VRS-VCF index (created using [vrsix](https://github.com/gks-anvil/vrsix))
- this enables efficient retrieval of VCF row by VRS ID
- [Optional] Phenotype of interest to specify subcohort
- [Optional] Plugins for project-specific transformations (see [here](README.md#plugins-for-unique-data-inputs) for more info)

### Use Cases
1. Given a variant ID and VCF path, get the CAF for the entire cohort
- Get VCF row corresponding to variant ID using a variant -> VCF row index
- Get phenotypes corresponding to each participants using the phenotypes by patient table
- Aggregate counts for participants using their genotypes
- Create CAF object using counts

2. Given a variant ID, VCF path, **and participant list**, get the CAF for a subset of participants (subcohort)
- Same as 1 with subcohort defined by the participant list

3. Given a variant ID, VCF path, **and phenotype**, get the CAF for a subcohort with a specified phenotype
- Same as 1, but for a subcohort of samples with the phenotype

## Plugins for Unique Data Inputs

### Description
Given the broad variety of data representation used by data generators, we want to be able to generate CAFs for any data consortium. One way this is possible is through the use of a plugin architecture.

A plugin architecture allows users to customize the aggregation of cohort data specified to their data model. This addresses the problem where...
1. a user has a project-specific phenotype data model
1. Example: sample-level rare disease data is stored in an unaggregated Terra data table
2. a user is interested in a subset of the cohort based on particular filters
1. Example: samples must have phenotype A and a minimum read depth to be included in the subcohort
3. each sample's genotype must be calculated uniquely depending on particular traits
1. Example: sex is not represented within the VCF, so a user needs to integrate sample-level phenotype data to get accurate counts for chrX variants

These three problems above map to three different methods necessary in implementing a `Plugin`:
1. `__init__`: Given any set of parameters, create a phenotype index that maps each sample to its list of phenotypes.
2. `include_sample`: given a sample's variant-level or phenotype data, determine whether to include the sample in the allele count
1. This takes a `pysam.VariantRecord` as input to represent a particular variant record in a VCF
2. For more details, consult the pysam [VariantRecord docs](https://pysam.readthedocs.io/en/latest/api.html#pysam.VariantRecord).
3. `process_sample_genotype`: determine how to sum the alleles of a sample's genotype using variant-level or phenotype data
1. This also makes use of a `pysam.VariantRecord` as input
2. An `alt_index` is also passed in as an input, which is the index representing the allele of interest within the VCF row. The alt index matches the genotype according to [VCF specification](https://samtools.github.io/hts-specs/VCFv4.2.pdf). For instance, a sample with the 2nd alt might have a genotype containing a 2, ie `(2,1)`, `(2,0)`, `(2,2)`, etc.

To implement your own plugin....

### Getting Started

There are two types of user stories for plugins: one will be implementing the project-specific plugin, while the other is using an already-built plugin. The former user will be called a plugin author, while the latter will be called a plugin user.

**Plugin Author**
1. Read through the default implementations defined in the [`BasePlugin`](src/plugin_system/plugins/base_plugin.py).
2. Copy [`simple_plugin.py`](src/plugin_system/plugins/gregor_plugin.py) to your working directory. This has to be in the **top-level directory** where you will do your CAF generation.
3. Rename the plugin class and name (eg `MyProjectPlugin` and `my_project_plugin.py`). The file name must end in `_plugin.py` to be a valid module.
4. Customize any of the methods described by the [`BasePlugin`](src/plugin_system/plugins/base_plugin.py) by implementing them in your own plugin class. For reference material...
1. [`BasePlugin`](src/plugin_system/plugins/base_plugin.py) is by default the parent class, so any methods that aren't defined in your plugin will inherit these methods. You can also define the `BasePlugin`'s implementations by calling `super().<method_to_invoke>` if you want to do additional work after using the default methods.
2. [`GregorPlugin`](src/plugin_system/plugins/gregor_plugin.py) is a worked example of specific real-world implementation, refer to that for alternative ways to customize allele frequency generation.
3. Plugin [utilities](src/plugin_system/utils.py) are a set of data transformation util methods that might be useful. For example, if your phenotypical data lives in a Terra data table, you could use `terra_data_table_to_dataframe` to rapidly retrieve the columns most relevant in creating a phenotype index.

**Plugin User**
1. Confirm that you have the variant of interest, [optional] phenotype of interest, VCF path of interest at your disposal, and name of implemented plugin class
2. See the `test_plugin_worked_example` function in [test_plugin.py](tests/unit/test_plugin.py) for a worked example on how to use plugin. Generally, the two main components related to using the plugin are...
1. For your `MyProjectPlugin` plugin, instantiate it with the `PluginManager` and any input parameters specified.
2. Call `get_cohort_allele_frequency` passing in the instantiated `MyProjectPlugin` object with the "plugin" parameter.

## Processing VCF Files ([vrs-python](https://github.com/ga4gh/vrs-python))

vrs-python is a GA4GH GKS package centered around creating Variant Representation specification (VRS) IDs: consistent, globally unique identifiers for variation. Some of its functionality includes variant ID translation and VCF annotation. Used as a dependency in `vrs_bulk`, it can also be used as a standalone package.

For Python usage, see [vrs_vcf_annotator.py](scripts/vrs_vcf_annotator.py) for an example.

For CLI usage:
```bash
python3 -m ga4gh.vrs.extras.vcf_annotation --vcf_in tests/fixtures/1kGP.chr1.1000.vcf --vcf_out annotated_output.vcf.gz --vrs_pickle_out allele_dicts.pkl --seqrepo_root_dir ~/seqrepo/latest
```

The above is an example using an example vcf. Replace the `--vcf_out` and `vrs_pickle_out` here with your desired output file path, where the output vcf can be BCF (`vcf.gz`) or VCF (`vcf`)

Also, see the [VRS Annotator](https://dockstore.org/workflows/github.com/gks-anvil/vrs-annotator/VRSAnnotator:main?tab=info) workflow on Dockstore for a way to do this on Terra.

## GREGoR-specific Details

### Work in Progress
- For chromosomes with ploidy of 1 (mitochondrial calling or sex chromosomes), focus allele counts (AC) and locus allele counts (AN) can have a maximum value of 1. Focus allele counts are 1 when the genotype has at least a single allele match (0/1, 1/1, or 1) otherwise it is none.

## Contributing

This project is open to contributions from the research community. If you are interested in contributing to the project, please contact the project team.
See the [contributing guide](CONTRIBUTING.md) for more information on how to contribute to the project.

## License

This project is licensed under the MIT License - see the [LICENSE](LICENSE.md) file for details.

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "vrs-anvil-toolkit",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.10",
    "maintainer_email": null,
    "keywords": null,
    "author": "GKS-AnVIL",
    "author_email": null,
    "download_url": "https://files.pythonhosted.org/packages/1a/c0/c35eb889d2411a317506a1cddabc741baa9f8f05b62969740c31620a7db8/vrs_anvil_toolkit-0.3.1.tar.gz",
    "platform": null,
    "description": "<img width=\"100\" alt=\"image\" src=\"https://github.com/user-attachments/assets/b3d5bb78-2794-4182-a7b9-11ab6969f666\">\n\n# VRS AnVIL Toolkit\n\n## Project Overview\n\nThis Python package is designed to process Variant Call Format (VCF) files and perform lookup operations on Genomic Variation Representation Service (GA4GH VRS) identifiers. The GA4GH VRS identifiers provide a standardized way to represent genomic variations, making it easier to exchange and share genomic information.\n\nIn addition, this project facilitates the retrieval of evidence associated with genomic alleles by leveraging the Genomic Data Representation and Knowledge Base (GA4GH MetaKB) service. GA4GH MetaKB provides a comprehensive knowledge base that links genomic variants to relevant clinical variant interpretations.\n\n## Features\n\n1. **VCF File Processing:**\n   - Streamlines reading and parsing of VCF files, to extract relevant genomic information.\n\n2. **GA4GH VRS Identifier Lookup:**\n   - Utilizes the GA4GH VRS API to perform lookups for each genomic variation mentioned in the VCF file.\n   - Retrieves standardized identifiers for the alleles, enhancing interoperability with GA4GH-compliant systems.\n   - GA4GH MetaKB Service Integration:  Utilizes the GA4GH MetaKB retrieve evidence associated with specified genomic alleles.\n\n3. **Output Generation:**\n   - Generates summary metrics about throughput, errors, evidence, and hits.\n   - Presents the retrieved evidence in a structured format, providing access to information about studies, publications, and other relevant details.\n   - Create cohort allele frequency objects\n\n4. **Additional Features**\n   - Provides configurable options like threading and caching for processing VCFs.\n   - Implements robust error handling to address issues like invalid input files, invalid variants, and more.\n\n## Getting Started\n\n### Prerequisites\n\n- Python 3.10 or later\n- Internet connectivity for data dependency setup (seqrepo)\n\n### Installation\n\n1. Get the repository either by...\n   1. Source code\n   ```bash\n   git clone https://github.com/gks-anvil/vrs_anvil_toolkit\n   cd vrs_anvil_toolkit\n   ```\n   2. PyPi\n   ```bash\n   pip install vrs_anvil_toolkit\n   ```\n\n2. Install dependencies either...\n   1. for local use\n   ```bash\n   # install postgresql@14 (required for vrs-python)\n   brew install postgresql@14\n   bash scripts/setup.sh\n   ```\n   2. for use on Terra\n   ```bash\n   bash terra/setup.sh\n   ```\n\n### Usage\n**General**\nAll usage has the following general steps...\n\n1. Create a manifest to configure your VCF processing run\n1. Use the `vrs_bulk` CLI to create a metrics file of related evidence\n1. Use the metrics files for downstream analysis\n\nThe follow steps are explained in detail below, with some additional info on using vrs-python to directly annotate VCFs with VRS IDs.\n\n**Manifest**\n\nThe configuration of each VCF processing run run is controlled by a `manifest.yaml` file. Most importantly, this file specifies the...\n- input VCF file(s) to process\n- working directories\n- performance and strictness configurations\n\nUse this commented [sample manifest](tests/fixtures/manifest.yaml) as a starting point on the specific variables you can specify per run.\n\n**CLI**\n\nBelow are a list of command line utilities that may be useful\n```bash\n# activate the environment\nsource venv/bin/activate\n\n# run the vrs_bulk command in the foreground\nvrs_bulk annotate\n\n# run the vrs_bulk command in parallel, one process per VCF file\nvrs_bulk annotate --scatter\n\n# run the vrs_bulk command in parallel in the background\nnohup vrs_bulk annotate --scatter & # press enter to continue\n\n# get the status of the processes for the most recent scatter run\nvrs_bulk ps\n```\n\nThe command line utility supports Google Cloud URIs and running commands in the background to interop with Terra out-of-the-box. This is described in the CLI usage above. For an example notebook, see `vrs-anvil-demo.ipynb` on the `vrs-anvil` workspace.\n\n## Cohort Allele Frequency Generation\n\n### Description\nGiven a variant and an optional phenotype of interest, get aggregated allele frequency info for a cohort of interest as a cohort allele frequency object ([CAF](https://va-ga4gh.readthedocs.io/en/1.0.0-ballot.2024-11/base-profiles/study-result-profiles.html#cohort-allele-frequency-study-result)).\n\n### General Prerequisites\n- Variant of interest\n- Valid VRS-annotated joint VCF\n  - genotyping laid out per-sample\n- Precomputed VRS-VCF index (created using [vrsix](https://github.com/gks-anvil/vrsix))\n  - this enables efficient retrieval of VCF row by VRS ID\n- [Optional] Phenotype of interest to specify subcohort\n- [Optional] Plugins for project-specific transformations (see [here](README.md#plugins-for-unique-data-inputs) for more info)\n\n### Use Cases\n1. Given a variant ID and VCF path, get the CAF for the entire cohort\n   - Get VCF row corresponding to variant ID using a variant -> VCF row index\n   - Get phenotypes corresponding to each participants using the phenotypes by patient table\n   - Aggregate counts for participants using their genotypes\n   - Create CAF object using counts\n\n2. Given a variant ID, VCF path, **and participant list**, get the CAF for a subset of participants (subcohort)\n   - Same as 1 with subcohort defined by the participant list\n\n3. Given a variant ID, VCF path, **and phenotype**, get the CAF for a subcohort with a specified phenotype\n   - Same as 1, but for a subcohort of samples with the phenotype\n\n\n## Plugins for Unique Data Inputs\n\n### Description\nGiven the broad variety of data representation used by data generators, we want to be able to generate CAFs for any data consortium. One way this is possible is through the use of a plugin architecture.\n\nA plugin architecture allows users to customize the aggregation of cohort data specified to their data model. This addresses the problem where...\n1. a user has a project-specific phenotype data model\n   1. Example: sample-level rare disease data is stored in an unaggregated Terra data table\n2. a user is interested in a subset of the cohort based on particular filters\n   1. Example: samples must have phenotype A and a minimum read depth to be included in the subcohort\n3. each sample's genotype must be calculated uniquely depending on particular traits\n   1. Example: sex is not represented within the VCF, so a user needs to integrate sample-level phenotype data to get accurate counts for chrX variants\n\nThese three problems above map to three different methods necessary in implementing a `Plugin`:\n1. `__init__`: Given any set of parameters, create a phenotype index that maps each sample to its list of phenotypes.\n2. `include_sample`: given a sample's variant-level or phenotype data, determine whether to include the sample in the allele count\n   1. This takes a `pysam.VariantRecord` as input to represent a particular variant record in a VCF\n   2. For more details, consult the pysam [VariantRecord docs](https://pysam.readthedocs.io/en/latest/api.html#pysam.VariantRecord).\n3. `process_sample_genotype`: determine how to sum the alleles of a sample's genotype using variant-level or phenotype data\n   1. This also makes use of a `pysam.VariantRecord` as input\n   2. An `alt_index` is also passed in as an input, which is the index representing the allele of interest within the VCF row. The alt index matches the genotype according to [VCF specification](https://samtools.github.io/hts-specs/VCFv4.2.pdf). For instance, a sample with the 2nd alt might have a genotype containing a 2, ie `(2,1)`, `(2,0)`, `(2,2)`, etc.\n\nTo implement your own plugin....\n\n### Getting Started\n\nThere are two types of user stories for plugins: one will be implementing the project-specific plugin, while the other is using an already-built plugin. The former user will be called a plugin author, while the latter will be called a plugin user.\n\n**Plugin Author**\n1. Read through the default implementations defined in the [`BasePlugin`](src/plugin_system/plugins/base_plugin.py).\n2. Copy [`simple_plugin.py`](src/plugin_system/plugins/gregor_plugin.py) to your working directory. This has to be in the **top-level directory** where you will do your CAF generation.\n3. Rename the plugin class and name (eg `MyProjectPlugin` and `my_project_plugin.py`). The file name must end in `_plugin.py` to be a valid module.\n4. Customize any of the methods described by the [`BasePlugin`](src/plugin_system/plugins/base_plugin.py) by implementing them in your own plugin class. For reference material...\n   1. [`BasePlugin`](src/plugin_system/plugins/base_plugin.py) is by default the parent class, so any methods that aren't defined in your plugin will inherit these methods. You can also define the `BasePlugin`'s implementations by calling `super().<method_to_invoke>` if you want to do additional work after using the default methods.\n   2. [`GregorPlugin`](src/plugin_system/plugins/gregor_plugin.py) is a worked example of specific real-world implementation, refer to that for alternative ways to customize allele frequency generation.\n   3. Plugin [utilities](src/plugin_system/utils.py) are a set of data transformation util methods that might be useful. For example, if your phenotypical data lives in a Terra data table, you could use `terra_data_table_to_dataframe` to rapidly retrieve the columns most relevant in creating a phenotype index.\n\n**Plugin User**\n1. Confirm that you have the variant of interest, [optional] phenotype of interest, VCF path of interest at your disposal, and name of implemented plugin class\n2. See the `test_plugin_worked_example` function in [test_plugin.py](tests/unit/test_plugin.py) for a worked example on how to use plugin. Generally, the two main components related to using the plugin are...\n   1. For your `MyProjectPlugin` plugin, instantiate it with the `PluginManager` and any input parameters specified.\n   2. Call `get_cohort_allele_frequency` passing in the instantiated `MyProjectPlugin` object with the \"plugin\" parameter.\n\n\n## Processing VCF Files ([vrs-python](https://github.com/ga4gh/vrs-python))\n\nvrs-python is a GA4GH GKS package centered around creating Variant Representation specification (VRS) IDs: consistent, globally unique identifiers for variation. Some of its functionality includes variant ID translation and VCF annotation. Used as a dependency in `vrs_bulk`, it can also be used as a standalone package.\n\nFor Python usage, see [vrs_vcf_annotator.py](scripts/vrs_vcf_annotator.py) for an example.\n\nFor CLI usage:\n```bash\npython3 -m ga4gh.vrs.extras.vcf_annotation --vcf_in tests/fixtures/1kGP.chr1.1000.vcf --vcf_out annotated_output.vcf.gz --vrs_pickle_out allele_dicts.pkl --seqrepo_root_dir ~/seqrepo/latest\n```\n\nThe above is an example using an example vcf. Replace the `--vcf_out` and `vrs_pickle_out` here with your desired output file path, where the output vcf can be BCF (`vcf.gz`) or VCF (`vcf`)\n\nAlso, see the [VRS Annotator](https://dockstore.org/workflows/github.com/gks-anvil/vrs-annotator/VRSAnnotator:main?tab=info) workflow on Dockstore for a way to do this on Terra.\n\n## GREGoR-specific Details\n\n### Work in Progress\n- For chromosomes with ploidy of 1 (mitochondrial calling or sex chromosomes), focus allele counts (AC) and locus allele counts (AN) can have a maximum value of 1. Focus allele counts are 1 when the genotype has at least a single allele match (0/1, 1/1, or 1) otherwise it is none.\n\n## Contributing\n\nThis project is open to contributions from the research community. If you are interested in contributing to the project, please contact the project team.\nSee the [contributing guide](CONTRIBUTING.md) for more information on how to contribute to the project.\n\n## License\n\nThis project is licensed under the MIT License - see the [LICENSE](LICENSE.md) file for details.\n",
    "bugtrack_url": null,
    "license": "MIT License\n        \n        Copyright (c) 2024 Oregon Health and Science University Computational Biology\n        \n        Permission is hereby granted, free of charge, to any person obtaining a copy\n        of this software and associated documentation files (the \"Software\"), to deal\n        in the Software without restriction, including without limitation the rights\n        to use, copy, modify, merge, publish, distribute, sublicense, and/or sell\n        copies of the Software, and to permit persons to whom the Software is\n        furnished to do so, subject to the following conditions:\n        \n        The above copyright notice and this permission notice shall be included in all\n        copies or substantial portions of the Software.\n        \n        THE SOFTWARE IS PROVIDED \"AS IS\", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR\n        IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,\n        FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE\n        AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER\n        LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,\n        OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE\n        SOFTWARE.\n        ",
    "summary": "Useful tools and methods for applying GA4GH GKS and VRS models on the NHGRI AnVIL platform.",
    "version": "0.3.1",
    "project_urls": {
        "Bug Tracker": "https://github.com/gks-anvil/vrs_anvil_toolkit/issues",
        "Changelog": "https://github.com/gks-anvil/vrs_anvil_toolkit/releases",
        "Documentation": "https://github.com/gks-anvil/vrs_anvil_toolkit",
        "Homepage": "https://github.com/gks-anvil/vrs_anvil_toolkit",
        "Source": "https://github.com/gks-anvil/vrs_anvil_toolkit"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "f576395c0702904b0a9784ba1ee0b1de85a51b5380be3f336749929c76ebfd60",
                "md5": "7c3923ca10bac037e9439db7436df76a",
                "sha256": "2c21ffb2f231534c79bc431cda3c69d6dec0d538de2006886ba1e585bfe62be4"
            },
            "downloads": -1,
            "filename": "vrs_anvil_toolkit-0.3.1-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "7c3923ca10bac037e9439db7436df76a",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.10",
            "size": 30282,
            "upload_time": "2025-03-03T21:11:02",
            "upload_time_iso_8601": "2025-03-03T21:11:02.066467Z",
            "url": "https://files.pythonhosted.org/packages/f5/76/395c0702904b0a9784ba1ee0b1de85a51b5380be3f336749929c76ebfd60/vrs_anvil_toolkit-0.3.1-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "1ac0c35eb889d2411a317506a1cddabc741baa9f8f05b62969740c31620a7db8",
                "md5": "531324ff9012a17eae900bd46404edbd",
                "sha256": "e4dde8d99aa35932bf4428fb7ab843ecbac34aceb0af4fa4b364800b9907e60a"
            },
            "downloads": -1,
            "filename": "vrs_anvil_toolkit-0.3.1.tar.gz",
            "has_sig": false,
            "md5_digest": "531324ff9012a17eae900bd46404edbd",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.10",
            "size": 1353375,
            "upload_time": "2025-03-03T21:11:05",
            "upload_time_iso_8601": "2025-03-03T21:11:05.119548Z",
            "url": "https://files.pythonhosted.org/packages/1a/c0/c35eb889d2411a317506a1cddabc741baa9f8f05b62969740c31620a7db8/vrs_anvil_toolkit-0.3.1.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-03-03 21:11:05",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "gks-anvil",
    "github_project": "vrs_anvil_toolkit",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "requirements": [
        {
            "name": "ga4gh.vrs",
            "specs": [
                [
                    "==",
                    "2.0.0a13"
                ]
            ]
        },
        {
            "name": "diskcache",
            "specs": []
        },
        {
            "name": "biocommons.seqrepo",
            "specs": []
        },
        {
            "name": "glom",
            "specs": []
        },
        {
            "name": "click",
            "specs": []
        },
        {
            "name": "pyyaml",
            "specs": []
        },
        {
            "name": "google",
            "specs": []
        },
        {
            "name": "requests",
            "specs": []
        },
        {
            "name": "boto3",
            "specs": []
        },
        {
            "name": "tqdm",
            "specs": []
        },
        {
            "name": "google-cloud-storage",
            "specs": []
        },
        {
            "name": "psutil",
            "specs": []
        },
        {
            "name": "setuptools",
            "specs": []
        },
        {
            "name": "firecloud",
            "specs": []
        },
        {
            "name": "ga4gh.va_spec",
            "specs": [
                [
                    "~=",
                    "0.2.0a0"
                ]
            ]
        },
        {
            "name": "pandas",
            "specs": []
        },
        {
            "name": "vrsix",
            "specs": [
                [
                    "==",
                    "0.1.1"
                ]
            ]
        }
    ],
    "lcname": "vrs-anvil-toolkit"
}

GKS-AnVIL