vcf2maf-lite


Namevcf2maf-lite JSON
Version 0.1.1 PyPI version JSON
download
home_pagehttps://github.com/genome-nexus/vcf2maf-lite
SummaryVCF to MAF format converter
upload_time2024-07-30 21:38:59
maintainerRamya Madupuri
docs_urlNone
authorAngelica Ochoa
requires_python>=3.7
licenseMIT
keywords vcf2maf vcftomaf vcf maf lite
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # vcf2maf_lite

vcf2maf_lite is a lightweight Python adaptation of the [vcf2maf Perl tool](https://github.com/mskcc/vcf2maf), designed to convert the VCF format to MAF format without adding variant annotations. For annotating a MAF file, [Genome Nexus](https://github.com/genome-nexus/genome-nexus-annotation-pipeline) can be utilized.

### Installation using pip

```
pip3 install vcf2maf_lite
```

Usage:

```
vcf2maf_lite --help

Usage: vcf2maf_lite [OPTIONS]

Options:
  -i, --input-data TEXT        A list of .vcf files or input data directories,
                               separated by commas  [required]
  -o, --output-directory TEXT  output data directory [optional]
  -c, --center TEXT            name of center (standard MAF field = 'Center')
                               [optional]
  -s, --sequence-source TEXT   Sequencing source (standard MAF field =
                               'Sequencing_Source'), e.g., WXS or WGS
                               [optional]
  -t, --tumor-id TEXT          The ID of the tumor sample utilized in the
                               genotype columns of the VCF file. [optional]
  -n, --normal-id TEXT         The ID of the normal sample utilized in the
                               genotype columns of the VCF file. [optional]
  -a, --retain-info TEXT       Comma-delimited names of INFO fields to retain
                               as extra columns in MAF [optional]
  -f, --retain-fmt TEXT        Comma-delimited names of FORMAT fields to
                               retain as extra columns in MAF [optional]
  --help                       Show this message and exit.
```

Example Usage:
```
vcf2maf_lite --input-data /data/vcf --output-directory /data/maf/ --center CTR --sequence-source WGS --tumor-id Tumor --normal-id Normal --retain-info Custom_filters,AC,AF,AC_nfe_seu,AC_afr,AF_afr --retain-fmt alt_count_raw,ref_count_raw,depth_raw
```

This command converts the VCF files in /vcf folder to MAF format. 
- The `--input-data` option is used to specify either a single VCF file or a directory containing multiple VCF files (separated by commas). This option supports passing multiple input files or directories at once.
- The `--output-directory` option allows you to specify the directory where the MAF files will be saved. If no output path is provided, the default output directory `vcf2maf_output` will be used in the current working directory. 
- The `--tumor-id` option allows you to specify the ID of the tumor sample used in the genotype columns of the VCF file. If the option is not used, the script will automatically identify the tumor ID from either the `tumor_sample` keyword in the meta data lines or the sample columns from VCF header.
- The `--normal-id` option allows you to specify the ID of the normal sample used in the genotype columns of the VCF file. If the option is not used, the script will automatically identify the normal ID from either the `normal_sample` keyword in the meta data lines or the sample columns from VCF header.
- The `--retain-info` option allows you to specify the INFO fields to be retained as additional columns in the MAF. If the option is not used, standard MAF columns are included by default.
- The `--retain-fmt` option allows you to specify the FORMAT fields to be retained as additional columns in the MAF. If the option is not used, standard MAF columns are included by default.


Importing to Python Scripts:
```
from vcf2maf_lite.vcf2maf_lite import main
from click.testing import CliRunner

runner = CliRunner()
runner.invoke(main, ['--input-data','test_vcf.vcf'])
```



            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/genome-nexus/vcf2maf-lite",
    "name": "vcf2maf-lite",
    "maintainer": "Ramya Madupuri",
    "docs_url": null,
    "requires_python": ">=3.7",
    "maintainer_email": null,
    "keywords": "vcf2maf, vcftomaf, vcf, maf, lite",
    "author": "Angelica Ochoa",
    "author_email": null,
    "download_url": "https://files.pythonhosted.org/packages/68/bc/a46dd228ced766a492713370a0415ee6293edf0919c0c65b26efda95c6cf/vcf2maf_lite-0.1.1.tar.gz",
    "platform": null,
    "description": "# vcf2maf_lite\n\nvcf2maf_lite is a lightweight Python adaptation of the [vcf2maf Perl tool](https://github.com/mskcc/vcf2maf), designed to convert the VCF format to MAF format without adding variant annotations. For annotating a MAF file, [Genome Nexus](https://github.com/genome-nexus/genome-nexus-annotation-pipeline) can be utilized.\n\n### Installation using pip\n\n```\npip3 install vcf2maf_lite\n```\n\nUsage:\n\n```\nvcf2maf_lite --help\n\nUsage: vcf2maf_lite [OPTIONS]\n\nOptions:\n  -i, --input-data TEXT        A list of .vcf files or input data directories,\n                               separated by commas  [required]\n  -o, --output-directory TEXT  output data directory [optional]\n  -c, --center TEXT            name of center (standard MAF field = 'Center')\n                               [optional]\n  -s, --sequence-source TEXT   Sequencing source (standard MAF field =\n                               'Sequencing_Source'), e.g., WXS or WGS\n                               [optional]\n  -t, --tumor-id TEXT          The ID of the tumor sample utilized in the\n                               genotype columns of the VCF file. [optional]\n  -n, --normal-id TEXT         The ID of the normal sample utilized in the\n                               genotype columns of the VCF file. [optional]\n  -a, --retain-info TEXT       Comma-delimited names of INFO fields to retain\n                               as extra columns in MAF [optional]\n  -f, --retain-fmt TEXT        Comma-delimited names of FORMAT fields to\n                               retain as extra columns in MAF [optional]\n  --help                       Show this message and exit.\n```\n\nExample Usage:\n```\nvcf2maf_lite --input-data /data/vcf --output-directory /data/maf/ --center CTR --sequence-source WGS --tumor-id Tumor --normal-id Normal --retain-info Custom_filters,AC,AF,AC_nfe_seu,AC_afr,AF_afr --retain-fmt alt_count_raw,ref_count_raw,depth_raw\n```\n\nThis command converts the VCF files in /vcf folder to MAF format. \n- The `--input-data` option is used to specify either a single VCF file or a directory containing multiple VCF files (separated by commas). This option supports passing multiple input files or directories at once.\n- The `--output-directory` option allows you to specify the directory where the MAF files will be saved. If no output path is provided, the default output directory `vcf2maf_output` will be used in the current working directory. \n- The `--tumor-id` option allows you to specify the ID of the tumor sample used in the genotype columns of the VCF file. If the option is not used, the script will automatically identify the tumor ID from either the `tumor_sample` keyword in the meta data lines or the sample columns from VCF header.\n- The `--normal-id` option allows you to specify the ID of the normal sample used in the genotype columns of the VCF file. If the option is not used, the script will automatically identify the normal ID from either the `normal_sample` keyword in the meta data lines or the sample columns from VCF header.\n- The `--retain-info` option allows you to specify the INFO fields to be retained as additional columns in the MAF. If the option is not used, standard MAF columns are included by default.\n- The `--retain-fmt` option allows you to specify the FORMAT fields to be retained as additional columns in the MAF. If the option is not used, standard MAF columns are included by default.\n\n\nImporting to Python Scripts:\n```\nfrom vcf2maf_lite.vcf2maf_lite import main\nfrom click.testing import CliRunner\n\nrunner = CliRunner()\nrunner.invoke(main, ['--input-data','test_vcf.vcf'])\n```\n\n\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "VCF to MAF format converter",
    "version": "0.1.1",
    "project_urls": {
        "Bug Tracker": "https://github.com/genome-nexus/vcf2maf-lite/issues",
        "Homepage": "https://github.com/genome-nexus/vcf2maf-lite",
        "Repository": "https://github.com/genome-nexus/vcf2maf-lite"
    },
    "split_keywords": [
        "vcf2maf",
        " vcftomaf",
        " vcf",
        " maf",
        " lite"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "f6afbdb1bd7b0b6f7e586c8feed7f5099317438eaf8a31f137a2e5a091448895",
                "md5": "e9025a7de227c553e16451ba3824f761",
                "sha256": "79ed99e0285dda9fe039a3b5a7572ff4d3987576a75fe7c6a4dd2ea41e2f8e38"
            },
            "downloads": -1,
            "filename": "vcf2maf_lite-0.1.1-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "e9025a7de227c553e16451ba3824f761",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.7",
            "size": 18178,
            "upload_time": "2024-07-30T21:38:58",
            "upload_time_iso_8601": "2024-07-30T21:38:58.144089Z",
            "url": "https://files.pythonhosted.org/packages/f6/af/bdb1bd7b0b6f7e586c8feed7f5099317438eaf8a31f137a2e5a091448895/vcf2maf_lite-0.1.1-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "68bca46dd228ced766a492713370a0415ee6293edf0919c0c65b26efda95c6cf",
                "md5": "ab7eb1315de735e292d221e8c6859955",
                "sha256": "105aa4fa45e3585f72e92cf6a3d905c06b34d3744bace407ab870869badc045a"
            },
            "downloads": -1,
            "filename": "vcf2maf_lite-0.1.1.tar.gz",
            "has_sig": false,
            "md5_digest": "ab7eb1315de735e292d221e8c6859955",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.7",
            "size": 18283,
            "upload_time": "2024-07-30T21:38:59",
            "upload_time_iso_8601": "2024-07-30T21:38:59.130699Z",
            "url": "https://files.pythonhosted.org/packages/68/bc/a46dd228ced766a492713370a0415ee6293edf0919c0c65b26efda95c6cf/vcf2maf_lite-0.1.1.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-07-30 21:38:59",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "genome-nexus",
    "github_project": "vcf2maf-lite",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "requirements": [],
    "lcname": "vcf2maf-lite"
}
        
Elapsed time: 0.88779s