biofile


Namebiofile JSON
Version 0.1.0 PyPI version JSON
download
home_pagehttps://github.com/Tiezhengyuan/bio_file
SummaryProcess various file format for RNA-Seq data analysis
upload_time2024-04-01 22:23:42
maintainerNone
docs_urlNone
authorTiezheng Yuan
requires_pythonNone
licenseNone
keywords pypi cicd python
VCS
bugtrack_url
requirements Bio biosequtils ddt numpy pandas
Travis-CI No Travis.
coveralls test coverage No coveralls.
            \n# Bioinformatics Tool: bioFile

## Introduction
Retrieve data from various file formats used in RNA-Seq data analysis. The tool currently support:
- GTF file: genomic annotations
- GFF file: genomic annoations

quick installation
```
pip install biofile
```


## Development

```
git clone git@github.com:Tiezhengyuan/bio_file.git
cd bio_file
source venv/bin/activate
```

Run unit testing:
```
pytest tests/unittests
```

## Quick tour


### Process GFF:
Retrieve annotations by features from <gff_file>. Multiple json files would be stored in <out_dir>
```
from biofile import GFF
g = GFF(gff_file, out_dir)
g.split_by_features()
```

Given an attribute, retrieve annotations from <gff_file>. and save dataframe in <out_dir>. Here, search all mRNA according to transcript_id. All related annotations are included. The output is transcript_id_mRNA.txt.
```
from biofile import GFF
g = GFF(gff_file, out_dir)
g.parse_attributes('transcript_id', 'mRNA')
```

### Process GTF:
Retrieve annotations by features from <gtf_file>. Multiple json files would be stored in <out_dir>
```
from biofile import GTF
g = GTF(gtf_file, out_dir)
g.split_by_features()
```

Given an attribute, retrieve annotations from <gtf_file>. and save dataframe in <out_dir>. Here, search all mRNA according to transcript_id. All related annotations are included. The output is transcript_id_mRNA.txt.
```
from biofile import GTF
g = GTF(gtf_file, out_dir)
g.parse_attributes('transcript_id', 'mRNA')
```




            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/Tiezhengyuan/bio_file",
    "name": "biofile",
    "maintainer": null,
    "docs_url": null,
    "requires_python": null,
    "maintainer_email": null,
    "keywords": "pypi, cicd, python",
    "author": "Tiezheng Yuan",
    "author_email": "tiezhengyuan@hotmail.com",
    "download_url": "https://files.pythonhosted.org/packages/c9/b6/fc83ab385e97ca08f0ff4cd3aa3fcf682401fe620989a8f974dbc9a69ee5/biofile-0.1.0.tar.gz",
    "platform": null,
    "description": "\\n# Bioinformatics Tool: bioFile\n\n## Introduction\nRetrieve data from various file formats used in RNA-Seq data analysis. The tool currently support:\n- GTF file: genomic annotations\n- GFF file: genomic annoations\n\nquick installation\n```\npip install biofile\n```\n\n\n## Development\n\n```\ngit clone git@github.com:Tiezhengyuan/bio_file.git\ncd bio_file\nsource venv/bin/activate\n```\n\nRun unit testing:\n```\npytest tests/unittests\n```\n\n## Quick tour\n\n\n### Process GFF:\nRetrieve annotations by features from <gff_file>. Multiple json files would be stored in <out_dir>\n```\nfrom biofile import GFF\ng = GFF(gff_file, out_dir)\ng.split_by_features()\n```\n\nGiven an attribute, retrieve annotations from <gff_file>. and save dataframe in <out_dir>. Here, search all mRNA according to transcript_id. All related annotations are included. The output is transcript_id_mRNA.txt.\n```\nfrom biofile import GFF\ng = GFF(gff_file, out_dir)\ng.parse_attributes('transcript_id', 'mRNA')\n```\n\n### Process GTF:\nRetrieve annotations by features from <gtf_file>. Multiple json files would be stored in <out_dir>\n```\nfrom biofile import GTF\ng = GTF(gtf_file, out_dir)\ng.split_by_features()\n```\n\nGiven an attribute, retrieve annotations from <gtf_file>. and save dataframe in <out_dir>. Here, search all mRNA according to transcript_id. All related annotations are included. The output is transcript_id_mRNA.txt.\n```\nfrom biofile import GTF\ng = GTF(gtf_file, out_dir)\ng.parse_attributes('transcript_id', 'mRNA')\n```\n\n\n\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "Process various file format for RNA-Seq data analysis",
    "version": "0.1.0",
    "project_urls": {
        "Homepage": "https://github.com/Tiezhengyuan/bio_file"
    },
    "split_keywords": [
        "pypi",
        " cicd",
        " python"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "d35c21620078b8fd47d66491faa3cb33b12a058bca7b9c3f9c387e3ccb4d5f62",
                "md5": "8c8015ae2ff05f714f19445604412377",
                "sha256": "857356795aa6d57ea374ded9c9f18abbb3a4996c88ff0e958cd3fb8a8dd1b031"
            },
            "downloads": -1,
            "filename": "biofile-0.1.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "8c8015ae2ff05f714f19445604412377",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 16606,
            "upload_time": "2024-04-01T22:23:41",
            "upload_time_iso_8601": "2024-04-01T22:23:41.094320Z",
            "url": "https://files.pythonhosted.org/packages/d3/5c/21620078b8fd47d66491faa3cb33b12a058bca7b9c3f9c387e3ccb4d5f62/biofile-0.1.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "c9b6fc83ab385e97ca08f0ff4cd3aa3fcf682401fe620989a8f974dbc9a69ee5",
                "md5": "df591be83314eced171e43c00cde736a",
                "sha256": "a009d31b3c3d523e656e3d8d5ce49aee8ce0f9954e21dbc04730ecfff4a96b0c"
            },
            "downloads": -1,
            "filename": "biofile-0.1.0.tar.gz",
            "has_sig": false,
            "md5_digest": "df591be83314eced171e43c00cde736a",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 12787,
            "upload_time": "2024-04-01T22:23:42",
            "upload_time_iso_8601": "2024-04-01T22:23:42.748160Z",
            "url": "https://files.pythonhosted.org/packages/c9/b6/fc83ab385e97ca08f0ff4cd3aa3fcf682401fe620989a8f974dbc9a69ee5/biofile-0.1.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-04-01 22:23:42",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "Tiezhengyuan",
    "github_project": "bio_file",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "requirements": [
        {
            "name": "Bio",
            "specs": []
        },
        {
            "name": "biosequtils",
            "specs": []
        },
        {
            "name": "ddt",
            "specs": []
        },
        {
            "name": "numpy",
            "specs": []
        },
        {
            "name": "pandas",
            "specs": []
        }
    ],
    "lcname": "biofile"
}
        
Elapsed time: 0.20556s