\n# Bioinformatics Tool: bioFile
## Introduction
Retrieve data from various file formats used in RNA-Seq data analysis. The tool currently support:
- GTF file: genomic annotations
- GFF file: genomic annoations
quick installation
```
pip install biofile
```
## Development
```
git clone git@github.com:Tiezhengyuan/bio_file.git
cd bio_file
source venv/bin/activate
```
Run unit testing:
```
pytest tests/unittests
```
## Quick tour
### Process GFF:
Retrieve annotations by features from <gff_file>. Multiple json files would be stored in <out_dir>
```
from biofile import GFF
g = GFF(gff_file, out_dir)
g.split_by_features()
```
Given an attribute, retrieve annotations from <gff_file>. and save dataframe in <out_dir>. Here, search all mRNA according to transcript_id. All related annotations are included. The output is transcript_id_mRNA.txt.
```
from biofile import GFF
g = GFF(gff_file, out_dir)
g.parse_attributes('transcript_id', 'mRNA')
```
### Process GTF:
Retrieve annotations by features from <gtf_file>. Multiple json files would be stored in <out_dir>
```
from biofile import GTF
g = GTF(gtf_file, out_dir)
g.split_by_features()
```
Given an attribute, retrieve annotations from <gtf_file>. and save dataframe in <out_dir>. Here, search all mRNA according to transcript_id. All related annotations are included. The output is transcript_id_mRNA.txt.
```
from biofile import GTF
g = GTF(gtf_file, out_dir)
g.parse_attributes('transcript_id', 'mRNA')
```
Raw data
{
"_id": null,
"home_page": "https://github.com/Tiezhengyuan/bio_file",
"name": "biofile",
"maintainer": null,
"docs_url": null,
"requires_python": null,
"maintainer_email": null,
"keywords": "pypi, cicd, python",
"author": "Tiezheng Yuan",
"author_email": "tiezhengyuan@hotmail.com",
"download_url": "https://files.pythonhosted.org/packages/c9/b6/fc83ab385e97ca08f0ff4cd3aa3fcf682401fe620989a8f974dbc9a69ee5/biofile-0.1.0.tar.gz",
"platform": null,
"description": "\\n# Bioinformatics Tool: bioFile\n\n## Introduction\nRetrieve data from various file formats used in RNA-Seq data analysis. The tool currently support:\n- GTF file: genomic annotations\n- GFF file: genomic annoations\n\nquick installation\n```\npip install biofile\n```\n\n\n## Development\n\n```\ngit clone git@github.com:Tiezhengyuan/bio_file.git\ncd bio_file\nsource venv/bin/activate\n```\n\nRun unit testing:\n```\npytest tests/unittests\n```\n\n## Quick tour\n\n\n### Process GFF:\nRetrieve annotations by features from <gff_file>. Multiple json files would be stored in <out_dir>\n```\nfrom biofile import GFF\ng = GFF(gff_file, out_dir)\ng.split_by_features()\n```\n\nGiven an attribute, retrieve annotations from <gff_file>. and save dataframe in <out_dir>. Here, search all mRNA according to transcript_id. All related annotations are included. The output is transcript_id_mRNA.txt.\n```\nfrom biofile import GFF\ng = GFF(gff_file, out_dir)\ng.parse_attributes('transcript_id', 'mRNA')\n```\n\n### Process GTF:\nRetrieve annotations by features from <gtf_file>. Multiple json files would be stored in <out_dir>\n```\nfrom biofile import GTF\ng = GTF(gtf_file, out_dir)\ng.split_by_features()\n```\n\nGiven an attribute, retrieve annotations from <gtf_file>. and save dataframe in <out_dir>. Here, search all mRNA according to transcript_id. All related annotations are included. The output is transcript_id_mRNA.txt.\n```\nfrom biofile import GTF\ng = GTF(gtf_file, out_dir)\ng.parse_attributes('transcript_id', 'mRNA')\n```\n\n\n\n",
"bugtrack_url": null,
"license": null,
"summary": "Process various file format for RNA-Seq data analysis",
"version": "0.1.0",
"project_urls": {
"Homepage": "https://github.com/Tiezhengyuan/bio_file"
},
"split_keywords": [
"pypi",
" cicd",
" python"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "d35c21620078b8fd47d66491faa3cb33b12a058bca7b9c3f9c387e3ccb4d5f62",
"md5": "8c8015ae2ff05f714f19445604412377",
"sha256": "857356795aa6d57ea374ded9c9f18abbb3a4996c88ff0e958cd3fb8a8dd1b031"
},
"downloads": -1,
"filename": "biofile-0.1.0-py3-none-any.whl",
"has_sig": false,
"md5_digest": "8c8015ae2ff05f714f19445604412377",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 16606,
"upload_time": "2024-04-01T22:23:41",
"upload_time_iso_8601": "2024-04-01T22:23:41.094320Z",
"url": "https://files.pythonhosted.org/packages/d3/5c/21620078b8fd47d66491faa3cb33b12a058bca7b9c3f9c387e3ccb4d5f62/biofile-0.1.0-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "c9b6fc83ab385e97ca08f0ff4cd3aa3fcf682401fe620989a8f974dbc9a69ee5",
"md5": "df591be83314eced171e43c00cde736a",
"sha256": "a009d31b3c3d523e656e3d8d5ce49aee8ce0f9954e21dbc04730ecfff4a96b0c"
},
"downloads": -1,
"filename": "biofile-0.1.0.tar.gz",
"has_sig": false,
"md5_digest": "df591be83314eced171e43c00cde736a",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 12787,
"upload_time": "2024-04-01T22:23:42",
"upload_time_iso_8601": "2024-04-01T22:23:42.748160Z",
"url": "https://files.pythonhosted.org/packages/c9/b6/fc83ab385e97ca08f0ff4cd3aa3fcf682401fe620989a8f974dbc9a69ee5/biofile-0.1.0.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-04-01 22:23:42",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "Tiezhengyuan",
"github_project": "bio_file",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"requirements": [
{
"name": "Bio",
"specs": []
},
{
"name": "biosequtils",
"specs": []
},
{
"name": "ddt",
"specs": []
},
{
"name": "numpy",
"specs": []
},
{
"name": "pandas",
"specs": []
}
],
"lcname": "biofile"
}