bioomics


Namebioomics JSON
Version 0.2.9 PyPI version JSON
download
home_pagehttps://github.com/Tiezhengyuan/bio_omics
SummaryDownload, retrieve and process omics data for further bioinformatics
upload_time2024-11-13 21:57:06
maintainerNone
docs_urlNone
authorTiezheng Yuan
requires_pythonNone
licenseNone
keywords pypi cicd python
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            \n# bio_omics
Download, retrieve and process omics data, or biological informatics data from public database

Comprehensive Databases
- NCBI: genome database, 
- UniProt: protein database, https://www.expasy.org/resources/uniprotkb-swiss-prot

Sepecific Databases
- miRBase: mircoRNA database, https://www.mirbase.org/
- RNACentral: non-coding RNA squence database, https://rnacentral.org/
- IEDB: immune epitope database, https://www.iedb.org/


See the help documents of example coding at https://www.fbridges.com/pipeline/bio_omics.

https://www.iedb.org/downloader.php?file_name=doc/epitope_full_v3.zip


## data model
ETL data processing is composed of some steps including downloads, retrieval, organization, combination, integration, enrichment, formation. This packages focus on downloads, retrieval, and combination of omics data.
It is suggested that the data model would be consistent. Data are organized by entity namely protein, or antigen. An example of data is showed as the below. Here the pair 'key' defines unique identifier of this entity. "ID" is automatically created. Retrieved data are pushed as one key-value. 
Note:
- Abundant data are possible and to be allowed.
- The key-value is defined by this corresponding database source.
- Used for Integration rather than enrichment. Therefore, data combination or aggregation is not recommended.
- Data from various source could be different or invalid. Those would be validated in the afterwards step rather than this step.
```
{
    "ID": "79541",
    "key": "H0YED9",
    "UniProt_SwissProt": {
        ....
    },
    "NCBI": {
        ....
    },
    "PDB": {
        ....
    },
    ....    
}
```

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/Tiezhengyuan/bio_omics",
    "name": "bioomics",
    "maintainer": null,
    "docs_url": null,
    "requires_python": null,
    "maintainer_email": null,
    "keywords": "pypi, cicd, python",
    "author": "Tiezheng Yuan",
    "author_email": "tiezhengyuan@hotmail.com",
    "download_url": "https://files.pythonhosted.org/packages/e5/75/8f8a352a09fe12ae65aedec4d8129b569302884fa420f326f1c3e3bb8caf/bioomics-0.2.9.tar.gz",
    "platform": null,
    "description": "\\n# bio_omics\nDownload, retrieve and process omics data, or biological informatics data from public database\n\nComprehensive Databases\n- NCBI: genome database, \n- UniProt: protein database, https://www.expasy.org/resources/uniprotkb-swiss-prot\n\nSepecific Databases\n- miRBase: mircoRNA database, https://www.mirbase.org/\n- RNACentral: non-coding RNA squence database, https://rnacentral.org/\n- IEDB: immune epitope database, https://www.iedb.org/\n\n\nSee the help documents of example coding at https://www.fbridges.com/pipeline/bio_omics.\n\nhttps://www.iedb.org/downloader.php?file_name=doc/epitope_full_v3.zip\n\n\n## data model\nETL data processing is composed of some steps including downloads, retrieval, organization, combination, integration, enrichment, formation. This packages focus on downloads, retrieval, and combination of omics data.\nIt is suggested that the data model would be consistent. Data are organized by entity namely protein, or antigen. An example of data is showed as the below. Here the pair 'key' defines unique identifier of this entity. \"ID\" is automatically created. Retrieved data are pushed as one key-value. \nNote:\n- Abundant data are possible and to be allowed.\n- The key-value is defined by this corresponding database source.\n- Used for Integration rather than enrichment. Therefore, data combination or aggregation is not recommended.\n- Data from various source could be different or invalid. Those would be validated in the afterwards step rather than this step.\n```\n{\n    \"ID\": \"79541\",\n    \"key\": \"H0YED9\",\n    \"UniProt_SwissProt\": {\n        ....\n    },\n    \"NCBI\": {\n        ....\n    },\n    \"PDB\": {\n        ....\n    },\n    ....    \n}\n```\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "Download, retrieve and process omics data for further bioinformatics",
    "version": "0.2.9",
    "project_urls": {
        "Homepage": "https://github.com/Tiezhengyuan/bio_omics"
    },
    "split_keywords": [
        "pypi",
        " cicd",
        " python"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "29d8bab7009df5b5b08242683307b2855568e366aa500c4b0212f9762b32a29b",
                "md5": "d86bdfbbcc1d796742933203a4b9bf26",
                "sha256": "333c09172d756d630a575fe8577668519a7152e641e6b4e1ab69fbcbab7d79cc"
            },
            "downloads": -1,
            "filename": "bioomics-0.2.9-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "d86bdfbbcc1d796742933203a4b9bf26",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 35885,
            "upload_time": "2024-11-13T21:57:04",
            "upload_time_iso_8601": "2024-11-13T21:57:04.424208Z",
            "url": "https://files.pythonhosted.org/packages/29/d8/bab7009df5b5b08242683307b2855568e366aa500c4b0212f9762b32a29b/bioomics-0.2.9-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "e5758f8a352a09fe12ae65aedec4d8129b569302884fa420f326f1c3e3bb8caf",
                "md5": "979003f6b5d9121923fad82ea061db12",
                "sha256": "55b55d955738d46b1c7aa44764a45c0422d629c325236896d6ae31aff86706ca"
            },
            "downloads": -1,
            "filename": "bioomics-0.2.9.tar.gz",
            "has_sig": false,
            "md5_digest": "979003f6b5d9121923fad82ea061db12",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 31341,
            "upload_time": "2024-11-13T21:57:06",
            "upload_time_iso_8601": "2024-11-13T21:57:06.278043Z",
            "url": "https://files.pythonhosted.org/packages/e5/75/8f8a352a09fe12ae65aedec4d8129b569302884fa420f326f1c3e3bb8caf/bioomics-0.2.9.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-11-13 21:57:06",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "Tiezhengyuan",
    "github_project": "bio_omics",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "requirements": [],
    "lcname": "bioomics"
}
        
Elapsed time: 0.58763s