bioomics

Name	bioomics JSON
Version	0.2.9 JSON
	download
home_page	https://github.com/Tiezhengyuan/bio_omics
Summary	Download, retrieve and process omics data for further bioinformatics
upload_time	2024-11-13 21:57:06
maintainer	None
docs_url	None
author	Tiezheng Yuan
requires_python	None
license	None
keywords	pypi cicd python
VCS
bugtrack_url
requirements	No requirements were recorded.
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

            \n# bio_omics
Download, retrieve and process omics data, or biological informatics data from public database

Comprehensive Databases
- NCBI: genome database, 
- UniProt: protein database, https://www.expasy.org/resources/uniprotkb-swiss-prot

Sepecific Databases
- miRBase: mircoRNA database, https://www.mirbase.org/
- RNACentral: non-coding RNA squence database, https://rnacentral.org/
- IEDB: immune epitope database, https://www.iedb.org/


See the help documents of example coding at https://www.fbridges.com/pipeline/bio_omics.

https://www.iedb.org/downloader.php?file_name=doc/epitope_full_v3.zip


## data model
ETL data processing is composed of some steps including downloads, retrieval, organization, combination, integration, enrichment, formation. This packages focus on downloads, retrieval, and combination of omics data.
It is suggested that the data model would be consistent. Data are organized by entity namely protein, or antigen. An example of data is showed as the below. Here the pair 'key' defines unique identifier of this entity. "ID" is automatically created. Retrieved data are pushed as one key-value. 
Note:
- Abundant data are possible and to be allowed.
- The key-value is defined by this corresponding database source.
- Used for Integration rather than enrichment. Therefore, data combination or aggregation is not recommended.
- Data from various source could be different or invalid. Those would be validated in the afterwards step rather than this step.
```
{
    "ID": "79541",
    "key": "H0YED9",
    "UniProt_SwissProt": {
        ....
    },
    "NCBI": {
        ....
    },
    "PDB": {
        ....
    },
    ....    
}
```

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/Tiezhengyuan/bio_omics",
    "name": "bioomics",
    "maintainer": null,
    "docs_url": null,
    "requires_python": null,
    "maintainer_email": null,
    "keywords": "pypi, cicd, python",
    "author": "Tiezheng Yuan",
    "author_email": "tiezhengyuan@hotmail.com",
    "download_url": "https://files.pythonhosted.org/packages/e5/75/8f8a352a09fe12ae65aedec4d8129b569302884fa420f326f1c3e3bb8caf/bioomics-0.2.9.tar.gz",
    "platform": null,
    "description": "\\n# bio_omics\nDownload, retrieve and process omics data, or biological informatics data from public database\n\nComprehensive Databases\n- NCBI: genome database, \n- UniProt: protein database, https://www.expasy.org/resources/uniprotkb-swiss-prot\n\nSepecific Databases\n- miRBase: mircoRNA database, https://www.mirbase.org/\n- RNACentral: non-coding RNA squence database, https://rnacentral.org/\n- IEDB: immune epitope database, https://www.iedb.org/\n\n\nSee the help documents of example coding at https://www.fbridges.com/pipeline/bio_omics.\n\nhttps://www.iedb.org/downloader.php?file_name=doc/epitope_full_v3.zip\n\n\n## data model\nETL data processing is composed of some steps including downloads, retrieval, organization, combination, integration, enrichment, formation. This packages focus on downloads, retrieval, and combination of omics data.\nIt is suggested that the data model would be consistent. Data are organized by entity namely protein, or antigen. An example of data is showed as the below. Here the pair 'key' defines unique identifier of this entity. \"ID\" is automatically created. Retrieved data are pushed as one key-value. \nNote:\n- Abundant data are possible and to be allowed.\n- The key-value is defined by this corresponding database source.\n- Used for Integration rather than enrichment. Therefore, data combination or aggregation is not recommended.\n- Data from various source could be different or invalid. Those would be validated in the afterwards step rather than this step.\n```\n{\n    \"ID\": \"79541\",\n    \"key\": \"H0YED9\",\n    \"UniProt_SwissProt\": {\n        ....\n    },\n    \"NCBI\": {\n        ....\n    },\n    \"PDB\": {\n        ....\n    },\n    ....    \n}\n```\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "Download, retrieve and process omics data for further bioinformatics",
    "version": "0.2.9",
    "project_urls": {
        "Homepage": "https://github.com/Tiezhengyuan/bio_omics"
    },
    "split_keywords": [
        "pypi",
        " cicd",
        " python"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "29d8bab7009df5b5b08242683307b2855568e366aa500c4b0212f9762b32a29b",
                "md5": "d86bdfbbcc1d796742933203a4b9bf26",
                "sha256": "333c09172d756d630a575fe8577668519a7152e641e6b4e1ab69fbcbab7d79cc"
            },
            "downloads": -1,
            "filename": "bioomics-0.2.9-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "d86bdfbbcc1d796742933203a4b9bf26",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 35885,
            "upload_time": "2024-11-13T21:57:04",
            "upload_time_iso_8601": "2024-11-13T21:57:04.424208Z",
            "url": "https://files.pythonhosted.org/packages/29/d8/bab7009df5b5b08242683307b2855568e366aa500c4b0212f9762b32a29b/bioomics-0.2.9-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "e5758f8a352a09fe12ae65aedec4d8129b569302884fa420f326f1c3e3bb8caf",
                "md5": "979003f6b5d9121923fad82ea061db12",
                "sha256": "55b55d955738d46b1c7aa44764a45c0422d629c325236896d6ae31aff86706ca"
            },
            "downloads": -1,
            "filename": "bioomics-0.2.9.tar.gz",
            "has_sig": false,
            "md5_digest": "979003f6b5d9121923fad82ea061db12",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 31341,
            "upload_time": "2024-11-13T21:57:06",
            "upload_time_iso_8601": "2024-11-13T21:57:06.278043Z",
            "url": "https://files.pythonhosted.org/packages/e5/75/8f8a352a09fe12ae65aedec4d8129b569302884fa420f326f1c3e3bb8caf/bioomics-0.2.9.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-11-13 21:57:06",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "Tiezhengyuan",
    "github_project": "bio_omics",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "requirements": [],
    "lcname": "bioomics"
}

Tiezheng Yuan