Name | cumulus-library-umls JSON |
Version |
0.3.2
JSON |
| download |
home_page | None |
Summary | A Unified Medical Language System® Metathesaurus study for the Cumulus project |
upload_time | 2025-07-10 14:31:49 |
maintainer | None |
docs_url | None |
author | None |
requires_python | >=3.11 |
license | None |
keywords |
fhir
sql
health informatics
|
VCS |
 |
bugtrack_url |
|
requirements |
No requirements were recorded.
|
Travis-CI |
No Travis.
|
coveralls test coverage |
No coveralls.
|
# Cumulus Library UMLS
An installation of the Unified Medical Language System® Metathesaurus®. Part of the [SMART on FHIR Cumulus Project](https://smarthealthit.org/cumulus)
For more information, [browse the documentation](https://docs.smarthealthit.org/cumulus/library).
## Usage
In order to use the Metathesaurus, you'll need to get an API key for access from the National Library of Medicine, which you can sign up for [here](https://uts.nlm.nih.gov/uts/signup-login).
You can then install this module by running `pip install cumulus-library-umls`.
This will add a `umls` target to `cumulus-library`. You'll need to pass your
API key via the `--umls-key` CLI flag, or set the `UMLS_API_KEY` environment variable
to the key you received from NIH.
This ends up being a fairly intensive operation - we download a large file,
extract it, create parquet files from Athena, and then upload it. It usually
takes a half hour to run. We try to preserve some of those artifacts along
the way to make rebuilds faster. If you need to force recreation from scratch, the
`--force-upload` CLI flag will handle this.
Note: This study is explicitly namespaced in its own schema, `umls`. Make sure your
database is not using this schema for another use. Do not create tables inside this
schema by another means.
## Additional custom tables
The following tables are a derived from the primary tables, and are included here as a
convenience to avoid having to compute these on a repeated basis
- **mrrel_drug_is_a** a subset of the relationships in mrrel, including only those that define
that concept A is a member of concept B (i.e. is a child, or is explicitly marked as
being a tradename/member belonging to the parent concept), for drugs and drug-related topics.
- **mrconso_drugs** a subset of the entity list in mrconso, limited to vocabularies
specifically dealing with drug identifiers (i.e. SNOMED, RxNorm, etc.)
- **mrconso_icd10cm**/**mrrel__icd10cm** are slices of the respective main tables,
only containing records from the ICD10 coding system
- **icd10_(type)** are slices of a given coding system at the relevant level of the
ICD10 hierarchy (category,block,chapter,subcategory[1-3], extension)
- **icd10_tree** provides a relation-navigable code hierarchy of the individual levels
in the ICD10 hierarchy
- **icd10_hierarchy** provides a extracted tablular representation of the full ICD10
code system
## Licensing details
The `cumulus-library-umls` study is provided as a convenience to install the
UMLS Metathesaurus, but is not shipped with the Metathesaurus dataset. It will
require an API key to download the data from NIH directly.
As a reminder, the
[License Agreement for Use of the UMLS® Metathesaurus®](https://uts.nlm.nih.gov/uts/assets/LicenseAgreement.pdf)
provides several restrictions on this usage of this data (including distributing
the dataset). When you sign up for a UMLS key, you are assuming responsibility
for complying with these terms, or an alternate licensing agreement with the
owner of the Metathesaus data if you are provided with one.
## Citations
Bodenreider O. The Unified Medical Language System (UMLS): integrating biomedical terminology. Nucleic Acids Res. 2004 Jan 1;32(Database issue):D267-70. doi: 10.1093/nar/gkh061. PubMed PMID: 14681409; PubMed Central PMCID: PMC308795.
Raw data
{
"_id": null,
"home_page": null,
"name": "cumulus-library-umls",
"maintainer": null,
"docs_url": null,
"requires_python": ">=3.11",
"maintainer_email": null,
"keywords": "FHIR, SQL, Health Informatics",
"author": null,
"author_email": null,
"download_url": "https://files.pythonhosted.org/packages/19/2f/d95d46faa8bc611b1f611976784452cff28bcd2da1db44a2a184b2111c94/cumulus_library_umls-0.3.2.tar.gz",
"platform": null,
"description": "# Cumulus Library UMLS\n\nAn installation of the Unified Medical Language System\u00ae Metathesaurus\u00ae. Part of the [SMART on FHIR Cumulus Project](https://smarthealthit.org/cumulus)\n\nFor more information, [browse the documentation](https://docs.smarthealthit.org/cumulus/library).\n## Usage\n\nIn order to use the Metathesaurus, you'll need to get an API key for access from the National Library of Medicine, which you can sign up for [here](https://uts.nlm.nih.gov/uts/signup-login).\n\nYou can then install this module by running `pip install cumulus-library-umls`.\n\nThis will add a `umls` target to `cumulus-library`. You'll need to pass your\nAPI key via the `--umls-key` CLI flag, or set the `UMLS_API_KEY` environment variable\nto the key you received from NIH.\n\nThis ends up being a fairly intensive operation - we download a large file,\nextract it, create parquet files from Athena, and then upload it. It usually\ntakes a half hour to run. We try to preserve some of those artifacts along\nthe way to make rebuilds faster. If you need to force recreation from scratch, the\n`--force-upload` CLI flag will handle this.\n\nNote: This study is explicitly namespaced in its own schema, `umls`. Make sure your\ndatabase is not using this schema for another use. Do not create tables inside this\nschema by another means.\n\n## Additional custom tables\n\nThe following tables are a derived from the primary tables, and are included here as a \nconvenience to avoid having to compute these on a repeated basis\n\n- **mrrel_drug_is_a** a subset of the relationships in mrrel, including only those that define\nthat concept A is a member of concept B (i.e. is a child, or is explicitly marked as\nbeing a tradename/member belonging to the parent concept), for drugs and drug-related topics.\n- **mrconso_drugs** a subset of the entity list in mrconso, limited to vocabularies\nspecifically dealing with drug identifiers (i.e. SNOMED, RxNorm, etc.)\n- **mrconso_icd10cm**/**mrrel__icd10cm** are slices of the respective main tables,\nonly containing records from the ICD10 coding system\n- **icd10_(type)** are slices of a given coding system at the relevant level of the\nICD10 hierarchy (category,block,chapter,subcategory[1-3], extension)\n- **icd10_tree** provides a relation-navigable code hierarchy of the individual levels\nin the ICD10 hierarchy\n- **icd10_hierarchy** provides a extracted tablular representation of the full ICD10\ncode system\n\n## Licensing details\n\nThe `cumulus-library-umls` study is provided as a convenience to install the\nUMLS Metathesaurus, but is not shipped with the Metathesaurus dataset. It will\nrequire an API key to download the data from NIH directly.\n\nAs a reminder, the \n[License Agreement for Use of the UMLS\u00ae Metathesaurus\u00ae](https://uts.nlm.nih.gov/uts/assets/LicenseAgreement.pdf)\nprovides several restrictions on this usage of this data (including distributing\nthe dataset). When you sign up for a UMLS key, you are assuming responsibility\nfor complying with these terms, or an alternate licensing agreement with the\nowner of the Metathesaus data if you are provided with one.\n\n\n## Citations\n\nBodenreider O. The Unified Medical Language System (UMLS): integrating biomedical terminology. Nucleic Acids Res. 2004 Jan 1;32(Database issue):D267-70. doi: 10.1093/nar/gkh061. PubMed PMID: 14681409; PubMed Central PMCID: PMC308795.\n",
"bugtrack_url": null,
"license": null,
"summary": "A Unified Medical Language System\u00ae Metathesaurus study for the Cumulus project",
"version": "0.3.2",
"project_urls": {
"Documentation": "https://docs.smarthealthit.org/cumulus/",
"Home": "https://smarthealthit.org/cumulus",
"Source": "https://github.com/smart-on-fhir/cumulus-library-umls"
},
"split_keywords": [
"fhir",
" sql",
" health informatics"
],
"urls": [
{
"comment_text": null,
"digests": {
"blake2b_256": "2597c1cd54bc27cf307be53c498fc291b97cb57495d9771a4d37d2bee6602897",
"md5": "e5107ddd4424f44c64aa2d104597c248",
"sha256": "9b750080b15a7926e9454bf610a3d956fbbcd6145faa03d0ccd72b3d8bf49f6c"
},
"downloads": -1,
"filename": "cumulus_library_umls-0.3.2-py3-none-any.whl",
"has_sig": false,
"md5_digest": "e5107ddd4424f44c64aa2d104597c248",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.11",
"size": 31954,
"upload_time": "2025-07-10T14:31:48",
"upload_time_iso_8601": "2025-07-10T14:31:48.498978Z",
"url": "https://files.pythonhosted.org/packages/25/97/c1cd54bc27cf307be53c498fc291b97cb57495d9771a4d37d2bee6602897/cumulus_library_umls-0.3.2-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": null,
"digests": {
"blake2b_256": "192fd95d46faa8bc611b1f611976784452cff28bcd2da1db44a2a184b2111c94",
"md5": "1ac63e42838b3762782ccad4451a8344",
"sha256": "549f2ef0dd395b34feebcdd8ac44d0dac8753d4f5fdde95f319abb3ede3fb43f"
},
"downloads": -1,
"filename": "cumulus_library_umls-0.3.2.tar.gz",
"has_sig": false,
"md5_digest": "1ac63e42838b3762782ccad4451a8344",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.11",
"size": 29923,
"upload_time": "2025-07-10T14:31:49",
"upload_time_iso_8601": "2025-07-10T14:31:49.607946Z",
"url": "https://files.pythonhosted.org/packages/19/2f/d95d46faa8bc611b1f611976784452cff28bcd2da1db44a2a184b2111c94/cumulus_library_umls-0.3.2.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2025-07-10 14:31:49",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "smart-on-fhir",
"github_project": "cumulus-library-umls",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"lcname": "cumulus-library-umls"
}