Name | aind-data-schema JSON |
Version |
1.3.0
JSON |
| download |
home_page | None |
Summary | A library that defines AIND data schema and validates JSON files. |
upload_time | 2025-01-07 00:49:29 |
maintainer | None |
docs_url | None |
author | None |
requires_python | >=3.8 |
license | MIT |
keywords |
|
VCS |
|
bugtrack_url |
|
requirements |
No requirements were recorded.
|
Travis-CI |
No Travis.
|
coveralls test coverage |
No coveralls.
|
# aind-data-schema
[](LICENSE)

[](https://aind-data-schema.readthedocs.io/en/latest/?badge=latest)
A library that defines [AIND](https://alleninstitute.org/what-we-do/brain-science/research/allen-institute-neural-dynamics/) data schema and validates JSON files.
User documentation available on [readthedocs](https://aind-data-schema.readthedocs.io/en/latest/).
## Overview
This repository contains the schemas needed to ingest and validate metadata that are essential to ensuring [AIND](https://alleninstitute.org/what-we-do/brain-science/research/allen-institute-neural-dynamics/) data collection is completely reproducible. Our general approach is to semantically version core schema classes and include those version numbers in serialized metadata so that we can flexibly evolve the schemas over time without requiring difficult data migrations. In the future, we will provide a browsable list of these classes rendered to [JSON Schema](https://json-schema.org/), including all historic versions.
Be aware that this package is still under heavy preliminary development. Expect breaking changes regularly, although we will communicate these through semantic versioning.
A simple example:
```python
import datetime
from aind_data_schema.core.subject import BreedingInfo, Housing, Subject
from aind_data_schema_models.organizations import Organization
from aind_data_schema_models.species import Species
t = datetime.datetime(2022, 11, 22, 8, 43, 00)
s = Subject(
species=Species.MUS_MUSCULUS,
subject_id="12345",
sex="Male",
date_of_birth=t.date(),
genotype="Emx1-IRES-Cre;Camk2a-tTA;Ai93(TITL-GCaMP6f)",
housing=Housing(home_cage_enrichment=["Running wheel"], cage_id="123"),
background_strain="C57BL/6J",
source=Organization.AI,
breeding_info=BreedingInfo(
breeding_group="Emx1-IRES-Cre(ND)",
maternal_id="546543",
maternal_genotype="Emx1-IRES-Cre/wt; Camk2a-tTa/Camk2a-tTA",
paternal_id="232323",
paternal_genotype="Ai93(TITL-GCaMP6f)/wt",
),
)
s.write_standard_file() # writes subject.json
```
```json
{
"describedBy": "https://raw.githubusercontent.com/AllenNeuralDynamics/aind-data-schema/main/src/aind_data_schema/core/subject.py",
"schema_version": "0.5.6",
"subject_id": "12345",
"sex": "Male",
"date_of_birth": "2022-11-22",
"genotype": "Emx1-IRES-Cre;Camk2a-tTA;Ai93(TITL-GCaMP6f)",
"species": {
"name": "Mus musculus",
"abbreviation": null,
"registry": {
"name": "National Center for Biotechnology Information",
"abbreviation": "NCBI"
},
"registry_identifier": "10090"
},
"alleles": [],
"background_strain": "C57BL/6J",
"breeding_info": {
"breeding_group": "Emx1-IRES-Cre(ND)",
"maternal_id": "546543",
"maternal_genotype": "Emx1-IRES-Cre/wt; Camk2a-tTa/Camk2a-tTA",
"paternal_id": "232323",
"paternal_genotype": "Ai93(TITL-GCaMP6f)/wt"
},
"source": {
"name": "Allen Institute",
"abbreviation": "AI",
"registry": {
"name": "Research Organization Registry",
"abbreviation": "ROR"
},
"registry_identifier": "03cpe7c52"
},
"rrid": null,
"restrictions": null,
"wellness_reports": [],
"housing": {
"cage_id": "123",
"room_id": null,
"light_cycle": null,
"home_cage_enrichment": [
"Running wheel"
],
"cohoused_subjects": []
},
"notes": null
}
```
## Installing and Upgrading
To install the latest version:
```
pip install aind-data-schema
```
Every merge to the `main` branch is automatically tagged with a new major/minor/patch version and uploaded to PyPI. To upgrade to the latest version:
```
pip install aind-data-schema --upgrade
```
## Issues and Discussions
If you've found a bug in the schemas or would like to make a minor change, open an [issue](https://github.com/AllenNeuralDynamics/aind-data-schema/issues) and please use the provided [templates](https://github.com/AllenNeuralDynamics/aind-metadata-mapper/issues/new/choose).
If you'd like to propose a large change or addition, or generally have a question about how things work, head start a new [Discussion](https://github.com/AllenNeuralDynamics/aind-data-schema/discussions)!
## Controlled Vocabularies
Controlled vocabularies and other enumerated lists are maintained in a separate repository: [aind-data-schema-models](https://github.com/AllenNeuralDynamics/aind-data-schema-models). This allows us to specify these lists without changing aind-data-schema. Controlled vocabularies include lists of organizations, manufacturers, species, modalities, platforms, units, harp devices, and registries.
To upgrade to the latest data models version:
```
pip install aind-data-schema-models --upgrade
```
## Contributing
Contributions are more than welcome for this project! If you'd like to develop the code, please follow the standards outlined in the [contribution guide](https://github.com/AllenNeuralDynamics/aind-data-schema/blob/main/CONTRIBUTING.md).
### Documentation
To generate the rst files source files for documentation, run:
```
sphinx-apidoc -o docs/source/ src
```
Then to create the documentation html files, run:
```
sphinx-build -b html docs/source/ docs/build/html
```
More info on sphinx installation can be found here: https://www.sphinx-doc.org/en/master/usage/installation.html
Raw data
{
"_id": null,
"home_page": null,
"name": "aind-data-schema",
"maintainer": null,
"docs_url": null,
"requires_python": ">=3.8",
"maintainer_email": null,
"keywords": null,
"author": null,
"author_email": null,
"download_url": "https://files.pythonhosted.org/packages/58/84/51128de3d7d4a26e00c233d02d898124c0fdb86da085d48d8e821e2f6a00/aind_data_schema-1.3.0.tar.gz",
"platform": null,
"description": "# aind-data-schema\n\n[](LICENSE)\n\n[](https://aind-data-schema.readthedocs.io/en/latest/?badge=latest)\n\nA library that defines [AIND](https://alleninstitute.org/what-we-do/brain-science/research/allen-institute-neural-dynamics/) data schema and validates JSON files. \n\nUser documentation available on [readthedocs](https://aind-data-schema.readthedocs.io/en/latest/).\n\n## Overview\n\nThis repository contains the schemas needed to ingest and validate metadata that are essential to ensuring [AIND](https://alleninstitute.org/what-we-do/brain-science/research/allen-institute-neural-dynamics/) data collection is completely reproducible. Our general approach is to semantically version core schema classes and include those version numbers in serialized metadata so that we can flexibly evolve the schemas over time without requiring difficult data migrations. In the future, we will provide a browsable list of these classes rendered to [JSON Schema](https://json-schema.org/), including all historic versions.\n\nBe aware that this package is still under heavy preliminary development. Expect breaking changes regularly, although we will communicate these through semantic versioning.\n\nA simple example:\n\n```python\nimport datetime\n\nfrom aind_data_schema.core.subject import BreedingInfo, Housing, Subject\nfrom aind_data_schema_models.organizations import Organization\nfrom aind_data_schema_models.species import Species\n\nt = datetime.datetime(2022, 11, 22, 8, 43, 00)\n\ns = Subject(\n species=Species.MUS_MUSCULUS,\n subject_id=\"12345\",\n sex=\"Male\",\n date_of_birth=t.date(),\n genotype=\"Emx1-IRES-Cre;Camk2a-tTA;Ai93(TITL-GCaMP6f)\",\n housing=Housing(home_cage_enrichment=[\"Running wheel\"], cage_id=\"123\"),\n background_strain=\"C57BL/6J\",\n source=Organization.AI,\n breeding_info=BreedingInfo(\n breeding_group=\"Emx1-IRES-Cre(ND)\",\n maternal_id=\"546543\",\n maternal_genotype=\"Emx1-IRES-Cre/wt; Camk2a-tTa/Camk2a-tTA\",\n paternal_id=\"232323\",\n paternal_genotype=\"Ai93(TITL-GCaMP6f)/wt\",\n ),\n)\n\ns.write_standard_file() # writes subject.json\n```\n\n```json\n{\n \"describedBy\": \"https://raw.githubusercontent.com/AllenNeuralDynamics/aind-data-schema/main/src/aind_data_schema/core/subject.py\",\n \"schema_version\": \"0.5.6\",\n \"subject_id\": \"12345\",\n \"sex\": \"Male\",\n \"date_of_birth\": \"2022-11-22\",\n \"genotype\": \"Emx1-IRES-Cre;Camk2a-tTA;Ai93(TITL-GCaMP6f)\",\n \"species\": {\n \"name\": \"Mus musculus\",\n \"abbreviation\": null,\n \"registry\": {\n \"name\": \"National Center for Biotechnology Information\",\n \"abbreviation\": \"NCBI\"\n },\n \"registry_identifier\": \"10090\"\n },\n \"alleles\": [],\n \"background_strain\": \"C57BL/6J\",\n \"breeding_info\": {\n \"breeding_group\": \"Emx1-IRES-Cre(ND)\",\n \"maternal_id\": \"546543\",\n \"maternal_genotype\": \"Emx1-IRES-Cre/wt; Camk2a-tTa/Camk2a-tTA\",\n \"paternal_id\": \"232323\",\n \"paternal_genotype\": \"Ai93(TITL-GCaMP6f)/wt\"\n },\n \"source\": {\n \"name\": \"Allen Institute\",\n \"abbreviation\": \"AI\",\n \"registry\": {\n \"name\": \"Research Organization Registry\",\n \"abbreviation\": \"ROR\"\n },\n \"registry_identifier\": \"03cpe7c52\"\n },\n \"rrid\": null,\n \"restrictions\": null,\n \"wellness_reports\": [],\n \"housing\": {\n \"cage_id\": \"123\",\n \"room_id\": null,\n \"light_cycle\": null,\n \"home_cage_enrichment\": [\n \"Running wheel\"\n ],\n \"cohoused_subjects\": []\n },\n \"notes\": null\n}\n```\n\n## Installing and Upgrading\n\nTo install the latest version:\n```\npip install aind-data-schema\n```\n\nEvery merge to the `main` branch is automatically tagged with a new major/minor/patch version and uploaded to PyPI. To upgrade to the latest version:\n```\npip install aind-data-schema --upgrade\n```\n\n## Issues and Discussions\nIf you've found a bug in the schemas or would like to make a minor change, open an [issue](https://github.com/AllenNeuralDynamics/aind-data-schema/issues) and please use the provided [templates](https://github.com/AllenNeuralDynamics/aind-metadata-mapper/issues/new/choose).\nIf you'd like to propose a large change or addition, or generally have a question about how things work, head start a new [Discussion](https://github.com/AllenNeuralDynamics/aind-data-schema/discussions)!\n\n## Controlled Vocabularies\n\nControlled vocabularies and other enumerated lists are maintained in a separate repository: [aind-data-schema-models](https://github.com/AllenNeuralDynamics/aind-data-schema-models). This allows us to specify these lists without changing aind-data-schema. Controlled vocabularies include lists of organizations, manufacturers, species, modalities, platforms, units, harp devices, and registries.\n\nTo upgrade to the latest data models version:\n```\npip install aind-data-schema-models --upgrade\n```\n\n## Contributing\nContributions are more than welcome for this project! If you'd like to develop the code, please follow the standards outlined in the [contribution guide](https://github.com/AllenNeuralDynamics/aind-data-schema/blob/main/CONTRIBUTING.md).\n\n\n### Documentation\n\nTo generate the rst files source files for documentation, run:\n\n```\nsphinx-apidoc -o docs/source/ src\n```\n\nThen to create the documentation html files, run:\n```\nsphinx-build -b html docs/source/ docs/build/html\n```\n\nMore info on sphinx installation can be found here: https://www.sphinx-doc.org/en/master/usage/installation.html\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "A library that defines AIND data schema and validates JSON files.",
"version": "1.3.0",
"project_urls": null,
"split_keywords": [],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "a23b75700474073a8d15dc968b59c539253788dd8193fa8dbc9ab5cfeeb4545f",
"md5": "0125ea2d3c888f6933b9c2a408a90dd3",
"sha256": "ef6c999c15d87bf4bbd783c8e71d39ef960d9c41f54520f9b2d2624adc0a5eb0"
},
"downloads": -1,
"filename": "aind_data_schema-1.3.0-py3-none-any.whl",
"has_sig": false,
"md5_digest": "0125ea2d3c888f6933b9c2a408a90dd3",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.8",
"size": 64624,
"upload_time": "2025-01-07T00:49:27",
"upload_time_iso_8601": "2025-01-07T00:49:27.088778Z",
"url": "https://files.pythonhosted.org/packages/a2/3b/75700474073a8d15dc968b59c539253788dd8193fa8dbc9ab5cfeeb4545f/aind_data_schema-1.3.0-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "588451128de3d7d4a26e00c233d02d898124c0fdb86da085d48d8e821e2f6a00",
"md5": "29f51bf80a7ddc8048f30aa0cbbfae27",
"sha256": "554590ff4ea2fc3b1c146cec5bb068e134575d7acfcb8cc34c3fefdced2c4e28"
},
"downloads": -1,
"filename": "aind_data_schema-1.3.0.tar.gz",
"has_sig": false,
"md5_digest": "29f51bf80a7ddc8048f30aa0cbbfae27",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.8",
"size": 857427,
"upload_time": "2025-01-07T00:49:29",
"upload_time_iso_8601": "2025-01-07T00:49:29.013674Z",
"url": "https://files.pythonhosted.org/packages/58/84/51128de3d7d4a26e00c233d02d898124c0fdb86da085d48d8e821e2f6a00/aind_data_schema-1.3.0.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2025-01-07 00:49:29",
"github": false,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"lcname": "aind-data-schema"
}