broad-babel


Namebroad-babel JSON
Version 0.1.15 PyPI version JSON
download
home_pageNone
SummaryA translator of Broad and JUMP ids to more conventional names.
upload_time2024-05-03 20:44:58
maintainerNone
docs_urlNone
authorAlan Munoz
requires_python<4.0,>=3.9
licenseNone
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Broad_Babel

Minimal name translator of [JUMP](https://jump-cellpainting.broadinstitute.org/) consortium.

## Installation

```bash
pip install broad-babel
```

## Broad sample to standard 
You can fetch a single value
```python
from broad_sample.query import sample_to_standard

broad_to_standard("BRD-K18895904-001-16-1") 
# -> 'KVWDHTXUZHCGIO-UHFFFAOYSA-N'
```
If you provide multiple strings it will return dictionary.

```python
broad_to_standard(("BRD-K36461289-001-05-8", "ccsbBroad304_16164")) 
# {'BRD-K36461289-001-05-8': 'SCIMP', 'ccsbBroad304_16164': 'PIMZUZSSNYHVCU-KBLUICEQSA-N'}
```

## Export database as csv
```python
from broad_sample.query import export_csv

export_csv("./output.csv")
```

## Custom querying
The available fields are:
- perturbation: Dataset of origin for a given entry
- JCP2022: Identifier from the JUMP dataset
- standard_key: Gene Entrez id for gene-related perturbations, and InChIKey for compound perturbations
- broad_sample: Internal Broad ID
- pert_type: Type of perturbation, options are trt (treatment), control, negcon (Negative Control), poscon_cp (Positive Control, Compound Probe), poscon_diverse, poscon_orf, and poscon (Positive Control).
- NCBI_Gene_ID: NCBI identifier, only applicable to ORF and CRISPR

You can fetch any field using another (note that the output is a list of tuples)

```python
run_query(query="JCP2022_915119", input_column="JCP2022", output_column="broad_sample")
# [('ccsbBroad304_16164',)]
```

Note that there are some duplicates that arise from both between orf and crispr perturbations, but also within orf standard_keys.

```python
run_query("ccsbBroad304_00900", input_column = "broad_sample", output_column = "*")

# [('crispr', 'JCP2022_803621', 'KCNN1', 'ccsbBroad304_00900', 'trt', None),
#  ('orf', 'JCP2022_900842', 'KCNN1', 'ccsbBroad304_00900', 'trt', None),
#  ('Target1_orf', None, 'KCNN1', 'ccsbBroad304_00900', 'trt', None)]
```

It is also possible to use fuzzy querying by changing the operator argument and adding "%" to out key.

```python
    run_query(
        "BRD-K21728777%",
        input_column="broad_sample",
        output_column="*",
        operator="LIKE",
    )

# [('compound',
#   'JCP2022_037716',
#   'IVUGFMLRJOCGAS-UHFFFAOYSA-N',
#   'BRD-K21728777-001-02-3',
#   'control',
#   'poscon_cp'),
#  ('Target2_compound',
#   None,
#   'IVUGFMLRJOCGAS-UHFFFAOYSA-N',
#   'BRD-K21728777-001-02-3',
#   'control',
#   'poscon_cp')]
```

## Additional documentation
Metadata sources and additional documentation is available [here](./docs). 

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "broad-babel",
    "maintainer": null,
    "docs_url": null,
    "requires_python": "<4.0,>=3.9",
    "maintainer_email": null,
    "keywords": null,
    "author": "Alan Munoz",
    "author_email": null,
    "download_url": "https://files.pythonhosted.org/packages/aa/15/0e6b50177f9754baf68dc95a7ae9abbc37ec2a7896c3282e9bfc01f3c64e/broad_babel-0.1.15.tar.gz",
    "platform": null,
    "description": "# Broad_Babel\n\nMinimal name translator of [JUMP](https://jump-cellpainting.broadinstitute.org/) consortium.\n\n## Installation\n\n```bash\npip install broad-babel\n```\n\n## Broad sample to standard \nYou can fetch a single value\n```python\nfrom broad_sample.query import sample_to_standard\n\nbroad_to_standard(\"BRD-K18895904-001-16-1\") \n# -> 'KVWDHTXUZHCGIO-UHFFFAOYSA-N'\n```\nIf you provide multiple strings it will return dictionary.\n\n```python\nbroad_to_standard((\"BRD-K36461289-001-05-8\", \"ccsbBroad304_16164\")) \n# {'BRD-K36461289-001-05-8': 'SCIMP', 'ccsbBroad304_16164': 'PIMZUZSSNYHVCU-KBLUICEQSA-N'}\n```\n\n## Export database as csv\n```python\nfrom broad_sample.query import export_csv\n\nexport_csv(\"./output.csv\")\n```\n\n## Custom querying\nThe available fields are:\n- perturbation: Dataset of origin for a given entry\n- JCP2022: Identifier from the JUMP dataset\n- standard_key: Gene Entrez id for gene-related perturbations, and InChIKey for compound perturbations\n- broad_sample: Internal Broad ID\n- pert_type: Type of perturbation, options are trt (treatment), control, negcon (Negative Control), poscon_cp (Positive Control, Compound Probe), poscon_diverse, poscon_orf, and poscon (Positive Control).\n- NCBI_Gene_ID: NCBI identifier, only applicable to ORF and CRISPR\n\nYou can fetch any field using another (note that the output is a list of tuples)\n\n```python\nrun_query(query=\"JCP2022_915119\", input_column=\"JCP2022\", output_column=\"broad_sample\")\n# [('ccsbBroad304_16164',)]\n```\n\nNote that there are some duplicates that arise from both between orf and crispr perturbations, but also within orf standard_keys.\n\n```python\nrun_query(\"ccsbBroad304_00900\", input_column = \"broad_sample\", output_column = \"*\")\n\n# [('crispr', 'JCP2022_803621', 'KCNN1', 'ccsbBroad304_00900', 'trt', None),\n#  ('orf', 'JCP2022_900842', 'KCNN1', 'ccsbBroad304_00900', 'trt', None),\n#  ('Target1_orf', None, 'KCNN1', 'ccsbBroad304_00900', 'trt', None)]\n```\n\nIt is also possible to use fuzzy querying by changing the operator argument and adding \"%\" to out key.\n\n```python\n    run_query(\n        \"BRD-K21728777%\",\n        input_column=\"broad_sample\",\n        output_column=\"*\",\n        operator=\"LIKE\",\n    )\n\n# [('compound',\n#   'JCP2022_037716',\n#   'IVUGFMLRJOCGAS-UHFFFAOYSA-N',\n#   'BRD-K21728777-001-02-3',\n#   'control',\n#   'poscon_cp'),\n#  ('Target2_compound',\n#   None,\n#   'IVUGFMLRJOCGAS-UHFFFAOYSA-N',\n#   'BRD-K21728777-001-02-3',\n#   'control',\n#   'poscon_cp')]\n```\n\n## Additional documentation\nMetadata sources and additional documentation is available [here](./docs). \n",
    "bugtrack_url": null,
    "license": null,
    "summary": "A translator of Broad and JUMP ids to more conventional names.",
    "version": "0.1.15",
    "project_urls": null,
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "e72e0557c39bd6f5186356cc00b5552385100aa4f7a4d4f3e3a48c7f3b0b2b60",
                "md5": "b632ddb6232fbea25ecf33e23db0b082",
                "sha256": "5cb0de0f399e71b25b2d35cfec3ede873a4d879f1176537eb30efb4a7624d18b"
            },
            "downloads": -1,
            "filename": "broad_babel-0.1.15-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "b632ddb6232fbea25ecf33e23db0b082",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": "<4.0,>=3.9",
            "size": 5058,
            "upload_time": "2024-05-03T20:44:57",
            "upload_time_iso_8601": "2024-05-03T20:44:57.158210Z",
            "url": "https://files.pythonhosted.org/packages/e7/2e/0557c39bd6f5186356cc00b5552385100aa4f7a4d4f3e3a48c7f3b0b2b60/broad_babel-0.1.15-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "aa150e6b50177f9754baf68dc95a7ae9abbc37ec2a7896c3282e9bfc01f3c64e",
                "md5": "7217ff2ebba5b63db14467ce3b8ca4e8",
                "sha256": "5293b066132eaf99f2e6c3ee9467506dfc4c733eaea158da373a81ef1c9054fd"
            },
            "downloads": -1,
            "filename": "broad_babel-0.1.15.tar.gz",
            "has_sig": false,
            "md5_digest": "7217ff2ebba5b63db14467ce3b8ca4e8",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": "<4.0,>=3.9",
            "size": 4473,
            "upload_time": "2024-05-03T20:44:58",
            "upload_time_iso_8601": "2024-05-03T20:44:58.880805Z",
            "url": "https://files.pythonhosted.org/packages/aa/15/0e6b50177f9754baf68dc95a7ae9abbc37ec2a7896c3282e9bfc01f3c64e/broad_babel-0.1.15.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-05-03 20:44:58",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "broad-babel"
}
        
Elapsed time: 0.24952s