# Protmapper
The Protmapper maps references to protein sites to the human reference
sequence based on UniProt, PhosphoSitePlus, and manual curation.
## Installation
### Python package
The Protmapper is a Python package that is available on PyPI and can be
installed as:
```
pip install protmapper
```
### Docker container
Alternatively, the Protmapper Docker container can be run to expose it as
a REST API as:
```
docker run -d -p 8008:8008 labsyspharm/protmapper:latest
```
## Command line interface
In addition to supporting usage via a Python API and a REST service,
Protmapper also provides a command line interface that can be used as follows.
```bash
Run Protmapper on a list of proteins with residues and sites provided in a
text file.
positional arguments:
input Path to an input file. The input file is a text file
in which each row consists of four comma separated
values, with the first element being a protein ID, the
second, the namespace in which that ID is valid
(uniprot or hgnc),third, an amino acid represented as
a single capital letter, and fourth, a site position
on the protein.
output Path to the output file to be generated. Each line of
the output file corresponds to a line in the input
file. Each linerepresents a mapped site produced by
Protmapper.
optional arguments:
-h, --help show this help message and exit
--peptide If given, the third element of each row of the input
file is a peptide (amino acid sequence) rather than a
single amino acid residue. In this case, peptide-
oriented mappings are applied. In this mode the
following boolean arguments are ignored.
--no_methionine_offset
If given, will not check for off-by-one errors in site
position (possibly) attributable to site numbering
from mature proteins after cleavage of the initial
methionine.
--no_orthology_mapping
If given, will not check sequence positions for known
modification sites in mouse or rat sequences (based on
PhosphoSitePlus data).
--no_isoform_mapping If given, will not check sequence positions for known
modifications in other human isoforms of the protein
(based on PhosphoSitePlus data).
```
## Documentation
For a detailed documentation of the Protmapper, visit http://protmapper.readthedocs.io
## Funding
The development of protmapper is funded under the DARPA Automated Scientific Discovery Framework project (ARO grant W911NF018-1-0124).
## Citation
```bibtex
@article{bachman2019protmapper,
author = {Bachman, John A and Gyori, Benjamin M and Sorger, Peter K},
doi = {10.1101/822668},
journal = {bioRxiv},
publisher = {Cold Spring Harbor Laboratory},
title = {{Assembling a phosphoproteomic knowledge base using ProtMapper to normalize phosphosite information from databases and text mining}},
url = {https://www.biorxiv.org/content/early/2019/11/06/822668.1},
year = {2019}
}
```
Raw data
{
"_id": null,
"home_page": "https://github.com/indralab/protmapper",
"name": "protmapper",
"maintainer": "",
"docs_url": null,
"requires_python": "",
"maintainer_email": "",
"keywords": "protein,proteomics,sequence,alignment,assembly,post-translational,modification",
"author": "John A. Bachman",
"author_email": "john_bachman@hms.harvard.edu",
"download_url": "https://files.pythonhosted.org/packages/1d/b8/767041ed4ba4fa07ab812b4641ce4a88aa36b5ed3a4ae150089a3e2c7619/protmapper-0.0.29.tar.gz",
"platform": null,
"description": "# Protmapper\nThe Protmapper maps references to protein sites to the human reference\nsequence based on UniProt, PhosphoSitePlus, and manual curation.\n\n\n## Installation\n\n### Python package\nThe Protmapper is a Python package that is available on PyPI and can be\ninstalled as:\n\n```\npip install protmapper\n```\n\n### Docker container\nAlternatively, the Protmapper Docker container can be run to expose it as\na REST API as:\n\n```\ndocker run -d -p 8008:8008 labsyspharm/protmapper:latest\n```\n\n## Command line interface\nIn addition to supporting usage via a Python API and a REST service,\nProtmapper also provides a command line interface that can be used as follows.\n\n```bash\nRun Protmapper on a list of proteins with residues and sites provided in a\ntext file.\n\npositional arguments:\n input Path to an input file. The input file is a text file\n in which each row consists of four comma separated\n values, with the first element being a protein ID, the\n second, the namespace in which that ID is valid\n (uniprot or hgnc),third, an amino acid represented as\n a single capital letter, and fourth, a site position\n on the protein.\n output Path to the output file to be generated. Each line of\n the output file corresponds to a line in the input\n file. Each linerepresents a mapped site produced by\n Protmapper.\n\noptional arguments:\n -h, --help show this help message and exit\n --peptide If given, the third element of each row of the input\n file is a peptide (amino acid sequence) rather than a\n single amino acid residue. In this case, peptide-\n oriented mappings are applied. In this mode the\n following boolean arguments are ignored.\n --no_methionine_offset\n If given, will not check for off-by-one errors in site\n position (possibly) attributable to site numbering\n from mature proteins after cleavage of the initial\n methionine.\n --no_orthology_mapping\n If given, will not check sequence positions for known\n modification sites in mouse or rat sequences (based on\n PhosphoSitePlus data).\n --no_isoform_mapping If given, will not check sequence positions for known\n modifications in other human isoforms of the protein\n (based on PhosphoSitePlus data).\n\n```\n\n## Documentation\nFor a detailed documentation of the Protmapper, visit http://protmapper.readthedocs.io\n\n## Funding\nThe development of protmapper is funded under the DARPA Automated Scientific Discovery Framework project (ARO grant W911NF018-1-0124).\n\n## Citation\n\n```bibtex\n@article{bachman2019protmapper,\n author = {Bachman, John A and Gyori, Benjamin M and Sorger, Peter K},\n doi = {10.1101/822668},\n journal = {bioRxiv},\n publisher = {Cold Spring Harbor Laboratory},\n title = {{Assembling a phosphoproteomic knowledge base using ProtMapper to normalize phosphosite information from databases and text mining}},\n url = {https://www.biorxiv.org/content/early/2019/11/06/822668.1},\n year = {2019}\n}\n```\n",
"bugtrack_url": null,
"license": "",
"summary": "Map protein sites to human reference sequence.",
"version": "0.0.29",
"project_urls": {
"Homepage": "https://github.com/indralab/protmapper"
},
"split_keywords": [
"protein",
"proteomics",
"sequence",
"alignment",
"assembly",
"post-translational",
"modification"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "3cbccdf85d1a6e4fc5287eb4e492e50f1c8a3a7a5a9b6660e364f894f64d1b0b",
"md5": "8af5abd8a04ac144455cbc213d8ad7ac",
"sha256": "411597039c0ea07e2c43973409ca4c5cb3da16f5eea48e1b9e6c54b0f00a9552"
},
"downloads": -1,
"filename": "protmapper-0.0.29-py3-none-any.whl",
"has_sig": false,
"md5_digest": "8af5abd8a04ac144455cbc213d8ad7ac",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 35892,
"upload_time": "2024-01-19T04:49:12",
"upload_time_iso_8601": "2024-01-19T04:49:12.651282Z",
"url": "https://files.pythonhosted.org/packages/3c/bc/cdf85d1a6e4fc5287eb4e492e50f1c8a3a7a5a9b6660e364f894f64d1b0b/protmapper-0.0.29-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "1db8767041ed4ba4fa07ab812b4641ce4a88aa36b5ed3a4ae150089a3e2c7619",
"md5": "6556c798765a1494f775a910cf31c947",
"sha256": "d0d10198fb000401d03ba6c271869970caf8e1c8b46db686d2eed153fa8c2257"
},
"downloads": -1,
"filename": "protmapper-0.0.29.tar.gz",
"has_sig": false,
"md5_digest": "6556c798765a1494f775a910cf31c947",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 34397,
"upload_time": "2024-01-19T04:49:14",
"upload_time_iso_8601": "2024-01-19T04:49:14.471464Z",
"url": "https://files.pythonhosted.org/packages/1d/b8/767041ed4ba4fa07ab812b4641ce4a88aa36b5ed3a4ae150089a3e2c7619/protmapper-0.0.29.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-01-19 04:49:14",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "indralab",
"github_project": "protmapper",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"lcname": "protmapper"
}