# PepSift
## Summary
***Identify peptides and their derivatives from small molecule datasets.***
## Installation
```commandline
pip install pepsift
```
## Usage
`PepSift` relies on multiple criteria defining different types od amino acids and polymers thereof.
There are currently 5 different levels available from most to least stringent:
| level | description | comment |
|--------------------------------------------|-------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| `SiftLevel.NaturalLAminoAcids` | natural L-amino acids and peptides thereof | e.g. identify L-Alanine or the sequence `ACDEFGHIKLMNPQRSTVWY` <br/><img src="images/L-Ala.png" alt="L-Ala" width=200> |
| `SiftLevel.NaturalLDAminoAcids` | natural L- and D-amino acid and peptides thereof | e.g. identify L-Alanine or the sequences `D-A L-W`, `L-H D-Q`, `D-M D-K` <br/><img src="images/D-Ala.png" alt="D-Ala" width=200> |
| `SiftLevel.NaturalAminoAcidDerivatives` | derivatives of natural L- and D-amino acid and peptides thereof | i.e. any compound containing a canonical amino acid/peptide backbone <br/><img src="images/AA.png" alt="AA" width=200> |
| `SiftLevel.NonNaturalAminoAcidDerivatives` | non-natural amino acid derivatives and peptides thereof | e.g. identify beta-homo-alanine or alpha-methyl-Tyr <br/><img src="images/beta-homo-Ala.png" alt="BH-Ala" width=200> <img src="images/alpha-methyl-Tyr.png" alt="AM-Tyr" width=200> |
| `SiftLevel.AllAmineAndAcid` | compounds containing amine and carboxylic acid moieties | e.g. 3-[3-(2-Aminoethyl)cyclohexyl]propionic acid <br/><img src="images/5N1NAPHT.png" alt="5N1NAPHT" width=200> |
These levels allow for granular selection of different types amino acids/peptides.
<br/>
The decreasing stringency of `SiftLevel` criteria is exemplified below.
```python
from pepsift import PepSift, SiftLevel
from rdkit import Chem
ps1 = PepSift(SiftLevel.NaturalLAminoAcids)
ps2 = PepSift(SiftLevel.NaturalLDAminoAcids)
ps3 = PepSift(SiftLevel.NaturalAminoAcidDerivatives)
ps4 = PepSift(SiftLevel.NonNaturalAminoAcidDerivatives)
ps5 = PepSift(SiftLevel.AllAmineAndAcid)
mols = [Chem.MolFromSmiles('C[C@@H](C(=O)O)N'), # L-Ala
Chem.MolFromSmiles('C[C@H](C(=O)O)N'), # D-Ala
Chem.MolFromSmiles('C[C@@H](CN)C(=O)O'), # Beta-homo-Ala
Chem.MolFromSmiles('CC(C)(C(=O)O)N'), # Alpha-methyl-Ala
Chem.MolFromSmiles('NCCCCCCCCCCCCCCCC(=O)O'), # Amino-hexadecanoic acid
Chem.MolFromSmiles('c1ccccc1'), # Benzene
]
for mol in mols:
print((ps1.is_peptide(mol),
ps2.is_peptide(mol),
ps3.is_peptide(mol),
ps4.is_peptide(mol),
ps5.is_peptide(mol)
)
)
# L-Ala
# (True, True, True, True, True)
# D-Ala
# (False, True, True, True, True)
# Beta-homo-Ala
# (False, False, True, True, True)
# Alpha-methyl-Ala
# (False, False, False, True, True)
# Amino-hexadecanoic acid
# (False, False, False, False, True)
# Benzene
# (False, False, False, False, False)
```
:warning: Any peptide containing a natural amino acid is considered a derivative of natural amino acids (even if it also contains non natural amino acids)
Raw data
{
"_id": null,
"home_page": "https://github.com/OlivierBeq/pepsift",
"name": "pepsift",
"maintainer": "Olivier J. M. B\u00e9quignon",
"docs_url": null,
"requires_python": "",
"maintainer_email": "\"olivier.bequignon.maintainer@gmail.com\"",
"keywords": "peptides,cheminformatics,filtering",
"author": "Olivier J. M. B\u00e9quignon",
"author_email": "\"olivier.bequignon.maintainer@gmail.com\"",
"download_url": "https://files.pythonhosted.org/packages/2c/cb/b4a20257dc444b828112b1402ecd8299603a19f2ed05b8a8d82931ea0916/pepsift-0.0.2.post1.tar.gz",
"platform": null,
"description": "# PepSift\r\n\r\n\r\n## Summary\r\n\r\n***Identify peptides and their derivatives from small molecule datasets.***\r\n\r\n## Installation\r\n\r\n```commandline\r\npip install pepsift\r\n```\r\n\r\n## Usage\r\n\r\n`PepSift` relies on multiple criteria defining different types od amino acids and polymers thereof.\r\n\r\nThere are currently 5 different levels available from most to least stringent:\r\n\r\n| level | description | comment |\r\n|--------------------------------------------|-------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|\r\n| `SiftLevel.NaturalLAminoAcids` | natural L-amino acids and peptides thereof | e.g. identify L-Alanine or the sequence `ACDEFGHIKLMNPQRSTVWY` <br/><img src=\"images/L-Ala.png\" alt=\"L-Ala\" width=200> |\r\n| `SiftLevel.NaturalLDAminoAcids` | natural L- and D-amino acid and peptides thereof | e.g. identify L-Alanine or the sequences `D-A L-W`, `L-H D-Q`, `D-M D-K` <br/><img src=\"images/D-Ala.png\" alt=\"D-Ala\" width=200> |\r\n| `SiftLevel.NaturalAminoAcidDerivatives` | derivatives of natural L- and D-amino acid and peptides thereof | i.e. any compound containing a canonical amino acid/peptide backbone <br/><img src=\"images/AA.png\" alt=\"AA\" width=200> |\r\n| `SiftLevel.NonNaturalAminoAcidDerivatives` | non-natural amino acid derivatives and peptides thereof | e.g. identify beta-homo-alanine or alpha-methyl-Tyr <br/><img src=\"images/beta-homo-Ala.png\" alt=\"BH-Ala\" width=200> <img src=\"images/alpha-methyl-Tyr.png\" alt=\"AM-Tyr\" width=200> |\r\n| `SiftLevel.AllAmineAndAcid` | compounds containing amine and carboxylic acid moieties | e.g. 3-[3-(2-Aminoethyl)cyclohexyl]propionic acid <br/><img src=\"images/5N1NAPHT.png\" alt=\"5N1NAPHT\" width=200> |\r\n\r\nThese levels allow for granular selection of different types amino acids/peptides.\r\n\r\n<br/>\r\nThe decreasing stringency of `SiftLevel` criteria is exemplified below.\r\n\r\n```python\r\nfrom pepsift import PepSift, SiftLevel\r\n\r\nfrom rdkit import Chem\r\n\r\nps1 = PepSift(SiftLevel.NaturalLAminoAcids)\r\nps2 = PepSift(SiftLevel.NaturalLDAminoAcids)\r\nps3 = PepSift(SiftLevel.NaturalAminoAcidDerivatives)\r\nps4 = PepSift(SiftLevel.NonNaturalAminoAcidDerivatives)\r\nps5 = PepSift(SiftLevel.AllAmineAndAcid)\r\n\r\nmols = [Chem.MolFromSmiles('C[C@@H](C(=O)O)N'), # L-Ala\r\n Chem.MolFromSmiles('C[C@H](C(=O)O)N'), # D-Ala\r\n Chem.MolFromSmiles('C[C@@H](CN)C(=O)O'), # Beta-homo-Ala\r\n Chem.MolFromSmiles('CC(C)(C(=O)O)N'), # Alpha-methyl-Ala\r\n Chem.MolFromSmiles('NCCCCCCCCCCCCCCCC(=O)O'), # Amino-hexadecanoic acid\r\n Chem.MolFromSmiles('c1ccccc1'), # Benzene\r\n ]\r\n\r\nfor mol in mols:\r\n print((ps1.is_peptide(mol),\r\n ps2.is_peptide(mol),\r\n ps3.is_peptide(mol),\r\n ps4.is_peptide(mol),\r\n ps5.is_peptide(mol)\r\n )\r\n )\r\n\r\n# L-Ala\r\n# (True, True, True, True, True)\r\n# D-Ala\r\n# (False, True, True, True, True)\r\n# Beta-homo-Ala\r\n# (False, False, True, True, True)\r\n# Alpha-methyl-Ala\r\n# (False, False, False, True, True)\r\n# Amino-hexadecanoic acid\r\n# (False, False, False, False, True)\r\n# Benzene\r\n# (False, False, False, False, False)\r\n```\r\n\r\n\r\n\r\n:warning: Any peptide containing a natural amino acid is considered a derivative of natural amino acids (even if it also contains non natural amino acids)\r\n",
"bugtrack_url": null,
"license": "",
"summary": "Identify peptides and derivatives from small molecule datasets",
"version": "0.0.2.post1",
"project_urls": {
"Homepage": "https://github.com/OlivierBeq/pepsift"
},
"split_keywords": [
"peptides",
"cheminformatics",
"filtering"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "ad53fd541e503cd5e136b30a71b3618afa7893fd59b919776dbab723f02793a3",
"md5": "9cd1b8b39d9bca77a2f4ec921766ba6b",
"sha256": "5ce30d63b4552609d87fc117bbd7a5be0c1cbe1d6549f0160d5b9ee247f23aaa"
},
"downloads": -1,
"filename": "pepsift-0.0.2.post1-py3-none-any.whl",
"has_sig": false,
"md5_digest": "9cd1b8b39d9bca77a2f4ec921766ba6b",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 7724,
"upload_time": "2023-08-04T14:14:18",
"upload_time_iso_8601": "2023-08-04T14:14:18.808131Z",
"url": "https://files.pythonhosted.org/packages/ad/53/fd541e503cd5e136b30a71b3618afa7893fd59b919776dbab723f02793a3/pepsift-0.0.2.post1-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "2ccbb4a20257dc444b828112b1402ecd8299603a19f2ed05b8a8d82931ea0916",
"md5": "a409a5b90832688bb181ea12f12941b9",
"sha256": "2ca7d78ae9d14f4b259819518e16e8d8830d1c57d309b7992dc4a91a25658ed2"
},
"downloads": -1,
"filename": "pepsift-0.0.2.post1.tar.gz",
"has_sig": false,
"md5_digest": "a409a5b90832688bb181ea12f12941b9",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 9383,
"upload_time": "2023-08-04T14:14:19",
"upload_time_iso_8601": "2023-08-04T14:14:19.753752Z",
"url": "https://files.pythonhosted.org/packages/2c/cb/b4a20257dc444b828112b1402ecd8299603a19f2ed05b8a8d82931ea0916/pepsift-0.0.2.post1.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2023-08-04 14:14:19",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "OlivierBeq",
"github_project": "pepsift",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"tox": true,
"lcname": "pepsift"
}