pepsift


Namepepsift JSON
Version 0.0.2.post1 PyPI version JSON
download
home_pagehttps://github.com/OlivierBeq/pepsift
SummaryIdentify peptides and derivatives from small molecule datasets
upload_time2023-08-04 14:14:19
maintainerOlivier J. M. Béquignon
docs_urlNone
authorOlivier J. M. Béquignon
requires_python
license
keywords peptides cheminformatics filtering
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # PepSift


## Summary

***Identify peptides and their derivatives from small molecule datasets.***

## Installation

```commandline
pip install pepsift
```

## Usage

`PepSift` relies on multiple criteria defining different types od amino acids and polymers thereof.

There are currently 5 different levels available from most to least stringent:

| level                                      | description                                                       | comment                                                                                                                                                                             |
|--------------------------------------------|-------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| `SiftLevel.NaturalLAminoAcids`             | natural L-amino acids and peptides thereof                        | e.g. identify L-Alanine or the sequence `ACDEFGHIKLMNPQRSTVWY` <br/><img src="images/L-Ala.png" alt="L-Ala" width=200>                                                              |
| `SiftLevel.NaturalLDAminoAcids`            | natural L- and D-amino acid and peptides thereof                  | e.g. identify L-Alanine or the sequences `D-A L-W`, `L-H D-Q`,   `D-M D-K` <br/><img src="images/D-Ala.png" alt="D-Ala" width=200>                                                  |
| `SiftLevel.NaturalAminoAcidDerivatives`    | derivatives of natural L- and D-amino acid and peptides   thereof | i.e. any compound containing a canonical amino acid/peptide backbone <br/><img src="images/AA.png" alt="AA" width=200>                                                              |
| `SiftLevel.NonNaturalAminoAcidDerivatives` | non-natural amino acid derivatives and peptides thereof           | e.g. identify beta-homo-alanine or alpha-methyl-Tyr <br/><img src="images/beta-homo-Ala.png" alt="BH-Ala" width=200> <img src="images/alpha-methyl-Tyr.png" alt="AM-Tyr" width=200> |
| `SiftLevel.AllAmineAndAcid`                | compounds containing amine and carboxylic acid moieties           | e.g. 3-[3-(2-Aminoethyl)cyclohexyl]propionic acid <br/><img src="images/5N1NAPHT.png" alt="5N1NAPHT" width=200>                                                                     |

These levels allow for granular selection of different types amino acids/peptides.

<br/>
The decreasing stringency of `SiftLevel` criteria is exemplified below.

```python
from pepsift import PepSift, SiftLevel

from rdkit import Chem

ps1 = PepSift(SiftLevel.NaturalLAminoAcids)
ps2 = PepSift(SiftLevel.NaturalLDAminoAcids)
ps3 = PepSift(SiftLevel.NaturalAminoAcidDerivatives)
ps4 = PepSift(SiftLevel.NonNaturalAminoAcidDerivatives)
ps5 = PepSift(SiftLevel.AllAmineAndAcid)

mols = [Chem.MolFromSmiles('C[C@@H](C(=O)O)N'),  # L-Ala
        Chem.MolFromSmiles('C[C@H](C(=O)O)N'),   # D-Ala
        Chem.MolFromSmiles('C[C@@H](CN)C(=O)O'), # Beta-homo-Ala
        Chem.MolFromSmiles('CC(C)(C(=O)O)N'),    # Alpha-methyl-Ala
        Chem.MolFromSmiles('NCCCCCCCCCCCCCCCC(=O)O'),   # Amino-hexadecanoic acid
        Chem.MolFromSmiles('c1ccccc1'),           # Benzene
       ]

for mol in mols:
    print((ps1.is_peptide(mol),
           ps2.is_peptide(mol),
           ps3.is_peptide(mol),
           ps4.is_peptide(mol),
           ps5.is_peptide(mol)
           )
          )

# L-Ala
# (True, True, True, True, True)
# D-Ala
# (False, True, True, True, True)
# Beta-homo-Ala
# (False, False, True, True, True)
# Alpha-methyl-Ala
# (False, False, False, True, True)
# Amino-hexadecanoic acid
# (False, False, False, False, True)
# Benzene
# (False, False, False, False, False)
```



:warning: Any peptide containing a natural amino acid is considered a derivative of natural amino acids (even if it also contains non natural amino acids)

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/OlivierBeq/pepsift",
    "name": "pepsift",
    "maintainer": "Olivier J. M. B\u00e9quignon",
    "docs_url": null,
    "requires_python": "",
    "maintainer_email": "\"olivier.bequignon.maintainer@gmail.com\"",
    "keywords": "peptides,cheminformatics,filtering",
    "author": "Olivier J. M. B\u00e9quignon",
    "author_email": "\"olivier.bequignon.maintainer@gmail.com\"",
    "download_url": "https://files.pythonhosted.org/packages/2c/cb/b4a20257dc444b828112b1402ecd8299603a19f2ed05b8a8d82931ea0916/pepsift-0.0.2.post1.tar.gz",
    "platform": null,
    "description": "# PepSift\r\n\r\n\r\n## Summary\r\n\r\n***Identify peptides and their derivatives from small molecule datasets.***\r\n\r\n## Installation\r\n\r\n```commandline\r\npip install pepsift\r\n```\r\n\r\n## Usage\r\n\r\n`PepSift` relies on multiple criteria defining different types od amino acids and polymers thereof.\r\n\r\nThere are currently 5 different levels available from most to least stringent:\r\n\r\n| level                                      | description                                                       | comment                                                                                                                                                                             |\r\n|--------------------------------------------|-------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|\r\n| `SiftLevel.NaturalLAminoAcids`             | natural L-amino acids and peptides thereof                        | e.g. identify L-Alanine or the sequence `ACDEFGHIKLMNPQRSTVWY` <br/><img src=\"images/L-Ala.png\" alt=\"L-Ala\" width=200>                                                              |\r\n| `SiftLevel.NaturalLDAminoAcids`            | natural L- and D-amino acid and peptides thereof                  | e.g. identify L-Alanine or the sequences `D-A L-W`, `L-H D-Q`,   `D-M D-K` <br/><img src=\"images/D-Ala.png\" alt=\"D-Ala\" width=200>                                                  |\r\n| `SiftLevel.NaturalAminoAcidDerivatives`    | derivatives of natural L- and D-amino acid and peptides   thereof | i.e. any compound containing a canonical amino acid/peptide backbone <br/><img src=\"images/AA.png\" alt=\"AA\" width=200>                                                              |\r\n| `SiftLevel.NonNaturalAminoAcidDerivatives` | non-natural amino acid derivatives and peptides thereof           | e.g. identify beta-homo-alanine or alpha-methyl-Tyr <br/><img src=\"images/beta-homo-Ala.png\" alt=\"BH-Ala\" width=200> <img src=\"images/alpha-methyl-Tyr.png\" alt=\"AM-Tyr\" width=200> |\r\n| `SiftLevel.AllAmineAndAcid`                | compounds containing amine and carboxylic acid moieties           | e.g. 3-[3-(2-Aminoethyl)cyclohexyl]propionic acid <br/><img src=\"images/5N1NAPHT.png\" alt=\"5N1NAPHT\" width=200>                                                                     |\r\n\r\nThese levels allow for granular selection of different types amino acids/peptides.\r\n\r\n<br/>\r\nThe decreasing stringency of `SiftLevel` criteria is exemplified below.\r\n\r\n```python\r\nfrom pepsift import PepSift, SiftLevel\r\n\r\nfrom rdkit import Chem\r\n\r\nps1 = PepSift(SiftLevel.NaturalLAminoAcids)\r\nps2 = PepSift(SiftLevel.NaturalLDAminoAcids)\r\nps3 = PepSift(SiftLevel.NaturalAminoAcidDerivatives)\r\nps4 = PepSift(SiftLevel.NonNaturalAminoAcidDerivatives)\r\nps5 = PepSift(SiftLevel.AllAmineAndAcid)\r\n\r\nmols = [Chem.MolFromSmiles('C[C@@H](C(=O)O)N'),  # L-Ala\r\n        Chem.MolFromSmiles('C[C@H](C(=O)O)N'),   # D-Ala\r\n        Chem.MolFromSmiles('C[C@@H](CN)C(=O)O'), # Beta-homo-Ala\r\n        Chem.MolFromSmiles('CC(C)(C(=O)O)N'),    # Alpha-methyl-Ala\r\n        Chem.MolFromSmiles('NCCCCCCCCCCCCCCCC(=O)O'),   # Amino-hexadecanoic acid\r\n        Chem.MolFromSmiles('c1ccccc1'),           # Benzene\r\n       ]\r\n\r\nfor mol in mols:\r\n    print((ps1.is_peptide(mol),\r\n           ps2.is_peptide(mol),\r\n           ps3.is_peptide(mol),\r\n           ps4.is_peptide(mol),\r\n           ps5.is_peptide(mol)\r\n           )\r\n          )\r\n\r\n# L-Ala\r\n# (True, True, True, True, True)\r\n# D-Ala\r\n# (False, True, True, True, True)\r\n# Beta-homo-Ala\r\n# (False, False, True, True, True)\r\n# Alpha-methyl-Ala\r\n# (False, False, False, True, True)\r\n# Amino-hexadecanoic acid\r\n# (False, False, False, False, True)\r\n# Benzene\r\n# (False, False, False, False, False)\r\n```\r\n\r\n\r\n\r\n:warning: Any peptide containing a natural amino acid is considered a derivative of natural amino acids (even if it also contains non natural amino acids)\r\n",
    "bugtrack_url": null,
    "license": "",
    "summary": "Identify peptides and derivatives from small molecule datasets",
    "version": "0.0.2.post1",
    "project_urls": {
        "Homepage": "https://github.com/OlivierBeq/pepsift"
    },
    "split_keywords": [
        "peptides",
        "cheminformatics",
        "filtering"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "ad53fd541e503cd5e136b30a71b3618afa7893fd59b919776dbab723f02793a3",
                "md5": "9cd1b8b39d9bca77a2f4ec921766ba6b",
                "sha256": "5ce30d63b4552609d87fc117bbd7a5be0c1cbe1d6549f0160d5b9ee247f23aaa"
            },
            "downloads": -1,
            "filename": "pepsift-0.0.2.post1-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "9cd1b8b39d9bca77a2f4ec921766ba6b",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 7724,
            "upload_time": "2023-08-04T14:14:18",
            "upload_time_iso_8601": "2023-08-04T14:14:18.808131Z",
            "url": "https://files.pythonhosted.org/packages/ad/53/fd541e503cd5e136b30a71b3618afa7893fd59b919776dbab723f02793a3/pepsift-0.0.2.post1-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "2ccbb4a20257dc444b828112b1402ecd8299603a19f2ed05b8a8d82931ea0916",
                "md5": "a409a5b90832688bb181ea12f12941b9",
                "sha256": "2ca7d78ae9d14f4b259819518e16e8d8830d1c57d309b7992dc4a91a25658ed2"
            },
            "downloads": -1,
            "filename": "pepsift-0.0.2.post1.tar.gz",
            "has_sig": false,
            "md5_digest": "a409a5b90832688bb181ea12f12941b9",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 9383,
            "upload_time": "2023-08-04T14:14:19",
            "upload_time_iso_8601": "2023-08-04T14:14:19.753752Z",
            "url": "https://files.pythonhosted.org/packages/2c/cb/b4a20257dc444b828112b1402ecd8299603a19f2ed05b8a8d82931ea0916/pepsift-0.0.2.post1.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-08-04 14:14:19",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "OlivierBeq",
    "github_project": "pepsift",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "tox": true,
    "lcname": "pepsift"
}
        
Elapsed time: 0.10504s