nsdpy


Namensdpy JSON
Version 1.0.0 PyPI version JSON
download
home_pagehttps://github.com/RaphaelHebert/nsdpy
SummaryAutomatize the download of DNA sequences from NCBI, sort them according to their taxonomy and filter them with a gene name (provided as a regular expression)
upload_time2023-10-15 13:58:55
maintainer
docs_urlNone
authorRaphaelHebert
requires_python>=3.8,<4.0
licenseMIT
keywords ncbi taxonomy dna
VCS
bugtrack_url
requirements certifi chardet idna requests urllib3
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # nsdpy

[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
[![pypi](https://img.shields.io/pypi/v/nsdpy)](https://pypi.org/project/nsdpy/)
[![Python 3.8](https://upload.wikimedia.org/wikipedia/commons/a/a5/Blue_Python_3.8_Shield_Badge.svg)](https://www.python.org/)
[![Documentation Status](https://readthedocs.org/projects/nsdpy/badge/?version=latest)](https://nsdpy.readthedocs.io/en/latest/?badge=latest)
[![Coverage Status](https://coveralls.io/repos/github/RaphaelHebert/nsdpy/badge.svg?branch=fix/longSpeciesList)](https://coveralls.io/github/RaphaelHebert/nsdpy?branch=fix/longSpeciesList)

<div align="center" style="text-align: center">
Your support means the world to me and helps me continue developing and maintaining this repository to benefit the entire community. Thank you for making a difference in the open-source world! </div>
<br/>
<div align="center" style="text-align: center;" markdown="1"><a href="https://www.buymeacoffee.com/joe010" target="_blank"><img src="https://cdn.buymeacoffee.com/buttons/default-orange.png" alt="Buy Me A Coffee" height="41" width="174"></a>
</div>

##

- [Introduction](#introduction)
- [Workfolw](#workflow)
- [Quick start](#quick-start)
- [Usage](#usage)
  - [Google Colab](#google-colab)
  - [Command line](#command-line)
- [Authors and acknowledgment](#authors-and-acknowledgment)
- [Support](#support)
- [Licence](#license)
- [More Documentation](#more-documentation)

## Introduction

nsdpy (nucleotide or NCBI sequence downloader) aims to ease the download and sort of big bacth of DNA sequences from the NCBI database.
It can also be usefull to filter the sequences based on their annotations.
Using nsdpy the user can:

- **Search** NCBI nucleotide database
- **Download** the fasta files or the cds_fasta files corresponding to the result of the search
- **Sort** the sequences based on their taxonomy
- **Select** coding sequences from cds files based on the gene names using one or more regular expressions.
This can help the user retrieve some sequences for which the gene name is annotated in another field.
- **Retrieve** the taxonomic information and add it to the output sequences.

## Quick start

- Clone the repo from Github:
```bash
git clone https://github.com/RaphaelHebert/nsdpy.git
  ```
- pip:
_depending on the user environment pip may be replaced by pip3 if pip3 is used_
```bash
pip install nsdpy
```
*minimum python version for nsdpy: 3.8.2*

- Google Colab: save a copy of [this notebook](https://colab.research.google.com/drive/1UmxzRc_k5sNeQ2RPGe29nWR_1_0FRPkq?usp=sharing) in your drive.

## Workflow

<img src="https://docs.google.com/drawings/d/e/2PACX-1vRD4h7l0S57op_4j-5xsz8iv1j1XBliw-jEdtnWOIq-JAU2l8kSV6d1NmkHd5Q4zhUmZCA3SHUSuHJw/pub?w=801&amp;h=744" width="600" />

## Usage
### Google colab

[nsdpy colab notebook](https://colab.research.google.com/drive/1UmxzRc_k5sNeQ2RPGe29nWR_1_0FRPkq?usp=sharing)

### Command line

```bash
nsdpy -r "USER'S REQUEST" [OPTIONS]
```

## Authors and acknowledgment

[Raphael Hebert](https://github.com/RaphaelHebert)
[Emese Meglecz](https://github.com/meglecz)


## Support

## License

Code and documentation copyright 2021 the nsdpy authors. Code released under the MIT License.

## More Documentation

Official documentation:
[Readthedocs](https://nsdpy.readthedocs.io/en/latest/main.html#overview)

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/RaphaelHebert/nsdpy",
    "name": "nsdpy",
    "maintainer": "",
    "docs_url": null,
    "requires_python": ">=3.8,<4.0",
    "maintainer_email": "",
    "keywords": "NCBI,Taxonomy,DNA",
    "author": "RaphaelHebert",
    "author_email": "raphaelhebert18@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/bc/f9/3a298deb1e2211aa128a8eb7bf8084e6da26e4fc1a5cbc21fcf3a1aca916/nsdpy-1.0.0.tar.gz",
    "platform": null,
    "description": "# nsdpy\n\n[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)\n[![pypi](https://img.shields.io/pypi/v/nsdpy)](https://pypi.org/project/nsdpy/)\n[![Python 3.8](https://upload.wikimedia.org/wikipedia/commons/a/a5/Blue_Python_3.8_Shield_Badge.svg)](https://www.python.org/)\n[![Documentation Status](https://readthedocs.org/projects/nsdpy/badge/?version=latest)](https://nsdpy.readthedocs.io/en/latest/?badge=latest)\n[![Coverage Status](https://coveralls.io/repos/github/RaphaelHebert/nsdpy/badge.svg?branch=fix/longSpeciesList)](https://coveralls.io/github/RaphaelHebert/nsdpy?branch=fix/longSpeciesList)\n\n<div align=\"center\" style=\"text-align: center\">\nYour support means the world to me and helps me continue developing and maintaining this repository to benefit the entire community. Thank you for making a difference in the open-source world! </div>\n<br/>\n<div align=\"center\" style=\"text-align: center;\" markdown=\"1\"><a href=\"https://www.buymeacoffee.com/joe010\" target=\"_blank\"><img src=\"https://cdn.buymeacoffee.com/buttons/default-orange.png\" alt=\"Buy Me A Coffee\" height=\"41\" width=\"174\"></a>\n</div>\n\n##\n\n- [Introduction](#introduction)\n- [Workfolw](#workflow)\n- [Quick start](#quick-start)\n- [Usage](#usage)\n  - [Google Colab](#google-colab)\n  - [Command line](#command-line)\n- [Authors and acknowledgment](#authors-and-acknowledgment)\n- [Support](#support)\n- [Licence](#license)\n- [More Documentation](#more-documentation)\n\n## Introduction\n\nnsdpy (nucleotide or NCBI sequence downloader) aims to ease the download and sort of big bacth of DNA sequences from the NCBI database.\nIt can also be usefull to filter the sequences based on their annotations.\nUsing nsdpy the user can:\n\n- **Search** NCBI nucleotide database\n- **Download** the fasta files or the cds_fasta files corresponding to the result of the search\n- **Sort** the sequences based on their taxonomy\n- **Select** coding sequences from cds files based on the gene names using one or more regular expressions.\nThis can help the user retrieve some sequences for which the gene name is annotated in another field.\n- **Retrieve** the taxonomic information and add it to the output sequences.\n\n## Quick start\n\n- Clone the repo from Github:\n```bash\ngit clone https://github.com/RaphaelHebert/nsdpy.git\n  ```\n- pip:\n_depending on the user environment pip may be replaced by pip3 if pip3 is used_\n```bash\npip install nsdpy\n```\n*minimum python version for nsdpy: 3.8.2*\n\n- Google Colab: save a copy of [this notebook](https://colab.research.google.com/drive/1UmxzRc_k5sNeQ2RPGe29nWR_1_0FRPkq?usp=sharing) in your drive.\n\n## Workflow\n\n<img src=\"https://docs.google.com/drawings/d/e/2PACX-1vRD4h7l0S57op_4j-5xsz8iv1j1XBliw-jEdtnWOIq-JAU2l8kSV6d1NmkHd5Q4zhUmZCA3SHUSuHJw/pub?w=801&amp;h=744\" width=\"600\" />\n\n## Usage\n### Google colab\n\n[nsdpy colab notebook](https://colab.research.google.com/drive/1UmxzRc_k5sNeQ2RPGe29nWR_1_0FRPkq?usp=sharing)\n\n### Command line\n\n```bash\nnsdpy -r \"USER'S REQUEST\" [OPTIONS]\n```\n\n## Authors and acknowledgment\n\n[Raphael Hebert](https://github.com/RaphaelHebert)\n[Emese Meglecz](https://github.com/meglecz)\n\n\n## Support\n\n## License\n\nCode and documentation copyright 2021 the nsdpy authors. Code released under the MIT License.\n\n## More Documentation\n\nOfficial documentation:\n[Readthedocs](https://nsdpy.readthedocs.io/en/latest/main.html#overview)\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Automatize the download of DNA sequences from NCBI, sort them according to their taxonomy and filter them with a gene name (provided as a regular expression)",
    "version": "1.0.0",
    "project_urls": {
        "Homepage": "https://github.com/RaphaelHebert/nsdpy",
        "Repository": "https://github.com/RaphaelHebert/nsdpy"
    },
    "split_keywords": [
        "ncbi",
        "taxonomy",
        "dna"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "d25f2a86a241e8e1b569ff622da610f9d3320be5e213c0d68d18eae8dc928213",
                "md5": "89f9aeec8b87e2eda94305ce38b215fa",
                "sha256": "8f25e7dd738eccbac1c05d6b8e8592fa1d7e77308dd6b07d64b87060d97b0bb6"
            },
            "downloads": -1,
            "filename": "nsdpy-1.0.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "89f9aeec8b87e2eda94305ce38b215fa",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.8,<4.0",
            "size": 18084,
            "upload_time": "2023-10-15T13:58:54",
            "upload_time_iso_8601": "2023-10-15T13:58:54.183714Z",
            "url": "https://files.pythonhosted.org/packages/d2/5f/2a86a241e8e1b569ff622da610f9d3320be5e213c0d68d18eae8dc928213/nsdpy-1.0.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "bcf93a298deb1e2211aa128a8eb7bf8084e6da26e4fc1a5cbc21fcf3a1aca916",
                "md5": "a950e186fa45dce809b19a18789416a6",
                "sha256": "da5c37f99f77f9a2f6cbfa82ea1aa7958b83a721318137f427655c1a8224ac7a"
            },
            "downloads": -1,
            "filename": "nsdpy-1.0.0.tar.gz",
            "has_sig": false,
            "md5_digest": "a950e186fa45dce809b19a18789416a6",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.8,<4.0",
            "size": 18159,
            "upload_time": "2023-10-15T13:58:55",
            "upload_time_iso_8601": "2023-10-15T13:58:55.723279Z",
            "url": "https://files.pythonhosted.org/packages/bc/f9/3a298deb1e2211aa128a8eb7bf8084e6da26e4fc1a5cbc21fcf3a1aca916/nsdpy-1.0.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-10-15 13:58:55",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "RaphaelHebert",
    "github_project": "nsdpy",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "requirements": [
        {
            "name": "certifi",
            "specs": [
                [
                    ">=",
                    "2020.12.5"
                ]
            ]
        },
        {
            "name": "chardet",
            "specs": [
                [
                    "==",
                    "4.0.0"
                ]
            ]
        },
        {
            "name": "idna",
            "specs": [
                [
                    "==",
                    "2.10"
                ]
            ]
        },
        {
            "name": "requests",
            "specs": [
                [
                    "==",
                    "2.25.1"
                ]
            ]
        },
        {
            "name": "urllib3",
            "specs": [
                [
                    "==",
                    "1.26.2"
                ]
            ]
        }
    ],
    "lcname": "nsdpy"
}
        
Elapsed time: 2.02139s