bioscience


Namebioscience JSON
Version 0.1.3 PyPI version JSON
download
home_pagehttps://github.com/aureliolfdez/bioscience
SummarybioScience: A new Python science library for High-Performance Computing Bioinformatics Analytics
upload_time2024-07-27 21:04:14
maintainerAurelio Lopez-Fernandez
docs_urlNone
authorNone
requires_python>=3.10
licenseBSD License
keywords python data-science data-mining bioinformatics high-performance-computing data-analysis
VCS
bugtrack_url
requirements numpy pandas scikit-learn numba seaborn matplotlib
Travis-CI No Travis.
coveralls test coverage No coveralls.
            bioScience: A new Python science library for High-Performance Computing Bioinformatics Analytics
=================================================================================================

**Deployment & Documentation & Stats**

.. image:: https://img.shields.io/badge/pypi-v0.1.3-brightgreen
   :target: https://pypi.org/project/bioscience/
   :alt: PyPI version


.. image:: https://readthedocs.org/projects/bioscience/badge/?version=latest
   :target: https://bioscience.readthedocs.io/en/latest/?badge=latest
   :alt: Documentation Status


.. image:: https://img.shields.io/github/stars/aureliolfdez/bioscience.svg
   :target: https://github.com/aureliolfdez/bioscience/stargazers
   :alt: GitHub stars


.. image:: https://img.shields.io/github/forks/aureliolfdez/bioscience.svg?color=blue
   :target: https://github.com/aureliolfdez/bioscience/network
   :alt: GitHub forks


.. image:: https://img.shields.io/badge/License-BSD_3--Clause-blue.svg
   :target: https://github.com/aureliolfdez/bioscience/blob/main/LICENSE
   :alt: License

----


BioScience is an advanced Python library designed to satisfy the growing data analysis needs in the field of bioinformatics by leveraging High-Performance Computing (HPC). This library encompasses a vast multitude of functionalities, from loading specialised gene expression datasets (microarrays, RNA-Seq, etc.) to pre-processing techniques and data mining algorithms suitable for this type of datasets. BioScience is distinguished by its capacity to manage large amounts of biological data, providing users with efficient and scalable tools for the analysis of genomic and transcriptomic data through the use of parallel architectures for clusters composed of CPUs and GPUs.


**BioScience** is featured for:

* **Unified APIs, detailed documentation, and interactive examples** available to the community.
* **Complete coverage** for generate biological results from gene co-expression datasets.
* **Optimized models** to generate results in the shortest possible time.
* **Optimization of a High-Performance Computing (HPC) and Big Data ecosystem**.

----

Installation
============

It is recommended to use **pip** for installation. Please make sure
**the latest version** is installed, as bioScience is updated frequently:

.. code-block:: bash

   pip install bioscience            # normal install
   pip install --upgrade bioscience  # or update if needed
   pip install --pre bioscience      # or include pre-release version for new features

Alternatively, you could clone and run setup.py file:

.. code-block:: bash

   git clone https://github.com/aureliolfdez/bioscience.git
   pip install .

**Required Dependencies**\ :

* **Python**>=3.10
* **numpy**>=2.0.1
* **pandas**>=2.2.2
* **scikit-learn**>=1.5.1
* **numba**>=0.60.0
* **seaborn**>=0.13.2
* **matplotlib**>=3.9.1

----

API demo
========

.. code-block:: python


      import bioscience as bs

      if __name__ == "__main__":
         
         # RNA-Seq dataset load
         dataset = load(path="datasets/rnaseq.txt", index_gene=0, index_lengths=1 ,naFilter=True, head = 0)

         # RNA-Seq preprocessing
         bs.tpm(dataset)

         # Binary preprocessing
         bs.binarize(dataset)

         # Data mining phase
         listModels = bs.bibit(dataset, cMnr=2, cMnc=2, mode=3, deviceCount=1, debug = True)

         # Save results
         bs.saveGenes(path="/path/", models=listModels, data=dataset)

**Citing bioScience**\ :

`bioScience <https://www.sciencedirect.com/science/article/pii/S2352711024000372>`_ is published in
`SoftwareX <https://www.sciencedirect.com/science/article/pii/S2352711024000372>`_.
If you use bioScience in a scientific publication, we would appreciate citations to the following paper::

   López-Fernández, A., Gómez-Vela, F. A., Gonzalez-Dominguez, J., & Bidare-Divakarachari, P. (2024). bioScience: A new python science library for high-performance computing bioinformatics analytics. SoftwareX, 26, 101666.

**Key Links and Resources**\ :

* `View the latest codes on Github <https://github.com/aureliolfdez/bioscience>`_
* `View the documentation & API <https://bioscience.readthedocs.io/>`_
* `View all examples <https://github.com/aureliolfdez/bioscience/tree/main/tests/test_integration>`_

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/aureliolfdez/bioscience",
    "name": "bioscience",
    "maintainer": "Aurelio Lopez-Fernandez",
    "docs_url": null,
    "requires_python": ">=3.10",
    "maintainer_email": "alopfer1@upo.es",
    "keywords": "python, data-science, data-mining, bioinformatics, high-performance-computing, data-analysis",
    "author": null,
    "author_email": null,
    "download_url": "https://files.pythonhosted.org/packages/95/bf/4021dc8dc8b5074758d4cc12ea10cf09a8673d2c9076ac8b78952e40452f/bioscience-0.1.3.tar.gz",
    "platform": null,
    "description": "bioScience: A new Python science library for High-Performance Computing Bioinformatics Analytics\n=================================================================================================\n\n**Deployment & Documentation & Stats**\n\n.. image:: https://img.shields.io/badge/pypi-v0.1.3-brightgreen\n   :target: https://pypi.org/project/bioscience/\n   :alt: PyPI version\n\n\n.. image:: https://readthedocs.org/projects/bioscience/badge/?version=latest\n   :target: https://bioscience.readthedocs.io/en/latest/?badge=latest\n   :alt: Documentation Status\n\n\n.. image:: https://img.shields.io/github/stars/aureliolfdez/bioscience.svg\n   :target: https://github.com/aureliolfdez/bioscience/stargazers\n   :alt: GitHub stars\n\n\n.. image:: https://img.shields.io/github/forks/aureliolfdez/bioscience.svg?color=blue\n   :target: https://github.com/aureliolfdez/bioscience/network\n   :alt: GitHub forks\n\n\n.. image:: https://img.shields.io/badge/License-BSD_3--Clause-blue.svg\n   :target: https://github.com/aureliolfdez/bioscience/blob/main/LICENSE\n   :alt: License\n\n----\n\n\nBioScience is an advanced Python library designed to satisfy the growing data analysis needs in the field of bioinformatics by leveraging High-Performance Computing (HPC). This library encompasses a vast multitude of functionalities, from loading specialised gene expression datasets (microarrays, RNA-Seq, etc.) to pre-processing techniques and data mining algorithms suitable for this type of datasets. BioScience is distinguished by its capacity to manage large amounts of biological data, providing users with efficient and scalable tools for the analysis of genomic and transcriptomic data through the use of parallel architectures for clusters composed of CPUs and GPUs.\n\n\n**BioScience** is featured for:\n\n* **Unified APIs, detailed documentation, and interactive examples** available to the community.\n* **Complete coverage** for generate biological results from gene co-expression datasets.\n* **Optimized models** to generate results in the shortest possible time.\n* **Optimization of a High-Performance Computing (HPC) and Big Data ecosystem**.\n\n----\n\nInstallation\n============\n\nIt is recommended to use **pip** for installation. Please make sure\n**the latest version** is installed, as bioScience is updated frequently:\n\n.. code-block:: bash\n\n   pip install bioscience            # normal install\n   pip install --upgrade bioscience  # or update if needed\n   pip install --pre bioscience      # or include pre-release version for new features\n\nAlternatively, you could clone and run setup.py file:\n\n.. code-block:: bash\n\n   git clone https://github.com/aureliolfdez/bioscience.git\n   pip install .\n\n**Required Dependencies**\\ :\n\n* **Python**>=3.10\n* **numpy**>=2.0.1\n* **pandas**>=2.2.2\n* **scikit-learn**>=1.5.1\n* **numba**>=0.60.0\n* **seaborn**>=0.13.2\n* **matplotlib**>=3.9.1\n\n----\n\nAPI demo\n========\n\n.. code-block:: python\n\n\n      import bioscience as bs\n\n      if __name__ == \"__main__\":\n         \n         # RNA-Seq dataset load\n         dataset = load(path=\"datasets/rnaseq.txt\", index_gene=0, index_lengths=1 ,naFilter=True, head = 0)\n\n         # RNA-Seq preprocessing\n         bs.tpm(dataset)\n\n         # Binary preprocessing\n         bs.binarize(dataset)\n\n         # Data mining phase\n         listModels = bs.bibit(dataset, cMnr=2, cMnc=2, mode=3, deviceCount=1, debug = True)\n\n         # Save results\n         bs.saveGenes(path=\"/path/\", models=listModels, data=dataset)\n\n**Citing bioScience**\\ :\n\n`bioScience <https://www.sciencedirect.com/science/article/pii/S2352711024000372>`_ is published in\n`SoftwareX <https://www.sciencedirect.com/science/article/pii/S2352711024000372>`_.\nIf you use bioScience in a scientific publication, we would appreciate citations to the following paper::\n\n   L\u00f3pez-Fern\u00e1ndez, A., G\u00f3mez-Vela, F. A., Gonzalez-Dominguez, J., & Bidare-Divakarachari, P. (2024). bioScience: A new python science library for high-performance computing bioinformatics analytics. SoftwareX, 26, 101666.\n\n**Key Links and Resources**\\ :\n\n* `View the latest codes on Github <https://github.com/aureliolfdez/bioscience>`_\n* `View the documentation & API <https://bioscience.readthedocs.io/>`_\n* `View all examples <https://github.com/aureliolfdez/bioscience/tree/main/tests/test_integration>`_\n",
    "bugtrack_url": null,
    "license": "BSD License",
    "summary": "bioScience: A new Python science library for High-Performance Computing Bioinformatics Analytics",
    "version": "0.1.3",
    "project_urls": {
        "Bug Tracker": "https://github.com/aureliolfdez/bioscience/issues",
        "Documentation": "https://bioscience.readthedocs.io/en/latest/",
        "Download": "https://pypi.org/project/bioscience/#files",
        "Homepage": "https://github.com/aureliolfdez/bioscience",
        "Source Code": "https://github.com/aureliolfdez/bioscience"
    },
    "split_keywords": [
        "python",
        " data-science",
        " data-mining",
        " bioinformatics",
        " high-performance-computing",
        " data-analysis"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "f8ca71c13f9890f2b63fc4e2167b3b55aa01bf4de90ffb9b33debff944b89555",
                "md5": "ec56e84d5d4af7b61d213559a3eb805c",
                "sha256": "1de1d4928b44aef67a754e6ec658504fd35096cdd31ba8fed96c124a9854730a"
            },
            "downloads": -1,
            "filename": "bioscience-0.1.3-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "ec56e84d5d4af7b61d213559a3eb805c",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.10",
            "size": 22447,
            "upload_time": "2024-07-27T21:04:12",
            "upload_time_iso_8601": "2024-07-27T21:04:12.526083Z",
            "url": "https://files.pythonhosted.org/packages/f8/ca/71c13f9890f2b63fc4e2167b3b55aa01bf4de90ffb9b33debff944b89555/bioscience-0.1.3-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "95bf4021dc8dc8b5074758d4cc12ea10cf09a8673d2c9076ac8b78952e40452f",
                "md5": "2b78698f2501da34d669797494d112a6",
                "sha256": "82b811e7286fdc5962271c0b82ca2f31be0eccd34fa07b3f8873e1aa587cdcb4"
            },
            "downloads": -1,
            "filename": "bioscience-0.1.3.tar.gz",
            "has_sig": false,
            "md5_digest": "2b78698f2501da34d669797494d112a6",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.10",
            "size": 21699,
            "upload_time": "2024-07-27T21:04:14",
            "upload_time_iso_8601": "2024-07-27T21:04:14.112470Z",
            "url": "https://files.pythonhosted.org/packages/95/bf/4021dc8dc8b5074758d4cc12ea10cf09a8673d2c9076ac8b78952e40452f/bioscience-0.1.3.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-07-27 21:04:14",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "aureliolfdez",
    "github_project": "bioscience",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "requirements": [
        {
            "name": "numpy",
            "specs": [
                [
                    ">=",
                    "2.0.1"
                ]
            ]
        },
        {
            "name": "pandas",
            "specs": [
                [
                    ">=",
                    "2.2.2"
                ]
            ]
        },
        {
            "name": "scikit-learn",
            "specs": [
                [
                    ">=",
                    "1.5.1"
                ]
            ]
        },
        {
            "name": "numba",
            "specs": [
                [
                    ">=",
                    "0.60.0"
                ]
            ]
        },
        {
            "name": "seaborn",
            "specs": [
                [
                    ">=",
                    "0.13.2"
                ]
            ]
        },
        {
            "name": "matplotlib",
            "specs": [
                [
                    ">=",
                    "3.9.1"
                ]
            ]
        }
    ],
    "lcname": "bioscience"
}
        
Elapsed time: 0.32603s