GRATIOSA


NameGRATIOSA JSON
Version 1.0.2 PyPI version JSON
download
home_pagehttps://github.com/sammeyer2017/GRATIOSA.git
SummaryNone
upload_time2025-01-14 13:36:16
maintainerNone
docs_urlNone
authorPineau Maïwenn, Forquet Raphaël, Meyer Sam
requires_pythonNone
licenseNone
keywords
VCS
bugtrack_url
requirements matplotlib numpy pandas pysam scipy statsmodels
Travis-CI No Travis.
coveralls test coverage No coveralls.
            GRATIOSA
========

Genome Regulation Analysis Tool Incorporating Organization and Spatial Architecture

GRATIOSA facilitates the integration, comparison and combined statistical analyses of diffent data types such as Chip-Seq data, RNA-Seq data and genomic annotations. It includes (1) the integration of various data types and formats in a unified framework, allowing direct comparison; (2) an implementation of spatial statistical analyses of the relationship between transcriptional regulation and chromosome organization, greatly reducing coding time; and (3) an improvement in reproducibility, in particular for common regulatory mechanisms across different species.

GRATIOSA is written in Python, and is targeted to computational biologists with some experience in Python. It is designed for UNIX architectures (Linux, MacOS), and typically accessed from Jupyter notebooks to carry out specific analyses. 

The automated import of data and standardized statistical tests and procedures are combined with the flexibility of Python for developing custom subsequent analyses, depending on the particular regulatory mechanism under investigation. GRATIOSA is primarily designed to treat expression data (RNA-Seq or microarrays), and ChIP-Seq data, but can be used with any type of continuous signals along the genome (other binding signals, mutation rates, binding prediction profiles…) or lists of discrete features (annotated or predicted protein binding sites, Hi-C topological domain borders, …). 

How to install the package? 
---------------------------
To install this package, the recommended method is to execute the following command in your terminal 
`pip install GRATIOSA`
For simple testing, you can just download this package manually (using Code / Download ZIP) and work locally. 

Dependencies
------------
GRATIOSA requires Python3 with the standard following libraries: NumPy, Pandas, Scipy, MatPlotLib, StatModels. Additionally, for the specific handling of .bam files, the library pySam is required. See version details in the requirements.txt file.

General presentation 
--------------------
We describe here the recommended installation procedure. For a local use (for simple testing), please follow instructions proposed in the provided tutorials (Jupyter notebooks). 

Before using the package, the user needs to prepare and organize their data into predefined directories: see examples in the provided data/ directory and their use in tutorials. Each new organism corresponds to a directory with the associated name.  This directory must contain the reference sequence in fasta format and an annotation file in gff format in the "annotation" subdirectory. This can be created automatically from the NCBI database (see documentation of the Genome class). The pre-processed and formatted experimental data, provided by the user, should be placed in appropriate directories named according to the data type (see examples in tutorials). These experimental data files should usually be accompanied by an "info" file that the user needs to complete with information about the file organization, following the provided template.

How to use GRATIOSA?
--------------------
The user must define the location of the database (containing all datafiles for the investigated species) as an environment variable GRATIOSA_DB_PATH (e.g. export GRATIOSA_DB_PATH = /home/usr/documents/GRATIOSA/). Thus, the files must be in GRATIOSA_DB_PATH + data/organisms. The most convenient way to run GRATIOSA is probably to install a virtual environment. See tutorials for examples of use. 

Data analysis using the package is performed through Python commands, typically in a Jupyter notebook, with three major steps. Firstly, as the package is an object-oriented framework, the objects (Genome, Transcriptome, ChIP-Seq, etc.) need to be initialized, and then the data can be loaded as attributes. 

The second step involves data processing. During this step, attributes associated with genomic positions can be scaled to the gene level for further analysis. Additionally, the loaded continuous signals can be binned, smoothed, or averaged. A verification and graphical exploration of the signals can be performed at the end of the first or second step using a graphical function that plots the signals on the annotated genome. To prepare for statistical analysis, quantitative data can also be classified.

The last step is the statistical analysis with enrichment or proportion tests (for qualitative attributes) and Student/Wilcoxon-Mann-Whitney tests (for quantitative comparisons). These functions can also be used to handle custom data imported manually by the user. Results are saved as tables (in csv format) and can be visualized as annotated bar plots created with graphical functions included in the package. 


Documentation
-------------
https://gratiosa.readthedocs.io/en/latest/Presentation.html

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/sammeyer2017/GRATIOSA.git",
    "name": "GRATIOSA",
    "maintainer": null,
    "docs_url": null,
    "requires_python": null,
    "maintainer_email": null,
    "keywords": null,
    "author": "Pineau Ma\u00efwenn, Forquet Rapha\u00ebl, Meyer Sam",
    "author_email": "sam.meyer@insa-lyon.fr",
    "download_url": "https://files.pythonhosted.org/packages/9f/74/443bedaa581dd8dcccf6abc3d1bf18457d1b0170237e209bdd7c237cc380/gratiosa-1.0.2.tar.gz",
    "platform": null,
    "description": "GRATIOSA\n========\n\nGenome Regulation Analysis Tool Incorporating Organization and Spatial Architecture\n\nGRATIOSA facilitates the integration, comparison and combined statistical analyses of diffent data types such as Chip-Seq data, RNA-Seq data and genomic annotations. It includes (1) the integration of various data types and formats in a unified framework, allowing direct comparison; (2) an implementation of spatial statistical analyses of the relationship between transcriptional regulation and chromosome organization, greatly reducing coding time; and (3) an improvement in reproducibility, in particular for common regulatory mechanisms across different species.\n\nGRATIOSA is written in Python, and is targeted to computational biologists with some experience in Python. It is designed for UNIX architectures (Linux, MacOS), and typically accessed from Jupyter notebooks to carry out specific analyses. \n\nThe automated import of data and standardized statistical tests and procedures are combined with the flexibility of Python for developing custom subsequent analyses, depending on the particular regulatory mechanism under investigation. GRATIOSA is primarily designed to treat expression data (RNA-Seq or microarrays), and ChIP-Seq data, but can be used with any type of continuous signals along the genome (other binding signals, mutation rates, binding prediction profiles\u2026) or lists of discrete features (annotated or predicted protein binding sites, Hi-C topological domain borders, \u2026). \n\nHow to install the package? \n---------------------------\nTo install this package, the recommended method is to execute the following command in your terminal \n`pip install GRATIOSA`\nFor simple testing, you can just download this package manually (using Code / Download ZIP) and work locally. \n\nDependencies\n------------\nGRATIOSA requires Python3 with the standard following libraries: NumPy, Pandas, Scipy, MatPlotLib, StatModels. Additionally, for the specific handling of .bam files, the library pySam is required. See version details in the requirements.txt file.\n\nGeneral presentation \n--------------------\nWe describe here the recommended installation procedure. For a local use (for simple testing), please follow instructions proposed in the provided tutorials (Jupyter notebooks). \n\nBefore using the package, the user needs to prepare and organize their data into predefined directories: see examples in the provided data/ directory and their use in tutorials. Each new organism corresponds to a directory with the associated name.  This directory must contain the reference sequence in fasta format and an annotation file in gff format in the \"annotation\" subdirectory. This can be created automatically from the NCBI database (see documentation of the Genome class). The pre-processed and formatted experimental data, provided by the user, should be placed in appropriate directories named according to the data type (see examples in tutorials). These experimental data files should usually be accompanied by an \"info\" file that the user needs to complete with information about the file organization, following the provided template.\n\nHow to use GRATIOSA?\n--------------------\nThe user must define the location of the database (containing all datafiles for the investigated species) as an environment variable GRATIOSA_DB_PATH (e.g. export GRATIOSA_DB_PATH = /home/usr/documents/GRATIOSA/). Thus, the files must be in GRATIOSA_DB_PATH + data/organisms. The most convenient way to run GRATIOSA is probably to install a virtual environment. See tutorials for examples of use. \n\nData analysis using the package is performed through Python commands, typically in a Jupyter notebook, with three major steps. Firstly, as the package is an object-oriented framework, the objects (Genome, Transcriptome, ChIP-Seq, etc.) need to be initialized, and then the data can be loaded as attributes. \n\nThe second step involves data processing. During this step, attributes associated with genomic positions can be scaled to the gene level for further analysis. Additionally, the loaded continuous signals can be binned, smoothed, or averaged. A verification and graphical exploration of the signals can be performed at the end of the first or second step using a graphical function that plots the signals on the annotated genome. To prepare for statistical analysis, quantitative data can also be classified.\n\nThe last step is the statistical analysis with enrichment or proportion tests (for qualitative attributes) and Student/Wilcoxon-Mann-Whitney tests (for quantitative comparisons). These functions can also be used to handle custom data imported manually by the user. Results are saved as tables (in csv format) and can be visualized as annotated bar plots created with graphical functions included in the package. \n\n\nDocumentation\n-------------\nhttps://gratiosa.readthedocs.io/en/latest/Presentation.html\n",
    "bugtrack_url": null,
    "license": null,
    "summary": null,
    "version": "1.0.2",
    "project_urls": {
        "Homepage": "https://github.com/sammeyer2017/GRATIOSA.git"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "6fdd351883c0e70c5fbff2ef11ac33c94c1c3470ac0c4cfe1fbf6b3ce7184c99",
                "md5": "d81a1a760de425d8e403f9f123448b91",
                "sha256": "8ecd0b5c303c8ecbb751621d74971eaa0e50ea632ea4765176d8a712ab06fbac"
            },
            "downloads": -1,
            "filename": "GRATIOSA-1.0.2-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "d81a1a760de425d8e403f9f123448b91",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 73043,
            "upload_time": "2025-01-14T13:36:14",
            "upload_time_iso_8601": "2025-01-14T13:36:14.346106Z",
            "url": "https://files.pythonhosted.org/packages/6f/dd/351883c0e70c5fbff2ef11ac33c94c1c3470ac0c4cfe1fbf6b3ce7184c99/GRATIOSA-1.0.2-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "9f74443bedaa581dd8dcccf6abc3d1bf18457d1b0170237e209bdd7c237cc380",
                "md5": "d923c4522a994aeec7214b3fed960f55",
                "sha256": "73ac3c624ae23addff4f2e4664ef5cb0d8a6850253d1cf23fd11f17e72b5d7a6"
            },
            "downloads": -1,
            "filename": "gratiosa-1.0.2.tar.gz",
            "has_sig": false,
            "md5_digest": "d923c4522a994aeec7214b3fed960f55",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 63632,
            "upload_time": "2025-01-14T13:36:16",
            "upload_time_iso_8601": "2025-01-14T13:36:16.727929Z",
            "url": "https://files.pythonhosted.org/packages/9f/74/443bedaa581dd8dcccf6abc3d1bf18457d1b0170237e209bdd7c237cc380/gratiosa-1.0.2.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-01-14 13:36:16",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "sammeyer2017",
    "github_project": "GRATIOSA",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "requirements": [
        {
            "name": "matplotlib",
            "specs": [
                [
                    "==",
                    "3.4.3"
                ]
            ]
        },
        {
            "name": "numpy",
            "specs": [
                [
                    "==",
                    "1.22.2"
                ]
            ]
        },
        {
            "name": "pandas",
            "specs": [
                [
                    "==",
                    "1.5.2"
                ]
            ]
        },
        {
            "name": "pysam",
            "specs": [
                [
                    "==",
                    "0.19.1"
                ]
            ]
        },
        {
            "name": "scipy",
            "specs": [
                [
                    "==",
                    "1.7.1"
                ]
            ]
        },
        {
            "name": "statsmodels",
            "specs": [
                [
                    "==",
                    "0.13.2"
                ]
            ]
        }
    ],
    "lcname": "gratiosa"
}
        
Elapsed time: 0.43242s