step-kit


Namestep-kit JSON
Version 0.2.2 PyPI version JSON
download
home_pagehttps://github.com/SGGb0nd/step
SummarySTEP, an acronym for Spatial Transcriptomics Embedding Procedure, is a deep learning-based tool for the analysis of single-cell RNA (scRNA-seq) and spatially resolved transcriptomics (SRT) data. STEP introduces a unified approach to process and analyze multiple samples of scRNA-seq data as well as align several sections of SRT data, disregarding location relationships. Furthermore, STEP conducts integrative analysis across different modalities like scRNA-seq and SRT.
upload_time2024-04-25 17:18:19
maintainerNone
docs_urlNone
authorSGGb0nd
requires_python<4.0,>=3.10
licenseApache-2.0
keywords spatial transcriptomics single-cell rna-seq deep learning scrna-seq srt step step
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # STEP: Spatial Transcriptomics Embedding Procedure
[![Docs](https://github.com/SGGb0nd/step/actions/workflows/mkdocs.yaml/badge.svg)](https://github.com/SGGb0nd/step/actions/workflows/mkdocs.yaml)
[![Pages](https://github.com/SGGb0nd/step/actions/workflows/pages/pages-build-deployment/badge.svg)](https://github.com/SGGb0nd/step/actions/workflows/pages/pages-build-deployment)
[![PyPI version](https://badge.fury.io/py/step-kit.svg)](https://badge.fury.io/py/step-kit)
[![DOI](http://img.shields.io/badge/DOI-10.1101/2024.04.15.589470-B31B1B.svg)](https://doi.org/10.1101/2024.04.15.589470)

![image](http://docs.3s540lab.cloud/images/STEP_fig_1a.webp)
![image](http://docs.3s540lab.cloud/images/STEP_fig_1b.webp)

## Introduction
STEP, an acronym for Spatial Transcriptomics Embedding Procedure, is a foundation deep learning/AI architecture for the analysis of spatially resolved transcriptomics (SRT) data, and is also compatible with scRNA-seq data. STEP roots on the precise captures of three major varitions occured in the SRT (and scRNA-seq) data: **Transcriptional Variations**, **Batch Variations** and **Spatial Variations** with the correponding modular designs: **Backbone model**: a Transformer based model togther with gene module seqeunce mapping; **Batch-effect model**: A pair of inverse transformations utilizing the *batch-embedding* conception for the decoupled batch-effect elimination; **Spatial model**: a GCN-based spatial filter/smoother working on the extracted embedding from the Backbone model, different from the usage of GCN in other methods as a feature extractor. Thus, with the proper combinations of these models, STEP introduces a unified approach to systematically process and analyze single or multiple samples of SRT data, disregarding location relationships between sections (meaning both contiguous and non-contiguous sections), to reveal multi-scale bilogical heterogeneities (cell types and spatial domains) in multi-resolution SRT data. Furthermore, STEP can also conduct integrative analysis on scRNA-seq and SRT data.

## Key Features

-  Integration of multiple scRNA-seq and single-cell resolution SRT samples to reveal cell-type level heterogeneities.
-  Alignment of various SRT data sections contiguous or non-contiguous to identify spatial domains across sections.
-  Scalable to different data resolutions, i.e., wild range of technologies and platforms of SRT data, including **Visium HD**, **Visum**, **MERFISH**, **STARmap**, **Stereo-seq**, **ST**, etc.
-  Scalable to large datasets with a high number of cells and spatial locations.
-  Performance of integrative analysis across modalities (scRNA-seq and SRT) and cell-type deconvolution for the non-single-cell resolution SRT data.

## Other Capabilities
-  Capability to produce not only the batch-corrected embeddings but also batch-corrected gene expression profiles for scRNA-seq data.
-  Capability to perform spatial mapping of reference scRNA-seq data points to the spatial locations of SRT data based on the learned co-embeddings and kNN.
-  Comprehensive `adata` processing by specifically desinged `BaseDataset` class and its view version `MaskedDataset` class for the SRT data, which can be easily integrated with the PyTorch `DataLoader` class for training and validation.
-  Low computational cost and high efficiency in processing large-scale SRT data: 4-8 GB GPU memory is sufficient for processing a dataset with 100,000 spatial locations with 2000+ sample-size in fast mode, and consumes less than 6 minutes for training in 2000 iterations (tested on NVIDIA RTX 3090).

## System Requiremtns
-  **Software Requirements**: Python 3.10+, CUDA 11.6+.
-  **Hardware Requirements**: NVIDIA GPU with CUDA support (recommended), 8GB+ GPU memory. As possible as high RAM and CPU cores for storing and processing large-scale data (this is not required by STEP, but by the data itself).

## Installation
```
pip install step-kit
```

Documentation and tutorials are available at [https://sggb0nd.github.io/step/](https://sggb0nd.github.io/step/).


## Contribution

We welcome contributions! Please see [`CONTRIBUTING.md`](./CONTRIBUTING.md) for more details!

## License

step is licensed under [LICENSE](./LICENSE)

## Contact

If you have any questions, please feel free to contact us at [here](mailto:lilounan1997@gmail.com), or feel free to open an issue on this repository.

## Citation
The preprint of STEP is available at [bioRxiv](https://www.biorxiv.org/content/early/2024/04/20/2024.04.15.589470.full.pdf). If you use STEP in your research, please cite:

```bibtex
@article{Li2024.04.15.589470,
  title = {{{STEP}}: {{Spatial}} Transcriptomics Embedding Procedure for Multi-Scale Biological Heterogeneities Revelation in Multiple Samples},
  author = {Li, Lounan and Li, Zhong and Li, Yuanyuan and Yin, Xiao-ming and Xu, Xiaojiang},
  year = {2024},
  journal = {bioRxiv : the preprint server for biology},
  eprint = {https://www.biorxiv.org/content/early/2024/04/20/2024.04.15.589470.full.pdf},
  publisher = {Cold Spring Harbor Laboratory},
  doi = {10.1101/2024.04.15.589470},
}
```
            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/SGGb0nd/step",
    "name": "step-kit",
    "maintainer": null,
    "docs_url": null,
    "requires_python": "<4.0,>=3.10",
    "maintainer_email": null,
    "keywords": "spatial transcriptomics, single-cell RNA-seq, deep learning, scRNA-seq, SRT, step, STEP",
    "author": "SGGb0nd",
    "author_email": "lilounan1997@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/5e/f2/c912487504859f2634dfac7b3e5b772e63822263e7932758fcb0acd8e8f9/step_kit-0.2.2.tar.gz",
    "platform": null,
    "description": "# STEP: Spatial Transcriptomics Embedding Procedure\n[![Docs](https://github.com/SGGb0nd/step/actions/workflows/mkdocs.yaml/badge.svg)](https://github.com/SGGb0nd/step/actions/workflows/mkdocs.yaml)\n[![Pages](https://github.com/SGGb0nd/step/actions/workflows/pages/pages-build-deployment/badge.svg)](https://github.com/SGGb0nd/step/actions/workflows/pages/pages-build-deployment)\n[![PyPI version](https://badge.fury.io/py/step-kit.svg)](https://badge.fury.io/py/step-kit)\n[![DOI](http://img.shields.io/badge/DOI-10.1101/2024.04.15.589470-B31B1B.svg)](https://doi.org/10.1101/2024.04.15.589470)\n\n![image](http://docs.3s540lab.cloud/images/STEP_fig_1a.webp)\n![image](http://docs.3s540lab.cloud/images/STEP_fig_1b.webp)\n\n## Introduction\nSTEP, an acronym for Spatial Transcriptomics Embedding Procedure, is a foundation deep learning/AI architecture for the analysis of spatially resolved transcriptomics (SRT) data, and is also compatible with scRNA-seq data. STEP roots on the precise captures of three major varitions occured in the SRT (and scRNA-seq) data: **Transcriptional Variations**, **Batch Variations** and **Spatial Variations** with the correponding modular designs: **Backbone model**: a Transformer based model togther with gene module seqeunce mapping; **Batch-effect model**: A pair of inverse transformations utilizing the *batch-embedding* conception for the decoupled batch-effect elimination; **Spatial model**: a GCN-based spatial filter/smoother working on the extracted embedding from the Backbone model, different from the usage of GCN in other methods as a feature extractor. Thus, with the proper combinations of these models, STEP introduces a unified approach to systematically process and analyze single or multiple samples of SRT data, disregarding location relationships between sections (meaning both contiguous and non-contiguous sections), to reveal multi-scale bilogical heterogeneities (cell types and spatial domains) in multi-resolution SRT data. Furthermore, STEP can also conduct integrative analysis on scRNA-seq and SRT data.\n\n## Key Features\n\n-  Integration of multiple scRNA-seq and single-cell resolution SRT samples to reveal cell-type level heterogeneities.\n-  Alignment of various SRT data sections contiguous or non-contiguous to identify spatial domains across sections.\n-  Scalable to different data resolutions, i.e., wild range of technologies and platforms of SRT data, including **Visium HD**, **Visum**, **MERFISH**, **STARmap**, **Stereo-seq**, **ST**, etc.\n-  Scalable to large datasets with a high number of cells and spatial locations.\n-  Performance of integrative analysis across modalities (scRNA-seq and SRT) and cell-type deconvolution for the non-single-cell resolution SRT data.\n\n## Other Capabilities\n-  Capability to produce not only the batch-corrected embeddings but also batch-corrected gene expression profiles for scRNA-seq data.\n-  Capability to perform spatial mapping of reference scRNA-seq data points to the spatial locations of SRT data based on the learned co-embeddings and kNN.\n-  Comprehensive `adata` processing by specifically desinged `BaseDataset` class and its view version `MaskedDataset` class for the SRT data, which can be easily integrated with the PyTorch `DataLoader` class for training and validation.\n-  Low computational cost and high efficiency in processing large-scale SRT data: 4-8 GB GPU memory is sufficient for processing a dataset with 100,000 spatial locations with 2000+ sample-size in fast mode, and consumes less than 6 minutes for training in 2000 iterations (tested on NVIDIA RTX 3090).\n\n## System Requiremtns\n-  **Software Requirements**: Python 3.10+, CUDA 11.6+.\n-  **Hardware Requirements**: NVIDIA GPU with CUDA support (recommended), 8GB+ GPU memory. As possible as high RAM and CPU cores for storing and processing large-scale data (this is not required by STEP, but by the data itself).\n\n## Installation\n```\npip install step-kit\n```\n\nDocumentation and tutorials are available at [https://sggb0nd.github.io/step/](https://sggb0nd.github.io/step/).\n\n\n## Contribution\n\nWe welcome contributions! Please see [`CONTRIBUTING.md`](./CONTRIBUTING.md) for more details!\n\n## License\n\nstep is licensed under [LICENSE](./LICENSE)\n\n## Contact\n\nIf you have any questions, please feel free to contact us at [here](mailto:lilounan1997@gmail.com), or feel free to open an issue on this repository.\n\n## Citation\nThe preprint of STEP is available at [bioRxiv](https://www.biorxiv.org/content/early/2024/04/20/2024.04.15.589470.full.pdf). If you use STEP in your research, please cite:\n\n```bibtex\n@article{Li2024.04.15.589470,\n  title = {{{STEP}}: {{Spatial}} Transcriptomics Embedding Procedure for Multi-Scale Biological Heterogeneities Revelation in Multiple Samples},\n  author = {Li, Lounan and Li, Zhong and Li, Yuanyuan and Yin, Xiao-ming and Xu, Xiaojiang},\n  year = {2024},\n  journal = {bioRxiv : the preprint server for biology},\n  eprint = {https://www.biorxiv.org/content/early/2024/04/20/2024.04.15.589470.full.pdf},\n  publisher = {Cold Spring Harbor Laboratory},\n  doi = {10.1101/2024.04.15.589470},\n}\n```",
    "bugtrack_url": null,
    "license": "Apache-2.0",
    "summary": "STEP, an acronym for Spatial Transcriptomics Embedding Procedure, is a deep learning-based tool for the analysis of single-cell RNA (scRNA-seq) and spatially resolved transcriptomics (SRT) data. STEP introduces a unified approach to process and analyze multiple samples of scRNA-seq data as well as align several sections of SRT data, disregarding location relationships. Furthermore, STEP conducts integrative analysis across different modalities like scRNA-seq and SRT.",
    "version": "0.2.2",
    "project_urls": {
        "Documentation": "https://sggb0nd.github.io/step/",
        "Homepage": "https://github.com/SGGb0nd/step",
        "Repository": "https://github.com/SGGb0nd/step"
    },
    "split_keywords": [
        "spatial transcriptomics",
        " single-cell rna-seq",
        " deep learning",
        " scrna-seq",
        " srt",
        " step",
        " step"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "aa53a2b5698bc4a95ba7a0f811c81675ad31b3978877e7f742f45d1c0180201e",
                "md5": "8f95b6b27bf52f21fa2648019a4de7e4",
                "sha256": "48f40b03e9a53700c33a93d8c2b170b98f0a605898475fbbc1316065e636111a"
            },
            "downloads": -1,
            "filename": "step_kit-0.2.2-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "8f95b6b27bf52f21fa2648019a4de7e4",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": "<4.0,>=3.10",
            "size": 96116,
            "upload_time": "2024-04-25T17:18:17",
            "upload_time_iso_8601": "2024-04-25T17:18:17.157452Z",
            "url": "https://files.pythonhosted.org/packages/aa/53/a2b5698bc4a95ba7a0f811c81675ad31b3978877e7f742f45d1c0180201e/step_kit-0.2.2-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "5ef2c912487504859f2634dfac7b3e5b772e63822263e7932758fcb0acd8e8f9",
                "md5": "8d6f9c01a7c1c15693dec37f6965db10",
                "sha256": "bf33ba0664f02824427544fc03aafab9e3d5aad9e8e223712cabeda1332e3080"
            },
            "downloads": -1,
            "filename": "step_kit-0.2.2.tar.gz",
            "has_sig": false,
            "md5_digest": "8d6f9c01a7c1c15693dec37f6965db10",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": "<4.0,>=3.10",
            "size": 82333,
            "upload_time": "2024-04-25T17:18:19",
            "upload_time_iso_8601": "2024-04-25T17:18:19.259644Z",
            "url": "https://files.pythonhosted.org/packages/5e/f2/c912487504859f2634dfac7b3e5b772e63822263e7932758fcb0acd8e8f9/step_kit-0.2.2.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-04-25 17:18:19",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "SGGb0nd",
    "github_project": "step",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "step-kit"
}
        
Elapsed time: 0.23623s