ragulate-bio


Nameragulate-bio JSON
Version 0.1.1 PyPI version JSON
download
home_pageNone
SummaryRetrieval augmented validation for TF inference in single cell biology
upload_time2025-08-19 21:26:39
maintainerNone
docs_urlNone
authorMehrdad Zandigohar
requires_python>=3.9
licenseMIT
keywords single-cell regulatory networks rag bioinformatics
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # RAGulate<img src="assets/RAGulate_logo.png" alt="RAGulate logo" align="right" width="200" />
**R**etrieval-**A**ugmented **G**eneration for Post-hoc Literature-Grounded **Regulatory** Validation

[![GitHub issues](https://img.shields.io/github/issues/YDaiLab/RAGulate)](https://github.com/YDaiLab/RAGulate/issues)
[![PyPI - Project](https://img.shields.io/pypi/v/ragulate-bio)](https://pypi.org/project/ragulate-bio/)
[![Conda](https://img.shields.io/conda/v/zandigohar/RAGulate?label=conda)](https://anaconda.org/zandigohar/RAGulate)
[![Docs](https://img.shields.io/badge/docs-GitHub%20Pages-blue)](https://ydailab.github.io/RAGulate/)


## Introduction
**RAGulate** is a **R**etrieval-**A**ugmented **G**eneration (RAG) pipeline that integrates domain-specific large language models (LLM) with curated literature to identify, score, and validate inferred **regulatory** (transcription factor-target gene) interactions in their biological context.

For further information and example tutorials, please check our documentation:

If you have any questions or concerns, feel free to [open an issue](https://github.com/YDaiLab/RAGulate/issues).

## Requirements
RAGulate is implemented in the LlamaIndex framework. Running RAGulate on `CUDA` is highly recommended if available.


Before installing and running RAGulate, ensure you have the following libraries installed:
- **PyTorch** (version 2.0 or higher)  
  Install with the exact command from the [PyTorch “Get Started” page](https://pytorch.org/get-started/locally/) for your OS, Python version and (optionally) CUDA toolkit.
- **NumPy** (version 1.23 or higher)

You can install these dependencies using `pip`:

```bash
pip install torch numpy
```

## Installation

**Option 1 (Coming soon):**  
You can install **RAGulate** via pip for a lightweight installation:

```bash
pip install ragulate-bio
```

**Option 2 (Coming soon):**  
Alternatively, if you want the latest, unreleased version, you can install it directly from the source on GitHub:

```bash
pip install git+https://github.com/YDaiLab/RAGulate.git
```

**Import**

```python
import ragulate_bio as ragulate  # recommended alias
```

Note: The PyPI distribution is named ragulate-bio to avoid a name conflict with an unrelated project called ragulate.
Always import ragulate_bio in Python (you may alias it to ragulate for convenience)

**Option 3 (Coming soon):**  
For users who prefer Conda or Mamba for environment management, you can install **RAGulate** along with extra dependencies:

**Conda:**
```bash
conda install -c zandigohar RAGulate
```

**Mamba:**
```bash
mamba create -n RAGulate -c zandigohar RAGulate
```

## FAQ

**Q1: Do I need a GPU to run RAGulate?**  
No, a GPU is not required. However, using a CUDA-enabled GPU is strongly recommended for faster runs, especially with large queries.

**Q2: How do I know if I can use a GPU with RAGulate?**  
There are two quick checks:

1. **System check**  
   In your terminal, run `nvidia-smi`. If you see your GPU listed (model, memory, driver version), your machine has an NVIDIA GPU with the driver installed.

2. **Python check**  
   In a Python shell, run:
   ```python
   import torch
   print(torch.cuda.is_available())  # True means PyTorch can see your GPU
   print(torch.cuda.device_count())  # How many GPUs are usable
   ```

**Q3: Can I use RAGulate with R-based tools?**  
RAGulate is written in Python and works directly with `Numpy` objects. 

**Q4: What if I also have another package called ragulate installed?**
RAGulate will warn you if it detects a conflicting installation. We recommend using a clean virtual environment to avoid import clashes.

**Q5: How do I cite RAGulate?**  
See the [Citation](#citation) section below for the latest reference and preprint link.

**Q6: How can I reproduce the paper’s results?**  
See our [Reproducibility Guide](https://github.com/YDaiLab/RAGulate/blob/main/notebooks/XYZ.ipynb) for step-by-step instructions. Then run RAGulate.

## Citation

⚠️ Preprint coming soon (bioRxiv, 2025)  
**This repository is under active development.** Please cite as:  
Zandigohar M, Rehman J, Dai Y. RAGulate: RAGulate: Retrieval-Augmented Generation for Post-hoc Literature-Grounded Regulatory Validation. 2025.


## Development & Contact
RAGulate was developed and is actively maintained by Mehrdad Zandigohar as part of his PhD research at the University of Illinois Chicago (UIC), in the lab of Dr. Yang Dai.

📬 For private questions, please email: mzandi2@uic.edu

🤝 For collaboration inquiries, please contact PI: Dr. Yang Dai (yangdai@uic.edu)

Contributions, feature suggestions, and feedback are always welcome!

## License

The code in **RAGulate** is licensed under the [MIT License](https://opensource.org/licenses/MIT), which permits academic and commercial use, modification, and distribution. 

Please note that any third-party dependencies bundled with **RAGulate** may have their own respective licenses.


            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "ragulate-bio",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.9",
    "maintainer_email": null,
    "keywords": "single-cell, regulatory networks, RAG, bioinformatics",
    "author": "Mehrdad Zandigohar",
    "author_email": null,
    "download_url": "https://files.pythonhosted.org/packages/38/5f/1a40998b0039a0ba11b09291c867a460b2c8dde45ae21fcade6258d5aee2/ragulate_bio-0.1.1.tar.gz",
    "platform": null,
    "description": "# RAGulate<img src=\"assets/RAGulate_logo.png\" alt=\"RAGulate logo\" align=\"right\" width=\"200\" />\n**R**etrieval-**A**ugmented **G**eneration for Post-hoc Literature-Grounded **Regulatory** Validation\n\n[![GitHub issues](https://img.shields.io/github/issues/YDaiLab/RAGulate)](https://github.com/YDaiLab/RAGulate/issues)\n[![PyPI - Project](https://img.shields.io/pypi/v/ragulate-bio)](https://pypi.org/project/ragulate-bio/)\n[![Conda](https://img.shields.io/conda/v/zandigohar/RAGulate?label=conda)](https://anaconda.org/zandigohar/RAGulate)\n[![Docs](https://img.shields.io/badge/docs-GitHub%20Pages-blue)](https://ydailab.github.io/RAGulate/)\n\n\n## Introduction\n**RAGulate** is a **R**etrieval-**A**ugmented **G**eneration (RAG) pipeline that integrates domain-specific large language models (LLM) with curated literature to identify, score, and validate inferred **regulatory** (transcription factor-target gene) interactions in their biological context.\n\nFor further information and example tutorials, please check our documentation:\n\nIf you have any questions or concerns, feel free to [open an issue](https://github.com/YDaiLab/RAGulate/issues).\n\n## Requirements\nRAGulate is implemented in the LlamaIndex framework. Running RAGulate on `CUDA` is highly recommended if available.\n\n\nBefore installing and running RAGulate, ensure you have the following libraries installed:\n- **PyTorch** (version 2.0 or higher)  \n  Install with the exact command from the [PyTorch \u201cGet Started\u201d page](https://pytorch.org/get-started/locally/) for your OS, Python version and (optionally) CUDA toolkit.\n- **NumPy** (version 1.23 or higher)\n\nYou can install these dependencies using `pip`:\n\n```bash\npip install torch numpy\n```\n\n## Installation\n\n**Option 1 (Coming soon):**  \nYou can install **RAGulate** via pip for a lightweight installation:\n\n```bash\npip install ragulate-bio\n```\n\n**Option 2 (Coming soon):**  \nAlternatively, if you want the latest, unreleased version, you can install it directly from the source on GitHub:\n\n```bash\npip install git+https://github.com/YDaiLab/RAGulate.git\n```\n\n**Import**\n\n```python\nimport ragulate_bio as ragulate  # recommended alias\n```\n\nNote: The PyPI distribution is named ragulate-bio to avoid a name conflict with an unrelated project called ragulate.\nAlways import ragulate_bio in Python (you may alias it to ragulate for convenience)\n\n**Option 3 (Coming soon):**  \nFor users who prefer Conda or Mamba for environment management, you can install **RAGulate** along with extra dependencies:\n\n**Conda:**\n```bash\nconda install -c zandigohar RAGulate\n```\n\n**Mamba:**\n```bash\nmamba create -n RAGulate -c zandigohar RAGulate\n```\n\n## FAQ\n\n**Q1: Do I need a GPU to run RAGulate?**  \nNo, a GPU is not required. However, using a CUDA-enabled GPU is strongly recommended for faster runs, especially with large queries.\n\n**Q2: How do I know if I can use a GPU with RAGulate?**  \nThere are two quick checks:\n\n1. **System check**  \n   In your terminal, run `nvidia-smi`. If you see your GPU listed (model, memory, driver version), your machine has an NVIDIA GPU with the driver installed.\n\n2. **Python check**  \n   In a Python shell, run:\n   ```python\n   import torch\n   print(torch.cuda.is_available())  # True means PyTorch can see your GPU\n   print(torch.cuda.device_count())  # How many GPUs are usable\n   ```\n\n**Q3: Can I use RAGulate with R-based tools?**  \nRAGulate is written in Python and works directly with `Numpy` objects. \n\n**Q4: What if I also have another package called ragulate installed?**\nRAGulate will warn you if it detects a conflicting installation. We recommend using a clean virtual environment to avoid import clashes.\n\n**Q5: How do I cite RAGulate?**  \nSee the [Citation](#citation) section below for the latest reference and preprint link.\n\n**Q6: How can I reproduce the paper\u2019s results?**  \nSee our [Reproducibility Guide](https://github.com/YDaiLab/RAGulate/blob/main/notebooks/XYZ.ipynb) for step-by-step instructions. Then run RAGulate.\n\n## Citation\n\n\u26a0\ufe0f Preprint coming soon (bioRxiv, 2025)  \n**This repository is under active development.** Please cite as:  \nZandigohar M, Rehman J, Dai Y. RAGulate: RAGulate: Retrieval-Augmented Generation for Post-hoc Literature-Grounded Regulatory Validation. 2025.\n\n\n## Development & Contact\nRAGulate was developed and is actively maintained by Mehrdad Zandigohar as part of his PhD research at the University of Illinois Chicago (UIC), in the lab of Dr. Yang Dai.\n\n\ud83d\udcec For private questions, please email: mzandi2@uic.edu\n\n\ud83e\udd1d For collaboration inquiries, please contact PI: Dr. Yang Dai (yangdai@uic.edu)\n\nContributions, feature suggestions, and feedback are always welcome!\n\n## License\n\nThe code in **RAGulate** is licensed under the [MIT License](https://opensource.org/licenses/MIT), which permits academic and commercial use, modification, and distribution. \n\nPlease note that any third-party dependencies bundled with **RAGulate** may have their own respective licenses.\n\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Retrieval augmented validation for TF inference in single cell biology",
    "version": "0.1.1",
    "project_urls": null,
    "split_keywords": [
        "single-cell",
        " regulatory networks",
        " rag",
        " bioinformatics"
    ],
    "urls": [
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "c54079139c6892b864884719a9f39f30d8c8665b7dbdb0d800c715fa2561ed25",
                "md5": "18751d82baddbe5b3fcd8bd63dd6e5de",
                "sha256": "91c10536dd112893029c09507687dd216a3fa4f537e088e9472d634415205627"
            },
            "downloads": -1,
            "filename": "ragulate_bio-0.1.1-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "18751d82baddbe5b3fcd8bd63dd6e5de",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.9",
            "size": 5487,
            "upload_time": "2025-08-19T21:26:38",
            "upload_time_iso_8601": "2025-08-19T21:26:38.004975Z",
            "url": "https://files.pythonhosted.org/packages/c5/40/79139c6892b864884719a9f39f30d8c8665b7dbdb0d800c715fa2561ed25/ragulate_bio-0.1.1-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "385f1a40998b0039a0ba11b09291c867a460b2c8dde45ae21fcade6258d5aee2",
                "md5": "d4a1e2e2df1af3062d205d20c1938010",
                "sha256": "68433734f42ec0fb9462fdb89011c1f0d631dc4c2719e71b68c326f142984c88"
            },
            "downloads": -1,
            "filename": "ragulate_bio-0.1.1.tar.gz",
            "has_sig": false,
            "md5_digest": "d4a1e2e2df1af3062d205d20c1938010",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.9",
            "size": 360093,
            "upload_time": "2025-08-19T21:26:39",
            "upload_time_iso_8601": "2025-08-19T21:26:39.713692Z",
            "url": "https://files.pythonhosted.org/packages/38/5f/1a40998b0039a0ba11b09291c867a460b2c8dde45ae21fcade6258d5aee2/ragulate_bio-0.1.1.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-08-19 21:26:39",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "ragulate-bio"
}
        
Elapsed time: 1.86938s