genegist


Namegenegist JSON
Version 0.1.5 PyPI version JSON
download
home_pagehttps://github.com/aion-labs/genegist
SummaryCreate summararies of groups of genes.
upload_time2024-01-18 14:44:53
maintainer
docs_urlNone
authorJJ Ben-Joseph
requires_python>3.10
licenseApache
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # 👓 GeneGist

Researchers often face challenges in deciphering the complex interactions and functions of systems of genes. GeneGist addresses this problem by providing detailed summaries and insights into gene behaviors, interactions, and their roles in biological processes. 

This complexity arises from the vast array of gene interactions, regulatory mechanisms, and the multifaceted roles genes play in biological processes. GeneGist generates in-depth summaries and insights into gene behaviors and interactions, as well as their roles in biological pathways and systems.

GeneGist first scrapes and analyzes academic articles. GeneGist leverages the most advanced Large Language Models (LLMs) available to analyze this information. Using this distilled knowledge it produces biological process summaries. 

GeneGist can also create Gene Reference Into Function (GeneRIFs) directly from scientific literature. GeneRIFs are concise sentence-like annotations, typically written by a human, that describe the function of a gene. GeneGist can construct GeneRIFs using generative AI technology based on LLMs.

## License
Apache License

### Installation

To install GeneGist, ensure you have Python 3.10 or higher. It can be installed via pip:

```bash
pip install genegist
```

## Development 

### Installing Poetry
Poetry is required to handle dependencies and package management. To install Poetry, run:

```bash
curl -sSL https://raw.githubusercontent.com/python-poetry/poetry/master/get-poetry.py | python -
```

### Setting Up genegist
1. Clone the repository:

   ```bash
   git clone [repository URL]
   cd genegist
   ```

2. Install the dependencies using Poetry:

   ```bash
   poetry install
   ```

## Usage

To use genegist, run the following command:

```bash
poetry run genegist [options]
```

### Options

- `-g GENE`, `--gene GENE`: 
  Look up GeneRIFs for a given gene.

- `-s GENESET`, `--geneset GENESET`: 
  Look up GeneRIFs for a given gene set.

- `-f GENESET_FILE`, `--geneset-file GENESET_FILE`: 
  Look up GeneRIFs for a file containing a list of genes.

- `-p PROCESS`, `--process PROCESS`: 
  Find a biological process for the inputted gene set.

- `-d CREATE_DRY_RUN`, `--create-dry-run CREATE_DRY_RUN`: 
  Don't actually run the biological process finder, but save the gene summaries to a file.

- `-a`, `--abstracts`: 
  Also look up abstracts.

- `-r LOAD_DRY_RUN`, `--load-dry-run LOAD_DRY_RUN`: 
  Load the gene summaries from a file instead of running the LLM on them explicitly.

- `--llm {gpt-3.5-turbo-1106,gpt-4-1106-preview}`: 
  Specify the LLM to use.

- `-m ARTICLE`, `--article ARTICLE`: 
  Get the summary for a given PMID.

- `-t`, `--tasks`: 
  Run a given custom task. Currently only E3 ligase analysis is supported.

- `-y`, `--synthetic-generifs`: 
  Create synthetic generifs and save them to a tab-delimited file.

- `-i`, `--build-index`:
   Build an embedding index for all the generifs.


## Development

### Running Tests
To run tests, use:

```bash
poetry run pytest
```
            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/aion-labs/genegist",
    "name": "genegist",
    "maintainer": "",
    "docs_url": null,
    "requires_python": ">3.10",
    "maintainer_email": "",
    "keywords": "",
    "author": "JJ Ben-Joseph",
    "author_email": "jj@tensorspace.ai",
    "download_url": "https://files.pythonhosted.org/packages/ca/57/c77a00f6cf2c3375a81da273ba87a05a1c6da900a4cef5e42721845b8f42/genegist-0.1.5.tar.gz",
    "platform": null,
    "description": "# \ud83d\udc53 GeneGist\n\nResearchers often face challenges in deciphering the complex interactions and functions of systems of genes. GeneGist addresses this problem by providing detailed summaries and insights into gene behaviors, interactions, and their roles in biological processes. \n\nThis complexity arises from the vast array of gene interactions, regulatory mechanisms, and the multifaceted roles genes play in biological processes. GeneGist generates in-depth summaries and insights into gene behaviors and interactions, as well as their roles in biological pathways and systems.\n\nGeneGist first scrapes and analyzes academic articles. GeneGist leverages the most advanced Large Language Models (LLMs) available to analyze this information. Using this distilled knowledge it produces biological process summaries. \n\nGeneGist can also create Gene Reference Into Function (GeneRIFs) directly from scientific literature. GeneRIFs are concise sentence-like annotations, typically written by a human, that describe the function of a gene. GeneGist can construct GeneRIFs using generative AI technology based on LLMs.\n\n## License\nApache License\n\n### Installation\n\nTo install GeneGist, ensure you have Python 3.10 or higher. It can be installed via pip:\n\n```bash\npip install genegist\n```\n\n## Development \n\n### Installing Poetry\nPoetry is required to handle dependencies and package management. To install Poetry, run:\n\n```bash\ncurl -sSL https://raw.githubusercontent.com/python-poetry/poetry/master/get-poetry.py | python -\n```\n\n### Setting Up genegist\n1. Clone the repository:\n\n   ```bash\n   git clone [repository URL]\n   cd genegist\n   ```\n\n2. Install the dependencies using Poetry:\n\n   ```bash\n   poetry install\n   ```\n\n## Usage\n\nTo use genegist, run the following command:\n\n```bash\npoetry run genegist [options]\n```\n\n### Options\n\n- `-g GENE`, `--gene GENE`: \n  Look up GeneRIFs for a given gene.\n\n- `-s GENESET`, `--geneset GENESET`: \n  Look up GeneRIFs for a given gene set.\n\n- `-f GENESET_FILE`, `--geneset-file GENESET_FILE`: \n  Look up GeneRIFs for a file containing a list of genes.\n\n- `-p PROCESS`, `--process PROCESS`: \n  Find a biological process for the inputted gene set.\n\n- `-d CREATE_DRY_RUN`, `--create-dry-run CREATE_DRY_RUN`: \n  Don't actually run the biological process finder, but save the gene summaries to a file.\n\n- `-a`, `--abstracts`: \n  Also look up abstracts.\n\n- `-r LOAD_DRY_RUN`, `--load-dry-run LOAD_DRY_RUN`: \n  Load the gene summaries from a file instead of running the LLM on them explicitly.\n\n- `--llm {gpt-3.5-turbo-1106,gpt-4-1106-preview}`: \n  Specify the LLM to use.\n\n- `-m ARTICLE`, `--article ARTICLE`: \n  Get the summary for a given PMID.\n\n- `-t`, `--tasks`: \n  Run a given custom task. Currently only E3 ligase analysis is supported.\n\n- `-y`, `--synthetic-generifs`: \n  Create synthetic generifs and save them to a tab-delimited file.\n\n- `-i`, `--build-index`:\n   Build an embedding index for all the generifs.\n\n\n## Development\n\n### Running Tests\nTo run tests, use:\n\n```bash\npoetry run pytest\n```",
    "bugtrack_url": null,
    "license": "Apache",
    "summary": "Create summararies of groups of genes.",
    "version": "0.1.5",
    "project_urls": {
        "Homepage": "https://github.com/aion-labs/genegist"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "6ecbaac9a2f630845f194ce30c88b9bb50e2ec92e611f3106388f60e54d5474a",
                "md5": "fcf3a648ea989dad3e3908b2908f43a5",
                "sha256": "8f119c2febcb72997af04bc02fae940f9d681fb974576cf9b833cb4f27797e34"
            },
            "downloads": -1,
            "filename": "genegist-0.1.5-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "fcf3a648ea989dad3e3908b2908f43a5",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">3.10",
            "size": 15560,
            "upload_time": "2024-01-18T14:44:51",
            "upload_time_iso_8601": "2024-01-18T14:44:51.328617Z",
            "url": "https://files.pythonhosted.org/packages/6e/cb/aac9a2f630845f194ce30c88b9bb50e2ec92e611f3106388f60e54d5474a/genegist-0.1.5-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "ca57c77a00f6cf2c3375a81da273ba87a05a1c6da900a4cef5e42721845b8f42",
                "md5": "b5bfc90d0a0f60d4b4fc73bfb3d95411",
                "sha256": "d6e75b4d2add7e12e02637aaf01dc50f0d290c301e603517c8ae94b359c1f814"
            },
            "downloads": -1,
            "filename": "genegist-0.1.5.tar.gz",
            "has_sig": false,
            "md5_digest": "b5bfc90d0a0f60d4b4fc73bfb3d95411",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">3.10",
            "size": 14777,
            "upload_time": "2024-01-18T14:44:53",
            "upload_time_iso_8601": "2024-01-18T14:44:53.168585Z",
            "url": "https://files.pythonhosted.org/packages/ca/57/c77a00f6cf2c3375a81da273ba87a05a1c6da900a4cef5e42721845b8f42/genegist-0.1.5.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-01-18 14:44:53",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "aion-labs",
    "github_project": "genegist",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "genegist"
}
        
Elapsed time: 0.18403s