verba-rag


Nameverba-rag JSON
Version 0.2.2 PyPI version JSON
download
home_pagehttps://github.com/weaviate/Verba
SummaryWelcome to Verba: The Golden RAGtriever, an open-source initiative designed to offer a streamlined, user-friendly interface for Retrieval-Augmented Generation (RAG) applications. In just a few easy steps, dive into your data and make meaningful interactions!
upload_time2023-09-01 02:09:49
maintainer
docs_urlNone
authorWeaviate
requires_python
license
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Verba 
## πŸ• The Golden RAGtriever

Welcome to Verba: The Golden RAGtriever, an open-source initiative designed to offer a streamlined, user-friendly interface for Retrieval-Augmented Generation (RAG) applications. In just a few easy steps, dive into your data and make meaningful interactions!

```pip install verba_rag```

[![Weaviate](https://img.shields.io/static/v1?label=powered%20by&message=Weaviate%20%E2%9D%A4&color=green&style=flat-square)](https://weaviate.io/) 
[![PyPi downloads](https://static.pepy.tech/personalized-badge/verba_rag?period=total&units=international_system&left_color=grey&right_color=orange&left_text=pip%20downloads)](https://pypi.org/project/verba_rag/) [![Docker support](https://img.shields.io/badge/Docker_support-%E2%9C%93-4c1?style=flat-square&logo=docker&logoColor=white)](https://docs.docker.com/get-started/) [![Demo](https://img.shields.io/badge/Check%20out%20the%20demo!-yellow?&style=flat-square&logo=react&logoColor=white)](https://verba.weaviate.io/)

![Demo of Verba](https://github.com/weaviate/Verba/blob/main/img/verba.gif)

## 🎯 What Is Verba?
Verba is more than just a toolβ€”it's a personal assistant for querying and interacting with your data. Have questions about your documents? Need to cross-reference multiple data points? Want to gain insights from your existing knowledge base? Verba makes it all possible through the power of Weaviate and Large Language Models (LLMs).

## βš™οΈ Under the Hood
Built on top of Weaviate's state-of-the-art Generative Search technology, Verba fetches relevant portions of documents to answer your queries. It leverages the computational strength of LLMs to offer comprehensive, contextually relevant answers. All of this is conveniently accessible through Verba's intuitive user interface.

## πŸ’‘ Effortless Data Import with Weaviate
Verba offers seamless data import functionality, supporting a diverse range of file types including .txt, .md, and more. Before feeding your data into Weaviate, our system handles chunking and vectorization to optimize it for search and retrieval.

> πŸ”§ Work in Progress: We are actively developing a data cleaning pipeline for custom datasets. Until then, please ensure your data is clean and well-structured before importing it into Weaviate.

## πŸ’₯ Advanced Query Resolution with Hybrid and Generative Search
Harness the power of Weaviate's generate module and hybrid search features when using Verba. These advanced search techniques sift through your documents to identify contextually relevant fragments, which are then used by Large Language Models to formulate comprehensive answers to your queries.

## πŸ”₯ Accelerate Queries with Semantic Cache
Verba utilizes Weaviate's Semantic Cache to embed both the generated results and queries, making future searches incredibly efficient. When you ask a question, Verba will first check the Semantic Cache to see if a semantically identical query has already been processed.

# ✨ Getting Started with Verba

This section outlines various methods to set up and deploy Verba, so you can choose the one that fits you best:

- Deploy with `pip`
- Build from Source
- Use Docker for Deployment

**Prerequisites**: If you're not using Docker, ensure that you have Python >=3.9.0 installed on your system.

**πŸ”‘ API Key Requirement**: Regardless of the deployment method, you'll need an OpenAI API key to enable data ingestion and querying features. You can specify this by either creating a .env file when cloning the project, or by storing the API key in your system environment variables.

## πŸš€ Quickstart: Deploy with pip

1. **Initialize a new Python Environment**
- ```python3 -m virutalenv venv```

2. **Install Verba**
- ```pip install verba_rag```

3. **Launch Verba**
- ```verba start```

## πŸ› οΈ Quickstart: Build from Source

1. **Clone the Verba repos**
- ```git clone https://github.com/weaviate/Verba.git```

2. **Initialize a new Python Environment**
- ```python3 -m virutalenv venv```

3. **Install Verba**
- ```pip install -e .```

4. **Launch Verba**
- ```verba start```

## 🐳 Quickstart: Deploy with Docker
If you're unfamiliar with Docker, you can learn more about it [here](https://docker-curriculum.com/)

0. **Clone the Verba repos**
- ```git clone https://github.com/weaviate/Verba.git```

2. **Deploy using Docker**
- ```docker-compose up```

## 🌐 Selecting the Optimal Weaviate Deployment for Verba

Verba provides flexibility in connecting to Weaviate instances based on your needs. By default, Verba opts for [Weaviate Embedded](https://weaviate.io/developers/weaviate/installation/embedded) if it doesn't detect the `VERBA_URL` and `VERBA_API_KEY` environment variables. This local deployment is the most straightforward way to launch your Weaviate database for prototyping and testing.

However, you have other compelling options to consider:

**🌩️ Weaviate Cloud Service (WCS)**

If you prefer a cloud-based solution, Weaviate Cloud Service (WCS) offers a scalable, managed environment. Learn how to set up a cloud cluster by following the [Weaviate Cluster Setup Guide](https://weaviate.io/developers/wcs/guides/create-instance).

**🐳 Docker Deployment**
Another robust local alternative is deploying Weaviate using Docker. For more details, consult the [Weaviate Docker Guide](https://weaviate.io/developers/weaviate/installation/docker-compose).

**🌿 Environment Variable Configuration**
Regardless of your chosen deployment method, you'll need to specify the following environment variables. These can either be added to a .env file in your project directory or set as global environment variables on your system:

- ```VERBA_URL=http://your-weaviate-server:8080```
- ```VERBA_API_KEY=your-weaviate-database-key```

# πŸ“¦ Data Import Guide

Verba offers straightforward commands to import your data for further interaction. Before you proceed, please be aware that importing data will incur costs based on your configured OpenAI access key.

> **Important Notes:**
> Supported file types are currently limited to .txt, .md, and .mdx. Additional formats are in development.
> Basic CRUD operations and UI interactions are also in the pipeline.

``` 
verba start             # Initiates Verba application
verba import --path "Path to your dir or file" --model "gpt-3.5-turbo" --clear True # Imports data into Verba
verba clear             # Deletes all data within Verba
verba clear_cache       # Removes cached data in Verba
```

If you've cloned the repository, you can get a quick start with sample datasets in the `./data` directory. Use `verba import --path ./data` to import these samples. You can also populate Verba with predefined suggestions using a JSON list via `verba import --path suggestions.json`. An example is provided in the `./data` directory.

## πŸ’° Large Language Model (LLM) Costs

Verba exclusively utilizes OpenAI models. Be advised that the usage costs for these models will be billed to the API access key you provide. Primarily, costs are incurred during data embedding and answer generation processes. The default vectorization engine for this project is `Ada v2`.

## πŸ› οΈ Project Architecture
Verba is built on three primary components:

- Weaviate Database: You have the option to host on Weaviate Cloud Service (WCS) or run it locally.
- FastAPI Endpoint: Acts as the communication bridge between the Large Language Model provider and the Weaviate database.
- React Frontend (Static served through FastAPI): Offers an interactive UI to display and interact with your data.
Development 

>Note: If you're planning to modify the frontend, ensure you have Node.js version >=18.16.0 installed. For more details on setting up the frontend, check out the Frontend README.

## πŸ’– Open Source Contribution

Your contributions are always welcome! Feel free to contribute ideas, feedback, or create issues and bug reports if you find any! Visit our [Weaviate Community Forum](https://forum.weaviate.io/) if you need any help!


            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/weaviate/Verba",
    "name": "verba-rag",
    "maintainer": "",
    "docs_url": null,
    "requires_python": "",
    "maintainer_email": "",
    "keywords": "",
    "author": "Weaviate",
    "author_email": "edward@weaviate.io",
    "download_url": "https://files.pythonhosted.org/packages/79/96/34eb6c9fe6f32538ef1ab52d1bf0911eb5b561c7c5a39e2c645e0fc40a45/verba_rag-0.2.2.tar.gz",
    "platform": null,
    "description": "# Verba \n## \ud83d\udc15 The Golden RAGtriever\n\nWelcome to Verba: The Golden RAGtriever, an open-source initiative designed to offer a streamlined, user-friendly interface for Retrieval-Augmented Generation (RAG) applications. In just a few easy steps, dive into your data and make meaningful interactions!\n\n```pip install verba_rag```\n\n[![Weaviate](https://img.shields.io/static/v1?label=powered%20by&message=Weaviate%20%E2%9D%A4&color=green&style=flat-square)](https://weaviate.io/) \n[![PyPi downloads](https://static.pepy.tech/personalized-badge/verba_rag?period=total&units=international_system&left_color=grey&right_color=orange&left_text=pip%20downloads)](https://pypi.org/project/verba_rag/) [![Docker support](https://img.shields.io/badge/Docker_support-%E2%9C%93-4c1?style=flat-square&logo=docker&logoColor=white)](https://docs.docker.com/get-started/) [![Demo](https://img.shields.io/badge/Check%20out%20the%20demo!-yellow?&style=flat-square&logo=react&logoColor=white)](https://verba.weaviate.io/)\n\n![Demo of Verba](https://github.com/weaviate/Verba/blob/main/img/verba.gif)\n\n## \ud83c\udfaf What Is Verba?\nVerba is more than just a tool\u2014it's a personal assistant for querying and interacting with your data. Have questions about your documents? Need to cross-reference multiple data points? Want to gain insights from your existing knowledge base? Verba makes it all possible through the power of Weaviate and Large Language Models (LLMs).\n\n## \u2699\ufe0f Under the Hood\nBuilt on top of Weaviate's state-of-the-art Generative Search technology, Verba fetches relevant portions of documents to answer your queries. It leverages the computational strength of LLMs to offer comprehensive, contextually relevant answers. All of this is conveniently accessible through Verba's intuitive user interface.\n\n## \ud83d\udca1 Effortless Data Import with Weaviate\nVerba offers seamless data import functionality, supporting a diverse range of file types including .txt, .md, and more. Before feeding your data into Weaviate, our system handles chunking and vectorization to optimize it for search and retrieval.\n\n> \ud83d\udd27 Work in Progress: We are actively developing a data cleaning pipeline for custom datasets. Until then, please ensure your data is clean and well-structured before importing it into Weaviate.\n\n## \ud83d\udca5 Advanced Query Resolution with Hybrid and Generative Search\nHarness the power of Weaviate's generate module and hybrid search features when using Verba. These advanced search techniques sift through your documents to identify contextually relevant fragments, which are then used by Large Language Models to formulate comprehensive answers to your queries.\n\n## \ud83d\udd25 Accelerate Queries with Semantic Cache\nVerba utilizes Weaviate's Semantic Cache to embed both the generated results and queries, making future searches incredibly efficient. When you ask a question, Verba will first check the Semantic Cache to see if a semantically identical query has already been processed.\n\n# \u2728 Getting Started with Verba\n\nThis section outlines various methods to set up and deploy Verba, so you can choose the one that fits you best:\n\n- Deploy with `pip`\n- Build from Source\n- Use Docker for Deployment\n\n**Prerequisites**: If you're not using Docker, ensure that you have Python >=3.9.0 installed on your system.\n\n**\ud83d\udd11 API Key Requirement**: Regardless of the deployment method, you'll need an OpenAI API key to enable data ingestion and querying features. You can specify this by either creating a .env file when cloning the project, or by storing the API key in your system environment variables.\n\n## \ud83d\ude80 Quickstart: Deploy with pip\n\n1. **Initialize a new Python Environment**\n- ```python3 -m virutalenv venv```\n\n2. **Install Verba**\n- ```pip install verba_rag```\n\n3. **Launch Verba**\n- ```verba start```\n\n## \ud83d\udee0\ufe0f Quickstart: Build from Source\n\n1. **Clone the Verba repos**\n- ```git clone https://github.com/weaviate/Verba.git```\n\n2. **Initialize a new Python Environment**\n- ```python3 -m virutalenv venv```\n\n3. **Install Verba**\n- ```pip install -e .```\n\n4. **Launch Verba**\n- ```verba start```\n\n## \ud83d\udc33 Quickstart: Deploy with Docker\nIf you're unfamiliar with Docker, you can learn more about it [here](https://docker-curriculum.com/)\n\n0. **Clone the Verba repos**\n- ```git clone https://github.com/weaviate/Verba.git```\n\n2. **Deploy using Docker**\n- ```docker-compose up```\n\n## \ud83c\udf10 Selecting the Optimal Weaviate Deployment for Verba\n\nVerba provides flexibility in connecting to Weaviate instances based on your needs. By default, Verba opts for [Weaviate Embedded](https://weaviate.io/developers/weaviate/installation/embedded) if it doesn't detect the `VERBA_URL` and `VERBA_API_KEY` environment variables. This local deployment is the most straightforward way to launch your Weaviate database for prototyping and testing.\n\nHowever, you have other compelling options to consider:\n\n**\ud83c\udf29\ufe0f Weaviate Cloud Service (WCS)**\n\nIf you prefer a cloud-based solution, Weaviate Cloud Service (WCS) offers a scalable, managed environment. Learn how to set up a cloud cluster by following the [Weaviate Cluster Setup Guide](https://weaviate.io/developers/wcs/guides/create-instance).\n\n**\ud83d\udc33 Docker Deployment**\nAnother robust local alternative is deploying Weaviate using Docker. For more details, consult the [Weaviate Docker Guide](https://weaviate.io/developers/weaviate/installation/docker-compose).\n\n**\ud83c\udf3f Environment Variable Configuration**\nRegardless of your chosen deployment method, you'll need to specify the following environment variables. These can either be added to a .env file in your project directory or set as global environment variables on your system:\n\n- ```VERBA_URL=http://your-weaviate-server:8080```\n- ```VERBA_API_KEY=your-weaviate-database-key```\n\n# \ud83d\udce6 Data Import Guide\n\nVerba offers straightforward commands to import your data for further interaction. Before you proceed, please be aware that importing data will incur costs based on your configured OpenAI access key.\n\n> **Important Notes:**\n> Supported file types are currently limited to .txt, .md, and .mdx. Additional formats are in development.\n> Basic CRUD operations and UI interactions are also in the pipeline.\n\n``` \nverba start             # Initiates Verba application\nverba import --path \"Path to your dir or file\" --model \"gpt-3.5-turbo\" --clear True # Imports data into Verba\nverba clear             # Deletes all data within Verba\nverba clear_cache       # Removes cached data in Verba\n```\n\nIf you've cloned the repository, you can get a quick start with sample datasets in the `./data` directory. Use `verba import --path ./data` to import these samples. You can also populate Verba with predefined suggestions using a JSON list via `verba import --path suggestions.json`. An example is provided in the `./data` directory.\n\n## \ud83d\udcb0 Large Language Model (LLM) Costs\n\nVerba exclusively utilizes OpenAI models. Be advised that the usage costs for these models will be billed to the API access key you provide. Primarily, costs are incurred during data embedding and answer generation processes. The default vectorization engine for this project is `Ada v2`.\n\n## \ud83d\udee0\ufe0f Project Architecture\nVerba is built on three primary components:\n\n- Weaviate Database: You have the option to host on Weaviate Cloud Service (WCS) or run it locally.\n- FastAPI Endpoint: Acts as the communication bridge between the Large Language Model provider and the Weaviate database.\n- React Frontend (Static served through FastAPI): Offers an interactive UI to display and interact with your data.\nDevelopment \n\n>Note: If you're planning to modify the frontend, ensure you have Node.js version >=18.16.0 installed. For more details on setting up the frontend, check out the Frontend README.\n\n## \ud83d\udc96 Open Source Contribution\n\nYour contributions are always welcome! Feel free to contribute ideas, feedback, or create issues and bug reports if you find any! Visit our [Weaviate Community Forum](https://forum.weaviate.io/) if you need any help!\n\n",
    "bugtrack_url": null,
    "license": "",
    "summary": "Welcome to Verba: The Golden RAGtriever, an open-source initiative designed to offer a streamlined, user-friendly interface for Retrieval-Augmented Generation (RAG) applications. In just a few easy steps, dive into your data and make meaningful interactions!",
    "version": "0.2.2",
    "project_urls": {
        "Homepage": "https://github.com/weaviate/Verba"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "19571885a49e255a3b301de7b44e65539b9518704d449654040ee04f56338128",
                "md5": "8b66b2af9e30240bc1f7631d7826401b",
                "sha256": "e8b3f788b985438a86b845d7ecf9b2d9f70bc2bdcba4029edc4de119e34315da"
            },
            "downloads": -1,
            "filename": "verba_rag-0.2.2-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "8b66b2af9e30240bc1f7631d7826401b",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 1897363,
            "upload_time": "2023-09-01T02:09:45",
            "upload_time_iso_8601": "2023-09-01T02:09:45.819205Z",
            "url": "https://files.pythonhosted.org/packages/19/57/1885a49e255a3b301de7b44e65539b9518704d449654040ee04f56338128/verba_rag-0.2.2-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "799634eb6c9fe6f32538ef1ab52d1bf0911eb5b561c7c5a39e2c645e0fc40a45",
                "md5": "781ab48243e194391a54c8927c1272a3",
                "sha256": "77efbf1675874ccc80e3ec9d0439ad3ed3719a9646ca5f72db5ccbb2637ab8b2"
            },
            "downloads": -1,
            "filename": "verba_rag-0.2.2.tar.gz",
            "has_sig": false,
            "md5_digest": "781ab48243e194391a54c8927c1272a3",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 1773164,
            "upload_time": "2023-09-01T02:09:49",
            "upload_time_iso_8601": "2023-09-01T02:09:49.395092Z",
            "url": "https://files.pythonhosted.org/packages/79/96/34eb6c9fe6f32538ef1ab52d1bf0911eb5b561c7c5a39e2c645e0fc40a45/verba_rag-0.2.2.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-09-01 02:09:49",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "weaviate",
    "github_project": "Verba",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "requirements": [],
    "lcname": "verba-rag"
}
        
Elapsed time: 0.13507s