gptdb


Namegptdb JSON
Version 0.5.9 PyPI version JSON
download
home_pagehttps://github.com/khulnasoft/GPT-DB
SummaryGPT-DB is an experimental open-source project that uses localized GPT large models to interact with your data and environment. With this solution, you can be assured that there is no risk of data leakage, and your data is 100% private and secure.
upload_time2024-07-14 21:57:37
maintainerNone
docs_urlNone
authorkhulnasoft
requires_python>=3.10
licensehttps://opensource.org/license/mit/
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # GPT-DB: Revolutionizing Database Interactions with Private LLM Technology
 
<p align="left">
  <img src="./assets/LOGO.png" width="100%" />
</p>

<div align="center">
  <p>
    <a href="https://github.com/khulnasoft/GPT-DB">
        <img alt="stars" src="https://img.shields.io/github/stars/khulnasoft/db-gpt?style=social" />
    </a>
    <a href="https://github.com/khulnasoft/GPT-DB">
        <img alt="forks" src="https://img.shields.io/github/forks/khulnasoft/db-gpt?style=social" />
    </a>
    <a href="https://opensource.org/licenses/MIT">
      <img alt="License: MIT" src="https://img.shields.io/badge/License-MIT-yellow.svg" />
    </a>
     <a href="https://github.com/khulnasoft/GPT-DB/releases">
      <img alt="Release Notes" src="https://img.shields.io/github/release/khulnasoft/GPT-DB" />
    </a>
    <a href="https://github.com/khulnasoft/GPT-DB/issues">
      <img alt="Open Issues" src="https://img.shields.io/github/issues-raw/khulnasoft/GPT-DB" />
    </a>
    <a href="https://discord.gg/7uQnPuveTY">
      <img alt="Discord" src="https://dcbadge.vercel.app/api/server/7uQnPuveTY?compact=true&style=flat" />
    </a>
    <a href="https://join.slack.com/t/slack-inu2564/shared_invite/zt-29rcnyw2b-N~ubOD9kFc7b7MDOAM1otA">
      <img alt="Slack" src="https://badgen.net/badge/Slack/Join%20GPT-DB/0abd59?icon=slack" />
    </a>
    <a href="https://codespaces.new/khulnasoft/GPT-DB">
      <img alt="Open in GitHub Codespaces" src="https://github.com/codespaces/badge.svg" />
    </a>
  </p>


[**简体中文**](README.zh.md) | [**日本語**](README.ja.md) | [**Discord**](https://discord.gg/7uQnPuveTY) | [**Documents**](https://docs.gptdb.site) | [**微信**](https://github.com/khulnasoft/GPT-DB/blob/main/README.zh.md#%E8%81%94%E7%B3%BB%E6%88%91%E4%BB%AC) | [**Community**](https://github.com/khulnasoft/community) | [**Paper**](https://arxiv.org/pdf/2312.17449.pdf)

</div>

## What is GPT-DB?

🤖 **GPT-DB is an open source AI native data app development framework with AWEL(Agentic Workflow Expression Language) and agents**. 

The purpose is to build infrastructure in the field of large models, through the development of multiple technical capabilities such as multi-model management (SMMF), Text2SQL effect optimization, RAG framework and optimization, Multi-Agents framework collaboration, AWEL (agent workflow orchestration), etc. Which makes large model applications with data simpler and more convenient.


🚀 **In the Data 3.0 era, based on models and databases, enterprises and developers can build their own bespoke applications with less code.**

### AI-Native Data App 
---
- 🔥🔥🔥 [Released V0.5.0 | Develop native data applications through workflows and agents](https://docs.gptdb.site/docs/changelog/Released_V0.5.0)
---

![Data-awels](https://github.com/khulnasoft/GPT-DB/assets/17919400/37d116fc-d9dd-4efa-b4df-9ab02b22541c)

![Data-Apps](https://github.com/khulnasoft/GPT-DB/assets/17919400/a7bf6d65-92d1-4f0e-aaf0-259ccdde22fd)

![dashboard-images](https://github.com/khulnasoft/GPT-DB/assets/17919400/1849a79a-f7fd-40cf-bc9c-b117a041dd6a)


## Contents
- [Introduction](#introduction)
- [Install](#install)
- [Features](#features)
- [Contribution](#contribution)
- [Contact](#contact-information)

## Introduction 
The architecture of GPT-DB is shown in the following figure:

<p align="center">
  <img src="./assets/gptdb.png" width="800" />
</p>

The core capabilities include the following parts:

- **RAG (Retrieval Augmented Generation)**: RAG is currently the most practically implemented and urgently needed domain. GPT-DB has already implemented a framework based on RAG, allowing users to build knowledge-based applications using the RAG capabilities of GPT-DB.

- **GBI (Generative Business Intelligence)**: Generative BI is one of the core capabilities of the GPT-DB project, providing the foundational data intelligence technology to build enterprise report analysis and business insights.

- **Fine-tuning Framework**: Model fine-tuning is an indispensable capability for any enterprise to implement in vertical and niche domains. GPT-DB provides a complete fine-tuning framework that integrates seamlessly with the GPT-DB project. In recent fine-tuning efforts, an accuracy rate based on the Spider dataset has been achieved at 82.5%.

- **Data-Driven Multi-Agents Framework**: GPT-DB offers a data-driven self-evolving multi-agents framework, aiming to continuously make decisions and execute based on data.

- **Data Factory**: The Data Factory is mainly about cleaning and processing trustworthy knowledge and data in the era of large models.

- **Data Sources**: Integrating various data sources to seamlessly connect production business data to the core capabilities of GPT-DB.

### SubModule
- [GPT-DB-Hub](https://github.com/khulnasoft-lab/GPT-DB-Hub) Text-to-SQL workflow with high performance by applying Supervised Fine-Tuning (SFT) on Large Language Models (LLMs).

- [gptdbs](https://github.com/khulnasoft-lab/gptdbs)  gptdbs is the official repository which contains some data apps、AWEL operators、AWEL workflow templates and agents which build upon GPT-DB.

#### Text2SQL Finetune
- support llms
  - [x] LLaMA
  - [x] LLaMA-2
  - [x] BLOOM
  - [x] BLOOMZ
  - [x] Falcon
  - [x] Baichuan
  - [x] Baichuan2
  - [x] InternLM
  - [x] Qwen
  - [x] XVERSE
  - [x] ChatGLM2

-  SFT Accuracy
As of October 10, 2023, through the fine-tuning of an open-source model with 13 billion parameters using this project, we have achieved execution accuracy on the Spider dataset that surpasses even GPT-4!

[More Information about Text2SQL finetune](https://github.com/khulnasoft-lab/GPT-DB-Hub)

- [GPT-DB-Plugins](https://github.com/khulnasoft-lab/GPT-DB-Plugins) GPT-DB Plugins that can run Auto-GPT plugin directly
- [GPT-Vis](https://github.com/khulnasoft-lab/GPT-Vis) Visualization protocol

## Install 
![Docker](https://img.shields.io/badge/docker-%230db7ed.svg?style=for-the-badge&logo=docker&logoColor=white)
![Linux](https://img.shields.io/badge/Linux-FCC624?style=for-the-badge&logo=linux&logoColor=black)
![macOS](https://img.shields.io/badge/mac%20os-000000?style=for-the-badge&logo=macos&logoColor=F0F0F0)
![Windows](https://img.shields.io/badge/Windows-0078D6?style=for-the-badge&logo=windows&logoColor=white)

[**Usage Tutorial**](http://docs.gptdb.site/docs/overview)
- [**Install**](http://docs.gptdb.site/docs/installation)
  - [Docker](https://docs.gptdb.site/docs/installation/docker)
  - [Source Code](https://docs.gptdb.site/docs/installation/sourcecode)
- [**Quickstart**](http://docs.gptdb.site/docs/quickstart)
- [**Application**](http://docs.gptdb.site/docs/operation_manual)
  - [App Usage](https://docs.gptdb.site/docs/application/app_usage)
  - [AWEL Flow Usage](https://docs.gptdb.site/docs/application/awel_flow_usage)
- [**Debugging**](http://docs.gptdb.site/docs/operation_manual/advanced_tutorial/debugging)
- [**Advanced Usage**](https://docs.gptdb.site/docs/application/advanced_tutorial/cli)
  - [SMMF](https://docs.gptdb.site/docs/application/advanced_tutorial/smmf)
  - [Finetune](https://docs.gptdb.site/docs/application/fine_tuning_manual/gptdb_hub)
  - [AWEL](https://docs.gptdb.site/docs/latest/awel/tutorial)


## Features

At present, we have introduced several key features to showcase our current capabilities:
- **Private Domain Q&A & Data Processing**

  The GPT-DB project offers a range of functionalities designed to improve knowledge base construction and enable efficient storage and retrieval of both structured and unstructured data. These functionalities include built-in support for uploading multiple file formats, the ability to integrate custom data extraction plug-ins, and unified vector storage and retrieval capabilities for effectively managing large volumes of information.

- **Multi-Data Source & GBI(Generative Business intelligence)**

  The GPT-DB project facilitates seamless natural language interaction with diverse data sources, including Excel, databases, and data warehouses. It simplifies the process of querying and retrieving information from these sources, empowering users to engage in intuitive conversations and gain insights. Moreover, GPT-DB supports the generation of analytical reports, providing users with valuable data summaries and interpretations.

- **Multi-Agents&Plugins**

  It offers support for custom plug-ins to perform various tasks and natively integrates the Auto-GPT plug-in model. The Agents protocol adheres to the Agent Protocol standard.

- **Automated Fine-tuning text2SQL**

  We've also developed an automated fine-tuning lightweight framework centred on large language models (LLMs), Text2SQL datasets, LoRA/QLoRA/Pturning, and other fine-tuning methods. This framework simplifies Text-to-SQL fine-tuning, making it as straightforward as an assembly line process. [GPT-DB-Hub](https://github.com/khulnasoft-lab/GPT-DB-Hub)

- **SMMF(Service-oriented Multi-model Management Framework)**

  We offer extensive model support, including dozens of large language models (LLMs) from both open-source and API agents, such as LLaMA/LLaMA2, Baichuan, ChatGLM, Wenxin, Tongyi, Zhipu, and many more. 

  - News
    - 🔥🔥🔥  [gemma-2-27b-it](https://huggingface.co/google/gemma-2-27b-it)
    - 🔥🔥🔥  [gemma-2-9b-it](https://huggingface.co/google/gemma-2-9b-it)
    - 🔥🔥🔥  [DeepSeek-Coder-V2-Instruct](https://huggingface.co/deepseek-ai/DeepSeek-Coder-V2-Instruct)
    - 🔥🔥🔥  [DeepSeek-Coder-V2-Lite-Instruct](https://huggingface.co/deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct)
    - 🔥🔥🔥  [Qwen2-57B-A14B-Instruct](https://huggingface.co/Qwen/Qwen2-57B-A14B-Instruct)
    - 🔥🔥🔥  [Qwen2-72B-Instruct](https://huggingface.co/Qwen/Qwen2-72B-Instruct)
    - 🔥🔥🔥  [Qwen2-7B-Instruct](https://huggingface.co/Qwen/Qwen2-7B-Instruct)
    - 🔥🔥🔥  [Qwen2-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2-1.5B-Instruct)
    - 🔥🔥🔥  [Qwen2-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2-0.5B-Instruct)
    - 🔥🔥🔥  [glm-4-9b-chat](https://huggingface.co/THUDM/glm-4-9b-chat)
    - 🔥🔥🔥  [Phi-3](https://huggingface.co/collections/microsoft/phi-3-6626e15e9585a200d2d761e3)
    - 🔥🔥🔥  [Yi-1.5-34B-Chat](https://huggingface.co/01-ai/Yi-1.5-34B-Chat)
    - 🔥🔥🔥  [Yi-1.5-9B-Chat](https://huggingface.co/01-ai/Yi-1.5-9B-Chat)
    - 🔥🔥🔥  [Yi-1.5-6B-Chat](https://huggingface.co/01-ai/Yi-1.5-6B-Chat)
    - 🔥🔥🔥  [Qwen1.5-110B-Chat](https://huggingface.co/Qwen/Qwen1.5-110B-Chat)
    - 🔥🔥🔥  [Qwen1.5-MoE-A2.7B-Chat](https://huggingface.co/Qwen/Qwen1.5-MoE-A2.7B-Chat)
    - 🔥🔥🔥  [Meta-Llama-3-70B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct)
    - 🔥🔥🔥  [Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)
    - 🔥🔥🔥  [CodeQwen1.5-7B-Chat](https://huggingface.co/Qwen/CodeQwen1.5-7B-Chat)
    - 🔥🔥🔥  [Qwen1.5-32B-Chat](https://huggingface.co/Qwen/Qwen1.5-32B-Chat)
    - 🔥🔥🔥  [Starling-LM-7B-beta](https://huggingface.co/Nexusflow/Starling-LM-7B-beta)
    - 🔥🔥🔥  [gemma-7b-it](https://huggingface.co/google/gemma-7b-it)
    - 🔥🔥🔥  [gemma-2b-it](https://huggingface.co/google/gemma-2b-it)
    - 🔥🔥🔥  [SOLAR-10.7B](https://huggingface.co/upstage/SOLAR-10.7B-Instruct-v1.0)
    - 🔥🔥🔥  [Mixtral-8x7B](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1)
    - 🔥🔥🔥  [Qwen-72B-Chat](https://huggingface.co/Qwen/Qwen-72B-Chat)
    - 🔥🔥🔥  [Yi-34B-Chat](https://huggingface.co/01-ai/Yi-34B-Chat)
  - [More Supported LLMs](http://docs.gptdb.site/docs/modules/smmf)

- **Privacy and Security**
  
  We ensure the privacy and security of data through the implementation of various technologies, including privatized large models and proxy desensitization.

- Support Datasources
  - [Datasources](http://docs.gptdb.site/docs/modules/connections)



## Image
🌐 [AutoDL Image](https://www.codewithgpu.com/i/khulnasoft/GPT-DB/gptdb)


### Language Switching
    In the .env configuration file, modify the LANGUAGE parameter to switch to different languages. The default is English (Chinese: zh, English: en, other languages to be added later).

## Contribution

- To check detailed guidelines for new contributions, please refer [how to contribute](https://github.com/khulnasoft/GPT-DB/blob/main/CONTRIBUTING.md)

### Contributors Wall
<a href="https://github.com/khulnasoft/GPT-DB/graphs/contributors">
  <img src="https://contrib.rocks/image?repo=khulnasoft/GPT-DB&max=200" />
</a>



## Licence
The MIT License (MIT)

## Citation
If you find `GPT-DB` useful for your research or development, please cite the following <a href="https://arxiv.org/abs/2312.17449" target="_blank">paper</a>:

```bibtex
@article{xue2023gptdb,
      title={GPT-DB: Empowering Database Interactions with Private Large Language Models}, 
      author={Siqiao Xue and Caigao Jiang and Wenhui Shi and Fangyin Cheng and Keting Chen and Hongjun Yang and Zhiping Zhang and Jianshan He and Hongyang Zhang and Ganglin Wei and Wang Zhao and Fan Zhou and Danrui Qi and Hong Yi and Shaodong Liu and Faqiang Chen},
      year={2023},
      journal={arXiv preprint arXiv:2312.17449},
      url={https://arxiv.org/abs/2312.17449}
}
```

## Contact Information
We are working on building a community, if you have any ideas for building the community, feel free to contact us.
[![](https://dcbadge.vercel.app/api/server/7uQnPuveTY?compact=true&style=flat)](https://discord.gg/7uQnPuveTY)

[![Star History Chart](https://api.star-history.com/svg?repos=khulnasoft/GPT-DB&type=Date)](https://star-history.com/#khulnasoft/GPT-DB)

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/khulnasoft/GPT-DB",
    "name": "gptdb",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.10",
    "maintainer_email": null,
    "keywords": null,
    "author": "khulnasoft",
    "author_email": "info@khulnasoft.com",
    "download_url": "https://files.pythonhosted.org/packages/df/f9/dd00b79abbab52178f6f88c1ad290e48fe4473bf4069c89bc656194eeabf/gptdb-0.5.9.tar.gz",
    "platform": null,
    "description": "# GPT-DB: Revolutionizing Database Interactions with Private LLM Technology\n \n<p align=\"left\">\n  <img src=\"./assets/LOGO.png\" width=\"100%\" />\n</p>\n\n<div align=\"center\">\n  <p>\n    <a href=\"https://github.com/khulnasoft/GPT-DB\">\n        <img alt=\"stars\" src=\"https://img.shields.io/github/stars/khulnasoft/db-gpt?style=social\" />\n    </a>\n    <a href=\"https://github.com/khulnasoft/GPT-DB\">\n        <img alt=\"forks\" src=\"https://img.shields.io/github/forks/khulnasoft/db-gpt?style=social\" />\n    </a>\n    <a href=\"https://opensource.org/licenses/MIT\">\n      <img alt=\"License: MIT\" src=\"https://img.shields.io/badge/License-MIT-yellow.svg\" />\n    </a>\n     <a href=\"https://github.com/khulnasoft/GPT-DB/releases\">\n      <img alt=\"Release Notes\" src=\"https://img.shields.io/github/release/khulnasoft/GPT-DB\" />\n    </a>\n    <a href=\"https://github.com/khulnasoft/GPT-DB/issues\">\n      <img alt=\"Open Issues\" src=\"https://img.shields.io/github/issues-raw/khulnasoft/GPT-DB\" />\n    </a>\n    <a href=\"https://discord.gg/7uQnPuveTY\">\n      <img alt=\"Discord\" src=\"https://dcbadge.vercel.app/api/server/7uQnPuveTY?compact=true&style=flat\" />\n    </a>\n    <a href=\"https://join.slack.com/t/slack-inu2564/shared_invite/zt-29rcnyw2b-N~ubOD9kFc7b7MDOAM1otA\">\n      <img alt=\"Slack\" src=\"https://badgen.net/badge/Slack/Join%20GPT-DB/0abd59?icon=slack\" />\n    </a>\n    <a href=\"https://codespaces.new/khulnasoft/GPT-DB\">\n      <img alt=\"Open in GitHub Codespaces\" src=\"https://github.com/codespaces/badge.svg\" />\n    </a>\n  </p>\n\n\n[**\u7b80\u4f53\u4e2d\u6587**](README.zh.md) | [**\u65e5\u672c\u8a9e**](README.ja.md) | [**Discord**](https://discord.gg/7uQnPuveTY) | [**Documents**](https://docs.gptdb.site) | [**\u5fae\u4fe1**](https://github.com/khulnasoft/GPT-DB/blob/main/README.zh.md#%E8%81%94%E7%B3%BB%E6%88%91%E4%BB%AC) | [**Community**](https://github.com/khulnasoft/community) | [**Paper**](https://arxiv.org/pdf/2312.17449.pdf)\n\n</div>\n\n## What is GPT-DB?\n\n\ud83e\udd16 **GPT-DB is an open source AI native data app development framework with AWEL(Agentic Workflow Expression Language) and agents**. \n\nThe purpose is to build infrastructure in the field of large models, through the development of multiple technical capabilities such as multi-model management (SMMF), Text2SQL effect optimization, RAG framework and optimization, Multi-Agents framework collaboration, AWEL (agent workflow orchestration), etc. Which makes large model applications with data simpler and more convenient.\n\n\n\ud83d\ude80 **In the Data 3.0 era, based on models and databases, enterprises and developers can build their own bespoke applications with less code.**\n\n### AI-Native Data App \n---\n- \ud83d\udd25\ud83d\udd25\ud83d\udd25 [Released V0.5.0 | Develop native data applications through workflows and agents](https://docs.gptdb.site/docs/changelog/Released_V0.5.0)\n---\n\n![Data-awels](https://github.com/khulnasoft/GPT-DB/assets/17919400/37d116fc-d9dd-4efa-b4df-9ab02b22541c)\n\n![Data-Apps](https://github.com/khulnasoft/GPT-DB/assets/17919400/a7bf6d65-92d1-4f0e-aaf0-259ccdde22fd)\n\n![dashboard-images](https://github.com/khulnasoft/GPT-DB/assets/17919400/1849a79a-f7fd-40cf-bc9c-b117a041dd6a)\n\n\n## Contents\n- [Introduction](#introduction)\n- [Install](#install)\n- [Features](#features)\n- [Contribution](#contribution)\n- [Contact](#contact-information)\n\n## Introduction \nThe architecture of GPT-DB is shown in the following figure:\n\n<p align=\"center\">\n  <img src=\"./assets/gptdb.png\" width=\"800\" />\n</p>\n\nThe core capabilities include the following parts:\n\n- **RAG (Retrieval Augmented Generation)**: RAG is currently the most practically implemented and urgently needed domain. GPT-DB has already implemented a framework based on RAG, allowing users to build knowledge-based applications using the RAG capabilities of GPT-DB.\n\n- **GBI (Generative Business Intelligence)**: Generative BI is one of the core capabilities of the GPT-DB project, providing the foundational data intelligence technology to build enterprise report analysis and business insights.\n\n- **Fine-tuning Framework**: Model fine-tuning is an indispensable capability for any enterprise to implement in vertical and niche domains. GPT-DB provides a complete fine-tuning framework that integrates seamlessly with the GPT-DB project. In recent fine-tuning efforts, an accuracy rate based on the Spider dataset has been achieved at 82.5%.\n\n- **Data-Driven Multi-Agents Framework**: GPT-DB offers a data-driven self-evolving multi-agents framework, aiming to continuously make decisions and execute based on data.\n\n- **Data Factory**: The Data Factory is mainly about cleaning and processing trustworthy knowledge and data in the era of large models.\n\n- **Data Sources**: Integrating various data sources to seamlessly connect production business data to the core capabilities of GPT-DB.\n\n### SubModule\n- [GPT-DB-Hub](https://github.com/khulnasoft-lab/GPT-DB-Hub) Text-to-SQL workflow with high performance by applying Supervised Fine-Tuning (SFT) on Large Language Models (LLMs).\n\n- [gptdbs](https://github.com/khulnasoft-lab/gptdbs)  gptdbs is the official repository which contains some data apps\u3001AWEL operators\u3001AWEL workflow templates and agents which build upon GPT-DB.\n\n#### Text2SQL Finetune\n- support llms\n  - [x] LLaMA\n  - [x] LLaMA-2\n  - [x] BLOOM\n  - [x] BLOOMZ\n  - [x] Falcon\n  - [x] Baichuan\n  - [x] Baichuan2\n  - [x] InternLM\n  - [x] Qwen\n  - [x] XVERSE\n  - [x] ChatGLM2\n\n-  SFT Accuracy\nAs of October 10, 2023, through the fine-tuning of an open-source model with 13 billion parameters using this project, we have achieved execution accuracy on the Spider dataset that surpasses even GPT-4!\n\n[More Information about Text2SQL finetune](https://github.com/khulnasoft-lab/GPT-DB-Hub)\n\n- [GPT-DB-Plugins](https://github.com/khulnasoft-lab/GPT-DB-Plugins) GPT-DB Plugins that can run Auto-GPT plugin directly\n- [GPT-Vis](https://github.com/khulnasoft-lab/GPT-Vis) Visualization protocol\n\n## Install \n![Docker](https://img.shields.io/badge/docker-%230db7ed.svg?style=for-the-badge&logo=docker&logoColor=white)\n![Linux](https://img.shields.io/badge/Linux-FCC624?style=for-the-badge&logo=linux&logoColor=black)\n![macOS](https://img.shields.io/badge/mac%20os-000000?style=for-the-badge&logo=macos&logoColor=F0F0F0)\n![Windows](https://img.shields.io/badge/Windows-0078D6?style=for-the-badge&logo=windows&logoColor=white)\n\n[**Usage Tutorial**](http://docs.gptdb.site/docs/overview)\n- [**Install**](http://docs.gptdb.site/docs/installation)\n  - [Docker](https://docs.gptdb.site/docs/installation/docker)\n  - [Source Code](https://docs.gptdb.site/docs/installation/sourcecode)\n- [**Quickstart**](http://docs.gptdb.site/docs/quickstart)\n- [**Application**](http://docs.gptdb.site/docs/operation_manual)\n  - [App Usage](https://docs.gptdb.site/docs/application/app_usage)\n  - [AWEL Flow Usage](https://docs.gptdb.site/docs/application/awel_flow_usage)\n- [**Debugging**](http://docs.gptdb.site/docs/operation_manual/advanced_tutorial/debugging)\n- [**Advanced Usage**](https://docs.gptdb.site/docs/application/advanced_tutorial/cli)\n  - [SMMF](https://docs.gptdb.site/docs/application/advanced_tutorial/smmf)\n  - [Finetune](https://docs.gptdb.site/docs/application/fine_tuning_manual/gptdb_hub)\n  - [AWEL](https://docs.gptdb.site/docs/latest/awel/tutorial)\n\n\n## Features\n\nAt present, we have introduced several key features to showcase our current capabilities:\n- **Private Domain Q&A & Data Processing**\n\n  The GPT-DB project offers a range of functionalities designed to improve knowledge base construction and enable efficient storage and retrieval of both structured and unstructured data. These functionalities include built-in support for uploading multiple file formats, the ability to integrate custom data extraction plug-ins, and unified vector storage and retrieval capabilities for effectively managing large volumes of information.\n\n- **Multi-Data Source & GBI(Generative Business intelligence)**\n\n  The GPT-DB project facilitates seamless natural language interaction with diverse data sources, including Excel, databases, and data warehouses. It simplifies the process of querying and retrieving information from these sources, empowering users to engage in intuitive conversations and gain insights. Moreover, GPT-DB supports the generation of analytical reports, providing users with valuable data summaries and interpretations.\n\n- **Multi-Agents&Plugins**\n\n  It offers support for custom plug-ins to perform various tasks and natively integrates the Auto-GPT plug-in model. The Agents protocol adheres to the Agent Protocol standard.\n\n- **Automated Fine-tuning text2SQL**\n\n  We've also developed an automated fine-tuning lightweight framework centred on large language models (LLMs), Text2SQL datasets, LoRA/QLoRA/Pturning, and other fine-tuning methods. This framework simplifies Text-to-SQL fine-tuning, making it as straightforward as an assembly line process. [GPT-DB-Hub](https://github.com/khulnasoft-lab/GPT-DB-Hub)\n\n- **SMMF(Service-oriented Multi-model Management Framework)**\n\n  We offer extensive model support, including dozens of large language models (LLMs) from both open-source and API agents, such as LLaMA/LLaMA2, Baichuan, ChatGLM, Wenxin, Tongyi, Zhipu, and many more. \n\n  - News\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [gemma-2-27b-it](https://huggingface.co/google/gemma-2-27b-it)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [gemma-2-9b-it](https://huggingface.co/google/gemma-2-9b-it)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [DeepSeek-Coder-V2-Instruct](https://huggingface.co/deepseek-ai/DeepSeek-Coder-V2-Instruct)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [DeepSeek-Coder-V2-Lite-Instruct](https://huggingface.co/deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [Qwen2-57B-A14B-Instruct](https://huggingface.co/Qwen/Qwen2-57B-A14B-Instruct)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [Qwen2-72B-Instruct](https://huggingface.co/Qwen/Qwen2-72B-Instruct)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [Qwen2-7B-Instruct](https://huggingface.co/Qwen/Qwen2-7B-Instruct)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [Qwen2-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2-1.5B-Instruct)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [Qwen2-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2-0.5B-Instruct)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [glm-4-9b-chat](https://huggingface.co/THUDM/glm-4-9b-chat)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [Phi-3](https://huggingface.co/collections/microsoft/phi-3-6626e15e9585a200d2d761e3)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [Yi-1.5-34B-Chat](https://huggingface.co/01-ai/Yi-1.5-34B-Chat)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [Yi-1.5-9B-Chat](https://huggingface.co/01-ai/Yi-1.5-9B-Chat)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [Yi-1.5-6B-Chat](https://huggingface.co/01-ai/Yi-1.5-6B-Chat)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [Qwen1.5-110B-Chat](https://huggingface.co/Qwen/Qwen1.5-110B-Chat)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [Qwen1.5-MoE-A2.7B-Chat](https://huggingface.co/Qwen/Qwen1.5-MoE-A2.7B-Chat)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [Meta-Llama-3-70B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [CodeQwen1.5-7B-Chat](https://huggingface.co/Qwen/CodeQwen1.5-7B-Chat)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [Qwen1.5-32B-Chat](https://huggingface.co/Qwen/Qwen1.5-32B-Chat)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [Starling-LM-7B-beta](https://huggingface.co/Nexusflow/Starling-LM-7B-beta)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [gemma-7b-it](https://huggingface.co/google/gemma-7b-it)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [gemma-2b-it](https://huggingface.co/google/gemma-2b-it)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [SOLAR-10.7B](https://huggingface.co/upstage/SOLAR-10.7B-Instruct-v1.0)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [Mixtral-8x7B](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [Qwen-72B-Chat](https://huggingface.co/Qwen/Qwen-72B-Chat)\n    - \ud83d\udd25\ud83d\udd25\ud83d\udd25  [Yi-34B-Chat](https://huggingface.co/01-ai/Yi-34B-Chat)\n  - [More Supported LLMs](http://docs.gptdb.site/docs/modules/smmf)\n\n- **Privacy and Security**\n  \n  We ensure the privacy and security of data through the implementation of various technologies, including privatized large models and proxy desensitization.\n\n- Support Datasources\n  - [Datasources](http://docs.gptdb.site/docs/modules/connections)\n\n\n\n## Image\n\ud83c\udf10 [AutoDL Image](https://www.codewithgpu.com/i/khulnasoft/GPT-DB/gptdb)\n\n\n### Language Switching\n    In the .env configuration file, modify the LANGUAGE parameter to switch to different languages. The default is English (Chinese: zh, English: en, other languages to be added later).\n\n## Contribution\n\n- To check detailed guidelines for new contributions, please refer [how to contribute](https://github.com/khulnasoft/GPT-DB/blob/main/CONTRIBUTING.md)\n\n### Contributors Wall\n<a href=\"https://github.com/khulnasoft/GPT-DB/graphs/contributors\">\n  <img src=\"https://contrib.rocks/image?repo=khulnasoft/GPT-DB&max=200\" />\n</a>\n\n\n\n## Licence\nThe MIT License (MIT)\n\n## Citation\nIf you find `GPT-DB` useful for your research or development, please cite the following <a href=\"https://arxiv.org/abs/2312.17449\" target=\"_blank\">paper</a>:\n\n```bibtex\n@article{xue2023gptdb,\n      title={GPT-DB: Empowering Database Interactions with Private Large Language Models}, \n      author={Siqiao Xue and Caigao Jiang and Wenhui Shi and Fangyin Cheng and Keting Chen and Hongjun Yang and Zhiping Zhang and Jianshan He and Hongyang Zhang and Ganglin Wei and Wang Zhao and Fan Zhou and Danrui Qi and Hong Yi and Shaodong Liu and Faqiang Chen},\n      year={2023},\n      journal={arXiv preprint arXiv:2312.17449},\n      url={https://arxiv.org/abs/2312.17449}\n}\n```\n\n## Contact Information\nWe are working on building a community, if you have any ideas for building the community, feel free to contact us.\n[![](https://dcbadge.vercel.app/api/server/7uQnPuveTY?compact=true&style=flat)](https://discord.gg/7uQnPuveTY)\n\n[![Star History Chart](https://api.star-history.com/svg?repos=khulnasoft/GPT-DB&type=Date)](https://star-history.com/#khulnasoft/GPT-DB)\n",
    "bugtrack_url": null,
    "license": "https://opensource.org/license/mit/",
    "summary": "GPT-DB is an experimental open-source project that uses localized GPT large models to interact with your data and environment. With this solution, you can be assured that there is no risk of data leakage, and your data is 100% private and secure.",
    "version": "0.5.9",
    "project_urls": {
        "Homepage": "https://github.com/khulnasoft/GPT-DB"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "aaced197ce1f03d5e43960f669a33c7c90960de75950d73b9a9530f523a5796c",
                "md5": "0a83336a309151fe638d54903bab9640",
                "sha256": "d48c8b6d7695d79c193868bf686238f49bf8f9e4a4c9873f9c359368c640c487"
            },
            "downloads": -1,
            "filename": "gptdb-0.5.9-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "0a83336a309151fe638d54903bab9640",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.10",
            "size": 923291,
            "upload_time": "2024-07-14T21:57:34",
            "upload_time_iso_8601": "2024-07-14T21:57:34.636451Z",
            "url": "https://files.pythonhosted.org/packages/aa/ce/d197ce1f03d5e43960f669a33c7c90960de75950d73b9a9530f523a5796c/gptdb-0.5.9-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "dff9dd00b79abbab52178f6f88c1ad290e48fe4473bf4069c89bc656194eeabf",
                "md5": "1fe8c17610d7166ca2ef8cde112aaf61",
                "sha256": "d9ea23297b135666f69db9d36be7a4508f270b2cd8116054ec8b42c6f13b0bb3"
            },
            "downloads": -1,
            "filename": "gptdb-0.5.9.tar.gz",
            "has_sig": false,
            "md5_digest": "1fe8c17610d7166ca2ef8cde112aaf61",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.10",
            "size": 678526,
            "upload_time": "2024-07-14T21:57:37",
            "upload_time_iso_8601": "2024-07-14T21:57:37.629177Z",
            "url": "https://files.pythonhosted.org/packages/df/f9/dd00b79abbab52178f6f88c1ad290e48fe4473bf4069c89bc656194eeabf/gptdb-0.5.9.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-07-14 21:57:37",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "khulnasoft",
    "github_project": "GPT-DB",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "gptdb"
}
        
Elapsed time: 0.43864s