easy-rag-llm

Name	easy-rag-llm JSON
Version	1.1.5 JSON
	download
home_page	https://github.com/Aiden-Kwak/easy_rag
Summary	Easily implement RAG workflows with pre-built modules.
upload_time	2025-01-23 04:15:35
maintainer	None
docs_url	None
author	Aiden-Kwak
requires_python	>=3.8
license	None
keywords
VCS
bugtrack_url
requirements	faiss-cpu numpy packaging PyPDF2 python-dotenv tqdm
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

            # easy_rag_llm

## CAUTION
- easy-rag-llm==1.0.* version is testing version. These versions are usually invalid.

## 🇰🇷 소개
- easy_rag_llm는 OpenAI 및 DeepSeek 모델을 지원하는 간단한 RAG(정보 검색 및 생성) 기반 서비스를 제공합니다. 간단하게 RAG LLM을 서비스에 통합시킬 수 있도록 만들어졌습니다.
- (2025.01.16 기준/ v1.1.0) 학습가능한 자료 포맷은 PDF입니다.

## 🇺🇸 Introduction
- easy_rag_llm is a lightweight RAG-based service that supports both OpenAI and DeepSeek models.
It is designed to seamlessly integrate RAG-based LLM functionalities into your service.
- As of 2025-01-15 (v1.1.0), the supported resource format for training is PDF.

## Usage
#### Install (https://pypi.org/project/easy-rag-llm/)
```bash
pip install easy_rag_llm
```

#### How to integrate to your service?
```python
from easy_rag import RagService

rs = RagService(
    embedding_model="text-embedding-3-small", #Fixed to OpenAI model
    response_model="deepseek-chat",  # Options: "openai" or "deepseek-chat"
    open_api_key="your_openai_api_key_here",
    deepseek_api_key="your_deepseek_api_key_here",
    deepseek_base_url="https://api.deepseek.com",
)

rs2 = RagService( # this is example for openai chat model
    embedding_model="text-embedding-3-small",
    response_model="gpt-3.5-turbo",
    open_api_key="your_openai_api_key_here",
)

# Learn from all files under ./rscFiles
# force_update=False, chunkers=10, embedders=10, ef_construction=200, ef_search=100, M=48 is default parameter. you can tune them.
resource = rs.rsc("./rscFiles") # default workers are 10.

query = "Explain what is taught in the third week's lecture."
response, top_evidence = rs.generate_response(resource, query, evidence_num=5) # default evidence_num is 3.

print(response)
```

### 🇰🇷 안내.
- pdf 제목을 명확하게 적어주세요. 메타데이터에는 pdf제목이 추출되어 들어가며, 답변 근거를 출력할때 유용하게 사용될 수 있습니다.
- `rs.rsc("./folder")` 작동시 `faiss_index.bin`과 `metadata.json`이 생성됩니다. 이후엔 이미 만들어진 .bin과 .json으로 답변을 생성합니다. 만약 폴더에 새로운 파일을 추가하거나 제거하여 변경하고 싶다면 `force_update=True`로 설정하여 강제업데이트가 가능합니다.
- max_worker는 pdf 분할 병렬처리를 위한 동시작업 개수이고, embed_worker는 임베딩 작업 병렬처리를 위한 동시작업 개수입니다. 둘다 기본값 10으로 각각 CPU 코어개수와 api ratelimit에 영향을 받으므로 적절히 조절해야합니다.

### 🇺🇸 Note.
- Ensure that your PDFs have clear titles. Extracted titles from the PDF metadata are used during training and for generating evidence-based responses.
- Running `rs.rsc("./folder")` generates `faiss_index.bin` and `metadata.json` files. Subsequently, the system uses the existing .bin and .json files to generate responses. If you want to reflect changes by adding or removing files in the folder, you can enable forced updates by setting `force_update=True`.

### release version.
- 1.0.12 : Supported. However, the embedding model and chat model are fixed to OpenAI's text-embedding-3-small and deepseek-chat, respectively. Fixed at threadpool worker=10, which may cause errors in certain environments.
- 1.1.0 : LTS version.

### TODO
index/아래에 생성된 임베딩이 있으면 그거 쓰도록 함.
- Replace threadPool to asyncio (v1.2.* ~)
- 입력포맷 다양화. pdf외 지원. (v1.4.* ~)


### What can you do with this?
https://github.com/Aiden-Kwak/ClimateJudgeLLM


### Release Message
v1.1.5<br/>
: Vector Searching method is changed to HNSW. <br/>
: The speed of vector embedding has been significantly improved, reducing the time by 90%, making it 10 times faster than before! (10sec for 500 page PDF.)



### Author Information
- 곽병혁 (https://github.com/Aiden-Kwak)

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/Aiden-Kwak/easy_rag",
    "name": "easy-rag-llm",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.8",
    "maintainer_email": null,
    "keywords": null,
    "author": "Aiden-Kwak",
    "author_email": "duckracoon@gist.ac.kr",
    "download_url": "https://files.pythonhosted.org/packages/0f/0a/1ab9e35404a10eaee96809a31e780577ad806ed322f89cfd8b6d0c0b0e97/easy_rag_llm-1.1.5.tar.gz",
    "platform": null,
    "description": "# easy_rag_llm\n\n## CAUTION\n- easy-rag-llm==1.0.* version is testing version. These versions are usually invalid.\n\n## \ud83c\uddf0\ud83c\uddf7 \uc18c\uac1c\n- easy_rag_llm\ub294 OpenAI \ubc0f DeepSeek \ubaa8\ub378\uc744 \uc9c0\uc6d0\ud558\ub294 \uac04\ub2e8\ud55c RAG(\uc815\ubcf4 \uac80\uc0c9 \ubc0f \uc0dd\uc131) \uae30\ubc18 \uc11c\ube44\uc2a4\ub97c \uc81c\uacf5\ud569\ub2c8\ub2e4. \uac04\ub2e8\ud558\uac8c RAG LLM\uc744 \uc11c\ube44\uc2a4\uc5d0 \ud1b5\ud569\uc2dc\ud0ac \uc218 \uc788\ub3c4\ub85d \ub9cc\ub4e4\uc5b4\uc84c\uc2b5\ub2c8\ub2e4.\n- (2025.01.16 \uae30\uc900/ v1.1.0) \ud559\uc2b5\uac00\ub2a5\ud55c \uc790\ub8cc \ud3ec\ub9f7\uc740 PDF\uc785\ub2c8\ub2e4.\n\n## \ud83c\uddfa\ud83c\uddf8 Introduction\n- easy_rag_llm is a lightweight RAG-based service that supports both OpenAI and DeepSeek models.\nIt is designed to seamlessly integrate RAG-based LLM functionalities into your service.\n- As of 2025-01-15 (v1.1.0), the supported resource format for training is PDF.\n\n## Usage\n#### Install (https://pypi.org/project/easy-rag-llm/)\n```bash\npip install easy_rag_llm\n```\n\n#### How to integrate to your service?\n```python\nfrom easy_rag import RagService\n\nrs = RagService(\n    embedding_model=\"text-embedding-3-small\", #Fixed to OpenAI model\n    response_model=\"deepseek-chat\",  # Options: \"openai\" or \"deepseek-chat\"\n    open_api_key=\"your_openai_api_key_here\",\n    deepseek_api_key=\"your_deepseek_api_key_here\",\n    deepseek_base_url=\"https://api.deepseek.com\",\n)\n\nrs2 = RagService( # this is example for openai chat model\n    embedding_model=\"text-embedding-3-small\",\n    response_model=\"gpt-3.5-turbo\",\n    open_api_key=\"your_openai_api_key_here\",\n)\n\n# Learn from all files under ./rscFiles\n# force_update=False, chunkers=10, embedders=10, ef_construction=200, ef_search=100, M=48 is default parameter. you can tune them.\nresource = rs.rsc(\"./rscFiles\") # default workers are 10.\n\nquery = \"Explain what is taught in the third week's lecture.\"\nresponse, top_evidence = rs.generate_response(resource, query, evidence_num=5) # default evidence_num is 3.\n\nprint(response)\n```\n\n### \ud83c\uddf0\ud83c\uddf7 \uc548\ub0b4.\n- pdf \uc81c\ubaa9\uc744 \uba85\ud655\ud558\uac8c \uc801\uc5b4\uc8fc\uc138\uc694. \uba54\ud0c0\ub370\uc774\ud130\uc5d0\ub294 pdf\uc81c\ubaa9\uc774 \ucd94\ucd9c\ub418\uc5b4 \ub4e4\uc5b4\uac00\uba70, \ub2f5\ubcc0 \uadfc\uac70\ub97c \ucd9c\ub825\ud560\ub54c \uc720\uc6a9\ud558\uac8c \uc0ac\uc6a9\ub420 \uc218 \uc788\uc2b5\ub2c8\ub2e4.\n- `rs.rsc(\"./folder\")` \uc791\ub3d9\uc2dc `faiss_index.bin`\uacfc `metadata.json`\uc774 \uc0dd\uc131\ub429\ub2c8\ub2e4. \uc774\ud6c4\uc5d4 \uc774\ubbf8 \ub9cc\ub4e4\uc5b4\uc9c4 .bin\uacfc .json\uc73c\ub85c \ub2f5\ubcc0\uc744 \uc0dd\uc131\ud569\ub2c8\ub2e4. \ub9cc\uc57d \ud3f4\ub354\uc5d0 \uc0c8\ub85c\uc6b4 \ud30c\uc77c\uc744 \ucd94\uac00\ud558\uac70\ub098 \uc81c\uac70\ud558\uc5ec \ubcc0\uacbd\ud558\uace0 \uc2f6\ub2e4\uba74 `force_update=True`\ub85c \uc124\uc815\ud558\uc5ec \uac15\uc81c\uc5c5\ub370\uc774\ud2b8\uac00 \uac00\ub2a5\ud569\ub2c8\ub2e4.\n- max_worker\ub294 pdf \ubd84\ud560 \ubcd1\ub82c\ucc98\ub9ac\ub97c \uc704\ud55c \ub3d9\uc2dc\uc791\uc5c5 \uac1c\uc218\uc774\uace0, embed_worker\ub294 \uc784\ubca0\ub529 \uc791\uc5c5 \ubcd1\ub82c\ucc98\ub9ac\ub97c \uc704\ud55c \ub3d9\uc2dc\uc791\uc5c5 \uac1c\uc218\uc785\ub2c8\ub2e4. \ub458\ub2e4 \uae30\ubcf8\uac12 10\uc73c\ub85c \uac01\uac01 CPU \ucf54\uc5b4\uac1c\uc218\uc640 api ratelimit\uc5d0 \uc601\ud5a5\uc744 \ubc1b\uc73c\ubbc0\ub85c \uc801\uc808\ud788 \uc870\uc808\ud574\uc57c\ud569\ub2c8\ub2e4.\n\n### \ud83c\uddfa\ud83c\uddf8 Note.\n- Ensure that your PDFs have clear titles. Extracted titles from the PDF metadata are used during training and for generating evidence-based responses.\n- Running `rs.rsc(\"./folder\")` generates `faiss_index.bin` and `metadata.json` files. Subsequently, the system uses the existing .bin and .json files to generate responses. If you want to reflect changes by adding or removing files in the folder, you can enable forced updates by setting `force_update=True`.\n\n### release version.\n- 1.0.12 : Supported. However, the embedding model and chat model are fixed to OpenAI's text-embedding-3-small and deepseek-chat, respectively. Fixed at threadpool worker=10, which may cause errors in certain environments.\n- 1.1.0 : LTS version.\n\n### TODO\nindex/\uc544\ub798\uc5d0 \uc0dd\uc131\ub41c \uc784\ubca0\ub529\uc774 \uc788\uc73c\uba74 \uadf8\uac70 \uc4f0\ub3c4\ub85d \ud568.\n- Replace threadPool to asyncio (v1.2.* ~)\n- \uc785\ub825\ud3ec\ub9f7 \ub2e4\uc591\ud654. pdf\uc678 \uc9c0\uc6d0. (v1.4.* ~)\n\n\n### What can you do with this?\nhttps://github.com/Aiden-Kwak/ClimateJudgeLLM\n\n\n### Release Message\nv1.1.5<br/>\n: Vector Searching method is changed to HNSW. <br/>\n: The speed of vector embedding has been significantly improved, reducing the time by 90%, making it 10 times faster than before! (10sec for 500 page PDF.)\n\n\n\n### Author Information\n- \uacfd\ubcd1\ud601 (https://github.com/Aiden-Kwak)\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "Easily implement RAG workflows with pre-built modules.",
    "version": "1.1.5",
    "project_urls": {
        "Homepage": "https://github.com/Aiden-Kwak/easy_rag"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "2158cfee7790b26b912cfe1d2428ca336cb0a156fbd6a4d0b1a9783bb0bc4b17",
                "md5": "c513bf96e90377c0a90de7b09147ec67",
                "sha256": "1414ca4ea04a5f2dfb55d4990deee4f583dea802ee1dff68462a020bced08b2f"
            },
            "downloads": -1,
            "filename": "easy_rag_llm-1.1.5-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "c513bf96e90377c0a90de7b09147ec67",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.8",
            "size": 11057,
            "upload_time": "2025-01-23T04:15:33",
            "upload_time_iso_8601": "2025-01-23T04:15:33.526794Z",
            "url": "https://files.pythonhosted.org/packages/21/58/cfee7790b26b912cfe1d2428ca336cb0a156fbd6a4d0b1a9783bb0bc4b17/easy_rag_llm-1.1.5-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "0f0a1ab9e35404a10eaee96809a31e780577ad806ed322f89cfd8b6d0c0b0e97",
                "md5": "cde87c2a45e75df5e36d4e2ed62c089e",
                "sha256": "79f80635d9eb3486d5cf544b1bc60ef98b3a60e908f6cf9f0ee08a29516183e6"
            },
            "downloads": -1,
            "filename": "easy_rag_llm-1.1.5.tar.gz",
            "has_sig": false,
            "md5_digest": "cde87c2a45e75df5e36d4e2ed62c089e",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.8",
            "size": 12823,
            "upload_time": "2025-01-23T04:15:35",
            "upload_time_iso_8601": "2025-01-23T04:15:35.281705Z",
            "url": "https://files.pythonhosted.org/packages/0f/0a/1ab9e35404a10eaee96809a31e780577ad806ed322f89cfd8b6d0c0b0e97/easy_rag_llm-1.1.5.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-01-23 04:15:35",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "Aiden-Kwak",
    "github_project": "easy_rag",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "requirements": [
        {
            "name": "faiss-cpu",
            "specs": [
                [
                    "==",
                    "1.9.0.post1"
                ]
            ]
        },
        {
            "name": "numpy",
            "specs": [
                [
                    "==",
                    "2.2.1"
                ]
            ]
        },
        {
            "name": "packaging",
            "specs": [
                [
                    "==",
                    "24.2"
                ]
            ]
        },
        {
            "name": "PyPDF2",
            "specs": [
                [
                    "==",
                    "3.0.1"
                ]
            ]
        },
        {
            "name": "python-dotenv",
            "specs": [
                [
                    "==",
                    "1.0.1"
                ]
            ]
        },
        {
            "name": "tqdm",
            "specs": [
                [
                    "==",
                    "4.67.1"
                ]
            ]
        }
    ],
    "lcname": "easy-rag-llm"
}

Aiden-Kwak