kozhindev-data-labeler


Namekozhindev-data-labeler JSON
Version 0.0.1 PyPI version JSON
download
home_pageNone
SummaryПакет, содержащий пайплайн для разметки данных, используя большие языковые модели
upload_time2025-08-11 08:18:16
maintainerNone
docs_urlNone
authorYVoskanyan
requires_python>=3.12
licenseNone
keywords kozhindev datalaber classification llm
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Usage example
```python
from pydantic import BaseModel

from kozhindev_data_labeler import LLMClient
from kozhindev_data_labeler import Pipeline


class LLMPredict(BaseModel):
    reviews: list[str]
    target: list[int]

llm_client = LLMClient(
    model='gpt-4o-mini',
    api_key='API_KEY',
    response_format=LLMPredict
)

reviews = [
    'Сегодня отличная погода, настроение супер!',
    'SOME TEXT 123312!312OKF;SEKF;',
    'PostgreSQL- свободная объектно-реляционная базами данных',
    "Congratulations! You've won a $1,000 Walmart gift card to http://bit.ly/123456 tp claim now."
]

pipeline = Pipeline(
    data={'reviews': reviews},
    prompt='Сделай классификацию сообщений на негативные и позитивные (0 - позитивный, 1 - негативный)',
    model_client=llm_client
)

pipeline.run()
pipeline.to_csv('result.csv')
```

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "kozhindev-data-labeler",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.12",
    "maintainer_email": null,
    "keywords": "kozhindev datalaber classification llm",
    "author": "YVoskanyan",
    "author_email": "yuvoskanyan@team.kozhin.dev",
    "download_url": "https://files.pythonhosted.org/packages/5c/10/6c7660df8edb8d499cbe58c1bbae2fc61dbd49b929448dfd629d80f91fba/kozhindev_data_labeler-0.0.1.tar.gz",
    "platform": null,
    "description": "# Usage example\n```python\nfrom pydantic import BaseModel\n\nfrom kozhindev_data_labeler import LLMClient\nfrom kozhindev_data_labeler import Pipeline\n\n\nclass LLMPredict(BaseModel):\n    reviews: list[str]\n    target: list[int]\n\nllm_client = LLMClient(\n    model='gpt-4o-mini',\n    api_key='API_KEY',\n    response_format=LLMPredict\n)\n\nreviews = [\n    '\u0421\u0435\u0433\u043e\u0434\u043d\u044f \u043e\u0442\u043b\u0438\u0447\u043d\u0430\u044f \u043f\u043e\u0433\u043e\u0434\u0430, \u043d\u0430\u0441\u0442\u0440\u043e\u0435\u043d\u0438\u0435 \u0441\u0443\u043f\u0435\u0440!',\n    'SOME TEXT 123312!312OKF;SEKF;',\n    'PostgreSQL- \u0441\u0432\u043e\u0431\u043e\u0434\u043d\u0430\u044f \u043e\u0431\u044a\u0435\u043a\u0442\u043d\u043e-\u0440\u0435\u043b\u044f\u0446\u0438\u043e\u043d\u043d\u0430\u044f \u0431\u0430\u0437\u0430\u043c\u0438 \u0434\u0430\u043d\u043d\u044b\u0445',\n    \"Congratulations! You've won a $1,000 Walmart gift card to http://bit.ly/123456 tp claim now.\"\n]\n\npipeline = Pipeline(\n    data={'reviews': reviews},\n    prompt='\u0421\u0434\u0435\u043b\u0430\u0439 \u043a\u043b\u0430\u0441\u0441\u0438\u0444\u0438\u043a\u0430\u0446\u0438\u044e \u0441\u043e\u043e\u0431\u0449\u0435\u043d\u0438\u0439 \u043d\u0430 \u043d\u0435\u0433\u0430\u0442\u0438\u0432\u043d\u044b\u0435 \u0438 \u043f\u043e\u0437\u0438\u0442\u0438\u0432\u043d\u044b\u0435 (0 - \u043f\u043e\u0437\u0438\u0442\u0438\u0432\u043d\u044b\u0439, 1 - \u043d\u0435\u0433\u0430\u0442\u0438\u0432\u043d\u044b\u0439)',\n    model_client=llm_client\n)\n\npipeline.run()\npipeline.to_csv('result.csv')\n```\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "\u041f\u0430\u043a\u0435\u0442, \u0441\u043e\u0434\u0435\u0440\u0436\u0430\u0449\u0438\u0439 \u043f\u0430\u0439\u043f\u043b\u0430\u0439\u043d \u0434\u043b\u044f \u0440\u0430\u0437\u043c\u0435\u0442\u043a\u0438 \u0434\u0430\u043d\u043d\u044b\u0445, \u0438\u0441\u043f\u043e\u043b\u044c\u0437\u0443\u044f \u0431\u043e\u043b\u044c\u0448\u0438\u0435 \u044f\u0437\u044b\u043a\u043e\u0432\u044b\u0435 \u043c\u043e\u0434\u0435\u043b\u0438",
    "version": "0.0.1",
    "project_urls": null,
    "split_keywords": [
        "kozhindev",
        "datalaber",
        "classification",
        "llm"
    ],
    "urls": [
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "8ed7c589cc8abd03f47d7de9a5b66f343ae02c3c5080149f62a3072c18725c90",
                "md5": "999c99f164529fb10a92dd0b4325ed5a",
                "sha256": "c715bfb7fc190d66eaefe02dfd45e2aabdfea1cb68d829df5861b61cd690ba5b"
            },
            "downloads": -1,
            "filename": "kozhindev_data_labeler-0.0.1-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "999c99f164529fb10a92dd0b4325ed5a",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.12",
            "size": 4601,
            "upload_time": "2025-08-11T08:18:14",
            "upload_time_iso_8601": "2025-08-11T08:18:14.883843Z",
            "url": "https://files.pythonhosted.org/packages/8e/d7/c589cc8abd03f47d7de9a5b66f343ae02c3c5080149f62a3072c18725c90/kozhindev_data_labeler-0.0.1-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "5c106c7660df8edb8d499cbe58c1bbae2fc61dbd49b929448dfd629d80f91fba",
                "md5": "9882f5238bb265b7492a270acaaf9546",
                "sha256": "7063bd5030429435154dc0eece272cf29664061d1b6add032a47793aa0120945"
            },
            "downloads": -1,
            "filename": "kozhindev_data_labeler-0.0.1.tar.gz",
            "has_sig": false,
            "md5_digest": "9882f5238bb265b7492a270acaaf9546",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.12",
            "size": 3752,
            "upload_time": "2025-08-11T08:18:16",
            "upload_time_iso_8601": "2025-08-11T08:18:16.113329Z",
            "url": "https://files.pythonhosted.org/packages/5c/10/6c7660df8edb8d499cbe58c1bbae2fc61dbd49b929448dfd629d80f91fba/kozhindev_data_labeler-0.0.1.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-08-11 08:18:16",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "kozhindev-data-labeler"
}
        
Elapsed time: 1.33789s