ecko-cli


Nameecko-cli JSON
Version 1.2.0 PyPI version JSON
download
home_pageNone
SummaryCLI tool that easily converts a directory of images into a dataset for training generative ai models
upload_time2024-09-23 12:57:36
maintainerNone
docs_urlNone
authorNone
requires_python>=3.11
licenseCopyright 2024 Regi E Ellis Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the “Software”), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions: The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software. THE SOFTWARE IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
keywords ai cli dataset florence-2 generative-ai huggingface image image-classificatioon image-processing onnx timm torch torchvision training transformers
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # ecko-cli


> [!IMPORTANT]
> This tool makes use of the `SmilingWolf/wd-eva02-large-tagger-v3` library, which you will need to download 
> and place in the `models` directory inside the `ecko_cli` folder of this project. Make sure to not rename
> the file as the script will be looking for `model.onnx`.
> 
> [Huggingface Repo](https://huggingface.co/SmilingWolf/wd-eva02-large-tagger-v3/tree/main)

> [!IMPORTANT]
> This tool makes use of the `flash-attention` library, which has known to be problematic to install based on PyTorch > and CUDA versions. You may
> need to install the dependencies manually if you encounter issues. The way to install flash-attention is to clone >  the repo and install the package with pip. This is the recommended way to install the package. You can also
> install the package with pipx, but you will need to clone the repo first
>
>```bash
> git clone https://github.com/Dao-AILab/flash-attention.git
> cd flash-attention
> pip install flash_attn --no-build-isolation
> pip install timm
>```
>

> [!NOTE]
> You may notice a delay when first using the tool...normally this means that the models
> are being downloaded/update from huggingface or that they are being moved to your
> GPU. Check your terminal to make see progress

## Overview

**ecko-cli** is a simple CLI tool that streamlines the process of processingimages in a directory, generating captions, and saving them as text files.
Additionally, it provides functionalities to create a JSONL file from images in the directory you specify. Images will be captioned using the Microsoft Florence-2-large model and the ONNX Runtime engine. Images are resized to multiple sizes for better captioning results. [1024, 768, 672, 512]. The WD14 model is used for captioning all images based on a modified version of the selected tags it was trained on.


![screenshot](screen.png)

## Why

I wanted to create a tool that would allow me to process images in bulk quickly and efficiently for using in generative art projects. This tool
allows me to generate captions for images that I can use as training data captions for my training LORAs (Large OpenAI Research Agents) and other
generative models.


## Installation (Recommended)

You have a couple of options for installing/running the tool:

### Install [pipx](https://pipxproject.github.io/pipx/installation/), then run the tool with the following command

```bash
pipx install ecko-cli
```

### Alternatively, you can install using `pip`

```bash
pip install .
```

## Configuration

> [!IMPORTANT]
> Before using the tool, It's required to set up a `.env` file in the parent directory of the script or your home user dir [windows] or `$HOME/.config/civitai-cli-manager/.env`

The application intelligently locates your `.env` file, accommodating various platforms like Windows and Linux, or defaulting to the current directory.

## Usage // Available Commands

Once installed via pipx or pip:

```
ecko-cli process-images /path/to/images watercolors --padding 4
```
```
ecko-cli process-images /path/to/images doors --is_object True
```
```
ecko-cli process-images /path/to/images doors --trigger WORD
```
```
ecko-cli create-jsonl /path/to/images [dataset]
```


## Dependencies

This tool requires Python 3.11 or higher and has the following dependencies:

```bash
"typer",
"rich",
"shellingham",
"python-dotenv",
"onnxruntime-gpu",
"numpy",
"pandas",
"torch",
"pillow",
"einops"
"transformers"
"timm"
"huggingface_hub[cli]"
```

### Contact

For any inquiries, feedback, or suggestions, please feel free to open an issue on this repository.

### License

This project is licensed under the [MIT License](LICENSE).

---

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "ecko-cli",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.11",
    "maintainer_email": null,
    "keywords": "ai, cli, dataset, florence-2, generative-ai, huggingface, image, image-classificatioon, image-processing, onnx, timm, torch, torchvision, training, transformers",
    "author": null,
    "author_email": "Regi E <regi@bynine.io>",
    "download_url": "https://files.pythonhosted.org/packages/9e/b9/bd3b45dd5ad01bd1b2ec3b40b7c068eced69b573317e2c870e2dfababa07/ecko_cli-1.2.0.tar.gz",
    "platform": null,
    "description": "# ecko-cli\n\n\n> [!IMPORTANT]\n> This tool makes use of the `SmilingWolf/wd-eva02-large-tagger-v3` library, which you will need to download \n> and place in the `models` directory inside the `ecko_cli` folder of this project. Make sure to not rename\n> the file as the script will be looking for `model.onnx`.\n> \n> [Huggingface Repo](https://huggingface.co/SmilingWolf/wd-eva02-large-tagger-v3/tree/main)\n\n> [!IMPORTANT]\n> This tool makes use of the `flash-attention` library, which has known to be problematic to install based on PyTorch > and CUDA versions. You may\n> need to install the dependencies manually if you encounter issues. The way to install flash-attention is to clone >  the repo and install the package with pip. This is the recommended way to install the package. You can also\n> install the package with pipx, but you will need to clone the repo first\n>\n>```bash\n> git clone https://github.com/Dao-AILab/flash-attention.git\n> cd flash-attention\n> pip install flash_attn --no-build-isolation\n> pip install timm\n>```\n>\n\n> [!NOTE]\n> You may notice a delay when first using the tool...normally this means that the models\n> are being downloaded/update from huggingface or that they are being moved to your\n> GPU. Check your terminal to make see progress\n\n## Overview\n\n**ecko-cli** is a simple CLI tool that streamlines the process of processingimages in a directory, generating captions, and saving them as text files.\nAdditionally, it provides functionalities to create a JSONL file from images in the directory you specify. Images will be captioned using the Microsoft Florence-2-large model and the ONNX Runtime engine. Images are resized to multiple sizes for better captioning results. [1024, 768, 672, 512]. The WD14 model is used for captioning all images based on a modified version of the selected tags it was trained on.\n\n\n![screenshot](screen.png)\n\n## Why\n\nI wanted to create a tool that would allow me to process images in bulk quickly and efficiently for using in generative art projects. This tool\nallows me to generate captions for images that I can use as training data captions for my training LORAs (Large OpenAI Research Agents) and other\ngenerative models.\n\n\n## Installation (Recommended)\n\nYou have a couple of options for installing/running the tool:\n\n### Install [pipx](https://pipxproject.github.io/pipx/installation/), then run the tool with the following command\n\n```bash\npipx install ecko-cli\n```\n\n### Alternatively, you can install using `pip`\n\n```bash\npip install .\n```\n\n## Configuration\n\n> [!IMPORTANT]\n> Before using the tool, It's required to set up a `.env` file in the parent directory of the script or your home user dir [windows] or `$HOME/.config/civitai-cli-manager/.env`\n\nThe application intelligently locates your `.env` file, accommodating various platforms like Windows and Linux, or defaulting to the current directory.\n\n## Usage // Available Commands\n\nOnce installed via pipx or pip:\n\n```\necko-cli process-images /path/to/images watercolors --padding 4\n```\n```\necko-cli process-images /path/to/images doors --is_object True\n```\n```\necko-cli process-images /path/to/images doors --trigger WORD\n```\n```\necko-cli create-jsonl /path/to/images [dataset]\n```\n\n\n## Dependencies\n\nThis tool requires Python 3.11 or higher and has the following dependencies:\n\n```bash\n\"typer\",\n\"rich\",\n\"shellingham\",\n\"python-dotenv\",\n\"onnxruntime-gpu\",\n\"numpy\",\n\"pandas\",\n\"torch\",\n\"pillow\",\n\"einops\"\n\"transformers\"\n\"timm\"\n\"huggingface_hub[cli]\"\n```\n\n### Contact\n\nFor any inquiries, feedback, or suggestions, please feel free to open an issue on this repository.\n\n### License\n\nThis project is licensed under the [MIT License](LICENSE).\n\n---\n",
    "bugtrack_url": null,
    "license": "Copyright 2024 Regi E Ellis  Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the \u201cSoftware\u201d), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:  The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.  THE SOFTWARE IS PROVIDED \u201cAS IS\u201d, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.",
    "summary": "CLI tool that easily converts a directory of images into a dataset for training generative ai models",
    "version": "1.2.0",
    "project_urls": {
        "Bug Tracker": "https://github.com/regiellis/ecko-cli/issues",
        "Documentation": "https://github.com/regiellis/ecko-cli/blob/main/README.md",
        "Repository": "https://github.com/regiellis/ecko-cli"
    },
    "split_keywords": [
        "ai",
        " cli",
        " dataset",
        " florence-2",
        " generative-ai",
        " huggingface",
        " image",
        " image-classificatioon",
        " image-processing",
        " onnx",
        " timm",
        " torch",
        " torchvision",
        " training",
        " transformers"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "87b2af72940e06bbdb6315bb7958629c5e91529f7f27586bdc6ef77b1188c2e1",
                "md5": "d99d245f7c02b6809418615c6d5dd49b",
                "sha256": "ad91aa7e6dd788bfac7316970a6153ec87d13d6abdde3c0fbd3ec0b09852a9e4"
            },
            "downloads": -1,
            "filename": "ecko_cli-1.2.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "d99d245f7c02b6809418615c6d5dd49b",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.11",
            "size": 262956,
            "upload_time": "2024-09-23T12:57:33",
            "upload_time_iso_8601": "2024-09-23T12:57:33.940511Z",
            "url": "https://files.pythonhosted.org/packages/87/b2/af72940e06bbdb6315bb7958629c5e91529f7f27586bdc6ef77b1188c2e1/ecko_cli-1.2.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "9eb9bd3b45dd5ad01bd1b2ec3b40b7c068eced69b573317e2c870e2dfababa07",
                "md5": "d1cb382bdb0c428600b323553f20f252",
                "sha256": "7987da98c1a9853b96de1cccaaf2f5593eaaa7ee703b67312f72db729e2f9a30"
            },
            "downloads": -1,
            "filename": "ecko_cli-1.2.0.tar.gz",
            "has_sig": false,
            "md5_digest": "d1cb382bdb0c428600b323553f20f252",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.11",
            "size": 261436,
            "upload_time": "2024-09-23T12:57:36",
            "upload_time_iso_8601": "2024-09-23T12:57:36.169194Z",
            "url": "https://files.pythonhosted.org/packages/9e/b9/bd3b45dd5ad01bd1b2ec3b40b7c068eced69b573317e2c870e2dfababa07/ecko_cli-1.2.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-09-23 12:57:36",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "regiellis",
    "github_project": "ecko-cli",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "requirements": [],
    "lcname": "ecko-cli"
}
        
Elapsed time: 0.44389s