ezsam

Name	ezsam JSON
Version	0.3.0 JSON
	download
home_page
Summary	Extract foreground from images or video via text prompt
upload_time	2024-03-13 03:40:10
maintainer
docs_url	None
author
requires_python	<3.12,>=3.9
license	AGPL-3.0-only
keywords	groundingdino segment-anything sam sam-hq grounded-segment-anything subject object foreground background filter extract select remove prompt subtract
VCS
bugtrack_url
requirements	No requirements were recorded.
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

            # ezsam (easy segment anything model)

A command line and gui tool to segment images and video via text prompts.

Input images and videos, describe the subjects or objects you want to keep, and output new images and videos with the background removed.

**Check out the docs! [ezsam.org](https://www.ezsam.org)**

## Why?

Meta's [Segment Anything](https://github.com/facebookresearch/segment-anything) is a powerful tool for separating parts of images,
but requires coordinate prompts&mdash;either bounding boxes or points.
And manual prompt generation is tedious for large collections of still images or video.

In contrast, text-based prompts describing the object(s) in the foreground to segment can be constant.
Inspired by [Grounded-Segment-Anything](https://github.com/IDEA-Research/Grounded-Segment-Anything),
this project tries to package a simpler to use tool.

If you're not interested in text-based prompts with Segment Anything, 
check out [rembg](https://github.com/danielgatis/rembg).

## How does it work?

The foreground is selected using text prompts to [GroundingDINO](https://github.com/IDEA-Research/GroundingDINO) to detect objects.
Image segments are generated using [Segment Anything](https://github.com/facebookresearch/segment-anything) 
or [Segment Anything HQ (SAM-HQ)](https://github.com/SysCV/SAM-HQ).

## Quick start

```bash
# Ubuntu 22.04, Python 3.9 - 3.11
pip install ezsam
sudo apt install ffmpeg imagemagick
ezsam --help
ezsam-gui
```

For more detailed info, see the documentation site here: [ezsam.org](https://www.ezsam.org)

Raw data

            {
    "_id": null,
    "home_page": "",
    "name": "ezsam",
    "maintainer": "",
    "docs_url": null,
    "requires_python": "<3.12,>=3.9",
    "maintainer_email": "ae9is <aegis.opensource@gmail.com>",
    "keywords": "GroundingDINO Segment-Anything SAM SAM-HQ Grounded-Segment-Anything subject object foreground background filter extract select remove prompt subtract",
    "author": "",
    "author_email": "ae9is <aegis.opensource@gmail.com>",
    "download_url": "https://files.pythonhosted.org/packages/20/2b/f9ffcfc17f3c5ff5ed3e3f99e4e9d835dbbbd053ccd8d72127b09095a20a/ezsam-0.3.0.tar.gz",
    "platform": null,
    "description": "# ezsam (easy segment anything model)\n\nA command line and gui tool to segment images and video via text prompts.\n\nInput images and videos, describe the subjects or objects you want to keep, and output new images and videos with the background removed.\n\n**Check out the docs! [ezsam.org](https://www.ezsam.org)**\n\n## Why?\n\nMeta's [Segment Anything](https://github.com/facebookresearch/segment-anything) is a powerful tool for separating parts of images,\nbut requires coordinate prompts&mdash;either bounding boxes or points.\nAnd manual prompt generation is tedious for large collections of still images or video.\n\nIn contrast, text-based prompts describing the object(s) in the foreground to segment can be constant.\nInspired by [Grounded-Segment-Anything](https://github.com/IDEA-Research/Grounded-Segment-Anything),\nthis project tries to package a simpler to use tool.\n\nIf you're not interested in text-based prompts with Segment Anything, \ncheck out [rembg](https://github.com/danielgatis/rembg).\n\n## How does it work?\n\nThe foreground is selected using text prompts to [GroundingDINO](https://github.com/IDEA-Research/GroundingDINO) to detect objects.\nImage segments are generated using [Segment Anything](https://github.com/facebookresearch/segment-anything) \nor [Segment Anything HQ (SAM-HQ)](https://github.com/SysCV/SAM-HQ).\n\n## Quick start\n\n```bash\n# Ubuntu 22.04, Python 3.9 - 3.11\npip install ezsam\nsudo apt install ffmpeg imagemagick\nezsam --help\nezsam-gui\n```\n\nFor more detailed info, see the documentation site here: [ezsam.org](https://www.ezsam.org)\n",
    "bugtrack_url": null,
    "license": "AGPL-3.0-only",
    "summary": "Extract foreground from images or video via text prompt",
    "version": "0.3.0",
    "project_urls": {
        "Bug reports": "https://github.com/ae9is/ezsam/issues",
        "Homepage": "https://www.ezsam.org",
        "Source": "https://github.com/ae9is/ezsam"
    },
    "split_keywords": [
        "groundingdino",
        "segment-anything",
        "sam",
        "sam-hq",
        "grounded-segment-anything",
        "subject",
        "object",
        "foreground",
        "background",
        "filter",
        "extract",
        "select",
        "remove",
        "prompt",
        "subtract"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "01119960cb3437b264b7d63f0b68b40ce63977364bd33b0ff3a8b861919f72bb",
                "md5": "e2915939b0ac1d177a9870a2268bb9a0",
                "sha256": "aa9d81d75df7b21de20cfcc5796602700203ed5c2f6755499820e30753a17e9a"
            },
            "downloads": -1,
            "filename": "ezsam-0.3.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "e2915939b0ac1d177a9870a2268bb9a0",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": "<3.12,>=3.9",
            "size": 34034,
            "upload_time": "2024-03-13T03:40:08",
            "upload_time_iso_8601": "2024-03-13T03:40:08.532081Z",
            "url": "https://files.pythonhosted.org/packages/01/11/9960cb3437b264b7d63f0b68b40ce63977364bd33b0ff3a8b861919f72bb/ezsam-0.3.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "202bf9ffcfc17f3c5ff5ed3e3f99e4e9d835dbbbd053ccd8d72127b09095a20a",
                "md5": "eefcf538f0122a3bd2d83f878d45c786",
                "sha256": "a162fc8b22586225857ef3fed4003fb504e2515c7ce84c00f064f81e93288004"
            },
            "downloads": -1,
            "filename": "ezsam-0.3.0.tar.gz",
            "has_sig": false,
            "md5_digest": "eefcf538f0122a3bd2d83f878d45c786",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": "<3.12,>=3.9",
            "size": 76017,
            "upload_time": "2024-03-13T03:40:10",
            "upload_time_iso_8601": "2024-03-13T03:40:10.411471Z",
            "url": "https://files.pythonhosted.org/packages/20/2b/f9ffcfc17f3c5ff5ed3e3f99e4e9d835dbbbd053ccd8d72127b09095a20a/ezsam-0.3.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-03-13 03:40:10",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "ae9is",
    "github_project": "ezsam",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "requirements": [],
    "lcname": "ezsam"
}