agentdesk_dg


Nameagentdesk_dg JSON
Version 0.2.93 PyPI version JSON
download
home_pageNone
SummaryA desktop for AI agents
upload_time2024-08-26 13:03:57
maintainerNone
docs_urlNone
authorPatrick Barker
requires_python<4.0,>=3.10
licenseMIT
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            <!-- PROJECT LOGO -->
<br />
<p align="center">
  <!-- <a href="https://github.com/agentsea/skillpacks">
    <img src="https://project-logo.png" alt="Logo" width="80">
  </a> -->

  <h1 align="center">AgentDesk</h1>

  <p align="center">
    Desktops for AI agents &nbsp; :computer:
    <br />
    <a href="https://docs.hub.agentsea.ai/agentdesk/intro"><strong>Explore the docs »</strong></a>
    <br />
    <br />
    <a href="https://youtu.be/exoOUUwFRB8">View Demo</a>
    ·
    <a href="https://github.com/agentsea/agentdesk/issues">Report Bug</a>
    ·
    <a href="https://github.com/agentsea/agentdesk/issues">Request Feature</a>
  </p>
  <br>
</p>

AgentDesk provides full-featured desktop environments which can be programatically controlled by AI agents. 
Spin them up locally or in the cloud.

▶ Built on [agentd](https://github.com/agentsea/agentd) a runtime daemon which exposes a REST API for interacting with the desktop.

▶ Implements the [DeviceBay Protocol](https://github.com/agentsea/devicebay).

## Installation

```
pip install agentdesk
```

## Quick Start

```python
from agentdesk import Desktop

# Create a local VM
desktop = Desktop.local()

# Launch the UI for it
desktop.view(background=True)

# Open a browser to Google
desktop.open_url("https://google.com")

# Take actions on the desktop
desktop.move_mouse(500, 500)
desktop.click()
img = desktop.take_screenshot()
```

## Usage

### Create a local desktop

```python
from agentdesk import Desktop

desktop = Desktop.local()
```

```bash
$ agentdesk create --provider qemu
```

_\*requires [qemu](https://www.qemu.org/)_

### Create a remote desktop on GCE

```python
desktop = Desktop.gce()
```

```bash
$ agentdesk create --provider gce
```

### Create a remote desktop on EC2

```python
desktop = Desktop.ec2()
```

```bash
$ agentdesk create --provider ec2
```

### View the desktop in the UI

```python
desktop.view()
```

```bash
$ agentdesk view old_mckinny
```

_\*requires docker_

### List desktops

```python
Desktop.find()
```

```bash
$ agentdesk get
```

### Delete a desktop

```python
Desktop.delete("old_mckinny")
```

```bash
$ agentdesk delete old_mckinny
```

### Use the desktop

```python
desktop.open_url("https://google.com")

coords = desktop.mouse_coordinates()

desktop.move_mouse(500, 500)

desktop.click()

desktop.type_text("What kind of ducks are in Canada?")

desktop.press_key('Enter')

desktop.scroll()

img = desktop.take_screenshot()
```

### Processors

Process images to make them more accessible to LMMs.

#### Grid

Add a coordinate grid on top of the image

```python
from agentdesk.processors import GridProcessor

img = desktop.take_screenshot()

processor = GridProcessor()
grid_img = processor.process_b64(img)
```

## Examples

### Drawing Bot

See how to use a web-based drawing app with AgentDesk in our [notebook](./examples/drawing/demo.ipynb).

### GPT-4V

See how to use GPT-4V with AgentDesk in our [notebook](./examples/gpt4v/note.ipynb) or [agent](./examples/gpt4v/main.py).

## Community

Come join us on [Discord](https://discord.gg/hhaq7XYPS6).

## Developing

Please open an issue before creating a PR.

Changes to the VM happen in [agentd](https://github.com/agentsea/agentd).

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "agentdesk_dg",
    "maintainer": null,
    "docs_url": null,
    "requires_python": "<4.0,>=3.10",
    "maintainer_email": null,
    "keywords": null,
    "author": "Patrick Barker",
    "author_email": "patrickbarkerco@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/b0/e8/edf6defc845a2be0abf21f94549656755d96067a8b9ff469ca4d54ea0456/agentdesk_dg-0.2.93.tar.gz",
    "platform": null,
    "description": "<!-- PROJECT LOGO -->\n<br />\n<p align=\"center\">\n  <!-- <a href=\"https://github.com/agentsea/skillpacks\">\n    <img src=\"https://project-logo.png\" alt=\"Logo\" width=\"80\">\n  </a> -->\n\n  <h1 align=\"center\">AgentDesk</h1>\n\n  <p align=\"center\">\n    Desktops for AI agents &nbsp; :computer:\n    <br />\n    <a href=\"https://docs.hub.agentsea.ai/agentdesk/intro\"><strong>Explore the docs \u00bb</strong></a>\n    <br />\n    <br />\n    <a href=\"https://youtu.be/exoOUUwFRB8\">View Demo</a>\n    \u00b7\n    <a href=\"https://github.com/agentsea/agentdesk/issues\">Report Bug</a>\n    \u00b7\n    <a href=\"https://github.com/agentsea/agentdesk/issues\">Request Feature</a>\n  </p>\n  <br>\n</p>\n\nAgentDesk provides full-featured desktop environments which can be programatically controlled by AI agents. \nSpin them up locally or in the cloud.\n\n\u25b6 Built on [agentd](https://github.com/agentsea/agentd) a runtime daemon which exposes a REST API for interacting with the desktop.\n\n\u25b6 Implements the [DeviceBay Protocol](https://github.com/agentsea/devicebay).\n\n## Installation\n\n```\npip install agentdesk\n```\n\n## Quick Start\n\n```python\nfrom agentdesk import Desktop\n\n# Create a local VM\ndesktop = Desktop.local()\n\n# Launch the UI for it\ndesktop.view(background=True)\n\n# Open a browser to Google\ndesktop.open_url(\"https://google.com\")\n\n# Take actions on the desktop\ndesktop.move_mouse(500, 500)\ndesktop.click()\nimg = desktop.take_screenshot()\n```\n\n## Usage\n\n### Create a local desktop\n\n```python\nfrom agentdesk import Desktop\n\ndesktop = Desktop.local()\n```\n\n```bash\n$ agentdesk create --provider qemu\n```\n\n_\\*requires [qemu](https://www.qemu.org/)_\n\n### Create a remote desktop on GCE\n\n```python\ndesktop = Desktop.gce()\n```\n\n```bash\n$ agentdesk create --provider gce\n```\n\n### Create a remote desktop on EC2\n\n```python\ndesktop = Desktop.ec2()\n```\n\n```bash\n$ agentdesk create --provider ec2\n```\n\n### View the desktop in the UI\n\n```python\ndesktop.view()\n```\n\n```bash\n$ agentdesk view old_mckinny\n```\n\n_\\*requires docker_\n\n### List desktops\n\n```python\nDesktop.find()\n```\n\n```bash\n$ agentdesk get\n```\n\n### Delete a desktop\n\n```python\nDesktop.delete(\"old_mckinny\")\n```\n\n```bash\n$ agentdesk delete old_mckinny\n```\n\n### Use the desktop\n\n```python\ndesktop.open_url(\"https://google.com\")\n\ncoords = desktop.mouse_coordinates()\n\ndesktop.move_mouse(500, 500)\n\ndesktop.click()\n\ndesktop.type_text(\"What kind of ducks are in Canada?\")\n\ndesktop.press_key('Enter')\n\ndesktop.scroll()\n\nimg = desktop.take_screenshot()\n```\n\n### Processors\n\nProcess images to make them more accessible to LMMs.\n\n#### Grid\n\nAdd a coordinate grid on top of the image\n\n```python\nfrom agentdesk.processors import GridProcessor\n\nimg = desktop.take_screenshot()\n\nprocessor = GridProcessor()\ngrid_img = processor.process_b64(img)\n```\n\n## Examples\n\n### Drawing Bot\n\nSee how to use a web-based drawing app with AgentDesk in our [notebook](./examples/drawing/demo.ipynb).\n\n### GPT-4V\n\nSee how to use GPT-4V with AgentDesk in our [notebook](./examples/gpt4v/note.ipynb) or [agent](./examples/gpt4v/main.py).\n\n## Community\n\nCome join us on [Discord](https://discord.gg/hhaq7XYPS6).\n\n## Developing\n\nPlease open an issue before creating a PR.\n\nChanges to the VM happen in [agentd](https://github.com/agentsea/agentd).\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "A desktop for AI agents",
    "version": "0.2.93",
    "project_urls": null,
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "beeb946faf61d07aef10e90ae4b3858bc195b6f2bf6aff7e066c9f484cf8ecf8",
                "md5": "40e97c0c648b6fab9d5e2b6086bee6e2",
                "sha256": "10755d581f27d4b2ce8e86a68c5234ac1fd4527381df0e84e00f21f3c76efe25"
            },
            "downloads": -1,
            "filename": "agentdesk_dg-0.2.93-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "40e97c0c648b6fab9d5e2b6086bee6e2",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": "<4.0,>=3.10",
            "size": 43097,
            "upload_time": "2024-08-26T13:03:54",
            "upload_time_iso_8601": "2024-08-26T13:03:54.602426Z",
            "url": "https://files.pythonhosted.org/packages/be/eb/946faf61d07aef10e90ae4b3858bc195b6f2bf6aff7e066c9f484cf8ecf8/agentdesk_dg-0.2.93-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "b0e8edf6defc845a2be0abf21f94549656755d96067a8b9ff469ca4d54ea0456",
                "md5": "c1371b701405b6c371945c3e9c053b3b",
                "sha256": "6934a3cb111de5fae3232ca96cc70d3abe3a0cee723cfb0312566802f2cd54e2"
            },
            "downloads": -1,
            "filename": "agentdesk_dg-0.2.93.tar.gz",
            "has_sig": false,
            "md5_digest": "c1371b701405b6c371945c3e9c053b3b",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": "<4.0,>=3.10",
            "size": 34889,
            "upload_time": "2024-08-26T13:03:57",
            "upload_time_iso_8601": "2024-08-26T13:03:57.094757Z",
            "url": "https://files.pythonhosted.org/packages/b0/e8/edf6defc845a2be0abf21f94549656755d96067a8b9ff469ca4d54ea0456/agentdesk_dg-0.2.93.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-08-26 13:03:57",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "agentdesk_dg"
}
        
Elapsed time: 0.34792s