gpt-computer-agent

Name	gpt-computer-agent JSON
Version	1.0.2 JSON
	download
home_page	https://github.com/khulnasoft/gpt-computer-agent
Summary	GPT
upload_time	2024-07-28 06:33:49
maintainer	None
docs_url	None
author	KhulnaSoft DevOps
requires_python	>=3.9
license	MIT
keywords
VCS
bugtrack_url
requirements	No requirements were recorded.
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

            # GPT Computer Agent
Hi, this is an alternative work for providing ChatGPT MacOS app to Windows and Linux. In this way this is a fresh and stable work. You can easily install as Python library for this time but we will prepare a pipeline for providing native install scripts (.exe).

Powered by <a href="https://github.com/Upsonic/Tiger"><strong>Upsonic Tiger 🐅</strong></a> A function hub for llm agents.




## Installation and Run
Needed >= Python 3.9
```console
pip3 install 'gpt-computer-agent[base]'
```

```console
computeragent
```

### Wake Word | NEW
<details>


We have added Pvporcupine integration. To use it, you need to install an additional library:

```console
pip3 install 'gpt-computer-agent[wakeword]'
```

After that, please enter your [Pvporcupine](https://picovoice.ai/) API key and enable the wake word feature.
</details>

<p align="center">
<br>
  <br>
  <br>

</p>


### Agent Infrastructure

With this way you can create `crewai` agents and using it into gpt-computer-agent gui and tools.


```console
pip3 install 'gpt-computer-agent[base]'
pip3 install 'gpt-computer-agent[agentic]'
```

```python
from gpt_computer_agent import Agent, start

manager = Agent(
  role='Project Manager',
  goal='understands project needs and assist coder',
  backstory="""You're a manager at a large company.""",
)

coder = Agent(
  role='Senior Python Coder',
  goal='writing python scripts and copying to clipboard',
  backstory="""You're a python developer at a large company.""",
)


start()
```



### Adding Custom Tools

Now you are able to add custom tools that run in the agentic infra and agent processes. 


```python
from gpt_computer_agent import Tool, start

@Tool
def sum_tool(first_number: int, second_number: int) -> str:
    """Useful for when you need to sum two numbers together."""
    return first_number + second_number

start()
```






### API | NEW

Now you can use your GPT Computer Agent remotely! GUI still active, for this there is few steps:

```console
pip3 install 'gpt-computer-agent[base]'
pip3 install 'gpt-computer-agent[api]'
```

```console
computeragent --api
```


```python
from gpt_computer_agent.remote import remote

output = remote.input("Hi, how are you today?", screen=False, talk=False)
print(output)

remote.just_screenshot()

remote.talk("TTS test")

# Other Functionalities
remote.reset_memory()
remote.profile("default")

remote.enable_predefined_agents()
remote.disable_predefined_agents()

remote.enable_online_tools()
remote.disable_online_tools()
```






<p align="center">
<br>
  <br>
  <br>
  <br>
  <br>
</p>

## Roadmap
| Feature                         | Status       | Target Release |
|---------------------------------|--------------|----------------|
| Clear Chat History         | Completed    | Q2 2024        |
| Long Audios Support (Split 20mb)      | Completed    | Q2 2024        |
| Text Inputs               | Completed      | Q2 2024        |
| Just Text Mode (Mute Speech)           | Completed  | Q2 2024        |
| Added profiles (Different Chats)          | Completed    | Q2 2024        |
| More Feedback About Agent Status                  | Completed    | Q2 2024        |
| Local Model Vision and Text (With Ollama, and vision models)  | Completed  | Q2 2024        |
| **Our Customizable Agent Infrastructure**              | Completed      | Q2 2024        |
| Supporting Groq Models  | Completed  | Q2 2024        |
| **Adding Custom Tools**  | Completed  | Q2 2024        |
| Click on something on the screen (text and icon)              | Completed      | Q2 2024        |
| New UI              | Completed      | Q2 2024        |
| Native Applications, exe, dmg              | Failed (Agentic Infra libraries not supported for now)     | Q2 2024        |
| **Collaborated Speaking Different Voice Models on long responses.**              | Completed     | Q2 2024        |
| **Auto Stop Recording, when you complate talking**              | Completed     | Q2 2024        |
| **Wakeup Word**              | Completed     | Q2 2024        |
| **Continuously Conversations**              | Completed     | Q2 2024        |
| **Adding more capability on device**              | Planned     | Q2 2024        |
| DeepFace Integration (Facial Recognition)                    | Planned  | Q2 2024        |







## Capabilities
At this time we have many infrastructure elements. We just aim to provide whole things that already in ChatGPT app.

| Capability                         | Status                      |
|------------------------------------|----------------------------------|
| **Screen Read**                    |            OK                    |
| **Click to and Text or Icon in the screen**                    |            OK                    |
| **Move to and Text or Icon in the screen**                    |            OK                    |
| **Typing Something**                    |            OK                    |
| **Pressing to Any Key**                    |            OK                    |
| **Scrolling**                    |            OK                    |
| **Microphone**                     |            OK                    |
| **System Audio**                  |            OK                    |
| **Memory**                         |            OK                    |
| **Open and Close App**             |            OK                    |
| **Open a URL**                     |            OK                    |
| **Clipboard**                       |            OK                    |
| **Search Engines**                 |            OK                    |
| **Writing and running Python**     |            OK                    |
| **Writing and running SH**    |            OK                    |
| **Using your Telegram Account**    |            OK                    |
| **Knowledge Management**           |            OK                    |
| **[Add more tool](https://github.com/khulnasoft/gpt-computer-agent/blob/master/gpt_computer_agent/standard_tools.py)**           |            ?                    |

### Predefined Agents
If you enable it your agent will work with these teams:

| Team Name                         | Status                      |
|------------------------------------|----------------------------------|
| **search_on_internet_and_report_team**                    |            OK                    |
| **generate_code_with_aim_team_**                    |            OK                    |
| **[Add your own one](https://github.com/khulnasoft/gpt-computer-agent/blob/master/gpt_computer_agent/teams.py)**                    |            ?                    |


## Contributors

<a href="https://github.com/khulnasoft/gpt-computer-agent/graphs/contributors">
  <img src="https://contrib.rocks/image?repo=khulnasoft/gpt-computer-agent" />
</a>

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/khulnasoft/gpt-computer-agent",
    "name": "gpt-computer-agent",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.9",
    "maintainer_email": null,
    "keywords": null,
    "author": "KhulnaSoft DevOps",
    "author_email": "info@khulnasoft.com",
    "download_url": "https://files.pythonhosted.org/packages/d4/76/09284a5b878e63bcba6561736f672340db8dba8c94dc6fc2e11ae571b2f8/gpt_computer_agent-1.0.2.tar.gz",
    "platform": null,
    "description": "# GPT Computer Agent\nHi, this is an alternative work for providing ChatGPT MacOS app to Windows and Linux. In this way this is a fresh and stable work. You can easily install as Python library for this time but we will prepare a pipeline for providing native install scripts (.exe).\n\nPowered by <a href=\"https://github.com/Upsonic/Tiger\"><strong>Upsonic Tiger \ud83d\udc05</strong></a> A function hub for llm agents.\n\n\n\n\n## Installation and Run\nNeeded >= Python 3.9\n```console\npip3 install 'gpt-computer-agent[base]'\n```\n\n```console\ncomputeragent\n```\n\n### Wake Word | NEW\n<details>\n\n\nWe have added Pvporcupine integration. To use it, you need to install an additional library:\n\n```console\npip3 install 'gpt-computer-agent[wakeword]'\n```\n\nAfter that, please enter your [Pvporcupine](https://picovoice.ai/) API key and enable the wake word feature.\n</details>\n\n<p align=\"center\">\n<br>\n  <br>\n  <br>\n\n</p>\n\n\n### Agent Infrastructure\n\nWith this way you can create `crewai` agents and using it into gpt-computer-agent gui and tools.\n\n\n```console\npip3 install 'gpt-computer-agent[base]'\npip3 install 'gpt-computer-agent[agentic]'\n```\n\n```python\nfrom gpt_computer_agent import Agent, start\n\nmanager = Agent(\n  role='Project Manager',\n  goal='understands project needs and assist coder',\n  backstory=\"\"\"You're a manager at a large company.\"\"\",\n)\n\ncoder = Agent(\n  role='Senior Python Coder',\n  goal='writing python scripts and copying to clipboard',\n  backstory=\"\"\"You're a python developer at a large company.\"\"\",\n)\n\n\nstart()\n```\n\n\n\n### Adding Custom Tools\n\nNow you are able to add custom tools that run in the agentic infra and agent processes. \n\n\n```python\nfrom gpt_computer_agent import Tool, start\n\n@Tool\ndef sum_tool(first_number: int, second_number: int) -> str:\n    \"\"\"Useful for when you need to sum two numbers together.\"\"\"\n    return first_number + second_number\n\nstart()\n```\n\n\n\n\n\n\n### API | NEW\n\nNow you can use your GPT Computer Agent remotely! GUI still active, for this there is few steps:\n\n```console\npip3 install 'gpt-computer-agent[base]'\npip3 install 'gpt-computer-agent[api]'\n```\n\n```console\ncomputeragent --api\n```\n\n\n```python\nfrom gpt_computer_agent.remote import remote\n\noutput = remote.input(\"Hi, how are you today?\", screen=False, talk=False)\nprint(output)\n\nremote.just_screenshot()\n\nremote.talk(\"TTS test\")\n\n# Other Functionalities\nremote.reset_memory()\nremote.profile(\"default\")\n\nremote.enable_predefined_agents()\nremote.disable_predefined_agents()\n\nremote.enable_online_tools()\nremote.disable_online_tools()\n```\n\n\n\n\n\n\n<p align=\"center\">\n<br>\n  <br>\n  <br>\n  <br>\n  <br>\n</p>\n\n## Roadmap\n| Feature                         | Status       | Target Release |\n|---------------------------------|--------------|----------------|\n| Clear Chat History         | Completed    | Q2 2024        |\n| Long Audios Support (Split 20mb)      | Completed    | Q2 2024        |\n| Text Inputs               | Completed      | Q2 2024        |\n| Just Text Mode (Mute Speech)           | Completed  | Q2 2024        |\n| Added profiles (Different Chats)          | Completed    | Q2 2024        |\n| More Feedback About Agent Status                  | Completed    | Q2 2024        |\n| Local Model Vision and Text (With Ollama, and vision models)  | Completed  | Q2 2024        |\n| **Our Customizable Agent Infrastructure**              | Completed      | Q2 2024        |\n| Supporting Groq Models  | Completed  | Q2 2024        |\n| **Adding Custom Tools**  | Completed  | Q2 2024        |\n| Click on something on the screen (text and icon)              | Completed      | Q2 2024        |\n| New UI              | Completed      | Q2 2024        |\n| Native Applications, exe, dmg              | Failed (Agentic Infra libraries not supported for now)     | Q2 2024        |\n| **Collaborated Speaking Different Voice Models on long responses.**              | Completed     | Q2 2024        |\n| **Auto Stop Recording, when you complate talking**              | Completed     | Q2 2024        |\n| **Wakeup Word**              | Completed     | Q2 2024        |\n| **Continuously Conversations**              | Completed     | Q2 2024        |\n| **Adding more capability on device**              | Planned     | Q2 2024        |\n| DeepFace Integration (Facial Recognition)                    | Planned  | Q2 2024        |\n\n\n\n\n\n\n\n## Capabilities\nAt this time we have many infrastructure elements. We just aim to provide whole things that already in ChatGPT app.\n\n| Capability                         | Status                      |\n|------------------------------------|----------------------------------|\n| **Screen Read**                    |            OK                    |\n| **Click to and Text or Icon in the screen**                    |            OK                    |\n| **Move to and Text or Icon in the screen**                    |            OK                    |\n| **Typing Something**                    |            OK                    |\n| **Pressing to Any Key**                    |            OK                    |\n| **Scrolling**                    |            OK                    |\n| **Microphone**                     |            OK                    |\n| **System Audio**                  |            OK                    |\n| **Memory**                         |            OK                    |\n| **Open and Close App**             |            OK                    |\n| **Open a URL**                     |            OK                    |\n| **Clipboard**                       |            OK                    |\n| **Search Engines**                 |            OK                    |\n| **Writing and running Python**     |            OK                    |\n| **Writing and running SH**    |            OK                    |\n| **Using your Telegram Account**    |            OK                    |\n| **Knowledge Management**           |            OK                    |\n| **[Add more tool](https://github.com/khulnasoft/gpt-computer-agent/blob/master/gpt_computer_agent/standard_tools.py)**           |            ?                    |\n\n### Predefined Agents\nIf you enable it your agent will work with these teams:\n\n| Team Name                         | Status                      |\n|------------------------------------|----------------------------------|\n| **search_on_internet_and_report_team**                    |            OK                    |\n| **generate_code_with_aim_team_**                    |            OK                    |\n| **[Add your own one](https://github.com/khulnasoft/gpt-computer-agent/blob/master/gpt_computer_agent/teams.py)**                    |            ?                    |\n\n\n## Contributors\n\n<a href=\"https://github.com/khulnasoft/gpt-computer-agent/graphs/contributors\">\n  <img src=\"https://contrib.rocks/image?repo=khulnasoft/gpt-computer-agent\" />\n</a>",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "GPT",
    "version": "1.0.2",
    "project_urls": {
        "Homepage": "https://github.com/khulnasoft/gpt-computer-agent"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "d47609284a5b878e63bcba6561736f672340db8dba8c94dc6fc2e11ae571b2f8",
                "md5": "d2885a6bf77ef1dd4a413d696c69c3d6",
                "sha256": "9461921f7931845c57530b0c70636af34183e6fd79ab2884a044db936154891e"
            },
            "downloads": -1,
            "filename": "gpt_computer_agent-1.0.2.tar.gz",
            "has_sig": false,
            "md5_digest": "d2885a6bf77ef1dd4a413d696c69c3d6",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.9",
            "size": 1674875,
            "upload_time": "2024-07-28T06:33:49",
            "upload_time_iso_8601": "2024-07-28T06:33:49.063354Z",
            "url": "https://files.pythonhosted.org/packages/d4/76/09284a5b878e63bcba6561736f672340db8dba8c94dc6fc2e11ae571b2f8/gpt_computer_agent-1.0.2.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-07-28 06:33:49",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "khulnasoft",
    "github_project": "gpt-computer-agent",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "requirements": [],
    "lcname": "gpt-computer-agent"
}

KhulnaSoft DevOps