HoloSTT

Name	HoloSTT JSON
Version	0.2.3 JSON
	download
home_page	None
Summary	Modern Speech Recognition with both active and ambient listening and keyboard
upload_time	2025-09-14 17:23:17
maintainer	None
docs_url	None
author	Tristan McBride Sr.
requires_python	>=3.10
license	None
keywords	ai agents speech recognition productivity automation
VCS
bugtrack_url
requirements	No requirements were recorded.
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

            
---

# HoloSTT

## Overview

**HoloSTT** is a modern, thread-safe speech recognition and input manager for Python applications.
It unifies active speech, ambient listening, and keyboard input into a single, production-ready interface for any AI-driven project.

**Highlights:**

* **Multi-modal input:** Seamlessly combine voice (active/ambient) and keyboard input.
* **No vendor lock-in:** Works with any skill set, AI stack, or backend logic.
* **Advanced audio handling:** Adaptive noise management, energy thresholds, platform-aware volume support.
* **Thread-safe singleton:** Designed for multi-threaded and interactive desktop, assistant, and automation apps.
* **Real-time text processing:** Built-in cleaning, comparison, and input filtering utilities.

---

## Why HoloSTT?

Typical speech recognition modules often come with limitations:

* Limited to just microphone input, or just keyboard.
* Lack of fallback modes, or forced use of a single speech engine.
* Not suitable for multi-threaded apps, or lacking input state management.

**HoloSTT** solves these problems by:

* Offering both **active (“push-to-talk”)** and **ambient (always-on)** listening, with instant keyboard fallback.
* Providing a **centralized, extensible interface** for all input and recognition events.
* Supporting robust error handling, state tracking, and dynamic configuration—ready for modern AI workflows.

---

## Key Features

* **Flexible Audio Capture:**
  Toggle between active/ambient listening or switch to keyboard input instantly.

* **Dynamic Noise and Volume Handling:**
  Automatically adapts to background noise and adjusts thresholds for clear, accurate recognition.

* **Centralized State & Properties:**
  Track commands, input modes, timeouts, and all key settings in one place.

* **Customizable Input Processing:**
  Clean and filter input text, apply replacements, compare similarity, and trigger actions.

* **Robust Error Handling:**
  Handles microphone, audio, and network exceptions gracefully.

* **Production-Ready:**
  Built for real-world, scalable AI systems.

---

## How It Works

1. **Call the HoloSTT manager** in your application.
2. **Select input mode:** active voice, ambient voice, or keyboard.
3. **Process and clean recognized text** automatically.
4. **Use output directly** for commands, automations, or AI skills.

---

## FAQ

**Q: Does HoloSTT require a specific folder or class naming?**
A: No. Organize your project and input logic as you see fit.

**Q: Can I use my own text cleaning or filtering?**
A: Yes. HoloSTT exposes all processing utilities and is easy to extend.

**Q: Is it production-ready and thread-safe?**
A: Yes. The singleton implementation ensures safe multi-threaded operation.

---

## Code Examples

You can find code examples on my [GitHub repository](https://github.com/TristanMcBrideSr/TechBook).

---

## License

This project is licensed under the [Apache License, Version 2.0](LICENSE).
Copyright 2025 Tristan McBride Sr.

---

## Authors
- Tristan McBride Sr.
- Sybil

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "HoloSTT",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.10",
    "maintainer_email": "\"Tristan McBride Sr.\" <142635792+TristanMcBrideSr@users.noreply.github.com>",
    "keywords": "AI, Agents, Speech Recognition, Productivity, Automation",
    "author": "Tristan McBride Sr.",
    "author_email": "\"Tristan McBride Sr.\" <142635792+TristanMcBrideSr@users.noreply.github.com>",
    "download_url": "https://files.pythonhosted.org/packages/f7/65/b9af8f047c8f1311a6e920621518a3c5988fc9a8811a1a55672cf16cadef/holostt-0.2.3.tar.gz",
    "platform": null,
    "description": "\ufeff\r\n---\r\n\r\n# HoloSTT\r\n\r\n## Overview\r\n\r\n**HoloSTT** is a modern, thread-safe speech recognition and input manager for Python applications.\r\nIt unifies active speech, ambient listening, and keyboard input into a single, production-ready interface for any AI-driven project.\r\n\r\n**Highlights:**\r\n\r\n* **Multi-modal input:** Seamlessly combine voice (active/ambient) and keyboard input.\r\n* **No vendor lock-in:** Works with any skill set, AI stack, or backend logic.\r\n* **Advanced audio handling:** Adaptive noise management, energy thresholds, platform-aware volume support.\r\n* **Thread-safe singleton:** Designed for multi-threaded and interactive desktop, assistant, and automation apps.\r\n* **Real-time text processing:** Built-in cleaning, comparison, and input filtering utilities.\r\n\r\n---\r\n\r\n## Why HoloSTT?\r\n\r\nTypical speech recognition modules often come with limitations:\r\n\r\n* Limited to just microphone input, or just keyboard.\r\n* Lack of fallback modes, or forced use of a single speech engine.\r\n* Not suitable for multi-threaded apps, or lacking input state management.\r\n\r\n**HoloSTT** solves these problems by:\r\n\r\n* Offering both **active (\u201cpush-to-talk\u201d)** and **ambient (always-on)** listening, with instant keyboard fallback.\r\n* Providing a **centralized, extensible interface** for all input and recognition events.\r\n* Supporting robust error handling, state tracking, and dynamic configuration\u2014ready for modern AI workflows.\r\n\r\n---\r\n\r\n## Key Features\r\n\r\n* **Flexible Audio Capture:**\r\n  Toggle between active/ambient listening or switch to keyboard input instantly.\r\n\r\n* **Dynamic Noise and Volume Handling:**\r\n  Automatically adapts to background noise and adjusts thresholds for clear, accurate recognition.\r\n\r\n* **Centralized State & Properties:**\r\n  Track commands, input modes, timeouts, and all key settings in one place.\r\n\r\n* **Customizable Input Processing:**\r\n  Clean and filter input text, apply replacements, compare similarity, and trigger actions.\r\n\r\n* **Robust Error Handling:**\r\n  Handles microphone, audio, and network exceptions gracefully.\r\n\r\n* **Production-Ready:**\r\n  Built for real-world, scalable AI systems.\r\n\r\n---\r\n\r\n## How It Works\r\n\r\n1. **Call the HoloSTT manager** in your application.\r\n2. **Select input mode:** active voice, ambient voice, or keyboard.\r\n3. **Process and clean recognized text** automatically.\r\n4. **Use output directly** for commands, automations, or AI skills.\r\n\r\n---\r\n\r\n## FAQ\r\n\r\n**Q: Does HoloSTT require a specific folder or class naming?**\r\nA: No. Organize your project and input logic as you see fit.\r\n\r\n**Q: Can I use my own text cleaning or filtering?**\r\nA: Yes. HoloSTT exposes all processing utilities and is easy to extend.\r\n\r\n**Q: Is it production-ready and thread-safe?**\r\nA: Yes. The singleton implementation ensures safe multi-threaded operation.\r\n\r\n---\r\n\r\n## Code Examples\r\n\r\nYou can find code examples on my [GitHub repository](https://github.com/TristanMcBrideSr/TechBook).\r\n\r\n---\r\n\r\n## License\r\n\r\nThis project is licensed under the [Apache License, Version 2.0](LICENSE).\r\nCopyright 2025 Tristan McBride Sr.\r\n\r\n---\r\n\r\n## Authors\r\n- Tristan McBride Sr.\r\n- Sybil\r\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "Modern Speech Recognition with both active and ambient listening and keyboard",
    "version": "0.2.3",
    "project_urls": {
        "Homepage": "https://github.com/TristanMcBrideSr"
    },
    "split_keywords": [
        "ai",
        " agents",
        " speech recognition",
        " productivity",
        " automation"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "b3715f595ead6c48b205b06ef0368371832d4a06bd11f5fdc64dd878676fb4b7",
                "md5": "cd6228e5b2fc35d0d79a0a7dc5d0ef8a",
                "sha256": "cece209b84e180552b69056f3742100a9329f3de4223cdb4ba1dc7400dc246b0"
            },
            "downloads": -1,
            "filename": "holostt-0.2.3-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "cd6228e5b2fc35d0d79a0a7dc5d0ef8a",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.10",
            "size": 11292,
            "upload_time": "2025-09-14T17:23:16",
            "upload_time_iso_8601": "2025-09-14T17:23:16.806089Z",
            "url": "https://files.pythonhosted.org/packages/b3/71/5f595ead6c48b205b06ef0368371832d4a06bd11f5fdc64dd878676fb4b7/holostt-0.2.3-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "f765b9af8f047c8f1311a6e920621518a3c5988fc9a8811a1a55672cf16cadef",
                "md5": "ff08207e8406e2f4d79d18c211b3b940",
                "sha256": "dbbc1bfce30f9bd728915e0604a896f51134acfba0c4272258d64052b2c93b8c"
            },
            "downloads": -1,
            "filename": "holostt-0.2.3.tar.gz",
            "has_sig": false,
            "md5_digest": "ff08207e8406e2f4d79d18c211b3b940",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.10",
            "size": 11595,
            "upload_time": "2025-09-14T17:23:17",
            "upload_time_iso_8601": "2025-09-14T17:23:17.987172Z",
            "url": "https://files.pythonhosted.org/packages/f7/65/b9af8f047c8f1311a6e920621518a3c5988fc9a8811a1a55672cf16cadef/holostt-0.2.3.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-09-14 17:23:17",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "holostt"
}

Tristan McBride Sr.