HoloSTT


NameHoloSTT JSON
Version 0.1.0 PyPI version JSON
download
home_pageNone
SummaryModern Speech Recognition with both active and ambient listening and keyboard
upload_time2025-08-04 02:45:11
maintainerNone
docs_urlNone
authorTristan McBride Sr.
requires_python>=3.10
licenseNone
keywords ai agents speech recognition productivity automation
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            
---

# HoloSTT

## Overview

**HoloSTT** is a modern, thread-safe speech recognition and input manager for Python applications.
It unifies active speech, ambient listening, and keyboard input into a single, production-ready interface for any AI-driven project.

**Highlights:**

* **Multi-modal input:** Seamlessly combine voice (active/ambient) and keyboard input.
* **No vendor lock-in:** Works with any skill set, AI stack, or backend logic.
* **Advanced audio handling:** Adaptive noise management, energy thresholds, platform-aware volume support.
* **Thread-safe singleton:** Designed for multi-threaded and interactive desktop, assistant, and automation apps.
* **Real-time text processing:** Built-in cleaning, comparison, and input filtering utilities.

---

## Why HoloSTT?

Typical speech recognition modules often come with limitations:

* Limited to just microphone input, or just keyboard.
* Lack of fallback modes, or forced use of a single speech engine.
* Not suitable for multi-threaded apps, or lacking input state management.

**HoloSTT** solves these problems by:

* Offering both **active (“push-to-talk”)** and **ambient (always-on)** listening, with instant keyboard fallback.
* Providing a **centralized, extensible interface** for all input and recognition events.
* Supporting robust error handling, state tracking, and dynamic configuration—ready for modern AI workflows.

---

## Key Features

* **Flexible Audio Capture:**
  Toggle between active/ambient listening or switch to keyboard input instantly.

* **Dynamic Noise and Volume Handling:**
  Automatically adapts to background noise and adjusts thresholds for clear, accurate recognition.

* **Centralized State & Properties:**
  Track commands, input modes, timeouts, and all key settings in one place.

* **Customizable Input Processing:**
  Clean and filter input text, apply replacements, compare similarity, and trigger actions.

* **Robust Error Handling:**
  Handles microphone, audio, and network exceptions gracefully.

* **Production-Ready:**
  Built for real-world, scalable AI systems.

---

## How It Works

1. **Call the HoloSTT manager** in your application.
2. **Select input mode:** active voice, ambient voice, or keyboard.
3. **Process and clean recognized text** automatically.
4. **Use output directly** for commands, automations, or AI skills.

---

## FAQ

**Q: Does HoloSTT require a specific folder or class naming?**
A: No. Organize your project and input logic as you see fit.

**Q: Can I use my own text cleaning or filtering?**
A: Yes. HoloSTT exposes all processing utilities and is easy to extend.

**Q: Is it production-ready and thread-safe?**
A: Yes. The singleton implementation ensures safe multi-threaded operation.

---

## Code Examples

You can find code examples on my [GitHub repository](https://github.com/TristanMcBrideSr/TechBook).

---

## License

This project is licensed under the [Apache License, Version 2.0](LICENSE).
Copyright 2025 Tristan McBride Sr.

---

## Acknowledgements

Project by:
- Tristan McBride Sr.
- Sybil

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "HoloSTT",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.10",
    "maintainer_email": "\"Tristan McBride Sr.\" <142635792+TristanMcBrideSr@users.noreply.github.com>",
    "keywords": "AI, Agents, Speech Recognition, Productivity, Automation",
    "author": "Tristan McBride Sr.",
    "author_email": "\"Tristan McBride Sr.\" <142635792+TristanMcBrideSr@users.noreply.github.com>",
    "download_url": "https://files.pythonhosted.org/packages/7c/cd/af04690fae6f173a5c680f95b50bab361ea586ce7083dcd2ce708fb1dc3a/holostt-0.1.0.tar.gz",
    "platform": null,
    "description": "\ufeff\r\n---\r\n\r\n# HoloSTT\r\n\r\n## Overview\r\n\r\n**HoloSTT** is a modern, thread-safe speech recognition and input manager for Python applications.\r\nIt unifies active speech, ambient listening, and keyboard input into a single, production-ready interface for any AI-driven project.\r\n\r\n**Highlights:**\r\n\r\n* **Multi-modal input:** Seamlessly combine voice (active/ambient) and keyboard input.\r\n* **No vendor lock-in:** Works with any skill set, AI stack, or backend logic.\r\n* **Advanced audio handling:** Adaptive noise management, energy thresholds, platform-aware volume support.\r\n* **Thread-safe singleton:** Designed for multi-threaded and interactive desktop, assistant, and automation apps.\r\n* **Real-time text processing:** Built-in cleaning, comparison, and input filtering utilities.\r\n\r\n---\r\n\r\n## Why HoloSTT?\r\n\r\nTypical speech recognition modules often come with limitations:\r\n\r\n* Limited to just microphone input, or just keyboard.\r\n* Lack of fallback modes, or forced use of a single speech engine.\r\n* Not suitable for multi-threaded apps, or lacking input state management.\r\n\r\n**HoloSTT** solves these problems by:\r\n\r\n* Offering both **active (\u201cpush-to-talk\u201d)** and **ambient (always-on)** listening, with instant keyboard fallback.\r\n* Providing a **centralized, extensible interface** for all input and recognition events.\r\n* Supporting robust error handling, state tracking, and dynamic configuration\u2014ready for modern AI workflows.\r\n\r\n---\r\n\r\n## Key Features\r\n\r\n* **Flexible Audio Capture:**\r\n  Toggle between active/ambient listening or switch to keyboard input instantly.\r\n\r\n* **Dynamic Noise and Volume Handling:**\r\n  Automatically adapts to background noise and adjusts thresholds for clear, accurate recognition.\r\n\r\n* **Centralized State & Properties:**\r\n  Track commands, input modes, timeouts, and all key settings in one place.\r\n\r\n* **Customizable Input Processing:**\r\n  Clean and filter input text, apply replacements, compare similarity, and trigger actions.\r\n\r\n* **Robust Error Handling:**\r\n  Handles microphone, audio, and network exceptions gracefully.\r\n\r\n* **Production-Ready:**\r\n  Built for real-world, scalable AI systems.\r\n\r\n---\r\n\r\n## How It Works\r\n\r\n1. **Call the HoloSTT manager** in your application.\r\n2. **Select input mode:** active voice, ambient voice, or keyboard.\r\n3. **Process and clean recognized text** automatically.\r\n4. **Use output directly** for commands, automations, or AI skills.\r\n\r\n---\r\n\r\n## FAQ\r\n\r\n**Q: Does HoloSTT require a specific folder or class naming?**\r\nA: No. Organize your project and input logic as you see fit.\r\n\r\n**Q: Can I use my own text cleaning or filtering?**\r\nA: Yes. HoloSTT exposes all processing utilities and is easy to extend.\r\n\r\n**Q: Is it production-ready and thread-safe?**\r\nA: Yes. The singleton implementation ensures safe multi-threaded operation.\r\n\r\n---\r\n\r\n## Code Examples\r\n\r\nYou can find code examples on my [GitHub repository](https://github.com/TristanMcBrideSr/TechBook).\r\n\r\n---\r\n\r\n## License\r\n\r\nThis project is licensed under the [Apache License, Version 2.0](LICENSE).\r\nCopyright 2025 Tristan McBride Sr.\r\n\r\n---\r\n\r\n## Acknowledgements\r\n\r\nProject by:\r\n- Tristan McBride Sr.\r\n- Sybil\r\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "Modern Speech Recognition with both active and ambient listening and keyboard",
    "version": "0.1.0",
    "project_urls": {
        "Homepage": "https://github.com/TristanMcBrideSr"
    },
    "split_keywords": [
        "ai",
        " agents",
        " speech recognition",
        " productivity",
        " automation"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "6ebf5af08fc521746c3eb5bc4f521c350ce80a7aa67dd704f5a70a28bef6baa2",
                "md5": "86d5cf507dd275fb181a2b26531ef21d",
                "sha256": "e357754fc9297cdfbd01e98c04ca0184578ec5466750c5b8d22f40d9deadfcc0"
            },
            "downloads": -1,
            "filename": "holostt-0.1.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "86d5cf507dd275fb181a2b26531ef21d",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.10",
            "size": 5971,
            "upload_time": "2025-08-04T02:45:10",
            "upload_time_iso_8601": "2025-08-04T02:45:10.521956Z",
            "url": "https://files.pythonhosted.org/packages/6e/bf/5af08fc521746c3eb5bc4f521c350ce80a7aa67dd704f5a70a28bef6baa2/holostt-0.1.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "7ccdaf04690fae6f173a5c680f95b50bab361ea586ce7083dcd2ce708fb1dc3a",
                "md5": "dc90858cb9e0f4eab374cd0659e3ffcb",
                "sha256": "3a1aa1f03f23eadea27ac9a1828ad4b9dbf7ddf53c7c369b66d83f7703fb2839"
            },
            "downloads": -1,
            "filename": "holostt-0.1.0.tar.gz",
            "has_sig": false,
            "md5_digest": "dc90858cb9e0f4eab374cd0659e3ffcb",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.10",
            "size": 6177,
            "upload_time": "2025-08-04T02:45:11",
            "upload_time_iso_8601": "2025-08-04T02:45:11.447696Z",
            "url": "https://files.pythonhosted.org/packages/7c/cd/af04690fae6f173a5c680f95b50bab361ea586ce7083dcd2ce708fb1dc3a/holostt-0.1.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-08-04 02:45:11",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "holostt"
}
        
Elapsed time: 0.70499s