| Name | Version | Summary | date | 
        
        
            
                | fonadalabs | 
                2.0.2 | 
                Unified Python SDK for FonadaLabs Text-to-Speech, Automatic Speech Recognition, and Audio Denoising APIs | 
                2025-11-03 09:40:47 | 
            
        
            
                | nulla | 
                0.0.5 | 
                Nulla: a local AI companion bootstrapper (Windows) with voice (Whisper ASR + XTTS v2 TTS), llama.cpp + OpenHermes GGUF, and built-in mini-games. | 
                2025-11-02 18:55:59 | 
            
        
            
                | wraipperz | 
                0.1.47 | 
                Simple wrappers for various AI APIs including LLMs, ASR, and TTS | 
                2025-11-02 14:10:35 | 
            
        
            
                | audioscope | 
                0.0.1 | 
                Audio-Scope: forensic & diagnostic toolkit for robust speech benchmarking (name reservation release) | 
                2025-10-24 20:35:37 | 
            
        
            
                | whisperpipe | 
                0.1.0 | 
                Real-time speech-to-text streaming with OpenAI Whisper | 
                2025-10-20 23:24:12 | 
            
        
            
                | wyoming-openai | 
                0.3.8 | 
                OpenAI-Compatible Proxy Middleware for the Wyoming Protocol | 
                2025-10-11 04:00:30 | 
            
        
            
                | chunkformer | 
                1.2.1 | 
                ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription | 
                2025-10-10 05:06:39 | 
            
        
            
                | sense-voice-streaming-asr | 
                0.1.1 | 
                Real-time streaming automatic speech recognition (ASR) with support for Chinese, English, Cantonese, Japanese, and Korean languages using SenseVoiceSmall model. | 
                2025-10-08 16:46:09 | 
            
        
            
                | any4any | 
                0.1.1 | 
                大模型会话、对话内容预览与修改、语音识别、文本转语音、文档重排、文本嵌入、知识库系统和MCP服务的一键式API服务 | 
                2025-10-08 10:49:23 | 
            
        
            
                | parakeet-mlx | 
                0.3.7 | 
                An implementation of the Nvidia's Parakeet models for Apple Silicon using MLX. | 
                2025-09-17 13:37:44 | 
            
        
            
                | faster-whisper-hotkey | 
                0.4.2 | 
                Push-to-talk transcription | 
                2025-09-14 11:47:50 | 
            
        
            
                | bilibili-video-mcp | 
                0.1.0 | 
                MCP server for downloading and extracting content from Bilibili videos | 
                2025-09-14 02:06:21 | 
            
        
            
                | mythic-lite | 
                0.1.1 | 
                A lightweight, local AI chatbot system with text-to-speech capabilities | 
                2025-09-05 15:46:03 | 
            
        
            
                | pvcheetahdemo | 
                2.3.0 | 
                Cheetah speech-to-text engine demos | 
                2025-08-27 18:57:13 | 
            
        
            
                | pvcheetah | 
                2.3.0 | 
                Cheetah Speech-to-Text Engine. | 
                2025-08-27 18:39:55 | 
            
        
            
                | whisper-eval-serbian | 
                0.0.35 | 
                An evaluation framework for Serbian Whisper models. | 
                2025-08-21 20:28:55 | 
            
        
            
                | mlx_hubert | 
                0.1.0 | 
                HuBERT (Hidden Unit BERT) implementation in MLX for Apple Silicon | 
                2025-07-27 13:24:02 | 
            
        
            
                | ivrs-client | 
                0.1 | 
                智能语音应答系统客户端 | 
                2025-07-25 06:20:16 | 
            
        
            
                | whisper.py | 
                0.1.0 | 
                A Python wrapper for whisper.cpp - fast automatic speech recognition | 
                2025-07-09 10:13:23 | 
            
        
            
                | f5-tts-mlx | 
                0.2.6 | 
                F5-TTS - MLX | 
                2025-03-19 02:11:37 |