Name | Version | Summary | date |
voice-mode |
2.28.3 |
VoiceMode - Voice interaction capabilities for AI assistants (formerly voice-mcp) |
2025-08-24 06:25:20 |
pygpt-net |
2.6.21 |
Desktop AI Assistant powered by: OpenAI GPT-5, o1, o3, GPT-4, Gemini, Claude, Grok, DeepSeek, and other models supported by Llama Index, and Ollama. Chatbot, agents, completion, image generation, vision analysis, speech-to-text, plugins, internet access, file handling, command execution and more. |
2025-08-24 04:52:18 |
videosdk-plugins-speechify |
0.0.27 |
VideoSDK Agent Framework plugin for Speechify AI TTS |
2025-08-23 04:59:57 |
videosdk-plugins-rnnoise |
0.0.27 |
VideoSDK Agent Framework plugin for RNNoise. |
2025-08-23 04:59:01 |
videosdk-plugins-rime |
0.0.27 |
VideoSDK Agent Framework plugin for Rime AI Text-to-Speech services |
2025-08-23 04:58:46 |
videosdk-plugins-neuphonic |
0.0.27 |
VideoSDK Agent Framework plugin for Neuphonic AI |
2025-08-23 04:58:15 |
videosdk-plugins-lmnt |
0.0.27 |
VideoSDK Agent Framework plugin for LMNT AI Text-to-Speech services |
2025-08-23 04:57:58 |
videosdk-plugins-inworldai |
0.0.27 |
VideoSDK Agent Framework plugin for InworldAI TTS services |
2025-08-23 04:57:50 |
videosdk-plugins-humeai |
0.0.27 |
Hume AI TTS plugin for videosdk-agents |
2025-08-23 04:57:42 |
videosdk-plugins-groq |
0.0.27 |
VideoSDK Agent Framework plugin for Groq TTS services |
2025-08-23 04:57:32 |
mlx-omni-server |
0.4.9 |
MLX Omni Server is a server that provides OpenAI-compatible APIs using Apple's MLX framework. |
2025-08-20 01:48:36 |
indoxrouter |
0.1.26 |
A unified client for various AI providers |
2025-08-19 15:51:51 |
par-cli-tts |
0.2.0 |
PAR CLI TTS - Command line text-to-speech tool using ElevenLabs with voice caching and name resolution |
2025-08-19 15:24:18 |
tnzapi |
2.4.2.0 |
TNZ REST API Helper Library for Python |
2025-08-19 04:20:49 |
megatron-bridge |
0.1.0rc1 |
Megatron Bridge: Training Recipes for Megatron-based LLM and VLM models |
2025-08-18 06:29:49 |
speechlight |
2.0.3 |
A lightweight Python library providing a common interface to multiple TTS and screen reader APIs. |
2025-08-17 23:58:29 |
kokoro-tts |
2.3.0 |
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats including EPUB books and PDF documents. |
2025-08-17 21:23:00 |
webscout |
8.3.6 |
Search for anything using Google, DuckDuckGo, phind.com, Contains AI models, can transcribe yt videos, temporary email and phone number generation, has TTS support, webai (terminal gpt and open interpreter) and offline LLMs and more |
2025-08-16 04:49:40 |
chatterbox-vllm |
0.1.2 |
Chatterbox TTS ported to VLLM for efficienct and advanced inference tasks |
2025-08-16 00:18:39 |
mcp-tts |
0.3.0 |
MCP Text-to-Speech Server for Cursor IDE (and others) with cross-platform audio playback |
2025-08-13 20:58:57 |