Name | Version | Summary | date |
patkit |
0.18.2 |
Phonetic Analysis ToolKit: Tools for processing phonetic data |
2025-08-08 22:23:27 |
audiofeat |
1.0.0 |
A comprehensive PyTorch-based audio feature extraction library for machine learning, research, and audio analysis |
2025-08-04 04:54:45 |
deepctl |
0.1.10 |
Official Deepgram CLI for speech recognition and audio intelligence |
2025-08-01 15:23:11 |
deepctl-cmd-transcribe |
0.1.10 |
Transcribe command for deepctl |
2025-08-01 15:23:07 |
auto-video-generator-mcp |
3.0.0 |
基于MCP协议的智能视频生成系统,支持自动添加字幕、语音合成和视频剪辑功能 |
2025-07-27 03:59:50 |
nemo-toolkit |
2.4.0 |
NeMo - a toolkit for Conversational AI |
2025-07-25 18:12:25 |
whisper-parallel-cpu |
1.2.3 |
High-performance audio and video transcription using whisper.cpp with automatic model downloading and CPU parallelism |
2025-07-25 10:44:35 |
phonexia-audio-quality-estimation-client |
1.0.0 |
Client for communicating with Phonexia audio quality estimation |
2025-07-21 08:39:56 |
phonexia-grpc |
2.20.0 |
Library for communication with microservices developed by phonexia using grpc application interface. |
2025-07-17 09:33:17 |
audiojudge |
0.1.2 |
A simple package for audio comparison using large language models |
2025-07-15 20:46:33 |
voice-mode-azure |
2.13.0 |
VoiceMode with Azure OpenAI support - Voice interaction capabilities for AI assistants |
2025-07-13 01:42:09 |
vocals |
1.0.984 |
A Python SDK for voice processing and real-time audio communication |
2025-07-11 04:52:57 |
text-to-speech-api |
2025.0.2 |
image-upscaling.net api client |
2025-07-10 23:23:25 |
pyttsx3 |
2.99 |
Text to Speech (TTS) library for Python 3. Works without internet connection or delay. Supports multiple TTS engines, including Sapi5, nsss, and espeak. |
2025-07-08 12:24:21 |
podonos |
0.11.0 |
Managed evaluation for audio & speech |
2025-02-19 00:59:14 |
agi-open-network-cn |
0.1.0 |
AGI Open Network China Models - A Simple and Powerful Framework for Chinese AI Models |
2025-02-02 11:28:57 |
phonexia-gender-identification-client |
1.3.1 |
Client script for communicationg with Phonexia gender identification microservice. |
2025-01-27 08:36:12 |
phonexia-enhanced-speech-to-text-built-on-whisper-client |
1.8.0 |
Client for communication with Phonexia Enhanced Speech To Text Built On Whisper microservice. |
2025-01-21 20:01:00 |
pyobjc-framework-Speech |
11.0 |
Wrappers for the framework Speech on macOS |
2025-01-14 19:05:38 |
pnm |
0.0.1 |
Convert audio to phonetic text and practice improving your speech accent. |
2025-01-14 04:51:36 |