Name | Version | Summary | date |
RealTimeSTT |
0.1.11 |
A fast Voice Activity Detection and Transcription System |
2024-03-16 19:46:37 |
transcribe-anything |
2.7.33 |
Uses whisper AI to transcribe speach from video and audio files. Also accepts urls for youtube, rumble, bitchute, clear file, etc. |
2024-02-24 04:39:55 |
torch-cif |
0.2.0 |
A fast parallel implementation of continuous integrate-and-fire (CIF) https://arxiv.org/abs/1905.11235 |
2024-02-09 05:22:48 |
pvcheetahdemo |
2.0.1 |
Cheetah speech-to-text engine demos |
2024-02-06 01:33:16 |
pvcheetah |
2.0.1 |
Cheetah Speech-to-Text Engine. |
2024-02-06 01:32:51 |
pvleoparddemo |
2.0.2 |
Leopard speech-to-text engine demos |
2024-02-06 01:32:20 |
pvleopard |
2.0.2 |
Leopard Speech-to-Text Engine. |
2024-02-06 01:30:24 |
verbatim |
0.1.6 |
high quality multi-lingual speech to text |
2024-02-02 06:17:23 |
video-summary |
1.0.3 |
A Python SDK for video processing, providing functionalities like speech-to-text, summarization, transcription, and chaptering. |
2024-01-04 22:27:21 |
BanterBot |
0.0.15 |
BanterBot: An OpenAI ChatGPT-powered chatbot with Azure Neural Voices. Supports speech-to-text and text-to-speech interactions with emotional tone selection. Features real-time monitoring and Tkinter frontend. |
2023-12-29 13:17:54 |
tafrigh |
1.1.2 |
تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai. |
2023-12-11 13:45:31 |
whisperyt |
0.1.4 |
Using Gladia's Whisper API for transcribing YouTube videos |
2023-12-01 23:30:20 |
werpy |
2.1.0 |
A powerful yet lightweight Python package to calculate and analyze the Word Error Rate (WER). |
2023-11-23 06:21:05 |
scraibe |
0.1.1 |
Transcription tool for audio files based on Whisper and Pyannote |
2023-09-22 18:38:58 |
stark-engine |
4.0.7 |
S.T.A.R.K - Speech and Text Algorithmic Recognition Kit. Modern framework for creating powerfull voice assistants. |
2023-09-21 18:37:29 |
stark-place |
1.1.0 |
S.T.A.R.K. Platform Library And Community Extensions |
2023-09-21 14:47:47 |
armspeech |
0.1.4 |
ArmSpeech is an offline Armenian speech recognition library (speech-to-text) and CLI tool based on Coqui STT (🐸STT) and trained on the ArmSpeech dataset. |
2023-06-06 16:25:24 |
gptalk |
0.0.4.5 |
Fast GPT-3 client for Windows and Unix that supports both text and speech in any language. |
2023-03-26 09:45:11 |