Name | Version | Summary | date |
dillwave |
1.0.3 |
dillwave |
2024-05-02 16:50:05 |
transformers |
4.40.1 |
State-of-the-art Machine Learning for JAX, PyTorch and TensorFlow |
2024-04-23 22:01:15 |
SpeechRecognition |
3.10.3 |
Library for performing speech recognition, with support for several engines and APIs, online and offline. |
2024-03-30 15:23:14 |
last-asr |
0.0.4 |
The LAttice-based Speech Transducer (LAST) library |
2024-03-25 20:57:08 |
spark-nlp |
5.3.2 |
John Snow Labs Spark NLP is a natural language processing library built on top of Apache Spark ML. It provides simple, performant & accurate NLP annotations for machine learning pipelines, that scale easily in a distributed environment. |
2024-03-20 19:09:21 |
s3prl |
0.4.15 |
Self-Supervised Speech Pre-training and Representation Learning Toolkit |
2024-03-18 13:38:07 |
pyobjc-framework-Speech |
10.2 |
Wrappers for the framework Speech on macOS |
2024-03-16 09:22:07 |
inaSpeechSegmenter |
0.7.8 |
CNN-based audio segmentation toolkit. Does voice activity detection, speech detection, music detection, noise detection, speaker gender recognition. |
2024-03-15 16:27:05 |
nm-transformers-nightly |
1.7.0.20240304 |
State-of-the-art Machine Learning for JAX, PyTorch and TensorFlow |
2024-03-05 13:39:02 |
ppgs |
0.0.3 |
Phonetic posteriorgrams |
2024-03-04 22:15:00 |
reseval |
0.1.6 |
Reproducible Subjective Evaluation |
2024-03-03 05:32:16 |
wikipron |
1.3.1 |
Scraping grapheme-to-phoneme data from Wiktionary |
2024-03-02 23:45:33 |
faster-whisper |
1.0.1 |
Faster Whisper transcription with CTranslate2 |
2024-03-01 10:44:19 |
phonexia-voiceprint-extraction-client |
1.2.0 |
Client for communication with Phonexia voiceprint extraction microservice. |
2024-02-29 16:19:50 |
nemo-toolkit |
1.23.0 |
NeMo - a toolkit for Conversational AI |
2024-02-28 05:27:22 |
citylex |
0.1.15 |
Builds a multi-source English lexicon |
2024-02-20 17:46:52 |
voxscribe |
1.1.2 |
Extract text from .wav and .mp3 files. |
2024-02-17 01:28:00 |
lmnt |
1.1.2 |
Python client library for the LMNT API |
2024-02-16 01:21:18 |
parrots |
1.0.3 |
Parrots, Automatic Speech Recognition(**ASR**), Text-To-Speech(**TTS**) toolkit |
2024-02-13 06:16:30 |
torch-cif |
0.2.0 |
A fast parallel implementation of continuous integrate-and-fire (CIF) https://arxiv.org/abs/1905.11235 |
2024-02-09 05:22:48 |