Name | Version | Summary | date |
ArticutAPI-Taigi |
0.94 |
Articut NLP system provides not only finest results on Chinese word segmentaion (CWS), Part-of-Speech tagging (POS) and Named Entity Recogintion tagging (NER), but also the fastest online API service in the NLP industry. |
2023-08-21 05:09:12 |
langmo |
0.2.0 |
toolbox for various tasks in the area of vector space models of computational linguistic |
2023-08-17 01:38:51 |
YaleKorean |
1.0.1 |
Korean Yale Romanizer |
2023-08-14 04:34:30 |
Boco |
0.3.2 |
A corpus manager for Tibetan Language |
2023-07-29 13:01:19 |
uniparser-yawarana |
0.0.6 |
A UniParser implementation for morphologically parsing and annotating Yawarana material. |
2023-07-25 15:17:59 |
samba-sampler |
0.3 |
A Python package providing sampling methods via matrix-based distance measures to mitigate autocorrelation |
2023-07-13 08:15:50 |
colibricore |
2.5.9 |
Colibri Core is an NLP tool as well as a C++ and Python library (all included in this package) for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` which allows you to build, view, manipulate and query pattern models. |
2023-07-03 10:33:51 |
clld-document-plugin |
0.0.5 |
Document model and rendering in CLLD apps. |
2023-06-27 21:09:05 |
text-selection |
0.0.3 |
Command-line interface (CLI) to select lines of a text file. |
2023-05-30 08:33:55 |
copius-api |
1.4.2 |
Transcription & orthography toolset |
2023-05-23 16:23:09 |
kollo |
1.0.1 |
Extract collocations from VERT data |
2023-04-28 09:31:41 |
corpy |
0.6.1 |
Tools for processing language data. |
2023-04-05 13:44:59 |
textacy |
0.13.0 |
NLP, before and after spaCy |
2023-04-02 23:05:33 |
ToMiddleChinese |
0.2.2 |
中古漢語自動標註工具 Middle Chinese Pronunciation Automatic Labeling Tool |
2023-03-13 11:53:34 |
wordseg |
0.0.5 |
Word segmentation models |
2023-03-11 21:58:09 |
nskipgrams |
0.5.0 |
A lightweight Python package to work with ngrams and skipgrams |
2023-03-11 21:42:45 |
arcaverborum |
0.2.1 |
Library for interfacing with data from the GLED project |
2023-02-11 15:33:51 |
speech-dataset-parser |
0.0.4 |
Library to parse speech datasets stored in a generic format based on TextGrids. A tool (CLI) for converting common datasets like LJ Speech into a generic format is included. |
2023-01-12 13:55:18 |
textgrid-tools |
0.0.7 |
Command-line interface (CLI) to modify TextGrids and their corresponding audio files. |
2023-01-12 11:46:23 |
dict-from-pypinyin |
0.0.1 |
Command-line interface (CLI) to create a pronunciation dictionary by looking up pinyin transcriptions using pypinyin including the possibility of ignoring punctuation and splitting words on hyphens before transcribing them. |
2023-01-11 08:37:54 |