Name | Version | Summary | date |
python-frog |
0.6.11 |
Python binding to Frog, an NLP suite for Dutch doing part-of-speech tagging, lemmatisation, morphological analysis, named-entity recognition, shallow parsing, and dependency parsing. |
2024-12-17 12:31:08 |
python-ucto |
0.6.9 |
This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is a regular-expression based, extensible, and advanced tokeniser written in C++ (https://languagemachines.github.io/ucto). |
2024-12-17 11:56:39 |
Spacy2FoLiA |
0.3.4 |
Library that adds FoLiA (format for linguistic annotation) support to spaCy |
2024-02-27 21:45:47 |
colibricore |
2.5.9 |
Colibri Core is an NLP tool as well as a C++ and Python library (all included in this package) for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` which allows you to build, view, manipulate and query pattern models. |
2023-07-03 10:33:51 |
botok |
0.8.12 |
Tibetan Word Tokenizer |
2023-05-17 11:36:37 |
BabelPy |
1.0.1 |
BabelFy API Client |
2017-12-12 12:25:04 |