Name | Version | Summary | date |
indoxMiner |
0.0.10 |
Indox Data Extraction |
2024-11-11 05:40:57 |
yurenizer |
0.0.8 |
A library for standardizing terms with spelling variations using a synonym dictionary. |
2024-11-10 09:23:09 |
chonkie |
0.1.2 |
🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library |
2024-11-08 17:27:30 |
analiticcl |
0.4.8 |
Analiticcl is an approximate string matching or fuzzy-matching system that can be used to find variants for spelling correction or text normalisation |
2024-10-17 19:47:08 |
sesdiff |
0.3.2 |
Generates a shortest edit script (Myers' diff algorithm) to indicate how to get from the strings in column A to the strings in column B. Also provides the edit distance (levenshtein). This is the Python binding. |
2024-10-15 08:59:03 |
reliq |
0.0.26 |
Python ctypes bindings for reliq |
2024-10-05 12:32:03 |
abbreviation-extractor |
0.1.4 |
A library for extracting abbreviations from text. |
2024-09-14 20:02:14 |
html-to-markdown |
1.1.0 |
Convert HTML to markdown |
2024-09-09 06:26:33 |
tokenize-text |
0.2.32 |
Tokenizing and processing text inputs with transformer models |
2024-08-11 21:53:24 |
tokenize-transformer |
0.2.14 |
Tokenizing and processing text inputs with transformer models |
2024-08-01 18:39:29 |
flashtext2 |
1.1.0 |
A package for extracting keywords from large text very quickly (much faster than regex and the original flashtext package |
2024-07-04 14:40:37 |
roter |
2024.6.25 |
Rotate and combine tables (Danish: Roter og kombiner borde). |
2024-06-24 19:22:08 |