| Name | Version | Summary | date |
| spacytextblob |
5.1.0 |
A TextBlob sentiment analysis pipeline component for spaCy. |
2025-10-30 22:19:16 |
| spacypdfreader |
0.4.0 |
A PDF to text extraction pipeline component for spaCy. |
2025-10-30 21:51:51 |
| ai-data-scrubber |
0.1.1 |
A lightweight tool for removing personal data from text before uploading to LLMs |
2025-10-15 03:28:13 |
| intelli3text |
0.2.5 |
Ingestion (web/PDF/DOCX/TXT), cleaning, paragraph-level LID (PT/EN/ES), and spaCy-based normalization; PDF export. |
2025-10-13 00:46:31 |
| tamil-utils |
0.4.0 |
Tiny Tamil text utilities: normalize, tokenize, stopwords, graphemes, n-grams, syllables, Tamil collation; dataset preprocessor; optional spaCy tokenizer hook. |
2025-09-17 14:24:34 |
| coref-onnx |
0.1.2 |
Lightweight cross-lingual coreference resolution using ONNX Runtime and distilled transformer models |
2025-08-03 10:33:03 |
| textcleaner-partha |
1.1.2 |
A lightweight and reusable text preprocessing package for NLP tasks |
2025-08-02 16:21:04 |
| zensols-nlp |
1.12.8 |
This framework wraps the spaCy framework and creates light weight features in a class hierarchy that reflects the structure of natural language |
2025-07-31 21:21:02 |
| inconnu |
0.1.0 |
GDPR-compliant data privacy tool for entity redaction and de-anonymization |
2025-07-23 10:41:29 |
| simple-anonymizer |
0.1.18 |
Privacy-first text anonymization tool with enterprise-grade accuracy for removing PII from documents |
2025-07-23 07:51:26 |
| sencore |
0.1.50 |
sentence nlp parser for multilingua |
2025-02-10 04:20:32 |
| phrase-detective |
0.1.35 |
Phrase recognizer component for spacy pipeline |
2024-12-25 11:14:39 |
| tsnorm |
1.1.2 |
A library to put stress marks in Russian text |
2024-12-19 00:43:06 |
| textdescriptives |
2.8.4 |
A library for calculating a variety of features from text using spaCy |
2024-12-16 09:07:48 |
| lemon-tizer |
0.0.7 |
LemonTizer is a class that wraps the spacy library to build a lemmatizer for language learning applications. |
2024-11-27 15:22:51 |
| clause-segmenter |
0.1.1 |
A clause segmenting tool utilising Python's spacy |
2024-11-19 03:09:24 |
| news-fetch |
0.3.0 |
news-fetch is an open-source, easy-to-use news extractor with basic NLP features (cleaning text, keywords, summary) that just works. |
2024-11-03 07:10:21 |
| huspacy |
0.12.0 |
HuSpaCy: industrial strength Hungarian natural language processing |
2024-10-28 10:30:55 |
| lingpatlab |
0.2.10 |
Linguistic Pattern Lab using spaCy |
2024-10-11 18:02:37 |
| spacy-conll |
4.0.1 |
A custom pipeline component for spaCy that can convert any parsed Doc and its sentences into CoNLL-U format. Also provides a command line entry point. |
2024-07-02 08:51:06 |