Name | Version | Summary | date |
---|---|---|---|
python-ucto | 0.6.9 | This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is a regular-expression based, extensible, and advanced tokeniser written in C++ (https://languagemachines.github.io/ucto). | 2024-12-17 11:56:39 |
wordsprobability | 0.17 | Method to get a words probability with fixes from How to Compute the Probability of a Word. | 2024-07-10 13:28:31 |
hour | day | week | total |
---|---|---|---|
37 | 1205 | 8160 | 274904 |