# RO Diacritics module
**RO Diacritics** is a straightforward diacritics restoration module for Romanian Language
```python
from ro_diacritics import restore_diacritics
print(restore_diacritics("fara poezie, viata e pustiu"))
```
or correcting a pandas dataframe:
```python
from ro_diacritics import restore_diacritics
df['text-diacritice'] = df['text'].apply(restore_diacritics)
```
## Installing
```console
$ python -m pip install ro-diacritics
```
or
```console
$ pip install ro-diacritics
```
## Requirements
* torch and torchtext
* numpy
* nltk and scikit-learn (for training)
* needs nltk.download('punkt') for tokenization
## References
- Ruseti, S., Cotet, T. M., & Dascalu, M. (2020). Romanian Diacritics Restoration Using Recurrent Neural Networks. arXiv preprint arXiv:2009.02743.
- https://github.com/teodor-cotet/DiacriticsRestoration
Raw data
{
"_id": null,
"home_page": "https://github.com/AndyTheFactory/RO-Diacritics",
"name": "ro-diacritics",
"maintainer": "Andrei Paraschiv",
"docs_url": null,
"requires_python": "",
"maintainer_email": "andrei@thephpfactory.com",
"keywords": "romanian diacritcs language restoration diacritice python",
"author": "Andrei Paraschiv",
"author_email": "andrei@thephpfactory.com",
"download_url": "https://files.pythonhosted.org/packages/b4/1f/8c4f370f9ea7c229f6a5d54678fed1002acc3e7adb8bf895e3aa934211db/ro-diacritics-0.9.4.2.tar.gz",
"platform": null,
"description": "# RO Diacritics module\n\n**RO Diacritics** is a straightforward diacritics restoration module for Romanian Language\n\n```python\nfrom ro_diacritics import restore_diacritics\nprint(restore_diacritics(\"fara poezie, viata e pustiu\"))\n```\n\nor correcting a pandas dataframe:\n\n```python\nfrom ro_diacritics import restore_diacritics\ndf['text-diacritice'] = df['text'].apply(restore_diacritics)\n```\n\n## Installing\n\n```console\n$ python -m pip install ro-diacritics\n```\nor\n\n```console\n$ pip install ro-diacritics\n```\n\n## Requirements\n\n * torch and torchtext\n * numpy\n * nltk and scikit-learn (for training)\n * needs nltk.download('punkt') for tokenization\n\n## References\n\n- Ruseti, S., Cotet, T. M., & Dascalu, M. (2020). Romanian Diacritics Restoration Using Recurrent Neural Networks. arXiv preprint arXiv:2009.02743.\n- https://github.com/teodor-cotet/DiacriticsRestoration\n\n\n",
"bugtrack_url": null,
"license": "",
"summary": "Python API for Romanian diacritics restoration",
"version": "0.9.4.2",
"project_urls": {
"Homepage": "https://github.com/AndyTheFactory/RO-Diacritics"
},
"split_keywords": [
"romanian",
"diacritcs",
"language",
"restoration",
"diacritice",
"python"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "b41f8c4f370f9ea7c229f6a5d54678fed1002acc3e7adb8bf895e3aa934211db",
"md5": "cf4ab9868049a2a2e3bcf9f28a3d0d5b",
"sha256": "92bffa40355c4120b10354b783c8950c82d48da005757546cf86c00894a3e321"
},
"downloads": -1,
"filename": "ro-diacritics-0.9.4.2.tar.gz",
"has_sig": false,
"md5_digest": "cf4ab9868049a2a2e3bcf9f28a3d0d5b",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 12504,
"upload_time": "2024-01-03T21:31:00",
"upload_time_iso_8601": "2024-01-03T21:31:00.768363Z",
"url": "https://files.pythonhosted.org/packages/b4/1f/8c4f370f9ea7c229f6a5d54678fed1002acc3e7adb8bf895e3aa934211db/ro-diacritics-0.9.4.2.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-01-03 21:31:00",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "AndyTheFactory",
"github_project": "RO-Diacritics",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"lcname": "ro-diacritics"
}