ro-diacritics


Namero-diacritics JSON
Version 0.9.4.2 PyPI version JSON
download
home_pagehttps://github.com/AndyTheFactory/RO-Diacritics
SummaryPython API for Romanian diacritics restoration
upload_time2024-01-03 21:31:00
maintainerAndrei Paraschiv
docs_urlNone
authorAndrei Paraschiv
requires_python
license
keywords romanian diacritcs language restoration diacritice python
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # RO Diacritics module

**RO Diacritics** is a straightforward diacritics restoration module for Romanian Language

```python
from ro_diacritics import restore_diacritics
print(restore_diacritics("fara poezie, viata e pustiu"))
```

or correcting a pandas dataframe:

```python
from ro_diacritics import restore_diacritics
df['text-diacritice'] = df['text'].apply(restore_diacritics)
```

## Installing

```console
$ python -m pip install ro-diacritics
```
or

```console
$ pip install ro-diacritics
```

## Requirements

 * torch and torchtext
 * numpy
 * nltk and scikit-learn (for training)
 * needs nltk.download('punkt') for tokenization

## References

- Ruseti, S., Cotet, T. M., & Dascalu, M. (2020). Romanian Diacritics Restoration Using Recurrent Neural Networks. arXiv preprint arXiv:2009.02743.
- https://github.com/teodor-cotet/DiacriticsRestoration



            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/AndyTheFactory/RO-Diacritics",
    "name": "ro-diacritics",
    "maintainer": "Andrei Paraschiv",
    "docs_url": null,
    "requires_python": "",
    "maintainer_email": "andrei@thephpfactory.com",
    "keywords": "romanian diacritcs language restoration diacritice python",
    "author": "Andrei Paraschiv",
    "author_email": "andrei@thephpfactory.com",
    "download_url": "https://files.pythonhosted.org/packages/b4/1f/8c4f370f9ea7c229f6a5d54678fed1002acc3e7adb8bf895e3aa934211db/ro-diacritics-0.9.4.2.tar.gz",
    "platform": null,
    "description": "# RO Diacritics module\n\n**RO Diacritics** is a straightforward diacritics restoration module for Romanian Language\n\n```python\nfrom ro_diacritics import restore_diacritics\nprint(restore_diacritics(\"fara poezie, viata e pustiu\"))\n```\n\nor correcting a pandas dataframe:\n\n```python\nfrom ro_diacritics import restore_diacritics\ndf['text-diacritice'] = df['text'].apply(restore_diacritics)\n```\n\n## Installing\n\n```console\n$ python -m pip install ro-diacritics\n```\nor\n\n```console\n$ pip install ro-diacritics\n```\n\n## Requirements\n\n * torch and torchtext\n * numpy\n * nltk and scikit-learn (for training)\n * needs nltk.download('punkt') for tokenization\n\n## References\n\n- Ruseti, S., Cotet, T. M., & Dascalu, M. (2020). Romanian Diacritics Restoration Using Recurrent Neural Networks. arXiv preprint arXiv:2009.02743.\n- https://github.com/teodor-cotet/DiacriticsRestoration\n\n\n",
    "bugtrack_url": null,
    "license": "",
    "summary": "Python API for Romanian diacritics restoration",
    "version": "0.9.4.2",
    "project_urls": {
        "Homepage": "https://github.com/AndyTheFactory/RO-Diacritics"
    },
    "split_keywords": [
        "romanian",
        "diacritcs",
        "language",
        "restoration",
        "diacritice",
        "python"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "b41f8c4f370f9ea7c229f6a5d54678fed1002acc3e7adb8bf895e3aa934211db",
                "md5": "cf4ab9868049a2a2e3bcf9f28a3d0d5b",
                "sha256": "92bffa40355c4120b10354b783c8950c82d48da005757546cf86c00894a3e321"
            },
            "downloads": -1,
            "filename": "ro-diacritics-0.9.4.2.tar.gz",
            "has_sig": false,
            "md5_digest": "cf4ab9868049a2a2e3bcf9f28a3d0d5b",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 12504,
            "upload_time": "2024-01-03T21:31:00",
            "upload_time_iso_8601": "2024-01-03T21:31:00.768363Z",
            "url": "https://files.pythonhosted.org/packages/b4/1f/8c4f370f9ea7c229f6a5d54678fed1002acc3e7adb8bf895e3aa934211db/ro-diacritics-0.9.4.2.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-01-03 21:31:00",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "AndyTheFactory",
    "github_project": "RO-Diacritics",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "lcname": "ro-diacritics"
}
        
Elapsed time: 0.20190s