czech-syllable-splitter


Nameczech-syllable-splitter JSON
Version 0.1.0 PyPI version JSON
download
home_pagehttps://github.com/adasek/czech-syllable-splitter
SummarySplitting Czech words into syllables
upload_time2024-06-14 22:01:02
maintainerNone
docs_urlNone
authorAdam Benda
requires_python<4.0,>=3.8
licenseMIT
keywords syllables syllable czech czech-language language count counter natural-language-processing nlp
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            ## Czech Syllable Splitter
Alogirthm for splitting Czech words into syllables.
Inspired by a syllable counting algorithm from David Lukeš counting the vowels.

With Klára Bendová we put together rules to expand the vowels into syllables,
empirically finding some common letter groups to stay intact.

This is not a perfect solution, but it is a good start for Czech language processing.
Measuring the accuracy of this algorithm is a to-do, as well as adding more rules if needed.

### Installation
```bash
pip install czech-syllable-splitter
```
or using Poetry package manager
```bash
poetry add czech-syllable-splitter
```

### Usage

```python
from czech_syllable_splitter import count_syllables, split_to_syllables, split_to_characters

print(split_to_syllables("příliš"))
print(split_to_characters("přesný"))
print(count_syllables("přísný"))
```

### Lint & Test
```bash
poetry run python3 -m pytest
poetry run mypy czech_syllable_splitter
poetry run pylint czech_syllable_splitter

```


            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/adasek/czech-syllable-splitter",
    "name": "czech-syllable-splitter",
    "maintainer": null,
    "docs_url": null,
    "requires_python": "<4.0,>=3.8",
    "maintainer_email": null,
    "keywords": "syllables, syllable, czech, czech-language, language, count, counter, natural-language-processing, nlp",
    "author": "Adam Benda",
    "author_email": "adam@adasek.cz",
    "download_url": "https://files.pythonhosted.org/packages/0b/b0/23bec9b5d9aa36782610a247dc2dfd1eb8f837cd5cb229a24a9dbd9bf4e6/czech_syllable_splitter-0.1.0.tar.gz",
    "platform": null,
    "description": "## Czech Syllable Splitter\nAlogirthm for splitting Czech words into syllables.\nInspired by a syllable counting algorithm from David Luke\u0161 counting the vowels.\n\nWith Kl\u00e1ra Bendov\u00e1 we put together rules to expand the vowels into syllables,\nempirically finding some common letter groups to stay intact.\n\nThis is not a perfect solution, but it is a good start for Czech language processing.\nMeasuring the accuracy of this algorithm is a to-do, as well as adding more rules if needed.\n\n### Installation\n```bash\npip install czech-syllable-splitter\n```\nor using Poetry package manager\n```bash\npoetry add czech-syllable-splitter\n```\n\n### Usage\n\n```python\nfrom czech_syllable_splitter import count_syllables, split_to_syllables, split_to_characters\n\nprint(split_to_syllables(\"p\u0159\u00edli\u0161\"))\nprint(split_to_characters(\"p\u0159esn\u00fd\"))\nprint(count_syllables(\"p\u0159\u00edsn\u00fd\"))\n```\n\n### Lint & Test\n```bash\npoetry run python3 -m pytest\npoetry run mypy czech_syllable_splitter\npoetry run pylint czech_syllable_splitter\n\n```\n\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Splitting Czech words into syllables",
    "version": "0.1.0",
    "project_urls": {
        "Homepage": "https://github.com/adasek/czech-syllable-splitter",
        "Repository": "https://github.com/adasek/czech-syllable-splitter"
    },
    "split_keywords": [
        "syllables",
        " syllable",
        " czech",
        " czech-language",
        " language",
        " count",
        " counter",
        " natural-language-processing",
        " nlp"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "3723b2088e9eef5f94b5cfec8983403fca2b70320b7cb2f018970daf05428888",
                "md5": "092b645d8b32230132c0879e2d644c08",
                "sha256": "a787fc2d045c1df6d7b10196ed8d5f9db4dc4a7556fb5bd6305c23471cc793ec"
            },
            "downloads": -1,
            "filename": "czech_syllable_splitter-0.1.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "092b645d8b32230132c0879e2d644c08",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": "<4.0,>=3.8",
            "size": 4331,
            "upload_time": "2024-06-14T22:01:01",
            "upload_time_iso_8601": "2024-06-14T22:01:01.186394Z",
            "url": "https://files.pythonhosted.org/packages/37/23/b2088e9eef5f94b5cfec8983403fca2b70320b7cb2f018970daf05428888/czech_syllable_splitter-0.1.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "0bb023bec9b5d9aa36782610a247dc2dfd1eb8f837cd5cb229a24a9dbd9bf4e6",
                "md5": "71ab4b5c61b895cc71e553c736db5b86",
                "sha256": "2bf0b0cc55417fa750a5fb51fbde1a98038bf611f9e9efd88ba349ed2bdba509"
            },
            "downloads": -1,
            "filename": "czech_syllable_splitter-0.1.0.tar.gz",
            "has_sig": false,
            "md5_digest": "71ab4b5c61b895cc71e553c736db5b86",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": "<4.0,>=3.8",
            "size": 3576,
            "upload_time": "2024-06-14T22:01:02",
            "upload_time_iso_8601": "2024-06-14T22:01:02.830418Z",
            "url": "https://files.pythonhosted.org/packages/0b/b0/23bec9b5d9aa36782610a247dc2dfd1eb8f837cd5cb229a24a9dbd9bf4e6/czech_syllable_splitter-0.1.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-06-14 22:01:02",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "adasek",
    "github_project": "czech-syllable-splitter",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "lcname": "czech-syllable-splitter"
}
        
Elapsed time: 1.31866s