korean-text-normalizer


Namekorean-text-normalizer JSON
Version 0.1.0 PyPI version JSON
download
home_pagehttps://github.com/yourusername/korean-text-normalizer
SummaryA tool for normalizing Korean text
upload_time2024-09-27 00:05:29
maintainerNone
docs_urlNone
authorYour Name
requires_python>=3.6
licenseNone
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Korean Text Normalizer

Korean Text Normalizer is a Python package for normalizing Korean text. It provides various functions to process and clean up Korean text data.



## Features

- Expand common Korean abbreviations
- Perform basic spell checking
- Normalize emoticons
- Detect and correct sentence boundaries
- Separate and combine Korean jamo (syllable characters)



## Installation

You can install the package using pip:

```
pip install korean-text-normalizer
```



## Usage

Here's a basic example of how to use the Korean Text Normalizer:

```python
from korean_text_normalizer import KoreanTextNormalizer

normalizer = KoreanTextNormalizer()

text = "ㅎㅇ! 오늘 날씨가 좋네요ㄱㅅ ^_^ 내일도 날씨가 좋았으면"
normalized_text = normalizer.normalize(text)

print(normalized_text)
```



## Contributing

Contributions are welcome! Please feel free to submit a Pull Request.



## License

This project is licensed under the MIT License.

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/yourusername/korean-text-normalizer",
    "name": "korean-text-normalizer",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.6",
    "maintainer_email": null,
    "keywords": null,
    "author": "Your Name",
    "author_email": "your.email@example.com",
    "download_url": "https://files.pythonhosted.org/packages/86/1c/4fb03f3c27a277435e45b62f52c0fa50877b228e3474a1b00bea58bf2ca6/korean-text-normalizer-0.1.0.tar.gz",
    "platform": null,
    "description": "# Korean Text Normalizer\r\n\r\nKorean Text Normalizer is a Python package for normalizing Korean text. It provides various functions to process and clean up Korean text data.\r\n\r\n\r\n\r\n## Features\r\n\r\n- Expand common Korean abbreviations\r\n- Perform basic spell checking\r\n- Normalize emoticons\r\n- Detect and correct sentence boundaries\r\n- Separate and combine Korean jamo (syllable characters)\r\n\r\n\r\n\r\n## Installation\r\n\r\nYou can install the package using pip:\r\n\r\n```\r\npip install korean-text-normalizer\r\n```\r\n\r\n\r\n\r\n## Usage\r\n\r\nHere's a basic example of how to use the Korean Text Normalizer:\r\n\r\n```python\r\nfrom korean_text_normalizer import KoreanTextNormalizer\r\n\r\nnormalizer = KoreanTextNormalizer()\r\n\r\ntext = \"\u314e\u3147! \uc624\ub298 \ub0a0\uc528\uac00 \uc88b\ub124\uc694\u3131\u3145 ^_^ \ub0b4\uc77c\ub3c4 \ub0a0\uc528\uac00 \uc88b\uc558\uc73c\uba74\"\r\nnormalized_text = normalizer.normalize(text)\r\n\r\nprint(normalized_text)\r\n```\r\n\r\n\r\n\r\n## Contributing\r\n\r\nContributions are welcome! Please feel free to submit a Pull Request.\r\n\r\n\r\n\r\n## License\r\n\r\nThis project is licensed under the MIT License.\r\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "A tool for normalizing Korean text",
    "version": "0.1.0",
    "project_urls": {
        "Homepage": "https://github.com/yourusername/korean-text-normalizer"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "eab1833c44ca373071185f31e5a0e458547738b290f063153b745cb3a0143385",
                "md5": "b4cb886c1ea547e5bc91aa0de2d0fccb",
                "sha256": "fe7b98ec1786f9e5a1ca45e7f2d1c5cc1cfd3d60d5e394a194d9697a833f9424"
            },
            "downloads": -1,
            "filename": "korean_text_normalizer-0.1.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "b4cb886c1ea547e5bc91aa0de2d0fccb",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.6",
            "size": 2667,
            "upload_time": "2024-09-27T00:05:26",
            "upload_time_iso_8601": "2024-09-27T00:05:26.965210Z",
            "url": "https://files.pythonhosted.org/packages/ea/b1/833c44ca373071185f31e5a0e458547738b290f063153b745cb3a0143385/korean_text_normalizer-0.1.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "861c4fb03f3c27a277435e45b62f52c0fa50877b228e3474a1b00bea58bf2ca6",
                "md5": "fb66f4610a5cc143a89f6f7a08897d57",
                "sha256": "06b40dc01c97179fa1dcfc30ec08c9dfacd17024f72d37b932b8871649260944"
            },
            "downloads": -1,
            "filename": "korean-text-normalizer-0.1.0.tar.gz",
            "has_sig": false,
            "md5_digest": "fb66f4610a5cc143a89f6f7a08897d57",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.6",
            "size": 2475,
            "upload_time": "2024-09-27T00:05:29",
            "upload_time_iso_8601": "2024-09-27T00:05:29.996466Z",
            "url": "https://files.pythonhosted.org/packages/86/1c/4fb03f3c27a277435e45b62f52c0fa50877b228e3474a1b00bea58bf2ca6/korean-text-normalizer-0.1.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-09-27 00:05:29",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "yourusername",
    "github_project": "korean-text-normalizer",
    "github_not_found": true,
    "lcname": "korean-text-normalizer"
}
        
Elapsed time: 0.65799s