# Korean Text Normalizer
Korean Text Normalizer is a Python package for normalizing Korean text. It provides various functions to process and clean up Korean text data.
## Features
- Expand common Korean abbreviations
- Perform basic spell checking
- Normalize emoticons
- Detect and correct sentence boundaries
- Separate and combine Korean jamo (syllable characters)
## Installation
You can install the package using pip:
```
pip install korean-text-normalizer
```
## Usage
Here's a basic example of how to use the Korean Text Normalizer:
```python
from korean_text_normalizer import KoreanTextNormalizer
normalizer = KoreanTextNormalizer()
text = "ㅎㅇ! 오늘 날씨가 좋네요ㄱㅅ ^_^ 내일도 날씨가 좋았으면"
normalized_text = normalizer.normalize(text)
print(normalized_text)
```
## Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
## License
This project is licensed under the MIT License.
Raw data
{
"_id": null,
"home_page": "https://github.com/yourusername/korean-text-normalizer",
"name": "korean-text-normalizer",
"maintainer": null,
"docs_url": null,
"requires_python": ">=3.6",
"maintainer_email": null,
"keywords": null,
"author": "Your Name",
"author_email": "your.email@example.com",
"download_url": "https://files.pythonhosted.org/packages/86/1c/4fb03f3c27a277435e45b62f52c0fa50877b228e3474a1b00bea58bf2ca6/korean-text-normalizer-0.1.0.tar.gz",
"platform": null,
"description": "# Korean Text Normalizer\r\n\r\nKorean Text Normalizer is a Python package for normalizing Korean text. It provides various functions to process and clean up Korean text data.\r\n\r\n\r\n\r\n## Features\r\n\r\n- Expand common Korean abbreviations\r\n- Perform basic spell checking\r\n- Normalize emoticons\r\n- Detect and correct sentence boundaries\r\n- Separate and combine Korean jamo (syllable characters)\r\n\r\n\r\n\r\n## Installation\r\n\r\nYou can install the package using pip:\r\n\r\n```\r\npip install korean-text-normalizer\r\n```\r\n\r\n\r\n\r\n## Usage\r\n\r\nHere's a basic example of how to use the Korean Text Normalizer:\r\n\r\n```python\r\nfrom korean_text_normalizer import KoreanTextNormalizer\r\n\r\nnormalizer = KoreanTextNormalizer()\r\n\r\ntext = \"\u314e\u3147! \uc624\ub298 \ub0a0\uc528\uac00 \uc88b\ub124\uc694\u3131\u3145 ^_^ \ub0b4\uc77c\ub3c4 \ub0a0\uc528\uac00 \uc88b\uc558\uc73c\uba74\"\r\nnormalized_text = normalizer.normalize(text)\r\n\r\nprint(normalized_text)\r\n```\r\n\r\n\r\n\r\n## Contributing\r\n\r\nContributions are welcome! Please feel free to submit a Pull Request.\r\n\r\n\r\n\r\n## License\r\n\r\nThis project is licensed under the MIT License.\r\n",
"bugtrack_url": null,
"license": null,
"summary": "A tool for normalizing Korean text",
"version": "0.1.0",
"project_urls": {
"Homepage": "https://github.com/yourusername/korean-text-normalizer"
},
"split_keywords": [],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "eab1833c44ca373071185f31e5a0e458547738b290f063153b745cb3a0143385",
"md5": "b4cb886c1ea547e5bc91aa0de2d0fccb",
"sha256": "fe7b98ec1786f9e5a1ca45e7f2d1c5cc1cfd3d60d5e394a194d9697a833f9424"
},
"downloads": -1,
"filename": "korean_text_normalizer-0.1.0-py3-none-any.whl",
"has_sig": false,
"md5_digest": "b4cb886c1ea547e5bc91aa0de2d0fccb",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.6",
"size": 2667,
"upload_time": "2024-09-27T00:05:26",
"upload_time_iso_8601": "2024-09-27T00:05:26.965210Z",
"url": "https://files.pythonhosted.org/packages/ea/b1/833c44ca373071185f31e5a0e458547738b290f063153b745cb3a0143385/korean_text_normalizer-0.1.0-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "861c4fb03f3c27a277435e45b62f52c0fa50877b228e3474a1b00bea58bf2ca6",
"md5": "fb66f4610a5cc143a89f6f7a08897d57",
"sha256": "06b40dc01c97179fa1dcfc30ec08c9dfacd17024f72d37b932b8871649260944"
},
"downloads": -1,
"filename": "korean-text-normalizer-0.1.0.tar.gz",
"has_sig": false,
"md5_digest": "fb66f4610a5cc143a89f6f7a08897d57",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.6",
"size": 2475,
"upload_time": "2024-09-27T00:05:29",
"upload_time_iso_8601": "2024-09-27T00:05:29.996466Z",
"url": "https://files.pythonhosted.org/packages/86/1c/4fb03f3c27a277435e45b62f52c0fa50877b228e3474a1b00bea58bf2ca6/korean-text-normalizer-0.1.0.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-09-27 00:05:29",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "yourusername",
"github_project": "korean-text-normalizer",
"github_not_found": true,
"lcname": "korean-text-normalizer"
}