# Script Converter for Uyghur Language
This converter converts multiple Uyghur scripts such as **ULS** (Uyghur Latin Script), **UAS** (Uyghur Arabic Script),
**CTS** (Common Turkish Script), **UCS** (Uyghur Cyrillic Script), **UYS** (Uyghur Yengi (new) Script),
International Phonetic Alphabet (IPA), Uzbek Latin Script (UZLS) and Xinjiang University Script (XJUS).
## Installation
```
pip install umsc
```
## Mapping table
| UAS | CTS | ULS| UCS|UYS| IPA | UZLS | XJUS |
|-----|----| ---- | --- | -- |-------|---|------|
| ا | a | a | а |a | /ɑ/ | o | a |
| ە | e | e | ә |ə | /æ/ | a | A |
| ب | b | b | б |b | /b/ | b | b |
| پ | p | p | п |p | /p/ | p | p |
| ت | t | t | т |t | /t/ | t | t |
| ج | c | j | җ |j | /d͡ʒ/ | j | j |
| چ | ç | ch | ч |q | /t͡ʃ/ | ch | c |
| خ | x | x | х |h | /χ/ | x | H |
| د | d | d | д |d | /d/ | d | d |
| ر | r | r | р |r | /r/ | r | r |
| ز | z | z | з |z | /z/ | z | z |
| ژ | j | zh | ж |ⱬ | /ʒ/ | j | J |
| س | s | s | с |s | /s/ | s | s |
| ش | ş | sh | ш |x | /ʃ/ | sh | x |
| ف | f | f | ф |f | /f/ | f | f |
| ڭ | ñ | ng | ң |ng | /ŋ/ | ng | N |
| ل | l | l | л |l | /l/ | l | l |
| م | m | m | м |m | /m/ | m | m |
| ھ | h | h | һ |ⱨ | /h/ | h | h |
| و | o | o | о |o | /o/ | oʻ | o |
| ۇ | u | u | у |u | /u/ | u | u |
| ۆ | ö | ö | ө |ɵ | /ø/ | oʻ | O |
| ۈ | ü | ü | ү |ü | /y/ | uʻ | U |
| ۋ | v | w | в |w | /w/ | v | w |
| ې | é | é | е |e | /ɛ/ | e | e |
| ى | i | i | и |i | /i/ | i | i |
| ي | y | y | й |y | /j/ | y | y |
| ق | q | q | қ |ⱪ | /q/ | q | q |
| ك | k | k | к |k | /k/ | k | k |
| گ | g | g | г |g | /ɡ/ | g | g |
| ن | n | n | н |n | /n/ | n | n |
| غ | ğ | gh | ғ |ƣ | /ʁ/ | gʻ | G |
| ئ | | | | | | | v |
| يا | ya | ya | я |ya | | ya | ya |
| يۇ | yu | yu | ю |yu | | yu | yu |
## Sample input and output examples
Review the files in the tests directory for examples of converting between different scripts.
## Usage
```
from umsc import UgMultiScriptConverter
# To convert text, you need to define source and target scripts
# The abbreviation of scrips
# ULS | Uyghur Latin Script
# UYS | Uyghur Yengi (New) Script
# CPS | Chinese Pinyin Script
# UAS | Uyghur Arabic Script
# CTS |Common Turkic Script
# UCS | Uyghur Cyrillic Script
# XJU | Xinjinag University English Case Sensitive
# UZLS | Uzbek Latin Script
# Convert Uyghur Arabic Script to Uyghur Latin Script
source_script = 'UAS'
target_script = 'ULS'
converter = UgMultiScriptConverter(source_script, target_script)
text1 = 'ياخشىمۇسىز!'
text1 = converter(text1)
print(text1)
# Convert Uyghur Latin Script to Uyghur Arabic Script
source_script = 'ULS'
target_script = 'UAS'
converter = UgMultiScriptConverter(source_script, target_script)
text2 = 'yaxshimusiz!'
text2 = converter(text2)
print(text2)
```
## Notes
- API and Uzbek are currently developing. The mapping might not be very accurate. Especially for Uzbek, it is not very clear how to map "ئا" and "ئە" to Uzbek.
## Citation
If you wish to cite this project, please use `cite this repository`.
## Contributing
Feel free to raise issue and pull request.
## License
Distributed under the Apache 2.0 License. See [`LICENSE`](LICENSE) for more information.
Raw data
{
"_id": null,
"home_page": "https://github.com/neouyghur/ScriptConverter4Uyghur",
"name": "umsc",
"maintainer": "Osman Tursun",
"docs_url": null,
"requires_python": ">=3.0",
"maintainer_email": "mpcabd@gmail.com",
"keywords": "uyghur script converter arabic latin cyrillic IPA \u0626\u06c7\u064a\u063a\u06c7\u0631",
"author": "Osman Tursun",
"author_email": "osmanjan.t@gmail.com",
"download_url": "https://files.pythonhosted.org/packages/c1/58/dc5faf952c59041bb3d9966affb091a26bba0084895284ff314c96033efb/umsc-0.3.0.tar.gz",
"platform": null,
"description": "# Script Converter for Uyghur Language\nThis converter converts multiple Uyghur scripts such as **ULS** (Uyghur Latin Script), **UAS** (Uyghur Arabic Script), \n**CTS** (Common Turkish Script), **UCS** (Uyghur Cyrillic Script), **UYS** (Uyghur Yengi (new) Script), \nInternational Phonetic Alphabet (IPA), Uzbek Latin Script (UZLS) and Xinjiang University Script (XJUS).\n\n## Installation\n```\npip install umsc\n```\n\n\n## Mapping table\n| UAS | CTS | ULS| UCS|UYS| IPA | UZLS | XJUS |\n|-----|----| ---- | --- | -- |-------|---|------|\n| \u0627 | a | a | \u0430 |a | /\u0251/ | o | a |\n| \u06d5 | e | e | \u04d9 |\u0259 | /\u00e6/ | a | A |\n| \u0628 | b | b | \u0431 |b | /b/ | b | b |\n| \u067e | p | p | \u043f |p | /p/ | p | p |\n| \u062a | t | t | \u0442 |t | /t/ | t | t |\n| \u062c | c | j | \u0497 |j | /d\u0361\u0292/ | j | j |\n| \u0686 | \u00e7 | ch | \u0447 |q | /t\u0361\u0283/ | ch | c |\n| \u062e | x | x | \u0445 |h | /\u03c7/ | x | H |\n| \u062f | d | d | \u0434 |d | /d/ | d | d |\n| \u0631 | r | r | \u0440 |r | /r/ | r | r |\n| \u0632 | z | z | \u0437 |z | /z/ | z | z |\n| \u0698 | j | zh | \u0436 |\u2c6c | /\u0292/ | j | J |\n| \u0633 | s | s | \u0441 |s | /s/ | s | s |\n| \u0634 | \u015f | sh | \u0448 |x | /\u0283/ | sh | x |\n| \u0641 | f | f | \u0444 |f | /f/ | f | f |\n| \u06ad | \u00f1 | ng | \u04a3 |ng | /\u014b/ | ng | N |\n| \u0644 | l | l | \u043b |l | /l/ | l | l |\n| \u0645 | m | m | \u043c |m | /m/ | m | m |\n| \u06be | h | h | \u04bb |\u2c68 | /h/ | h | h |\n| \u0648 | o | o | \u043e |o | /o/ | o\u02bb | o |\n| \u06c7 | u | u | \u0443 |u | /u/ | u | u |\n| \u06c6 | \u00f6 | \u00f6 | \u04e9 |\u0275 | /\u00f8/ | o\u02bb | O |\n| \u06c8 | \u00fc | \u00fc | \u04af |\u00fc | /y/ | u\u02bb | U |\n| \u06cb | v | w | \u0432 |w | /w/ | v | w |\n| \u06d0 | \u00e9 | \u00e9 | \u0435 |e | /\u025b/ | e | e |\n| \u0649 | i | i | \u0438 |i | /i/ | i | i |\n| \u064a | y | y | \u0439 |y | /j/ | y | y |\n| \u0642 | q | q | \u049b |\u2c6a | /q/ | q | q |\n| \u0643 | k | k | \u043a |k | /k/ | k | k |\n| \u06af | g | g | \u0433 |g | /\u0261/ | g | g |\n| \u0646 | n | n | \u043d |n | /n/ | n | n |\n| \u063a | \u011f | gh | \u0493 |\u01a3 | /\u0281/ | g\u02bb | G |\n| \u0626 | | | | | | | v |\n| \u064a\u0627 | ya | ya | \u044f |ya | | ya | ya |\n| \u064a\u06c7 | yu | yu | \u044e |yu | | yu | yu |\n\n## Sample input and output examples\n\nReview the files in the tests directory for examples of converting between different scripts.\n\n## Usage\n\n```\nfrom umsc import UgMultiScriptConverter\n# To convert text, you need to define source and target scripts\n# The abbreviation of scrips\n# ULS | Uyghur Latin Script\n# UYS | Uyghur Yengi (New) Script\n# CPS | Chinese Pinyin Script\n# UAS | Uyghur Arabic Script\n# CTS |Common Turkic Script\n# UCS | Uyghur Cyrillic Script\n# XJU | Xinjinag University English Case Sensitive\n# UZLS | Uzbek Latin Script\n# Convert Uyghur Arabic Script to Uyghur Latin Script\nsource_script = 'UAS'\ntarget_script = 'ULS'\nconverter = UgMultiScriptConverter(source_script, target_script)\ntext1 = '\u064a\u0627\u062e\u0634\u0649\u0645\u06c7\u0633\u0649\u0632!'\ntext1 = converter(text1)\nprint(text1)\n# Convert Uyghur Latin Script to Uyghur Arabic Script\nsource_script = 'ULS'\ntarget_script = 'UAS'\nconverter = UgMultiScriptConverter(source_script, target_script)\ntext2 = 'yaxshimusiz!'\ntext2 = converter(text2)\nprint(text2)\n```\n\n## Notes\n- API and Uzbek are currently developing. The mapping might not be very accurate. Especially for Uzbek, it is not very clear how to map \"\u0626\u0627\" and \"\u0626\u06d5\" to Uzbek.\n\n\n## Citation\n\nIf you wish to cite this project, please use `cite this repository`. \n\n## Contributing\nFeel free to raise issue and pull request.\n\n## License\nDistributed under the Apache 2.0 License. See [`LICENSE`](LICENSE) for more information.\n",
"bugtrack_url": null,
"license": null,
"summary": "Script Converter for Uyghur Language",
"version": "0.3.0",
"project_urls": {
"Homepage": "https://github.com/neouyghur/ScriptConverter4Uyghur"
},
"split_keywords": [
"uyghur",
"script",
"converter",
"arabic",
"latin",
"cyrillic",
"ipa",
"\u0626\u06c7\u064a\u063a\u06c7\u0631"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "58c5a4bc5591da37a09f0b6494e2e2005deefab538bb2654eb1ed74fb5483fc7",
"md5": "871027c63e1963a57908aa2a705ca9eb",
"sha256": "0aa8eb1698782f9197e08f8d38c8a0d5f8faa9f98bf1cd22193f23c1d2493a53"
},
"downloads": -1,
"filename": "umsc-0.3.0-py3-none-any.whl",
"has_sig": false,
"md5_digest": "871027c63e1963a57908aa2a705ca9eb",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.0",
"size": 22177,
"upload_time": "2024-03-27T05:24:50",
"upload_time_iso_8601": "2024-03-27T05:24:50.823377Z",
"url": "https://files.pythonhosted.org/packages/58/c5/a4bc5591da37a09f0b6494e2e2005deefab538bb2654eb1ed74fb5483fc7/umsc-0.3.0-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "c158dc5faf952c59041bb3d9966affb091a26bba0084895284ff314c96033efb",
"md5": "f919868fdf3f195323d41746bac9d0c5",
"sha256": "829f17ed0f21e49d68707429cc85f20e7406eb14772dfca3131bebcee2464ce4"
},
"downloads": -1,
"filename": "umsc-0.3.0.tar.gz",
"has_sig": false,
"md5_digest": "f919868fdf3f195323d41746bac9d0c5",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.0",
"size": 16686,
"upload_time": "2024-03-27T05:24:52",
"upload_time_iso_8601": "2024-03-27T05:24:52.233518Z",
"url": "https://files.pythonhosted.org/packages/c1/58/dc5faf952c59041bb3d9966affb091a26bba0084895284ff314c96033efb/umsc-0.3.0.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-03-27 05:24:52",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "neouyghur",
"github_project": "ScriptConverter4Uyghur",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"lcname": "umsc"
}