umsc


Nameumsc JSON
Version 0.4.0 PyPI version JSON
download
home_pagehttps://github.com/neouyghur/ScriptConverter4Uyghur
SummaryScript Converter for Uyghur Language
upload_time2025-08-17 23:45:26
maintainerOsman Tursun
docs_urlNone
authorOsman Tursun
requires_python>=3.0
licenseNone
keywords uyghur script converter arabic latin cyrillic ipa ئۇيغۇر
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Script Converter for Uyghur Language
This converter supports multiple Uyghur writing systems:
- **ULS** — Uyghur Latin Script  
- **UAS** — Uyghur Arabic Script  
- **CTS** — Common Turkish Script  
- **UCS** — Uyghur Cyrillic Script  
- **UYS** — Uyghur Yengi (New) Script  
- **IPA** — International Phonetic Alphabet  
- **UZLS** — Uzbek Latin Script  
- **XJUS** — Xinjiang University Script  

## Installation
```
pip install umsc
```


## Mapping table
| UAS | CTS | ULS| UCS|UYS| IPA   | UZLS | XJUS |
|-----|----| ---- | --- | -- |-------|---|------|
| ا   | a  | a    | а   |a | /ɑ/   | o | a    |
| ە   | e  | e    | ә   |ə | /æ/   | a | A    |
| ب   | b  | b    | б   |b | /b/   | b | b    |
| پ   | p  | p    | п   |p | /p/   | p | p    |
| ت   | t  | t    | т   |t | /t/   | t | t    |
| ج   | c  | j    | җ   |j | /d͡ʒ/ |  j | j   |
| چ   | ç  | ch   | ч   |q | /t͡ʃ/ | ch | c   |
| خ   | x  | x    | х   |h | /χ/   | x | H    |
| د   | d  | d    | д   |d | /d/   | d | d    |
| ر   | r  | r    | р   |r | /r/   | r | r    |
| ز   | z  | z    | з   |z | /z/   | z | z    |
| ژ   | j  | zh   | ж   |ⱬ | /ʒ/   |  j | J   |
| س   | s  | s    | с   |s | /s/   | s | s    |
| ش   | ş  | sh   | ш   |x | /ʃ/   | sh | x   |
| ف   | f  | f    | ф   |f | /f/   | f | f    |
| ڭ   | ñ  | ng   | ң   |ng | /ŋ/   | ng | N  |
| ل   | l  | l    | л   |l | /l/   | l | l    |
| م   | m  | m    | м   |m | /m/   | m | m    |
| ھ   | h  | h    | һ   |ⱨ | /h/   | h | h    |
| و   | o  | o    | о   |o | /o/   | oʻ | o    |
| ۇ   | u  | u    | у   |u | /u/   | u | u    |
| ۆ   | ö  | ö    | ө   |ɵ | /ø/   | oʻ | O   |
| ۈ   | ü  | ü    | ү   |ü | /y/   | uʻ | U   |
| ۋ   | v  | w    | в   |w | /w/   | v | w    |
| ې   | é  | é    | е   |e | /ɛ/   | e | e    |
| ى   | i  | i    | и   |i | /i/   | i | i    |
| ي   | y  | y    | й   |y | /j/   | y | y    |
| ق   | q  | q    | қ   |ⱪ | /q/   | q | q    |
| ك   | k  | k    | к   |k | /k/   | k | k    |
| گ   | g  | g    | г   |g | /ɡ/   | g | g    |
| ن   | n  | n    | н   |n | /n/   | n | n    |
| غ   | ğ  | gh   | ғ   |ƣ | /ʁ/   | gʻ | G   |
| ئ   |    |      |     | |       |   | v    |
| يا  | ya | ya   | я   |ya |       | ya | ya   |
| يۇ  | yu | yu   | ю   |yu |       | yu | yu   |

## Sample input and output examples

Review the files in the tests directory for examples of converting between different scripts.

## Usage

```
from umsc import UgMultiScriptConverter
# To convert text, you need to define source and target scripts
# The abbreviation of scrips
# ULS | Uyghur Latin Script
# UYS | Uyghur Yengi (New) Script
# CPS | Chinese Pinyin Script
# UAS | Uyghur Arabic Script
# CTS |Common Turkic Script
# UCS | Uyghur Cyrillic Script
# XJU | Xinjinag University English Case Sensitive
# UZLS | Uzbek Latin Script
# Convert Uyghur Arabic Script to Uyghur Latin Script
source_script = 'UAS'
target_script = 'ULS'
converter = UgMultiScriptConverter(source_script, target_script)
text1 = 'ياخشىمۇسىز!'
text1 = converter(text1)
print(text1)
# Convert Uyghur Latin Script to Uyghur Arabic Script
source_script = 'ULS'
target_script = 'UAS'
converter = UgMultiScriptConverter(source_script, target_script)
text2 = 'yaxshimusiz!'
text2 = converter(text2)
print(text2)
```

## Notes
- API and Uzbek are currently developing. The mapping might not be very accurate. Especially for Uzbek, it is not very clear how to map "ئا" and "ئە" to Uzbek.


## Citation

If you wish to cite this project, please use `cite this repository`. 

## Contributing
Feel free to raise issue and pull request.

## License
Distributed under the Apache 2.0 License. See [`LICENSE`](LICENSE) for more information.

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/neouyghur/ScriptConverter4Uyghur",
    "name": "umsc",
    "maintainer": "Osman Tursun",
    "docs_url": null,
    "requires_python": ">=3.0",
    "maintainer_email": "mpcabd@gmail.com",
    "keywords": "uyghur script converter arabic latin cyrillic IPA \u0626\u06c7\u064a\u063a\u06c7\u0631",
    "author": "Osman Tursun",
    "author_email": "osmanjan.t@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/c0/46/1b7de1dd1698505730b727a1b8d67fcc7daf841101edd171b3f816ab434f/umsc-0.4.0.tar.gz",
    "platform": null,
    "description": "# Script Converter for Uyghur Language\nThis converter supports multiple Uyghur writing systems:\n- **ULS** \u2014 Uyghur Latin Script  \n- **UAS** \u2014 Uyghur Arabic Script  \n- **CTS** \u2014 Common Turkish Script  \n- **UCS** \u2014 Uyghur Cyrillic Script  \n- **UYS** \u2014 Uyghur Yengi (New) Script  \n- **IPA** \u2014 International Phonetic Alphabet  \n- **UZLS** \u2014 Uzbek Latin Script  \n- **XJUS** \u2014 Xinjiang University Script  \n\n## Installation\n```\npip install umsc\n```\n\n\n## Mapping table\n| UAS | CTS | ULS| UCS|UYS| IPA   | UZLS | XJUS |\n|-----|----| ---- | --- | -- |-------|---|------|\n| \u0627   | a  | a    | \u0430   |a | /\u0251/   | o | a    |\n| \u06d5   | e  | e    | \u04d9   |\u0259 | /\u00e6/   | a | A    |\n| \u0628   | b  | b    | \u0431   |b | /b/   | b | b    |\n| \u067e   | p  | p    | \u043f   |p | /p/   | p | p    |\n| \u062a   | t  | t    | \u0442   |t | /t/   | t | t    |\n| \u062c   | c  | j    | \u0497   |j | /d\u0361\u0292/ |  j | j   |\n| \u0686   | \u00e7  | ch   | \u0447   |q | /t\u0361\u0283/ | ch | c   |\n| \u062e   | x  | x    | \u0445   |h | /\u03c7/   | x | H    |\n| \u062f   | d  | d    | \u0434   |d | /d/   | d | d    |\n| \u0631   | r  | r    | \u0440   |r | /r/   | r | r    |\n| \u0632   | z  | z    | \u0437   |z | /z/   | z | z    |\n| \u0698   | j  | zh   | \u0436   |\u2c6c | /\u0292/   |  j | J   |\n| \u0633   | s  | s    | \u0441   |s | /s/   | s | s    |\n| \u0634   | \u015f  | sh   | \u0448   |x | /\u0283/   | sh | x   |\n| \u0641   | f  | f    | \u0444   |f | /f/   | f | f    |\n| \u06ad   | \u00f1  | ng   | \u04a3   |ng | /\u014b/   | ng | N  |\n| \u0644   | l  | l    | \u043b   |l | /l/   | l | l    |\n| \u0645   | m  | m    | \u043c   |m | /m/   | m | m    |\n| \u06be   | h  | h    | \u04bb   |\u2c68 | /h/   | h | h    |\n| \u0648   | o  | o    | \u043e   |o | /o/   | o\u02bb | o    |\n| \u06c7   | u  | u    | \u0443   |u | /u/   | u | u    |\n| \u06c6   | \u00f6  | \u00f6    | \u04e9   |\u0275 | /\u00f8/   | o\u02bb | O   |\n| \u06c8   | \u00fc  | \u00fc    | \u04af   |\u00fc | /y/   | u\u02bb | U   |\n| \u06cb   | v  | w    | \u0432   |w | /w/   | v | w    |\n| \u06d0   | \u00e9  | \u00e9    | \u0435   |e | /\u025b/   | e | e    |\n| \u0649   | i  | i    | \u0438   |i | /i/   | i | i    |\n| \u064a   | y  | y    | \u0439   |y | /j/   | y | y    |\n| \u0642   | q  | q    | \u049b   |\u2c6a | /q/   | q | q    |\n| \u0643   | k  | k    | \u043a   |k | /k/   | k | k    |\n| \u06af   | g  | g    | \u0433   |g | /\u0261/   | g | g    |\n| \u0646   | n  | n    | \u043d   |n | /n/   | n | n    |\n| \u063a   | \u011f  | gh   | \u0493   |\u01a3 | /\u0281/   | g\u02bb | G   |\n| \u0626   |    |      |     | |       |   | v    |\n| \u064a\u0627  | ya | ya   | \u044f   |ya |       | ya | ya   |\n| \u064a\u06c7  | yu | yu   | \u044e   |yu |       | yu | yu   |\n\n## Sample input and output examples\n\nReview the files in the tests directory for examples of converting between different scripts.\n\n## Usage\n\n```\nfrom umsc import UgMultiScriptConverter\n# To convert text, you need to define source and target scripts\n# The abbreviation of scrips\n# ULS | Uyghur Latin Script\n# UYS | Uyghur Yengi (New) Script\n# CPS | Chinese Pinyin Script\n# UAS | Uyghur Arabic Script\n# CTS |Common Turkic Script\n# UCS | Uyghur Cyrillic Script\n# XJU | Xinjinag University English Case Sensitive\n# UZLS | Uzbek Latin Script\n# Convert Uyghur Arabic Script to Uyghur Latin Script\nsource_script = 'UAS'\ntarget_script = 'ULS'\nconverter = UgMultiScriptConverter(source_script, target_script)\ntext1 = '\u064a\u0627\u062e\u0634\u0649\u0645\u06c7\u0633\u0649\u0632!'\ntext1 = converter(text1)\nprint(text1)\n# Convert Uyghur Latin Script to Uyghur Arabic Script\nsource_script = 'ULS'\ntarget_script = 'UAS'\nconverter = UgMultiScriptConverter(source_script, target_script)\ntext2 = 'yaxshimusiz!'\ntext2 = converter(text2)\nprint(text2)\n```\n\n## Notes\n- API and Uzbek are currently developing. The mapping might not be very accurate. Especially for Uzbek, it is not very clear how to map \"\u0626\u0627\" and \"\u0626\u06d5\" to Uzbek.\n\n\n## Citation\n\nIf you wish to cite this project, please use `cite this repository`. \n\n## Contributing\nFeel free to raise issue and pull request.\n\n## License\nDistributed under the Apache 2.0 License. See [`LICENSE`](LICENSE) for more information.\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "Script Converter for Uyghur Language",
    "version": "0.4.0",
    "project_urls": {
        "Homepage": "https://github.com/neouyghur/ScriptConverter4Uyghur"
    },
    "split_keywords": [
        "uyghur",
        "script",
        "converter",
        "arabic",
        "latin",
        "cyrillic",
        "ipa",
        "\u0626\u06c7\u064a\u063a\u06c7\u0631"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "e2fa23769066813103fc1b9b2c8df7cb0a858dd313b331b4996ceb34663343c6",
                "md5": "42090f4ff0688c061933ea5b6667fd29",
                "sha256": "558d1ac9aea3e17466c8f28c90019fa53ec0af9b56e6f0104ecdadc4f8147bef"
            },
            "downloads": -1,
            "filename": "umsc-0.4.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "42090f4ff0688c061933ea5b6667fd29",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.0",
            "size": 23533,
            "upload_time": "2025-08-17T23:45:25",
            "upload_time_iso_8601": "2025-08-17T23:45:25.124225Z",
            "url": "https://files.pythonhosted.org/packages/e2/fa/23769066813103fc1b9b2c8df7cb0a858dd313b331b4996ceb34663343c6/umsc-0.4.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "c0461b7de1dd1698505730b727a1b8d67fcc7daf841101edd171b3f816ab434f",
                "md5": "d0b78342c8425385de710ba4da385ec7",
                "sha256": "3f7ec701d8897569cee95f775e29ac97fd5fec9caf9555bda32686eaa1ccf6b9"
            },
            "downloads": -1,
            "filename": "umsc-0.4.0.tar.gz",
            "has_sig": false,
            "md5_digest": "d0b78342c8425385de710ba4da385ec7",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.0",
            "size": 17762,
            "upload_time": "2025-08-17T23:45:26",
            "upload_time_iso_8601": "2025-08-17T23:45:26.497645Z",
            "url": "https://files.pythonhosted.org/packages/c0/46/1b7de1dd1698505730b727a1b8d67fcc7daf841101edd171b3f816ab434f/umsc-0.4.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-08-17 23:45:26",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "neouyghur",
    "github_project": "ScriptConverter4Uyghur",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "lcname": "umsc"
}
        
Elapsed time: 1.44852s