pydips


Namepydips JSON
Version 0.0.4 PyPI version JSON
download
home_pageNone
SummaryMulti-criteria Cantonese segmentation with dashes, intermediates, pipes, and spaces.
upload_time2024-09-18 17:13:17
maintainerNone
docs_urlNone
authorNone
requires_python<4,>=3.9
licenseMIT
keywords cantonese chinese natural-language-processing
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # pydips

Multi-criteria Cantonese segmentation with **d**ashes, **i**ntermediates, **p**ipes, and **s**paces.

Note: This package is still in beta, there might be breaking changes in the future.
Currently supports macOS (Apple Silicon) and Linux (x86_64 with avx, avx2, and fma instructions)

## Install

```sh
pip install pydips
```

## Usage

```python
>>> from pydips import BertModel
>>> model = BertModel()

>>> model.cut('阿張先生嗰時好nice㗎', mode='coarse')
['阿張先生', '嗰時', '好', 'nice', '㗎']

>>> model.cut('阿張先生嗰時好nice㗎', mode='fine')
['阿', '張', '先生', '嗰', '時', '好', 'nice', '㗎']

>>> model.cut('阿張先生嗰時好nice㗎', mode='dips_str')
'阿-張|先生 嗰-時 好 nice 㗎'

>>> model.cut('阿張先生嗰時好nice㗎', mode='dips')
['S', 'D', 'P', 'I', 'S', 'D', 'S', 'S', 'I', 'I', 'I', 'S']
```

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "pydips",
    "maintainer": null,
    "docs_url": null,
    "requires_python": "<4,>=3.9",
    "maintainer_email": null,
    "keywords": "cantonese, chinese, natural-language-processing",
    "author": null,
    "author_email": "Kevin Xiang Li <kevinli020508@gmail.com>",
    "download_url": "https://files.pythonhosted.org/packages/00/e0/94bbfc9797b01b5d749d34e025638ea60d011f220e06ab868094462f74a4/pydips-0.0.4.tar.gz",
    "platform": null,
    "description": "# pydips\n\nMulti-criteria Cantonese segmentation with **d**ashes, **i**ntermediates, **p**ipes, and **s**paces.\n\nNote: This package is still in beta, there might be breaking changes in the future.\nCurrently supports macOS (Apple Silicon) and Linux (x86_64 with avx, avx2, and fma instructions)\n\n## Install\n\n```sh\npip install pydips\n```\n\n## Usage\n\n```python\n>>> from pydips import BertModel\n>>> model = BertModel()\n\n>>> model.cut('\u963f\u5f35\u5148\u751f\u55f0\u6642\u597dnice\u35ce', mode='coarse')\n['\u963f\u5f35\u5148\u751f', '\u55f0\u6642', '\u597d', 'nice', '\u35ce']\n\n>>> model.cut('\u963f\u5f35\u5148\u751f\u55f0\u6642\u597dnice\u35ce', mode='fine')\n['\u963f', '\u5f35', '\u5148\u751f', '\u55f0', '\u6642', '\u597d', 'nice', '\u35ce']\n\n>>> model.cut('\u963f\u5f35\u5148\u751f\u55f0\u6642\u597dnice\u35ce', mode='dips_str')\n'\u963f-\u5f35|\u5148\u751f \u55f0-\u6642 \u597d nice \u35ce'\n\n>>> model.cut('\u963f\u5f35\u5148\u751f\u55f0\u6642\u597dnice\u35ce', mode='dips')\n['S', 'D', 'P', 'I', 'S', 'D', 'S', 'S', 'I', 'I', 'I', 'S']\n```\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Multi-criteria Cantonese segmentation with dashes, intermediates, pipes, and spaces.",
    "version": "0.0.4",
    "project_urls": {
        "Bug Reports": "https://github.com/AlienKevin/pydips/issues",
        "Homepage": "https://github.com/AlienKevin/pydips",
        "Source": "https://github.com/AlienKevin/pydips"
    },
    "split_keywords": [
        "cantonese",
        " chinese",
        " natural-language-processing"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "8e2b7397dc6e8b9707afa6287ab7dd11cb15d8d05d8e52fb78a4832dcd4b8368",
                "md5": "ab12395f3e25c8530f79473a4a9a0c89",
                "sha256": "9ff7f2b48fa253c9112b72482573f0185c7c8f2ead14e776e85fe10e76d64f3a"
            },
            "downloads": -1,
            "filename": "pydips-0.0.4-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "ab12395f3e25c8530f79473a4a9a0c89",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": "<4,>=3.9",
            "size": 3783890,
            "upload_time": "2024-09-18T17:13:15",
            "upload_time_iso_8601": "2024-09-18T17:13:15.143072Z",
            "url": "https://files.pythonhosted.org/packages/8e/2b/7397dc6e8b9707afa6287ab7dd11cb15d8d05d8e52fb78a4832dcd4b8368/pydips-0.0.4-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "00e094bbfc9797b01b5d749d34e025638ea60d011f220e06ab868094462f74a4",
                "md5": "fa601b5046c58288c8524a1f90ecd34d",
                "sha256": "c05bfadfac41a620fa28c3015cad5b4b9d54d601936b537a6d02f8eff5e2f2df"
            },
            "downloads": -1,
            "filename": "pydips-0.0.4.tar.gz",
            "has_sig": false,
            "md5_digest": "fa601b5046c58288c8524a1f90ecd34d",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": "<4,>=3.9",
            "size": 3786754,
            "upload_time": "2024-09-18T17:13:17",
            "upload_time_iso_8601": "2024-09-18T17:13:17.184230Z",
            "url": "https://files.pythonhosted.org/packages/00/e0/94bbfc9797b01b5d749d34e025638ea60d011f220e06ab868094462f74a4/pydips-0.0.4.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-09-18 17:13:17",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "AlienKevin",
    "github_project": "pydips",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "lcname": "pydips"
}
        
Elapsed time: 0.33769s