chinormfilter


Namechinormfilter JSON
Version 0.5.3 PyPI version JSON
download
home_pagehttp://github.com/po3rin/chinormfilter
Summary
upload_time2023-07-07 15:33:25
maintainer
docs_urlNone
authorpo3rin
requires_python>=3.11,<4.0
licenseApache-2.0
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # chinormfilter

[![PyPi version](https://img.shields.io/pypi/v/chinormfilter.svg)](https://pypi.python.org/pypi/chinormfilter/)
![PyTest](https://github.com/po3rin/chinormfilter/workflows/PyTest/badge.svg)
[![](https://img.shields.io/badge/python-3.7+-blue.svg)](https://www.python.org/downloads/release/python-390/)
![](https://img.shields.io/pypi/l/chinormfilter)

Filter synonym written in lucene format to avoid duplication with Sudachi normalization. Mainly used when migrating to sudachi analyzer.

## Usage

```sh
$ chinormfilter tests/test.txt -o out.txt
```

filtered result is following.

```txt
レナリドミド,レナリドマイド
リンゴ => 林檎
飲む,呑む
tlc => tlc,全肺気量
リンたんぱく質,リン蛋白質,リンタンパク質

↓ filter

レナリドミド,レナリドマイド
tlc => tlc,全肺気量
```

### Specify system dict

```sh
$ chinormfilter tests/test.txt -s full -o out.txt
```

### Use Custom Dict

Specify dict via sudachi.json

```sh
$ chinormfilter tests/test.txt -s sudachi.json -o out.txt
```

## TODO
- [ ] custom dict test


            

Raw data

            {
    "_id": null,
    "home_page": "http://github.com/po3rin/chinormfilter",
    "name": "chinormfilter",
    "maintainer": "",
    "docs_url": null,
    "requires_python": ">=3.11,<4.0",
    "maintainer_email": "",
    "keywords": "",
    "author": "po3rin",
    "author_email": "abctail30@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/5e/a2/25954bab0cecbcb149f9cd0fa6d15caa108b40bb8b3cb5a1627e25f06109/chinormfilter-0.5.3.tar.gz",
    "platform": null,
    "description": "# chinormfilter\n\n[![PyPi version](https://img.shields.io/pypi/v/chinormfilter.svg)](https://pypi.python.org/pypi/chinormfilter/)\n![PyTest](https://github.com/po3rin/chinormfilter/workflows/PyTest/badge.svg)\n[![](https://img.shields.io/badge/python-3.7+-blue.svg)](https://www.python.org/downloads/release/python-390/)\n![](https://img.shields.io/pypi/l/chinormfilter)\n\nFilter synonym written in lucene format to avoid duplication with Sudachi normalization. Mainly used when migrating to sudachi analyzer.\n\n## Usage\n\n```sh\n$ chinormfilter tests/test.txt -o out.txt\n```\n\nfiltered result is following.\n\n```txt\n\u30ec\u30ca\u30ea\u30c9\u30df\u30c9,\u30ec\u30ca\u30ea\u30c9\u30de\u30a4\u30c9\n\u30ea\u30f3\u30b4 => \u6797\u6a8e\n\u98f2\u3080,\u5451\u3080\ntlc => tlc,\u5168\u80ba\u6c17\u91cf\n\u30ea\u30f3\u305f\u3093\u3071\u304f\u8cea,\u30ea\u30f3\u86cb\u767d\u8cea,\u30ea\u30f3\u30bf\u30f3\u30d1\u30af\u8cea\n\n\u2193 filter\n\n\u30ec\u30ca\u30ea\u30c9\u30df\u30c9,\u30ec\u30ca\u30ea\u30c9\u30de\u30a4\u30c9\ntlc => tlc,\u5168\u80ba\u6c17\u91cf\n```\n\n### Specify system dict\n\n```sh\n$ chinormfilter tests/test.txt -s full -o out.txt\n```\n\n### Use Custom Dict\n\nSpecify dict via sudachi.json\n\n```sh\n$ chinormfilter tests/test.txt -s sudachi.json -o out.txt\n```\n\n## TODO\n- [ ] custom dict test\n\n",
    "bugtrack_url": null,
    "license": "Apache-2.0",
    "summary": "",
    "version": "0.5.3",
    "project_urls": {
        "Homepage": "http://github.com/po3rin/chinormfilter",
        "Repository": "http://github.com/po3rin/chinormfilter"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "017ee134bd54901bd904dd5dde58e45ffe3cc801732c184b6973faac0f543a67",
                "md5": "f65a69eadb22b3d77c80147030ffebdb",
                "sha256": "28b181d50d78bfb94e8c3fbe39038ef47cff4c889f6ba3d2a9fe3ef8451a8380"
            },
            "downloads": -1,
            "filename": "chinormfilter-0.5.3-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "f65a69eadb22b3d77c80147030ffebdb",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.11,<4.0",
            "size": 7013,
            "upload_time": "2023-07-07T15:33:23",
            "upload_time_iso_8601": "2023-07-07T15:33:23.367426Z",
            "url": "https://files.pythonhosted.org/packages/01/7e/e134bd54901bd904dd5dde58e45ffe3cc801732c184b6973faac0f543a67/chinormfilter-0.5.3-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "5ea225954bab0cecbcb149f9cd0fa6d15caa108b40bb8b3cb5a1627e25f06109",
                "md5": "31ea1eb8fea7151ca26c485683f0430e",
                "sha256": "67fcfe6c5d191dcf505a3d7d73292b01f3da696a3933c1f4395bb5baead34f63"
            },
            "downloads": -1,
            "filename": "chinormfilter-0.5.3.tar.gz",
            "has_sig": false,
            "md5_digest": "31ea1eb8fea7151ca26c485683f0430e",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.11,<4.0",
            "size": 6152,
            "upload_time": "2023-07-07T15:33:25",
            "upload_time_iso_8601": "2023-07-07T15:33:25.410877Z",
            "url": "https://files.pythonhosted.org/packages/5e/a2/25954bab0cecbcb149f9cd0fa6d15caa108b40bb8b3cb5a1627e25f06109/chinormfilter-0.5.3.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-07-07 15:33:25",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "po3rin",
    "github_project": "chinormfilter",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "chinormfilter"
}
        
Elapsed time: 0.08805s