# chinormfilter
[![PyPi version](https://img.shields.io/pypi/v/chinormfilter.svg)](https://pypi.python.org/pypi/chinormfilter/)
![PyTest](https://github.com/po3rin/chinormfilter/workflows/PyTest/badge.svg)
[![](https://img.shields.io/badge/python-3.7+-blue.svg)](https://www.python.org/downloads/release/python-390/)
![](https://img.shields.io/pypi/l/chinormfilter)
Filter synonym written in lucene format to avoid duplication with Sudachi normalization. Mainly used when migrating to sudachi analyzer.
## Usage
```sh
$ chinormfilter tests/test.txt -o out.txt
```
filtered result is following.
```txt
レナリドミド,レナリドマイド
リンゴ => 林檎
飲む,呑む
tlc => tlc,全肺気量
リンたんぱく質,リン蛋白質,リンタンパク質
↓ filter
レナリドミド,レナリドマイド
tlc => tlc,全肺気量
```
### Specify system dict
```sh
$ chinormfilter tests/test.txt -s full -o out.txt
```
### Use Custom Dict
Specify dict via sudachi.json
```sh
$ chinormfilter tests/test.txt -s sudachi.json -o out.txt
```
## TODO
- [ ] custom dict test
Raw data
{
"_id": null,
"home_page": "http://github.com/po3rin/chinormfilter",
"name": "chinormfilter",
"maintainer": "",
"docs_url": null,
"requires_python": ">=3.11,<4.0",
"maintainer_email": "",
"keywords": "",
"author": "po3rin",
"author_email": "abctail30@gmail.com",
"download_url": "https://files.pythonhosted.org/packages/5e/a2/25954bab0cecbcb149f9cd0fa6d15caa108b40bb8b3cb5a1627e25f06109/chinormfilter-0.5.3.tar.gz",
"platform": null,
"description": "# chinormfilter\n\n[![PyPi version](https://img.shields.io/pypi/v/chinormfilter.svg)](https://pypi.python.org/pypi/chinormfilter/)\n![PyTest](https://github.com/po3rin/chinormfilter/workflows/PyTest/badge.svg)\n[![](https://img.shields.io/badge/python-3.7+-blue.svg)](https://www.python.org/downloads/release/python-390/)\n![](https://img.shields.io/pypi/l/chinormfilter)\n\nFilter synonym written in lucene format to avoid duplication with Sudachi normalization. Mainly used when migrating to sudachi analyzer.\n\n## Usage\n\n```sh\n$ chinormfilter tests/test.txt -o out.txt\n```\n\nfiltered result is following.\n\n```txt\n\u30ec\u30ca\u30ea\u30c9\u30df\u30c9,\u30ec\u30ca\u30ea\u30c9\u30de\u30a4\u30c9\n\u30ea\u30f3\u30b4 => \u6797\u6a8e\n\u98f2\u3080,\u5451\u3080\ntlc => tlc,\u5168\u80ba\u6c17\u91cf\n\u30ea\u30f3\u305f\u3093\u3071\u304f\u8cea,\u30ea\u30f3\u86cb\u767d\u8cea,\u30ea\u30f3\u30bf\u30f3\u30d1\u30af\u8cea\n\n\u2193 filter\n\n\u30ec\u30ca\u30ea\u30c9\u30df\u30c9,\u30ec\u30ca\u30ea\u30c9\u30de\u30a4\u30c9\ntlc => tlc,\u5168\u80ba\u6c17\u91cf\n```\n\n### Specify system dict\n\n```sh\n$ chinormfilter tests/test.txt -s full -o out.txt\n```\n\n### Use Custom Dict\n\nSpecify dict via sudachi.json\n\n```sh\n$ chinormfilter tests/test.txt -s sudachi.json -o out.txt\n```\n\n## TODO\n- [ ] custom dict test\n\n",
"bugtrack_url": null,
"license": "Apache-2.0",
"summary": "",
"version": "0.5.3",
"project_urls": {
"Homepage": "http://github.com/po3rin/chinormfilter",
"Repository": "http://github.com/po3rin/chinormfilter"
},
"split_keywords": [],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "017ee134bd54901bd904dd5dde58e45ffe3cc801732c184b6973faac0f543a67",
"md5": "f65a69eadb22b3d77c80147030ffebdb",
"sha256": "28b181d50d78bfb94e8c3fbe39038ef47cff4c889f6ba3d2a9fe3ef8451a8380"
},
"downloads": -1,
"filename": "chinormfilter-0.5.3-py3-none-any.whl",
"has_sig": false,
"md5_digest": "f65a69eadb22b3d77c80147030ffebdb",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.11,<4.0",
"size": 7013,
"upload_time": "2023-07-07T15:33:23",
"upload_time_iso_8601": "2023-07-07T15:33:23.367426Z",
"url": "https://files.pythonhosted.org/packages/01/7e/e134bd54901bd904dd5dde58e45ffe3cc801732c184b6973faac0f543a67/chinormfilter-0.5.3-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "5ea225954bab0cecbcb149f9cd0fa6d15caa108b40bb8b3cb5a1627e25f06109",
"md5": "31ea1eb8fea7151ca26c485683f0430e",
"sha256": "67fcfe6c5d191dcf505a3d7d73292b01f3da696a3933c1f4395bb5baead34f63"
},
"downloads": -1,
"filename": "chinormfilter-0.5.3.tar.gz",
"has_sig": false,
"md5_digest": "31ea1eb8fea7151ca26c485683f0430e",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.11,<4.0",
"size": 6152,
"upload_time": "2023-07-07T15:33:25",
"upload_time_iso_8601": "2023-07-07T15:33:25.410877Z",
"url": "https://files.pythonhosted.org/packages/5e/a2/25954bab0cecbcb149f9cd0fa6d15caa108b40bb8b3cb5a1627e25f06109/chinormfilter-0.5.3.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2023-07-07 15:33:25",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "po3rin",
"github_project": "chinormfilter",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"lcname": "chinormfilter"
}