# String Comparison
Text normalization for string comparison in Python.
This maps a Unicode code point key represented as an integer to a corresponding ASCII
character mapping. In some cases such as Hebrew niqqud & cantillation, we remove the
character altogether or replace certain punctuation with a space.
In addition, some characters are removed prior to normalization. For example, Unicode control chars.
## Requirements
This project requires Python 3.7+
## Installation
To install with pip
pip install string_comparison
## Usage
import string_comparison
normalized = string_comparison.normalize("é")
assert normalized == "e"
Raw data
{
"_id": null,
"home_page": "https://github.com/bbelyeu/string_comparison/",
"name": "string-comparison",
"maintainer": null,
"docs_url": null,
"requires_python": ">3.7.0",
"maintainer_email": null,
"keywords": "unicode",
"author": "Brad Belyeu",
"author_email": "bradley.belyeu@life.church",
"download_url": "https://files.pythonhosted.org/packages/77/5c/ee3dc2f0db3573f930852a1485e38a924def848bcf853bc95ca0cd757793/string_comparison-1.1.0.tar.gz",
"platform": "any",
"description": "# String Comparison\n\nText normalization for string comparison in Python.\n\nThis maps a Unicode code point key represented as an integer to a corresponding ASCII\ncharacter mapping. In some cases such as Hebrew niqqud & cantillation, we remove the\ncharacter altogether or replace certain punctuation with a space.\n\nIn addition, some characters are removed prior to normalization. For example, Unicode control chars.\n\n## Requirements\n\nThis project requires Python 3.7+\n\n## Installation\n\nTo install with pip\n\n pip install string_comparison\n\n## Usage\n\n import string_comparison\n normalized = string_comparison.normalize(\"\u00e9\")\n assert normalized == \"e\"\n\n\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "Utility to assist with string comparisons.",
"version": "1.1.0",
"project_urls": {
"Download": "https://github.com/bbelyeu/string_comparison/archive/1.1.0.zip",
"Homepage": "https://github.com/bbelyeu/string_comparison/"
},
"split_keywords": [
"unicode"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "bb234338a943d6de7bc7c2dde669c08803e214eeeec12de9ca6a32b4592fe436",
"md5": "293b5ce21c430cd0fb4d12ddd28121f9",
"sha256": "46cda70eaafcf1bdc43c081f0eddcd2d2985efe137e7f71e530fc0c6440b2401"
},
"downloads": -1,
"filename": "string_comparison-1.1.0-py3-none-any.whl",
"has_sig": false,
"md5_digest": "293b5ce21c430cd0fb4d12ddd28121f9",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">3.7.0",
"size": 8118,
"upload_time": "2024-05-17T16:44:01",
"upload_time_iso_8601": "2024-05-17T16:44:01.712903Z",
"url": "https://files.pythonhosted.org/packages/bb/23/4338a943d6de7bc7c2dde669c08803e214eeeec12de9ca6a32b4592fe436/string_comparison-1.1.0-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "775cee3dc2f0db3573f930852a1485e38a924def848bcf853bc95ca0cd757793",
"md5": "be7c07814e438fbfc7a8660cf487b060",
"sha256": "09aa25595472cb2ba3adbaf0d642541d7854f6b90619c01b2a7a3381b01a8bac"
},
"downloads": -1,
"filename": "string_comparison-1.1.0.tar.gz",
"has_sig": false,
"md5_digest": "be7c07814e438fbfc7a8660cf487b060",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">3.7.0",
"size": 6813,
"upload_time": "2024-05-17T16:44:03",
"upload_time_iso_8601": "2024-05-17T16:44:03.525773Z",
"url": "https://files.pythonhosted.org/packages/77/5c/ee3dc2f0db3573f930852a1485e38a924def848bcf853bc95ca0cd757793/string_comparison-1.1.0.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-05-17 16:44:03",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "bbelyeu",
"github_project": "string_comparison",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"requirements": [],
"lcname": "string-comparison"
}