confusable_homoglyphs


Nameconfusable_homoglyphs JSON
Version 3.2.0 PyPI version JSON
download
home_pagehttps://github.com/vhf/confusable_homoglyphs
SummaryDetect confusable usage of unicode homoglyphs, prevent homograph attacks.
upload_time2018-08-31 13:04:04
maintainer
docs_urlNone
authorVictor Felder
requires_python
licenseMIT
keywords confusable
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI
coveralls test coverage No coveralls.
            confusable_homoglyphs `[doc] <http://confusable-homoglyphs.readthedocs.io/en/latest/>`__
========================================================================================

.. image:: https://img.shields.io/travis/vhf/confusable_homoglyphs.svg
        :target: https://travis-ci.org/vhf/confusable_homoglyphs

.. image:: https://img.shields.io/pypi/v/confusable_homoglyphs.svg
        :target: https://pypi.python.org/pypi/confusable_homoglyphs

.. image:: https://readthedocs.org/projects/confusable_homoglyphs/badge/?version=latest
        :target: http://confusable-homoglyphs.readthedocs.io/en/latest/
        :alt: Documentation Status

*a homoglyph is one of two or more graphemes, characters, or glyphs with
shapes that appear identical or very similar*
`wikipedia:Homoglyph <https://en.wikipedia.org/wiki/Homoglyph>`__

Unicode homoglyphs can be a nuisance on the web. Your most popular
client, AlaskaJazz, might be upset to be impersonated by a trickster who
deliberately chose the username ΑlaskaJazz.

-  ``AlaskaJazz`` is single script: only Latin characters.
-  ``ΑlaskaJazz`` is mixed-script: the first character is a greek
   letter.

You might also want to avoid people being tricked into entering their
password on ``www.microsоft.com`` or ``www.faϲebook.com`` instead of
``www.microsoft.com`` or ``www.facebook.com``. `Here is a
utility <http://unicode.org/cldr/utility/confusables.jsp>`__ to play
with these **confusable homoglyphs**.

Not all mixed-script strings have to be ruled out though, you could only
exclude mixed-script strings containing characters that might be
confused with a character from some unicode blocks of your choosing.

-  ``Allo`` and ``ρττ`` are fine: single script.
-  ``AlloΓ`` is fine when our preferred script alias is 'latin': mixed script, but ``Γ`` is not confusable.
-  ``Alloρ`` is dangerous: mixed script and ``ρ`` could be confused with
   ``p``.

This library is compatible Python 2 and Python 3.

`API documentation <http://confusable-homoglyphs.readthedocs.io/en/latest/apidocumentation.html>`__
---------------------------------------------------------------------------------------------------

Is the data up to date?
-----------------------

Yep.

The unicode blocks aliases and names for each character are extracted
from `this file <http://www.unicode.org/Public/UNIDATA/Scripts.txt>`__
provided by the unicode consortium.

The matrix of which character can be confused with which other
characters is built using `this
file <http://www.unicode.org/Public/security/latest/confusables.txt>`__
provided by the unicode consortium.

This data is stored in two JSON files: ``categories.json`` and
``confusables.json``. If you delete them, they will both be recreated by
downloading and parsing the two abovementioned files and stored as JSON
files again.




History
-------

1.0.0
=====

Initial release.

2.0.0
=====

- `allowed_categories` renamed to `allowed_aliases`

2.0.1
=====

- Fix a TypeError: https://github.com/vhf/confusable_homoglyphs/pull/2

3.0.0
=====

Courtesy of Ryan P Kilby, via https://github.com/vhf/confusable_homoglyphs/pull/6 :

- Changed file paths to be relative to the `confusable_homoglyphs` package directory instead of the user's current working directory.
- Data files are now distributed with the packaging.
- Fixes tests so that they use the installed distribution instead of the local files. (Originally, the data files were erroneously showing up during testing, despite not being included in the distribution).
- Moves the data file generation into a simple CLI. This way, users have a method for controlling when the data files are updated.
- Since the data files are now included in the distribution, the CLI is made optional. Its dependencies can be installed with the `cli` bundle, eg. `pip install confusable_homoglyphs[cli]`.


            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/vhf/confusable_homoglyphs",
    "name": "confusable_homoglyphs",
    "maintainer": "",
    "docs_url": null,
    "requires_python": "",
    "maintainer_email": "",
    "keywords": "confusable",
    "author": "Victor Felder",
    "author_email": "victorfelder@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/62/55/0aac1a100d755987e62c12367724b33419ed15921c6a915dc257c886ceff/confusable_homoglyphs-3.2.0.tar.gz",
    "platform": "",
    "description": "confusable_homoglyphs `[doc] <http://confusable-homoglyphs.readthedocs.io/en/latest/>`__\n========================================================================================\n\n.. image:: https://img.shields.io/travis/vhf/confusable_homoglyphs.svg\n        :target: https://travis-ci.org/vhf/confusable_homoglyphs\n\n.. image:: https://img.shields.io/pypi/v/confusable_homoglyphs.svg\n        :target: https://pypi.python.org/pypi/confusable_homoglyphs\n\n.. image:: https://readthedocs.org/projects/confusable_homoglyphs/badge/?version=latest\n        :target: http://confusable-homoglyphs.readthedocs.io/en/latest/\n        :alt: Documentation Status\n\n*a homoglyph is one of two or more graphemes, characters, or glyphs with\nshapes that appear identical or very similar*\n`wikipedia:Homoglyph <https://en.wikipedia.org/wiki/Homoglyph>`__\n\nUnicode homoglyphs can be a nuisance on the web. Your most popular\nclient, AlaskaJazz, might be upset to be impersonated by a trickster who\ndeliberately chose the username \u0391laskaJazz.\n\n-  ``AlaskaJazz`` is single script: only Latin characters.\n-  ``\u0391laskaJazz`` is mixed-script: the first character is a greek\n   letter.\n\nYou might also want to avoid people being tricked into entering their\npassword on ``www.micros\ufeff\u043eft.com`` or ``www.fa\u03f2ebook.com`` instead of\n``www.microsoft.com`` or ``www.facebook.com``. `Here is a\nutility <http://unicode.org/cldr/utility/confusables.jsp>`__ to play\nwith these **confusable homoglyphs**.\n\nNot all mixed-script strings have to be ruled out though, you could only\nexclude mixed-script strings containing characters that might be\nconfused with a character from some unicode blocks of your choosing.\n\n-  ``Allo`` and ``\u03c1\u03c4\u03c4`` are fine: single script.\n-  ``Allo\u0393`` is fine when our preferred script alias is 'latin': mixed script, but ``\u0393`` is not confusable.\n-  ``Allo\u03c1`` is dangerous: mixed script and ``\u03c1`` could be confused with\n   ``p``.\n\nThis library is compatible Python 2 and Python 3.\n\n`API documentation <http://confusable-homoglyphs.readthedocs.io/en/latest/apidocumentation.html>`__\n---------------------------------------------------------------------------------------------------\n\nIs the data up to date?\n-----------------------\n\nYep.\n\nThe unicode blocks aliases and names for each character are extracted\nfrom `this file <http://www.unicode.org/Public/UNIDATA/Scripts.txt>`__\nprovided by the unicode consortium.\n\nThe matrix of which character can be confused with which other\ncharacters is built using `this\nfile <http://www.unicode.org/Public/security/latest/confusables.txt>`__\nprovided by the unicode consortium.\n\nThis data is stored in two JSON files: ``categories.json`` and\n``confusables.json``. If you delete them, they will both be recreated by\ndownloading and parsing the two abovementioned files and stored as JSON\nfiles again.\n\n\n\n\nHistory\n-------\n\n1.0.0\n=====\n\nInitial release.\n\n2.0.0\n=====\n\n- `allowed_categories` renamed to `allowed_aliases`\n\n2.0.1\n=====\n\n- Fix a TypeError: https://github.com/vhf/confusable_homoglyphs/pull/2\n\n3.0.0\n=====\n\nCourtesy of Ryan P Kilby, via https://github.com/vhf/confusable_homoglyphs/pull/6 :\n\n- Changed file paths to be relative to the `confusable_homoglyphs` package directory instead of the user's current working directory.\n- Data files are now distributed with the packaging.\n- Fixes tests so that they use the installed distribution instead of the local files. (Originally, the data files were erroneously showing up during testing, despite not being included in the distribution).\n- Moves the data file generation into a simple CLI. This way, users have a method for controlling when the data files are updated.\n- Since the data files are now included in the distribution, the CLI is made optional. Its dependencies can be installed with the `cli` bundle, eg. `pip install confusable_homoglyphs[cli]`.\n\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Detect confusable usage of unicode homoglyphs, prevent homograph attacks.",
    "version": "3.2.0",
    "project_urls": {
        "Homepage": "https://github.com/vhf/confusable_homoglyphs"
    },
    "split_keywords": [
        "confusable"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "a438aadd1c42ae353943cd1523cdd38ffb267ba95861b4e8bb1a559d975ae612",
                "md5": "eff03a1e88ff8583f7c7ef896922481d",
                "sha256": "e3ce611028d882b74a5faa69e3cbb5bd4dcd9f69936da6e73d33eda42c917944"
            },
            "downloads": -1,
            "filename": "confusable_homoglyphs-3.2.0-py2.py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "eff03a1e88ff8583f7c7ef896922481d",
            "packagetype": "bdist_wheel",
            "python_version": "2.7",
            "requires_python": null,
            "size": 141197,
            "upload_time": "2018-08-31T13:04:08",
            "upload_time_iso_8601": "2018-08-31T13:04:08.055754Z",
            "url": "https://files.pythonhosted.org/packages/a4/38/aadd1c42ae353943cd1523cdd38ffb267ba95861b4e8bb1a559d975ae612/confusable_homoglyphs-3.2.0-py2.py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "62550aac1a100d755987e62c12367724b33419ed15921c6a915dc257c886ceff",
                "md5": "e8f03a9076c079f2d304db3a9848b709",
                "sha256": "3b4a0d9fa510669498820c91a0bfc0c327568cecec90648cf3819d4a6fc6a751"
            },
            "downloads": -1,
            "filename": "confusable_homoglyphs-3.2.0.tar.gz",
            "has_sig": false,
            "md5_digest": "e8f03a9076c079f2d304db3a9848b709",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 161937,
            "upload_time": "2018-08-31T13:04:04",
            "upload_time_iso_8601": "2018-08-31T13:04:04.501016Z",
            "url": "https://files.pythonhosted.org/packages/62/55/0aac1a100d755987e62c12367724b33419ed15921c6a915dc257c886ceff/confusable_homoglyphs-3.2.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2018-08-31 13:04:04",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "vhf",
    "github_project": "confusable_homoglyphs",
    "travis_ci": true,
    "coveralls": false,
    "github_actions": false,
    "requirements": [],
    "tox": true,
    "lcname": "confusable_homoglyphs"
}
        
Elapsed time: 0.18636s