ghalatawi


Nameghalatawi JSON
Version 0.3 PyPI version JSON
download
home_pagehttp://ghalatawi.sourceforge.net/
Summaryghalatawi: Arabic autocorrect library
upload_time2023-01-05 15:13:15
maintainer
docs_urlhttps://pythonhosted.org/ghalatawi/
authorTaha Zerrouki
requires_python
licenseGPL
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            Ghalatawi غلطاوي
================

Ghalatawi: Arabic Autocorrect library مكتبة للتصحيح التلقائي للغة
العربية

.. figure:: doc/ghalatawi_header.png
   :alt: ghalatawi logo

   ghalatawi logo

.. figure:: https://img.shields.io/pypi/dm/ghalatawi
   :alt: PyPI - Downloads

   PyPI - Downloads

Developpers: Taha Zerrouki: http://tahadz.com taha dot zerrouki at gmail
dot com

+------+---------------------------------------------------------------+
| Feat | value                                                         |
| ures |                                                               |
+------+---------------------------------------------------------------+
| Aut  |  Authors.md                                                   |
| hors |  <https://github.com/linuxscout/ghalatawi/main/AUTHORS.md>    |
+------+---------------------------------------------------------------+
| Rel  | 0.1                                                           |
| ease |                                                               |
+------+---------------------------------------------------------------+
| Lic  | `                                                             |
| ense | GPL <https://github.com/linuxscout/ghalatawi/main/LICENSE>`__ |
+------+---------------------------------------------------------------+
| Tra  | `linuxscout/ghalata                                           |
| cker | wi/Issues <https://github.com/linuxscout/ghalatawi/issues>`__ |
+------+---------------------------------------------------------------+
| So   | `Github <http://github.com/linuxscout/ghalatawi>`__           |
| urce |                                                               |
+------+---------------------------------------------------------------+
| F    | `Comments <https://github.com/linuxscout/ghalatawi/>`__       |
| eedb |                                                               |
| acks |                                                               |
+------+---------------------------------------------------------------+
| Acco | `@Twitter <https://twitter.com/linuxscout>`__)                |
| unts |                                                               |
+------+---------------------------------------------------------------+

Description
-----------

ghalatawi: Arabic Autocorrect library

التصحيح الإملائي من أهم الأدوات المستعملة في النشر والكتابة الإلكترونية،
وتستعمل في البحث والاستعلامات، كما يعد من أهم الأدوات المرافقة للبرامج
المكتبية. وفي بعض الحالات نحتاج لما يسمى بالتصحيح التلقائي للكلمات، وهو
اقتراح تصحيح لأخطاء شائعة في الكتابة، مثل قلب حرفين، أو عدم الضغط على زر
ما لإعطاء كلمة خاطئة، وتشيع في الكتابة بالعربية بعض الأخطاء مثل عدم
التفريق بين همزة الوصل وهمزة القطع، والخلط بين الضاد والظاء، وإغفال
نقطتي الياء، وكتابة التاء المربوطة هاء.

يهدف هذا المشروع إلى وضع قائمة للكلمات الخاطئة الشائعة وتصحيحها
التلقائي، وكذا وضع التعبيرات المنتظمة التي تعبّر عن بعض الحالات.

AutoCorrect (Text replacement, Replace as you type) is a software
function commonly found in word processors such as
OpenOffice.org/LibreOffice. Its principal purpose is to correct common
spelling or typing errors, saving time for the user. It is also used to
automatically format text or insert special characters by recognizing
particular character usage, saving the user from having to use more
tedious functions[cf. wikipedia]. The common errors in Arabic, are
confusion between Alef forms ( Alef with Hamza, Alef without Hamza),
missig the Yeh dots, and missing the Teh-Marbuta dots.

This Project aims to construct a word list and a list of regular
expressions for Arabic auto-correction.

مزايا:
~~~~~~

Applications تطبيقات
~~~~~~~~~~~~~~~~~~~~

-  قائمة التصحيح التلقائي في البرامج المكتبة OpenOffice/LibreOffice

-  تدقيق الأخطاء الشائعة في المواقع ذات المحتوى الضخم، مثل ويكيبيديا.

-  مساعدة الكاتب في برامج المراسلات، والترجمة، وغيرها.

-  تدقيق المدوّنات اللغوية (corpus).

-  تصحيح استعلامات البحث.

-  تصحيح الكتابة في الأجهزة المحمولة كالهوتاف الذكية.

-  AutoCorrect for word processors like OpenOffice/LibreOffice

-  Huge content site Autocorrection, like Wikipedia.

-  Writres and translators assitantance .

-  Corpora Autocorrection.

Usage
~~~~~

install
~~~~~~~

.. code:: shell

   pip install ghalatawi

[requirement]
^^^^^^^^^^^^^

::

   pyarabic>=0.6.8

import
^^^^^^

.. code:: python

   >>> from ghalatawi.autocorrector import AutoCorrector

Examples
--------

Detailed examples and features in `Features <doc/features.md>`__

-  Autocorrect a text

.. code:: python

   from ghalatawi.autocorrector import AutoCorrector
   autoco = AutoCorrector()
   text = 'اذا أردت إستعارة كتاب ، اذهب الى المكتبه او الادارة فى الضهيرة .'
   autoco.spell(text)

output

.. code:: python

   'إذا أردت استعارة كتاب، اذهب إلى المكتبة أو الادارة في الظهيرة.'

-  Show methods used in spelling

.. code:: python

   from ghalatawi.autocorrector import AutoCorrector
   autoco = AutoCorrector()
   autoco.show_config()

.. code:: python

   {'regex': True, 'wordlist': True, 'punct': True, 'typo': True}

-  Allow/disallow methods used in spelling

.. code:: python

   from ghalatawi.autocorrector import AutoCorrector
   autoco = AutoCorrector()
   # remove regex method from spelling
   autoco.unset_allow("regex") 
   text = 'اذا أردت إستعارة كتاب ، اذهب الى المكتبه او الادارة فى الضهيرة .'
   # The word إستعارة will no be corrected
   expected_text = 'إذا أردت إستعارة كتاب، اذهب إلى المكتبة أو الادارة في الظهيرة.'
   result_text = autoco.spell(text)
   bool(result_text == expected_text)
   # Allow regex 
   autoco.set_allow("regex") 
   expected_text = 'إذا أردت استعارة كتاب، اذهب إلى المكتبة أو الادارة في الظهيرة.'
   result_text = autoco.spell(text)
   bool(result_text == expected_text)

.. code:: python

   True
   True

-  Adjust punctuations in text

   .. code:: python

      >>> from ghalatawi.autocorrector import AutoCorrector
      >>> autoco = AutoCorrector()
      >>> text = "قال : للصائم فرحتان : فرحة حين يفطر ، وفرحة حين يلقى ربه  ."
      >>> autoco.adjust_pounct(text)
      'قال: للصائم فرحتان: فرحة حين يفطر، وفرحة حين يلقى ربه.'

-  Adjust typos in text

   .. code:: python

      >>> from ghalatawi.autocorrector import AutoCorrector
      >>> autoco = AutoCorrector()        
      >>> text = "اشتريت الخبز و الحليب و الخضر و قليلا من الفاكهة."
      >>> autoco.adjust_typo(text)
          'اشتريت الخبز والحليب والخضر وقليلا من الفاكهة.'

-  Autocorrect a word by regex

   .. code:: python

      >>> from ghalatawi.autocorrector import AutoCorrector
      >>> autoco = AutoCorrector()            
      >>> word=u"الإجتماعية"
      >>> autoco.autocorrect_by_regex(word)
      الاجتماعية

-  Autocorrect a word by autucorrection word list; with given word list

   .. code:: python

      >>> from ghalatawi.autocorrector import AutoCorrector
      >>> autoco = AutoCorrector()            
      >>> autocorrectlist={
      ...                 u'اذا':u'إذا',
      ...                 u'او':u'أو',
      ...                 u'فى':u'في',
      ...                 u'هى':u'هي',
      ...                 u'انت':u'أنت',
      ...                 u'انتما':u'أنتما',
      ...                 u'الى':u'إلى',
      ...                 u'التى':u'التي',
      ...                 u'الذى':u'الذي',
      ...                 }
      >>> word=u"اذا"
      >>> autoco.autocorrect_by_wordlist(word, autocorrectlist)
      'إذا'

-  Autocorrect a word by autucorrection word list; with default word
   list

   .. code:: python

      >>> from ghalatawi.autocorrector import AutoCorrector
      >>> autoco = AutoCorrector()                    
      >>> word=u"اذا"
      >>> # default word list
      >>> autoco.autocorrect_by_wordlist(word)
      إذا

-  Load a specific wordlist Autocorrect a word by autucorrection word
   list

.. code:: python

   >>> from ghalatawi.autocorrector import AutoCorrector
   >>> autoco = AutoCorrector()                    
   >>> autocorrectlist = autoco.load_wordlist("data/arabic.acl")
   >>> word=u"اذا"
   >>> autoco.autocorrect_by_wordlist(word, autocorrectlist)
   إذا



            

Raw data

            {
    "_id": null,
    "home_page": "http://ghalatawi.sourceforge.net/",
    "name": "ghalatawi",
    "maintainer": "",
    "docs_url": "https://pythonhosted.org/ghalatawi/",
    "requires_python": "",
    "maintainer_email": "",
    "keywords": "",
    "author": "Taha Zerrouki",
    "author_email": "taha.zerrouki@gmail.com",
    "download_url": "",
    "platform": null,
    "description": "Ghalatawi \u063a\u0644\u0637\u0627\u0648\u064a\n================\n\nGhalatawi: Arabic Autocorrect library \u0645\u0643\u062a\u0628\u0629 \u0644\u0644\u062a\u0635\u062d\u064a\u062d \u0627\u0644\u062a\u0644\u0642\u0627\u0626\u064a \u0644\u0644\u063a\u0629\n\u0627\u0644\u0639\u0631\u0628\u064a\u0629\n\n.. figure:: doc/ghalatawi_header.png\n   :alt: ghalatawi logo\n\n   ghalatawi logo\n\n.. figure:: https://img.shields.io/pypi/dm/ghalatawi\n   :alt: PyPI - Downloads\n\n   PyPI - Downloads\n\nDeveloppers: Taha Zerrouki: http://tahadz.com taha dot zerrouki at gmail\ndot com\n\n+------+---------------------------------------------------------------+\n| Feat | value                                                         |\n| ures |                                                               |\n+------+---------------------------------------------------------------+\n| Aut  |  Authors.md                                                   |\n| hors |  <https://github.com/linuxscout/ghalatawi/main/AUTHORS.md>    |\n+------+---------------------------------------------------------------+\n| Rel  | 0.1                                                           |\n| ease |                                                               |\n+------+---------------------------------------------------------------+\n| Lic  | `                                                             |\n| ense | GPL <https://github.com/linuxscout/ghalatawi/main/LICENSE>`__ |\n+------+---------------------------------------------------------------+\n| Tra  | `linuxscout/ghalata                                           |\n| cker | wi/Issues <https://github.com/linuxscout/ghalatawi/issues>`__ |\n+------+---------------------------------------------------------------+\n| So   | `Github <http://github.com/linuxscout/ghalatawi>`__           |\n| urce |                                                               |\n+------+---------------------------------------------------------------+\n| F    | `Comments <https://github.com/linuxscout/ghalatawi/>`__       |\n| eedb |                                                               |\n| acks |                                                               |\n+------+---------------------------------------------------------------+\n| Acco | `@Twitter <https://twitter.com/linuxscout>`__)                |\n| unts |                                                               |\n+------+---------------------------------------------------------------+\n\nDescription\n-----------\n\nghalatawi: Arabic Autocorrect library\n\n\u0627\u0644\u062a\u0635\u062d\u064a\u062d \u0627\u0644\u0625\u0645\u0644\u0627\u0626\u064a \u0645\u0646 \u0623\u0647\u0645 \u0627\u0644\u0623\u062f\u0648\u0627\u062a \u0627\u0644\u0645\u0633\u062a\u0639\u0645\u0644\u0629 \u0641\u064a \u0627\u0644\u0646\u0634\u0631 \u0648\u0627\u0644\u0643\u062a\u0627\u0628\u0629 \u0627\u0644\u0625\u0644\u0643\u062a\u0631\u0648\u0646\u064a\u0629\u060c\n\u0648\u062a\u0633\u062a\u0639\u0645\u0644 \u0641\u064a \u0627\u0644\u0628\u062d\u062b \u0648\u0627\u0644\u0627\u0633\u062a\u0639\u0644\u0627\u0645\u0627\u062a\u060c \u0643\u0645\u0627 \u064a\u0639\u062f \u0645\u0646 \u0623\u0647\u0645 \u0627\u0644\u0623\u062f\u0648\u0627\u062a \u0627\u0644\u0645\u0631\u0627\u0641\u0642\u0629 \u0644\u0644\u0628\u0631\u0627\u0645\u062c\n\u0627\u0644\u0645\u0643\u062a\u0628\u064a\u0629. \u0648\u0641\u064a \u0628\u0639\u0636 \u0627\u0644\u062d\u0627\u0644\u0627\u062a \u0646\u062d\u062a\u0627\u062c \u0644\u0645\u0627 \u064a\u0633\u0645\u0649 \u0628\u0627\u0644\u062a\u0635\u062d\u064a\u062d \u0627\u0644\u062a\u0644\u0642\u0627\u0626\u064a \u0644\u0644\u0643\u0644\u0645\u0627\u062a\u060c \u0648\u0647\u0648\n\u0627\u0642\u062a\u0631\u0627\u062d \u062a\u0635\u062d\u064a\u062d \u0644\u0623\u062e\u0637\u0627\u0621 \u0634\u0627\u0626\u0639\u0629 \u0641\u064a \u0627\u0644\u0643\u062a\u0627\u0628\u0629\u060c \u0645\u062b\u0644 \u0642\u0644\u0628 \u062d\u0631\u0641\u064a\u0646\u060c \u0623\u0648 \u0639\u062f\u0645 \u0627\u0644\u0636\u063a\u0637 \u0639\u0644\u0649 \u0632\u0631\n\u0645\u0627 \u0644\u0625\u0639\u0637\u0627\u0621 \u0643\u0644\u0645\u0629 \u062e\u0627\u0637\u0626\u0629\u060c \u0648\u062a\u0634\u064a\u0639 \u0641\u064a \u0627\u0644\u0643\u062a\u0627\u0628\u0629 \u0628\u0627\u0644\u0639\u0631\u0628\u064a\u0629 \u0628\u0639\u0636 \u0627\u0644\u0623\u062e\u0637\u0627\u0621 \u0645\u062b\u0644 \u0639\u062f\u0645\n\u0627\u0644\u062a\u0641\u0631\u064a\u0642 \u0628\u064a\u0646 \u0647\u0645\u0632\u0629 \u0627\u0644\u0648\u0635\u0644 \u0648\u0647\u0645\u0632\u0629 \u0627\u0644\u0642\u0637\u0639\u060c \u0648\u0627\u0644\u062e\u0644\u0637 \u0628\u064a\u0646 \u0627\u0644\u0636\u0627\u062f \u0648\u0627\u0644\u0638\u0627\u0621\u060c \u0648\u0625\u063a\u0641\u0627\u0644\n\u0646\u0642\u0637\u062a\u064a \u0627\u0644\u064a\u0627\u0621\u060c \u0648\u0643\u062a\u0627\u0628\u0629 \u0627\u0644\u062a\u0627\u0621 \u0627\u0644\u0645\u0631\u0628\u0648\u0637\u0629 \u0647\u0627\u0621.\n\n\u064a\u0647\u062f\u0641 \u0647\u0630\u0627 \u0627\u0644\u0645\u0634\u0631\u0648\u0639 \u0625\u0644\u0649 \u0648\u0636\u0639 \u0642\u0627\u0626\u0645\u0629 \u0644\u0644\u0643\u0644\u0645\u0627\u062a \u0627\u0644\u062e\u0627\u0637\u0626\u0629 \u0627\u0644\u0634\u0627\u0626\u0639\u0629 \u0648\u062a\u0635\u062d\u064a\u062d\u0647\u0627\n\u0627\u0644\u062a\u0644\u0642\u0627\u0626\u064a\u060c \u0648\u0643\u0630\u0627 \u0648\u0636\u0639 \u0627\u0644\u062a\u0639\u0628\u064a\u0631\u0627\u062a \u0627\u0644\u0645\u0646\u062a\u0638\u0645\u0629 \u0627\u0644\u062a\u064a \u062a\u0639\u0628\u0651\u0631 \u0639\u0646 \u0628\u0639\u0636 \u0627\u0644\u062d\u0627\u0644\u0627\u062a.\n\nAutoCorrect (Text replacement, Replace as you type) is a software\nfunction commonly found in word processors such as\nOpenOffice.org/LibreOffice. Its principal purpose is to correct common\nspelling or typing errors, saving time for the user. It is also used to\nautomatically format text or insert special characters by recognizing\nparticular character usage, saving the user from having to use more\ntedious functions[cf.\u00a0wikipedia]. The common errors in Arabic, are\nconfusion between Alef forms ( Alef with Hamza, Alef without Hamza),\nmissig the Yeh dots, and missing the Teh-Marbuta dots.\n\nThis Project aims to construct a word list and a list of regular\nexpressions for Arabic auto-correction.\n\n\u0645\u0632\u0627\u064a\u0627:\n~~~~~~\n\nApplications \u062a\u0637\u0628\u064a\u0642\u0627\u062a\n~~~~~~~~~~~~~~~~~~~~\n\n-  \u0642\u0627\u0626\u0645\u0629 \u0627\u0644\u062a\u0635\u062d\u064a\u062d \u0627\u0644\u062a\u0644\u0642\u0627\u0626\u064a \u0641\u064a \u0627\u0644\u0628\u0631\u0627\u0645\u062c \u0627\u0644\u0645\u0643\u062a\u0628\u0629 OpenOffice/LibreOffice\n\n-  \u062a\u062f\u0642\u064a\u0642 \u0627\u0644\u0623\u062e\u0637\u0627\u0621 \u0627\u0644\u0634\u0627\u0626\u0639\u0629 \u0641\u064a \u0627\u0644\u0645\u0648\u0627\u0642\u0639 \u0630\u0627\u062a \u0627\u0644\u0645\u062d\u062a\u0648\u0649 \u0627\u0644\u0636\u062e\u0645\u060c \u0645\u062b\u0644 \u0648\u064a\u0643\u064a\u0628\u064a\u062f\u064a\u0627.\n\n-  \u0645\u0633\u0627\u0639\u062f\u0629 \u0627\u0644\u0643\u0627\u062a\u0628 \u0641\u064a \u0628\u0631\u0627\u0645\u062c \u0627\u0644\u0645\u0631\u0627\u0633\u0644\u0627\u062a\u060c \u0648\u0627\u0644\u062a\u0631\u062c\u0645\u0629\u060c \u0648\u063a\u064a\u0631\u0647\u0627.\n\n-  \u062a\u062f\u0642\u064a\u0642 \u0627\u0644\u0645\u062f\u0648\u0651\u0646\u0627\u062a \u0627\u0644\u0644\u063a\u0648\u064a\u0629 (corpus).\n\n-  \u062a\u0635\u062d\u064a\u062d \u0627\u0633\u062a\u0639\u0644\u0627\u0645\u0627\u062a \u0627\u0644\u0628\u062d\u062b.\n\n-  \u062a\u0635\u062d\u064a\u062d \u0627\u0644\u0643\u062a\u0627\u0628\u0629 \u0641\u064a \u0627\u0644\u0623\u062c\u0647\u0632\u0629 \u0627\u0644\u0645\u062d\u0645\u0648\u0644\u0629 \u0643\u0627\u0644\u0647\u0648\u062a\u0627\u0641 \u0627\u0644\u0630\u0643\u064a\u0629.\n\n-  AutoCorrect for word processors like OpenOffice/LibreOffice\n\n-  Huge content site Autocorrection, like Wikipedia.\n\n-  Writres and translators assitantance .\n\n-  Corpora Autocorrection.\n\nUsage\n~~~~~\n\ninstall\n~~~~~~~\n\n.. code:: shell\n\n   pip install ghalatawi\n\n[requirement]\n^^^^^^^^^^^^^\n\n::\n\n   pyarabic>=0.6.8\n\nimport\n^^^^^^\n\n.. code:: python\n\n   >>> from ghalatawi.autocorrector import AutoCorrector\n\nExamples\n--------\n\nDetailed examples and features in `Features <doc/features.md>`__\n\n-  Autocorrect a text\n\n.. code:: python\n\n   from ghalatawi.autocorrector import AutoCorrector\n   autoco = AutoCorrector()\n   text = '\u0627\u0630\u0627 \u0623\u0631\u062f\u062a \u0625\u0633\u062a\u0639\u0627\u0631\u0629 \u0643\u062a\u0627\u0628 \u060c \u0627\u0630\u0647\u0628 \u0627\u0644\u0649 \u0627\u0644\u0645\u0643\u062a\u0628\u0647 \u0627\u0648 \u0627\u0644\u0627\u062f\u0627\u0631\u0629 \u0641\u0649 \u0627\u0644\u0636\u0647\u064a\u0631\u0629 .'\n   autoco.spell(text)\n\noutput\n\n.. code:: python\n\n   '\u0625\u0630\u0627 \u0623\u0631\u062f\u062a \u0627\u0633\u062a\u0639\u0627\u0631\u0629 \u0643\u062a\u0627\u0628\u060c \u0627\u0630\u0647\u0628 \u0625\u0644\u0649 \u0627\u0644\u0645\u0643\u062a\u0628\u0629 \u0623\u0648 \u0627\u0644\u0627\u062f\u0627\u0631\u0629 \u0641\u064a \u0627\u0644\u0638\u0647\u064a\u0631\u0629.'\n\n-  Show methods used in spelling\n\n.. code:: python\n\n   from ghalatawi.autocorrector import AutoCorrector\n   autoco = AutoCorrector()\n   autoco.show_config()\n\n.. code:: python\n\n   {'regex': True, 'wordlist': True, 'punct': True, 'typo': True}\n\n-  Allow/disallow methods used in spelling\n\n.. code:: python\n\n   from ghalatawi.autocorrector import AutoCorrector\n   autoco = AutoCorrector()\n   # remove regex method from spelling\n   autoco.unset_allow(\"regex\") \n   text = '\u0627\u0630\u0627 \u0623\u0631\u062f\u062a \u0625\u0633\u062a\u0639\u0627\u0631\u0629 \u0643\u062a\u0627\u0628 \u060c \u0627\u0630\u0647\u0628 \u0627\u0644\u0649 \u0627\u0644\u0645\u0643\u062a\u0628\u0647 \u0627\u0648 \u0627\u0644\u0627\u062f\u0627\u0631\u0629 \u0641\u0649 \u0627\u0644\u0636\u0647\u064a\u0631\u0629 .'\n   # The word \u0625\u0633\u062a\u0639\u0627\u0631\u0629 will no be corrected\n   expected_text = '\u0625\u0630\u0627 \u0623\u0631\u062f\u062a \u0625\u0633\u062a\u0639\u0627\u0631\u0629 \u0643\u062a\u0627\u0628\u060c \u0627\u0630\u0647\u0628 \u0625\u0644\u0649 \u0627\u0644\u0645\u0643\u062a\u0628\u0629 \u0623\u0648 \u0627\u0644\u0627\u062f\u0627\u0631\u0629 \u0641\u064a \u0627\u0644\u0638\u0647\u064a\u0631\u0629.'\n   result_text = autoco.spell(text)\n   bool(result_text == expected_text)\n   # Allow regex \n   autoco.set_allow(\"regex\") \n   expected_text = '\u0625\u0630\u0627 \u0623\u0631\u062f\u062a \u0627\u0633\u062a\u0639\u0627\u0631\u0629 \u0643\u062a\u0627\u0628\u060c \u0627\u0630\u0647\u0628 \u0625\u0644\u0649 \u0627\u0644\u0645\u0643\u062a\u0628\u0629 \u0623\u0648 \u0627\u0644\u0627\u062f\u0627\u0631\u0629 \u0641\u064a \u0627\u0644\u0638\u0647\u064a\u0631\u0629.'\n   result_text = autoco.spell(text)\n   bool(result_text == expected_text)\n\n.. code:: python\n\n   True\n   True\n\n-  Adjust punctuations in text\n\n   .. code:: python\n\n      >>> from ghalatawi.autocorrector import AutoCorrector\n      >>> autoco = AutoCorrector()\n      >>> text = \"\u0642\u0627\u0644 : \u0644\u0644\u0635\u0627\u0626\u0645 \u0641\u0631\u062d\u062a\u0627\u0646 : \u0641\u0631\u062d\u0629 \u062d\u064a\u0646 \u064a\u0641\u0637\u0631 \u060c \u0648\u0641\u0631\u062d\u0629 \u062d\u064a\u0646 \u064a\u0644\u0642\u0649 \u0631\u0628\u0647  .\"\n      >>> autoco.adjust_pounct(text)\n      '\u0642\u0627\u0644: \u0644\u0644\u0635\u0627\u0626\u0645 \u0641\u0631\u062d\u062a\u0627\u0646: \u0641\u0631\u062d\u0629 \u062d\u064a\u0646 \u064a\u0641\u0637\u0631\u060c \u0648\u0641\u0631\u062d\u0629 \u062d\u064a\u0646 \u064a\u0644\u0642\u0649 \u0631\u0628\u0647.'\n\n-  Adjust typos in text\n\n   .. code:: python\n\n      >>> from ghalatawi.autocorrector import AutoCorrector\n      >>> autoco = AutoCorrector()        \n      >>> text = \"\u0627\u0634\u062a\u0631\u064a\u062a \u0627\u0644\u062e\u0628\u0632 \u0648 \u0627\u0644\u062d\u0644\u064a\u0628 \u0648 \u0627\u0644\u062e\u0636\u0631 \u0648 \u0642\u0644\u064a\u0644\u0627 \u0645\u0646 \u0627\u0644\u0641\u0627\u0643\u0647\u0629.\"\n      >>> autoco.adjust_typo(text)\n          '\u0627\u0634\u062a\u0631\u064a\u062a \u0627\u0644\u062e\u0628\u0632 \u0648\u0627\u0644\u062d\u0644\u064a\u0628 \u0648\u0627\u0644\u062e\u0636\u0631 \u0648\u0642\u0644\u064a\u0644\u0627 \u0645\u0646 \u0627\u0644\u0641\u0627\u0643\u0647\u0629.'\n\n-  Autocorrect a word by regex\n\n   .. code:: python\n\n      >>> from ghalatawi.autocorrector import AutoCorrector\n      >>> autoco = AutoCorrector()            \n      >>> word=u\"\u0627\u0644\u0625\u062c\u062a\u0645\u0627\u0639\u064a\u0629\"\n      >>> autoco.autocorrect_by_regex(word)\n      \u0627\u0644\u0627\u062c\u062a\u0645\u0627\u0639\u064a\u0629\n\n-  Autocorrect a word by autucorrection word list; with given word list\n\n   .. code:: python\n\n      >>> from ghalatawi.autocorrector import AutoCorrector\n      >>> autoco = AutoCorrector()            \n      >>> autocorrectlist={\n      ...                 u'\u0627\u0630\u0627':u'\u0625\u0630\u0627',\n      ...                 u'\u0627\u0648':u'\u0623\u0648',\n      ...                 u'\u0641\u0649':u'\u0641\u064a',\n      ...                 u'\u0647\u0649':u'\u0647\u064a',\n      ...                 u'\u0627\u0646\u062a':u'\u0623\u0646\u062a',\n      ...                 u'\u0627\u0646\u062a\u0645\u0627':u'\u0623\u0646\u062a\u0645\u0627',\n      ...                 u'\u0627\u0644\u0649':u'\u0625\u0644\u0649',\n      ...                 u'\u0627\u0644\u062a\u0649':u'\u0627\u0644\u062a\u064a',\n      ...                 u'\u0627\u0644\u0630\u0649':u'\u0627\u0644\u0630\u064a',\n      ...                 }\n      >>> word=u\"\u0627\u0630\u0627\"\n      >>> autoco.autocorrect_by_wordlist(word, autocorrectlist)\n      '\u0625\u0630\u0627'\n\n-  Autocorrect a word by autucorrection word list; with default word\n   list\n\n   .. code:: python\n\n      >>> from ghalatawi.autocorrector import AutoCorrector\n      >>> autoco = AutoCorrector()                    \n      >>> word=u\"\u0627\u0630\u0627\"\n      >>> # default word list\n      >>> autoco.autocorrect_by_wordlist(word)\n      \u0625\u0630\u0627\n\n-  Load a specific wordlist Autocorrect a word by autucorrection word\n   list\n\n.. code:: python\n\n   >>> from ghalatawi.autocorrector import AutoCorrector\n   >>> autoco = AutoCorrector()                    \n   >>> autocorrectlist = autoco.load_wordlist(\"data/arabic.acl\")\n   >>> word=u\"\u0627\u0630\u0627\"\n   >>> autoco.autocorrect_by_wordlist(word, autocorrectlist)\n   \u0625\u0630\u0627\n\n\n",
    "bugtrack_url": null,
    "license": "GPL",
    "summary": "ghalatawi: Arabic autocorrect library",
    "version": "0.3",
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "6f8f015d12e92f3e7c72c0ec36e6ad0b3258a55aeaec2599976783e1848dd1e7",
                "md5": "6b70c5ddf36e933d96d7c281df170716",
                "sha256": "19153f751e4fec2ce2330bcca9dc64f0061c07f7e1ef1800ada4e13a4d83302c"
            },
            "downloads": -1,
            "filename": "ghalatawi-0.3-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "6b70c5ddf36e933d96d7c281df170716",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 7169842,
            "upload_time": "2023-01-05T15:13:15",
            "upload_time_iso_8601": "2023-01-05T15:13:15.300684Z",
            "url": "https://files.pythonhosted.org/packages/6f/8f/015d12e92f3e7c72c0ec36e6ad0b3258a55aeaec2599976783e1848dd1e7/ghalatawi-0.3-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-01-05 15:13:15",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "lcname": "ghalatawi"
}
        
Elapsed time: 0.02535s