.. NOTES FOR CREATING A RELEASE:
..
.. * bump the version number in scrubadub/__init__.py
.. * update docs/changelog.rst
.. * git push
.. * create a release https://github.com/LeapBeyond/scrubadub/releases
.. * This should trigger a github action to upload to pypi
.. * ReadTheDocs.io should see any changes and also rebuild the docs
*********
scrubadub
*********
Remove personally identifiable information from free text. Sometimes we have
additional metadata about the people we wish to anonymize. Other times we don't.
This package makes it easy to seamlessly scrub personal information from free
text, without compromising the privacy of the people we are trying to protect.
``scrubadub`` currently supports removing:
* Names
* Email addresses
* Addresses/Postal codes (US, GB, CA)
* Credit card numbers
* Dates of birth
* URLs
* Phone numbers
* Username and password combinations
* Skype/twitter usernames
* Social security numbers (US and GB national insurance numbers)
* Tax numbers (GB)
* Driving licence numbers (GB)
.. image:: https://img.shields.io/github/workflow/status/LeapBeyond/scrubadub/Python%20package/master
:target: https://github.com/LeapBeyond/scrubadub/actions?query=workflow%3A%22Python+package%22+branch%3Amaster
:alt: Build Status
.. image:: https://img.shields.io/pypi/v/scrubadub.svg
:target: https://pypi.org/project/scrubadub/
:alt: Version
.. image:: https://img.shields.io/pypi/dm/scrubadub.svg
:target: https://pypi.org/project/scrubadub/
:alt: Downloads
.. image:: https://coveralls.io/repos/github/LeapBeyond/scrubadub/badge.svg?branch=master
:target: https://coveralls.io/r/LeapBeyond/scrubadub
:alt: Test Coverage
.. image:: https://readthedocs.org/projects/scrubadub/badge/?version=latest
:target: https://readthedocs.org/projects/scrubadub/?badge=latest
:alt: Documentation Status
Quick start
-----------
Getting started with ``scrubadub`` is as easy as ``pip install scrubadub`` and
incorporating it into your python scripts like this:
.. code:: pycon
>>> import scrubadub
# My cat may be more tech-savvy than most, but he doesn't want other people to know it.
>>> text = "My cat can be contacted on example@example.com, or 1800 555-5555"
# Replaces the phone number and email addresse with anonymous IDs.
>>> scrubadub.clean(text)
'My cat can be contacted on {{EMAIL}}, or {{PHONE}}'
There are many ways to tailor the behavior of ``scrubadub`` using
`different Detectors and PostProcessors <https://scrubadub.readthedocs.io/en/stable/usage.html>`_.
Scrubadub is highly configurable and supports localisation for different languages and regions.
Installation
------------
To install scrubadub using pip, simply type::
pip install scrubadub
There are several other packages that can optionally be installed to enable extra detectors.
These `scrubadub_address <https://github.com/LeapBeyond/scrubadub_address>`_, `scrubadub_spacy <https://github.com/LeapBeyond/scrubadub_spacy>`_ and `scrubadub_stanford <https://github.com/LeapBeyond/scrubadub_stanford>`_, see the relevant documentation (`address detector documentation <https://scrubadub.readthedocs.io/en/latest/addresses.html>`_ and `name detector documentation <https://scrubadub.readthedocs.io/en/latest/names.html>`_) for more info on these as they require additional dependencies.
This package requires at least python 3.6.
For python 2.7 or 3.5 support use v1.2.2 which is the last version with support for these versions.
New maintainers
---------------
`LeapBeyond <http://leapbeyond.ai/>`_ are excited to be supporting scrubadub with ongoing maintenance and development.
Thanks to all of the contributors who made this package a success, but especially `@deanmalmgren <https://github.com/deanmalmgren>`_, `IDEO <https://www.ideo.com/>`_ and `Datascope <https://datascopeanalytics.com/>`_.
Raw data
{
"_id": null,
"home_page": "https://github.com/LeapBeyond/scrubadub",
"name": "scrubadub",
"maintainer": "",
"docs_url": null,
"requires_python": "",
"maintainer_email": "",
"keywords": "",
"author": "Dean Malmgren",
"author_email": "dean.malmgren@datascopeanalytics.com",
"download_url": "https://files.pythonhosted.org/packages/6f/24/f56c1b27689eff1809791b37660a9b1687ddfb157c0e380114245d67af1b/scrubadub-2.0.1.tar.gz",
"platform": null,
"description": "\n.. NOTES FOR CREATING A RELEASE:\n..\n.. * bump the version number in scrubadub/__init__.py\n.. * update docs/changelog.rst\n.. * git push\n.. * create a release https://github.com/LeapBeyond/scrubadub/releases\n.. * This should trigger a github action to upload to pypi\n.. * ReadTheDocs.io should see any changes and also rebuild the docs\n\n\n*********\nscrubadub\n*********\n\nRemove personally identifiable information from free text. Sometimes we have\nadditional metadata about the people we wish to anonymize. Other times we don't.\nThis package makes it easy to seamlessly scrub personal information from free\ntext, without compromising the privacy of the people we are trying to protect.\n\n``scrubadub`` currently supports removing:\n\n* Names\n* Email addresses\n* Addresses/Postal codes (US, GB, CA)\n* Credit card numbers\n* Dates of birth\n* URLs\n* Phone numbers\n* Username and password combinations\n* Skype/twitter usernames\n* Social security numbers (US and GB national insurance numbers)\n* Tax numbers (GB)\n* Driving licence numbers (GB)\n\n.. image:: https://img.shields.io/github/workflow/status/LeapBeyond/scrubadub/Python%20package/master\n :target: https://github.com/LeapBeyond/scrubadub/actions?query=workflow%3A%22Python+package%22+branch%3Amaster\n :alt: Build Status\n.. image:: https://img.shields.io/pypi/v/scrubadub.svg\n :target: https://pypi.org/project/scrubadub/\n :alt: Version\n.. image:: https://img.shields.io/pypi/dm/scrubadub.svg\n :target: https://pypi.org/project/scrubadub/\n :alt: Downloads\n.. image:: https://coveralls.io/repos/github/LeapBeyond/scrubadub/badge.svg?branch=master\n :target: https://coveralls.io/r/LeapBeyond/scrubadub\n :alt: Test Coverage\n.. image:: https://readthedocs.org/projects/scrubadub/badge/?version=latest\n :target: https://readthedocs.org/projects/scrubadub/?badge=latest\n :alt: Documentation Status\n\n\nQuick start\n-----------\n\nGetting started with ``scrubadub`` is as easy as ``pip install scrubadub`` and\nincorporating it into your python scripts like this:\n\n.. code:: pycon\n\n >>> import scrubadub\n\n # My cat may be more tech-savvy than most, but he doesn't want other people to know it.\n >>> text = \"My cat can be contacted on example@example.com, or 1800 555-5555\"\n\n # Replaces the phone number and email addresse with anonymous IDs.\n >>> scrubadub.clean(text)\n 'My cat can be contacted on {{EMAIL}}, or {{PHONE}}'\n\n\nThere are many ways to tailor the behavior of ``scrubadub`` using\n`different Detectors and PostProcessors <https://scrubadub.readthedocs.io/en/stable/usage.html>`_.\nScrubadub is highly configurable and supports localisation for different languages and regions.\n\nInstallation\n------------\n\nTo install scrubadub using pip, simply type::\n\n pip install scrubadub\n\nThere are several other packages that can optionally be installed to enable extra detectors.\nThese `scrubadub_address <https://github.com/LeapBeyond/scrubadub_address>`_, `scrubadub_spacy <https://github.com/LeapBeyond/scrubadub_spacy>`_ and `scrubadub_stanford <https://github.com/LeapBeyond/scrubadub_stanford>`_, see the relevant documentation (`address detector documentation <https://scrubadub.readthedocs.io/en/latest/addresses.html>`_ and `name detector documentation <https://scrubadub.readthedocs.io/en/latest/names.html>`_) for more info on these as they require additional dependencies.\nThis package requires at least python 3.6.\nFor python 2.7 or 3.5 support use v1.2.2 which is the last version with support for these versions.\n\nNew maintainers\n---------------\n\n`LeapBeyond <http://leapbeyond.ai/>`_ are excited to be supporting scrubadub with ongoing maintenance and development.\nThanks to all of the contributors who made this package a success, but especially `@deanmalmgren <https://github.com/deanmalmgren>`_, `IDEO <https://www.ideo.com/>`_ and `Datascope <https://datascopeanalytics.com/>`_.\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "Clean personally identifiable information from dirty dirty text.",
"version": "2.0.1",
"project_urls": {
"Download": "https://github.com/LeapBeyond/scrubadub/archives/master",
"Homepage": "https://github.com/LeapBeyond/scrubadub"
},
"split_keywords": [],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "f4c504b959566c85914b17327e40d25b0535b0209a5a5216006443b769bebe25",
"md5": "4642d4a1cb79d070134f516d93a0efba",
"sha256": "44b9004998a03aff4c6b5d9073a52895081742f994470083a7be610b373e62b7"
},
"downloads": -1,
"filename": "scrubadub-2.0.1-py3-none-any.whl",
"has_sig": false,
"md5_digest": "4642d4a1cb79d070134f516d93a0efba",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 65152,
"upload_time": "2023-09-01T14:50:25",
"upload_time_iso_8601": "2023-09-01T14:50:25.318561Z",
"url": "https://files.pythonhosted.org/packages/f4/c5/04b959566c85914b17327e40d25b0535b0209a5a5216006443b769bebe25/scrubadub-2.0.1-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "6f24f56c1b27689eff1809791b37660a9b1687ddfb157c0e380114245d67af1b",
"md5": "67c901dc153479682a39a3f415b97275",
"sha256": "52a1fb8aa9bc0226043e02c3ec22d450bd4ebeede9e7e8db2def7c89b37c5aad"
},
"downloads": -1,
"filename": "scrubadub-2.0.1.tar.gz",
"has_sig": false,
"md5_digest": "67c901dc153479682a39a3f415b97275",
"packagetype": "sdist",
"python_version": "source",
"requires_python": null,
"size": 46599,
"upload_time": "2023-09-01T14:50:26",
"upload_time_iso_8601": "2023-09-01T14:50:26.964137Z",
"url": "https://files.pythonhosted.org/packages/6f/24/f56c1b27689eff1809791b37660a9b1687ddfb157c0e380114245d67af1b/scrubadub-2.0.1.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2023-09-01 14:50:26",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "LeapBeyond",
"github_project": "scrubadub",
"travis_ci": false,
"coveralls": true,
"github_actions": true,
"tox": true,
"lcname": "scrubadub"
}