elasticsearch-pii-redacter


Nameelasticsearch-pii-redacter JSON
Version 1.10.1 PyPI version JSON
download
home_pageNone
SummaryRedacting field data from your Elasticsearch indices and Searchable Snapshots
upload_time2024-03-26 20:36:37
maintainerNone
docs_urlNone
authorNone
requires_python>=3.8
licenseApache-2.0
keywords elasticsearch index pii redact
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            .. _readme:

elastic-pii-redacter
====================

Did you find PII (Personally Identifiable Information) in your Elasticsearch
indices that doesn't belong there? This is the tool for you!

The elastic-pii-redacter can help you redact information from even Searchable
Snapshot mounted indices. It works with deeply nested fields, too!


Client Configuration
--------------------

The tool connects using the ``es_client`` Python module.

You can use command-line options, or a YAML configuration file to configure the client connection.
If using a configuration file is desired, the configuration file structure requires
``elasticsearch`` at the root level as follows::

    ---
    elasticsearch:
      client:
        hosts: https://10.11.12.13:9200
        cloud_id:
        bearer_auth:
        opaque_id:
        request_timeout: 60
        http_compress:
        verify_certs:
        ca_certs:
        client_cert:
        client_key:
        ssl_assert_hostname:
        ssl_assert_fingerprint:
        ssl_version:
      other_settings:
        master_only:
        skip_version_test:
        username:
        password:
        api_key:
          id:
          api_key:
          token:

    logging:
      loglevel: INFO
      logfile: /path/to/file.log
      logformat: default
      blacklist: []


`REDACTIONS_FILE` Configuration
-------------------------------

::

  ---
  redactions:
    - job_name_20230731_redact_hot:
        pattern: hot-*
        query: {'match': {'message': 'message1'}}
        fields: ['message']
        message: REDACTED
        expected_docs: 1
        restore_settings: {'index.routing.allocation.include._tier_preference': 'data_warm,data_hot,data_content'}
    - job_name_20230731_redact_cold:
        pattern: restored-cold-*
        query: {'match': {'nested.key': 'nested19'}}
        fields: ['nested.key']
        message: REDACTED
        expected_docs: 1
        restore_settings: {'index.routing.allocation.include._tier_preference': 'data_warm,data_hot,data_content'}
        forcemerge:
          max_num_segments: 1
    - job_name_20230731_redact_frozen:
        pattern: partial-frozen-*
        query: {'range': {'number': {'gte': 8, 'lte': 11}}}
        fields: ['deep.l1.l2.l3']
        message: REDACTED
        expected_docs: 4
        forcemerge:
          only_expunge_deletes: True

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "elasticsearch-pii-redacter",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.8",
    "maintainer_email": null,
    "keywords": "elasticsearch, index, pii, redact",
    "author": null,
    "author_email": "Elastic <info@elastic.co>",
    "download_url": "https://files.pythonhosted.org/packages/0f/43/0e097735fcff66b42b9d415ca5b81bf4130748c5893edf641511fe870ce6/elasticsearch_pii_redacter-1.10.1.tar.gz",
    "platform": null,
    "description": ".. _readme:\n\nelastic-pii-redacter\n====================\n\nDid you find PII (Personally Identifiable Information) in your Elasticsearch\nindices that doesn't belong there? This is the tool for you!\n\nThe elastic-pii-redacter can help you redact information from even Searchable\nSnapshot mounted indices. It works with deeply nested fields, too!\n\n\nClient Configuration\n--------------------\n\nThe tool connects using the ``es_client`` Python module.\n\nYou can use command-line options, or a YAML configuration file to configure the client connection.\nIf using a configuration file is desired, the configuration file structure requires\n``elasticsearch`` at the root level as follows::\n\n    ---\n    elasticsearch:\n      client:\n        hosts: https://10.11.12.13:9200\n        cloud_id:\n        bearer_auth:\n        opaque_id:\n        request_timeout: 60\n        http_compress:\n        verify_certs:\n        ca_certs:\n        client_cert:\n        client_key:\n        ssl_assert_hostname:\n        ssl_assert_fingerprint:\n        ssl_version:\n      other_settings:\n        master_only:\n        skip_version_test:\n        username:\n        password:\n        api_key:\n          id:\n          api_key:\n          token:\n\n    logging:\n      loglevel: INFO\n      logfile: /path/to/file.log\n      logformat: default\n      blacklist: []\n\n\n`REDACTIONS_FILE` Configuration\n-------------------------------\n\n::\n\n  ---\n  redactions:\n    - job_name_20230731_redact_hot:\n        pattern: hot-*\n        query: {'match': {'message': 'message1'}}\n        fields: ['message']\n        message: REDACTED\n        expected_docs: 1\n        restore_settings: {'index.routing.allocation.include._tier_preference': 'data_warm,data_hot,data_content'}\n    - job_name_20230731_redact_cold:\n        pattern: restored-cold-*\n        query: {'match': {'nested.key': 'nested19'}}\n        fields: ['nested.key']\n        message: REDACTED\n        expected_docs: 1\n        restore_settings: {'index.routing.allocation.include._tier_preference': 'data_warm,data_hot,data_content'}\n        forcemerge:\n          max_num_segments: 1\n    - job_name_20230731_redact_frozen:\n        pattern: partial-frozen-*\n        query: {'range': {'number': {'gte': 8, 'lte': 11}}}\n        fields: ['deep.l1.l2.l3']\n        message: REDACTED\n        expected_docs: 4\n        forcemerge:\n          only_expunge_deletes: True\n",
    "bugtrack_url": null,
    "license": "Apache-2.0",
    "summary": "Redacting field data from your Elasticsearch indices and Searchable Snapshots",
    "version": "1.10.1",
    "project_urls": {
        "Bug Tracker": "https://github.com/elastic/elastic-pii-redacter/issues",
        "Homepage": "https://github.com/elastic/elastic-pii-redacter"
    },
    "split_keywords": [
        "elasticsearch",
        " index",
        " pii",
        " redact"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "1936c9c7df9934cfd0c0c1183f1cf7d439014d81fb3f489ab46ae2b8b75b34dd",
                "md5": "46b3bd27c15b0c6642126898cfc5dbaa",
                "sha256": "f8b983f4086bb1aec0a18b5729d581a801744920879ebbfc25d3b569bebd58fc"
            },
            "downloads": -1,
            "filename": "elasticsearch_pii_redacter-1.10.1-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "46b3bd27c15b0c6642126898cfc5dbaa",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.8",
            "size": 29479,
            "upload_time": "2024-03-26T20:36:35",
            "upload_time_iso_8601": "2024-03-26T20:36:35.290343Z",
            "url": "https://files.pythonhosted.org/packages/19/36/c9c7df9934cfd0c0c1183f1cf7d439014d81fb3f489ab46ae2b8b75b34dd/elasticsearch_pii_redacter-1.10.1-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "0f430e097735fcff66b42b9d415ca5b81bf4130748c5893edf641511fe870ce6",
                "md5": "0e6dba900a39a47e9dc7b102e4093312",
                "sha256": "b2abb2e685d39c5dc53cf9dfa54eb6f5a12d31ca3322efe4be8c2195a21b5dc7"
            },
            "downloads": -1,
            "filename": "elasticsearch_pii_redacter-1.10.1.tar.gz",
            "has_sig": false,
            "md5_digest": "0e6dba900a39a47e9dc7b102e4093312",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.8",
            "size": 24628,
            "upload_time": "2024-03-26T20:36:37",
            "upload_time_iso_8601": "2024-03-26T20:36:37.346921Z",
            "url": "https://files.pythonhosted.org/packages/0f/43/0e097735fcff66b42b9d415ca5b81bf4130748c5893edf641511fe870ce6/elasticsearch_pii_redacter-1.10.1.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-03-26 20:36:37",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "elastic",
    "github_project": "elastic-pii-redacter",
    "github_not_found": true,
    "lcname": "elasticsearch-pii-redacter"
}
        
Elapsed time: 3.05193s