docfilter


Namedocfilter JSON
Version 0.1.0 PyPI version JSON
download
home_pagehttps://github.com/DSDanielPark/docfilter
SummaryThe Python package docfilter is used to detect and remove inappropriate information from text.
upload_time2023-03-27 15:52:40
maintainer
docs_urlNone
authorparkminwoo
requires_python>=3.6
license
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Python Package: Filter
The Python package `filter` is used to detect and remove inappropriate information from text.

# Installation
```
pip install filter
```

# Features
1. `Regular expression filtering` This method is used to find and filter out specific string patterns, such as personal information like phone numbers or email addresses, or words and phrases related to profanity.
2. `Dictionary-based filtering` This method is used to filter out words that match with profanity or personal information already listed in a pre-built dictionary. While this method is quick to apply, it has the disadvantage of needing to be updated with new profanity or personal information.
3. `Machine learning-based filtering` This method uses a machine learning model to identify sentences related to profanity or personal information. While this method is more accurate than others, it requires a lot of time and resources to train the model.
4. `Special character removal` This method is used to remove special characters from strings that may contain personal information, such as phone numbers or email addresses. While this method is quick to apply, it may be difficult to handle exceptional cases.

One or a combination of the above methods can be used to implement an appropriate filtering method depending on the requirements of the application, and it should be validated in terms of performance and security.

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/DSDanielPark/docfilter",
    "name": "docfilter",
    "maintainer": "",
    "docs_url": null,
    "requires_python": ">=3.6",
    "maintainer_email": "",
    "keywords": "",
    "author": "parkminwoo",
    "author_email": "parkminwoo1991@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/ac/7a/8a67cc0852028fecfcc65f503c07aaf5b5882d3bcee3b9c453f61df499e9/docfilter-0.1.0.tar.gz",
    "platform": null,
    "description": "# Python Package: Filter\r\nThe Python package `filter` is used to detect and remove inappropriate information from text.\r\n\r\n# Installation\r\n```\r\npip install filter\r\n```\r\n\r\n# Features\r\n1. `Regular expression filtering` This method is used to find and filter out specific string patterns, such as personal information like phone numbers or email addresses, or words and phrases related to profanity.\r\n2. `Dictionary-based filtering` This method is used to filter out words that match with profanity or personal information already listed in a pre-built dictionary. While this method is quick to apply, it has the disadvantage of needing to be updated with new profanity or personal information.\r\n3. `Machine learning-based filtering` This method uses a machine learning model to identify sentences related to profanity or personal information. While this method is more accurate than others, it requires a lot of time and resources to train the model.\r\n4. `Special character removal` This method is used to remove special characters from strings that may contain personal information, such as phone numbers or email addresses. While this method is quick to apply, it may be difficult to handle exceptional cases.\r\n\r\nOne or a combination of the above methods can be used to implement an appropriate filtering method depending on the requirements of the application, and it should be validated in terms of performance and security.\r\n",
    "bugtrack_url": null,
    "license": "",
    "summary": "The Python package docfilter is used to detect and remove inappropriate information from text.",
    "version": "0.1.0",
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "d0170c3a307f9be21e4069cd8f361cb441ab5614ccb86862d6fc5195084a9472",
                "md5": "de702cc58d46b0e469e7f8b02f70fce8",
                "sha256": "2d8062e832233ace6c69c66f2e91e3b2c610bc69531327986187f22d9d384d18"
            },
            "downloads": -1,
            "filename": "docfilter-0.1.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "de702cc58d46b0e469e7f8b02f70fce8",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.6",
            "size": 3170,
            "upload_time": "2023-03-27T15:52:34",
            "upload_time_iso_8601": "2023-03-27T15:52:34.502010Z",
            "url": "https://files.pythonhosted.org/packages/d0/17/0c3a307f9be21e4069cd8f361cb441ab5614ccb86862d6fc5195084a9472/docfilter-0.1.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "ac7a8a67cc0852028fecfcc65f503c07aaf5b5882d3bcee3b9c453f61df499e9",
                "md5": "db58eb887cd664eba226da69a146f5ed",
                "sha256": "543394a2180a1f5154bb4d2a2774141ee79b014211b58b90b6c4ec361a7f3b8e"
            },
            "downloads": -1,
            "filename": "docfilter-0.1.0.tar.gz",
            "has_sig": false,
            "md5_digest": "db58eb887cd664eba226da69a146f5ed",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.6",
            "size": 2406,
            "upload_time": "2023-03-27T15:52:40",
            "upload_time_iso_8601": "2023-03-27T15:52:40.122777Z",
            "url": "https://files.pythonhosted.org/packages/ac/7a/8a67cc0852028fecfcc65f503c07aaf5b5882d3bcee3b9c453f61df499e9/docfilter-0.1.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-03-27 15:52:40",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "github_user": "DSDanielPark",
    "github_project": "docfilter",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "lcname": "docfilter"
}
        
Elapsed time: 0.20942s