PureFlow


NamePureFlow JSON
Version 1.0.1 PyPI version JSON
download
home_pagehttps://github.com/iMoHd8/PureFlow
SummaryData Cleaning Tool
upload_time2023-06-26 22:58:22
maintainer
docs_urlNone
authorMohammed Mahameed
requires_python
licenseMIT
keywords pureflow data datacleaning cleaning machine learning data cleaning data science data process
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # PureFlow - README

This tool is designed to clean datasets by removing null values, special characters, links, hashtags, and mentions from specific columns. It also performs various text cleaning operations to preprocess the data.

## Usage:
1. Prepare your dataset in a supported format (CSV, Excel, etc.).
2. Place the dataset file in the same directory as this cleaning tool.
3. Open a command prompt or terminal.
4. Navigate to the directory containing the cleaning tool and dataset file.
5. Install the cleaning tool using the this commands:
   pip install PureFlow
6. Import The PureFlow Package and start using It:
   import PureFlow as pf
   pf.clean_dataset()
   pf.remove_nulls()

## Parameters:
- `columns_name`: Provide the names of the columns to clean. Use ['all'] to clean all columns or ['column1', 'column2'] to clean specific columns.
- `remove_nan` (Optional): Specify if null values should be removed. Default is False.
- `save_it_as_csv` (Optional): Specify if the cleaned dataset should be saved as a new CSV file. Default is True.

## Output:
- The cleaned dataset will be displayed on the console.
- If `save_it_as_csv` is True, the cleaned dataset will be saved as "Cleaned_Dataset.csv" in the same directory.

## Note:
- Make sure you have Python and the necessary dependencies (Pandas) installed.
- Ensure that your dataset file is not open or locked by any other program during the cleaning process.

## Example Usage:
pip install PureFlow
import PureFlow as pf
pf.clean_dataset('dataset name', ['columns name'])
Cleaning in progress...
Cleaned dataset:
    ...
    [Display a sample of cleaned data here]
    ...
Cleaning completed successfully. The cleaned dataset has been saved as "Cleaned_Dataset.csv".

For any issues or inquiries, please contact me:

E-Mail: iMoHd8@hotmail.com
LinkedIn: https://www.linkedin.com/in/mohammed-mahameed
GitHub: https://github.com/iMoHd8
Instagram: https://www.instagram.com/i.mohd.8/

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/iMoHd8/PureFlow",
    "name": "PureFlow",
    "maintainer": "",
    "docs_url": null,
    "requires_python": "",
    "maintainer_email": "",
    "keywords": "PureFlow,Data,DataCleaning,Cleaning,Machine Learning,Data Cleaning,data science,Data Process",
    "author": "Mohammed Mahameed",
    "author_email": "iMoHd8@hotmail.com",
    "download_url": "https://files.pythonhosted.org/packages/e0/0e/6fe7bbf26d6f3c71417429199ca1540abd833159f7d65e59abd383c96ae9/PureFlow-1.0.1.tar.gz",
    "platform": null,
    "description": "# PureFlow - README\r\n\r\nThis tool is designed to clean datasets by removing null values, special characters, links, hashtags, and mentions from specific columns. It also performs various text cleaning operations to preprocess the data.\r\n\r\n## Usage:\r\n1. Prepare your dataset in a supported format (CSV, Excel, etc.).\r\n2. Place the dataset file in the same directory as this cleaning tool.\r\n3. Open a command prompt or terminal.\r\n4. Navigate to the directory containing the cleaning tool and dataset file.\r\n5. Install the cleaning tool using the this commands:\r\n   pip install PureFlow\r\n6. Import The PureFlow Package and start using It:\r\n   import PureFlow as pf\r\n   pf.clean_dataset()\r\n   pf.remove_nulls()\r\n\r\n## Parameters:\r\n- `columns_name`: Provide the names of the columns to clean. Use ['all'] to clean all columns or ['column1', 'column2'] to clean specific columns.\r\n- `remove_nan` (Optional): Specify if null values should be removed. Default is False.\r\n- `save_it_as_csv` (Optional): Specify if the cleaned dataset should be saved as a new CSV file. Default is True.\r\n\r\n## Output:\r\n- The cleaned dataset will be displayed on the console.\r\n- If `save_it_as_csv` is True, the cleaned dataset will be saved as \"Cleaned_Dataset.csv\" in the same directory.\r\n\r\n## Note:\r\n- Make sure you have Python and the necessary dependencies (Pandas) installed.\r\n- Ensure that your dataset file is not open or locked by any other program during the cleaning process.\r\n\r\n## Example Usage:\r\npip install PureFlow\r\nimport PureFlow as pf\r\npf.clean_dataset('dataset name', ['columns name'])\r\nCleaning in progress...\r\nCleaned dataset:\r\n    ...\r\n    [Display a sample of cleaned data here]\r\n    ...\r\nCleaning completed successfully. The cleaned dataset has been saved as \"Cleaned_Dataset.csv\".\r\n\r\nFor any issues or inquiries, please contact me:\r\n\r\nE-Mail: iMoHd8@hotmail.com\r\nLinkedIn: https://www.linkedin.com/in/mohammed-mahameed\r\nGitHub: https://github.com/iMoHd8\r\nInstagram: https://www.instagram.com/i.mohd.8/\r\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Data Cleaning Tool",
    "version": "1.0.1",
    "project_urls": {
        "Homepage": "https://github.com/iMoHd8/PureFlow"
    },
    "split_keywords": [
        "pureflow",
        "data",
        "datacleaning",
        "cleaning",
        "machine learning",
        "data cleaning",
        "data science",
        "data process"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "604addfe145894949d3371ab696badef14fe34b467802c1bebdf56579931aa88",
                "md5": "974244837c78222eb0258ef7c242a39f",
                "sha256": "b4653ee6acbca67a0834f809910c64dcfd3e6b856cc79d633bcfaf874dde3ca8"
            },
            "downloads": -1,
            "filename": "PureFlow-1.0.1-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "974244837c78222eb0258ef7c242a39f",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 4835,
            "upload_time": "2023-06-26T22:58:20",
            "upload_time_iso_8601": "2023-06-26T22:58:20.314500Z",
            "url": "https://files.pythonhosted.org/packages/60/4a/ddfe145894949d3371ab696badef14fe34b467802c1bebdf56579931aa88/PureFlow-1.0.1-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "e00e6fe7bbf26d6f3c71417429199ca1540abd833159f7d65e59abd383c96ae9",
                "md5": "6dcd213f83a66465812e6315925d21e6",
                "sha256": "f8aba5501aafabc0c82f1383b9968881c44207ebe4d1000a3b2998d09e359b59"
            },
            "downloads": -1,
            "filename": "PureFlow-1.0.1.tar.gz",
            "has_sig": false,
            "md5_digest": "6dcd213f83a66465812e6315925d21e6",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 4547,
            "upload_time": "2023-06-26T22:58:22",
            "upload_time_iso_8601": "2023-06-26T22:58:22.201535Z",
            "url": "https://files.pythonhosted.org/packages/e0/0e/6fe7bbf26d6f3c71417429199ca1540abd833159f7d65e59abd383c96ae9/PureFlow-1.0.1.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-06-26 22:58:22",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "iMoHd8",
    "github_project": "PureFlow",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "lcname": "pureflow"
}
        
Elapsed time: 0.12840s