LeEncoderML


NameLeEncoderML JSON
Version 0.1.2 PyPI version JSON
download
home_pagehttps://github.com/varunvinodh/Functions/tree/main/Encoder
SummaryPython script for managing categorical data in machine learning, ensuring proper encoding and handling of unseen labels.
upload_time2023-11-14 10:43:11
maintainer
docs_urlNone
authorVarun Vinodh
requires_python
licenseMIT
keywords encoder
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Categorical Data Encoding for Machine Learning:
This repository contains a Python script for managing categorical data in preparation for machine learning tasks. It includes functions for encoding categorical data using LabelEncoder, decoding the encoded data, and handling unseen labels in the data. These functions help ensure that categorical data is properly processed and encoded for machine learning models.

## Key Features:

**text_to_numbers:** Encode categorical columns using LabelEncoder and save the encoders for later use.

**re_encode:** Re-encode categorical columns, handling unseen labels and maintaining consistency. Remove rows with unseen labels in a specific categorical column and re-encode the data.

**numbers_to_text:** Decode previously encoded categorical columns to their original values.

## Usage:
You can use these functions to preprocess and manage categorical data in your machine learning projects. They simplify the procedure of storing encoder files for individual datasets, allowing you to reuse them in the future. The script is designed to make it easier to work with categorical data, especially when dealing with unseen labels and encoding consistency.

## For more understanding reffer to the docstrings for detailed understanding on each function.
 

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/varunvinodh/Functions/tree/main/Encoder",
    "name": "LeEncoderML",
    "maintainer": "",
    "docs_url": null,
    "requires_python": "",
    "maintainer_email": "",
    "keywords": "encoder",
    "author": "Varun Vinodh",
    "author_email": "varunvinodh25@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/33/7a/f852443c1e88fce2e9f571a7c660bb89f3ec08c4cd2a3fe756efc03abd52/LeEncoderML-0.1.2.tar.gz",
    "platform": null,
    "description": "# Categorical Data Encoding for Machine Learning:\r\nThis repository contains a Python script for managing categorical data in preparation for machine learning tasks. It includes functions for encoding categorical data using LabelEncoder, decoding the encoded data, and handling unseen labels in the data. These functions help ensure that categorical data is properly processed and encoded for machine learning models.\r\n\r\n## Key Features:\r\n\r\n**text_to_numbers:** Encode categorical columns using LabelEncoder and save the encoders for later use.\r\n\r\n**re_encode:** Re-encode categorical columns, handling unseen labels and maintaining consistency. Remove rows with unseen labels in a specific categorical column and re-encode the data.\r\n\r\n**numbers_to_text:** Decode previously encoded categorical columns to their original values.\r\n\r\n## Usage:\r\nYou can use these functions to preprocess and manage categorical data in your machine learning projects. They simplify the procedure of storing encoder files for individual datasets, allowing you to reuse them in the future. The script is designed to make it easier to work with categorical data, especially when dealing with unseen labels and encoding consistency.\r\n\r\n## For more understanding reffer to the docstrings for detailed understanding on each function.\r\n \r\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Python script for managing categorical data in machine learning, ensuring proper encoding and handling of unseen labels.",
    "version": "0.1.2",
    "project_urls": {
        "Homepage": "https://github.com/varunvinodh/Functions/tree/main/Encoder"
    },
    "split_keywords": [
        "encoder"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "337af852443c1e88fce2e9f571a7c660bb89f3ec08c4cd2a3fe756efc03abd52",
                "md5": "a2c2b9fb4611f1750ab06c238a21859a",
                "sha256": "aaa3e28f0a6b91ea49ac05ca1053ddf1798074d8ecd3091a7beeb13d0bb0e5c0"
            },
            "downloads": -1,
            "filename": "LeEncoderML-0.1.2.tar.gz",
            "has_sig": false,
            "md5_digest": "a2c2b9fb4611f1750ab06c238a21859a",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": null,
            "size": 3502,
            "upload_time": "2023-11-14T10:43:11",
            "upload_time_iso_8601": "2023-11-14T10:43:11.071111Z",
            "url": "https://files.pythonhosted.org/packages/33/7a/f852443c1e88fce2e9f571a7c660bb89f3ec08c4cd2a3fe756efc03abd52/LeEncoderML-0.1.2.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-11-14 10:43:11",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "varunvinodh",
    "github_project": "Functions",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "lcname": "leencoderml"
}
        
Elapsed time: 0.13870s