EDAeasy


NameEDAeasy JSON
Version 1.0.1 PyPI version JSON
download
home_page
SummaryFunctions and tools for making Exploratory Data Analysis easy!
upload_time2023-08-31 02:20:12
maintainer
docs_urlNone
authorFrancisco Jesus Ocazionez Cardozo
requires_python>=3.8,<4.0
licenseMIT
keywords exploratory analysis eda
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # EDAeasy 😀
The package for quick exploratory data analysis


## Instalation 

`pip install EDAeasy`

## Usage
The **dataframe_summary** function have relative simple summary of the columns of your dataframe
for quick look at tabular data

    Generate a summary DataFrame of the input DataFrame 'dataframe'.

    Parameters
    ----------
    dataframe : pandas.DataFrame
        The input DataFrame for which the summary needs to be generated.

    Returns
    -------
    pandas.DataFrame
        A DataFrame containing summary information for each column in 'df':
        - Type: Data type of the column.
        - Min: Minimum value in the column.
        - Max: Maximum value in the column.
        - Nan %: Percentage of NaN values in the column.
        - # Unique Values: Total number of unique values in the column.
        - Unique values: List of unique values in the column.

    Example
    -------
    >>> data = {
            'age': ['[40-50)', '[60-70)', '[70-80)'],
            'time_in_hospital': [8, 3, 5],
            'n_lab_procedures': [72, 34, 45],
            ...
        }
    >>> dataframe = pd.DataFrame(data)
    >>> result = dataframe_summary(df)
    >>> print(result)
               Type       Min        Max  Nan %  # Unique Values                                  Unique values
    Variables                                                                                                              
    age       object   [40-50)    [90-100)    0.0        3      ['[70-80)', '[50-60)', '[60-70)', '[40-50)', '[80-90)', ...
    time_in_hospital  int64    1           14    0.0        3        [8, 3, 5]
    n_lab_procedures  int64    1          113    0.0        3        [72, 34, 45]
    ...

    Note
    ----
    The function uses vectorized operations to improve performance and memory usage.

            

Raw data

            {
    "_id": null,
    "home_page": "",
    "name": "EDAeasy",
    "maintainer": "",
    "docs_url": null,
    "requires_python": ">=3.8,<4.0",
    "maintainer_email": "",
    "keywords": "Exploratory Analysis,EDA",
    "author": "Francisco Jesus Ocazionez Cardozo",
    "author_email": "pach812@gmail.com",
    "download_url": "https://files.pythonhosted.org/packages/ca/11/4bc9c7df999253cecc13a940a6ff3abe5c32491c885ab3cb1affcd4502b8/edaeasy-1.0.1.tar.gz",
    "platform": null,
    "description": "# EDAeasy \ud83d\ude00\nThe package for quick exploratory data analysis\n\n\n## Instalation \n\n`pip install EDAeasy`\n\n## Usage\nThe **dataframe_summary** function have relative simple summary of the columns of your dataframe\nfor quick look at tabular data\n\n    Generate a summary DataFrame of the input DataFrame 'dataframe'.\n\n    Parameters\n    ----------\n    dataframe : pandas.DataFrame\n        The input DataFrame for which the summary needs to be generated.\n\n    Returns\n    -------\n    pandas.DataFrame\n        A DataFrame containing summary information for each column in 'df':\n        - Type: Data type of the column.\n        - Min: Minimum value in the column.\n        - Max: Maximum value in the column.\n        - Nan %: Percentage of NaN values in the column.\n        - # Unique Values: Total number of unique values in the column.\n        - Unique values: List of unique values in the column.\n\n    Example\n    -------\n    >>> data = {\n            'age': ['[40-50)', '[60-70)', '[70-80)'],\n            'time_in_hospital': [8, 3, 5],\n            'n_lab_procedures': [72, 34, 45],\n            ...\n        }\n    >>> dataframe = pd.DataFrame(data)\n    >>> result = dataframe_summary(df)\n    >>> print(result)\n               Type       Min        Max  Nan %  # Unique Values                                  Unique values\n    Variables                                                                                                              \n    age       object   [40-50)    [90-100)    0.0        3      ['[70-80)', '[50-60)', '[60-70)', '[40-50)', '[80-90)', ...\n    time_in_hospital  int64    1           14    0.0        3        [8, 3, 5]\n    n_lab_procedures  int64    1          113    0.0        3        [72, 34, 45]\n    ...\n\n    Note\n    ----\n    The function uses vectorized operations to improve performance and memory usage.\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Functions and tools for making Exploratory Data Analysis easy!",
    "version": "1.0.1",
    "project_urls": null,
    "split_keywords": [
        "exploratory analysis",
        "eda"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "c2d1567410cb689218d6c87733bf49f1a41c0c75d52862e94a2147e3f523d70e",
                "md5": "266b357f8e2e32c9106b387e6e6607f6",
                "sha256": "e7d68ee856db602858c9d61ec89901eb874ad77e259c7676c71f955dd7752564"
            },
            "downloads": -1,
            "filename": "edaeasy-1.0.1-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "266b357f8e2e32c9106b387e6e6607f6",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.8,<4.0",
            "size": 4103,
            "upload_time": "2023-08-31T02:20:11",
            "upload_time_iso_8601": "2023-08-31T02:20:11.134857Z",
            "url": "https://files.pythonhosted.org/packages/c2/d1/567410cb689218d6c87733bf49f1a41c0c75d52862e94a2147e3f523d70e/edaeasy-1.0.1-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "ca114bc9c7df999253cecc13a940a6ff3abe5c32491c885ab3cb1affcd4502b8",
                "md5": "2ce80b09c6df2d83bf98b600d178cee4",
                "sha256": "a79e0981753bf498e39f1697af43fc7bc668dbbcb15186cd6f441b9355fd6e86"
            },
            "downloads": -1,
            "filename": "edaeasy-1.0.1.tar.gz",
            "has_sig": false,
            "md5_digest": "2ce80b09c6df2d83bf98b600d178cee4",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.8,<4.0",
            "size": 2967,
            "upload_time": "2023-08-31T02:20:12",
            "upload_time_iso_8601": "2023-08-31T02:20:12.647901Z",
            "url": "https://files.pythonhosted.org/packages/ca/11/4bc9c7df999253cecc13a940a6ff3abe5c32491c885ab3cb1affcd4502b8/edaeasy-1.0.1.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-08-31 02:20:12",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "edaeasy"
}
        
Elapsed time: 0.10849s