# EDAeasy 😀
The package for quick exploratory data analysis
## Instalation
`pip install EDAeasy`
## Usage
The **dataframe_summary** function have relative simple summary of the columns of your dataframe
for quick look at tabular data
Generate a summary DataFrame of the input DataFrame 'dataframe'.
Parameters
----------
dataframe : pandas.DataFrame
The input DataFrame for which the summary needs to be generated.
Returns
-------
pandas.DataFrame
A DataFrame containing summary information for each column in 'df':
- Type: Data type of the column.
- Min: Minimum value in the column.
- Max: Maximum value in the column.
- Nan %: Percentage of NaN values in the column.
- # Unique Values: Total number of unique values in the column.
- Unique values: List of unique values in the column.
Example
-------
>>> data = {
'age': ['[40-50)', '[60-70)', '[70-80)'],
'time_in_hospital': [8, 3, 5],
'n_lab_procedures': [72, 34, 45],
...
}
>>> dataframe = pd.DataFrame(data)
>>> result = dataframe_summary(df)
>>> print(result)
Type Min Max Nan % # Unique Values Unique values
Variables
age object [40-50) [90-100) 0.0 3 ['[70-80)', '[50-60)', '[60-70)', '[40-50)', '[80-90)', ...
time_in_hospital int64 1 14 0.0 3 [8, 3, 5]
n_lab_procedures int64 1 113 0.0 3 [72, 34, 45]
...
Note
----
The function uses vectorized operations to improve performance and memory usage.
Raw data
{
"_id": null,
"home_page": "",
"name": "EDAeasy",
"maintainer": "",
"docs_url": null,
"requires_python": ">=3.8,<4.0",
"maintainer_email": "",
"keywords": "Exploratory Analysis,EDA",
"author": "Francisco Jesus Ocazionez Cardozo",
"author_email": "pach812@gmail.com",
"download_url": "https://files.pythonhosted.org/packages/ca/11/4bc9c7df999253cecc13a940a6ff3abe5c32491c885ab3cb1affcd4502b8/edaeasy-1.0.1.tar.gz",
"platform": null,
"description": "# EDAeasy \ud83d\ude00\nThe package for quick exploratory data analysis\n\n\n## Instalation \n\n`pip install EDAeasy`\n\n## Usage\nThe **dataframe_summary** function have relative simple summary of the columns of your dataframe\nfor quick look at tabular data\n\n Generate a summary DataFrame of the input DataFrame 'dataframe'.\n\n Parameters\n ----------\n dataframe : pandas.DataFrame\n The input DataFrame for which the summary needs to be generated.\n\n Returns\n -------\n pandas.DataFrame\n A DataFrame containing summary information for each column in 'df':\n - Type: Data type of the column.\n - Min: Minimum value in the column.\n - Max: Maximum value in the column.\n - Nan %: Percentage of NaN values in the column.\n - # Unique Values: Total number of unique values in the column.\n - Unique values: List of unique values in the column.\n\n Example\n -------\n >>> data = {\n 'age': ['[40-50)', '[60-70)', '[70-80)'],\n 'time_in_hospital': [8, 3, 5],\n 'n_lab_procedures': [72, 34, 45],\n ...\n }\n >>> dataframe = pd.DataFrame(data)\n >>> result = dataframe_summary(df)\n >>> print(result)\n Type Min Max Nan % # Unique Values Unique values\n Variables \n age object [40-50) [90-100) 0.0 3 ['[70-80)', '[50-60)', '[60-70)', '[40-50)', '[80-90)', ...\n time_in_hospital int64 1 14 0.0 3 [8, 3, 5]\n n_lab_procedures int64 1 113 0.0 3 [72, 34, 45]\n ...\n\n Note\n ----\n The function uses vectorized operations to improve performance and memory usage.\n",
"bugtrack_url": null,
"license": "MIT",
"summary": "Functions and tools for making Exploratory Data Analysis easy!",
"version": "1.0.1",
"project_urls": null,
"split_keywords": [
"exploratory analysis",
"eda"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "c2d1567410cb689218d6c87733bf49f1a41c0c75d52862e94a2147e3f523d70e",
"md5": "266b357f8e2e32c9106b387e6e6607f6",
"sha256": "e7d68ee856db602858c9d61ec89901eb874ad77e259c7676c71f955dd7752564"
},
"downloads": -1,
"filename": "edaeasy-1.0.1-py3-none-any.whl",
"has_sig": false,
"md5_digest": "266b357f8e2e32c9106b387e6e6607f6",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.8,<4.0",
"size": 4103,
"upload_time": "2023-08-31T02:20:11",
"upload_time_iso_8601": "2023-08-31T02:20:11.134857Z",
"url": "https://files.pythonhosted.org/packages/c2/d1/567410cb689218d6c87733bf49f1a41c0c75d52862e94a2147e3f523d70e/edaeasy-1.0.1-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "ca114bc9c7df999253cecc13a940a6ff3abe5c32491c885ab3cb1affcd4502b8",
"md5": "2ce80b09c6df2d83bf98b600d178cee4",
"sha256": "a79e0981753bf498e39f1697af43fc7bc668dbbcb15186cd6f441b9355fd6e86"
},
"downloads": -1,
"filename": "edaeasy-1.0.1.tar.gz",
"has_sig": false,
"md5_digest": "2ce80b09c6df2d83bf98b600d178cee4",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.8,<4.0",
"size": 2967,
"upload_time": "2023-08-31T02:20:12",
"upload_time_iso_8601": "2023-08-31T02:20:12.647901Z",
"url": "https://files.pythonhosted.org/packages/ca/11/4bc9c7df999253cecc13a940a6ff3abe5c32491c885ab3cb1affcd4502b8/edaeasy-1.0.1.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2023-08-31 02:20:12",
"github": false,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"lcname": "edaeasy"
}