Name | Version | Summary | date |
kedro-viz |
10.1.0 |
Kedro-Viz helps visualise Kedro data and analytics pipelines |
2024-11-21 20:16:56 |
dbt_coves |
1.8.12 |
CLI tool for dbt users adopting analytics engineering best practices. |
2024-10-24 20:29:14 |
DeepCoreML |
0.4.0 |
A collection of Machine Learning techniques for data management, engineering and augmentation. |
2024-09-30 08:53:03 |
aws-json-dataset |
0.1.0 |
Send JSON datasets to various AWS services. |
2024-02-03 22:56:45 |
scistag |
0.9.0 |
A stack of helpful libraries & applications for the rapid development of data driven solutions. |
2024-01-15 22:38:53 |
dcw |
0.0.10 |
|
2024-01-15 21:15:25 |
datasaurus |
0.0.2.dev4 |
Data Engineering framework based on Polars.rs |
2023-12-19 12:10:51 |
deepCoreML |
0.1 |
A collection of Machine Learning techniques for data management and augmentation. |
2023-11-29 22:13:22 |
pipelinex |
0.7.9 |
PipelineX: Python package to build ML pipelines for experimentation with Kedro, MLflow, and more |
2023-11-28 12:52:31 |
dtflw |
0.6.7 |
dtflw is a Python framework for building modular data pipelines based on Databricks dbutils.notebook API. |
2023-10-29 13:28:29 |
wiz-craft |
1.1.1 |
A CLI-based dataset preprocessing tool for machine learning tasks. Features include data exploration, null value handling, one-hot encoding, and feature scaling, and download the modified dataset effortlessly. |
2023-10-18 08:32:14 |
pycurie |
0.1.16 |
|
2023-10-17 16:56:22 |
ParallelFileConcatenator |
0.1 |
ParallelFileConcatenator is a robust tool designed to efficiently combine data files of various formats (CSV, Feather, Parquet, XLSX, XLS) from a specified directory. |
2023-08-22 09:34:18 |
dbt-coves |
1.6.0 |
CLI tool for dbt users adopting analytics engineering best practices. |
2023-08-10 18:57:39 |
dsutils-ms |
1.10 |
My Data Science Utils |
2023-07-19 16:16:44 |
chartstag |
0.8.2 |
Charting and diagram extension for SciStag |
2023-06-15 21:28:04 |
flowrunner |
0.2.3 |
Flowrunner is a lightweight package to organize and represent Data Engineering/Science workflows |
2023-06-08 15:59:53 |
duckingit |
0.0.11 |
A framework to leverage clusters of serverless functions for analytics. Powered by DuckDB |
2023-05-23 19:26:49 |
adcpipeline |
0.2.1 |
A pipeline for a structured way of working |
2023-04-24 11:35:44 |
dataset-shuffler |
0.1.1 |
Data engineering tool for learning-based computer vision. |
2023-01-28 23:21:55 |