Name | Version | Summary | date |
typedspark |
1.4.1 |
Column-wise type annotations for pyspark DataFrames |
2024-04-13 08:03:24 |
spark-dataframe-tools |
0.6.5 |
spark_dataframe_tools |
2024-04-12 08:29:45 |
spark-dummy-tools |
0.8.3 |
spark_dummy_tools |
2024-04-12 08:13:16 |
raydp-nightly |
2024.4.10.dev0 |
RayDP: Distributed Data Processing on Ray |
2024-04-10 00:49:01 |
spark-quality-rules-tools |
0.9.10 |
spark_quality_rules_tools |
2024-04-08 23:05:07 |
spark-gaps-date-rorc-tools |
0.2.1 |
spark_gaps_date_rorc_tools |
2024-04-07 06:04:22 |
johnsnowlabs-for-databricks |
5.3.3 |
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source products in an easy and simple manner. Access 10000+ state-of-the-art NLP and OCR models for Finance, Legal and Medical domains. Easily scalable to Spark Cluster |
2024-04-05 05:35:03 |
johnsnowlabs |
5.3.3 |
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source products in an easy and simple manner. Access 10000+ state-of-the-art NLP and OCR models for Finance, Legal and Medical domains. Easily scalable to Spark Cluster |
2024-04-05 05:35:00 |
spark-scaffolder-transforms-tools |
0.0.1 |
spark_scaffolder_transforms_tools |
2024-04-04 08:43:33 |
td-pyspark |
24.4.1 |
Treasure Data extension for pyspark |
2024-04-04 05:04:47 |
spark-dql-tools |
0.7.2 |
spark_dql_tools |
2024-04-02 00:03:13 |
aws-insurancelake-etl |
3.3.1 |
A CDK Python app for deploying ETL jobs that operate data pipelines for InsuranceLake in AWS |
2024-03-27 22:01:00 |
spark-on-k8s |
0.4.0 |
A Python package to submit and manage Apache Spark applications on Kubernetes. |
2024-03-24 23:54:45 |
spark-nlp |
5.3.2 |
John Snow Labs Spark NLP is a natural language processing library built on top of Apache Spark ML. It provides simple, performant & accurate NLP annotations for machine learning pipelines, that scale easily in a distributed environment. |
2024-03-20 19:09:21 |
shui |
0.8.1 |
Spark-Hadoop Unix Installer |
2024-03-11 17:20:17 |
nlu |
5.3.0 |
John Snow Labs NLU provides state of the art algorithms for NLP&NLU with 20000+ of pretrained models in 200+ languages. It enables swift and simple development and research with its powerful Pythonic and Keras inspired API. It is powerd by John Snow Labs powerful Spark NLP library. |
2024-03-08 18:46:44 |
repartipy |
0.1.8 |
Helper for handling PySpark DataFrame partition size 📑🎛️ |
2024-03-08 04:47:37 |
glue-utils |
0.1.1 |
Reusable utilities for working with Glue PySpark jobs |
2024-03-07 09:37:37 |
spark-datax-tools |
0.6.6 |
spark_datax_tools |
2024-03-01 19:49:07 |
jupyterlab-sql-editor |
0.1.94 |
SQL editor support for formatting, syntax highlighting and code completion of SQL in cell magic, line magic, python string and file editor. |
2024-03-01 15:42:01 |