Name | Version | Summary | date |
spark-nlp |
5.5.2 |
John Snow Labs Spark NLP is a natural language processing library built on top of Apache Spark ML. It provides simple, performant & accurate NLP annotations for machine learning pipelines, that scale easily in a distributed environment. |
2024-12-18 16:04:11 |
pysparta |
0.5.5 |
Library to help ETL using pyspark |
2024-12-05 20:30:42 |
johnsnowlabs-for-databricks |
5.5.2 |
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source products in an easy and simple manner. Access 10000+ state-of-the-art NLP and OCR models for Finance, Legal and Medical domains. Easily scalable to Spark Cluster |
2024-12-05 17:16:56 |
onetl |
0.12.5 |
One ETL tool to rule them all |
2024-12-03 09:32:12 |
spark-dataproc-local-tools |
0.1.4 |
spark_dataproc_local_tools |
2024-11-30 08:28:43 |
raydp-nightly |
2024.11.22.dev0 |
RayDP: Distributed Data Processing on Ray |
2024-11-22 01:08:19 |
spark-jdbc-ingestor |
1.0.1 |
A library to handle JDBC ingestion from a SQL database in a simple and efficient way. |
2024-11-19 22:24:32 |
aws-insurancelake-etl |
4.1.3 |
A CDK Python app for deploying ETL jobs that operate data pipelines for InsuranceLake in AWS |
2024-11-18 19:02:45 |
johnsnowlabs |
5.5.1 |
The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source products in an easy and simple manner. Access 10000+ state-of-the-art NLP and OCR models for Finance, Legal and Medical domains. Easily scalable to Spark Cluster |
2024-11-17 16:16:43 |
spark-acl-tools |
0.4.0 |
spark_acl_tools |
2024-11-12 17:45:54 |
emrrunner |
1.0.9 |
A powerful CLI tool and API for managing Spark jobs on Amazon EMR clusters |
2024-11-03 16:44:04 |
spark-dataframe-tools |
0.6.14 |
spark_dataframe_tools |
2024-10-18 05:26:52 |
spark-connect-proxy |
0.0.10 |
A reverse proxy server which allows secure connectivity to a Spark Connect server |
2024-10-16 15:39:49 |
nlu |
5.4.1 |
John Snow Labs NLU provides state of the art algorithms for NLP&NLU with 20000+ of pretrained models in 200+ languages. It enables swift and simple development and research with its powerful Pythonic and Keras inspired API. It is powerd by John Snow Labs powerful Spark NLP library. |
2024-09-27 01:23:20 |
spark-datax-tools |
0.7.0 |
spark_datax_tools |
2024-09-19 00:39:50 |
spark-gaps-date-rorc-tools |
0.2.3 |
spark_gaps_date_rorc_tools |
2024-09-12 00:31:57 |
spark-datiofilesystem-tools |
0.1.7 |
spark_datiofilesystem_tools |
2024-08-16 04:48:27 |
fugue-sql-antlr-cpp |
0.2.2 |
Fugue SQL Antlr C++ Parser |
2024-08-15 07:36:18 |
fugue-sql-antlr |
0.2.2 |
Fugue SQL Antlr Parser |
2024-08-15 07:25:57 |
typedspark |
1.5.0 |
Column-wise type annotations for pyspark DataFrames |
2024-08-12 12:58:01 |