PyDigger - unearthing stuff about Python


NameVersionSummarydate
pysparta 0.5.1 Library to help ETL using pyspark 2024-10-12 17:44:14
aws-insurancelake-etl 4.1.2 A CDK Python app for deploying ETL jobs that operate data pipelines for InsuranceLake in AWS 2024-10-08 14:30:22
spark-connect-proxy 0.0.8 A reverse proxy server which allows secure connectivity to a Spark Connect server 2024-10-04 20:03:34
johnsnowlabs-for-databricks 5.4.5 The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source products in an easy and simple manner. Access 10000+ state-of-the-art NLP and OCR models for Finance, Legal and Medical domains. Easily scalable to Spark Cluster 2024-09-27 03:20:00
johnsnowlabs 5.4.5 The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source products in an easy and simple manner. Access 10000+ state-of-the-art NLP and OCR models for Finance, Legal and Medical domains. Easily scalable to Spark Cluster 2024-09-27 03:19:59
nlu 5.4.1 John Snow Labs NLU provides state of the art algorithms for NLP&NLU with 20000+ of pretrained models in 200+ languages. It enables swift and simple development and research with its powerful Pythonic and Keras inspired API. It is powerd by John Snow Labs powerful Spark NLP library. 2024-09-27 01:23:20
spark-nlp 5.5.0 John Snow Labs Spark NLP is a natural language processing library built on top of Apache Spark ML. It provides simple, performant & accurate NLP annotations for machine learning pipelines, that scale easily in a distributed environment. 2024-09-25 14:58:30
spark-datax-tools 0.7.0 spark_datax_tools 2024-09-19 00:39:50
spark-gaps-date-rorc-tools 0.2.3 spark_gaps_date_rorc_tools 2024-09-12 00:31:57
onetl 0.12.0 One ETL tool to rule them all 2024-09-03 12:45:46
spark-datiofilesystem-tools 0.1.7 spark_datiofilesystem_tools 2024-08-16 04:48:27
fugue-sql-antlr-cpp 0.2.2 Fugue SQL Antlr C++ Parser 2024-08-15 07:36:18
fugue-sql-antlr 0.2.2 Fugue SQL Antlr Parser 2024-08-15 07:25:57
spark-dataframe-tools 0.6.13 spark_dataframe_tools 2024-08-13 20:45:31
typedspark 1.5.0 Column-wise type annotations for pyspark DataFrames 2024-08-12 12:58:01
spark-on-k8s 0.10.0 A Python package to submit and manage Apache Spark applications on Kubernetes. 2024-07-24 22:10:45
spark-acl-tools 0.3.9 spark_acl_tools 2024-07-18 05:31:34
spark-frame 0.5.2 A library containing various utility functions for playing with PySpark DataFrames 2024-07-11 16:10:52
raydp 1.6.1 RayDP: Distributed Data Processing on Ray 2024-06-26 07:38:48
pyspark-graph 0.0.7 Pure pyspark implementation of graph algorithms 2024-06-15 22:34:19
hourdayweektotal
38221210309253173
Elapsed time: 1.01505s