PyDigger - unearthing stuff about Python


NameVersionSummarydate
typedspark 1.4.1 Column-wise type annotations for pyspark DataFrames 2024-04-13 08:03:24
spark-dataframe-tools 0.6.5 spark_dataframe_tools 2024-04-12 08:29:45
spark-dummy-tools 0.8.3 spark_dummy_tools 2024-04-12 08:13:16
raydp-nightly 2024.4.10.dev0 RayDP: Distributed Data Processing on Ray 2024-04-10 00:49:01
spark-quality-rules-tools 0.9.10 spark_quality_rules_tools 2024-04-08 23:05:07
spark-gaps-date-rorc-tools 0.2.1 spark_gaps_date_rorc_tools 2024-04-07 06:04:22
johnsnowlabs-for-databricks 5.3.3 The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source products in an easy and simple manner. Access 10000+ state-of-the-art NLP and OCR models for Finance, Legal and Medical domains. Easily scalable to Spark Cluster 2024-04-05 05:35:03
johnsnowlabs 5.3.3 The John Snow Labs Library gives you access to all of John Snow Labs Enterprise And Open Source products in an easy and simple manner. Access 10000+ state-of-the-art NLP and OCR models for Finance, Legal and Medical domains. Easily scalable to Spark Cluster 2024-04-05 05:35:00
spark-scaffolder-transforms-tools 0.0.1 spark_scaffolder_transforms_tools 2024-04-04 08:43:33
td-pyspark 24.4.1 Treasure Data extension for pyspark 2024-04-04 05:04:47
spark-dql-tools 0.7.2 spark_dql_tools 2024-04-02 00:03:13
aws-insurancelake-etl 3.3.1 A CDK Python app for deploying ETL jobs that operate data pipelines for InsuranceLake in AWS 2024-03-27 22:01:00
spark-on-k8s 0.4.0 A Python package to submit and manage Apache Spark applications on Kubernetes. 2024-03-24 23:54:45
spark-nlp 5.3.2 John Snow Labs Spark NLP is a natural language processing library built on top of Apache Spark ML. It provides simple, performant & accurate NLP annotations for machine learning pipelines, that scale easily in a distributed environment. 2024-03-20 19:09:21
shui 0.8.1 Spark-Hadoop Unix Installer 2024-03-11 17:20:17
nlu 5.3.0 John Snow Labs NLU provides state of the art algorithms for NLP&NLU with 20000+ of pretrained models in 200+ languages. It enables swift and simple development and research with its powerful Pythonic and Keras inspired API. It is powerd by John Snow Labs powerful Spark NLP library. 2024-03-08 18:46:44
repartipy 0.1.8 Helper for handling PySpark DataFrame partition size 📑🎛️ 2024-03-08 04:47:37
glue-utils 0.1.1 Reusable utilities for working with Glue PySpark jobs 2024-03-07 09:37:37
spark-datax-tools 0.6.6 spark_datax_tools 2024-03-01 19:49:07
jupyterlab-sql-editor 0.1.94 SQL editor support for formatting, syntax highlighting and code completion of SQL in cell magic, line magic, python string and file editor. 2024-03-01 15:42:01
hourdayweektotal
10121909719198580
Elapsed time: 0.64993s