squirrel-datasets-core


Namesquirrel-datasets-core JSON
Version 0.1.8 PyPI version JSON
download
home_page
SummarySquirrel public datasets collection
upload_time2022-07-14 13:28:14
maintainer
docs_urlNone
authorMerantix Momentum
requires_python>=3.8.0
licenseApache 2.0
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            <div align="center">
  
# <img src="https://raw.githubusercontent.com/merantix-momentum/squirrel-datasets-core/main/docs/source/_static/logo.png" width="150px"> Squirrel Datasets Core
  
[![Python](https://img.shields.io/pypi/pyversions/squirrel-datasets-core.svg?style=plastic)](https://badge.fury.io/py/squirrel-datasets-core)
[![PyPI](https://badge.fury.io/py/squirrel-datasets-core.svg)](https://badge.fury.io/py/squirrel-datasets-core)
[![Conda](https://img.shields.io/conda/vn/conda-forge/squirrel-datasets-core)](https://anaconda.org/conda-forge/squirrel-datasets-core)
[![Documentation Status](https://readthedocs.org/projects/squirrel-datasets-core/badge/?version=latest)](https://squirrel-datasets-core.readthedocs.io)
[![Downloads](https://static.pepy.tech/personalized-badge/squirrel-datasets-core?period=total&units=international_system&left_color=grey&right_color=blue&left_text=Downloads)](https://pepy.tech/project/squirrel-datasets-core)
[![License](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](https://raw.githubusercontent.com/merantix-momentum/squirrel-datasets-core/main/LICENSE)
[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.6420214.svg)](https://doi.org/10.5281/zenodo.6420214)
[![Generic badge](https://img.shields.io/badge/Website-Merantix%20Momentum-blue)](https://merantix-momentum.com)
[![Slack](https://img.shields.io/badge/slack-chat-green.svg?logo=slack)](https://join.slack.com/t/squirrel-core/shared_invite/zt-14k6sk6sw-zQPHfqAI8Xq5WYd~UqgNFw)

</div>

---
# What is Squirrel Datasets Core?

`squirrel-datasets-core` is an extension of the [Squirrel](https://github.com/merantix-momentum/squirrel-core) library. `squirrel-datasets-core` is a hub where the user can 1) explore existing datasets registered in the data mesh by other users and 2) preprocess their datasets and share them with other users. As an end user, you will
be able to load many publically available datasets with ease and speed with the help of `squirrel`, or load and preprocess
your own datasets with the tools we provide here. 

For preprocessing, we currently support Spark as the main tool to carry out the task.

If you have any questions or would like to contribute, join our [Slack community](https://join.slack.com/t/squirrel-core/shared_invite/zt-14k6sk6sw-zQPHfqAI8Xq5WYd~UqgNFw)!

# Installation
Install `squirrel-core` and `squirrel-datasets-core` with pip:

```shell
pip install squirrel-core[all]
pip install squirrel-datasets-core[all]
```
# Documentation

Visit our documentation on [Readthedocs](https://squirrel-datasets-core.readthedocs.io).

# Contributing
`squirrel-datasets-core` is open source and community contributions are welcome!

# The humans behind Squirrel
We are [Merantix Momentum](https://merantix-momentum.com/), a team of ~30 machine learning engineers, developing machine learning solutions for industry and research. Each project comes with its own challenges, data types and learnings, but one issue we always faced was scalable data loading, transforming and sharing. We were looking for a solution that would allow us to load the data in a fast and cost-efficient way, while keeping the flexibility to work with any possible dataset and integrate with any API. That's why we build Squirrel – and we hope you'll find it as useful as we do! By the way, [we are hiring](https://merantix-momentum.com/about#jobs)!


# Citation

If you use Squirrel Datasets in your research, please cite Squirrel using:
```bibtex
@article{2022squirrelcore,
  title={Squirrel: A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way.},
  author={Squirrel Developer Team},
  journal={GitHub. Note: https://github.com/merantix-momentum/squirrel-core},
  year={2022}
}
```

            

Raw data

            {
    "_id": null,
    "home_page": "",
    "name": "squirrel-datasets-core",
    "maintainer": "",
    "docs_url": null,
    "requires_python": ">=3.8.0",
    "maintainer_email": "",
    "keywords": "",
    "author": "Merantix Momentum",
    "author_email": "",
    "download_url": "https://files.pythonhosted.org/packages/53/6b/1290efad1eb4d23451abdd702791f040d5222ca43b2b765731560026e175/squirrel_datasets_core-0.1.8.tar.gz",
    "platform": null,
    "description": "<div align=\"center\">\n  \n# <img src=\"https://raw.githubusercontent.com/merantix-momentum/squirrel-datasets-core/main/docs/source/_static/logo.png\" width=\"150px\"> Squirrel Datasets Core\n  \n[![Python](https://img.shields.io/pypi/pyversions/squirrel-datasets-core.svg?style=plastic)](https://badge.fury.io/py/squirrel-datasets-core)\n[![PyPI](https://badge.fury.io/py/squirrel-datasets-core.svg)](https://badge.fury.io/py/squirrel-datasets-core)\n[![Conda](https://img.shields.io/conda/vn/conda-forge/squirrel-datasets-core)](https://anaconda.org/conda-forge/squirrel-datasets-core)\n[![Documentation Status](https://readthedocs.org/projects/squirrel-datasets-core/badge/?version=latest)](https://squirrel-datasets-core.readthedocs.io)\n[![Downloads](https://static.pepy.tech/personalized-badge/squirrel-datasets-core?period=total&units=international_system&left_color=grey&right_color=blue&left_text=Downloads)](https://pepy.tech/project/squirrel-datasets-core)\n[![License](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](https://raw.githubusercontent.com/merantix-momentum/squirrel-datasets-core/main/LICENSE)\n[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.6420214.svg)](https://doi.org/10.5281/zenodo.6420214)\n[![Generic badge](https://img.shields.io/badge/Website-Merantix%20Momentum-blue)](https://merantix-momentum.com)\n[![Slack](https://img.shields.io/badge/slack-chat-green.svg?logo=slack)](https://join.slack.com/t/squirrel-core/shared_invite/zt-14k6sk6sw-zQPHfqAI8Xq5WYd~UqgNFw)\n\n</div>\n\n---\n# What is Squirrel Datasets Core?\n\n`squirrel-datasets-core` is an extension of the [Squirrel](https://github.com/merantix-momentum/squirrel-core) library. `squirrel-datasets-core` is a hub where the user can 1) explore existing datasets registered in the data mesh by other users and 2) preprocess their datasets and share them with other users. As an end user, you will\nbe able to load many publically available datasets with ease and speed with the help of `squirrel`, or load and preprocess\nyour own datasets with the tools we provide here. \n\nFor preprocessing, we currently support Spark as the main tool to carry out the task.\n\nIf you have any questions or would like to contribute, join our [Slack community](https://join.slack.com/t/squirrel-core/shared_invite/zt-14k6sk6sw-zQPHfqAI8Xq5WYd~UqgNFw)!\n\n# Installation\nInstall `squirrel-core` and `squirrel-datasets-core` with pip:\n\n```shell\npip install squirrel-core[all]\npip install squirrel-datasets-core[all]\n```\n# Documentation\n\nVisit our documentation on [Readthedocs](https://squirrel-datasets-core.readthedocs.io).\n\n# Contributing\n`squirrel-datasets-core` is open source and community contributions are welcome!\n\n# The humans behind Squirrel\nWe are [Merantix Momentum](https://merantix-momentum.com/), a team of ~30 machine learning engineers, developing machine learning solutions for industry and research. Each project comes with its own challenges, data types and learnings, but one issue we always faced was scalable data loading, transforming and sharing. We were looking for a solution that would allow us to load the data in a fast and cost-efficient way, while keeping the flexibility to work with any possible dataset and integrate with any API. That's why we build Squirrel \u2013 and we hope you'll find it as useful as we do! By the way, [we are hiring](https://merantix-momentum.com/about#jobs)!\n\n\n# Citation\n\nIf you use Squirrel Datasets in your research, please cite Squirrel using:\n```bibtex\n@article{2022squirrelcore,\n  title={Squirrel: A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way.},\n  author={Squirrel Developer Team},\n  journal={GitHub. Note: https://github.com/merantix-momentum/squirrel-core},\n  year={2022}\n}\n```\n",
    "bugtrack_url": null,
    "license": "Apache 2.0",
    "summary": "Squirrel public datasets collection",
    "version": "0.1.8",
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "md5": "b7006d1e3440a46f2fae9c530120dbad",
                "sha256": "d953825974bcfb3fa3019c23596bc2d11b742a8c3acc536cf8547cbdab667c9d"
            },
            "downloads": -1,
            "filename": "squirrel_datasets_core-0.1.8-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "b7006d1e3440a46f2fae9c530120dbad",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.8.0",
            "size": 53707,
            "upload_time": "2022-07-14T13:28:12",
            "upload_time_iso_8601": "2022-07-14T13:28:12.964918Z",
            "url": "https://files.pythonhosted.org/packages/9d/49/263544db8093adfc912b1be964b4cc7451c4c473cd21b79bf1753b551751/squirrel_datasets_core-0.1.8-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "md5": "bef3f511ef7151dc9d6cc7b0bd62427c",
                "sha256": "89375ce8d543ede13c33a72f16477da996b0c82698c6009511d21c31a777d198"
            },
            "downloads": -1,
            "filename": "squirrel_datasets_core-0.1.8.tar.gz",
            "has_sig": false,
            "md5_digest": "bef3f511ef7151dc9d6cc7b0bd62427c",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.8.0",
            "size": 38068,
            "upload_time": "2022-07-14T13:28:14",
            "upload_time_iso_8601": "2022-07-14T13:28:14.796247Z",
            "url": "https://files.pythonhosted.org/packages/53/6b/1290efad1eb4d23451abdd702791f040d5222ca43b2b765731560026e175/squirrel_datasets_core-0.1.8.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2022-07-14 13:28:14",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "lcname": "squirrel-datasets-core"
}
        
Elapsed time: 0.51549s