datasets-plus


Namedatasets-plus JSON
Version 0.1.0 PyPI version JSON
download
home_pageNone
SummaryA wrapper for HuggingFace datasets with additional utilities
upload_time2024-09-29 02:52:32
maintainerNone
docs_urlNone
authorNone
requires_python>=3.9
licenseMIT
keywords huggingface datasets wrapper utilities
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # 🤗 datasets-plus

A wrapper for Hugging Face datasets with extra utilities! 🚀

## 🌟 Features

- 🔧 Simplified dataset loading
- 🔀 Easy splitting and configuration
- 📁 Support for local and remote datasets
- 🧰 Additional utility functions

## 🚀 Installation

Install datasets-plus using pip:

```bash
pip install datasets-plus
```

## 📚 Usage

Here's a quick example of how to use datasets-plus:

```python
from datasets_plus import load_dataset

# Load validation fold of TriviaQA's unfiltered subset
dataset = load_dataset("mandarjoshi/trivia_qa:unfiltered:validation")

# Print dataset info
print(f"Loaded dataset with {len(dataset)} examples")
print("First example:", dataset[0])

# Load the train fold of the local hf dataset saved at /path/to/dataset
dataset = load_dataset("/path/to/dataset:train")
```

## 📄 License

This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.

## 🙏 Acknowledgements

- [Hugging Face Datasets](https://github.com/huggingface/datasets) for the amazing foundation
- All our contributors and users!

Happy data loading! 🎉

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "datasets-plus",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.9",
    "maintainer_email": null,
    "keywords": "huggingface, datasets, wrapper, utilities",
    "author": null,
    "author_email": "Maharshi Gor <mgor@cs.umd.edu>",
    "download_url": "https://files.pythonhosted.org/packages/e6/70/940389fae26c6089d83fb92b3d92160c9d50e2e957802bef8bc9a18d26c3/datasets_plus-0.1.0.tar.gz",
    "platform": null,
    "description": "# \ud83e\udd17 datasets-plus\n\nA wrapper for Hugging Face datasets with extra utilities! \ud83d\ude80\n\n## \ud83c\udf1f Features\n\n- \ud83d\udd27 Simplified dataset loading\n- \ud83d\udd00 Easy splitting and configuration\n- \ud83d\udcc1 Support for local and remote datasets\n- \ud83e\uddf0 Additional utility functions\n\n## \ud83d\ude80 Installation\n\nInstall datasets-plus using pip:\n\n```bash\npip install datasets-plus\n```\n\n## \ud83d\udcda Usage\n\nHere's a quick example of how to use datasets-plus:\n\n```python\nfrom datasets_plus import load_dataset\n\n# Load validation fold of TriviaQA's unfiltered subset\ndataset = load_dataset(\"mandarjoshi/trivia_qa:unfiltered:validation\")\n\n# Print dataset info\nprint(f\"Loaded dataset with {len(dataset)} examples\")\nprint(\"First example:\", dataset[0])\n\n# Load the train fold of the local hf dataset saved at /path/to/dataset\ndataset = load_dataset(\"/path/to/dataset:train\")\n```\n\n## \ud83d\udcc4 License\n\nThis project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.\n\n## \ud83d\ude4f Acknowledgements\n\n- [Hugging Face Datasets](https://github.com/huggingface/datasets) for the amazing foundation\n- All our contributors and users!\n\nHappy data loading! \ud83c\udf89\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "A wrapper for HuggingFace datasets with additional utilities",
    "version": "0.1.0",
    "project_urls": {
        "Homepage": "https://github.com/maharshi95/datasets-plus"
    },
    "split_keywords": [
        "huggingface",
        " datasets",
        " wrapper",
        " utilities"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "e7dad169f0f58ade845d2e0a84fff65a7a06c7669011d7d0a64d071c75cb1c35",
                "md5": "068d547395a482adc94cc17a6e070a32",
                "sha256": "8fe757e88199d37f1ff2bcae145d7ac4f233290b5c77052ad3f4f3a30674de50"
            },
            "downloads": -1,
            "filename": "datasets_plus-0.1.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "068d547395a482adc94cc17a6e070a32",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.9",
            "size": 4606,
            "upload_time": "2024-09-29T02:52:31",
            "upload_time_iso_8601": "2024-09-29T02:52:31.294138Z",
            "url": "https://files.pythonhosted.org/packages/e7/da/d169f0f58ade845d2e0a84fff65a7a06c7669011d7d0a64d071c75cb1c35/datasets_plus-0.1.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "e670940389fae26c6089d83fb92b3d92160c9d50e2e957802bef8bc9a18d26c3",
                "md5": "9cc331f4545eae3760e9a60a263ca012",
                "sha256": "af333ae65ea3ee91f3a2affd3acb4d5d449b660e2e28cc86bd2c1acf430c68d1"
            },
            "downloads": -1,
            "filename": "datasets_plus-0.1.0.tar.gz",
            "has_sig": false,
            "md5_digest": "9cc331f4545eae3760e9a60a263ca012",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.9",
            "size": 5018,
            "upload_time": "2024-09-29T02:52:32",
            "upload_time_iso_8601": "2024-09-29T02:52:32.696705Z",
            "url": "https://files.pythonhosted.org/packages/e6/70/940389fae26c6089d83fb92b3d92160c9d50e2e957802bef8bc9a18d26c3/datasets_plus-0.1.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-09-29 02:52:32",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "maharshi95",
    "github_project": "datasets-plus",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "requirements": [],
    "lcname": "datasets-plus"
}
        
Elapsed time: 0.42054s