dask-expr


Namedask-expr JSON
Version 1.0.13 PyPI version JSON
download
home_pageNone
SummaryHigh Level Expressions for Dask
upload_time2024-04-25 17:48:43
maintainerNone
docs_urlNone
authorNone
requires_python>=3.9
licenseBSD
keywords dask pandas
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            Dask Expressions
================

Dask DataFrames with query optimization.

This is a rewrite of Dask DataFrame that includes query
optimization and generally improved organization.

More in our blog posts:
- [Dask Expressions overview](https://blog.dask.org/2023/08/25/dask-expr-introduction)
- [TPC-H benchmark results vs. Dask DataFrame](https://blog.coiled.io/blog/dask-expr-tpch-dask.html)

Example
-------

```python
import dask_expr as dx

df = dx.datasets.timeseries()
df.head()

df.groupby("name").x.mean().compute()
```

Query Representation
--------------------

Dask-expr encodes user code in an expression tree:

```python
>>> df.x.mean().pprint()

Mean:
  Projection: columns='x'
    Timeseries: seed=1896674884
```

This expression tree will be optimized and modified before execution:

```python
>>> df.x.mean().optimize().pprint()

Div:
  Sum:
    Fused(375f9):
    | Projection: columns='x'
    |   Timeseries: dtypes={'x': <class 'float'>} seed=1896674884
  Count:
    Fused(375f9):
    | Projection: columns='x'
    |   Timeseries: dtypes={'x': <class 'float'>} seed=1896674884
```

Stability
---------

This is the default backend for dask.DataFrame since version 2024.3.0.

API Coverage
------------

Dask-Expr covers almost everything of the Dask DataFrame API. The only missing features are:

- ``melt``
- named GroupBy Aggregations

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "dask-expr",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.9",
    "maintainer_email": "Matthew Rocklin <mrocklin@gmail.com>",
    "keywords": "dask pandas",
    "author": null,
    "author_email": null,
    "download_url": "https://files.pythonhosted.org/packages/4f/7e/18919e9a57ac3ff2cd6c43fb6303e1ed24fb16ac214b1f88636e399bef75/dask_expr-1.0.13.tar.gz",
    "platform": null,
    "description": "Dask Expressions\n================\n\nDask DataFrames with query optimization.\n\nThis is a rewrite of Dask DataFrame that includes query\noptimization and generally improved organization.\n\nMore in our blog posts:\n- [Dask Expressions overview](https://blog.dask.org/2023/08/25/dask-expr-introduction)\n- [TPC-H benchmark results vs. Dask DataFrame](https://blog.coiled.io/blog/dask-expr-tpch-dask.html)\n\nExample\n-------\n\n```python\nimport dask_expr as dx\n\ndf = dx.datasets.timeseries()\ndf.head()\n\ndf.groupby(\"name\").x.mean().compute()\n```\n\nQuery Representation\n--------------------\n\nDask-expr encodes user code in an expression tree:\n\n```python\n>>> df.x.mean().pprint()\n\nMean:\n  Projection: columns='x'\n    Timeseries: seed=1896674884\n```\n\nThis expression tree will be optimized and modified before execution:\n\n```python\n>>> df.x.mean().optimize().pprint()\n\nDiv:\n  Sum:\n    Fused(375f9):\n    | Projection: columns='x'\n    |   Timeseries: dtypes={'x': <class 'float'>} seed=1896674884\n  Count:\n    Fused(375f9):\n    | Projection: columns='x'\n    |   Timeseries: dtypes={'x': <class 'float'>} seed=1896674884\n```\n\nStability\n---------\n\nThis is the default backend for dask.DataFrame since version 2024.3.0.\n\nAPI Coverage\n------------\n\nDask-Expr covers almost everything of the Dask DataFrame API. The only missing features are:\n\n- ``melt``\n- named GroupBy Aggregations\n",
    "bugtrack_url": null,
    "license": "BSD",
    "summary": "High Level Expressions for Dask",
    "version": "1.0.13",
    "project_urls": {
        "Source code": "https://github.com/dask-contrib/dask-expr/"
    },
    "split_keywords": [
        "dask",
        "pandas"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "471f71529e39ad96964c0cee251e07b7a4df1856001bf0effb77cbe447340b6f",
                "md5": "3c4e766ede152b3b1b8061f3ff8797ec",
                "sha256": "b162f0b26b740d43a6fc66fae9e480dbac2ee0051eadb620d0a15ba24e65274c"
            },
            "downloads": -1,
            "filename": "dask_expr-1.0.13-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "3c4e766ede152b3b1b8061f3ff8797ec",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.9",
            "size": 195115,
            "upload_time": "2024-04-25T17:48:40",
            "upload_time_iso_8601": "2024-04-25T17:48:40.299023Z",
            "url": "https://files.pythonhosted.org/packages/47/1f/71529e39ad96964c0cee251e07b7a4df1856001bf0effb77cbe447340b6f/dask_expr-1.0.13-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "4f7e18919e9a57ac3ff2cd6c43fb6303e1ed24fb16ac214b1f88636e399bef75",
                "md5": "ac91122014a49b9244706d6f9ee1a490",
                "sha256": "9a98562c7f30a5c08ae91028a4c81f037a6682eb81967a3658d9d14411f53a83"
            },
            "downloads": -1,
            "filename": "dask_expr-1.0.13.tar.gz",
            "has_sig": false,
            "md5_digest": "ac91122014a49b9244706d6f9ee1a490",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.9",
            "size": 197745,
            "upload_time": "2024-04-25T17:48:43",
            "upload_time_iso_8601": "2024-04-25T17:48:43.221687Z",
            "url": "https://files.pythonhosted.org/packages/4f/7e/18919e9a57ac3ff2cd6c43fb6303e1ed24fb16ac214b1f88636e399bef75/dask_expr-1.0.13.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-04-25 17:48:43",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "dask-contrib",
    "github_project": "dask-expr",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "dask-expr"
}
        
Elapsed time: 0.25411s