pyspark-graph


Namepyspark-graph JSON
Version 0.0.7 PyPI version JSON
download
home_pageNone
SummaryPure pyspark implementation of graph algorithms
upload_time2024-06-15 22:34:19
maintainerNone
docs_urlNone
authorNone
requires_python>=3.10
licenseNone
keywords spark pyspark graph graphx graphframes
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            ## pyspark-graph

This is a pure pyspark implementation of graph algorithms.
Many of these capabilites are already available in GraphX and GraphFrames,
but the language choice limits accessiblity for those who are not 
familiar with Scala.

Additionally, those libraries offer just the basic tools needed to implement
graph analytics whereas here we aim to offer a more batteries-included approach.

### Installation
This package is available on PyPI, to install it simply run:
```
pip install pyspark-graph
```

### Supported algorithms
The following table compares the features of pyspark-graph with GraphFrames and GraphX. The goal is to add the missing features and continue to add additional algorithms in future.

| Name                         | GraphX | GraphFrames | pyspark-graph |
|------------------------------|--------|-------------|---------------|
| AggregateMessages            | ✅      | ✅           | ✅             |
| BFS                          | ✅      | ✅           | ✅             |
| ConnectedComponents          | ✅      | ✅           | ✅             |
| LabelPropagation             | ✅      | ❌           | ✅             |
| PageRank                     | ✅      | ❌           | ❌             |
| ParallelPersonalizedPageRank | ✅      | ❌           | ❌             |
| Pregel                       | ✅      | ✅           | ✅             |
| SVDPlusPlus                  | ✅      | ❌           | ❌             |
| ShortestPaths                | ✅      | ❌           | ❌             |
| StronglyConnectedComponents  | ✅      | ❌           | ❌             |
| TriangleCount                | ✅      | ✅           | ✅             |
| JaccardSimilarity            | ❌      | ❌           | ✅             |
| OverlapCoefficient           | ❌      | ❌           | ✅             |

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "pyspark-graph",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.10",
    "maintainer_email": null,
    "keywords": "spark, pyspark, graph, graphx, graphframes",
    "author": null,
    "author_email": "Joshua <author@example.com>",
    "download_url": "https://files.pythonhosted.org/packages/f5/de/9b02b1754962f9b7c69414283ba09d5c92eade175ad12711dbb653f8b8ad/pyspark_graph-0.0.7.tar.gz",
    "platform": null,
    "description": "## pyspark-graph\n\nThis is a pure pyspark implementation of graph algorithms.\nMany of these capabilites are already available in GraphX and GraphFrames,\nbut the language choice limits accessiblity for those who are not \nfamiliar with Scala.\n\nAdditionally, those libraries offer just the basic tools needed to implement\ngraph analytics whereas here we aim to offer a more batteries-included approach.\n\n### Installation\nThis package is available on PyPI, to install it simply run:\n```\npip install pyspark-graph\n```\n\n### Supported algorithms\nThe following table compares the features of pyspark-graph with GraphFrames and GraphX. The goal is to add the missing features and continue to add additional algorithms in future.\n\n| Name                         | GraphX | GraphFrames | pyspark-graph |\n|------------------------------|--------|-------------|---------------|\n| AggregateMessages            | \u2705      | \u2705           | \u2705             |\n| BFS                          | \u2705      | \u2705           | \u2705             |\n| ConnectedComponents          | \u2705      | \u2705           | \u2705             |\n| LabelPropagation             | \u2705      | \u274c           | \u2705             |\n| PageRank                     | \u2705      | \u274c           | \u274c             |\n| ParallelPersonalizedPageRank | \u2705      | \u274c           | \u274c             |\n| Pregel                       | \u2705      | \u2705           | \u2705             |\n| SVDPlusPlus                  | \u2705      | \u274c           | \u274c             |\n| ShortestPaths                | \u2705      | \u274c           | \u274c             |\n| StronglyConnectedComponents  | \u2705      | \u274c           | \u274c             |\n| TriangleCount                | \u2705      | \u2705           | \u2705             |\n| JaccardSimilarity            | \u274c      | \u274c           | \u2705             |\n| OverlapCoefficient           | \u274c      | \u274c           | \u2705             |\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "Pure pyspark implementation of graph algorithms",
    "version": "0.0.7",
    "project_urls": {
        "Homepage": "https://github.com/aktungmak/pyspark-graph",
        "Issues": "https://github.com/aktungmak/pyspark-graph/issues"
    },
    "split_keywords": [
        "spark",
        " pyspark",
        " graph",
        " graphx",
        " graphframes"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "8896ef815233f3eb0bcceb7b7accf5ef708ba228f19182d5d2738f2293cb1852",
                "md5": "883d103c3dacd2c94df10d5fbfa8f570",
                "sha256": "661f590ae517358e0cbc8957c1019170589b0b9a3ba96d33ad594f97d849ce6b"
            },
            "downloads": -1,
            "filename": "pyspark_graph-0.0.7-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "883d103c3dacd2c94df10d5fbfa8f570",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.10",
            "size": 20294,
            "upload_time": "2024-06-15T22:34:18",
            "upload_time_iso_8601": "2024-06-15T22:34:18.033785Z",
            "url": "https://files.pythonhosted.org/packages/88/96/ef815233f3eb0bcceb7b7accf5ef708ba228f19182d5d2738f2293cb1852/pyspark_graph-0.0.7-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "f5de9b02b1754962f9b7c69414283ba09d5c92eade175ad12711dbb653f8b8ad",
                "md5": "aa850aec9f143fb7833c6a21e98062a2",
                "sha256": "1a3e736327b29f287ccd64af48b2624a98337f73bc53829ef5dafabb80134796"
            },
            "downloads": -1,
            "filename": "pyspark_graph-0.0.7.tar.gz",
            "has_sig": false,
            "md5_digest": "aa850aec9f143fb7833c6a21e98062a2",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.10",
            "size": 16072,
            "upload_time": "2024-06-15T22:34:19",
            "upload_time_iso_8601": "2024-06-15T22:34:19.401842Z",
            "url": "https://files.pythonhosted.org/packages/f5/de/9b02b1754962f9b7c69414283ba09d5c92eade175ad12711dbb653f8b8ad/pyspark_graph-0.0.7.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-06-15 22:34:19",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "aktungmak",
    "github_project": "pyspark-graph",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": false,
    "requirements": [],
    "lcname": "pyspark-graph"
}
        
Elapsed time: 4.83017s