## pyspark-graph
This is a pure pyspark implementation of graph algorithms.
Many of these capabilites are already available in GraphX and GraphFrames,
but the language choice limits accessiblity for those who are not
familiar with Scala.
Additionally, those libraries offer just the basic tools needed to implement
graph analytics whereas here we aim to offer a more batteries-included approach.
### Installation
This package is available on PyPI, to install it simply run:
```
pip install pyspark-graph
```
### Supported algorithms
The following table compares the features of pyspark-graph with GraphFrames and GraphX. The goal is to add the missing features and continue to add additional algorithms in future.
| Name | GraphX | GraphFrames | pyspark-graph |
|------------------------------|--------|-------------|---------------|
| AggregateMessages | ✅ | ✅ | ✅ |
| BFS | ✅ | ✅ | ✅ |
| ConnectedComponents | ✅ | ✅ | ✅ |
| LabelPropagation | ✅ | ❌ | ✅ |
| PageRank | ✅ | ❌ | ❌ |
| ParallelPersonalizedPageRank | ✅ | ❌ | ❌ |
| Pregel | ✅ | ✅ | ✅ |
| SVDPlusPlus | ✅ | ❌ | ❌ |
| ShortestPaths | ✅ | ❌ | ❌ |
| StronglyConnectedComponents | ✅ | ❌ | ❌ |
| TriangleCount | ✅ | ✅ | ✅ |
| JaccardSimilarity | ❌ | ❌ | ✅ |
| OverlapCoefficient | ❌ | ❌ | ✅ |
Raw data
{
"_id": null,
"home_page": null,
"name": "pyspark-graph",
"maintainer": null,
"docs_url": null,
"requires_python": ">=3.10",
"maintainer_email": null,
"keywords": "spark, pyspark, graph, graphx, graphframes",
"author": null,
"author_email": "Joshua <author@example.com>",
"download_url": "https://files.pythonhosted.org/packages/f5/de/9b02b1754962f9b7c69414283ba09d5c92eade175ad12711dbb653f8b8ad/pyspark_graph-0.0.7.tar.gz",
"platform": null,
"description": "## pyspark-graph\n\nThis is a pure pyspark implementation of graph algorithms.\nMany of these capabilites are already available in GraphX and GraphFrames,\nbut the language choice limits accessiblity for those who are not \nfamiliar with Scala.\n\nAdditionally, those libraries offer just the basic tools needed to implement\ngraph analytics whereas here we aim to offer a more batteries-included approach.\n\n### Installation\nThis package is available on PyPI, to install it simply run:\n```\npip install pyspark-graph\n```\n\n### Supported algorithms\nThe following table compares the features of pyspark-graph with GraphFrames and GraphX. The goal is to add the missing features and continue to add additional algorithms in future.\n\n| Name | GraphX | GraphFrames | pyspark-graph |\n|------------------------------|--------|-------------|---------------|\n| AggregateMessages | \u2705 | \u2705 | \u2705 |\n| BFS | \u2705 | \u2705 | \u2705 |\n| ConnectedComponents | \u2705 | \u2705 | \u2705 |\n| LabelPropagation | \u2705 | \u274c | \u2705 |\n| PageRank | \u2705 | \u274c | \u274c |\n| ParallelPersonalizedPageRank | \u2705 | \u274c | \u274c |\n| Pregel | \u2705 | \u2705 | \u2705 |\n| SVDPlusPlus | \u2705 | \u274c | \u274c |\n| ShortestPaths | \u2705 | \u274c | \u274c |\n| StronglyConnectedComponents | \u2705 | \u274c | \u274c |\n| TriangleCount | \u2705 | \u2705 | \u2705 |\n| JaccardSimilarity | \u274c | \u274c | \u2705 |\n| OverlapCoefficient | \u274c | \u274c | \u2705 |\n",
"bugtrack_url": null,
"license": null,
"summary": "Pure pyspark implementation of graph algorithms",
"version": "0.0.7",
"project_urls": {
"Homepage": "https://github.com/aktungmak/pyspark-graph",
"Issues": "https://github.com/aktungmak/pyspark-graph/issues"
},
"split_keywords": [
"spark",
" pyspark",
" graph",
" graphx",
" graphframes"
],
"urls": [
{
"comment_text": "",
"digests": {
"blake2b_256": "8896ef815233f3eb0bcceb7b7accf5ef708ba228f19182d5d2738f2293cb1852",
"md5": "883d103c3dacd2c94df10d5fbfa8f570",
"sha256": "661f590ae517358e0cbc8957c1019170589b0b9a3ba96d33ad594f97d849ce6b"
},
"downloads": -1,
"filename": "pyspark_graph-0.0.7-py3-none-any.whl",
"has_sig": false,
"md5_digest": "883d103c3dacd2c94df10d5fbfa8f570",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.10",
"size": 20294,
"upload_time": "2024-06-15T22:34:18",
"upload_time_iso_8601": "2024-06-15T22:34:18.033785Z",
"url": "https://files.pythonhosted.org/packages/88/96/ef815233f3eb0bcceb7b7accf5ef708ba228f19182d5d2738f2293cb1852/pyspark_graph-0.0.7-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": "",
"digests": {
"blake2b_256": "f5de9b02b1754962f9b7c69414283ba09d5c92eade175ad12711dbb653f8b8ad",
"md5": "aa850aec9f143fb7833c6a21e98062a2",
"sha256": "1a3e736327b29f287ccd64af48b2624a98337f73bc53829ef5dafabb80134796"
},
"downloads": -1,
"filename": "pyspark_graph-0.0.7.tar.gz",
"has_sig": false,
"md5_digest": "aa850aec9f143fb7833c6a21e98062a2",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.10",
"size": 16072,
"upload_time": "2024-06-15T22:34:19",
"upload_time_iso_8601": "2024-06-15T22:34:19.401842Z",
"url": "https://files.pythonhosted.org/packages/f5/de/9b02b1754962f9b7c69414283ba09d5c92eade175ad12711dbb653f8b8ad/pyspark_graph-0.0.7.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2024-06-15 22:34:19",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "aktungmak",
"github_project": "pyspark-graph",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"requirements": [],
"lcname": "pyspark-graph"
}