# Jorvik
[](https://github.com/GiorgosPa/jorvik/actions/workflows/build.yaml)
[](https://coveralls.io/github/jorvik-io/jorvik?branch=main)
[](https://pepy.tech/projects/jorvik)
Jorvik is a collection of utilities for creating and managing ETL pipeline in Pyspark. Build from Data Engineers for Data Engineers.
## Contribute
The Jorvik project welcomes your expertise and enthusiasm!
Writing code isn’t the only way to contribute. You can also:
- review pull requests
- suggest improvements through issues
- let us know your pain-points and repetitive tasks
- help us stay on top of new and old issues
- develop tutorials, videos, presentations, and other educational materials
See [How to Contribute](https://github.com/jorvik-io/jorvik/blob/main/CONTRIBUTING.md) for instructions on setting up your local machine and opening your first Pull Request.
## Getting Started.
Jorvik is available in Pypi and can be installed with pip
```bash
pip install jorvik
```
### Packages:
- [Storage](https://github.com/jorvik-io/jorvik/blob/main/jorvik/storage/README.md): Interact with the storage layer
- [Pipelines](https://github.com/jorvik-io/jorvik/blob/main/jorvik/pipelines/README.md): Build and test etl pipelines with ease
- [Data Lineage](https://github.com/jorvik-io/jorvik/blob/main/jorvik/data_lineage/README.md): Track data lineage
### Examples:
See the full power of jorvik when all the features come together in the examples bellow:
#### Databricks
- [Transactions](https://github.com/jorvik-io/jorvik/blob/main/examples/databricks/transactions/README.md): A multi step pipeline that creates customer statistics from customers and transaction data.
Raw data
{
"_id": null,
"home_page": "https://github.com/jorvik-io/jorvik",
"name": "jorvik",
"maintainer": null,
"docs_url": null,
"requires_python": null,
"maintainer_email": null,
"keywords": "ETL, Pyspark, Data Engineering, Data Pipelines",
"author": "https://github.com/jorvik-io",
"author_email": null,
"download_url": null,
"platform": null,
"description": "# Jorvik\n[](https://github.com/GiorgosPa/jorvik/actions/workflows/build.yaml)\n[](https://coveralls.io/github/jorvik-io/jorvik?branch=main)\n[](https://pepy.tech/projects/jorvik)\n\nJorvik is a collection of utilities for creating and managing ETL pipeline in Pyspark. Build from Data Engineers for Data Engineers.\n\n## Contribute\nThe Jorvik project welcomes your expertise and enthusiasm!\n\nWriting code isn\u2019t the only way to contribute. You can also:\n\n- review pull requests\n- suggest improvements through issues\n- let us know your pain-points and repetitive tasks\n- help us stay on top of new and old issues\n- develop tutorials, videos, presentations, and other educational materials\n\nSee [How to Contribute](https://github.com/jorvik-io/jorvik/blob/main/CONTRIBUTING.md) for instructions on setting up your local machine and opening your first Pull Request.\n\n## Getting Started.\nJorvik is available in Pypi and can be installed with pip\n\n```bash\npip install jorvik\n```\n\n### Packages:\n- [Storage](https://github.com/jorvik-io/jorvik/blob/main/jorvik/storage/README.md): Interact with the storage layer\n- [Pipelines](https://github.com/jorvik-io/jorvik/blob/main/jorvik/pipelines/README.md): Build and test etl pipelines with ease\n- [Data Lineage](https://github.com/jorvik-io/jorvik/blob/main/jorvik/data_lineage/README.md): Track data lineage\n\n### Examples:\nSee the full power of jorvik when all the features come together in the examples bellow:\n#### Databricks\n\n- [Transactions](https://github.com/jorvik-io/jorvik/blob/main/examples/databricks/transactions/README.md): A multi step pipeline that creates customer statistics from customers and transaction data.\n",
"bugtrack_url": null,
"license": "Apache License 2.0",
"summary": "A set of utilities for creating and managing ETL Pipelines with pyspark.",
"version": "1.2.4",
"project_urls": {
"Homepage": "https://github.com/jorvik-io/jorvik"
},
"split_keywords": [
"etl",
" pyspark",
" data engineering",
" data pipelines"
],
"urls": [
{
"comment_text": null,
"digests": {
"blake2b_256": "b038d9730b7dc0cf1be3d85ee98c7af6cc0cca5c3778151f59506aaea7bfb9bf",
"md5": "e8fd9858c423cf7c6cc588f5bb390bfe",
"sha256": "7e9f6b43fba9b57797bcbebe07af0d82b579906f941c3da17e2e755c5c1bc132"
},
"downloads": -1,
"filename": "jorvik-1.2.4-py3-none-any.whl",
"has_sig": false,
"md5_digest": "e8fd9858c423cf7c6cc588f5bb390bfe",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 35724,
"upload_time": "2025-08-26T10:25:15",
"upload_time_iso_8601": "2025-08-26T10:25:15.795876Z",
"url": "https://files.pythonhosted.org/packages/b0/38/d9730b7dc0cf1be3d85ee98c7af6cc0cca5c3778151f59506aaea7bfb9bf/jorvik-1.2.4-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2025-08-26 10:25:15",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "jorvik-io",
"github_project": "jorvik",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"requirements": [
{
"name": "buildtools",
"specs": []
},
{
"name": "delta-spark",
"specs": []
},
{
"name": "pandas",
"specs": []
},
{
"name": "pyarrow",
"specs": []
},
{
"name": "pyspark",
"specs": []
}
],
"lcname": "jorvik"
}