# Jorvik
[](https://github.com/GiorgosPa/jorvik/actions/workflows/build.yaml)
[](https://coveralls.io/github/jorvik-io/jorvik?branch=main)
Jorvik is a collection of utilities for creating and managing ETL pipeline in Pyspark. Build from Data Engineers for Data Engineers.
## Contribute
The Jorvik project welcomes your expertise and enthusiasm!
Writing code isn’t the only way to contribute. You can also:
- review pull requests
- suggest improvements through issues
- let us know your painpoints and repeatitive tasks
- help us stay on top of new and old issues
- develop tutorials, videos, presentations, and other educational materials
See [How to Contribute](https://github.com/jorvik-io/jorvik/blob/main/CONTRIBUTING.md) for instructions on setting up your local machine and opening your first Pull Request.
## Getting Started.
Jorvik is available in Pypi and can be installed with pip
```bash
pip install jorvik
```
Packages:
- [Storage](https://github.com/jorvik-io/jorvik/blob/main/jorvik/storage/README.md): Interact with the storage layer
- [Pipelines](https://github.com/jorvik-io/jorvik/blob/main/jorvik/pipelines/README.md): Build and test etl pipelines with ease
- [Data Lineage](https://github.com/jorvik-io/jorvik/blob/main/jorvik/data_lineage/README.md): Track data lineage
Raw data
{
"_id": null,
"home_page": "https://github.com/jorvik-io/jorvik",
"name": "jorvik",
"maintainer": null,
"docs_url": null,
"requires_python": null,
"maintainer_email": null,
"keywords": "ETL, Pyspark, Data Engineering, Data Pipelines",
"author": "https://github.com/jorvik-io",
"author_email": null,
"download_url": null,
"platform": null,
"description": "# Jorvik\n[](https://github.com/GiorgosPa/jorvik/actions/workflows/build.yaml)\n[](https://coveralls.io/github/jorvik-io/jorvik?branch=main)\n\nJorvik is a collection of utilities for creating and managing ETL pipeline in Pyspark. Build from Data Engineers for Data Engineers.\n\n## Contribute\nThe Jorvik project welcomes your expertise and enthusiasm!\n\nWriting code isn\u2019t the only way to contribute. You can also:\n\n- review pull requests\n- suggest improvements through issues\n- let us know your painpoints and repeatitive tasks\n- help us stay on top of new and old issues\n- develop tutorials, videos, presentations, and other educational materials\n\nSee [How to Contribute](https://github.com/jorvik-io/jorvik/blob/main/CONTRIBUTING.md) for instructions on setting up your local machine and opening your first Pull Request.\n\n## Getting Started.\nJorvik is available in Pypi and can be installed with pip\n\n```bash\npip install jorvik\n```\n\nPackages:\n- [Storage](https://github.com/jorvik-io/jorvik/blob/main/jorvik/storage/README.md): Interact with the storage layer\n- [Pipelines](https://github.com/jorvik-io/jorvik/blob/main/jorvik/pipelines/README.md): Build and test etl pipelines with ease\n- [Data Lineage](https://github.com/jorvik-io/jorvik/blob/main/jorvik/data_lineage/README.md): Track data lineage\n",
"bugtrack_url": null,
"license": "Apache License 2.0",
"summary": "A set of utilities for creating and managing ETL Pipelines with pyspark.",
"version": "1.2.0",
"project_urls": {
"Homepage": "https://github.com/jorvik-io/jorvik"
},
"split_keywords": [
"etl",
" pyspark",
" data engineering",
" data pipelines"
],
"urls": [
{
"comment_text": null,
"digests": {
"blake2b_256": "e401a855c5e837e71435fa8310caa8dbfd74c4abe20c798f67a51ec542290539",
"md5": "241d52f0798bff80caf82e0a348fa8b0",
"sha256": "f507f91f870343ac66eece985be1c3c52b2aaa6b69b8425b873281b5ba8f4c2a"
},
"downloads": -1,
"filename": "jorvik-1.2.0-py3-none-any.whl",
"has_sig": false,
"md5_digest": "241d52f0798bff80caf82e0a348fa8b0",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": null,
"size": 28260,
"upload_time": "2025-08-01T14:54:23",
"upload_time_iso_8601": "2025-08-01T14:54:23.364439Z",
"url": "https://files.pythonhosted.org/packages/e4/01/a855c5e837e71435fa8310caa8dbfd74c4abe20c798f67a51ec542290539/jorvik-1.2.0-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2025-08-01 14:54:23",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "jorvik-io",
"github_project": "jorvik",
"travis_ci": false,
"coveralls": false,
"github_actions": true,
"requirements": [
{
"name": "buildtools",
"specs": []
},
{
"name": "delta-spark",
"specs": []
},
{
"name": "pandas",
"specs": []
},
{
"name": "pyarrow",
"specs": []
},
{
"name": "pyspark",
"specs": []
}
],
"lcname": "jorvik"
}