jorvik


Namejorvik JSON
Version 1.2.0 PyPI version JSON
download
home_pagehttps://github.com/jorvik-io/jorvik
SummaryA set of utilities for creating and managing ETL Pipelines with pyspark.
upload_time2025-08-01 14:54:23
maintainerNone
docs_urlNone
authorhttps://github.com/jorvik-io
requires_pythonNone
licenseApache License 2.0
keywords etl pyspark data engineering data pipelines
VCS
bugtrack_url
requirements buildtools delta-spark pandas pyarrow pyspark
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Jorvik
[![Build](https://github.com/GiorgosPa/jorvik/actions/workflows/build.yaml/badge.svg)](https://github.com/GiorgosPa/jorvik/actions/workflows/build.yaml)
[![Coverage Status](https://coveralls.io/repos/github/jorvik-io/jorvik/badge.svg?branch=main)](https://coveralls.io/github/jorvik-io/jorvik?branch=main)

Jorvik is a collection of utilities for creating and managing ETL pipeline in Pyspark. Build from Data Engineers for Data Engineers.

## Contribute
The Jorvik project welcomes your expertise and enthusiasm!

Writing code isn’t the only way to contribute. You can also:

- review pull requests
- suggest improvements through issues
- let us know your painpoints and repeatitive tasks
- help us stay on top of new and old issues
- develop tutorials, videos, presentations, and other educational materials

See [How to Contribute](https://github.com/jorvik-io/jorvik/blob/main/CONTRIBUTING.md) for instructions on setting up your local machine and opening your first Pull Request.

## Getting Started.
Jorvik is available in Pypi and can be installed with pip

```bash
pip install jorvik
```

Packages:
- [Storage](https://github.com/jorvik-io/jorvik/blob/main/jorvik/storage/README.md): Interact with the storage layer
- [Pipelines](https://github.com/jorvik-io/jorvik/blob/main/jorvik/pipelines/README.md): Build and test etl pipelines with ease
- [Data Lineage](https://github.com/jorvik-io/jorvik/blob/main/jorvik/data_lineage/README.md): Track data lineage

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/jorvik-io/jorvik",
    "name": "jorvik",
    "maintainer": null,
    "docs_url": null,
    "requires_python": null,
    "maintainer_email": null,
    "keywords": "ETL, Pyspark, Data Engineering, Data Pipelines",
    "author": "https://github.com/jorvik-io",
    "author_email": null,
    "download_url": null,
    "platform": null,
    "description": "# Jorvik\n[![Build](https://github.com/GiorgosPa/jorvik/actions/workflows/build.yaml/badge.svg)](https://github.com/GiorgosPa/jorvik/actions/workflows/build.yaml)\n[![Coverage Status](https://coveralls.io/repos/github/jorvik-io/jorvik/badge.svg?branch=main)](https://coveralls.io/github/jorvik-io/jorvik?branch=main)\n\nJorvik is a collection of utilities for creating and managing ETL pipeline in Pyspark. Build from Data Engineers for Data Engineers.\n\n## Contribute\nThe Jorvik project welcomes your expertise and enthusiasm!\n\nWriting code isn\u2019t the only way to contribute. You can also:\n\n- review pull requests\n- suggest improvements through issues\n- let us know your painpoints and repeatitive tasks\n- help us stay on top of new and old issues\n- develop tutorials, videos, presentations, and other educational materials\n\nSee [How to Contribute](https://github.com/jorvik-io/jorvik/blob/main/CONTRIBUTING.md) for instructions on setting up your local machine and opening your first Pull Request.\n\n## Getting Started.\nJorvik is available in Pypi and can be installed with pip\n\n```bash\npip install jorvik\n```\n\nPackages:\n- [Storage](https://github.com/jorvik-io/jorvik/blob/main/jorvik/storage/README.md): Interact with the storage layer\n- [Pipelines](https://github.com/jorvik-io/jorvik/blob/main/jorvik/pipelines/README.md): Build and test etl pipelines with ease\n- [Data Lineage](https://github.com/jorvik-io/jorvik/blob/main/jorvik/data_lineage/README.md): Track data lineage\n",
    "bugtrack_url": null,
    "license": "Apache License 2.0",
    "summary": "A set of utilities for creating and managing ETL Pipelines with pyspark.",
    "version": "1.2.0",
    "project_urls": {
        "Homepage": "https://github.com/jorvik-io/jorvik"
    },
    "split_keywords": [
        "etl",
        " pyspark",
        " data engineering",
        " data pipelines"
    ],
    "urls": [
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "e401a855c5e837e71435fa8310caa8dbfd74c4abe20c798f67a51ec542290539",
                "md5": "241d52f0798bff80caf82e0a348fa8b0",
                "sha256": "f507f91f870343ac66eece985be1c3c52b2aaa6b69b8425b873281b5ba8f4c2a"
            },
            "downloads": -1,
            "filename": "jorvik-1.2.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "241d52f0798bff80caf82e0a348fa8b0",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": null,
            "size": 28260,
            "upload_time": "2025-08-01T14:54:23",
            "upload_time_iso_8601": "2025-08-01T14:54:23.364439Z",
            "url": "https://files.pythonhosted.org/packages/e4/01/a855c5e837e71435fa8310caa8dbfd74c4abe20c798f67a51ec542290539/jorvik-1.2.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-08-01 14:54:23",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "jorvik-io",
    "github_project": "jorvik",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "requirements": [
        {
            "name": "buildtools",
            "specs": []
        },
        {
            "name": "delta-spark",
            "specs": []
        },
        {
            "name": "pandas",
            "specs": []
        },
        {
            "name": "pyarrow",
            "specs": []
        },
        {
            "name": "pyspark",
            "specs": []
        }
    ],
    "lcname": "jorvik"
}
        
Elapsed time: 1.90720s