data-factory-testing-framework


Namedata-factory-testing-framework JSON
Version 1.1.5 PyPI version JSON
download
home_pageNone
SummaryA stand-alone test framework that allows to write unit tests for Data Factory pipelines on Microsoft Fabric and Azure Data Factory.
upload_time2024-10-11 13:16:10
maintainerNone
docs_urlNone
authorNone
requires_python<3.13,>=3.9
licenseMIT License Copyright (c) Microsoft Corporation. Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions: The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software. THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE
keywords fabric datafactory unit-testing functional-testing azure
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # Data Factory - Testing Framework :hammer_and_wrench:

A stand-alone test framework that allows to write unit tests for Data Factory pipelines on [Microsoft Fabric](https://learn.microsoft.com/en-us/fabric/data-factory/), [Azure Data Factory](https://learn.microsoft.com/en-us/azure/data-factory/concepts-pipelines-activities?tabs=data-factory) and [Azure Synapse Analytics](https://learn.microsoft.com/en-us/azure/data-factory/concepts-pipelines-activities?tabs=data-factory).

> The framework is currently in _Public Preview_ and is not officially supported by Microsoft.

## Features :star:

The framework evaluates pipeline and activity definitions which can be asserted. It does so by providing the following features:

1. Evaluate expressions by using the framework's internal expression parser. It supports all the functions and arguments that are available in the Data Factory expression language.
2. Test an activity with a specific state and assert the evaluated expressions.
3. Test a pipeline run by verifying the execution flow of activities for specific input parameters and assert the evaluated expressions of each activity.

> The framework does not support running the actual pipeline. It only gives you the ability to test the pipeline and activity definitions.

### High-level example :bulb:

Given a `WebActivity` with a `typeProperties.url` property containing the following expression:

```datafactoryexpression
@concat(pipeline().globalParameters.BaseUrl, variables('Path'))
```

A simple test to validate that the concatenation is working as expected could look like this:

```python
    # Arrange
    activity = pipeline.get_activity_by_name("webactivity_name")
    state = PipelineRunState(
        parameters=[
            RunParameter(RunParameterType.Global, "BaseUrl", "https://example.com"),
        ],
        variables=[
            PipelineRunVariable("Path", "some-path"),
        ])

    # Act
    activity.evaluate(state)

    # Assert
    assert "https://example.com/some-path" == activity.type_properties["url"].result
   ```

## Why :question:

Data Factory does not support unit testing, nor testing of pipelines locally. Having integration and e2e tests running on an actual Data Factory instance is great, but having unit tests on top of them provides additional means of quick iteration, validation and regression testing. Unit testing with the _Data Factory Testing Framework_ has the following benefits:

* Runs locally with immediate feedback
* Easier to cover a lot of different scenarios and edge cases
* Regression testing

## Getting started :rocket:

Before you start writing tests, you need to set up the repository and install the framework:

1. [Repository setup](docs/basic/repository_setup.md)
2. [Installing and initializing the framework](docs/basic/installing_and_initializing_framework.md)

> If you are not that experienced with Python and prefer a step-by-step guide, use the more detailed [getting started](docs/basic/getting_started.md) guide.

The framework allows you to write two types of tests:

* [Activity testing](docs/basic/activity_testing.md) - for testing activities in isolation
* [Pipeline testing](docs/basic/pipeline_testing.md) - for testing the entire pipeline

## Concepts :books:

The following pages go deeper into different topics and concepts of the framework to help in getting you started.

### Basic :seedling:

1. [Repository setup](docs/basic/repository_setup.md)
2. [Installing and initializing the framework](docs/basic/installing_and_initializing_framework.md)
3. [State](docs/basic/state.md)
4. [Activity testing](docs/basic/activity_testing.md)
5. [Pipeline testing](docs/basic/pipeline_testing.md)

### Advanced :microscope:

1. [Debugging your activities and pipelines](docs/advanced/debugging.md)
2. [Development workflow](docs/advanced/development_workflow.md)
3. [Overriding expression functions](docs/advanced/overriding_expression_functions.md)
4. [Framework internals](docs/advanced/framework_internals.md)

## Examples :memo:

More advanced examples demonstrating the capabilities of the framework:

Fabric:

1. [Batch job example](examples/fabric/batch_job/README.md)

Azure Data Factory:

1. [Copy blobs example](examples/data_factory/copy_blobs/README.md)
2. [Batch job example](examples/data_factory/batch_job/README.md)

Azure Synapse Analytics:

1. [Copy blobs example](examples/synapse/copy_blobs/README.md)

## Contributing :handshake:

This project welcomes contributions and suggestions.  Most contributions require you to agree to a
Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us
the rights to use your contribution. For details, visit <https://cla.opensource.microsoft.com>.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide
a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions
provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the [Microsoft Open Source Code of Conduct](https://opensource.microsoft.com/codeofconduct/).
For more information see the [Code of Conduct FAQ](https://opensource.microsoft.com/codeofconduct/faq/) or
contact [opencode@microsoft.com](mailto:opencode@microsoft.com) with any additional questions or comments.

## Trademarks :tm:

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft
trademarks or logos is subject to and must follow
[Microsoft's Trademark & Brand Guidelines](https://www.microsoft.com/en-us/legal/intellectualproperty/trademarks/usage/general).
Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship.
Any use of third-party trademarks or logos are subject to those third-party's policies.

            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "data-factory-testing-framework",
    "maintainer": null,
    "docs_url": null,
    "requires_python": "<3.13,>=3.9",
    "maintainer_email": null,
    "keywords": "fabric, datafactory, unit-testing, functional-testing, azure",
    "author": null,
    "author_email": "Data Factory Testing Framework <dftf@microsoft.com>",
    "download_url": "https://files.pythonhosted.org/packages/e4/8c/43a061914b72ab0527e0beb512781c9174a7e3422828c691ae28d2dcf2cf/data_factory_testing_framework-1.1.5.tar.gz",
    "platform": null,
    "description": "# Data Factory - Testing Framework :hammer_and_wrench:\n\nA stand-alone test framework that allows to write unit tests for Data Factory pipelines on [Microsoft Fabric](https://learn.microsoft.com/en-us/fabric/data-factory/), [Azure Data Factory](https://learn.microsoft.com/en-us/azure/data-factory/concepts-pipelines-activities?tabs=data-factory) and [Azure Synapse Analytics](https://learn.microsoft.com/en-us/azure/data-factory/concepts-pipelines-activities?tabs=data-factory).\n\n> The framework is currently in _Public Preview_ and is not officially supported by Microsoft.\n\n## Features :star:\n\nThe framework evaluates pipeline and activity definitions which can be asserted. It does so by providing the following features:\n\n1. Evaluate expressions by using the framework's internal expression parser. It supports all the functions and arguments that are available in the Data Factory expression language.\n2. Test an activity with a specific state and assert the evaluated expressions.\n3. Test a pipeline run by verifying the execution flow of activities for specific input parameters and assert the evaluated expressions of each activity.\n\n> The framework does not support running the actual pipeline. It only gives you the ability to test the pipeline and activity definitions.\n\n### High-level example :bulb:\n\nGiven a `WebActivity` with a `typeProperties.url` property containing the following expression:\n\n```datafactoryexpression\n@concat(pipeline().globalParameters.BaseUrl, variables('Path'))\n```\n\nA simple test to validate that the concatenation is working as expected could look like this:\n\n```python\n    # Arrange\n    activity = pipeline.get_activity_by_name(\"webactivity_name\")\n    state = PipelineRunState(\n        parameters=[\n            RunParameter(RunParameterType.Global, \"BaseUrl\", \"https://example.com\"),\n        ],\n        variables=[\n            PipelineRunVariable(\"Path\", \"some-path\"),\n        ])\n\n    # Act\n    activity.evaluate(state)\n\n    # Assert\n    assert \"https://example.com/some-path\" == activity.type_properties[\"url\"].result\n   ```\n\n## Why :question:\n\nData Factory does not support unit testing, nor testing of pipelines locally. Having integration and e2e tests running on an actual Data Factory instance is great, but having unit tests on top of them provides additional means of quick iteration, validation and regression testing. Unit testing with the _Data Factory Testing Framework_ has the following benefits:\n\n* Runs locally with immediate feedback\n* Easier to cover a lot of different scenarios and edge cases\n* Regression testing\n\n## Getting started :rocket:\n\nBefore you start writing tests, you need to set up the repository and install the framework:\n\n1. [Repository setup](docs/basic/repository_setup.md)\n2. [Installing and initializing the framework](docs/basic/installing_and_initializing_framework.md)\n\n> If you are not that experienced with Python and prefer a step-by-step guide, use the more detailed [getting started](docs/basic/getting_started.md) guide.\n\nThe framework allows you to write two types of tests:\n\n* [Activity testing](docs/basic/activity_testing.md) - for testing activities in isolation\n* [Pipeline testing](docs/basic/pipeline_testing.md) - for testing the entire pipeline\n\n## Concepts :books:\n\nThe following pages go deeper into different topics and concepts of the framework to help in getting you started.\n\n### Basic :seedling:\n\n1. [Repository setup](docs/basic/repository_setup.md)\n2. [Installing and initializing the framework](docs/basic/installing_and_initializing_framework.md)\n3. [State](docs/basic/state.md)\n4. [Activity testing](docs/basic/activity_testing.md)\n5. [Pipeline testing](docs/basic/pipeline_testing.md)\n\n### Advanced :microscope:\n\n1. [Debugging your activities and pipelines](docs/advanced/debugging.md)\n2. [Development workflow](docs/advanced/development_workflow.md)\n3. [Overriding expression functions](docs/advanced/overriding_expression_functions.md)\n4. [Framework internals](docs/advanced/framework_internals.md)\n\n## Examples :memo:\n\nMore advanced examples demonstrating the capabilities of the framework:\n\nFabric:\n\n1. [Batch job example](examples/fabric/batch_job/README.md)\n\nAzure Data Factory:\n\n1. [Copy blobs example](examples/data_factory/copy_blobs/README.md)\n2. [Batch job example](examples/data_factory/batch_job/README.md)\n\nAzure Synapse Analytics:\n\n1. [Copy blobs example](examples/synapse/copy_blobs/README.md)\n\n## Contributing :handshake:\n\nThis project welcomes contributions and suggestions.  Most contributions require you to agree to a\nContributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us\nthe rights to use your contribution. For details, visit <https://cla.opensource.microsoft.com>.\n\nWhen you submit a pull request, a CLA bot will automatically determine whether you need to provide\na CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions\nprovided by the bot. You will only need to do this once across all repos using our CLA.\n\nThis project has adopted the [Microsoft Open Source Code of Conduct](https://opensource.microsoft.com/codeofconduct/).\nFor more information see the [Code of Conduct FAQ](https://opensource.microsoft.com/codeofconduct/faq/) or\ncontact [opencode@microsoft.com](mailto:opencode@microsoft.com) with any additional questions or comments.\n\n## Trademarks :tm:\n\nThis project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft\ntrademarks or logos is subject to and must follow\n[Microsoft's Trademark & Brand Guidelines](https://www.microsoft.com/en-us/legal/intellectualproperty/trademarks/usage/general).\nUse of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship.\nAny use of third-party trademarks or logos are subject to those third-party's policies.\n",
    "bugtrack_url": null,
    "license": "MIT License  Copyright (c) Microsoft Corporation.  Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the \"Software\"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:  The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.  THE SOFTWARE IS PROVIDED \"AS IS\", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE ",
    "summary": "A stand-alone test framework that allows to write unit tests for Data Factory pipelines on Microsoft Fabric and Azure Data Factory.",
    "version": "1.1.5",
    "project_urls": {
        "Documentation": "https://github.com/microsoft/data-factory-testing-framework/blob/main/README.md",
        "Homepage": "https://github.com/microsoft/data-factory-testing-framework",
        "Issues": "https://github.com/microsoft/data-factory-testing-framework/issues",
        "Source": "https://github.com/microsoft/data-factory-testing-framework.git"
    },
    "split_keywords": [
        "fabric",
        " datafactory",
        " unit-testing",
        " functional-testing",
        " azure"
    ],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "13707dfb0d8babf3903b521208af6a340a469cb2da3115f143502ada2c47baa1",
                "md5": "2551aa79c3cb7c4e98d8ebc6f47c455a",
                "sha256": "7a438ae8cc314cb68030435e5049ed22ebc1caf85f8cdc63aca973a3b3e13804"
            },
            "downloads": -1,
            "filename": "data_factory_testing_framework-1.1.5-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "2551aa79c3cb7c4e98d8ebc6f47c455a",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": "<3.13,>=3.9",
            "size": 15118514,
            "upload_time": "2024-10-11T13:16:08",
            "upload_time_iso_8601": "2024-10-11T13:16:08.434447Z",
            "url": "https://files.pythonhosted.org/packages/13/70/7dfb0d8babf3903b521208af6a340a469cb2da3115f143502ada2c47baa1/data_factory_testing_framework-1.1.5-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "e48c43a061914b72ab0527e0beb512781c9174a7e3422828c691ae28d2dcf2cf",
                "md5": "db724e43a3d639a8b34114a62902a68a",
                "sha256": "3f04858780faa37edb4a6655059f27b3e58ec64a74e7e39a900fb49248ecea7a"
            },
            "downloads": -1,
            "filename": "data_factory_testing_framework-1.1.5.tar.gz",
            "has_sig": false,
            "md5_digest": "db724e43a3d639a8b34114a62902a68a",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": "<3.13,>=3.9",
            "size": 31531,
            "upload_time": "2024-10-11T13:16:10",
            "upload_time_iso_8601": "2024-10-11T13:16:10.942334Z",
            "url": "https://files.pythonhosted.org/packages/e4/8c/43a061914b72ab0527e0beb512781c9174a7e3422828c691ae28d2dcf2cf/data_factory_testing_framework-1.1.5.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2024-10-11 13:16:10",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "microsoft",
    "github_project": "data-factory-testing-framework",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "data-factory-testing-framework"
}
        
Elapsed time: 0.48098s