aws-ddk-core


Nameaws-ddk-core JSON
Version 1.4.0 PyPI version JSON
download
home_pagehttps://github.com/awslabs/aws-ddk/tree/main
SummaryThe AWS DataOps Development Kit is an open source development framework for customers that build data workflows and modern data architecture on AWS.
upload_time2023-12-28 23:28:03
maintainer
docs_urlNone
authorAWS Professional Services<aws-proserve-orion-dev@amazon.com>
requires_python~=3.8
licenseApache-2.0
keywords
VCS
bugtrack_url
requirements No requirements were recorded.
Travis-CI No Travis.
coveralls test coverage No coveralls.
            # AWS DataOps Development Kit (DDK)

![Actions Status](https://github.com/awslabs/aws-ddk/actions/workflows/build.yml/badge.svg)
[![Downloads](https://static.pepy.tech/personalized-badge/aws-ddk-core?period=total&units=international_system&left_color=black&right_color=orange&left_text=pypi%20downloads)](https://pepy.tech/project/aws-ddk-core)

##### Packages 🗳️

* [NPM](https://www.npmjs.com/package/aws-ddk-core/)
* [Pypi](https://pypi.org/project/aws-ddk-core/)

---


The AWS DataOps Development Kit is an open source development framework for customers that build data workflows and modern data architecture on AWS.

Based on the [AWS CDK](https://github.com/aws/aws-cdk), it offers high-level abstractions allowing you to build pipelines that manage data flows on AWS, driven by DevOps best practices.  The framework is extensible, you can add abstractions for your own data processing infrastructure or replace our best practices with your own standards. It's easy to share templates, so everyone in your organisation can concentrate on the business logic of dealing with their data, rather than boilerplate logic.

---


The **DDK Core** is a library of CDK constructs that you can use to build data workflows and modern data architecture on AWS, following our best practice. The DDK Core is modular and extensible, if our best practice doesn't work for you, then you can update and share your own version with the rest of your organisation by leveraging a private **AWS Code Artifact** repository.

You can compose constructs from the DDK Core into a **DDK App**.  Your DDK App can also add contain constructs from the CDK Framework or the [AWS Construct Library](https://docs.aws.amazon.com/cdk/api/latest/docs/aws-construct-library.html).

## Overview

For a detailed walk-through, check out our [Workshop](https://catalog.us-east-1.prod.workshops.aws/workshops/3644b48b-1d7c-43ef-a353-6edcd96385af/en-US) or
take a look at [examples](https://github.com/aws-samples/aws-ddk-examples).

### Build Data Pipelines

One of the core features of DDK is ability to create Data Pipelines. A DDK [DataPipeline](https://awslabs.github.io/aws-ddk/release/stable/api/core/stubs/aws_ddk_core.pipelines.DataPipeline.html)
is a chained series of stages. It automatically “wires” the stages together using
[AWS EventBridge Rules](https://docs.aws.amazon.com/eventbridge/latest/userguide/eb-rules.html) .

DDK comes with a library of stages, however users can also create their own based on their use cases,
and are encouraged to share them with the community.

Let's take a look at an example below:

```python
...

firehose_s3_stage = FirehoseToS3Stage(
    self,
    "ddk-firehose-s3",
    bucket=ddk_bucket,
    data_output_prefix="raw/",
)
sqs_lambda_stage = SqsToLambdaStage(
    scope=self,
    id="ddk-sqs-lambda",
    code=Code.from_asset("./lambda"),
    handler="index.lambda_handler",
    layers=[
        LayerVersion.from_layer_version_arn(
            self,
            "ddk-lambda-layer-wrangler",
            f"arn:aws:lambda:{self.region}:336392948345:layer:AWSSDKPandas-Python39:1",
        )
    ]
)

(
    DataPipeline(scope=self, id="ddk-pipeline")
    .add_stage(firehose_s3_stage)
    .add_stage(sqs_lambda_stage)
)
...
```

First, we import the required resources from the aws_ddk_core library, including the two stage constructs:
[FirehoseToS3Stage()](https://constructs.dev/packages/aws-ddk-core/v/1.0.1/api/FirehoseToS3Stage) and
[SqsToLambdaStage()](https://constructs.dev/packages/aws-ddk-core/v/1.0.1/api/SqsToLambdaStage).
These two classes are then instantiated and the delivery stream is configured with the S3 prefix (raw/).
Finally, the DDK DataPipeline construct is used to chain these two stages together into a data pipeline.

Complete source code of the data pipeline above can be found in
[AWS DDK Examples - Basic Data Pipeline](https://github.com/aws-samples/aws-ddk-examples/tree/main/basic-data-pipeline)

### Official Resources

* [Workshop](https://catalog.us-east-1.prod.workshops.aws/workshops/3644b48b-1d7c-43ef-a353-6edcd96385af/en-US)
* [Documentation](https://awslabs.github.io/aws-ddk/)
* [API Reference](https://awslabs.github.io/aws-ddk/release/stable/api/index)
* [Examples](https://github.com/aws-samples/aws-ddk-examples/)

## Getting Help

The best way to interact with our team is through GitHub.  You can open an issue and choose from one of our templates for bug reports, feature requests, or documentation issues.  If you have a feature request, don't forget you can search existing issues and upvote or comment on existing issues before creating a new one.

## Contributing

We welcome community contributions and pull requests.  Please see [CONTRIBUTING.md](./CONTRIBUTING.md) for details on how to set up a development
environment and submit code.

## Other Ways to Support

One way you can support our project is by letting others know that your organisation uses the DDK.  If you would like us to include your company's name and/or logo in this README file, please raise a 'Support the DDK' issue.  Note that by raising a this issue (and related pull request), you are granting AWS permission to use your company’s name (and logo) for the limited purpose described here and you are confirming that you have authority to grant such permission.

## License

This project is licensed under the Apache-2.0 License.

            

Raw data

            {
    "_id": null,
    "home_page": "https://github.com/awslabs/aws-ddk/tree/main",
    "name": "aws-ddk-core",
    "maintainer": "",
    "docs_url": null,
    "requires_python": "~=3.8",
    "maintainer_email": "",
    "keywords": "",
    "author": "AWS Professional Services<aws-proserve-orion-dev@amazon.com>",
    "author_email": "",
    "download_url": "https://files.pythonhosted.org/packages/ea/ae/d01206f1fb62461570a030d68db3dfa1064c1bfe8639189c9afc542a206c/aws-ddk-core-1.4.0.tar.gz",
    "platform": null,
    "description": "# AWS DataOps Development Kit (DDK)\n\n![Actions Status](https://github.com/awslabs/aws-ddk/actions/workflows/build.yml/badge.svg)\n[![Downloads](https://static.pepy.tech/personalized-badge/aws-ddk-core?period=total&units=international_system&left_color=black&right_color=orange&left_text=pypi%20downloads)](https://pepy.tech/project/aws-ddk-core)\n\n##### Packages \ud83d\uddf3\ufe0f\n\n* [NPM](https://www.npmjs.com/package/aws-ddk-core/)\n* [Pypi](https://pypi.org/project/aws-ddk-core/)\n\n---\n\n\nThe AWS DataOps Development Kit is an open source development framework for customers that build data workflows and modern data architecture on AWS.\n\nBased on the [AWS CDK](https://github.com/aws/aws-cdk), it offers high-level abstractions allowing you to build pipelines that manage data flows on AWS, driven by DevOps best practices.  The framework is extensible, you can add abstractions for your own data processing infrastructure or replace our best practices with your own standards. It's easy to share templates, so everyone in your organisation can concentrate on the business logic of dealing with their data, rather than boilerplate logic.\n\n---\n\n\nThe **DDK Core** is a library of CDK constructs that you can use to build data workflows and modern data architecture on AWS, following our best practice. The DDK Core is modular and extensible, if our best practice doesn't work for you, then you can update and share your own version with the rest of your organisation by leveraging a private **AWS Code Artifact** repository.\n\nYou can compose constructs from the DDK Core into a **DDK App**.  Your DDK App can also add contain constructs from the CDK Framework or the [AWS Construct Library](https://docs.aws.amazon.com/cdk/api/latest/docs/aws-construct-library.html).\n\n## Overview\n\nFor a detailed walk-through, check out our [Workshop](https://catalog.us-east-1.prod.workshops.aws/workshops/3644b48b-1d7c-43ef-a353-6edcd96385af/en-US) or\ntake a look at [examples](https://github.com/aws-samples/aws-ddk-examples).\n\n### Build Data Pipelines\n\nOne of the core features of DDK is ability to create Data Pipelines. A DDK [DataPipeline](https://awslabs.github.io/aws-ddk/release/stable/api/core/stubs/aws_ddk_core.pipelines.DataPipeline.html)\nis a chained series of stages. It automatically \u201cwires\u201d the stages together using\n[AWS EventBridge Rules](https://docs.aws.amazon.com/eventbridge/latest/userguide/eb-rules.html) .\n\nDDK comes with a library of stages, however users can also create their own based on their use cases,\nand are encouraged to share them with the community.\n\nLet's take a look at an example below:\n\n```python\n...\n\nfirehose_s3_stage = FirehoseToS3Stage(\n    self,\n    \"ddk-firehose-s3\",\n    bucket=ddk_bucket,\n    data_output_prefix=\"raw/\",\n)\nsqs_lambda_stage = SqsToLambdaStage(\n    scope=self,\n    id=\"ddk-sqs-lambda\",\n    code=Code.from_asset(\"./lambda\"),\n    handler=\"index.lambda_handler\",\n    layers=[\n        LayerVersion.from_layer_version_arn(\n            self,\n            \"ddk-lambda-layer-wrangler\",\n            f\"arn:aws:lambda:{self.region}:336392948345:layer:AWSSDKPandas-Python39:1\",\n        )\n    ]\n)\n\n(\n    DataPipeline(scope=self, id=\"ddk-pipeline\")\n    .add_stage(firehose_s3_stage)\n    .add_stage(sqs_lambda_stage)\n)\n...\n```\n\nFirst, we import the required resources from the aws_ddk_core library, including the two stage constructs:\n[FirehoseToS3Stage()](https://constructs.dev/packages/aws-ddk-core/v/1.0.1/api/FirehoseToS3Stage) and\n[SqsToLambdaStage()](https://constructs.dev/packages/aws-ddk-core/v/1.0.1/api/SqsToLambdaStage).\nThese two classes are then instantiated and the delivery stream is configured with the S3 prefix (raw/).\nFinally, the DDK DataPipeline construct is used to chain these two stages together into a data pipeline.\n\nComplete source code of the data pipeline above can be found in\n[AWS DDK Examples - Basic Data Pipeline](https://github.com/aws-samples/aws-ddk-examples/tree/main/basic-data-pipeline)\n\n### Official Resources\n\n* [Workshop](https://catalog.us-east-1.prod.workshops.aws/workshops/3644b48b-1d7c-43ef-a353-6edcd96385af/en-US)\n* [Documentation](https://awslabs.github.io/aws-ddk/)\n* [API Reference](https://awslabs.github.io/aws-ddk/release/stable/api/index)\n* [Examples](https://github.com/aws-samples/aws-ddk-examples/)\n\n## Getting Help\n\nThe best way to interact with our team is through GitHub.  You can open an issue and choose from one of our templates for bug reports, feature requests, or documentation issues.  If you have a feature request, don't forget you can search existing issues and upvote or comment on existing issues before creating a new one.\n\n## Contributing\n\nWe welcome community contributions and pull requests.  Please see [CONTRIBUTING.md](./CONTRIBUTING.md) for details on how to set up a development\nenvironment and submit code.\n\n## Other Ways to Support\n\nOne way you can support our project is by letting others know that your organisation uses the DDK.  If you would like us to include your company's name and/or logo in this README file, please raise a 'Support the DDK' issue.  Note that by raising a this issue (and related pull request), you are granting AWS permission to use your company\u2019s name (and logo) for the limited purpose described here and you are confirming that you have authority to grant such permission.\n\n## License\n\nThis project is licensed under the Apache-2.0 License.\n",
    "bugtrack_url": null,
    "license": "Apache-2.0",
    "summary": "The AWS DataOps Development Kit is an open source development framework for customers that build data workflows and modern data architecture on AWS.",
    "version": "1.4.0",
    "project_urls": {
        "Homepage": "https://github.com/awslabs/aws-ddk/tree/main",
        "Source": "https://github.com/awslabs/aws-ddk/tree/main"
    },
    "split_keywords": [],
    "urls": [
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "94f901f93c0c6ca80b5dbebf06f339a68ba3adf32e441c2f9ad790c7feb387ff",
                "md5": "50eb3925ca8f45fa78ed215c14165353",
                "sha256": "bccd64dbf4aa91653aae07c4caf6219d79b2e1e3a03b74464c6c310ac307a484"
            },
            "downloads": -1,
            "filename": "aws_ddk_core-1.4.0-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "50eb3925ca8f45fa78ed215c14165353",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": "~=3.8",
            "size": 5153106,
            "upload_time": "2023-12-28T23:27:57",
            "upload_time_iso_8601": "2023-12-28T23:27:57.191213Z",
            "url": "https://files.pythonhosted.org/packages/94/f9/01f93c0c6ca80b5dbebf06f339a68ba3adf32e441c2f9ad790c7feb387ff/aws_ddk_core-1.4.0-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": "",
            "digests": {
                "blake2b_256": "eaaed01206f1fb62461570a030d68db3dfa1064c1bfe8639189c9afc542a206c",
                "md5": "869bf60ebbf42d735b8cef7c03722a36",
                "sha256": "e91b0fe514cc1c9b9dee26fee2d2fbb3f5ef6616dc6ee8abd2188502c87fec61"
            },
            "downloads": -1,
            "filename": "aws-ddk-core-1.4.0.tar.gz",
            "has_sig": false,
            "md5_digest": "869bf60ebbf42d735b8cef7c03722a36",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": "~=3.8",
            "size": 5154498,
            "upload_time": "2023-12-28T23:28:03",
            "upload_time_iso_8601": "2023-12-28T23:28:03.753909Z",
            "url": "https://files.pythonhosted.org/packages/ea/ae/d01206f1fb62461570a030d68db3dfa1064c1bfe8639189c9afc542a206c/aws-ddk-core-1.4.0.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2023-12-28 23:28:03",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "awslabs",
    "github_project": "aws-ddk",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "lcname": "aws-ddk-core"
}
        
Elapsed time: 0.16282s