# FlowTask DataIntegration #
FlowTask DataIntegration is a plugin-based, component-driven task execution framework for create complex Tasks.
FlowTask runs Tasks defined in JSON, YAML or TOML files, any Task is a combination of Components,
and every component in the Task run sequentially or depend of others, like a DAG.
Can create a Task combining Commands, Shell scripts and other specific Components (as TableInput: Open a Table using a datasource, DownloadFromIMAP: Download a File from a IMAP Folder, and so on), any Python Callable can be a Component inside a Task, or can extends UserComponent to build your own componets.
Every designed Task can run from CLI, programmatically, via RESTful API (using our aioHTTP-based Handler), called by WebHooks or even dispatched to a external Worker using our built-in Scheduler.
## Quickstart ##
```console
pip install flowtask
```
Tasks can organizated into directory structure like this:
tasks /
├── programs /
├── test /
├── tasks /
The main reason of this structure, is maintain organized several tasks by tenant/program, avoiding filling a directory with several task files.
FlowTask support "TaskStorage", a Task Storage is the main repository for tasks, main Task Storage is a directory in any filesystem path (optionally you can syncronize that path using git), but Tasks can be saved onto a Database or a S3 bucket.
## Dependencies ##
* aiohttp (Asyncio Web Framework and Server) (required by navigator)
* AsyncDB
* QuerySource
* Navigator-api
* (Optional) Qworker (for distributing asyncio Tasks on distributed workers).
## Features ##
* Component-based Task execution framework with several components covering several actions (download files, create pandas dataframes from files, mapping dataframe columns to a json-dictionary, etc)
* Built-in API for execution of Tasks.
### How I run a Task? ###
Can run a Task from CLI:
```console
task --program=test --task=example
```
on CLI, you can pass an ENV (enviroment) to change the environment file on task execution.
```console
ENV=dev task --program=test --task=example
```
or Programmatically:
```python
from flowtask import Task
import asyncio
task = Task(program='test', task='example')
results = asyncio.run(task.run())
# we can alternatively, using the execution mode of task object:
results = asyncio.run(task())
```
### Requirements ###
* Python >= 3.9
* asyncio (https://pypi.python.org/pypi/asyncio/)
* aiohttp >= 3.6.2
### Contribution guidelines ###
Please have a look at the Contribution Guide
* Writing tests
* Code review
* Other guidelines
### Who do I talk to? ###
* Repo owner or admin
* Other community or team contact
### License ###
Navigator is licensed under Apache 2.0 License. See the LICENSE file for more details.
Raw data
{
"_id": null,
"home_page": "https://github.com/phenobarbital/flowtask",
"name": "flowtask",
"maintainer": null,
"docs_url": null,
"requires_python": ">=3.9.16",
"maintainer_email": null,
"keywords": "DataIntegration, Task, Orchestation, Task-Runner, Pipelines, Data-Pipelines",
"author": "Jesus Lara",
"author_email": "\"Jesus Lara G.\" <jesuslarag@gmail.com>",
"download_url": null,
"platform": "*nix",
"description": "# FlowTask DataIntegration #\n\nFlowTask DataIntegration is a plugin-based, component-driven task execution framework for create complex Tasks.\n\nFlowTask runs Tasks defined in JSON, YAML or TOML files, any Task is a combination of Components,\nand every component in the Task run sequentially or depend of others, like a DAG.\n\nCan create a Task combining Commands, Shell scripts and other specific Components (as TableInput: Open a Table using a datasource, DownloadFromIMAP: Download a File from a IMAP Folder, and so on), any Python Callable can be a Component inside a Task, or can extends UserComponent to build your own componets.\n\nEvery designed Task can run from CLI, programmatically, via RESTful API (using our aioHTTP-based Handler), called by WebHooks or even dispatched to a external Worker using our built-in Scheduler.\n\n## Quickstart ##\n\n```console\npip install flowtask\n```\n\nTasks can organizated into directory structure like this:\n\ntasks /\n \u251c\u2500\u2500 programs /\n \u251c\u2500\u2500 test /\n \u251c\u2500\u2500 tasks /\n\nThe main reason of this structure, is maintain organized several tasks by tenant/program, avoiding filling a directory with several task files.\n\nFlowTask support \"TaskStorage\", a Task Storage is the main repository for tasks, main Task Storage is a directory in any filesystem path (optionally you can syncronize that path using git), but Tasks can be saved onto a Database or a S3 bucket.\n\n## Dependencies ##\n\n * aiohttp (Asyncio Web Framework and Server) (required by navigator)\n * AsyncDB\n * QuerySource\n * Navigator-api\n * (Optional) Qworker (for distributing asyncio Tasks on distributed workers).\n\n## Features ##\n\n* Component-based Task execution framework with several components covering several actions (download files, create pandas dataframes from files, mapping dataframe columns to a json-dictionary, etc)\n* Built-in API for execution of Tasks.\n\n### How I run a Task? ###\n\nCan run a Task from CLI:\n```console\ntask --program=test --task=example\n```\n\non CLI, you can pass an ENV (enviroment) to change the environment file on task execution.\n```console\nENV=dev task --program=test --task=example\n```\n\nor Programmatically:\n```python\nfrom flowtask import Task\nimport asyncio\n\ntask = Task(program='test', task='example')\nresults = asyncio.run(task.run())\n# we can alternatively, using the execution mode of task object:\nresults = asyncio.run(task())\n```\n\n### Requirements ###\n\n* Python >= 3.9\n* asyncio (https://pypi.python.org/pypi/asyncio/)\n* aiohttp >= 3.6.2\n\n### Contribution guidelines ###\n\nPlease have a look at the Contribution Guide\n\n* Writing tests\n* Code review\n* Other guidelines\n\n### Who do I talk to? ###\n\n* Repo owner or admin\n* Other community or team contact\n\n### License ###\n\nNavigator is licensed under Apache 2.0 License. See the LICENSE file for more details.\n",
"bugtrack_url": null,
"license": "Apache-2.0",
"summary": "Framework for running Tasks and from CLI and API for orchestation. Component-based Task builder/Runner for non-programmers.",
"version": "5.6.17",
"project_urls": {
"Funding": "https://paypal.me/phenobarbital",
"Homepage": "https://github.com/phenobarbital/flowtask",
"Say Thanks!": "https://saythanks.io/to/phenobarbital",
"Source": "https://github.com/phenobarbital/flowtask"
},
"split_keywords": [
"dataintegration",
" task",
" orchestation",
" task-runner",
" pipelines",
" data-pipelines"
],
"urls": [
{
"comment_text": null,
"digests": {
"blake2b_256": "8ed560466e9e7863d03d2d752643fc1e6f8b417433dd66a3a871d923b3cbdd9f",
"md5": "8ced6a7d23725108173d7d939c728545",
"sha256": "f68a9901219c931fd2c92158402b42d1364981eb27ae04f10d4536e9464f705f"
},
"downloads": -1,
"filename": "flowtask-5.6.17-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
"has_sig": false,
"md5_digest": "8ced6a7d23725108173d7d939c728545",
"packagetype": "bdist_wheel",
"python_version": "cp310",
"requires_python": ">=3.9.16",
"size": 3561837,
"upload_time": "2025-02-19T01:41:45",
"upload_time_iso_8601": "2025-02-19T01:41:45.817629Z",
"url": "https://files.pythonhosted.org/packages/8e/d5/60466e9e7863d03d2d752643fc1e6f8b417433dd66a3a871d923b3cbdd9f/flowtask-5.6.17-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": null,
"digests": {
"blake2b_256": "0ed958eb30ae7d0c9eba513d785516477aedcfb350fe99531771ed724bdb0168",
"md5": "61b25ec62340eb388ae0d6576a32164d",
"sha256": "f2af10df1c47bba417229ffd2951143663af51bef331e718ae790ac065fdfdaa"
},
"downloads": -1,
"filename": "flowtask-5.6.17-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
"has_sig": false,
"md5_digest": "61b25ec62340eb388ae0d6576a32164d",
"packagetype": "bdist_wheel",
"python_version": "cp311",
"requires_python": ">=3.9.16",
"size": 3744087,
"upload_time": "2025-02-19T01:41:48",
"upload_time_iso_8601": "2025-02-19T01:41:48.848485Z",
"url": "https://files.pythonhosted.org/packages/0e/d9/58eb30ae7d0c9eba513d785516477aedcfb350fe99531771ed724bdb0168/flowtask-5.6.17-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": null,
"digests": {
"blake2b_256": "11d366bc88e772d76ab45d8cb82759e282e0b4c925033d4ff674aa94c3bd83da",
"md5": "3558483db12d122aeba03290e30f5318",
"sha256": "98ab77a4bbaa869433268212bc492a3e491df51dabbf4985eaf1f2fc1471262a"
},
"downloads": -1,
"filename": "flowtask-5.6.17-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
"has_sig": false,
"md5_digest": "3558483db12d122aeba03290e30f5318",
"packagetype": "bdist_wheel",
"python_version": "cp312",
"requires_python": ">=3.9.16",
"size": 3907202,
"upload_time": "2025-02-19T01:41:50",
"upload_time_iso_8601": "2025-02-19T01:41:50.443774Z",
"url": "https://files.pythonhosted.org/packages/11/d3/66bc88e772d76ab45d8cb82759e282e0b4c925033d4ff674aa94c3bd83da/flowtask-5.6.17-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": null,
"digests": {
"blake2b_256": "ed76fd82d43190932e5eedc8875e9279b57841db9e80a9b1dcd647b9bde69d21",
"md5": "fce9ba8a119f3720c0018b33e74fff27",
"sha256": "32adf58e7adddcc44e43004656970625a7dd72189786139d33e311294d1be8d9"
},
"downloads": -1,
"filename": "flowtask-5.6.17-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
"has_sig": false,
"md5_digest": "fce9ba8a119f3720c0018b33e74fff27",
"packagetype": "bdist_wheel",
"python_version": "cp39",
"requires_python": ">=3.9.16",
"size": 3578834,
"upload_time": "2025-02-19T01:41:52",
"upload_time_iso_8601": "2025-02-19T01:41:52.149047Z",
"url": "https://files.pythonhosted.org/packages/ed/76/fd82d43190932e5eedc8875e9279b57841db9e80a9b1dcd647b9bde69d21/flowtask-5.6.17-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2025-02-19 01:41:45",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "phenobarbital",
"github_project": "flowtask",
"github_not_found": true,
"lcname": "flowtask"
}