# DATAU - Batch Statistical Data Utilities
Cross-platform batch runner for statistical and numerical files.
## Installation
```bash
pip install pydatau
```
## Quick example
python:
```python
from datau import autorun
autorun(path_data="my_project/data", pattern='master', log_limit=1)
```
bash:
```bash
datau "my_project/data" -p "master" -l 1
```
## Supported Extensions and Execution Method
| Extension | Language/Tool | Method |
|-----------|------------------|-----------------------------|
| `.ipynb` | Jupyter Notebook | `papermill` |
| `.R` | R | `rpy2.robjects` |
| `.do` | Stata | Stata batch mode |
| `.jl` | Julia | `julia` (must be in PATH) |
| `.gms` | GAMS | `gams` (must be in PATH) |
| `.run` | AMPL | `ampl` (must be in PATH) |
| `.m` | MATLAB/Octave | `matlab.engine` or `Oct2Py` |
## User Reference
```python
autorun(path_data='...', pattern='...', *args, **kwargs)
```
Automatically runs matching statistical scripts in the given directory. Generates a number of `.log` files with outputs for each executed file.
**Parameters:**
`path_data` : *str*, default = *current working directory*
Path to the directory with input files.
`pattern` : *str*, optional
Regex pattern to match filenames (not paths), such as *'master'*. Case-insensitive.
`date_fmt` : *str*, default = *'%Y%m%d_%H%M%S'*
Datetime format for *.log* filenames.
`log_limit` : *int*, optional
If set, limits the number of log files per script. Older logs beyond this limit will be deleted. Please note that you can turn off logging by setting `log_limit` to *0*.
`use_powershell` : *bool*, default = *False*
If *True*, redirects via PowerShell on Windows. Ignored on POSIX.
## License
MIT License — see the [LICENSE](LICENSE) file.
Raw data
{
"_id": null,
"home_page": null,
"name": "pydatau",
"maintainer": null,
"docs_url": null,
"requires_python": ">=3.10",
"maintainer_email": null,
"keywords": "data-science, ipython, R, stata, gams, ampl, matlab, octave",
"author": null,
"author_email": "The Economist <29724411+econcz@users.noreply.github.com>",
"download_url": "https://files.pythonhosted.org/packages/05/dc/afb2cc4d9a88fb188fcee37ed2dfb68933d29f43864a340252f069882d1b/pydatau-1.0.6.tar.gz",
"platform": null,
"description": "# DATAU - Batch Statistical Data Utilities\n\nCross-platform batch runner for statistical and numerical files.\n\n## Installation\n\n```bash\npip install pydatau\n```\n\n## Quick example\npython: \n```python\nfrom datau import autorun\n\nautorun(path_data=\"my_project/data\", pattern='master', log_limit=1)\n```\nbash: \n```bash\ndatau \"my_project/data\" -p \"master\" -l 1\n```\n\n## Supported Extensions and Execution Method\n\n| Extension | Language/Tool | Method |\n|-----------|------------------|-----------------------------|\n| `.ipynb` | Jupyter Notebook | `papermill` |\n| `.R` | R | `rpy2.robjects` |\n| `.do` | Stata | Stata batch mode |\n| `.jl` | Julia | `julia` (must be in PATH) |\n| `.gms` | GAMS | `gams` (must be in PATH) |\n| `.run` | AMPL | `ampl` (must be in PATH) |\n| `.m` | MATLAB/Octave | `matlab.engine` or `Oct2Py` |\n\n## User Reference\n\n```python\nautorun(path_data='...', pattern='...', *args, **kwargs)\n```\n\nAutomatically runs matching statistical scripts in the given directory. Generates a number of `.log` files with outputs for each executed file.\n\n**Parameters:** \n\n`path_data` : *str*, default = *current working directory* \nPath to the directory with input files.\n\n`pattern` : *str*, optional \nRegex pattern to match filenames (not paths), such as *'master'*. Case-insensitive.\n\n`date_fmt` : *str*, default = *'%Y%m%d_%H%M%S'* \nDatetime format for *.log* filenames.\n\n`log_limit` : *int*, optional \nIf set, limits the number of log files per script. Older logs beyond this limit will be deleted. Please note that you can turn off logging by setting `log_limit` to *0*.\n\n`use_powershell` : *bool*, default = *False* \nIf *True*, redirects via PowerShell on Windows. Ignored on POSIX.\n\n## License\n\nMIT License \u2014 see the [LICENSE](LICENSE) file.\n",
"bugtrack_url": null,
"license": null,
"summary": "Batch Statistical Data Utilities",
"version": "1.0.6",
"project_urls": {
"Bug Tracker": "https://github.com/econcz/pydatau/issues",
"Homepage": "https://github.com/econcz/pydatau"
},
"split_keywords": [
"data-science",
" ipython",
" r",
" stata",
" gams",
" ampl",
" matlab",
" octave"
],
"urls": [
{
"comment_text": null,
"digests": {
"blake2b_256": "f65ee888006de00323f67e10bdbf54f78e27ba3f123338015d6b23030ff27ad3",
"md5": "e4c838da24c5391ecd36b864bce4ea7d",
"sha256": "a503e9180e6af60b8daf9d015b30dd72475b25d0610c4b40aef7b1880be98764"
},
"downloads": -1,
"filename": "pydatau-1.0.6-py3-none-any.whl",
"has_sig": false,
"md5_digest": "e4c838da24c5391ecd36b864bce4ea7d",
"packagetype": "bdist_wheel",
"python_version": "py3",
"requires_python": ">=3.10",
"size": 5963,
"upload_time": "2025-08-24T06:44:52",
"upload_time_iso_8601": "2025-08-24T06:44:52.337969Z",
"url": "https://files.pythonhosted.org/packages/f6/5e/e888006de00323f67e10bdbf54f78e27ba3f123338015d6b23030ff27ad3/pydatau-1.0.6-py3-none-any.whl",
"yanked": false,
"yanked_reason": null
},
{
"comment_text": null,
"digests": {
"blake2b_256": "05dcafb2cc4d9a88fb188fcee37ed2dfb68933d29f43864a340252f069882d1b",
"md5": "77977ea47216f9d54d9e37f439a3cd8c",
"sha256": "fd090c29d3728e668e2e2579a135092c2596a71301222b14c009a591850f1828"
},
"downloads": -1,
"filename": "pydatau-1.0.6.tar.gz",
"has_sig": false,
"md5_digest": "77977ea47216f9d54d9e37f439a3cd8c",
"packagetype": "sdist",
"python_version": "source",
"requires_python": ">=3.10",
"size": 5081,
"upload_time": "2025-08-24T06:44:53",
"upload_time_iso_8601": "2025-08-24T06:44:53.266720Z",
"url": "https://files.pythonhosted.org/packages/05/dc/afb2cc4d9a88fb188fcee37ed2dfb68933d29f43864a340252f069882d1b/pydatau-1.0.6.tar.gz",
"yanked": false,
"yanked_reason": null
}
],
"upload_time": "2025-08-24 06:44:53",
"github": true,
"gitlab": false,
"bitbucket": false,
"codeberg": false,
"github_user": "econcz",
"github_project": "pydatau",
"travis_ci": false,
"coveralls": false,
"github_actions": false,
"lcname": "pydatau"
}