xvc


Namexvc JSON
Version 0.6.13 PyPI version JSON
download
home_pageNone
SummaryAn MLOps tool to manage data files and pipelines on top of Git
upload_time2025-01-01 08:20:46
maintainerNone
docs_urlNone
authorNone
requires_python>=3.7
licenseNone
keywords machine-learning development devops git
VCS
bugtrack_url
requirements maturin jupyterlab pytest-forked pytest-cov coverage
Travis-CI No Travis.
coveralls test coverage No coveralls.
            ```python
# Xvc
```

This is the Python port of [Xvc](https://github.com/iesahin/xvc). Xvc's goal is let users perform all ML operations regarding data, files, models and pipelines from the command line, and version all of these on top of Git. Xvc.py extends this goal by allowing all commands to be run from Python shells, code and Jupyter notebooks. 

You can install `xvc` to your system or virtual environment with uncommenting the line below. 


```python
# !pip install xvc
```

Import Xvc like any other Python module. 


```python
import xvc
```

This notebook is created with the following version. Please test your issue with the latest version before creating any bug reports. 


```python
xvc.version()
```




    'v0.6.3-46-g71fc283-modified'



Python wrappings for `xvc` command uses `Xvc` class. 


```python
from xvc import Xvc
```

## Tracking files

Let's reset the directory we're in and create a directory tree. These are the data files we want to track. Note that `xvc-test-helper` is a cli application to make life easier for Xvc tests. If you want to reproduce this notebook, you can install it with `cargo install xvc-test-helper`


```python
!chmod -R +w .xvc .xvcignore .git .gitignore test-data/
!rm -rf .xvc .xvcignore .git .gitignore test-data/
!xvc-test-helper create-directory-tree --directories 3 --files 3 --root test-data --seed 20240101
```

    chmod: .xvc: No such file or directory
    chmod: .git: No such file or directory
    chmod: .gitignore: No such file or directory
    chmod: test-data/: No such file or directory


Above command created a directory structure as this. 


```python
!tree
```

    .
    ├── Debug.ipynb
    ├── Readme.ipynb
    ├── default-pipeline.json
    ├── dr-iq-scores.csv
    ├── generate_data.py
    ├── random_names_iq_scores.csv
    ├── requirements.txt
    ├── test-data
    │   ├── dir-0001
    │   │   ├── file-0001.bin
    │   │   ├── file-0002.bin
    │   │   └── file-0003.bin
    │   ├── dir-0002
    │   │   ├── file-0001.bin
    │   │   ├── file-0002.bin
    │   │   └── file-0003.bin
    │   └── dir-0003
    │       ├── file-0001.bin
    │       ├── file-0002.bin
    │       └── file-0003.bin
    └── visualize.py
    
    5 directories, 17 files


Now we can initialize Xvc repository. We first init Git in the directory. Xvc tracks binary files with their metadata (size, timestamp and digest) and this metadata is kept in text files. You can use Xvc without Git, but adding versioning to your data is usually a good idea. Xvc manages Git operations on the text files it creates automatically by default, so it won't add any extra commands to the workflow.


```python
!git init
```

    Initialized empty Git repository in /Users/iex/github.com/iesahin/xvc-notebooks/Readme/.git/


Now we can init the Xvc repository

First we create an Xvc instance. You can specify the workdir and this instance
will work on that directory. For now, only one instance of Xvc can run in a
Python process. This limitation may be removed in future releases.

```python

xvc_test_data = Xvc()
```

Let's initialize Xvc in the directory. This will  create a `.xvc` directory, will put several initial files and also create a `.gitignore` entry for those elements that shouldn't be tracked by Git. 


```python
xvc_test_data.init()
```




    ''



We can see which Xvc repository a directory belongs with `root` command. By default, it returns the relative directory but you can set `absolute` flag to get see the absolute dir. 


```python
xvc_test_data.root(absolute=True)
```

    /Users/iex/github.com/iesahin/xvc-notebooks/Readme





    ''



Let's add a set of files to Xvc. 


```python
xvc_test_data.file().track("test-data/dir-0001/")
```




    ''



If there is no output, it (usually) means the command is successful. If you want more output, you can set verbosity level when creating Xvc object. 

Let's get the list of files in the workspace. 


```python
xvc_test_data.file().list()
```




    'FX         142 2024-06-15 16:11:50          2f4850fa visualize.py\nFX        2003 2024-06-16 09:49:47          41e16be7 test-data/dir-0003/file-0003.bin\nFX        2002 2024-06-16 09:49:47          27f0efd0 test-data/dir-0003/file-0002.bin\nFX        2001 2024-06-16 09:49:47          66de5084 test-data/dir-0003/file-0001.bin\nDX         160 2024-06-16 09:49:47                   test-data/dir-0003\nFX        2003 2024-06-16 09:49:47          41e16be7 test-data/dir-0002/file-0003.bin\nFX        2002 2024-06-16 09:49:47          27f0efd0 test-data/dir-0002/file-0002.bin\nFX        2001 2024-06-16 09:49:47          66de5084 test-data/dir-0002/file-0001.bin\nDX         160 2024-06-16 09:49:47                   test-data/dir-0002\nFC        2003 2024-06-16 09:49:47 41e16be7 41e16be7 test-data/dir-0001/file-0003.bin\nFC        2002 2024-06-16 09:49:47 27f0efd0 27f0efd0 test-data/dir-0001/file-0002.bin\nFC        2001 2024-06-16 09:49:47 66de5084 66de5084 test-data/dir-0001/file-0001.bin\nDX         160 2024-06-16 09:49:48                   test-data/dir-0001\nFX          85 2024-06-16 09:49:48          30842230 test-data/.gitignore\nDX         192 2024-06-16 09:49:48                   test-data\nFX          14 2024-06-15 16:11:50          2b56f46f requirements.txt\nFX       19021 2024-06-16 09:44:16          95dff305 random_names_iq_scores.csv\nFX         629 2024-06-15 16:11:50          1eecb98e generate_data.py\nFX           8 2024-06-16 09:46:37          af1349b9 dr-iq-scores.csv\nFX        2183 2024-06-16 09:46:46          35763b4e default-pipeline.json\nFX       85102 2024-06-16 09:48:12          396cd3fd Readme.ipynb\nFX       30380 2024-06-10 08:25:47          a4864bca Debug.ipynb\nTotal #: 22 Workspace Size:      156254 Cached Size:        6006\n'



Note that Xvc commands return Python strings as output. You can print and process the output as any other string. 


```python
xvc_test_data_list = xvc_test_data.file().list() 
print(xvc_test_data_list)
```

    FX         142 2024-06-15 16:11:50          2f4850fa visualize.py
    FX        2003 2024-06-16 09:49:47          41e16be7 test-data/dir-0003/file-0003.bin
    FX        2002 2024-06-16 09:49:47          27f0efd0 test-data/dir-0003/file-0002.bin
    FX        2001 2024-06-16 09:49:47          66de5084 test-data/dir-0003/file-0001.bin
    DX         160 2024-06-16 09:49:47                   test-data/dir-0003
    FX        2003 2024-06-16 09:49:47          41e16be7 test-data/dir-0002/file-0003.bin
    FX        2002 2024-06-16 09:49:47          27f0efd0 test-data/dir-0002/file-0002.bin
    FX        2001 2024-06-16 09:49:47          66de5084 test-data/dir-0002/file-0001.bin
    DX         160 2024-06-16 09:49:47                   test-data/dir-0002
    FC        2003 2024-06-16 09:49:47 41e16be7 41e16be7 test-data/dir-0001/file-0003.bin
    FC        2002 2024-06-16 09:49:47 27f0efd0 27f0efd0 test-data/dir-0001/file-0002.bin
    FC        2001 2024-06-16 09:49:47 66de5084 66de5084 test-data/dir-0001/file-0001.bin
    DX         160 2024-06-16 09:49:48                   test-data/dir-0001
    FX          85 2024-06-16 09:49:48          30842230 test-data/.gitignore
    DX         192 2024-06-16 09:49:48                   test-data
    FX          14 2024-06-15 16:11:50          2b56f46f requirements.txt
    FX       19021 2024-06-16 09:44:16          95dff305 random_names_iq_scores.csv
    FX         629 2024-06-15 16:11:50          1eecb98e generate_data.py
    FX           8 2024-06-16 09:46:37          af1349b9 dr-iq-scores.csv
    FX        2183 2024-06-16 09:46:46          35763b4e default-pipeline.json
    FX       85102 2024-06-16 09:48:12          396cd3fd Readme.ipynb
    FX       30380 2024-06-10 08:25:47          a4864bca Debug.ipynb
    Total #: 22 Workspace Size:      156254 Cached Size:        6006
    


`xvc.file().list()` returns the list of all files and directories recursively. Its default formatting shows whether a file `F` is tracked and commit (`C`) and its hash values in the workspace and the cache. You can, for example, see files not tracked by processing the output string. 


```python
[l for l in xvc_test_data_list.split("\n") if l.startswith("FX")]
```




    ['FX         142 2024-06-15 16:11:50          2f4850fa visualize.py',
     'FX        2003 2024-06-16 09:49:47          41e16be7 test-data/dir-0003/file-0003.bin',
     'FX        2002 2024-06-16 09:49:47          27f0efd0 test-data/dir-0003/file-0002.bin',
     'FX        2001 2024-06-16 09:49:47          66de5084 test-data/dir-0003/file-0001.bin',
     'FX        2003 2024-06-16 09:49:47          41e16be7 test-data/dir-0002/file-0003.bin',
     'FX        2002 2024-06-16 09:49:47          27f0efd0 test-data/dir-0002/file-0002.bin',
     'FX        2001 2024-06-16 09:49:47          66de5084 test-data/dir-0002/file-0001.bin',
     'FX          85 2024-06-16 09:49:48          30842230 test-data/.gitignore',
     'FX          14 2024-06-15 16:11:50          2b56f46f requirements.txt',
     'FX       19021 2024-06-16 09:44:16          95dff305 random_names_iq_scores.csv',
     'FX         629 2024-06-15 16:11:50          1eecb98e generate_data.py',
     'FX           8 2024-06-16 09:46:37          af1349b9 dr-iq-scores.csv',
     'FX        2183 2024-06-16 09:46:46          35763b4e default-pipeline.json',
     'FX       85102 2024-06-16 09:48:12          396cd3fd Readme.ipynb',
     'FX       30380 2024-06-10 08:25:47          a4864bca Debug.ipynb']



It's also possible to specify the format string to `file().list()` with `format` argument. For example the following shows the size of all workspace files. You can see all format elements in the [`xvc file list` reference](https://docs.xvc.dev/ref/xvc-file-list). 


```python
print(xvc_test_data.file().list(format="{{name}}: {{asz}}"))
```

    visualize.py:         142
    test-data/dir-0003/file-0003.bin:        2003
    test-data/dir-0003/file-0002.bin:        2002
    test-data/dir-0003/file-0001.bin:        2001
    test-data/dir-0003:         160
    test-data/dir-0002/file-0003.bin:        2003
    test-data/dir-0002/file-0002.bin:        2002
    test-data/dir-0002/file-0001.bin:        2001
    test-data/dir-0002:         160
    test-data/dir-0001/file-0003.bin:        2003
    test-data/dir-0001/file-0002.bin:        2002
    test-data/dir-0001/file-0001.bin:        2001
    test-data/dir-0001:         160
    test-data/.gitignore:          85
    test-data:         192
    requirements.txt:          14
    random_names_iq_scores.csv:       19021
    generate_data.py:         629
    dr-iq-scores.csv:           8
    default-pipeline.json:        2183
    Readme.ipynb:       85102
    Debug.ipynb:       30380
    Total #: 22 Workspace Size:      156254 Cached Size:        6006
    


By default Xvc adds files to the cache and _copies_ them to the workspace. For data files that you only need to read, you can specify to use symlinks to track the files. This way, there will only one copy of the file in the cache and a read-only symlink will be placed in the workspace. 


```python
xvc_test_data.file().track("test-data/dir-0002/", recheck_method="symlink")
```




    ''




```python
print(xvc_test_data.file().list("test-data/dir-0002"))
```

    SS         131 2024-06-16 09:49:51 41e16be7          test-data/dir-0002/file-0003.bin
    SS         131 2024-06-16 09:49:51 27f0efd0          test-data/dir-0002/file-0002.bin
    SS         131 2024-06-16 09:49:51 66de5084          test-data/dir-0002/file-0001.bin
    Total #: 3 Workspace Size:         393 Cached Size:        6006
    


Note that the workspace size and cached size are different. The listed workspace size is only the size of symbolic links. Also the initials (symlink as recheck type and symlink in the workspace) is different the files in `test-data/dir-0001/`


```python
!ls -l test-data/dir-000[12]/
```

    
    test-data/dir-0001/:
    .rw-rw-rw- iex staff 2.0 KB Sun Jun 16 12:49:47 2024  file-0001.bin
    .rw-rw-rw- iex staff 2.0 KB Sun Jun 16 12:49:47 2024  file-0002.bin
    .rw-rw-rw- iex staff 2.0 KB Sun Jun 16 12:49:47 2024  file-0003.bin
    
    test-data/dir-0002/:
    lrwxr-xr-x iex staff 131 B Sun Jun 16 12:49:51 2024  file-0001.bin ⇒ /Users/iex/github.com/iesahin/xvc-notebooks/Readme/.xvc/b3/66d/e50/84c7bbab22aa25db35e7e28c16953764cc9036d1782a58444eb3b9394b/0.bin
    lrwxr-xr-x iex staff 131 B Sun Jun 16 12:49:51 2024  file-0002.bin ⇒ /Users/iex/github.com/iesahin/xvc-notebooks/Readme/.xvc/b3/27f/0ef/d00cd3ab2c9d9cd753b0c3a3bd10b79bf0ffc7719fac44218e8130a071/0.bin
    lrwxr-xr-x iex staff 131 B Sun Jun 16 12:49:51 2024  file-0003.bin ⇒ /Users/iex/github.com/iesahin/xvc-notebooks/Readme/.xvc/b3/41e/16b/e7387698c97b703e7607c649498594840cd8bf37295ef430de22ac72c1/0.bin


Xvc allows to track files with hardlinks and reflinks (where supported) as well. You can keep your model files as copies and data files as links. Files in the cache are read-only, so if you don't change their permissions manually, your code cannot write on them.

You can also change their [recheck (checkout) methods](https://docs.xvc.dev/ref/xvc-file-recheck/) after you add them to Xvc. So, if you change your mind and want to start to track `test-data/dir-0001` as hardlinks, you can do so. 


```python
xvc_test_data.file().recheck("test-data/dir-0001/", recheck_method="hardlink")
```




    ''




```python
print(xvc_test_data.file().list("test-data/dir-0001"))
```

    FH        2003 2024-06-16 09:49:47 41e16be7 41e16be7 test-data/dir-0001/file-0003.bin
    FH        2002 2024-06-16 09:49:47 27f0efd0 27f0efd0 test-data/dir-0001/file-0002.bin
    FH        2001 2024-06-16 09:49:47 66de5084 66de5084 test-data/dir-0001/file-0001.bin
    Total #: 3 Workspace Size:        6006 Cached Size:        6006
    


Note that the filesystem doesn't make a distinction between hardlinks and files, they all have the same attributes. So the file sizes are identical between in the workspace and the cache. In the second column you can see that these are tracked as `H`ardlinks by Xvc. 

If the files are tracked by Xvc, they are also normally in the cache. You can delete them and get them back from the cache when they are required. 


```python
!rm -rf test-data/dir-0002/
```


```python
!tree
```

    .
    ├── Debug.ipynb
    ├── Readme.ipynb
    ├── default-pipeline.json
    ├── dr-iq-scores.csv
    ├── generate_data.py
    ├── random_names_iq_scores.csv
    ├── requirements.txt
    ├── test-data
    │   ├── dir-0001
    │   │   ├── file-0001.bin
    │   │   ├── file-0002.bin
    │   │   └── file-0003.bin
    │   └── dir-0003
    │       ├── file-0001.bin
    │       ├── file-0002.bin
    │       └── file-0003.bin
    └── visualize.py
    
    4 directories, 14 files



```python
xvc_test_data.file().recheck("test-data/dir-0002/")
```




    ''




```python
print(xvc_test_data.file().list("test-data/dir-0002"))
```

    SS         131 2024-06-16 09:49:53 41e16be7          test-data/dir-0002/file-0003.bin
    SS         131 2024-06-16 09:49:53 27f0efd0          test-data/dir-0002/file-0002.bin
    SS         131 2024-06-16 09:49:53 66de5084          test-data/dir-0002/file-0001.bin
    Total #: 3 Workspace Size:         393 Cached Size:        6006
    


Note that recheck (checkout) methods for files are also tracked and you don't have to specify them every time. 

## Sharing files

Xvc can copy cached files to [S3](https://docs.xvc.dev/ref/xvc-storage-new-s3) compatible servers like [MinIO](https://docs.xvc.dev/ref/xvc-storage-new-minio) or [Google Cloud Storage](https://docs.xvc.dev/ref/xvc-storage-new-gcs); [locally accessible](https://docs.xvc.dev/ref/xvc-storage-new-local) paths like NFS or external drives; [SSH servers with Rsync](https://docs.xvc.dev/ref/xvc-storage-new-rsync) and via [external commands](https://docs.xvc.dev/ref/xvc-storage-new-generic) like rclone or s5cmd. For the purposes of this introduction, we are going to use S3.  

Note that Xvc doesn't store any credentials and rely on environment variables or existing SSH connections. 

Let's start by configuring the S3 connection that we'll use. 


```python
xvc_test_data.storage().new_s3(name="backup", bucket_name="xvc-test", region="eu-central-1", storage_prefix="xvc-storage")
```




    ''



You can now send the files you track to this storage.


```python
xvc_test_data.file().send(storage="backup")

```




    ''



When you (or someone else) want to access these files later, you can clone the Git repository and get the files from the
storage.

```shell
$ git clone https://example.com/my-machine-learning-project
Cloning into 'my-machine-learning-project'...
```



```python
xvc_test_data.file().bring("test-data/", storage="backup")
```




    ''



You can also share the files with others for a period if your storage type supports signed URLs. The default period is 24 hours. 


```python
xvc_test_data.file().share("test-data/dir-0001/file-0001.bin", storage="backup", period="1h")
```




    'https://xvc-test.s3.eu-central-1.amazonaws.com/xvc-test/xvc-storage/3c7f8146-df25-4c22-b0e6-bb20d600acaf/b3/66d/e50/84c7bbab22aa25db35e7e28c16953764cc9036d1782a58444eb3b9394b/0.bin?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIAJOXNDTB55QE3GOEQ%2F20240616%2Feu-central-1%2Fs3%2Faws4_request&X-Amz-Date=20240616T094957Z&X-Amz-Expires=86400&X-Amz-SignedHeaders=host&X-Amz-Signature=d82f59453c95abc9421bac2a12605373708b678c6a1e48d8b2734eb34bd22942'



This approach ensures convenient access to files from the shared storage when needed. You can share the URL with anyone and they will be able to download your file within this period. 



You don't have to reconfigure the storage after cloning, but you need to have valid credentials as environment variables
to access the storage.

Xvc never stores any credentials.


# Pipelines

If you have commands that depend on data or code elements, you can configure a pipeline.



For this example, we'll use [a Python script](https://github.com/iesahin/xvc/blob/main/workflow_tests/templates/README.in/generate_data.py) to generate a data set with random names with random IQ scores.

The script uses the Faker library and this library must be available where you run the pipeline. To make it repeatable, we start the pipeline by adding a step that creates the virtual environment and installs dependencies.


```python
pipeline = xvc_test_data.pipeline()
```


```python
pipeline.step().new(step_name="install-deps", command="python3 -m pip install -r requirements.txt")
```




    ''



We'll make `install-deps` step to depend on `requirements.txt` file, so when the requirements file changes it will make the step run. 


```python
pipeline.step().dependency(step_name="install-deps", file="requirements.txt")

```




    ''




Xvc allows to create dependencies between pipeline steps. Dependent steps wait for dependencies to finish successfully. 

Now we create a step to run the script and make `install-deps` step a dependency of it. 


```python

```


```python
pipeline.step().new(step_name="generate-data", command="python3 generate_data.py")
pipeline.step().dependency(step_name="generate-data", step="install-deps")
```




    ''



After you define the pipeline, you can run it by:


```python
print(pipeline.run())

```

    [OUT] [install-deps] Requirement already satisfied: faker in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from -r requirements.txt (line 1)) (25.8.0)
    Requirement already satisfied: seaborn in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from -r requirements.txt (line 2)) (0.13.2)
    Requirement already satisfied: python-dateutil>=2.4 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from faker->-r requirements.txt (line 1)) (2.9.0.post0)
    Requirement already satisfied: numpy!=1.24.0,>=1.20 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from seaborn->-r requirements.txt (line 2)) (1.26.4)
    Requirement already satisfied: pandas>=1.2 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from seaborn->-r requirements.txt (line 2)) (2.2.2)
    Requirement already satisfied: matplotlib!=3.6.1,>=3.4 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from seaborn->-r requirements.txt (line 2)) (3.9.0)
    Requirement already satisfied: contourpy>=1.0.1 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from matplotlib!=3.6.1,>=3.4->seaborn->-r requirements.txt (line 2)) (1.2.1)
    Requirement already satisfied: cycler>=0.10 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from matplotlib!=3.6.1,>=3.4->seaborn->-r requirements.txt (line 2)) (0.12.1)
    Requirement already satisfied: fonttools>=4.22.0 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from matplotlib!=3.6.1,>=3.4->seaborn->-r requirements.txt (line 2)) (4.53.0)
    Requirement already satisfied: kiwisolver>=1.3.1 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from matplotlib!=3.6.1,>=3.4->seaborn->-r requirements.txt (line 2)) (1.4.5)
    Requirement already satisfied: packaging>=20.0 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from matplotlib!=3.6.1,>=3.4->seaborn->-r requirements.txt (line 2)) (24.0)
    Requirement already satisfied: pillow>=8 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from matplotlib!=3.6.1,>=3.4->seaborn->-r requirements.txt (line 2)) (10.3.0)
    Requirement already satisfied: pyparsing>=2.3.1 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from matplotlib!=3.6.1,>=3.4->seaborn->-r requirements.txt (line 2)) (3.1.2)
    Requirement already satisfied: pytz>=2020.1 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from pandas>=1.2->seaborn->-r requirements.txt (line 2)) (2024.1)
    Requirement already satisfied: tzdata>=2022.7 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from pandas>=1.2->seaborn->-r requirements.txt (line 2)) (2024.1)
    Requirement already satisfied: six>=1.5 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from python-dateutil>=2.4->faker->-r requirements.txt (line 1)) (1.16.0)
     [DONE] install-deps (python3 -m pip install -r requirements.txt)[OUT] [generate-data] CSV file generated successfully.
     [DONE] generate-data (python3 generate_data.py)




Xvc allows many kinds of dependencies, like [files](https://docs.xvc.dev/ref/xvc-pipeline-step-dependency#file-dependencies), 
[groups of files and directories defined by globs](https://docs.xvc.dev/ref/xvc-pipeline-step-dependency#glob-dependencies), 
[regular expression searches in files](https://docs.xvc.dev/ref/xvc-pipeline-step-dependency#regex-dependencies), 
[line ranges in files](https://docs.xvc.dev/ref/xvc-pipeline-step-dependency#line-dependencies), 
[hyper-parameters defined in YAML, JSON or TOML files](https://docs.xvc.dev/ref/xvc-pipeline-step-dependency#hyper-parameter-dependencies)
[HTTP URLs](https://docs.xvc.dev/ref/xvc-pipeline-step-dependency#url-dependencies),
[shell command outputs](https://docs.xvc.dev/ref/xvc-pipeline-step-dependency#generic-command-dependencies), 
and [other steps](https://docs.xvc.dev/ref/xvc-pipeline-step-dependency#step-dependencies). 

> 🔅
> The rest of this document introduces some of these dependencies. **Please do not panic** if you're not into regexes or hyperparameters. 



Suppose you're only interested in the IQ scores of those with _Dr._ in front of their names and how they differ from the rest in the dataset we created. Let's create a regex search dependency to the data file that will show all _doctors_ IQ scores.



```python
pipeline.step().new(step_name="dr-iq", command='echo "${XVC_REGEX_ADDED_ITEMS}" >> dr-iq-scores.csv')
pipeline.step().dependency(step_name="dr-iq", regex_items=r'random_names_iq_scores.csv:/^Dr\..*')

```




    ''





The first line specifies a command, when run writes `${XVC_REGEX_ADDED_ITEMS}` environment variable to `dr-iq-scores.csv` file. 
The second line specifies the dependency which will also populate the `$[XVC_REGEX_ADDED_ITEMS]` environment variable in the command. 



Some dependency types like [regex items], 
[line items] and [glob items] inject environment variables in the commands they are a dependency.
For example, if you have two million files specified with a glob, but want to run a script only on the added files after the last run, you can use these environment variables. 

[regex items]: https://docs.xvc.dev/ref/xvc-pipeline-step-dependency#regex-item-dependencies
[line items]: https://docs.xvc.dev/ref/xvc-pipeline-step-dependency#line-item-dependencies
[glob items]: https://docs.xvc.dev/ref/xvc-pipeline-step-dependency#glob-item-dependencies


When you run the pipeline again, a file named `dr-iq-scores.csv` will be created. Note that, as `requirements.txt` didn't change `install-deps` step and its dependent `generate-data` steps didn't run.



```python
print(pipeline.run())
```

    [DONE] dr-iq (echo "${XVC_REGEX_ADDED_ITEMS}" >> dr-iq-scores.csv)





We are using this feature to get lines starting with `Dr.` from the file and write them to another file. When the file changes, e.g. another record matching the dependency regex added to the `random_names_iq_scores.csv` file, it will also be added to `dr-iq-scores.csv` file.



```python

!zsh -cl 'echo "Dr. Albert Einstein,144" >> random_names_iq_scores.csv'

print(pipeline.run())

```

    [DONE] dr-iq (echo "${XVC_REGEX_ADDED_ITEMS}" >> dr-iq-scores.csv)




Now we want to add a another command that draws a fancy histogram from `dr-iq-scores.csv`. As this new step must wait `dr-iq-scores.csv` file to be ready, we'll define `dr-iq-scores.csv` as an _output_ of `dr-iq` step and set the file as a dependency to this new `visualize` step.



```python
pipeline.step().output(step_name="dr-iq", output_file="dr-iq-scores.csv")

pipeline.step().new(step_name="visualize", command='python3 visualize.py')
pipeline.step().dependency(step_name="visualize", file="dr-iq-scores.csv")

print(pipeline.run())


```

    [DONE] dr-iq (echo "${XVC_REGEX_ADDED_ITEMS}" >> dr-iq-scores.csv)[ERROR] Step visualize finished UNSUCCESSFULLY with command python3 visualize.py



You can get the pipeline in Graphviz DOT or mermaid format to convert to an image.



```python
print(pipeline.dag(format="mermaid"))
```

    flowchart TD
        n0["install-deps"]
        n1["requirements.txt"] --> n0
        n2["generate-data"]
        n0["install-deps"] --> n2
        n3["dr-iq"]
        n4["visualize"]
        n5["dr-iq-scores.csv"] --> n4
    


```mermaid
flowchart TD
    n0["install-deps"]
    n1["requirements.txt"] --> n0
    n2["generate-data"]
    n0["install-deps"] --> n2
    n3["dr-iq"]
    n4["visualize"]
    n5["dr-iq-scores.csv"] --> n4
```

You can also export and import the pipeline to JSON to edit in your editor.


```python
pipeline.export(file="default-pipeline.json")
```




    ''




```python
!cat default-pipeline.json
```

    ───────┬────────────────────────────────────────────────────────────────────────
           │ File: default-pipeline.json
    ───────┼────────────────────────────────────────────────────────────────────────
       1   │ {
       2   │   "name": "default",
       3   │   "steps": [
       4   │     {
       5   │       "command": "python3 -m pip install -r requirements.txt",
       6   │       "dependencies": [
       7   │         {
       8   │           "File": {
       9   │             "content_digest": {
      10   │               "algorithm": "Blake3",
      11   │               "digest": [
      12   │                 43,
      13   │                 86,
      14   │                 244,
      15   │                 111,
      16   │                 13,
      17   │                 243,
      18   │                 28,
      19   │                 110,
      20   │                 140,
      21   │                 213,
      22   │                 105,
      23   │                 20,
      24   │                 239,
      25   │                 62,
      26   │                 73,
      27   │                 75,
      28   │                 13,
      29   │                 146,
      30   │                 82,
      31   │                 17,
      32   │                 148,
      33   │                 152,
      34   │                 66,
      35   │                 86,
      36   │                 154,
      37   │                 230,
      38   │                 154,
      39   │                 246,
      40   │                 213,
      41   │                 214,
      42   │                 40,
      43   │                 119
      44   │               ]
      45   │             },
      46   │             "path": "requirements.txt",
      47   │             "xvc_metadata": {
      48   │               "file_type": "File",
      49   │               "modified": {
      50   │                 "nanos_since_epoch": 431372139,
      51   │                 "secs_since_epoch": 1718467910
      52   │               },
      53   │               "size": 14
      54   │             }
      55   │           }
      56   │         }
      57   │       ],
      58   │       "invalidate": "ByDependencies",
      59   │       "name": "install-deps",
      60   │       "outputs": []
      61   │     },
      62   │     {
      63   │       "command": "python3 generate_data.py",
      64   │       "dependencies": [
      65   │         {
      66   │           "Step": {
      67   │             "name": "install-deps"
      68   │           }
      69   │         }
      70   │       ],
      71   │       "invalidate": "ByDependencies",
      72   │       "name": "generate-data",
      73   │       "outputs": []
      74   │     },
      75   │     {
      76   │       "command": "echo \"${XVC_REGEX_ADDED_ITEMS}\" >> dr-iq-scores.csv
           │ ",
      77   │       "dependencies": [],
      78   │       "invalidate": "ByDependencies",
      79   │       "name": "dr-iq",
      80   │       "outputs": []
      81   │     },
      82   │     {
      83   │       "command": "python3 visualize.py",
      84   │       "dependencies": [
      85   │         {
      86   │           "File": {
      87   │             "content_digest": null,
      88   │             "path": "dr-iq-scores.csv",
      89   │             "xvc_metadata": null
      90   │           }
      91   │         }
      92   │       ],
      93   │       "invalidate": "ByDependencies",
      94   │       "name": "visualize",
      95   │       "outputs": []
      96   │     }
      97   │   ],
      98   │   "version": 1,
      99   │   "workdir": ""
     100   │ }
    ───────┴────────────────────────────────────────────────────────────────────────



You can edit the file (in Jupyter lab or in your editor) to change commands, add new dependencies, etc. and import it back to Xvc with `import_pipeline` method. You can create copies of pipelines by either editing the pipeline name in the file or supplying the name with `name` argument. 


> 🔅 `import_pipeline` is `xvc pipeline import` in command line. Renamed because `import` is a reserved word. 


```python
pipeline.import_pipeline(file="default-pipeline.json", name="another-pipeline")

```




    ''



Now you can list all pipelines and run `another-pipeline` if you want to.


```python
print(pipeline.list())
```

    +------------------+---------+
    | Name             | Run Dir |
    +============================+
    | default          |         |
    |------------------+---------|
    | another-pipeline |         |
    +------------------+---------+




Please create an issue or discussion for any other kinds of dependencies that you'd like to be included.

I'm planning to add [data label and annotations tracking](https://github.com/iesahin/xvc/discussions/208)), [experiments tracking](https://github.com/iesahin/xvc/discussions/207)), [model tracking](https://github.com/iesahin/xvc/discussions/211)), encrypted cache, server to control all commands from a web interface, and more as my time permits.

Please check [`docs.xvc.dev`](https://docs.xvc.dev) for documentation.





## 🤟 Big Thanks

xvc stands on the following (giant) crates:

- [trycmd] is used to run all example commands in this file, [reference, and how-to documentation](https://docs.xvc.dev) at
  every PR. It makes sure that the documentation is always up-to-date and shown commands work as described. We start
  development by writing documentation and implementing them thanks to [trycmd].

- [serde] allows all data structures to be stored in text files. Special thanks from [`xvc-ecs`] for serializing components in an ECS with a single line of code.

- Xvc processes files in parallel with pipelines and parallel iterators thanks to [crossbeam] and [rayon].

- Thanks to [strum], Xvc uses enums extensively and converts almost everything to typed values from strings.

- Xvc has a deep CLI that has subcommands of subcommands (e.g. `xvc storage new s3`), and all these work with minimum bugs thanks to [clap].

- Xvc uses [rust-s3] to connect to S3 and compatible storage services. It employs excellent [tokio] for fast async Rust. These cloud storage features can be turned off thanks to Rust conditional compilation.

- Without implementations of [BLAKE3], BLAKE2, SHA-2 and SHA-3 from Rust [crypto] crate, Xvc couldn't detect file changes so fast.

- Many thanks to small and well built crates, [reflink], [relative-path], [path-absolutize], [glob] for file system and glob handling.

- Thanks to [sad_machine] for providing a State Machine implementation that I used in `xvc pipeline run`. A DAG composed of State Machines made running pipeline steps in parallel with a clean separation of process states.

- Thanks to [thiserror] and [anyhow] for making error handling a breeze. These two crates make me feel I'm doing something good for the humanity when handling errors.

- Xvc is split into many crates and owes this organization to [cargo workspaces].

[crossbeam]: https://docs.rs/crossbeam/
[cargo workspaces]: https://crates.io/crates/cargo-workspaces
[rayon]: https://docs.rs/rayon/
[strum]: https://docs.rs/strum/
[clap]: https://docs.rs/clap/
[serde]: https://serde.rs
[blake3]: https://docs.rs/blake3/
[crypto]: https://docs.rs/rust-crypto/
[reflink]: https://docs.rs/reflink/
[relative-path]: https://docs.rs/relative-path/
[path-absolutize]: https://docs.rs/path-absolutize/
[glob]: https://docs.rs/glob/
[wax]: https://docs.rs/wax/
[trycmd]: https://docs.rs/trycmd/
[sad_machine]: https://docs.rs/sad_machine/
[thiserror]: https://docs.rs/thiserror/
[anyhow]: https://docs.rs/anyhow/
[rust-s3]: https://docs.rs/rust-s3/
[`xvc-ecs`]: https://docs.rs/xvc-ecs/
[tokio]: https://tokio.rs

And, biggest thanks to Rust designers, developers and contributors. Although I can't see myself expert to appreciate it all, it's a fabulous language and environment to work with.


## 🚁 Support

- You can use [Discussions](https://github.com/iesahin/xvc/discussions) to ask questions. I'll answer as much as possible. Thank you.
- I don't follow any other sites regularly. You can also reach me at [emre@xvc.dev](mailto:emre@xvc.dev)



## 👐 Contributing

- Star this repo. I feel very happy for every star and send my best wishes to you. That's a certain win to spend your two seconds for me. Thanks.
- Use xvc. Tell me how it works for you, read the [documentation](https://docs.xvc.dev), [report bugs](https://github.com/iesahin/xvc/issues), [discuss features](https://github.com/iesahin/xvc/discussions).
- Please note that, I don't accept large code PRs. Please open an issue to discuss your idea and write/modify a
  reference page before sending a PR. I'm happy to discuss and help you to implement your idea. Also, it may require a copyright transfer to me, as there may be cases which I provide the code in other licenses. 



## 📜 License

Xvc and these Python bindings are licensed under the [GNU GPL 3.0 License](https://github.com/iesahin/xvc/blob/main/LICENSE). If you want to use the code in your project with other licenses, please contact me.


## 🌦️ Future and Maintenance

I'm using Xvc daily and I'm happy with it. Tracking all my files with Git via arbitrary servers and cloud providers is
something I always need. I'm happy to improve and maintain it as long as I use it.

Given that I'm working on this for the last two years for pure technical bliss, you can expect me to work on it more. 



## ⚠️ Disclaimer

This software is fresh and ambitious. Although I use it and test it close to real-world conditions, it didn't go under
the test of time. **Xvc can eat your files and spit them into the eternal void!** Please take backups.



            

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "xvc",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.7",
    "maintainer_email": "Emre Sahin <contact@emresahin.net>",
    "keywords": "machine-learning, development, devops, Git",
    "author": null,
    "author_email": null,
    "download_url": "https://files.pythonhosted.org/packages/33/f3/1a04090e05edbf2b13588a28e22c4ee10956a1fdfa201c5e99dd4ed2ece7/xvc-0.6.13.tar.gz",
    "platform": null,
    "description": "```python\n# Xvc\n```\n\nThis is the Python port of [Xvc](https://github.com/iesahin/xvc). Xvc's goal is let users perform all ML operations regarding data, files, models and pipelines from the command line, and version all of these on top of Git. Xvc.py extends this goal by allowing all commands to be run from Python shells, code and Jupyter notebooks. \n\nYou can install `xvc` to your system or virtual environment with uncommenting the line below. \n\n\n```python\n# !pip install xvc\n```\n\nImport Xvc like any other Python module. \n\n\n```python\nimport xvc\n```\n\nThis notebook is created with the following version. Please test your issue with the latest version before creating any bug reports. \n\n\n```python\nxvc.version()\n```\n\n\n\n\n    'v0.6.3-46-g71fc283-modified'\n\n\n\nPython wrappings for `xvc` command uses `Xvc` class. \n\n\n```python\nfrom xvc import Xvc\n```\n\n## Tracking files\n\nLet's reset the directory we're in and create a directory tree. These are the data files we want to track. Note that `xvc-test-helper` is a cli application to make life easier for Xvc tests. If you want to reproduce this notebook, you can install it with `cargo install xvc-test-helper`\n\n\n```python\n!chmod -R +w .xvc .xvcignore .git .gitignore test-data/\n!rm -rf .xvc .xvcignore .git .gitignore test-data/\n!xvc-test-helper create-directory-tree --directories 3 --files 3 --root test-data --seed 20240101\n```\n\n    chmod: .xvc: No such file or directory\n    chmod: .git: No such file or directory\n    chmod: .gitignore: No such file or directory\n    chmod: test-data/: No such file or directory\n\n\nAbove command created a directory structure as this. \n\n\n```python\n!tree\n```\n\n    \u001b[1;36m.\u001b[0m\n    \u251c\u2500\u2500 Debug.ipynb\n    \u251c\u2500\u2500 Readme.ipynb\n    \u251c\u2500\u2500 default-pipeline.json\n    \u251c\u2500\u2500 dr-iq-scores.csv\n    \u251c\u2500\u2500 generate_data.py\n    \u251c\u2500\u2500 random_names_iq_scores.csv\n    \u251c\u2500\u2500 requirements.txt\n    \u251c\u2500\u2500 \u001b[1;36mtest-data\u001b[0m\n    \u2502\u00a0\u00a0 \u251c\u2500\u2500 \u001b[1;36mdir-0001\u001b[0m\n    \u2502\u00a0\u00a0 \u2502\u00a0\u00a0 \u251c\u2500\u2500 file-0001.bin\n    \u2502\u00a0\u00a0 \u2502\u00a0\u00a0 \u251c\u2500\u2500 file-0002.bin\n    \u2502\u00a0\u00a0 \u2502\u00a0\u00a0 \u2514\u2500\u2500 file-0003.bin\n    \u2502\u00a0\u00a0 \u251c\u2500\u2500 \u001b[1;36mdir-0002\u001b[0m\n    \u2502\u00a0\u00a0 \u2502\u00a0\u00a0 \u251c\u2500\u2500 file-0001.bin\n    \u2502\u00a0\u00a0 \u2502\u00a0\u00a0 \u251c\u2500\u2500 file-0002.bin\n    \u2502\u00a0\u00a0 \u2502\u00a0\u00a0 \u2514\u2500\u2500 file-0003.bin\n    \u2502\u00a0\u00a0 \u2514\u2500\u2500 \u001b[1;36mdir-0003\u001b[0m\n    \u2502\u00a0\u00a0     \u251c\u2500\u2500 file-0001.bin\n    \u2502\u00a0\u00a0     \u251c\u2500\u2500 file-0002.bin\n    \u2502\u00a0\u00a0     \u2514\u2500\u2500 file-0003.bin\n    \u2514\u2500\u2500 visualize.py\n    \n    5 directories, 17 files\n\n\nNow we can initialize Xvc repository. We first init Git in the directory. Xvc tracks binary files with their metadata (size, timestamp and digest) and this metadata is kept in text files. You can use Xvc without Git, but adding versioning to your data is usually a good idea. Xvc manages Git operations on the text files it creates automatically by default, so it won't add any extra commands to the workflow.\n\n\n```python\n!git init\n```\n\n    Initialized empty Git repository in /Users/iex/github.com/iesahin/xvc-notebooks/Readme/.git/\n\n\nNow we can init the Xvc repository\n\nFirst we create an Xvc instance. You can specify the workdir and this instance\nwill work on that directory. For now, only one instance of Xvc can run in a\nPython process. This limitation may be removed in future releases.\n\n```python\n\nxvc_test_data = Xvc()\n```\n\nLet's initialize Xvc in the directory. This will  create a `.xvc` directory, will put several initial files and also create a `.gitignore` entry for those elements that shouldn't be tracked by Git. \n\n\n```python\nxvc_test_data.init()\n```\n\n\n\n\n    ''\n\n\n\nWe can see which Xvc repository a directory belongs with `root` command. By default, it returns the relative directory but you can set `absolute` flag to get see the absolute dir. \n\n\n```python\nxvc_test_data.root(absolute=True)\n```\n\n    /Users/iex/github.com/iesahin/xvc-notebooks/Readme\n\n\n\n\n\n    ''\n\n\n\nLet's add a set of files to Xvc. \n\n\n```python\nxvc_test_data.file().track(\"test-data/dir-0001/\")\n```\n\n\n\n\n    ''\n\n\n\nIf there is no output, it (usually) means the command is successful. If you want more output, you can set verbosity level when creating Xvc object. \n\nLet's get the list of files in the workspace. \n\n\n```python\nxvc_test_data.file().list()\n```\n\n\n\n\n    'FX         142 2024-06-15 16:11:50          2f4850fa visualize.py\\nFX        2003 2024-06-16 09:49:47          41e16be7 test-data/dir-0003/file-0003.bin\\nFX        2002 2024-06-16 09:49:47          27f0efd0 test-data/dir-0003/file-0002.bin\\nFX        2001 2024-06-16 09:49:47          66de5084 test-data/dir-0003/file-0001.bin\\nDX         160 2024-06-16 09:49:47                   test-data/dir-0003\\nFX        2003 2024-06-16 09:49:47          41e16be7 test-data/dir-0002/file-0003.bin\\nFX        2002 2024-06-16 09:49:47          27f0efd0 test-data/dir-0002/file-0002.bin\\nFX        2001 2024-06-16 09:49:47          66de5084 test-data/dir-0002/file-0001.bin\\nDX         160 2024-06-16 09:49:47                   test-data/dir-0002\\nFC        2003 2024-06-16 09:49:47 41e16be7 41e16be7 test-data/dir-0001/file-0003.bin\\nFC        2002 2024-06-16 09:49:47 27f0efd0 27f0efd0 test-data/dir-0001/file-0002.bin\\nFC        2001 2024-06-16 09:49:47 66de5084 66de5084 test-data/dir-0001/file-0001.bin\\nDX         160 2024-06-16 09:49:48                   test-data/dir-0001\\nFX          85 2024-06-16 09:49:48          30842230 test-data/.gitignore\\nDX         192 2024-06-16 09:49:48                   test-data\\nFX          14 2024-06-15 16:11:50          2b56f46f requirements.txt\\nFX       19021 2024-06-16 09:44:16          95dff305 random_names_iq_scores.csv\\nFX         629 2024-06-15 16:11:50          1eecb98e generate_data.py\\nFX           8 2024-06-16 09:46:37          af1349b9 dr-iq-scores.csv\\nFX        2183 2024-06-16 09:46:46          35763b4e default-pipeline.json\\nFX       85102 2024-06-16 09:48:12          396cd3fd Readme.ipynb\\nFX       30380 2024-06-10 08:25:47          a4864bca Debug.ipynb\\nTotal #: 22 Workspace Size:      156254 Cached Size:        6006\\n'\n\n\n\nNote that Xvc commands return Python strings as output. You can print and process the output as any other string. \n\n\n```python\nxvc_test_data_list = xvc_test_data.file().list() \nprint(xvc_test_data_list)\n```\n\n    FX         142 2024-06-15 16:11:50          2f4850fa visualize.py\n    FX        2003 2024-06-16 09:49:47          41e16be7 test-data/dir-0003/file-0003.bin\n    FX        2002 2024-06-16 09:49:47          27f0efd0 test-data/dir-0003/file-0002.bin\n    FX        2001 2024-06-16 09:49:47          66de5084 test-data/dir-0003/file-0001.bin\n    DX         160 2024-06-16 09:49:47                   test-data/dir-0003\n    FX        2003 2024-06-16 09:49:47          41e16be7 test-data/dir-0002/file-0003.bin\n    FX        2002 2024-06-16 09:49:47          27f0efd0 test-data/dir-0002/file-0002.bin\n    FX        2001 2024-06-16 09:49:47          66de5084 test-data/dir-0002/file-0001.bin\n    DX         160 2024-06-16 09:49:47                   test-data/dir-0002\n    FC        2003 2024-06-16 09:49:47 41e16be7 41e16be7 test-data/dir-0001/file-0003.bin\n    FC        2002 2024-06-16 09:49:47 27f0efd0 27f0efd0 test-data/dir-0001/file-0002.bin\n    FC        2001 2024-06-16 09:49:47 66de5084 66de5084 test-data/dir-0001/file-0001.bin\n    DX         160 2024-06-16 09:49:48                   test-data/dir-0001\n    FX          85 2024-06-16 09:49:48          30842230 test-data/.gitignore\n    DX         192 2024-06-16 09:49:48                   test-data\n    FX          14 2024-06-15 16:11:50          2b56f46f requirements.txt\n    FX       19021 2024-06-16 09:44:16          95dff305 random_names_iq_scores.csv\n    FX         629 2024-06-15 16:11:50          1eecb98e generate_data.py\n    FX           8 2024-06-16 09:46:37          af1349b9 dr-iq-scores.csv\n    FX        2183 2024-06-16 09:46:46          35763b4e default-pipeline.json\n    FX       85102 2024-06-16 09:48:12          396cd3fd Readme.ipynb\n    FX       30380 2024-06-10 08:25:47          a4864bca Debug.ipynb\n    Total #: 22 Workspace Size:      156254 Cached Size:        6006\n    \n\n\n`xvc.file().list()` returns the list of all files and directories recursively. Its default formatting shows whether a file `F` is tracked and commit (`C`) and its hash values in the workspace and the cache. You can, for example, see files not tracked by processing the output string. \n\n\n```python\n[l for l in xvc_test_data_list.split(\"\\n\") if l.startswith(\"FX\")]\n```\n\n\n\n\n    ['FX         142 2024-06-15 16:11:50          2f4850fa visualize.py',\n     'FX        2003 2024-06-16 09:49:47          41e16be7 test-data/dir-0003/file-0003.bin',\n     'FX        2002 2024-06-16 09:49:47          27f0efd0 test-data/dir-0003/file-0002.bin',\n     'FX        2001 2024-06-16 09:49:47          66de5084 test-data/dir-0003/file-0001.bin',\n     'FX        2003 2024-06-16 09:49:47          41e16be7 test-data/dir-0002/file-0003.bin',\n     'FX        2002 2024-06-16 09:49:47          27f0efd0 test-data/dir-0002/file-0002.bin',\n     'FX        2001 2024-06-16 09:49:47          66de5084 test-data/dir-0002/file-0001.bin',\n     'FX          85 2024-06-16 09:49:48          30842230 test-data/.gitignore',\n     'FX          14 2024-06-15 16:11:50          2b56f46f requirements.txt',\n     'FX       19021 2024-06-16 09:44:16          95dff305 random_names_iq_scores.csv',\n     'FX         629 2024-06-15 16:11:50          1eecb98e generate_data.py',\n     'FX           8 2024-06-16 09:46:37          af1349b9 dr-iq-scores.csv',\n     'FX        2183 2024-06-16 09:46:46          35763b4e default-pipeline.json',\n     'FX       85102 2024-06-16 09:48:12          396cd3fd Readme.ipynb',\n     'FX       30380 2024-06-10 08:25:47          a4864bca Debug.ipynb']\n\n\n\nIt's also possible to specify the format string to `file().list()` with `format` argument. For example the following shows the size of all workspace files. You can see all format elements in the [`xvc file list` reference](https://docs.xvc.dev/ref/xvc-file-list). \n\n\n```python\nprint(xvc_test_data.file().list(format=\"{{name}}: {{asz}}\"))\n```\n\n    visualize.py:         142\n    test-data/dir-0003/file-0003.bin:        2003\n    test-data/dir-0003/file-0002.bin:        2002\n    test-data/dir-0003/file-0001.bin:        2001\n    test-data/dir-0003:         160\n    test-data/dir-0002/file-0003.bin:        2003\n    test-data/dir-0002/file-0002.bin:        2002\n    test-data/dir-0002/file-0001.bin:        2001\n    test-data/dir-0002:         160\n    test-data/dir-0001/file-0003.bin:        2003\n    test-data/dir-0001/file-0002.bin:        2002\n    test-data/dir-0001/file-0001.bin:        2001\n    test-data/dir-0001:         160\n    test-data/.gitignore:          85\n    test-data:         192\n    requirements.txt:          14\n    random_names_iq_scores.csv:       19021\n    generate_data.py:         629\n    dr-iq-scores.csv:           8\n    default-pipeline.json:        2183\n    Readme.ipynb:       85102\n    Debug.ipynb:       30380\n    Total #: 22 Workspace Size:      156254 Cached Size:        6006\n    \n\n\nBy default Xvc adds files to the cache and _copies_ them to the workspace. For data files that you only need to read, you can specify to use symlinks to track the files. This way, there will only one copy of the file in the cache and a read-only symlink will be placed in the workspace. \n\n\n```python\nxvc_test_data.file().track(\"test-data/dir-0002/\", recheck_method=\"symlink\")\n```\n\n\n\n\n    ''\n\n\n\n\n```python\nprint(xvc_test_data.file().list(\"test-data/dir-0002\"))\n```\n\n    SS         131 2024-06-16 09:49:51 41e16be7          test-data/dir-0002/file-0003.bin\n    SS         131 2024-06-16 09:49:51 27f0efd0          test-data/dir-0002/file-0002.bin\n    SS         131 2024-06-16 09:49:51 66de5084          test-data/dir-0002/file-0001.bin\n    Total #: 3 Workspace Size:         393 Cached Size:        6006\n    \n\n\nNote that the workspace size and cached size are different. The listed workspace size is only the size of symbolic links. Also the initials (symlink as recheck type and symlink in the workspace) is different the files in `test-data/dir-0001/`\n\n\n```python\n!ls -l test-data/dir-000[12]/\n```\n\n    \n    test-data/dir-0001/:\n    .\u001b[38;5;2mr\u001b[39m\u001b[38;5;3mw\u001b[39m\u001b[38;5;245m-\u001b[39m\u001b[38;5;2mr\u001b[39m\u001b[38;5;3mw\u001b[39m\u001b[38;5;245m-\u001b[39m\u001b[38;5;2mr\u001b[39m\u001b[38;5;3mw\u001b[39m\u001b[38;5;245m-\u001b[39m\u001b[38;5;6m\u001b[39m \u001b[38;5;230miex\u001b[39m \u001b[38;5;187mstaff\u001b[39m \u001b[38;5;229m2.0\u001b[39m \u001b[38;5;229mKB\u001b[39m \u001b[38;5;40mSun Jun 16 12:49:47 2024\u001b[39m \ueae8 file-0001.bin\n    .\u001b[38;5;2mr\u001b[39m\u001b[38;5;3mw\u001b[39m\u001b[38;5;245m-\u001b[39m\u001b[38;5;2mr\u001b[39m\u001b[38;5;3mw\u001b[39m\u001b[38;5;245m-\u001b[39m\u001b[38;5;2mr\u001b[39m\u001b[38;5;3mw\u001b[39m\u001b[38;5;245m-\u001b[39m\u001b[38;5;6m\u001b[39m \u001b[38;5;230miex\u001b[39m \u001b[38;5;187mstaff\u001b[39m \u001b[38;5;229m2.0\u001b[39m \u001b[38;5;229mKB\u001b[39m \u001b[38;5;40mSun Jun 16 12:49:47 2024\u001b[39m \ueae8 file-0002.bin\n    .\u001b[38;5;2mr\u001b[39m\u001b[38;5;3mw\u001b[39m\u001b[38;5;245m-\u001b[39m\u001b[38;5;2mr\u001b[39m\u001b[38;5;3mw\u001b[39m\u001b[38;5;245m-\u001b[39m\u001b[38;5;2mr\u001b[39m\u001b[38;5;3mw\u001b[39m\u001b[38;5;245m-\u001b[39m\u001b[38;5;6m\u001b[39m \u001b[38;5;230miex\u001b[39m \u001b[38;5;187mstaff\u001b[39m \u001b[38;5;229m2.0\u001b[39m \u001b[38;5;229mKB\u001b[39m \u001b[38;5;40mSun Jun 16 12:49:47 2024\u001b[39m \ueae8 file-0003.bin\n    \n    test-data/dir-0002/:\n    \u001b[38;5;5ml\u001b[39m\u001b[38;5;2mr\u001b[39m\u001b[38;5;3mw\u001b[39m\u001b[38;5;1mx\u001b[39m\u001b[38;5;2mr\u001b[39m\u001b[38;5;245m-\u001b[39m\u001b[38;5;1mx\u001b[39m\u001b[38;5;2mr\u001b[39m\u001b[38;5;245m-\u001b[39m\u001b[38;5;1mx\u001b[39m\u001b[38;5;6m\u001b[39m \u001b[38;5;230miex\u001b[39m \u001b[38;5;187mstaff\u001b[39m \u001b[38;5;229m131\u001b[39m \u001b[38;5;229mB\u001b[39m \u001b[38;5;40mSun Jun 16 12:49:51 2024\u001b[39m \u001b[38;5;5m\uf481 file-0001.bin\u001b[39m \u21d2 \u001b[38;5;5m/Users/iex/github.com/iesahin/xvc-notebooks/Readme/.xvc/b3/66d/e50/84c7bbab22aa25db35e7e28c16953764cc9036d1782a58444eb3b9394b/0.bin\u001b[39m\n    \u001b[38;5;5ml\u001b[39m\u001b[38;5;2mr\u001b[39m\u001b[38;5;3mw\u001b[39m\u001b[38;5;1mx\u001b[39m\u001b[38;5;2mr\u001b[39m\u001b[38;5;245m-\u001b[39m\u001b[38;5;1mx\u001b[39m\u001b[38;5;2mr\u001b[39m\u001b[38;5;245m-\u001b[39m\u001b[38;5;1mx\u001b[39m\u001b[38;5;6m\u001b[39m \u001b[38;5;230miex\u001b[39m \u001b[38;5;187mstaff\u001b[39m \u001b[38;5;229m131\u001b[39m \u001b[38;5;229mB\u001b[39m \u001b[38;5;40mSun Jun 16 12:49:51 2024\u001b[39m \u001b[38;5;5m\uf481 file-0002.bin\u001b[39m \u21d2 \u001b[38;5;5m/Users/iex/github.com/iesahin/xvc-notebooks/Readme/.xvc/b3/27f/0ef/d00cd3ab2c9d9cd753b0c3a3bd10b79bf0ffc7719fac44218e8130a071/0.bin\u001b[39m\n    \u001b[38;5;5ml\u001b[39m\u001b[38;5;2mr\u001b[39m\u001b[38;5;3mw\u001b[39m\u001b[38;5;1mx\u001b[39m\u001b[38;5;2mr\u001b[39m\u001b[38;5;245m-\u001b[39m\u001b[38;5;1mx\u001b[39m\u001b[38;5;2mr\u001b[39m\u001b[38;5;245m-\u001b[39m\u001b[38;5;1mx\u001b[39m\u001b[38;5;6m\u001b[39m \u001b[38;5;230miex\u001b[39m \u001b[38;5;187mstaff\u001b[39m \u001b[38;5;229m131\u001b[39m \u001b[38;5;229mB\u001b[39m \u001b[38;5;40mSun Jun 16 12:49:51 2024\u001b[39m \u001b[38;5;5m\uf481 file-0003.bin\u001b[39m \u21d2 \u001b[38;5;5m/Users/iex/github.com/iesahin/xvc-notebooks/Readme/.xvc/b3/41e/16b/e7387698c97b703e7607c649498594840cd8bf37295ef430de22ac72c1/0.bin\u001b[39m\n\n\nXvc allows to track files with hardlinks and reflinks (where supported) as well. You can keep your model files as copies and data files as links. Files in the cache are read-only, so if you don't change their permissions manually, your code cannot write on them.\n\nYou can also change their [recheck (checkout) methods](https://docs.xvc.dev/ref/xvc-file-recheck/) after you add them to Xvc. So, if you change your mind and want to start to track `test-data/dir-0001` as hardlinks, you can do so. \n\n\n```python\nxvc_test_data.file().recheck(\"test-data/dir-0001/\", recheck_method=\"hardlink\")\n```\n\n\n\n\n    ''\n\n\n\n\n```python\nprint(xvc_test_data.file().list(\"test-data/dir-0001\"))\n```\n\n    FH        2003 2024-06-16 09:49:47 41e16be7 41e16be7 test-data/dir-0001/file-0003.bin\n    FH        2002 2024-06-16 09:49:47 27f0efd0 27f0efd0 test-data/dir-0001/file-0002.bin\n    FH        2001 2024-06-16 09:49:47 66de5084 66de5084 test-data/dir-0001/file-0001.bin\n    Total #: 3 Workspace Size:        6006 Cached Size:        6006\n    \n\n\nNote that the filesystem doesn't make a distinction between hardlinks and files, they all have the same attributes. So the file sizes are identical between in the workspace and the cache. In the second column you can see that these are tracked as `H`ardlinks by Xvc. \n\nIf the files are tracked by Xvc, they are also normally in the cache. You can delete them and get them back from the cache when they are required. \n\n\n```python\n!rm -rf test-data/dir-0002/\n```\n\n\n```python\n!tree\n```\n\n    \u001b[1;36m.\u001b[0m\n    \u251c\u2500\u2500 Debug.ipynb\n    \u251c\u2500\u2500 Readme.ipynb\n    \u251c\u2500\u2500 default-pipeline.json\n    \u251c\u2500\u2500 dr-iq-scores.csv\n    \u251c\u2500\u2500 generate_data.py\n    \u251c\u2500\u2500 random_names_iq_scores.csv\n    \u251c\u2500\u2500 requirements.txt\n    \u251c\u2500\u2500 \u001b[1;36mtest-data\u001b[0m\n    \u2502\u00a0\u00a0 \u251c\u2500\u2500 \u001b[1;36mdir-0001\u001b[0m\n    \u2502\u00a0\u00a0 \u2502\u00a0\u00a0 \u251c\u2500\u2500 file-0001.bin\n    \u2502\u00a0\u00a0 \u2502\u00a0\u00a0 \u251c\u2500\u2500 file-0002.bin\n    \u2502\u00a0\u00a0 \u2502\u00a0\u00a0 \u2514\u2500\u2500 file-0003.bin\n    \u2502\u00a0\u00a0 \u2514\u2500\u2500 \u001b[1;36mdir-0003\u001b[0m\n    \u2502\u00a0\u00a0     \u251c\u2500\u2500 file-0001.bin\n    \u2502\u00a0\u00a0     \u251c\u2500\u2500 file-0002.bin\n    \u2502\u00a0\u00a0     \u2514\u2500\u2500 file-0003.bin\n    \u2514\u2500\u2500 visualize.py\n    \n    4 directories, 14 files\n\n\n\n```python\nxvc_test_data.file().recheck(\"test-data/dir-0002/\")\n```\n\n\n\n\n    ''\n\n\n\n\n```python\nprint(xvc_test_data.file().list(\"test-data/dir-0002\"))\n```\n\n    SS         131 2024-06-16 09:49:53 41e16be7          test-data/dir-0002/file-0003.bin\n    SS         131 2024-06-16 09:49:53 27f0efd0          test-data/dir-0002/file-0002.bin\n    SS         131 2024-06-16 09:49:53 66de5084          test-data/dir-0002/file-0001.bin\n    Total #: 3 Workspace Size:         393 Cached Size:        6006\n    \n\n\nNote that recheck (checkout) methods for files are also tracked and you don't have to specify them every time. \n\n## Sharing files\n\nXvc can copy cached files to [S3](https://docs.xvc.dev/ref/xvc-storage-new-s3) compatible servers like [MinIO](https://docs.xvc.dev/ref/xvc-storage-new-minio) or [Google Cloud Storage](https://docs.xvc.dev/ref/xvc-storage-new-gcs); [locally accessible](https://docs.xvc.dev/ref/xvc-storage-new-local) paths like NFS or external drives; [SSH servers with Rsync](https://docs.xvc.dev/ref/xvc-storage-new-rsync) and via [external commands](https://docs.xvc.dev/ref/xvc-storage-new-generic) like rclone or s5cmd. For the purposes of this introduction, we are going to use S3.  \n\nNote that Xvc doesn't store any credentials and rely on environment variables or existing SSH connections. \n\nLet's start by configuring the S3 connection that we'll use. \n\n\n```python\nxvc_test_data.storage().new_s3(name=\"backup\", bucket_name=\"xvc-test\", region=\"eu-central-1\", storage_prefix=\"xvc-storage\")\n```\n\n\n\n\n    ''\n\n\n\nYou can now send the files you track to this storage.\n\n\n```python\nxvc_test_data.file().send(storage=\"backup\")\n\n```\n\n\n\n\n    ''\n\n\n\nWhen you (or someone else) want to access these files later, you can clone the Git repository and get the files from the\nstorage.\n\n```shell\n$ git clone https://example.com/my-machine-learning-project\nCloning into 'my-machine-learning-project'...\n```\n\n\n\n```python\nxvc_test_data.file().bring(\"test-data/\", storage=\"backup\")\n```\n\n\n\n\n    ''\n\n\n\nYou can also share the files with others for a period if your storage type supports signed URLs. The default period is 24 hours. \n\n\n```python\nxvc_test_data.file().share(\"test-data/dir-0001/file-0001.bin\", storage=\"backup\", period=\"1h\")\n```\n\n\n\n\n    'https://xvc-test.s3.eu-central-1.amazonaws.com/xvc-test/xvc-storage/3c7f8146-df25-4c22-b0e6-bb20d600acaf/b3/66d/e50/84c7bbab22aa25db35e7e28c16953764cc9036d1782a58444eb3b9394b/0.bin?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIAJOXNDTB55QE3GOEQ%2F20240616%2Feu-central-1%2Fs3%2Faws4_request&X-Amz-Date=20240616T094957Z&X-Amz-Expires=86400&X-Amz-SignedHeaders=host&X-Amz-Signature=d82f59453c95abc9421bac2a12605373708b678c6a1e48d8b2734eb34bd22942'\n\n\n\nThis approach ensures convenient access to files from the shared storage when needed. You can share the URL with anyone and they will be able to download your file within this period. \n\n\n\nYou don't have to reconfigure the storage after cloning, but you need to have valid credentials as environment variables\nto access the storage.\n\nXvc never stores any credentials.\n\n\n# Pipelines\n\nIf you have commands that depend on data or code elements, you can configure a pipeline.\n\n\n\nFor this example, we'll use [a Python script](https://github.com/iesahin/xvc/blob/main/workflow_tests/templates/README.in/generate_data.py) to generate a data set with random names with random IQ scores.\n\nThe script uses the Faker library and this library must be available where you run the pipeline. To make it repeatable, we start the pipeline by adding a step that creates the virtual environment and installs dependencies.\n\n\n```python\npipeline = xvc_test_data.pipeline()\n```\n\n\n```python\npipeline.step().new(step_name=\"install-deps\", command=\"python3 -m pip install -r requirements.txt\")\n```\n\n\n\n\n    ''\n\n\n\nWe'll make `install-deps` step to depend on `requirements.txt` file, so when the requirements file changes it will make the step run. \n\n\n```python\npipeline.step().dependency(step_name=\"install-deps\", file=\"requirements.txt\")\n\n```\n\n\n\n\n    ''\n\n\n\n\nXvc allows to create dependencies between pipeline steps. Dependent steps wait for dependencies to finish successfully. \n\nNow we create a step to run the script and make `install-deps` step a dependency of it. \n\n\n```python\n\n```\n\n\n```python\npipeline.step().new(step_name=\"generate-data\", command=\"python3 generate_data.py\")\npipeline.step().dependency(step_name=\"generate-data\", step=\"install-deps\")\n```\n\n\n\n\n    ''\n\n\n\nAfter you define the pipeline, you can run it by:\n\n\n```python\nprint(pipeline.run())\n\n```\n\n    [OUT] [install-deps] Requirement already satisfied: faker in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from -r requirements.txt (line 1)) (25.8.0)\n    Requirement already satisfied: seaborn in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from -r requirements.txt (line 2)) (0.13.2)\n    Requirement already satisfied: python-dateutil>=2.4 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from faker->-r requirements.txt (line 1)) (2.9.0.post0)\n    Requirement already satisfied: numpy!=1.24.0,>=1.20 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from seaborn->-r requirements.txt (line 2)) (1.26.4)\n    Requirement already satisfied: pandas>=1.2 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from seaborn->-r requirements.txt (line 2)) (2.2.2)\n    Requirement already satisfied: matplotlib!=3.6.1,>=3.4 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from seaborn->-r requirements.txt (line 2)) (3.9.0)\n    Requirement already satisfied: contourpy>=1.0.1 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from matplotlib!=3.6.1,>=3.4->seaborn->-r requirements.txt (line 2)) (1.2.1)\n    Requirement already satisfied: cycler>=0.10 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from matplotlib!=3.6.1,>=3.4->seaborn->-r requirements.txt (line 2)) (0.12.1)\n    Requirement already satisfied: fonttools>=4.22.0 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from matplotlib!=3.6.1,>=3.4->seaborn->-r requirements.txt (line 2)) (4.53.0)\n    Requirement already satisfied: kiwisolver>=1.3.1 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from matplotlib!=3.6.1,>=3.4->seaborn->-r requirements.txt (line 2)) (1.4.5)\n    Requirement already satisfied: packaging>=20.0 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from matplotlib!=3.6.1,>=3.4->seaborn->-r requirements.txt (line 2)) (24.0)\n    Requirement already satisfied: pillow>=8 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from matplotlib!=3.6.1,>=3.4->seaborn->-r requirements.txt (line 2)) (10.3.0)\n    Requirement already satisfied: pyparsing>=2.3.1 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from matplotlib!=3.6.1,>=3.4->seaborn->-r requirements.txt (line 2)) (3.1.2)\n    Requirement already satisfied: pytz>=2020.1 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from pandas>=1.2->seaborn->-r requirements.txt (line 2)) (2024.1)\n    Requirement already satisfied: tzdata>=2022.7 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from pandas>=1.2->seaborn->-r requirements.txt (line 2)) (2024.1)\n    Requirement already satisfied: six>=1.5 in /Users/iex/github.com/iesahin/xvc.py/.venv/lib/python3.12/site-packages (from python-dateutil>=2.4->faker->-r requirements.txt (line 1)) (1.16.0)\n     [DONE] install-deps (python3 -m pip install -r requirements.txt)[OUT] [generate-data] CSV file generated successfully.\n     [DONE] generate-data (python3 generate_data.py)\n\n\n\n\nXvc allows many kinds of dependencies, like [files](https://docs.xvc.dev/ref/xvc-pipeline-step-dependency#file-dependencies), \n[groups of files and directories defined by globs](https://docs.xvc.dev/ref/xvc-pipeline-step-dependency#glob-dependencies), \n[regular expression searches in files](https://docs.xvc.dev/ref/xvc-pipeline-step-dependency#regex-dependencies), \n[line ranges in files](https://docs.xvc.dev/ref/xvc-pipeline-step-dependency#line-dependencies), \n[hyper-parameters defined in YAML, JSON or TOML files](https://docs.xvc.dev/ref/xvc-pipeline-step-dependency#hyper-parameter-dependencies)\n[HTTP URLs](https://docs.xvc.dev/ref/xvc-pipeline-step-dependency#url-dependencies),\n[shell command outputs](https://docs.xvc.dev/ref/xvc-pipeline-step-dependency#generic-command-dependencies), \nand [other steps](https://docs.xvc.dev/ref/xvc-pipeline-step-dependency#step-dependencies). \n\n> \ud83d\udd05\n> The rest of this document introduces some of these dependencies. **Please do not panic** if you're not into regexes or hyperparameters. \n\n\n\nSuppose you're only interested in the IQ scores of those with _Dr._ in front of their names and how they differ from the rest in the dataset we created. Let's create a regex search dependency to the data file that will show all _doctors_ IQ scores.\n\n\n\n```python\npipeline.step().new(step_name=\"dr-iq\", command='echo \"${XVC_REGEX_ADDED_ITEMS}\" >> dr-iq-scores.csv')\npipeline.step().dependency(step_name=\"dr-iq\", regex_items=r'random_names_iq_scores.csv:/^Dr\\..*')\n\n```\n\n\n\n\n    ''\n\n\n\n\n\nThe first line specifies a command, when run writes `${XVC_REGEX_ADDED_ITEMS}` environment variable to `dr-iq-scores.csv` file. \nThe second line specifies the dependency which will also populate the `$[XVC_REGEX_ADDED_ITEMS]` environment variable in the command. \n\n\n\nSome dependency types like [regex items], \n[line items] and [glob items] inject environment variables in the commands they are a dependency.\nFor example, if you have two million files specified with a glob, but want to run a script only on the added files after the last run, you can use these environment variables. \n\n[regex items]: https://docs.xvc.dev/ref/xvc-pipeline-step-dependency#regex-item-dependencies\n[line items]: https://docs.xvc.dev/ref/xvc-pipeline-step-dependency#line-item-dependencies\n[glob items]: https://docs.xvc.dev/ref/xvc-pipeline-step-dependency#glob-item-dependencies\n\n\nWhen you run the pipeline again, a file named `dr-iq-scores.csv` will be created. Note that, as `requirements.txt` didn't change `install-deps` step and its dependent `generate-data` steps didn't run.\n\n\n\n```python\nprint(pipeline.run())\n```\n\n    [DONE] dr-iq (echo \"${XVC_REGEX_ADDED_ITEMS}\" >> dr-iq-scores.csv)\n\n\n\n\n\nWe are using this feature to get lines starting with `Dr.` from the file and write them to another file. When the file changes, e.g. another record matching the dependency regex added to the `random_names_iq_scores.csv` file, it will also be added to `dr-iq-scores.csv` file.\n\n\n\n```python\n\n!zsh -cl 'echo \"Dr. Albert Einstein,144\" >> random_names_iq_scores.csv'\n\nprint(pipeline.run())\n\n```\n\n    [DONE] dr-iq (echo \"${XVC_REGEX_ADDED_ITEMS}\" >> dr-iq-scores.csv)\n\n\n\n\nNow we want to add a another command that draws a fancy histogram from `dr-iq-scores.csv`. As this new step must wait `dr-iq-scores.csv` file to be ready, we'll define `dr-iq-scores.csv` as an _output_ of `dr-iq` step and set the file as a dependency to this new `visualize` step.\n\n\n\n```python\npipeline.step().output(step_name=\"dr-iq\", output_file=\"dr-iq-scores.csv\")\n\npipeline.step().new(step_name=\"visualize\", command='python3 visualize.py')\npipeline.step().dependency(step_name=\"visualize\", file=\"dr-iq-scores.csv\")\n\nprint(pipeline.run())\n\n\n```\n\n    [DONE] dr-iq (echo \"${XVC_REGEX_ADDED_ITEMS}\" >> dr-iq-scores.csv)[ERROR] Step visualize finished UNSUCCESSFULLY with command python3 visualize.py\n\n\n\nYou can get the pipeline in Graphviz DOT or mermaid format to convert to an image.\n\n\n\n```python\nprint(pipeline.dag(format=\"mermaid\"))\n```\n\n    flowchart TD\n        n0[\"install-deps\"]\n        n1[\"requirements.txt\"] --> n0\n        n2[\"generate-data\"]\n        n0[\"install-deps\"] --> n2\n        n3[\"dr-iq\"]\n        n4[\"visualize\"]\n        n5[\"dr-iq-scores.csv\"] --> n4\n    \n\n\n```mermaid\nflowchart TD\n    n0[\"install-deps\"]\n    n1[\"requirements.txt\"] --> n0\n    n2[\"generate-data\"]\n    n0[\"install-deps\"] --> n2\n    n3[\"dr-iq\"]\n    n4[\"visualize\"]\n    n5[\"dr-iq-scores.csv\"] --> n4\n```\n\nYou can also export and import the pipeline to JSON to edit in your editor.\n\n\n```python\npipeline.export(file=\"default-pipeline.json\")\n```\n\n\n\n\n    ''\n\n\n\n\n```python\n!cat default-pipeline.json\n```\n\n    \u001b[38;5;238m\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u252c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u001b[0m\n           \u001b[38;5;238m\u2502 \u001b[0mFile: \u001b[1mdefault-pipeline.json\u001b[0m\n    \u001b[38;5;238m\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u253c\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u001b[0m\n    \u001b[38;5;238m   1\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m{\u001b[0m\n    \u001b[38;5;238m   2\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m  \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mname\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mdefault\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m   3\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m  \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47msteps\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;73;72;62m[\u001b[0m\n    \u001b[38;5;238m   4\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m    \u001b[0m\u001b[38;2;73;72;62m{\u001b[0m\n    \u001b[38;5;238m   5\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m      \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mcommand\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mpython3 -m pip install -r requirements.txt\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m   6\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m      \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mdependencies\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;73;72;62m[\u001b[0m\n    \u001b[38;5;238m   7\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m        \u001b[0m\u001b[38;2;73;72;62m{\u001b[0m\n    \u001b[38;5;238m   8\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m          \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mFile\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;73;72;62m{\u001b[0m\n    \u001b[38;5;238m   9\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m            \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mcontent_digest\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;73;72;62m{\u001b[0m\n    \u001b[38;5;238m  10\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m              \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47malgorithm\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mBlake3\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  11\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m              \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mdigest\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;73;72;62m[\u001b[0m\n    \u001b[38;5;238m  12\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m43\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  13\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m86\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  14\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m244\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  15\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m111\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  16\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m13\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  17\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m243\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  18\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m28\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  19\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m110\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  20\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m140\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  21\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m213\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  22\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m105\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  23\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m20\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  24\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m239\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  25\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m62\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  26\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m73\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  27\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m75\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  28\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m13\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  29\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m146\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  30\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m82\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  31\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m17\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  32\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m148\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  33\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m152\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  34\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m66\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  35\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m86\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  36\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m154\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  37\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m230\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  38\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m154\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  39\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m246\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  40\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m213\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  41\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m214\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  42\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m40\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  43\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;104;77;153m119\u001b[0m\n    \u001b[38;5;238m  44\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m              \u001b[0m\u001b[38;2;73;72;62m]\u001b[0m\n    \u001b[38;5;238m  45\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m            \u001b[0m\u001b[38;2;73;72;62m}\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  46\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m            \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mpath\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mrequirements.txt\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  47\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m            \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mxvc_metadata\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;73;72;62m{\u001b[0m\n    \u001b[38;5;238m  48\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m              \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mfile_type\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mFile\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  49\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m              \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mmodified\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;73;72;62m{\u001b[0m\n    \u001b[38;5;238m  50\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mnanos_since_epoch\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;104;77;153m431372139\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  51\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m                \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47msecs_since_epoch\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;104;77;153m1718467910\u001b[0m\n    \u001b[38;5;238m  52\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m              \u001b[0m\u001b[38;2;73;72;62m}\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  53\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m              \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47msize\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;104;77;153m14\u001b[0m\n    \u001b[38;5;238m  54\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m            \u001b[0m\u001b[38;2;73;72;62m}\u001b[0m\n    \u001b[38;5;238m  55\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m          \u001b[0m\u001b[38;2;73;72;62m}\u001b[0m\n    \u001b[38;5;238m  56\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m        \u001b[0m\u001b[38;2;73;72;62m}\u001b[0m\n    \u001b[38;5;238m  57\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m      \u001b[0m\u001b[38;2;73;72;62m]\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  58\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m      \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47minvalidate\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mByDependencies\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  59\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m      \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mname\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47minstall-deps\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  60\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m      \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47moutputs\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;73;72;62m[\u001b[0m\u001b[38;2;73;72;62m]\u001b[0m\n    \u001b[38;5;238m  61\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m    \u001b[0m\u001b[38;2;73;72;62m}\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  62\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m    \u001b[0m\u001b[38;2;73;72;62m{\u001b[0m\n    \u001b[38;5;238m  63\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m      \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mcommand\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mpython3 generate_data.py\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  64\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m      \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mdependencies\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;73;72;62m[\u001b[0m\n    \u001b[38;5;238m  65\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m        \u001b[0m\u001b[38;2;73;72;62m{\u001b[0m\n    \u001b[38;5;238m  66\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m          \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mStep\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;73;72;62m{\u001b[0m\n    \u001b[38;5;238m  67\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m            \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mname\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47minstall-deps\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\n    \u001b[38;5;238m  68\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m          \u001b[0m\u001b[38;2;73;72;62m}\u001b[0m\n    \u001b[38;5;238m  69\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m        \u001b[0m\u001b[38;2;73;72;62m}\u001b[0m\n    \u001b[38;5;238m  70\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m      \u001b[0m\u001b[38;2;73;72;62m]\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  71\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m      \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47minvalidate\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mByDependencies\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  72\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m      \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mname\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mgenerate-data\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  73\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m      \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47moutputs\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;73;72;62m[\u001b[0m\u001b[38;2;73;72;62m]\u001b[0m\n    \u001b[38;5;238m  74\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m    \u001b[0m\u001b[38;2;73;72;62m}\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  75\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m    \u001b[0m\u001b[38;2;73;72;62m{\u001b[0m\n    \u001b[38;5;238m  76\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m      \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mcommand\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mecho \u001b[0m\u001b[38;2;104;77;153m\\\"\u001b[0m\u001b[38;2;153;143;47m${XVC_REGEX_ADDED_ITEMS}\u001b[0m\u001b[38;2;104;77;153m\\\"\u001b[0m\u001b[38;2;153;143;47m >> dr-iq-scores.csv\u001b[0m\n    \u001b[38;5;238m    \u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  77\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m      \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mdependencies\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;73;72;62m[\u001b[0m\u001b[38;2;73;72;62m]\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  78\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m      \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47minvalidate\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mByDependencies\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  79\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m      \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mname\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mdr-iq\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  80\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m      \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47moutputs\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;73;72;62m[\u001b[0m\u001b[38;2;73;72;62m]\u001b[0m\n    \u001b[38;5;238m  81\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m    \u001b[0m\u001b[38;2;73;72;62m}\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  82\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m    \u001b[0m\u001b[38;2;73;72;62m{\u001b[0m\n    \u001b[38;5;238m  83\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m      \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mcommand\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mpython3 visualize.py\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  84\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m      \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mdependencies\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;73;72;62m[\u001b[0m\n    \u001b[38;5;238m  85\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m        \u001b[0m\u001b[38;2;73;72;62m{\u001b[0m\n    \u001b[38;5;238m  86\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m          \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mFile\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;73;72;62m{\u001b[0m\n    \u001b[38;5;238m  87\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m            \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mcontent_digest\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;104;77;153mnull\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  88\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m            \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mpath\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mdr-iq-scores.csv\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  89\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m            \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mxvc_metadata\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;104;77;153mnull\u001b[0m\n    \u001b[38;5;238m  90\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m          \u001b[0m\u001b[38;2;73;72;62m}\u001b[0m\n    \u001b[38;5;238m  91\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m        \u001b[0m\u001b[38;2;73;72;62m}\u001b[0m\n    \u001b[38;5;238m  92\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m      \u001b[0m\u001b[38;2;73;72;62m]\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  93\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m      \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47minvalidate\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mByDependencies\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  94\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m      \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mname\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mvisualize\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  95\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m      \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47moutputs\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;73;72;62m[\u001b[0m\u001b[38;2;73;72;62m]\u001b[0m\n    \u001b[38;5;238m  96\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m    \u001b[0m\u001b[38;2;73;72;62m}\u001b[0m\n    \u001b[38;5;238m  97\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m  \u001b[0m\u001b[38;2;73;72;62m]\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  98\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m  \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mversion\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;104;77;153m1\u001b[0m\u001b[38;2;73;72;62m,\u001b[0m\n    \u001b[38;5;238m  99\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m  \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47mworkdir\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;73;72;62m:\u001b[0m\u001b[38;2;73;72;62m \u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\u001b[38;2;153;143;47m\"\u001b[0m\n    \u001b[38;5;238m 100\u001b[0m   \u001b[38;5;238m\u2502\u001b[0m \u001b[38;2;73;72;62m}\u001b[0m\n    \u001b[38;5;238m\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2534\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u001b[0m\n\n\n\nYou can edit the file (in Jupyter lab or in your editor) to change commands, add new dependencies, etc. and import it back to Xvc with `import_pipeline` method. You can create copies of pipelines by either editing the pipeline name in the file or supplying the name with `name` argument. \n\n\n> \ud83d\udd05 `import_pipeline` is `xvc pipeline import` in command line. Renamed because `import` is a reserved word. \n\n\n```python\npipeline.import_pipeline(file=\"default-pipeline.json\", name=\"another-pipeline\")\n\n```\n\n\n\n\n    ''\n\n\n\nNow you can list all pipelines and run `another-pipeline` if you want to.\n\n\n```python\nprint(pipeline.list())\n```\n\n    +------------------+---------+\n    | Name             | Run Dir |\n    +============================+\n    | default          |         |\n    |------------------+---------|\n    | another-pipeline |         |\n    +------------------+---------+\n\n\n\n\nPlease create an issue or discussion for any other kinds of dependencies that you'd like to be included.\n\nI'm planning to add [data label and annotations tracking](https://github.com/iesahin/xvc/discussions/208)), [experiments tracking](https://github.com/iesahin/xvc/discussions/207)), [model tracking](https://github.com/iesahin/xvc/discussions/211)), encrypted cache, server to control all commands from a web interface, and more as my time permits.\n\nPlease check [`docs.xvc.dev`](https://docs.xvc.dev) for documentation.\n\n\n\n\n\n## \ud83e\udd1f Big Thanks\n\nxvc stands on the following (giant) crates:\n\n- [trycmd] is used to run all example commands in this file, [reference, and how-to documentation](https://docs.xvc.dev) at\n  every PR. It makes sure that the documentation is always up-to-date and shown commands work as described. We start\n  development by writing documentation and implementing them thanks to [trycmd].\n\n- [serde] allows all data structures to be stored in text files. Special thanks from [`xvc-ecs`] for serializing components in an ECS with a single line of code.\n\n- Xvc processes files in parallel with pipelines and parallel iterators thanks to [crossbeam] and [rayon].\n\n- Thanks to [strum], Xvc uses enums extensively and converts almost everything to typed values from strings.\n\n- Xvc has a deep CLI that has subcommands of subcommands (e.g. `xvc storage new s3`), and all these work with minimum bugs thanks to [clap].\n\n- Xvc uses [rust-s3] to connect to S3 and compatible storage services. It employs excellent [tokio] for fast async Rust. These cloud storage features can be turned off thanks to Rust conditional compilation.\n\n- Without implementations of [BLAKE3], BLAKE2, SHA-2 and SHA-3 from Rust [crypto] crate, Xvc couldn't detect file changes so fast.\n\n- Many thanks to small and well built crates, [reflink], [relative-path], [path-absolutize], [glob] for file system and glob handling.\n\n- Thanks to [sad_machine] for providing a State Machine implementation that I used in `xvc pipeline run`. A DAG composed of State Machines made running pipeline steps in parallel with a clean separation of process states.\n\n- Thanks to [thiserror] and [anyhow] for making error handling a breeze. These two crates make me feel I'm doing something good for the humanity when handling errors.\n\n- Xvc is split into many crates and owes this organization to [cargo workspaces].\n\n[crossbeam]: https://docs.rs/crossbeam/\n[cargo workspaces]: https://crates.io/crates/cargo-workspaces\n[rayon]: https://docs.rs/rayon/\n[strum]: https://docs.rs/strum/\n[clap]: https://docs.rs/clap/\n[serde]: https://serde.rs\n[blake3]: https://docs.rs/blake3/\n[crypto]: https://docs.rs/rust-crypto/\n[reflink]: https://docs.rs/reflink/\n[relative-path]: https://docs.rs/relative-path/\n[path-absolutize]: https://docs.rs/path-absolutize/\n[glob]: https://docs.rs/glob/\n[wax]: https://docs.rs/wax/\n[trycmd]: https://docs.rs/trycmd/\n[sad_machine]: https://docs.rs/sad_machine/\n[thiserror]: https://docs.rs/thiserror/\n[anyhow]: https://docs.rs/anyhow/\n[rust-s3]: https://docs.rs/rust-s3/\n[`xvc-ecs`]: https://docs.rs/xvc-ecs/\n[tokio]: https://tokio.rs\n\nAnd, biggest thanks to Rust designers, developers and contributors. Although I can't see myself expert to appreciate it all, it's a fabulous language and environment to work with.\n\n\n## \ud83d\ude81 Support\n\n- You can use [Discussions](https://github.com/iesahin/xvc/discussions) to ask questions. I'll answer as much as possible. Thank you.\n- I don't follow any other sites regularly. You can also reach me at [emre@xvc.dev](mailto:emre@xvc.dev)\n\n\n\n## \ud83d\udc50 Contributing\n\n- Star this repo. I feel very happy for every star and send my best wishes to you. That's a certain win to spend your two seconds for me. Thanks.\n- Use xvc. Tell me how it works for you, read the [documentation](https://docs.xvc.dev), [report bugs](https://github.com/iesahin/xvc/issues), [discuss features](https://github.com/iesahin/xvc/discussions).\n- Please note that, I don't accept large code PRs. Please open an issue to discuss your idea and write/modify a\n  reference page before sending a PR. I'm happy to discuss and help you to implement your idea. Also, it may require a copyright transfer to me, as there may be cases which I provide the code in other licenses. \n\n\n\n## \ud83d\udcdc License\n\nXvc and these Python bindings are licensed under the [GNU GPL 3.0 License](https://github.com/iesahin/xvc/blob/main/LICENSE). If you want to use the code in your project with other licenses, please contact me.\n\n\n## \ud83c\udf26\ufe0f Future and Maintenance\n\nI'm using Xvc daily and I'm happy with it. Tracking all my files with Git via arbitrary servers and cloud providers is\nsomething I always need. I'm happy to improve and maintain it as long as I use it.\n\nGiven that I'm working on this for the last two years for pure technical bliss, you can expect me to work on it more. \n\n\n\n## \u26a0\ufe0f Disclaimer\n\nThis software is fresh and ambitious. Although I use it and test it close to real-world conditions, it didn't go under\nthe test of time. **Xvc can eat your files and spit them into the eternal void!** Please take backups.\n\n\n",
    "bugtrack_url": null,
    "license": null,
    "summary": "An MLOps tool to manage data files and pipelines on top of Git",
    "version": "0.6.13",
    "project_urls": {
        "Bug Reports": "https://github.com/iesahin/xvc.py/issues",
        "Documentation": "https://docs.xvc.dev",
        "Homepage": "https://xvc.dev",
        "Source": "https://github.com/iesahin/xvc.py/"
    },
    "split_keywords": [
        "machine-learning",
        " development",
        " devops",
        " git"
    ],
    "urls": [
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "2ef35819211e1700a93bf9f60ea176070c7433f48f86824add1da0609fdfc721",
                "md5": "799662a31dfb23df2dec2b82ef92144d",
                "sha256": "12a2e4119a0d32a4f5f867f2bf82ac23c272442ca885e07dd3a4712773f01dad"
            },
            "downloads": -1,
            "filename": "xvc-0.6.13-cp37-abi3-macosx_11_0_arm64.whl",
            "has_sig": false,
            "md5_digest": "799662a31dfb23df2dec2b82ef92144d",
            "packagetype": "bdist_wheel",
            "python_version": "cp37",
            "requires_python": ">=3.7",
            "size": 7818789,
            "upload_time": "2025-01-01T08:20:42",
            "upload_time_iso_8601": "2025-01-01T08:20:42.547742Z",
            "url": "https://files.pythonhosted.org/packages/2e/f3/5819211e1700a93bf9f60ea176070c7433f48f86824add1da0609fdfc721/xvc-0.6.13-cp37-abi3-macosx_11_0_arm64.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "33f31a04090e05edbf2b13588a28e22c4ee10956a1fdfa201c5e99dd4ed2ece7",
                "md5": "0520f61def95cd838f3421ac08877434",
                "sha256": "4f9cc1940146a1d86847dcb2aba61383fab49df3b1fca528dcd6418b91ecfd06"
            },
            "downloads": -1,
            "filename": "xvc-0.6.13.tar.gz",
            "has_sig": false,
            "md5_digest": "0520f61def95cd838f3421ac08877434",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.7",
            "size": 81272,
            "upload_time": "2025-01-01T08:20:46",
            "upload_time_iso_8601": "2025-01-01T08:20:46.671264Z",
            "url": "https://files.pythonhosted.org/packages/33/f3/1a04090e05edbf2b13588a28e22c4ee10956a1fdfa201c5e99dd4ed2ece7/xvc-0.6.13.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-01-01 08:20:46",
    "github": true,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "github_user": "iesahin",
    "github_project": "xvc.py",
    "travis_ci": false,
    "coveralls": false,
    "github_actions": true,
    "requirements": [
        {
            "name": "maturin",
            "specs": []
        },
        {
            "name": "jupyterlab",
            "specs": []
        },
        {
            "name": "pytest-forked",
            "specs": []
        },
        {
            "name": "pytest-cov",
            "specs": []
        },
        {
            "name": "coverage",
            "specs": []
        }
    ],
    "lcname": "xvc"
}
        
Elapsed time: 0.80773s