mtbsync

Name	mtbsync JSON
Version	0.10.16 JSON
	download
home_page	None
Summary	Align MTB GoPro runs and transfer markers using computer vision
upload_time	2025-11-02 12:06:38
maintainer	None
docs_url	None
author	MTB Sync Contributors
requires_python	>=3.11
license	MIT
keywords	gopro video sync mtb markers opencv
VCS
bugtrack_url
requirements	No requirements were recorded.
Travis-CI	No Travis.
coveralls test coverage	No coveralls.

            # MTB Video Sync MVP — Iterative Ticket Flow

[![PyPI version](https://badge.fury.io/py/mtbsync.svg)](https://badge.fury.io/py/mtbsync)
[![CI](https://github.com/markg72/goprosync/workflows/CI/badge.svg)](https://github.com/markg72/goprosync/actions/workflows/ci.yml)
[![Quality Gate](https://github.com/markg72/goprosync/actions/workflows/quality-gate.yml/badge.svg)](https://github.com/markg72/goprosync/actions/workflows/quality-gate.yml)
[![Python 3.11+](https://img.shields.io/badge/python-3.11+-blue.svg)](https://www.python.org/downloads/)
[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)

A modular, step‑by‑step plan to build a Python MVP that aligns a new MTB GoPro run to a reference run, then transfers reference markers to the new run. Designed for use with Claude Code or the OpenAI CLI.

## Quickstart

Install from PyPI:

```bash
pip install mtbsync
```

The package includes helper scripts and documentation:
- **Helper scripts**: `examples/visual_diagnostic.sh`, `diagnose_alignment.sh`, `resync_strict.sh`, `dashboard_cleanup.sh`
- **Documentation**: `QUICK_VISUAL_DIAGNOSTIC.md`, `examples/CI_CD_INTEGRATION.md`, `examples/README.md`

Basic usage:

```bash
# Index reference video
mtbsync index --reference ref.mp4 --fps 3 --out ref_index.npz

# Sync new video to reference
mtbsync sync --reference ref.mp4 --new new.mp4 --index ref_index.npz --out pairs.csv --fps 3

# Transfer markers
mtbsync transfer-markers --ref-markers ref_markers.csv --timewarp-json timewarp.json --out new_markers.csv

# Export to NLE formats
mtbsync export-markers new_markers.csv --preset fcpxml
mtbsync export-markers new_markers.csv --preset premiere
mtbsync export-markers new_markers.csv --preset resolve-edl
```

---

## Project Brief

**Goal:** Build a Python MVP that aligns a new MTB GoPro run to a reference run of the same trail, then transfers reference markers to the new run. Use computer vision (CV) with optional GPS. Export markers for NLEs (EDL/CSV).

**Core pipeline**

1. **Ingest**: MP4 + optional GPX/FIT.
2. **Reference indexing**: extract keyframes (2–5 fps), compute descriptors (start with ORB).
3. **Coarse sync**: (a) if GPS present → distance-based alignment; else (b) visual keyframe retrieval.
4. **Fine sync**: local feature matching + RANSAC → time pairs `(t_new → t_ref, confidence)`.
5. **Time‑warp fit**: monotonic mapping via DTW or a monotone spline with smoothing.
6. **Marker transfer**: map `t_ref → t_new`, attach confidence; flag low‑confidence.
7. **Export**: CSV + CMX3600 EDL; optional FCPXML later.
8. **Preview**: simple Streamlit UI for side‑by‑side, scrub‑synced playback and marker review.

**Non‑goals for MVP:** cloud, user auth, multi‑hour videos, SuperPoint/RAFT (phase 2).

**Tech:** Python 3.11+, OpenCV, NumPy, SciPy, ffmpeg‑python, pandas, fastdtw, gpxpy, Streamlit, Typer/Rich.

---

## Repository Layout

```
mtb-sync/
  README.md
  pyproject.toml            # or requirements.txt
  src/
    mtbsync/__init__.py
    mtbsync/cli.py
    mtbsync/io/video.py
    mtbsync/io/gps.py
    mtbsync/features/keyframes.py
    mtbsync/features/descriptors.py
    mtbsync/match/retrieval.py
    mtbsync/match/local.py
    mtbsync/align/timewarp.py
    mtbsync/markers/schema.py
    mtbsync/markers/transfer.py
    mtbsync/export/csv_export.py
    mtbsync/export/edl_export.py
    mtbsync/ui/app.py
  tests/
    test_timewarp.py
    test_marker_transfer.py
    test_edl_export.py
  data/
    sample_reference.mp4
    sample_new.mp4
    sample_reference_markers.csv
    sample_reference.gpx
```

---

## Tickets

> Work through these tickets in order. Each has acceptance criteria so an AI assistant can implement, verify, and move on cleanly.

### 0) Repo Scaffold

**Implement**

- Create the folder structure above.
- Configure packaging: `pip install -e .` enables local dev.
- Provide a `pyproject.toml` (or `requirements.txt`) with pinned deps.

**Acceptance Criteria**

- Virtual env setup works.
- `mtbsync --help` available after editable install.

---

### 1) CLI Surface

**Implement**

- Subcommands:

  ```
  mtbsync index --reference ref.mp4 --fps 3 --out cache/ref_index.npz
  mtbsync sync  --reference ref.mp4 --new new.mp4 \
                [--ref-gpx ref.gpx] [--new-gpx new.gpx] \
                --index cache/ref_index.npz --out cache/pairs.csv
  mtbsync warp  --pairs cache/pairs.csv --out cache/warp.npz
  mtbsync transfer --reference-markers data/ref_markers.csv --warp cache/warp.npz \
                   --out out/new_markers.csv --review-threshold 0.6
  mtbsync export edl --markers out/new_markers.csv --out out/new_markers.edl \
                     [--reel 001] [--fps 29.97]
  mtbsync preview --reference ref.mp4 --new new.mp4 --markers out/new_markers.csv
  ```

**Acceptance Criteria**

- Robust arg validation and concise summaries via `rich`.
- No global state; each command is independently runnable.

---

### 2) Video IO & Keyframes

**Implement**

- `io/video.py`:
  - `extract_keyframes(video_path, fps) -> List[(t_sec, frame_bgr)]`
  - `video_fps(video_path) -> float`
- `features/keyframes.py`:
  - Resize frames to max dim 960 px (preserve aspect).

**Acceptance Criteria**

- Deterministic timestamp sampling (handles VFR).
- Errors handled gracefully (bad file, zero‑length, etc.).

---

### 3) Descriptors

**Implement**

- `features/descriptors.py` with ORB (grayscale + optional CLAHE).
- Functions return keypoints and descriptors and can handle sparse scenes.

**Acceptance Criteria**

- Target ~800 keypoints per keyframe when available.
- Save index with timestamps, (x,y,angle,scale), and descriptors (uint8) to `.npz`.

---

### 4) Retrieval (Coarse Visual Alignment)

**Implement**

- `match/retrieval.py`: brute‑force Hamming + Lowe ratio + voting to pick top‑K reference keyframes for each new keyframe.

**Acceptance Criteria**

- Output `pairs_raw.csv` with `t_new, t_ref, score, n_inliers`.
- 5 min @ 3 fps runs in ~≤2 minutes on a laptop.

---

### 5) Local Refinement

**Implement**

- `match/local.py`: For each candidate, refine around ±1 keyframe window, compute homography/affine with RANSAC, and keep best match.

**Acceptance Criteria**

- Output `pairs.csv` with `(t_new, t_ref, confidence)`.
- Reject outliers by reprojection error; ≥70% valid pairs on sample data.

---

### 6) GPS Alignment (Optional but Implemented)

**Implement**

- `io/gps.py` using `gpxpy`:
  - Parse GPX/FIT (start with GPX).
  - Compute cumulative distance and resample to 10 Hz.
- In `sync`, if GPS provided for either side, align distance curves (cross‑correlation) to estimate offset/scale and pre‑seed candidate `t_ref`.

**Acceptance Criteria**

- Works with one or both GPS tracks.
- Falls back cleanly to visual retrieval.

---

### 7) Time‑Warp Fit

**Implement**

- `align/timewarp.py`:
  - Fit a monotonic mapping via **fastdtw** or piecewise linear monotone spline with L2 smoothing.
  - Expose `map_t_new_to_t_ref(t_new)` and inverse `map_t_ref_to_t_new(t_ref)`.

**Acceptance Criteria**

- Strict monotonicity; median residual < 0.3 s on sample data.
- Save to `warp.npz` with arrays + metadata.

---

### 8) Marker Schema & Transfer

**Implement**

- `markers/schema.py`: reference CSV schema `name,t_ref,colour?,comment?` (seconds float).
- `markers/transfer.py`: apply inverse warp to map each `t_ref → t_new`, interpolate confidence, flag `needs_review` by threshold.

**Acceptance Criteria**

- Output `new_markers.csv` with `name,t_new,confidence,needs_review,comment`.

---

### 9) Exports

**Implement**

- `export/csv_export.py`: already covered by transfer CSV.
- `export/edl_export.py`: write CMX3600 EDL; 1‑frame events with comments.

**Acceptance Criteria**

- EDL imports in Resolve/Premiere with visible markers.

---

### 10) Preview UI

**Implement**

- `ui/app.py` (Streamlit):
  - Inputs: reference/new MP4, markers CSV, warp.npz.
  - Side‑by‑side players with linked scrubbing.
  - Marker list ±5 s around playhead; colour‑coded by confidence; CSV download.

**Acceptance Criteria**

- `streamlit run src/mtbsync/ui/app.py` launches; approximate sync is acceptable.

---

### 11) Tests & Sample Data

**Implement**

- Unit tests for:
  - Monotonic time‑warp and inverse mapping.
  - EDL formatting round‑trip sanity.
  - Marker transfer shape and thresholds.
- Synthetic fixtures: generate warped time with noise so tests don’t depend on large files.

**Acceptance Criteria**

- `pytest` passes in < 10 s locally.

---

### 12) Dev Ergonomics

**Implement**

- `Makefile`/`justfile`: `setup`, `lint`, `test`, `run-preview`.
- Pre‑commit: `ruff`, `black`, `isort`.
- `README.md`: quick start + example commands.

**Acceptance Criteria**

- One‑command setup and test run.

---

## Prompt Templates

### Claude Code (per‑ticket)

```
You are a senior Python engineer. Implement the next ticket in the mtb-sync repo.
Follow the acceptance criteria exactly. Keep code modular and documented.

<TICKET NAME>: <paste ticket content>

Constraints:
- Python 3.11, no GPU assumptions.
- Pure functions where possible; no global state.
- Use 'rich' for concise logging.
- Add docstrings and type hints.
- If format ambiguities arise, decide and document in README.

After changes:
- List files changed.
- Provide example CLI usage.
- Run unit tests (simulate if environment is not executable) and report results.
```

### OpenAI CLI (chat)

```bash
openai chat.completions.create -m gpt-4.1 \
  -g "
You are a senior Python engineer. Implement the following ticket for a project called mtb-sync. Provide complete code blocks for the mentioned files. Explain only what's necessary to run it.

<TICKET NAME + CONTENT>
"
```

---

## Quick Run Guide (for README)

```bash
# 1) Setup
python -m venv .venv && source .venv/bin/activate
pip install -U pip
pip install -e .

# 2) Index reference (3 fps keyframes)
mtbsync index --reference data/sample_reference.mp4 --fps 3 --out cache/ref_index.npz

# 3) Build pairs (with or without GPS)
mtbsync sync --reference data/sample_reference.mp4 --new data/sample_new.mp4 \
             --index cache/ref_index.npz --out cache/pairs.csv
# Optional (GPS-assisted):
mtbsync sync --reference data/sample_reference.mp4 --new data/sample_new.mp4 \
             --ref-gpx data/sample_reference.gpx --new-gpx data/sample_new.gpx \
             --index cache/ref_index.npz --out cache/pairs.csv

# 4) Fit time-warp (automatic during sync, or standalone)
# Note: sync command automatically generates timewarp.json during retrieval
mtbsync warp --pairs cache/pairs.csv --out cache/warp.npz

# 5) Transfer markers (using timewarp.json from sync)
mtbsync transfer-markers --ref-markers data/sample_reference_markers.csv \
                         --timewarp-json timewarp.json \
                         --out out/new_markers.csv \
                         --plot-overlay
# Outputs:
#   - new_markers.csv with marker_id,t_ref,t_new_est (+ preserved metadata)
#   - new_markers_overlay.png preview

# 6) Export EDL
mtbsync export edl --markers out/new_markers.csv --out out/new_markers.edl --fps 29.97

# 7) Preview UI
streamlit run src/mtbsync/ui/app.py
```

---

## Performance

### Parallel Retrieval

Use `--threads` to enable multi-threaded frame matching:

```bash
mtbsync sync --reference ref.mp4 --new new.mp4 --index ref.npz --out pairs.csv --threads 4
```

### Fast Preset for Bulk Jobs

Use `--fast` to auto-tune parameters for large-scale processing:

```bash
mtbsync sync --reference ref.mp4 --new new.mp4 --index ref.npz --out pairs.csv --fast
```

The `--fast` preset automatically:
- Sets `threads >= 4` (parallel retrieval)
- Tunes warp parameters for speed (relaxed RANSAC iterations/thresholds)

### Timing Information

The `sync` command prints per-stage timings:
- `retrieval_sec` - Frame matching time
- `warp_sec` - Time-warp fitting/gating time
- `markers_sec` - Marker auto-export time
- `total_sec` - End-to-end pipeline time

Batch processing writes timings to `batch_summary.csv` for analysis across multiple pairs.

### ⚡ Performance Benchmarks

| Stage        | Mean Time (s) | Notes                              |
|---------------|---------------|------------------------------------|
| GPS Alignment |  0.28         | Vectorised (`np.interp`) fast-path |
| Retrieval     |  1.42         | 4 threads (`ThreadPoolExecutor`)   |
| Warp Fit      |  0.06         | RANSAC + IRLS refinement           |
| Marker Export |  0.11         | CSV → JSON + overlay               |
| **Total**     |  1.87 ± 0.15  | Typical 1080p pair (8 k frames)    |

> 💡 Use `--threads 4` or `--fast` for large jobs.
> `batch_summary.csv` records per-stage timings for every pair.

[![Performance Status](https://img.shields.io/badge/performance-optimized-brightgreen)](https://github.com/markg72/goprosync)

## Dashboard

Launch a local, zero-dependency dashboard to inspect artefacts:

```bash
# Default port 8760 (no browser auto-open)
mtbsync dashboard --root ./batch_out

# Enable server-side export
mtbsync dashboard --root ./batch_out --allow-write

# Explicitly open browser on launch
mtbsync dashboard --root ./batch_out --open-browser
```

### Ports & Browser Behavior (VS Code/Codespaces)

All UI commands now use **stable default ports** and **never auto-open a browser** by default:

| App | CLI Command | Env Variable | Default Port |
|-----|-------------|--------------|--------------|
| Dashboard | `mtbsync dashboard` | `MTB_DASHBOARD_PORT` | 8760 |
| Telemetry UI | `mtbsync telemetry-ui` | `MTB_TELEMETRY_PORT` | 8761 |
| Compare UI | `mtbsync compare-ui` | `MTB_COMPARE_PORT` | 8762 |

**Port precedence:** `--port` flag > environment variable > default

**VS Code/Codespaces behavior:**
- Ports are auto-forwarded but **won't open surprise browser tabs**
- Use the **Ports panel** to open the forwarded URL when ready
- VS Code may remap to a local random port (e.g., 5752) — this is expected
- Config is in `.vscode/settings.json` with `"onAutoForward": "ignore"`

**Quick setup:**
```bash
# Load port environment defaults
source scripts/mtbsync-ports.env

# Override specific ports if needed
export MTB_DASHBOARD_PORT=9000
```

**Optional shell aliases** (add to `~/.bashrc` or `~/.zshrc`):
```bash
# Quick launch aliases for mtbsync UIs
alias dash='mtbsync dashboard --root ./batch_out'
alias tele='mtbsync telemetry-ui --root ./batch_out'
alias comp='mtbsync compare-ui --root ./batch_out'
```

**Features:**
- Marker selector (multiple `new_markers*.csv`)
- Timing sparklines from `batch_summary.csv`
- Download artefacts; optional `POST /api/export-json` when `--allow-write` is set
- Live telemetry updates via Server-Sent Events (`/api/perf/stream`)

### 🧩 Threaded Dashboard Server

As of v0.10.4, the dashboard uses a threaded HTTP server to support long-lived Server-Sent Event (SSE) connections. This allows `/api/perf/stream` (the live telemetry feed) to run continuously while other endpoints (e.g. `/api/files`, `/api/markers`, `/api/timewarp`) remain responsive.

**Key details:**
- The dashboard runs via `ThreadingHTTPServer` with `daemon_threads=True` for safe shutdown
- Multiple browser tabs or connected clients can receive live telemetry simultaneously
- Existing single-threaded usage is now fully backward-compatible

**Usage example:**

```bash
mtbsync dashboard --root ./batch_out --port 8000
```

Then open http://localhost:8000 — the telemetry table will update live as new runs finish.

### 🚀 GPU & Extended Telemetry

From v0.10.5, mtbsync records extended runtime metrics in `perf.json`:

- **CPU%** and **RSS memory (MB)** when `psutil` is available
- **GPU utilisation (%)** and **VRAM used (MB)** when NVIDIA NVML (`pynvml`) is available
- Metrics surface in the dashboard table and update live via SSE

Telemetry is best-effort and never blocks the pipeline. To disable GPU probing:

```bash
# Disable GPU telemetry for sync
mtbsync sync ... --no-gpu

# Disable GPU telemetry for batch
mtbsync batch input_dir ... --no-gpu
```

> **Note:** psutil/pynvml are optional. If not installed, GPU/CPU fields are omitted or set to `null`.

### Telemetry Retention (perf.json)

For large batch runs, you can prune older telemetry artefacts:

**CLI**
```bash
# Keep newest 200 perf.json files under ./batch_out
mtbsync prune-perf --root ./batch_out --keep 200

# Dry-run first to see what would be deleted
mtbsync prune-perf --root ./batch_out --keep 200 --dry-run
```

**Dashboard (requires --allow-write)**
```bash
mtbsync dashboard --root ./batch_out --allow-write
```
Open the dashboard and use the **Prune perf.json** button to keep the newest N files.
All operations are constrained to the selected root.

### Streamlit Telemetry UI

Launch a rich, interactive telemetry dashboard using Streamlit:

```bash
mtbsync telemetry-ui --root ./batch_out
```

**Features:**
- Interactive time-series charts for FPS, CPU%, GPU utilisation, VRAM, and RSS memory
- Filtering controls (time range slider, recent N runs)
- Smoothing toggle for cleaner trend visualization
- CSV export of filtered telemetry data
- Optional live updates (experimental)
- Clean sidebar layout with metric selection

**Installation:**

Requires optional dependency:

```bash
pip install "mtbsync[telemetry]"
```

**Usage example:**

```bash
# Launch on default port (8761)
mtbsync telemetry-ui --root ./batch_out

# Custom port
mtbsync telemetry-ui --root ./batch_out --port 9000
```

The Streamlit UI complements the built-in HTML dashboard from `mtbsync dashboard`, providing richer interactivity and data exploration capabilities.

### Side-by-Side Video Comparison UI

Review and annotate two videos side-by-side with synced markers and delta inspection:

```bash
mtbsync compare-ui --ref ref.mp4 --new new.mp4 --ref-markers ref_markers.csv --timewarp-json timewarp.json
```

**Features:**
- Dual video panes (reference + new) for visual comparison
- Marker overlay with timeline visualization
- Delta summary statistics (MAE, P90, median, count)
- Interactive search and filter for markers
- Annotation system: add notes and labels to markers
- Export annotations to CSV or JSON

**Installation:**

Requires optional dependency:

```bash
pip install "mtbsync[compare]"
```

**Usage examples:**

```bash
# Full comparison with all inputs
mtbsync compare-ui --ref ref.mp4 --new new.mp4 \
  --ref-markers ref_markers.csv \
  --new-markers new_markers.csv \
  --timewarp-json timewarp.json

# Launch from batch output directory
mtbsync compare-ui --root ./batch_out

# Custom port (default is 8762)
mtbsync compare-ui --ref ref.mp4 --new new.mp4 \
  --ref-markers ref_markers.csv --port 9000
```

**Workflow:**
1. Configure file paths in the sidebar (or pass via CLI)
2. Click "Load Data" to import videos and markers
3. Review marker deltas in the timeline chart
4. Select markers to inspect details and jump to timestamps
5. Add annotations (notes/labels) for quality review
6. Export annotations for documentation or QA workflows

The comparison UI helps validate sync quality by showing actual vs predicted marker times, computing deltas, and allowing visual inspection of both videos at critical moments.

**Tips:**
- **CLI pre-population**: Arguments passed via CLI (--ref, --new, etc.) automatically pre-fill the sidebar fields through environment variables
- **No browser auto-open**: All UIs default to headless mode. Use the VS Code Ports panel to open when ready
- **Multiple instances**: Use different `--port` values to run multiple comparison sessions simultaneously

### Time-series (CPU/GPU/FPS)

From v0.10.7, the dashboard includes lightweight time-series charts for performance metrics:

- **FPS** (frames/sec) — computed from `frames_processed / retrieval_sec` when not present
- **CPU %** — CPU utilisation percentage
- **GPU Util %** — GPU utilisation percentage (requires NVML)
- **VRAM (MB)** — GPU memory used (requires NVML)
- **RSS Memory (MB)** — resident set size

**How it works:**
- Charts pull from the last 500 `perf.json` artefacts (configurable via query parameter)
- Data is ordered oldest→newest for temporal visualisation
- Null values are skipped (shown as gaps in the chart)
- No external JavaScript libraries required — pure inline SVG

**API endpoint:**
```bash
curl http://localhost:8000/api/perf/history?limit=500&fields=fps,cpu_pct,gpu_util,gpu_mem_mb,rss_mb
```

**Note:** GPU metrics require optional NVML (`pynvml`) at runtime. If unavailable, GPU charts show gaps or "(no data)".

---

## Benchmark Quality

Evaluate sync quality across many runs (e.g., batch outputs) and generate comprehensive reports with CSV/JSON/HTML outputs.

### Usage

```bash
# Basic usage — generates CSV + JSON by default
mtbsync benchmark-quality --root batch_out

# Include HTML report with inline charts
mtbsync benchmark-quality --root batch_out --html

# Custom output directory and fields
mtbsync benchmark-quality --root batch_out --out benchmark_reports --fields fps,cpu_pct,gpu_util,gpu_mem_mb,rss_mb

# Limit to 500 most recent runs
mtbsync benchmark-quality --root batch_out --limit 500 --html

# Show strict evaluation in summary
mtbsync benchmark-quality --root batch_out --strict
```

### What It Does

The benchmark command:
1. **Discovers run folders** under the specified root directory (sorted by modification time, newest first)
2. **Loads artefacts** from each run:
   - `timewarp.json` (required) — time-warp fit quality metrics
   - `perf.json` (optional) — performance metrics (FPS, CPU, GPU, etc.)
   - `pairs.csv` or `pairs_raw.csv` (optional) — residual fallback computation
3. **Aggregates metrics**:
   - Time-warp quality: `ppm`, `inlier_frac`, `res_mae`, `res_p90`
   - Performance: `fps`, `cpu_pct`, `gpu_util`, `gpu_mem_mb`, `rss_mb`
   - Success rates: `ok` (basic), `strict_ok` (strict criteria)
4. **Generates reports**:
   - **CSV**: Detailed table with all metrics
   - **JSON**: Machine-readable format for automation
   - **HTML**: Self-contained report with KPIs and inline SVG charts

### Strict Quality Criteria

When using `--strict`, runs are evaluated against production-grade gates:
- **Inlier fraction** ≥ 0.50 (50% of pairs support the fit)
- **P90 residual** ≤ 1.0 seconds (90th percentile within 1s)
- **PPM drift** ≤ 500 (0.05% speed variation)

Runs passing all three criteria are marked `strict_ok=True`.

### HTML Report Features

When `--html` is specified, the report includes:
- **KPI Dashboard**: Total runs, OK%, Strict OK%, Mean P90, Mean PPM
- **Distribution Charts**: Inline SVG histograms for P90 and PPM buckets
- **Detailed Table**: Sortable table with all metrics per run
- **Self-contained**: No external dependencies, works offline

### Example Output

```
Benchmark Summary
  Total runs: 50
  OK: 47 (94.0%)
  Strict OK: 38 (76.0%)
  Mean P90: 0.45s
  Mean PPM: 234

Outputs:
  CSV: benchmark_out/benchmark_quality.csv
  JSON: benchmark_out/benchmark_quality.json
  HTML: benchmark_out/benchmark_quality.html
```

### Quick QA (local)

```bash
./examples/run_benchmark.sh ./batch_out ./benchmark_out && open ./benchmark_out/report.html
```

*On Linux, replace `open` with `xdg-open`.*

### When to Use

- **QA workflows**: Validate batch processing quality before delivering results
- **Parameter tuning**: Compare sync quality across different configurations
- **Continuous integration**: Generate quality reports as part of automated pipelines
- **Performance analysis**: Track FPS, CPU, GPU trends across many runs
- **Documentation**: Generate HTML reports for stakeholders or audits

### Advanced Options

```bash
# Custom pattern for run folders
mtbsync benchmark-quality --root batch_out --pattern "run_2024*"

# Quiet mode (suppress console output, just write files)
mtbsync benchmark-quality --root batch_out --quiet

# Generate only JSON (skip CSV)
mtbsync benchmark-quality --root batch_out --no-csv --json
```

### Exit Codes

- `0`: Success
- `1`: Unexpected error
- `2`: No runs discovered in root directory

---

## Regression & Auto-Gating

Track quality trends across runs and automatically detect performance regressions. The system maintains a rolling history of quality metrics (P90, PPM, inlier fraction, FPS) and compares each new run against a baseline computed from recent history.

### Quick Start

```bash
# Sync with automatic regression tracking (enabled by default)
mtbsync sync --reference ref.mp4 --new new.mp4 --index ref_index.npz --out pairs.csv

# Check regression status
mtbsync quality-check --root ./batch_out

# Sync with strict regression gates (fail on severe regression)
mtbsync sync --reference ref.mp4 --new new.mp4 --index ref_index.npz --out pairs.csv \
  --fail-on-regression \
  --reg-p90-pct 0.10 --reg-p90-abs 0.10
```

### How It Works

1. **History Tracking**: Each successful sync/batch run appends a quality record to `quality_history.json`:
   - Timestamp, run ID
   - P90 residual (seconds), PPM drift, inlier fraction
   - Processing FPS, optional CPU/GPU metrics

2. **Baseline Computation**: Computes rolling median from last N runs (default 50)
   - Requires ≥10 runs for reliable gating
   - Stores separately for each root directory

3. **Regression Detection**: Compares current run against baseline:
   - **OK**: All metrics within thresholds
   - **WARN**: Minor regression detected
   - **FAIL**: Severe regression (2× threshold)
   - **LEARNING**: Insufficient history (<10 runs)

4. **Auto-gating**: Writes `regression.json` with status and deltas

### Default Thresholds

| Metric | Warning Threshold | Fail Threshold (2×) |
|--------|------------------|---------------------|
| **P90** | +25% or +0.25s | +50% or +0.50s |
| **PPM** | +25% or +150 ppm | +50% or +300 ppm |
| **Inlier Fraction** | -15% or -0.05 | -30% or -0.10 |

All thresholds are configurable via CLI flags.

### CLI Commands

#### sync / batch with Regression Tracking

```bash
# Basic usage (auto-tracking enabled)
mtbsync sync --reference ref.mp4 --new new.mp4 --index ref_index.npz --out pairs.csv

# Disable regression tracking
mtbsync sync ... --regression-off

# Fail on severe regression (exit code 20)
mtbsync sync ... --fail-on-regression

# Custom thresholds
mtbsync sync ... \
  --reg-window 100 \
  --reg-p90-pct 0.10 --reg-p90-abs 0.10 \
  --reg-ppm-pct 0.15 --reg-ppm-abs 100 \
  --reg-inlier-pct 0.10 --reg-inlier-abs 0.05
```

#### quality-check Command

Standalone checker for regression status:

```bash
# Check last run against baseline
mtbsync quality-check --root ./batch_out

# With custom thresholds
mtbsync quality-check --root ./batch_out \
  --window 100 \
  --reg-p90-pct 0.10 --reg-p90-abs 0.10

# Fail on severe regression (for CI)
mtbsync quality-check --root ./batch_out --fail-on-regression
```

**Exit Codes:**
- `0`: OK or learning
- `10`: Warning (minor regression)
- `20`: Fail (severe regression)
- `2`: No history found
- `1`: Error

### Dashboard Integration

The dashboard displays a **Regression** card showing:
- Color-coded status badge (green/yellow/red/cyan)
- Current vs baseline metrics table
- Delta percentages for P90, PPM, Inlier Fraction
- Regression reasons and baseline window size

Access via:
```bash
mtbsync dashboard --root batch_out --port 8000
```

Then navigate to `http://localhost:8000` and check the Regression card.

### CI/CD Integration

#### Automated Quality Gate Workflow

The repository includes a complete GitHub Actions workflow (`.github/workflows/quality-gate.yml`) that:

1. **Runs on every PR and push to main** (plus daily at 02:17 UTC)
2. **Generates benchmark reports** with CSV/JSON/HTML outputs
3. **Executes quality gate checks** with automatic pass/fail/warn status
4. **Posts PR comments** with quality status and links to artifacts
5. **Uploads artifacts** for download (14-day retention)
6. **Displays job summary** in GitHub Actions UI

**What's Checked:**

The quality gate monitors three key metrics against historical baseline:

- **P90 Residual (seconds)**: 90th percentile alignment error
  - Measures how well videos sync at challenging points
  - Default thresholds: +25% or +0.25s (warn), +50% or +0.50s (fail)

- **PPM (parts per million)**: Speed drift from 1:1 playback
  - abs(speed_ratio - 1.0) × 10⁶
  - Default thresholds: +25% or +150 ppm (warn), +50% or +300 ppm (fail)

- **Inlier Fraction**: Proportion of matches supporting the time-warp model
  - Indicates robustness of alignment
  - Default thresholds: -15% or -0.05 (warn), -30% or -0.10 (fail)

Baseline is computed from the last 50 successful runs (requires ≥10 runs for gating).

**Exit Code Mapping:**

| Exit Code | Status | Meaning | CI Behavior |
|-----------|--------|---------|-------------|
| 0 | PASS ✓ | Quality is good | Job passes |
| 10 | WARN ⚠ | Quality degraded but within warning threshold | Job passes (but flagged) |
| 20 | FAIL ✗ | Quality regression exceeds failure threshold | **Job fails** |
| 2 | NO DATA ○ | No history or insufficient data | Job passes (first run) |
| 1 | ERROR ✗ | Command error or exception | **Job fails** |

**Learning Window Protection:**

To prevent false-positives during initial data collection, the quality gate includes a learning window guard:

- **Requirement**: 25+ runs in quality history before allowing hard-fail
- **Behavior**: FAIL status is treated as WARN_LEARNING during learning period
- **Result**: Job passes (exit 0) but status is flagged in PR comment and job summary
- **Display**: History size shown in job summary: "Learning mode: Only 12 runs in history (need 25 for hard-fail)"

This ensures your CI doesn't fail due to unstable baselines while collecting initial quality data.

**Local Reproduction:**

```bash
# Run quality gate locally (same as CI)
python tools/ci_quality_gate.py

# Or run full benchmark + quality check
mtbsync benchmark-quality --root ./batch_out --out ./benchmark_out \
  --csv --json --html --strict --limit 500
python tools/ci_quality_gate.py
open ./benchmark_out/report.html
```

**Configurable Thresholds:**

You can override thresholds without code changes using manual workflow dispatch:

1. Go to: Actions → quality-gate → Run workflow
2. Select a **preset** (or choose custom for fine-grained control):
   - **strict**: Production quality gates (tighter tolerances)
   - **normal**: Default balanced thresholds
   - **permissive**: Development mode (relaxed gates)
   - **custom**: Manually adjust all 6 threshold parameters below
3. If preset=custom, adjust threshold inputs:
   - `reg_p90_pct`: P90 % threshold (warn) - default 0.25
   - `reg_p90_abs`: P90 abs seconds (warn) - default 0.25
   - `reg_ppm_pct`: PPM % threshold (warn) - default 0.25
   - `reg_ppm_abs`: PPM abs value (warn) - default 150
   - `reg_inlier_pct`: Inlier fraction % threshold (warn) - default 0.15
   - `reg_inlier_abs`: Inlier fraction abs threshold (warn) - default 0.05
4. Click "Run workflow"

Fail thresholds are automatically 2× the warn thresholds.

**Accessing Artifacts:**

After each quality gate run, downloadable artifacts are available for 14 days:

1. Navigate to the workflow run (click the badge or PR comment link)
2. Scroll to "Artifacts" section at bottom
3. Download:
   - `benchmark-report`: Full HTML/CSV/JSON quality reports
   - `quality-gate-summary`: Includes `quality_gate.txt` (plain text) and `quality_gate.json` (machine-readable)

The `quality_gate.json` file enables downstream tooling integration with schema:
```json
{
  "status": "PASS|WARN|FAIL|NO_DATA|ERROR",
  "exit_code": 0|10|20|2|1,
  "timestamp": "ISO-8601",
  "regression": {
    "severity": "ok|warn|fail",
    "current": {"p90_sec": 1.5, "ppm": 250, "inlier_frac": 0.7},
    "baseline": {"p90_sec": 1.0, "ppm": 200, "inlier_frac": 0.75},
    "deltas": {"p90_sec": 0.5, "ppm": 50, "inlier_frac": -0.05},
    "thresholds": {...},
    "reasons": [...]
  }
}
```

**PR Comment Example:**

The quality gate workflow automatically posts/updates a single PR comment with:
- Current status (PASS/WARN/FAIL/NO DATA)
- Link to downloadable benchmark report artifact
- Quality gate summary with deltas vs baseline
- Timestamp of last update

**Manual Integration Example:**

```yaml
- name: Run batch processing
  run: mtbsync batch input_videos/ --out-dir batch_out

- name: Check for regressions
  run: |
    mtbsync quality-check --root batch_out --fail-on-regression \
      --reg-p90-pct 0.15 --reg-p90-abs 0.15 \
      --reg-ppm-pct 0.20 --reg-ppm-abs 150 \
      --reg-inlier-pct 0.10 --reg-inlier-abs 0.05
```

The job will:
- Exit 0 (pass) if status is OK or LEARNING
- Exit 10 (continue) if status is WARN
- Exit 20 (fail) if status is FAIL

### Quality History Management

History files are automatically capped at 2000 records (most recent). To manually inspect or clean:

```bash
# View history
cat batch_out/quality_history.json | jq '.'

# View regression status
cat batch_out/regression.json | jq '.'

# Manually prune history (if needed)
# Just delete the file to reset — it will be recreated on next run
rm batch_out/quality_history.json
```

### Interpreting Results

**Example output from `quality-check`:**

```
Regression Status: WARN

┏━━━━━━━━━━━━━━━┳━━━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━━━━━━┓
┃ Metric        ┃ Current ┃ Baseline ┃ Delta          ┃
┡━━━━━━━━━━━━━━━╇━━━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━━━━━━━━┩
│ P90 (s)       │ 1.35    │ 1.00     │ +0.35 (+35.0%) │
│ PPM           │ 245     │ 200      │ +45 (+22.5%)   │
│ Inlier Frac   │ 0.72    │ 0.75     │ -0.03 (-4.0%)  │
└───────────────┴─────────┴──────────┴────────────────┘

  → P90 regression: 1.35s vs baseline 1.00s (+35.0%)

Baseline window: 50 runs (120 total)
Run ID: sync:new_video.mp4
```

**Status Meanings:**
- **OK**: All metrics within thresholds. No action needed.
- **WARN**: Minor regression detected. Review changes, consider tuning params.
- **FAIL**: Severe regression. Investigate immediately — likely quality degradation or bug.
- **LEARNING**: <10 runs in history. Collecting baseline data.

### When to Use

- **Continuous monitoring**: Detect quality drift early across batch runs
- **CI/CD quality gates**: Fail builds on severe performance regressions
- **Parameter tuning**: Validate that config changes don't degrade quality
- **Production deployment**: Ensure new releases maintain quality standards
- **Performance tracking**: Monitor FPS, CPU, GPU trends over time

### Tuning Thresholds

Start with **permissive thresholds** during development:
```bash
--reg-p90-pct 0.25 --reg-p90-abs 0.25     # ±25% or 0.25s
--reg-ppm-pct 0.25 --reg-ppm-abs 150       # ±25% or 150ppm
--reg-inlier-pct 0.15 --reg-inlier-abs 0.05  # -15% or 0.05
```

Tighten for **production quality gates**:
```bash
--reg-p90-pct 0.10 --reg-p90-abs 0.10     # ±10% or 0.10s (strict)
--reg-ppm-pct 0.15 --reg-ppm-abs 100       # ±15% or 100ppm
--reg-inlier-pct 0.10 --reg-inlier-abs 0.03  # -10% or 0.03
```

**General rules:**
- Lower percentages = stricter gates
- Smaller absolute values = stricter gates
- Use OR logic: threshold = max(baseline × pct, baseline + abs)
- Fail threshold = 2× warn threshold (configurable via `severe_multiplier`)

---

## Troubleshooting & Quality Tuning

### Interpreting Time-Warp Quality

After running `mtbsync sync`, inspect `timewarp.json` to assess alignment quality:

```json
{
  "ok": true,
  "model": "affine",
  "params": {"a": 1.0, "b": 0.0},
  "ppm": 0,
  "inlier_frac": 0.33,
  "residuals": {"mae": 0.89, "p50": 0.72, "p90": 2.0, "p95": 2.5}
}
```

**Quality indicators:**

- **`a=1, b=0`** → Identity warp (no temporal scaling/offset)
- **`ppm=0`** → Zero drift from 1:1 playback (consistent with identity)
- **`inlier_frac=0.33`** → Only 33% of pairs supported the model during RANSAC
- **`P90=2.0s`** → 90th percentile residual is 2 seconds (⚠️ **weak alignment**)

**Why `ok:true` despite poor quality?**

Default gates are permissive:
- `max_ppm=1000` (allows up to 0.1% speed drift)
- `min_inlier_frac=0.25` (accepts models with only 25% support)

With 33% inliers and identity warp, it technically passes. However, **P90=2s residuals mean marker transfer will be inaccurate by ±2 seconds**.

### Tightening Quality Gates

For **stricter alignment requirements**, use these parameters:

```bash
mtbsync sync \
  --reference ref.mp4 \
  --new new.mp4 \
  --index cache/ref_index.npz \
  --out cache \
  --warp-window-sec 0.3 \          # Tighter acceptance window (default: wider)
  --warp-inlier-thresh 0.05 \      # Stricter inlier definition (50ms @ 3fps)
  --warp-min-inlier-frac 0.5 \     # Require 50% support (up from 0.25)
  --warp-ransac-iters 1000 \       # More iterations for noisy data
  --warp-max-ppm 500               # Tighter drift tolerance (0.05% vs 0.1%)
```

**Expected outcome:** With P90=2s, these settings will likely produce `ok:false`, which is **useful signal**—the alignment isn't reliable for that pair.

### Visual Inspection

Generate a scatter plot overlay to see alignment quality:

```bash
# Headless mode (generates PNG)
mtbsync viewer --headless \
  --ref-markers ref_markers.csv \
  --timewarp-json timewarp.json \
  --out-png alignment_overlay.png

# Interactive GUI (file picker dialogs)
mtbsync viewer
```

The overlay shows:
- **Diagonal line** = perfect alignment
- **Deviations** = temporal misalignment
- **Clusters** = consistent offset regions

### Parameter Tuning Guidelines

| Symptom | Likely Cause | Suggested Fix |
|---------|--------------|---------------|
| Low `inlier_frac` (<0.4) | Poor feature matches, scene cuts | Increase `--warp-ransac-iters 1000`, tighten `--warp-min-inlier-frac 0.5` |
| High `ppm` (>500) | Videos at different speeds | Check source material, reduce `--warp-max-ppm 500` |
| High `P90` (>1s) | Inconsistent temporal alignment | Reduce `--warp-window-sec 0.3`, increase `--warp-inlier-thresh 0.05` |
| Identity warp (`a=1, b=0`) with high residuals | Visual features don't match temporal structure | Add GPS data, or manually verify videos are from same track |

### Batch Quality Review

Use the dashboard to quickly review alignment quality across many pairs:

```bash
mtbsync dashboard --root ./batch_output --port 8000
```

In the **Batch Summary** card, look for:
- ✅ `timewarp_ok=true` with `P90 < 0.5s` → good alignment
- ⚠️ `timewarp_ok=true` with `P90 > 1.0s` → weak alignment (investigate)
- ❌ `timewarp_ok=false` → alignment failed gates (expected for dissimilar runs)

---

## Phase 2 (Later Tickets)

- Add **SuperPoint + SuperGlue** path (`--matcher super`).
- RAFT optical flow refinement around matched frames.
- FCPXML export for Final Cut; Premiere Pro XML.
- Manual **anchor pairs** UI to re‑fit time‑warp interactively.
- SQLite cache for descriptor indexes and run metadata.
- Basic metrics dashboard (pair coverage, residuals, confidence histogram).

---

*Why an iterative ticket flow?*  
It enforces small, testable steps with explicit acceptance criteria, which yields cleaner commits, faster debugging, and easier refactors as the project grows.

Raw data

            {
    "_id": null,
    "home_page": null,
    "name": "mtbsync",
    "maintainer": null,
    "docs_url": null,
    "requires_python": ">=3.11",
    "maintainer_email": null,
    "keywords": "gopro, video, sync, mtb, markers, opencv",
    "author": "MTB Sync Contributors",
    "author_email": null,
    "download_url": "https://files.pythonhosted.org/packages/f0/2c/35628931a96a9172daa14eb80ef6a07773fec1e686001b083301b8a9b747/mtbsync-0.10.16.tar.gz",
    "platform": null,
    "description": "# MTB Video Sync MVP \u2014 Iterative Ticket Flow\n\n[![PyPI version](https://badge.fury.io/py/mtbsync.svg)](https://badge.fury.io/py/mtbsync)\n[![CI](https://github.com/markg72/goprosync/workflows/CI/badge.svg)](https://github.com/markg72/goprosync/actions/workflows/ci.yml)\n[![Quality Gate](https://github.com/markg72/goprosync/actions/workflows/quality-gate.yml/badge.svg)](https://github.com/markg72/goprosync/actions/workflows/quality-gate.yml)\n[![Python 3.11+](https://img.shields.io/badge/python-3.11+-blue.svg)](https://www.python.org/downloads/)\n[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)\n\nA modular, step\u2011by\u2011step plan to build a Python MVP that aligns a new MTB GoPro run to a reference run, then transfers reference markers to the new run. Designed for use with Claude Code or the OpenAI CLI.\n\n## Quickstart\n\nInstall from PyPI:\n\n```bash\npip install mtbsync\n```\n\nThe package includes helper scripts and documentation:\n- **Helper scripts**: `examples/visual_diagnostic.sh`, `diagnose_alignment.sh`, `resync_strict.sh`, `dashboard_cleanup.sh`\n- **Documentation**: `QUICK_VISUAL_DIAGNOSTIC.md`, `examples/CI_CD_INTEGRATION.md`, `examples/README.md`\n\nBasic usage:\n\n```bash\n# Index reference video\nmtbsync index --reference ref.mp4 --fps 3 --out ref_index.npz\n\n# Sync new video to reference\nmtbsync sync --reference ref.mp4 --new new.mp4 --index ref_index.npz --out pairs.csv --fps 3\n\n# Transfer markers\nmtbsync transfer-markers --ref-markers ref_markers.csv --timewarp-json timewarp.json --out new_markers.csv\n\n# Export to NLE formats\nmtbsync export-markers new_markers.csv --preset fcpxml\nmtbsync export-markers new_markers.csv --preset premiere\nmtbsync export-markers new_markers.csv --preset resolve-edl\n```\n\n---\n\n## Project Brief\n\n**Goal:** Build a Python MVP that aligns a new MTB GoPro run to a reference run of the same trail, then transfers reference markers to the new run. Use computer vision (CV) with optional GPS. Export markers for NLEs (EDL/CSV).\n\n**Core pipeline**\n\n1. **Ingest**: MP4 + optional GPX/FIT.\n2. **Reference indexing**: extract keyframes (2\u20135 fps), compute descriptors (start with ORB).\n3. **Coarse sync**: (a) if GPS present \u2192 distance-based alignment; else (b) visual keyframe retrieval.\n4. **Fine sync**: local feature matching + RANSAC \u2192 time pairs `(t_new \u2192 t_ref, confidence)`.\n5. **Time\u2011warp fit**: monotonic mapping via DTW or a monotone spline with smoothing.\n6. **Marker transfer**: map `t_ref \u2192 t_new`, attach confidence; flag low\u2011confidence.\n7. **Export**: CSV + CMX3600 EDL; optional FCPXML later.\n8. **Preview**: simple Streamlit UI for side\u2011by\u2011side, scrub\u2011synced playback and marker review.\n\n**Non\u2011goals for MVP:** cloud, user auth, multi\u2011hour videos, SuperPoint/RAFT (phase 2).\n\n**Tech:** Python 3.11+, OpenCV, NumPy, SciPy, ffmpeg\u2011python, pandas, fastdtw, gpxpy, Streamlit, Typer/Rich.\n\n---\n\n## Repository Layout\n\n```\nmtb-sync/\n  README.md\n  pyproject.toml            # or requirements.txt\n  src/\n    mtbsync/__init__.py\n    mtbsync/cli.py\n    mtbsync/io/video.py\n    mtbsync/io/gps.py\n    mtbsync/features/keyframes.py\n    mtbsync/features/descriptors.py\n    mtbsync/match/retrieval.py\n    mtbsync/match/local.py\n    mtbsync/align/timewarp.py\n    mtbsync/markers/schema.py\n    mtbsync/markers/transfer.py\n    mtbsync/export/csv_export.py\n    mtbsync/export/edl_export.py\n    mtbsync/ui/app.py\n  tests/\n    test_timewarp.py\n    test_marker_transfer.py\n    test_edl_export.py\n  data/\n    sample_reference.mp4\n    sample_new.mp4\n    sample_reference_markers.csv\n    sample_reference.gpx\n```\n\n---\n\n## Tickets\n\n> Work through these tickets in order. Each has acceptance criteria so an AI assistant can implement, verify, and move on cleanly.\n\n### 0) Repo Scaffold\n\n**Implement**\n\n- Create the folder structure above.\n- Configure packaging: `pip install -e .` enables local dev.\n- Provide a `pyproject.toml` (or `requirements.txt`) with pinned deps.\n\n**Acceptance Criteria**\n\n- Virtual env setup works.\n- `mtbsync --help` available after editable install.\n\n---\n\n### 1) CLI Surface\n\n**Implement**\n\n- Subcommands:\n\n  ```\n  mtbsync index --reference ref.mp4 --fps 3 --out cache/ref_index.npz\n  mtbsync sync  --reference ref.mp4 --new new.mp4 \\\n                [--ref-gpx ref.gpx] [--new-gpx new.gpx] \\\n                --index cache/ref_index.npz --out cache/pairs.csv\n  mtbsync warp  --pairs cache/pairs.csv --out cache/warp.npz\n  mtbsync transfer --reference-markers data/ref_markers.csv --warp cache/warp.npz \\\n                   --out out/new_markers.csv --review-threshold 0.6\n  mtbsync export edl --markers out/new_markers.csv --out out/new_markers.edl \\\n                     [--reel 001] [--fps 29.97]\n  mtbsync preview --reference ref.mp4 --new new.mp4 --markers out/new_markers.csv\n  ```\n\n**Acceptance Criteria**\n\n- Robust arg validation and concise summaries via `rich`.\n- No global state; each command is independently runnable.\n\n---\n\n### 2) Video IO & Keyframes\n\n**Implement**\n\n- `io/video.py`:\n  - `extract_keyframes(video_path, fps) -> List[(t_sec, frame_bgr)]`\n  - `video_fps(video_path) -> float`\n- `features/keyframes.py`:\n  - Resize frames to max dim 960 px (preserve aspect).\n\n**Acceptance Criteria**\n\n- Deterministic timestamp sampling (handles VFR).\n- Errors handled gracefully (bad file, zero\u2011length, etc.).\n\n---\n\n### 3) Descriptors\n\n**Implement**\n\n- `features/descriptors.py` with ORB (grayscale + optional CLAHE).\n- Functions return keypoints and descriptors and can handle sparse scenes.\n\n**Acceptance Criteria**\n\n- Target ~800 keypoints per keyframe when available.\n- Save index with timestamps, (x,y,angle,scale), and descriptors (uint8) to `.npz`.\n\n---\n\n### 4) Retrieval (Coarse Visual Alignment)\n\n**Implement**\n\n- `match/retrieval.py`: brute\u2011force Hamming + Lowe ratio + voting to pick top\u2011K reference keyframes for each new keyframe.\n\n**Acceptance Criteria**\n\n- Output `pairs_raw.csv` with `t_new, t_ref, score, n_inliers`.\n- 5 min @ 3 fps runs in ~\u22642 minutes on a laptop.\n\n---\n\n### 5) Local Refinement\n\n**Implement**\n\n- `match/local.py`: For each candidate, refine around \u00b11 keyframe window, compute homography/affine with RANSAC, and keep best match.\n\n**Acceptance Criteria**\n\n- Output `pairs.csv` with `(t_new, t_ref, confidence)`.\n- Reject outliers by reprojection error; \u226570% valid pairs on sample data.\n\n---\n\n### 6) GPS Alignment (Optional but Implemented)\n\n**Implement**\n\n- `io/gps.py` using `gpxpy`:\n  - Parse GPX/FIT (start with GPX).\n  - Compute cumulative distance and resample to 10 Hz.\n- In `sync`, if GPS provided for either side, align distance curves (cross\u2011correlation) to estimate offset/scale and pre\u2011seed candidate `t_ref`.\n\n**Acceptance Criteria**\n\n- Works with one or both GPS tracks.\n- Falls back cleanly to visual retrieval.\n\n---\n\n### 7) Time\u2011Warp Fit\n\n**Implement**\n\n- `align/timewarp.py`:\n  - Fit a monotonic mapping via **fastdtw** or piecewise linear monotone spline with L2 smoothing.\n  - Expose `map_t_new_to_t_ref(t_new)` and inverse `map_t_ref_to_t_new(t_ref)`.\n\n**Acceptance Criteria**\n\n- Strict monotonicity; median residual < 0.3 s on sample data.\n- Save to `warp.npz` with arrays + metadata.\n\n---\n\n### 8) Marker Schema & Transfer\n\n**Implement**\n\n- `markers/schema.py`: reference CSV schema `name,t_ref,colour?,comment?` (seconds float).\n- `markers/transfer.py`: apply inverse warp to map each `t_ref \u2192 t_new`, interpolate confidence, flag `needs_review` by threshold.\n\n**Acceptance Criteria**\n\n- Output `new_markers.csv` with `name,t_new,confidence,needs_review,comment`.\n\n---\n\n### 9) Exports\n\n**Implement**\n\n- `export/csv_export.py`: already covered by transfer CSV.\n- `export/edl_export.py`: write CMX3600 EDL; 1\u2011frame events with comments.\n\n**Acceptance Criteria**\n\n- EDL imports in Resolve/Premiere with visible markers.\n\n---\n\n### 10) Preview UI\n\n**Implement**\n\n- `ui/app.py` (Streamlit):\n  - Inputs: reference/new MP4, markers CSV, warp.npz.\n  - Side\u2011by\u2011side players with linked scrubbing.\n  - Marker list \u00b15 s around playhead; colour\u2011coded by confidence; CSV download.\n\n**Acceptance Criteria**\n\n- `streamlit run src/mtbsync/ui/app.py` launches; approximate sync is acceptable.\n\n---\n\n### 11) Tests & Sample Data\n\n**Implement**\n\n- Unit tests for:\n  - Monotonic time\u2011warp and inverse mapping.\n  - EDL formatting round\u2011trip sanity.\n  - Marker transfer shape and thresholds.\n- Synthetic fixtures: generate warped time with noise so tests don\u2019t depend on large files.\n\n**Acceptance Criteria**\n\n- `pytest` passes in < 10 s locally.\n\n---\n\n### 12) Dev Ergonomics\n\n**Implement**\n\n- `Makefile`/`justfile`: `setup`, `lint`, `test`, `run-preview`.\n- Pre\u2011commit: `ruff`, `black`, `isort`.\n- `README.md`: quick start + example commands.\n\n**Acceptance Criteria**\n\n- One\u2011command setup and test run.\n\n---\n\n## Prompt Templates\n\n### Claude Code (per\u2011ticket)\n\n```\nYou are a senior Python engineer. Implement the next ticket in the mtb-sync repo.\nFollow the acceptance criteria exactly. Keep code modular and documented.\n\n<TICKET NAME>: <paste ticket content>\n\nConstraints:\n- Python 3.11, no GPU assumptions.\n- Pure functions where possible; no global state.\n- Use 'rich' for concise logging.\n- Add docstrings and type hints.\n- If format ambiguities arise, decide and document in README.\n\nAfter changes:\n- List files changed.\n- Provide example CLI usage.\n- Run unit tests (simulate if environment is not executable) and report results.\n```\n\n### OpenAI CLI (chat)\n\n```bash\nopenai chat.completions.create -m gpt-4.1 \\\n  -g \"\nYou are a senior Python engineer. Implement the following ticket for a project called mtb-sync. Provide complete code blocks for the mentioned files. Explain only what's necessary to run it.\n\n<TICKET NAME + CONTENT>\n\"\n```\n\n---\n\n## Quick Run Guide (for README)\n\n```bash\n# 1) Setup\npython -m venv .venv && source .venv/bin/activate\npip install -U pip\npip install -e .\n\n# 2) Index reference (3 fps keyframes)\nmtbsync index --reference data/sample_reference.mp4 --fps 3 --out cache/ref_index.npz\n\n# 3) Build pairs (with or without GPS)\nmtbsync sync --reference data/sample_reference.mp4 --new data/sample_new.mp4 \\\n             --index cache/ref_index.npz --out cache/pairs.csv\n# Optional (GPS-assisted):\nmtbsync sync --reference data/sample_reference.mp4 --new data/sample_new.mp4 \\\n             --ref-gpx data/sample_reference.gpx --new-gpx data/sample_new.gpx \\\n             --index cache/ref_index.npz --out cache/pairs.csv\n\n# 4) Fit time-warp (automatic during sync, or standalone)\n# Note: sync command automatically generates timewarp.json during retrieval\nmtbsync warp --pairs cache/pairs.csv --out cache/warp.npz\n\n# 5) Transfer markers (using timewarp.json from sync)\nmtbsync transfer-markers --ref-markers data/sample_reference_markers.csv \\\n                         --timewarp-json timewarp.json \\\n                         --out out/new_markers.csv \\\n                         --plot-overlay\n# Outputs:\n#   - new_markers.csv with marker_id,t_ref,t_new_est (+ preserved metadata)\n#   - new_markers_overlay.png preview\n\n# 6) Export EDL\nmtbsync export edl --markers out/new_markers.csv --out out/new_markers.edl --fps 29.97\n\n# 7) Preview UI\nstreamlit run src/mtbsync/ui/app.py\n```\n\n---\n\n## Performance\n\n### Parallel Retrieval\n\nUse `--threads` to enable multi-threaded frame matching:\n\n```bash\nmtbsync sync --reference ref.mp4 --new new.mp4 --index ref.npz --out pairs.csv --threads 4\n```\n\n### Fast Preset for Bulk Jobs\n\nUse `--fast` to auto-tune parameters for large-scale processing:\n\n```bash\nmtbsync sync --reference ref.mp4 --new new.mp4 --index ref.npz --out pairs.csv --fast\n```\n\nThe `--fast` preset automatically:\n- Sets `threads >= 4` (parallel retrieval)\n- Tunes warp parameters for speed (relaxed RANSAC iterations/thresholds)\n\n### Timing Information\n\nThe `sync` command prints per-stage timings:\n- `retrieval_sec` - Frame matching time\n- `warp_sec` - Time-warp fitting/gating time\n- `markers_sec` - Marker auto-export time\n- `total_sec` - End-to-end pipeline time\n\nBatch processing writes timings to `batch_summary.csv` for analysis across multiple pairs.\n\n### \u26a1 Performance Benchmarks\n\n| Stage        | Mean Time (s) | Notes                              |\n|---------------|---------------|------------------------------------|\n| GPS Alignment |  0.28         | Vectorised (`np.interp`) fast-path |\n| Retrieval     |  1.42         | 4 threads (`ThreadPoolExecutor`)   |\n| Warp Fit      |  0.06         | RANSAC + IRLS refinement           |\n| Marker Export |  0.11         | CSV \u2192 JSON + overlay               |\n| **Total**     |  1.87 \u00b1 0.15  | Typical 1080p pair (8 k frames)    |\n\n> \ud83d\udca1 Use `--threads 4` or `--fast` for large jobs.\n> `batch_summary.csv` records per-stage timings for every pair.\n\n[![Performance Status](https://img.shields.io/badge/performance-optimized-brightgreen)](https://github.com/markg72/goprosync)\n\n## Dashboard\n\nLaunch a local, zero-dependency dashboard to inspect artefacts:\n\n```bash\n# Default port 8760 (no browser auto-open)\nmtbsync dashboard --root ./batch_out\n\n# Enable server-side export\nmtbsync dashboard --root ./batch_out --allow-write\n\n# Explicitly open browser on launch\nmtbsync dashboard --root ./batch_out --open-browser\n```\n\n### Ports & Browser Behavior (VS Code/Codespaces)\n\nAll UI commands now use **stable default ports** and **never auto-open a browser** by default:\n\n| App | CLI Command | Env Variable | Default Port |\n|-----|-------------|--------------|--------------|\n| Dashboard | `mtbsync dashboard` | `MTB_DASHBOARD_PORT` | 8760 |\n| Telemetry UI | `mtbsync telemetry-ui` | `MTB_TELEMETRY_PORT` | 8761 |\n| Compare UI | `mtbsync compare-ui` | `MTB_COMPARE_PORT` | 8762 |\n\n**Port precedence:** `--port` flag > environment variable > default\n\n**VS Code/Codespaces behavior:**\n- Ports are auto-forwarded but **won't open surprise browser tabs**\n- Use the **Ports panel** to open the forwarded URL when ready\n- VS Code may remap to a local random port (e.g., 5752) \u2014 this is expected\n- Config is in `.vscode/settings.json` with `\"onAutoForward\": \"ignore\"`\n\n**Quick setup:**\n```bash\n# Load port environment defaults\nsource scripts/mtbsync-ports.env\n\n# Override specific ports if needed\nexport MTB_DASHBOARD_PORT=9000\n```\n\n**Optional shell aliases** (add to `~/.bashrc` or `~/.zshrc`):\n```bash\n# Quick launch aliases for mtbsync UIs\nalias dash='mtbsync dashboard --root ./batch_out'\nalias tele='mtbsync telemetry-ui --root ./batch_out'\nalias comp='mtbsync compare-ui --root ./batch_out'\n```\n\n**Features:**\n- Marker selector (multiple `new_markers*.csv`)\n- Timing sparklines from `batch_summary.csv`\n- Download artefacts; optional `POST /api/export-json` when `--allow-write` is set\n- Live telemetry updates via Server-Sent Events (`/api/perf/stream`)\n\n### \ud83e\udde9 Threaded Dashboard Server\n\nAs of v0.10.4, the dashboard uses a threaded HTTP server to support long-lived Server-Sent Event (SSE) connections. This allows `/api/perf/stream` (the live telemetry feed) to run continuously while other endpoints (e.g. `/api/files`, `/api/markers`, `/api/timewarp`) remain responsive.\n\n**Key details:**\n- The dashboard runs via `ThreadingHTTPServer` with `daemon_threads=True` for safe shutdown\n- Multiple browser tabs or connected clients can receive live telemetry simultaneously\n- Existing single-threaded usage is now fully backward-compatible\n\n**Usage example:**\n\n```bash\nmtbsync dashboard --root ./batch_out --port 8000\n```\n\nThen open http://localhost:8000 \u2014 the telemetry table will update live as new runs finish.\n\n### \ud83d\ude80 GPU & Extended Telemetry\n\nFrom v0.10.5, mtbsync records extended runtime metrics in `perf.json`:\n\n- **CPU%** and **RSS memory (MB)** when `psutil` is available\n- **GPU utilisation (%)** and **VRAM used (MB)** when NVIDIA NVML (`pynvml`) is available\n- Metrics surface in the dashboard table and update live via SSE\n\nTelemetry is best-effort and never blocks the pipeline. To disable GPU probing:\n\n```bash\n# Disable GPU telemetry for sync\nmtbsync sync ... --no-gpu\n\n# Disable GPU telemetry for batch\nmtbsync batch input_dir ... --no-gpu\n```\n\n> **Note:** psutil/pynvml are optional. If not installed, GPU/CPU fields are omitted or set to `null`.\n\n### Telemetry Retention (perf.json)\n\nFor large batch runs, you can prune older telemetry artefacts:\n\n**CLI**\n```bash\n# Keep newest 200 perf.json files under ./batch_out\nmtbsync prune-perf --root ./batch_out --keep 200\n\n# Dry-run first to see what would be deleted\nmtbsync prune-perf --root ./batch_out --keep 200 --dry-run\n```\n\n**Dashboard (requires --allow-write)**\n```bash\nmtbsync dashboard --root ./batch_out --allow-write\n```\nOpen the dashboard and use the **Prune perf.json** button to keep the newest N files.\nAll operations are constrained to the selected root.\n\n### Streamlit Telemetry UI\n\nLaunch a rich, interactive telemetry dashboard using Streamlit:\n\n```bash\nmtbsync telemetry-ui --root ./batch_out\n```\n\n**Features:**\n- Interactive time-series charts for FPS, CPU%, GPU utilisation, VRAM, and RSS memory\n- Filtering controls (time range slider, recent N runs)\n- Smoothing toggle for cleaner trend visualization\n- CSV export of filtered telemetry data\n- Optional live updates (experimental)\n- Clean sidebar layout with metric selection\n\n**Installation:**\n\nRequires optional dependency:\n\n```bash\npip install \"mtbsync[telemetry]\"\n```\n\n**Usage example:**\n\n```bash\n# Launch on default port (8761)\nmtbsync telemetry-ui --root ./batch_out\n\n# Custom port\nmtbsync telemetry-ui --root ./batch_out --port 9000\n```\n\nThe Streamlit UI complements the built-in HTML dashboard from `mtbsync dashboard`, providing richer interactivity and data exploration capabilities.\n\n### Side-by-Side Video Comparison UI\n\nReview and annotate two videos side-by-side with synced markers and delta inspection:\n\n```bash\nmtbsync compare-ui --ref ref.mp4 --new new.mp4 --ref-markers ref_markers.csv --timewarp-json timewarp.json\n```\n\n**Features:**\n- Dual video panes (reference + new) for visual comparison\n- Marker overlay with timeline visualization\n- Delta summary statistics (MAE, P90, median, count)\n- Interactive search and filter for markers\n- Annotation system: add notes and labels to markers\n- Export annotations to CSV or JSON\n\n**Installation:**\n\nRequires optional dependency:\n\n```bash\npip install \"mtbsync[compare]\"\n```\n\n**Usage examples:**\n\n```bash\n# Full comparison with all inputs\nmtbsync compare-ui --ref ref.mp4 --new new.mp4 \\\n  --ref-markers ref_markers.csv \\\n  --new-markers new_markers.csv \\\n  --timewarp-json timewarp.json\n\n# Launch from batch output directory\nmtbsync compare-ui --root ./batch_out\n\n# Custom port (default is 8762)\nmtbsync compare-ui --ref ref.mp4 --new new.mp4 \\\n  --ref-markers ref_markers.csv --port 9000\n```\n\n**Workflow:**\n1. Configure file paths in the sidebar (or pass via CLI)\n2. Click \"Load Data\" to import videos and markers\n3. Review marker deltas in the timeline chart\n4. Select markers to inspect details and jump to timestamps\n5. Add annotations (notes/labels) for quality review\n6. Export annotations for documentation or QA workflows\n\nThe comparison UI helps validate sync quality by showing actual vs predicted marker times, computing deltas, and allowing visual inspection of both videos at critical moments.\n\n**Tips:**\n- **CLI pre-population**: Arguments passed via CLI (--ref, --new, etc.) automatically pre-fill the sidebar fields through environment variables\n- **No browser auto-open**: All UIs default to headless mode. Use the VS Code Ports panel to open when ready\n- **Multiple instances**: Use different `--port` values to run multiple comparison sessions simultaneously\n\n### Time-series (CPU/GPU/FPS)\n\nFrom v0.10.7, the dashboard includes lightweight time-series charts for performance metrics:\n\n- **FPS** (frames/sec) \u2014 computed from `frames_processed / retrieval_sec` when not present\n- **CPU %** \u2014 CPU utilisation percentage\n- **GPU Util %** \u2014 GPU utilisation percentage (requires NVML)\n- **VRAM (MB)** \u2014 GPU memory used (requires NVML)\n- **RSS Memory (MB)** \u2014 resident set size\n\n**How it works:**\n- Charts pull from the last 500 `perf.json` artefacts (configurable via query parameter)\n- Data is ordered oldest\u2192newest for temporal visualisation\n- Null values are skipped (shown as gaps in the chart)\n- No external JavaScript libraries required \u2014 pure inline SVG\n\n**API endpoint:**\n```bash\ncurl http://localhost:8000/api/perf/history?limit=500&fields=fps,cpu_pct,gpu_util,gpu_mem_mb,rss_mb\n```\n\n**Note:** GPU metrics require optional NVML (`pynvml`) at runtime. If unavailable, GPU charts show gaps or \"(no data)\".\n\n---\n\n## Benchmark Quality\n\nEvaluate sync quality across many runs (e.g., batch outputs) and generate comprehensive reports with CSV/JSON/HTML outputs.\n\n### Usage\n\n```bash\n# Basic usage \u2014 generates CSV + JSON by default\nmtbsync benchmark-quality --root batch_out\n\n# Include HTML report with inline charts\nmtbsync benchmark-quality --root batch_out --html\n\n# Custom output directory and fields\nmtbsync benchmark-quality --root batch_out --out benchmark_reports --fields fps,cpu_pct,gpu_util,gpu_mem_mb,rss_mb\n\n# Limit to 500 most recent runs\nmtbsync benchmark-quality --root batch_out --limit 500 --html\n\n# Show strict evaluation in summary\nmtbsync benchmark-quality --root batch_out --strict\n```\n\n### What It Does\n\nThe benchmark command:\n1. **Discovers run folders** under the specified root directory (sorted by modification time, newest first)\n2. **Loads artefacts** from each run:\n   - `timewarp.json` (required) \u2014 time-warp fit quality metrics\n   - `perf.json` (optional) \u2014 performance metrics (FPS, CPU, GPU, etc.)\n   - `pairs.csv` or `pairs_raw.csv` (optional) \u2014 residual fallback computation\n3. **Aggregates metrics**:\n   - Time-warp quality: `ppm`, `inlier_frac`, `res_mae`, `res_p90`\n   - Performance: `fps`, `cpu_pct`, `gpu_util`, `gpu_mem_mb`, `rss_mb`\n   - Success rates: `ok` (basic), `strict_ok` (strict criteria)\n4. **Generates reports**:\n   - **CSV**: Detailed table with all metrics\n   - **JSON**: Machine-readable format for automation\n   - **HTML**: Self-contained report with KPIs and inline SVG charts\n\n### Strict Quality Criteria\n\nWhen using `--strict`, runs are evaluated against production-grade gates:\n- **Inlier fraction** \u2265 0.50 (50% of pairs support the fit)\n- **P90 residual** \u2264 1.0 seconds (90th percentile within 1s)\n- **PPM drift** \u2264 500 (0.05% speed variation)\n\nRuns passing all three criteria are marked `strict_ok=True`.\n\n### HTML Report Features\n\nWhen `--html` is specified, the report includes:\n- **KPI Dashboard**: Total runs, OK%, Strict OK%, Mean P90, Mean PPM\n- **Distribution Charts**: Inline SVG histograms for P90 and PPM buckets\n- **Detailed Table**: Sortable table with all metrics per run\n- **Self-contained**: No external dependencies, works offline\n\n### Example Output\n\n```\nBenchmark Summary\n  Total runs: 50\n  OK: 47 (94.0%)\n  Strict OK: 38 (76.0%)\n  Mean P90: 0.45s\n  Mean PPM: 234\n\nOutputs:\n  CSV: benchmark_out/benchmark_quality.csv\n  JSON: benchmark_out/benchmark_quality.json\n  HTML: benchmark_out/benchmark_quality.html\n```\n\n### Quick QA (local)\n\n```bash\n./examples/run_benchmark.sh ./batch_out ./benchmark_out && open ./benchmark_out/report.html\n```\n\n*On Linux, replace `open` with `xdg-open`.*\n\n### When to Use\n\n- **QA workflows**: Validate batch processing quality before delivering results\n- **Parameter tuning**: Compare sync quality across different configurations\n- **Continuous integration**: Generate quality reports as part of automated pipelines\n- **Performance analysis**: Track FPS, CPU, GPU trends across many runs\n- **Documentation**: Generate HTML reports for stakeholders or audits\n\n### Advanced Options\n\n```bash\n# Custom pattern for run folders\nmtbsync benchmark-quality --root batch_out --pattern \"run_2024*\"\n\n# Quiet mode (suppress console output, just write files)\nmtbsync benchmark-quality --root batch_out --quiet\n\n# Generate only JSON (skip CSV)\nmtbsync benchmark-quality --root batch_out --no-csv --json\n```\n\n### Exit Codes\n\n- `0`: Success\n- `1`: Unexpected error\n- `2`: No runs discovered in root directory\n\n---\n\n## Regression & Auto-Gating\n\nTrack quality trends across runs and automatically detect performance regressions. The system maintains a rolling history of quality metrics (P90, PPM, inlier fraction, FPS) and compares each new run against a baseline computed from recent history.\n\n### Quick Start\n\n```bash\n# Sync with automatic regression tracking (enabled by default)\nmtbsync sync --reference ref.mp4 --new new.mp4 --index ref_index.npz --out pairs.csv\n\n# Check regression status\nmtbsync quality-check --root ./batch_out\n\n# Sync with strict regression gates (fail on severe regression)\nmtbsync sync --reference ref.mp4 --new new.mp4 --index ref_index.npz --out pairs.csv \\\n  --fail-on-regression \\\n  --reg-p90-pct 0.10 --reg-p90-abs 0.10\n```\n\n### How It Works\n\n1. **History Tracking**: Each successful sync/batch run appends a quality record to `quality_history.json`:\n   - Timestamp, run ID\n   - P90 residual (seconds), PPM drift, inlier fraction\n   - Processing FPS, optional CPU/GPU metrics\n\n2. **Baseline Computation**: Computes rolling median from last N runs (default 50)\n   - Requires \u226510 runs for reliable gating\n   - Stores separately for each root directory\n\n3. **Regression Detection**: Compares current run against baseline:\n   - **OK**: All metrics within thresholds\n   - **WARN**: Minor regression detected\n   - **FAIL**: Severe regression (2\u00d7 threshold)\n   - **LEARNING**: Insufficient history (<10 runs)\n\n4. **Auto-gating**: Writes `regression.json` with status and deltas\n\n### Default Thresholds\n\n| Metric | Warning Threshold | Fail Threshold (2\u00d7) |\n|--------|------------------|---------------------|\n| **P90** | +25% or +0.25s | +50% or +0.50s |\n| **PPM** | +25% or +150 ppm | +50% or +300 ppm |\n| **Inlier Fraction** | -15% or -0.05 | -30% or -0.10 |\n\nAll thresholds are configurable via CLI flags.\n\n### CLI Commands\n\n#### sync / batch with Regression Tracking\n\n```bash\n# Basic usage (auto-tracking enabled)\nmtbsync sync --reference ref.mp4 --new new.mp4 --index ref_index.npz --out pairs.csv\n\n# Disable regression tracking\nmtbsync sync ... --regression-off\n\n# Fail on severe regression (exit code 20)\nmtbsync sync ... --fail-on-regression\n\n# Custom thresholds\nmtbsync sync ... \\\n  --reg-window 100 \\\n  --reg-p90-pct 0.10 --reg-p90-abs 0.10 \\\n  --reg-ppm-pct 0.15 --reg-ppm-abs 100 \\\n  --reg-inlier-pct 0.10 --reg-inlier-abs 0.05\n```\n\n#### quality-check Command\n\nStandalone checker for regression status:\n\n```bash\n# Check last run against baseline\nmtbsync quality-check --root ./batch_out\n\n# With custom thresholds\nmtbsync quality-check --root ./batch_out \\\n  --window 100 \\\n  --reg-p90-pct 0.10 --reg-p90-abs 0.10\n\n# Fail on severe regression (for CI)\nmtbsync quality-check --root ./batch_out --fail-on-regression\n```\n\n**Exit Codes:**\n- `0`: OK or learning\n- `10`: Warning (minor regression)\n- `20`: Fail (severe regression)\n- `2`: No history found\n- `1`: Error\n\n### Dashboard Integration\n\nThe dashboard displays a **Regression** card showing:\n- Color-coded status badge (green/yellow/red/cyan)\n- Current vs baseline metrics table\n- Delta percentages for P90, PPM, Inlier Fraction\n- Regression reasons and baseline window size\n\nAccess via:\n```bash\nmtbsync dashboard --root batch_out --port 8000\n```\n\nThen navigate to `http://localhost:8000` and check the Regression card.\n\n### CI/CD Integration\n\n#### Automated Quality Gate Workflow\n\nThe repository includes a complete GitHub Actions workflow (`.github/workflows/quality-gate.yml`) that:\n\n1. **Runs on every PR and push to main** (plus daily at 02:17 UTC)\n2. **Generates benchmark reports** with CSV/JSON/HTML outputs\n3. **Executes quality gate checks** with automatic pass/fail/warn status\n4. **Posts PR comments** with quality status and links to artifacts\n5. **Uploads artifacts** for download (14-day retention)\n6. **Displays job summary** in GitHub Actions UI\n\n**What's Checked:**\n\nThe quality gate monitors three key metrics against historical baseline:\n\n- **P90 Residual (seconds)**: 90th percentile alignment error\n  - Measures how well videos sync at challenging points\n  - Default thresholds: +25% or +0.25s (warn), +50% or +0.50s (fail)\n\n- **PPM (parts per million)**: Speed drift from 1:1 playback\n  - abs(speed_ratio - 1.0) \u00d7 10\u2076\n  - Default thresholds: +25% or +150 ppm (warn), +50% or +300 ppm (fail)\n\n- **Inlier Fraction**: Proportion of matches supporting the time-warp model\n  - Indicates robustness of alignment\n  - Default thresholds: -15% or -0.05 (warn), -30% or -0.10 (fail)\n\nBaseline is computed from the last 50 successful runs (requires \u226510 runs for gating).\n\n**Exit Code Mapping:**\n\n| Exit Code | Status | Meaning | CI Behavior |\n|-----------|--------|---------|-------------|\n| 0 | PASS \u2713 | Quality is good | Job passes |\n| 10 | WARN \u26a0 | Quality degraded but within warning threshold | Job passes (but flagged) |\n| 20 | FAIL \u2717 | Quality regression exceeds failure threshold | **Job fails** |\n| 2 | NO DATA \u25cb | No history or insufficient data | Job passes (first run) |\n| 1 | ERROR \u2717 | Command error or exception | **Job fails** |\n\n**Learning Window Protection:**\n\nTo prevent false-positives during initial data collection, the quality gate includes a learning window guard:\n\n- **Requirement**: 25+ runs in quality history before allowing hard-fail\n- **Behavior**: FAIL status is treated as WARN_LEARNING during learning period\n- **Result**: Job passes (exit 0) but status is flagged in PR comment and job summary\n- **Display**: History size shown in job summary: \"Learning mode: Only 12 runs in history (need 25 for hard-fail)\"\n\nThis ensures your CI doesn't fail due to unstable baselines while collecting initial quality data.\n\n**Local Reproduction:**\n\n```bash\n# Run quality gate locally (same as CI)\npython tools/ci_quality_gate.py\n\n# Or run full benchmark + quality check\nmtbsync benchmark-quality --root ./batch_out --out ./benchmark_out \\\n  --csv --json --html --strict --limit 500\npython tools/ci_quality_gate.py\nopen ./benchmark_out/report.html\n```\n\n**Configurable Thresholds:**\n\nYou can override thresholds without code changes using manual workflow dispatch:\n\n1. Go to: Actions \u2192 quality-gate \u2192 Run workflow\n2. Select a **preset** (or choose custom for fine-grained control):\n   - **strict**: Production quality gates (tighter tolerances)\n   - **normal**: Default balanced thresholds\n   - **permissive**: Development mode (relaxed gates)\n   - **custom**: Manually adjust all 6 threshold parameters below\n3. If preset=custom, adjust threshold inputs:\n   - `reg_p90_pct`: P90 % threshold (warn) - default 0.25\n   - `reg_p90_abs`: P90 abs seconds (warn) - default 0.25\n   - `reg_ppm_pct`: PPM % threshold (warn) - default 0.25\n   - `reg_ppm_abs`: PPM abs value (warn) - default 150\n   - `reg_inlier_pct`: Inlier fraction % threshold (warn) - default 0.15\n   - `reg_inlier_abs`: Inlier fraction abs threshold (warn) - default 0.05\n4. Click \"Run workflow\"\n\nFail thresholds are automatically 2\u00d7 the warn thresholds.\n\n**Accessing Artifacts:**\n\nAfter each quality gate run, downloadable artifacts are available for 14 days:\n\n1. Navigate to the workflow run (click the badge or PR comment link)\n2. Scroll to \"Artifacts\" section at bottom\n3. Download:\n   - `benchmark-report`: Full HTML/CSV/JSON quality reports\n   - `quality-gate-summary`: Includes `quality_gate.txt` (plain text) and `quality_gate.json` (machine-readable)\n\nThe `quality_gate.json` file enables downstream tooling integration with schema:\n```json\n{\n  \"status\": \"PASS|WARN|FAIL|NO_DATA|ERROR\",\n  \"exit_code\": 0|10|20|2|1,\n  \"timestamp\": \"ISO-8601\",\n  \"regression\": {\n    \"severity\": \"ok|warn|fail\",\n    \"current\": {\"p90_sec\": 1.5, \"ppm\": 250, \"inlier_frac\": 0.7},\n    \"baseline\": {\"p90_sec\": 1.0, \"ppm\": 200, \"inlier_frac\": 0.75},\n    \"deltas\": {\"p90_sec\": 0.5, \"ppm\": 50, \"inlier_frac\": -0.05},\n    \"thresholds\": {...},\n    \"reasons\": [...]\n  }\n}\n```\n\n**PR Comment Example:**\n\nThe quality gate workflow automatically posts/updates a single PR comment with:\n- Current status (PASS/WARN/FAIL/NO DATA)\n- Link to downloadable benchmark report artifact\n- Quality gate summary with deltas vs baseline\n- Timestamp of last update\n\n**Manual Integration Example:**\n\n```yaml\n- name: Run batch processing\n  run: mtbsync batch input_videos/ --out-dir batch_out\n\n- name: Check for regressions\n  run: |\n    mtbsync quality-check --root batch_out --fail-on-regression \\\n      --reg-p90-pct 0.15 --reg-p90-abs 0.15 \\\n      --reg-ppm-pct 0.20 --reg-ppm-abs 150 \\\n      --reg-inlier-pct 0.10 --reg-inlier-abs 0.05\n```\n\nThe job will:\n- Exit 0 (pass) if status is OK or LEARNING\n- Exit 10 (continue) if status is WARN\n- Exit 20 (fail) if status is FAIL\n\n### Quality History Management\n\nHistory files are automatically capped at 2000 records (most recent). To manually inspect or clean:\n\n```bash\n# View history\ncat batch_out/quality_history.json | jq '.'\n\n# View regression status\ncat batch_out/regression.json | jq '.'\n\n# Manually prune history (if needed)\n# Just delete the file to reset \u2014 it will be recreated on next run\nrm batch_out/quality_history.json\n```\n\n### Interpreting Results\n\n**Example output from `quality-check`:**\n\n```\nRegression Status: WARN\n\n\u250f\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2533\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2533\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2533\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2513\n\u2503 Metric        \u2503 Current \u2503 Baseline \u2503 Delta          \u2503\n\u2521\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2547\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2547\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2547\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2501\u2529\n\u2502 P90 (s)       \u2502 1.35    \u2502 1.00     \u2502 +0.35 (+35.0%) \u2502\n\u2502 PPM           \u2502 245     \u2502 200      \u2502 +45 (+22.5%)   \u2502\n\u2502 Inlier Frac   \u2502 0.72    \u2502 0.75     \u2502 -0.03 (-4.0%)  \u2502\n\u2514\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2534\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2534\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2534\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2500\u2518\n\n  \u2192 P90 regression: 1.35s vs baseline 1.00s (+35.0%)\n\nBaseline window: 50 runs (120 total)\nRun ID: sync:new_video.mp4\n```\n\n**Status Meanings:**\n- **OK**: All metrics within thresholds. No action needed.\n- **WARN**: Minor regression detected. Review changes, consider tuning params.\n- **FAIL**: Severe regression. Investigate immediately \u2014 likely quality degradation or bug.\n- **LEARNING**: <10 runs in history. Collecting baseline data.\n\n### When to Use\n\n- **Continuous monitoring**: Detect quality drift early across batch runs\n- **CI/CD quality gates**: Fail builds on severe performance regressions\n- **Parameter tuning**: Validate that config changes don't degrade quality\n- **Production deployment**: Ensure new releases maintain quality standards\n- **Performance tracking**: Monitor FPS, CPU, GPU trends over time\n\n### Tuning Thresholds\n\nStart with **permissive thresholds** during development:\n```bash\n--reg-p90-pct 0.25 --reg-p90-abs 0.25     # \u00b125% or 0.25s\n--reg-ppm-pct 0.25 --reg-ppm-abs 150       # \u00b125% or 150ppm\n--reg-inlier-pct 0.15 --reg-inlier-abs 0.05  # -15% or 0.05\n```\n\nTighten for **production quality gates**:\n```bash\n--reg-p90-pct 0.10 --reg-p90-abs 0.10     # \u00b110% or 0.10s (strict)\n--reg-ppm-pct 0.15 --reg-ppm-abs 100       # \u00b115% or 100ppm\n--reg-inlier-pct 0.10 --reg-inlier-abs 0.03  # -10% or 0.03\n```\n\n**General rules:**\n- Lower percentages = stricter gates\n- Smaller absolute values = stricter gates\n- Use OR logic: threshold = max(baseline \u00d7 pct, baseline + abs)\n- Fail threshold = 2\u00d7 warn threshold (configurable via `severe_multiplier`)\n\n---\n\n## Troubleshooting & Quality Tuning\n\n### Interpreting Time-Warp Quality\n\nAfter running `mtbsync sync`, inspect `timewarp.json` to assess alignment quality:\n\n```json\n{\n  \"ok\": true,\n  \"model\": \"affine\",\n  \"params\": {\"a\": 1.0, \"b\": 0.0},\n  \"ppm\": 0,\n  \"inlier_frac\": 0.33,\n  \"residuals\": {\"mae\": 0.89, \"p50\": 0.72, \"p90\": 2.0, \"p95\": 2.5}\n}\n```\n\n**Quality indicators:**\n\n- **`a=1, b=0`** \u2192 Identity warp (no temporal scaling/offset)\n- **`ppm=0`** \u2192 Zero drift from 1:1 playback (consistent with identity)\n- **`inlier_frac=0.33`** \u2192 Only 33% of pairs supported the model during RANSAC\n- **`P90=2.0s`** \u2192 90th percentile residual is 2 seconds (\u26a0\ufe0f **weak alignment**)\n\n**Why `ok:true` despite poor quality?**\n\nDefault gates are permissive:\n- `max_ppm=1000` (allows up to 0.1% speed drift)\n- `min_inlier_frac=0.25` (accepts models with only 25% support)\n\nWith 33% inliers and identity warp, it technically passes. However, **P90=2s residuals mean marker transfer will be inaccurate by \u00b12 seconds**.\n\n### Tightening Quality Gates\n\nFor **stricter alignment requirements**, use these parameters:\n\n```bash\nmtbsync sync \\\n  --reference ref.mp4 \\\n  --new new.mp4 \\\n  --index cache/ref_index.npz \\\n  --out cache \\\n  --warp-window-sec 0.3 \\          # Tighter acceptance window (default: wider)\n  --warp-inlier-thresh 0.05 \\      # Stricter inlier definition (50ms @ 3fps)\n  --warp-min-inlier-frac 0.5 \\     # Require 50% support (up from 0.25)\n  --warp-ransac-iters 1000 \\       # More iterations for noisy data\n  --warp-max-ppm 500               # Tighter drift tolerance (0.05% vs 0.1%)\n```\n\n**Expected outcome:** With P90=2s, these settings will likely produce `ok:false`, which is **useful signal**\u2014the alignment isn't reliable for that pair.\n\n### Visual Inspection\n\nGenerate a scatter plot overlay to see alignment quality:\n\n```bash\n# Headless mode (generates PNG)\nmtbsync viewer --headless \\\n  --ref-markers ref_markers.csv \\\n  --timewarp-json timewarp.json \\\n  --out-png alignment_overlay.png\n\n# Interactive GUI (file picker dialogs)\nmtbsync viewer\n```\n\nThe overlay shows:\n- **Diagonal line** = perfect alignment\n- **Deviations** = temporal misalignment\n- **Clusters** = consistent offset regions\n\n### Parameter Tuning Guidelines\n\n| Symptom | Likely Cause | Suggested Fix |\n|---------|--------------|---------------|\n| Low `inlier_frac` (<0.4) | Poor feature matches, scene cuts | Increase `--warp-ransac-iters 1000`, tighten `--warp-min-inlier-frac 0.5` |\n| High `ppm` (>500) | Videos at different speeds | Check source material, reduce `--warp-max-ppm 500` |\n| High `P90` (>1s) | Inconsistent temporal alignment | Reduce `--warp-window-sec 0.3`, increase `--warp-inlier-thresh 0.05` |\n| Identity warp (`a=1, b=0`) with high residuals | Visual features don't match temporal structure | Add GPS data, or manually verify videos are from same track |\n\n### Batch Quality Review\n\nUse the dashboard to quickly review alignment quality across many pairs:\n\n```bash\nmtbsync dashboard --root ./batch_output --port 8000\n```\n\nIn the **Batch Summary** card, look for:\n- \u2705 `timewarp_ok=true` with `P90 < 0.5s` \u2192 good alignment\n- \u26a0\ufe0f `timewarp_ok=true` with `P90 > 1.0s` \u2192 weak alignment (investigate)\n- \u274c `timewarp_ok=false` \u2192 alignment failed gates (expected for dissimilar runs)\n\n---\n\n## Phase 2 (Later Tickets)\n\n- Add **SuperPoint + SuperGlue** path (`--matcher super`).\n- RAFT optical flow refinement around matched frames.\n- FCPXML export for Final Cut; Premiere Pro XML.\n- Manual **anchor pairs** UI to re\u2011fit time\u2011warp interactively.\n- SQLite cache for descriptor indexes and run metadata.\n- Basic metrics dashboard (pair coverage, residuals, confidence histogram).\n\n---\n\n*Why an iterative ticket flow?*  \nIt enforces small, testable steps with explicit acceptance criteria, which yields cleaner commits, faster debugging, and easier refactors as the project grows.\n",
    "bugtrack_url": null,
    "license": "MIT",
    "summary": "Align MTB GoPro runs and transfer markers using computer vision",
    "version": "0.10.16",
    "project_urls": null,
    "split_keywords": [
        "gopro",
        " video",
        " sync",
        " mtb",
        " markers",
        " opencv"
    ],
    "urls": [
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "f40bc451fd0de4c9c4b5c834cb34859f54592253d8fe205817ee4cdac7529ce2",
                "md5": "49a16c87eec4fd4e5d5b816b86270a0b",
                "sha256": "b5d76dde17a072d8ca6f6b99d1bf3af5e3ef13cab9a50cdd7b3caa3893100579"
            },
            "downloads": -1,
            "filename": "mtbsync-0.10.16-py3-none-any.whl",
            "has_sig": false,
            "md5_digest": "49a16c87eec4fd4e5d5b816b86270a0b",
            "packagetype": "bdist_wheel",
            "python_version": "py3",
            "requires_python": ">=3.11",
            "size": 96411,
            "upload_time": "2025-11-02T12:06:36",
            "upload_time_iso_8601": "2025-11-02T12:06:36.753610Z",
            "url": "https://files.pythonhosted.org/packages/f4/0b/c451fd0de4c9c4b5c834cb34859f54592253d8fe205817ee4cdac7529ce2/mtbsync-0.10.16-py3-none-any.whl",
            "yanked": false,
            "yanked_reason": null
        },
        {
            "comment_text": null,
            "digests": {
                "blake2b_256": "f02c35628931a96a9172daa14eb80ef6a07773fec1e686001b083301b8a9b747",
                "md5": "7137fbf7bdec1455e370ac388c7dc649",
                "sha256": "905ad69087432aaed55053e560eeddcff162bd6a56a082d560495acd4b3cc27f"
            },
            "downloads": -1,
            "filename": "mtbsync-0.10.16.tar.gz",
            "has_sig": false,
            "md5_digest": "7137fbf7bdec1455e370ac388c7dc649",
            "packagetype": "sdist",
            "python_version": "source",
            "requires_python": ">=3.11",
            "size": 197683,
            "upload_time": "2025-11-02T12:06:38",
            "upload_time_iso_8601": "2025-11-02T12:06:38.135505Z",
            "url": "https://files.pythonhosted.org/packages/f0/2c/35628931a96a9172daa14eb80ef6a07773fec1e686001b083301b8a9b747/mtbsync-0.10.16.tar.gz",
            "yanked": false,
            "yanked_reason": null
        }
    ],
    "upload_time": "2025-11-02 12:06:38",
    "github": false,
    "gitlab": false,
    "bitbucket": false,
    "codeberg": false,
    "lcname": "mtbsync"
}

MTB Sync Contributors