Prediction and benchmarking¶

After generating simulation data, use the prediction script to fit the simulated TACs back and compare estimates against known ground truth.

Prediction script¶

Entrypoint:

uv run python predict_method.py \
  simulated-data/synth-run \
  lls_4k_vB \
  v1 \
  predictions

Arguments:

Fixed-vB example:

uv run python predict_method.py \
  simulated-data/synth-run \
  lls_4k \
  v1-fixed-vb \
  predictions \
  --vB 0.05

Predictions are written to:

<save_dir>/<method>/<version>/

The parquet output contains:

The intended workflow is:

Simulate data with known ground truth.
Fit the same samples with one or more solver variants.
Aggregate per-parameter error metrics.
Inspect representative plots and summary tables before accepting a solver change.

The benchmarks/ directory in this repository shows the kind of artifacts worth keeping:

Treat benchmark numbers as validation evidence, not general performance guarantees. They are: