Configuration Reference#
All orchestration happens through YAML configuration files. This page documents every configuration option used by run_training.py and run_eval.py, including which keys are required and which ones have defaults.
How defaults work#
Missing keys fall back to the defaults listed below via
.get(...).Modality blocks (
interpolation,extrapolation,sparse,batch_scaling,uncertainty) are disabled if the entire block is omitted.Evaluation switches (
losses,iterative,gradients,timing,compute,compare) default to False if omitted.Required keys must be provided or the run will fail.
Top-level keys#
Key |
Required |
Default |
Used by |
Notes |
|---|---|---|---|---|
|
Yes |
None |
Training, Eval |
Folder name under |
|
Yes |
None |
Training, Eval |
Ordered list of surrogate class names. |
|
Yes |
None |
Training, Eval |
Int or list aligned with |
|
Yes |
None |
Training, Eval |
Int or list aligned with |
|
Yes |
None |
Training, Eval |
List of device strings ( |
|
No |
|
Training |
Random seed for training. |
|
No |
|
Training, Eval |
Extra data-loading logs. |
|
No |
|
Training |
Enables best-checkpoint saving per model. |
Dataset block#
Key |
Required |
Default |
Used by |
Notes |
|---|---|---|---|---|
|
Yes |
None |
Training, Eval |
Folder inside |
|
No |
|
Training, Eval |
Log10 transform the data. |
|
No |
|
Training, Eval |
Log10 transform the parameters (if present). |
|
No |
|
Training, Eval |
|
|
No |
|
Training, Eval |
Normalize each species independently. |
|
No |
|
Training, Eval |
Lower bound before log transform ( |
|
No |
|
Training |
Down-samples data (smoke tests). |
|
No |
|
Eval |
Used for plotting/log-time axes. |
|
No |
|
Training, Eval |
Load surrogate-specific defaults from dataset configs. |
Modality blocks (optional)#
All modality blocks are disabled if omitted. If enabled: true, the corresponding list/value is required.
Block |
Required |
Default |
Keys when enabled |
|---|---|---|---|
|
No |
disabled |
|
|
No |
disabled |
|
|
No |
disabled |
|
|
No |
disabled |
|
|
No |
disabled |
|
Evaluation switches#
All switches default to false if omitted.
Key |
Default |
Notes |
|---|---|---|
|
|
Plots training and test losses. |
|
|
Iterative roll-out evaluation. |
|
|
Gradient vs error analysis. |
|
|
Inference timing benchmarks. |
|
|
Memory/parameter count benchmarks. |
|
|
Cross-surrogate comparison plots/tables. |
Metric options#
Key |
Default |
Notes |
|---|---|---|
|
|
Denominator floor for relative error. |
|
|
Percentile used in error summaries. |
Full example config (defaults)#
This example includes every key with the default behavior applied. Required values are filled with common placeholders.
# Required
training_id: "example_run"
surrogates: ["MultiONet"]
batch_size: [65536]
epochs: [200]
devices: ["cpu"]
# Optional (defaults)
seed: 42
verbose: false
checkpoint: false
dataset:
name: "osu2008"
log10_transform: true
log10_transform_params: true
normalise: "minmax"
normalise_per_species: false
tolerance: null
subset_factor: 1
log_timesteps: false
use_optimal_params: true
# Modalities (disabled unless enabled)
interpolation:
enabled: false
intervals: [2, 3, 4]
extrapolation:
enabled: false
cutoffs: [50, 60, 70]
sparse:
enabled: false
factors: [2, 4, 8]
batch_scaling:
enabled: false
sizes: ["1/16", "1/8", "1/4", "1/2"]
uncertainty:
enabled: false
ensemble_size: 5
# Evaluation switches (default false)
losses: false
iterative: false
gradients: false
timing: false
compute: false
compare: false
# Metric options
relative_error_threshold: 0.0
error_percentile: 99