Skip to content

Validation Deficiencies — scpn-control

This document records physics and model-data alignment deficiencies revealed by tightened validation thresholds in v0.15.0. These thresholds were tightened to meet publication standards (e.g., JOSS, Nuclear Fusion).

1. Equilibrium Self-Consistency (Lane 1)

Threshold: psi_nrmse < 2.5% (Tightened from 5.0%) Current result: 12/13 pass (92%) — PASS Remaining outlier: sparc_1300.eqdsk remains outside the tightened threshold because it is a low-current start-up equilibrium with anomalous q-profile normalisation.

Former result: 7/13 pass (54%) — FAIL

Identified Deficiencies

  1. DIII-D High-Beta and Shaped Shots:
  2. diiid_hmode_1p5MA.geqdsk (6.98%)
  3. diiid_hmode_2MA.geqdsk (6.81%)
  4. diiid_negdelta.geqdsk (8.82%)
  5. Root Cause: The validation proxy for NRMSE computes the \(\Delta^* \psi\) operator without subtracting the plasma source term (\(R p' + FF'/\mu_0 R\)). In highly shaped or high-beta DIII-D shots, the neglected source term leads to a large residual, which the proxy misinterprets as non-consistency.
  6. Impact: Tightened thresholds reveal that the current validation script is inadequate for highly active plasma regions.

  7. SPARC Start-up Phase (sparc_1300.eqdsk):

  8. NRMSE: 7.84%
  9. q95: -36.67 (Anomalous)
  10. Root Cause: This equilibrium represents a start-up or transition phase where the magnetic axis is poorly defined or the plasma current is extremely low. The \(q\)-profile in the reference GEQDSK is likely unphysical or uses a different normalization convention.

Mitigation Implemented (v2.1)

  • validate_real_shots.py reconstructs the toroidal current density \(J_\phi\) from GEQDSK \(p'\) and \(FF'\) profiles.
  • The validation lane now uses a true, source-balanced GS residual check: \(|| \Delta^* \psi + \mu_0 R J_\phi || / \max(||\mu_0 R J_\phi||, ||\psi||_\mathrm{range})\).

2. Disruption Prediction (Lane 3)

Threshold: recall > 80%, FPR < 25% Result: Recall 100%, FPR 0% — PASS

Note: Although the current test set passes, the synthetic ROC analysis (Task V4) suggests that the FPR may degrade under more diverse noise conditions.

3. Transport Scaling (Lane 2)

Threshold: 80% within 2-sigma Result: 95% within 2-sigma — PASS

The IPB98(y,2) scaling remains robust across the ITPA database for the current parameter range.