pub fn dot_f64_dispatch(a: &[f64], b: &[f64]) -> f64
Dot product of two f64 slices using the best available SIMD path.