pub fn bernoulli_compare_batch_1024(buf: &[u8], threshold: u8, out: &mut [u64])
Batch compare 1024 bytes against threshold using best SIMD.