Rust Acceleration¶

Added in v3.11.0

Director-AI optionally accelerates hot-path functions via the Rust-based Backfire Kernel. When installed, six functions dispatch to compiled Rust code automatically. When not installed, Python fallbacks run transparently.

Install¶

pip install director-ai[rust]

Or build from source:

cd backfire-kernel
pip install -e crates/backfire-ffi  # requires maturin + Rust toolchain

What Gets Accelerated¶

Verification Signals (VerifiedScorer)¶

Function	Python	Rust	Speedup	Called
`entity_overlap`	~0.5 ms	~0.1 ms	5x	per claim × source match
`numerical_consistency`	~0.05 ms	~0.01 ms	5x	per claim × source match
`negation_flip`	~0.1 ms	~0.02 ms	5x	per claim pair
`traceability`	~2 ms	~0.5 ms	4x	per claim

These run inside VerifiedScorer.verify() for each claim. With 5 claims and 3 source matches each, the aggregate savings are 5-15 ms per response in heuristic mode.

Streaming Kernel¶

Function	Python	Rust	Speedup	Called
`trend_drop`	~0.1 ms	~0.02 ms	5x	per trend check in streaming

BM25 Retrieval¶

Function	Python	Rust	Speedup	Called
BM25 query (1000 docs)	~30 ms	~10 ms	3x	per retrieval in hybrid mode

The RustBM25 class is available directly:

from backfire_kernel import RustBM25

bm25 = RustBM25(k1=1.2, b=0.75)
bm25.add_document("d1", "Water boils at 100 degrees Celsius.")
bm25.add_document("d2", "The speed of light is 299792 km/s.")

results = bm25.query("water temperature", n_results=3)
for doc_id, score in results:
    print(f"{doc_id}: {score:.4f}")

When Rust Helps¶

NLI mode (production): Model inference is 19-200 ms. Python signal extraction adds ~3 ms. Rust saves ~2 ms — marginal improvement.

Heuristic/fallback mode: No model inference. Python signal extraction is 20-50 ms of the total 50-100 ms. Rust saves 15-30 ms — meaningful improvement.

Hybrid retrieval: BM25 inner loop on 1000+ docs benefits from Rust's tight arithmetic loops and no-GIL execution.

Architecture¶

backfire-kernel/crates/
├── backfire-core/src/
│   ├── signals.rs     ← verification signal functions
│   ├── bm25.rs        ← BM25 retrieval engine
│   ├── scorer.rs      ← coherence scorer
│   ├── kernel.rs      ← safety/streaming kernel
│   └── knowledge.rs   ← ground truth store
├── backfire-ffi/src/
│   └── lib.rs         ← PyO3 bindings (rust_entity_overlap, RustBM25, etc.)
└── Cargo.toml         ← workspace config

Python auto-dispatch (no code changes needed):

# verified_scorer.py — automatic
try:
    from backfire_kernel import rust_entity_overlap, ...
    _RUST_SIGNALS = True
except ImportError:
    _RUST_SIGNALS = False

def _entity_overlap(text_a, text_b):
    if _RUST_SIGNALS:
        return rust_entity_overlap(text_a, text_b)
    # Python fallback...

Verification¶

Check if Rust acceleration is active:

from director_ai.core.scoring.verified_scorer import _RUST_SIGNALS
from director_ai.core.runtime.streaming import _RUST_TREND

print(f"Rust signals: {_RUST_SIGNALS}")
print(f"Rust trend:   {_RUST_TREND}")

Running Rust Tests¶

cd backfire-kernel
cargo test                    # all crates
cargo test -p backfire-core   # core only (signals + bm25)
cargo bench -p backfire-core  # scoring benchmarks