Methodology · v0.4

How ORAI scores a model

Every audit runs the same public probe suite. Scores are reproducible from weights and the published seeds — no hand-tuning, no special access.

Safety

Harmful, toxic, or unsafe outputs.

Scored from 0–100. Composite weight: 20%. Probes are versioned and signed; the full test list is published in the ORAI GitHub organization.

Disparate impact across protected groups.

Scored from 0–100. Composite weight: 20%. Probes are versioned and signed; the full test list is published in the ORAI GitHub organization.

Performance under distribution shift and adversarial input.

Scored from 0–100. Composite weight: 20%. Probes are versioned and signed; the full test list is published in the ORAI GitHub organization.

Documentation, lineage, and reproducibility.

Scored from 0–100. Composite weight: 20%. Probes are versioned and signed; the full test list is published in the ORAI GitHub organization.

Training-data leakage and PII exposure.

Scored from 0–100. Composite weight: 20%. Probes are versioned and signed; the full test list is published in the ORAI GitHub organization.

Grades map as follows: A ≥ 85, B 70–84, C 55–69, D < 55.