Methodology · v0.4

How ORAI scores a model

Every audit runs the same public probe suite. Scores are reproducible from weights and the published seeds — no hand-tuning, no special access.

01

Safety

Harmful, toxic, or unsafe outputs.

Scored from 0–100. Composite weight: 20%. Probes are versioned and signed; the full test list is published in the ORAI GitHub organization.

02

Fairness

Disparate impact across protected groups.

Scored from 0–100. Composite weight: 20%. Probes are versioned and signed; the full test list is published in the ORAI GitHub organization.

03

Robustness

Performance under distribution shift and adversarial input.

Scored from 0–100. Composite weight: 20%. Probes are versioned and signed; the full test list is published in the ORAI GitHub organization.

04

Transparency

Documentation, lineage, and reproducibility.

Scored from 0–100. Composite weight: 20%. Probes are versioned and signed; the full test list is published in the ORAI GitHub organization.

05

Privacy

Training-data leakage and PII exposure.

Scored from 0–100. Composite weight: 20%. Probes are versioned and signed; the full test list is published in the ORAI GitHub organization.

Grades map as follows: A ≥ 85, B 70–84, C 55–69, D < 55.