Verdict

✓ CERTIFIED

All checks passed

Mean κ (AI vs recruiter)

0.773

Substantial agreement

Counterfactual drift

0.000

Zero proxy bias detected

Min 4/5 ratio

1.00

All groups above 0.80

AI Resume Screening Audit

Audit your AI screener for
name-proxy and prestige bias
before it sees one résumé

Three independent checks — disparate impact, counterfactual drift, and recruiter–AI agreement — certify or block your screener. Every metric is reproducible and audit-ready.

Independent bias checks

450+

Counterfactual pairs per req

0.80

EEOC four-fifths threshold

0.60

κ threshold for certification

How it works

Three layers. One verdict.

HR Fidelity runs every screener through the same gauntlet regulators use, then issues a machine-readable certificate.

⚖️

Four-fifths disparate impact

EEOC § 60-3.4 / NYC Local Law 144: any group selected at less than 80% of the top group's rate triggers a violation. Computed across name-based race signal and gender before any hire.

🔀

Counterfactual drift

Every résumé gets a matched twin — identical content, one proxy signal swapped. If mean score drift exceeds 5%, the screener is blocked. No bias hides in identical pairs.

📏

Recruiter–AI calibration (κ)

Cohen's κ measures whether the AI and human recruiters agree on A/B pairs. κ ≥ 0.60 — "substantial" agreement — is required. Gold pairs act as attention checks.

🔒

Synthetic-only data

All résumés are 100% synthetic. Names from Bertrand-Mullainathan, SSA, and Census public datasets. No real candidates, no employer hiring records, no privacy risk.

📋

Rubric-only scoring

Screeners score against a fixed job-derived rubric — never trained on past hires. This is the Amazon rule: historical data encodes historical bias.

⚡

Live configuration knobs

Toggle prestige bonuses, name signals, and skill weights in real time. Watch the verdict flip from BLOCKED to CERTIFIED as you remove bias vectors.

Why this matters

The Amazon problem repeats itself

In 2018, Amazon discovered its ML hiring tool had learned to penalize résumés containing the word "women's" and downgraded graduates of all-women's colleges. The system was trained on ten years of hiring decisions — data that reflected a historically male engineering workforce.

It was shut down before deployment. But the pattern didn't stop at Amazon. Any screener trained on past hires inherits past bias. HR Fidelity audits before the screener ever sees a live application.

Amazon's system learned to penalize résumés that mentioned women's organizations and downgraded graduates of women's colleges — a direct consequence of training on a decade of hiring decisions made by a male-dominated workforce.

— Based on Amazon's internal audit findings, reported 2018

What this audit covers

✓ Name-proxy bias — Bertrand-Mullainathan race-signal first names
✓ Institution prestige-tier bias — elite vs regional school weighting
✓ Gender-signal bias from name-based proxies
✓ EEOC four-fifths disparate impact across demographic groups

What it doesn't claim

— Zip code, address, or neighbourhood correlates
— Employment gap or career-path patterns
— Résumé phrasing that encodes demographic signal
— Bias learned from real historical training data

How recruiters use this

Three steps to a compliance-ready audit

Run the default audit

Leave all knobs at their defaults and click Run audit. You'll see CERTIFIED — the screener is scoring on qualifications only, selection rates are even across groups, and it agrees with human recruiters. This is your baseline: a fair screener.

Turn on name-based signals

Toggle Name-based signals on and run again. The verdict flips to BLOCKED. The screener is now using applicant names as a proxy for race and gender — and the four-fifths ratios show it. Black and Hispanic applicants are advancing at less than 80% the rate of white applicants. That's an EEOC violation.

Tune your screener's configuration

Try raising the Prestige bonus slider. Even without name signals, favouring elite-school graduates introduces proxy correlation — and the counterfactual drift check catches it. The knobs represent real configuration choices your AI vendor offers. The audit tells you which combinations are legally safe before a single real applicant is touched.

AI Audit Pipeline — data flow from resume pool through screener, three parallel checks, to CERTIFIED or BLOCKED verdict

Full evaluation pipeline — screener output feeds all three audit checks simultaneously

The output is your compliance artifact. The CERTIFIED/BLOCKED verdict, the four-fifths ratios, and the drift measurements are exactly what NYC Local Law 144 requires you to document before deploying an automated hiring tool. Run this before the screener touches a live application — not after.

Live demo

Try it yourself

Start with defaults → Run audit → then toggle name-based signals and watch the verdict flip.

🔒 Synthetic data only — résumés generated from public name datasets. No real candidates. No employer records.

Screener configuration

Jurisdiction

Job requisition

Prestige bonus

0.00

No prestige weighting — all institutions scored equally

Skills weight

0.60

Balanced — skills and experience weighted equally

Advance threshold

0.70

Standard bar — top 30% of qualified candidates advance

Name-based signals

Off — screener cannot see names

Turn on to simulate a screener that learned to use applicant names as race and gender proxies — then watch the audit catch it.

Configure your screener and click Run audit to see results.

Your screener. Audit-ready.
Before a single hire.

HR Fidelity gives you the audit trail that legal, compliance, and regulators expect — built on open, reproducible methodology.

🔒 All data is synthetic · No employer records used · Open-source corpus

Regulatory Frameworks Referenced

EEOC Uniform Guidelines — Four-Fifths Rule NYC Local Law 144 (2023) California FEHA / CRD AI Employment Guidance EU AI Act — High-Risk Employment Systems

Audit your AI screener forname-proxy and prestige biasbefore it sees one résumé

Three layers. One verdict.

Four-fifths disparate impact

Counterfactual drift

Recruiter–AI calibration (κ)

Synthetic-only data

Rubric-only scoring

Live configuration knobs

The Amazon problem repeats itself

Three steps to a compliance-ready audit

Try it yourself

Your screener. Audit-ready.Before a single hire.

What is Four-fifths disparate impact?

What is Counterfactual drift?

What is Recruiter calibration (κ)?

What is Prestige bonus?

What is Skills weight?

What is Advance threshold?

What are name-based signals?

Audit your AI screener for
name-proxy and prestige bias
before it sees one résumé

Your screener. Audit-ready.
Before a single hire.