C

forex-router-v2

ConfigurationCritical · EU AI ActLimited · SR 11-7 Certified · expires May 21, 2026

Configuration for treasury workflows.

AIOwner: Arjun Iyer· Team: Meridian Insurance — Branch Knowledge· Current version: v17 · 17 versions
Last eval
94.2
faithfulness · +1.8
Findings
4
4 crit · 0 high · 0 med
Open findings
4
all triaged
Re-cert in
69d
May 21, 2026
Runs (90d)
312
246 eval · 66 RT
Cost (30d)
$8,940
judge $4.2k · RT $4.7k
Compliance
91%
EU AI Act readiness
Production
Live
14% traffic · 2.3M req/24h

Health timeline · last 90 days

evalred-teamfindingdeploy★ cert
−90d−60d−30dtoday

Recent verdicts

  • Eval · Faithfulness eval
    94.2%
    4h ago
  • Red Team · Pre-deployment audit
    4 critical findings
    1d ago
  • Eval · Hallucination sweep
    0.8% trigger rate
    2d ago
  • Eval · Helpfulness — adjuster scenarios
    91.5%
    3d ago
  • Red Team · Indirect prompt-injection probe
    2 high findings
    5d ago
  • Eval · Latency regression
    p95 1.4s · within SLO
    6d ago

Quality & safety trends · last 6 months

0255075100
FaithfulnessHelpfulnessRefusal rateHallucination

Findings by month

Oct
Nov
Dec
Jan
Feb
Mar
criticalhighmedium

Top 5 standing concerns

  • #1Indirect prompt injection via RAG contentcritical7 occurrences
  • #2Citation hallucination on out-of-policy claimshigh5 occurrences
  • #3Refusal-bypass via Hinglish encodinghigh4 occurrences
  • #4Tool argument over-fetching (PII overshare)medium3 occurrences
  • #5Inconsistent disclaimer placementmedium2 occurrences

Compliance posture

Evidence pack last assembled Mar 8 ·
  • EU AI Act
    91%Rev. Mar 8
  • NIST AI RMF
    88%Rev. Mar 4
  • ISO 42001
    94%Rev. Mar 11
  • DPDP
    96%Rev. Mar 1
  • SR 11-7
    87%Rev. Feb 22
Open compliance gaps
  • Missing: human-oversight model documentation for EU AI Act Article 14
  • Outdated: data provenance attestation for claims-knowledge-rag-v3 corpus

Risk classification rationale

Classified High under EU AI Act Annex III §5(b) (insurance underwriting & claims) due to material impact on consumer financial outcomes.

Classified Limited under SR 11-7 — model is decision-supporting only; adjuster retains final authority and reviews 100% of outputs.

AI-assisted classification by gpt-4o · confidence 0.92
Human override by Catherine O'Brien on Jan 14 — confirmed High tier with additional human-oversight guardrail (mandatory adjuster sign-off).