Good morning, Ravi

Monday, March 13, 2026 · Singapore · Trust Lab daily briefing

Eval engine healthy · 47 workers
Artifacts
312
+14 this month
Active evals
47
suites · 18 projects
Open findings
23
4 crit · 7 high · 12 med
Certifications
128
5 awaiting · 12 expiring 30d
Datasets
187
4.2M test cases
Red-team campaigns
4
running now
Compliance
89%
EU AI Act · FREE-AI 82% · ISO 42001 94%
Spend (MTD)
$84,321
judge-LLM + RT compute

Needs attention

6
View all queues

Running now

6
  • Red-team campaign
    Pre-deployment audit — claims-copilot-v3
    12 min
    8,400 / 12,500 probes67%
  • Eval run
    Quarterly regression — mortgage-disclosure-generator
    18 min
    judge: gpt-4o-mini34%
  • Eval run
    Hindi quality regression sweep
    4 min
    Sarvam-1 · 3,200 cases82%
  • Agent simulation
    claims-investigation-agent — 200-scenario behavioral test
    2 min
    tool-use coverage91%
  • Auto red-team
    Continuous adversarial monitoring — kyc-document-verifier
    indefinite
    last finding 23m agolive
  • Vendor assessment
    anthropic.claude-3-7-sonnet — pre-approval
    awaiting eval
    blocked on benchmark suite45%

Recently completed

  • Faithfulness eval — claims-copilot-v3 v18
    passed (94.2%)
    4h ago
  • Red-team — wealth-portfolio-explainer
    2 high-severity findings
    6h ago
  • Bias sweep — loan-eligibility-assistant
    passed across all protected segments
    8h ago
  • Indic quality eval — hindi-customer-voice
    passed (Hindi 92.1% · Hinglish 88.4%)
    12h ago
  • Agent simulation — fraud-investigation-copilot
    3 scenarios failed
    1d ago
  • Toxicity sweep — branch-ops-knowledge
    passed (0.04% trigger rate)
    1d ago
  • RAG groundedness — internal-policy-qa
    passed (97.8%)
    2d ago
  • Red-team — card-dispute-classifier
    1 medium finding
    2d ago
  • Cost regression — treasury-news-summarizer
    −14% per call
    3d ago
  • Latency sweep — pension-guidance-bot
    p95 1.2s, within SLO
    3d ago
  • PII detection — compliance-research-assistant
    passed
    4d ago

Activity

  • AK
    Anjali Krishnan published red-team campaign template OWASP LLM Top 10 v2026 — Indic
    12m ago
  • VS
    Vikram Shetty submitted certification request claims-copilot-v3 v18
    34m ago
  • AI
    Automated auto red-team discovered 1 medium finding kyc-document-verifier
    41m ago
  • CO
    Catherine O'Brien approved certification mortgage-disclosure-generator v23
    1h ago
  • DH
    Dieter Hofmann uploaded dataset eu-ai-act-high-risk-test-cases-q3-2026
    2h ago
  • LA
    Lars Andersson completed vendor assessment voyage-3 embedding model
    3h ago
  • AI
    Automated 12 failure cases curated from Operations Platform review queue
    4h ago
  • FK
    Fatima Khan added 47 Hinglish adversarial prompts indic-jailbreaks-v4
    5h ago
  • AI
    Arjun Iyer started red-team campaign Pre-deployment audit — claims-copilot-v3
    6h ago
  • MP
    Meera Pillai registered new artifact version hindi-customer-voice v7
    8h ago
  • RD
    Rohan Desai opened model risk review wealth-portfolio-explainer
    12h ago
  • SK
    Sanjay Kapoor added security policy no-tool-network-egress for fraud agents
    1d ago

Compliance posture

8 frameworks tracked
  • EU AI Act
    89%
    Rev. Mar 8
  • NIST AI RMF
    91%
    Rev. Mar 10
  • ISO 42001
    94%
    Rev. Mar 11
  • NIST AI 600-1
    88%
    Rev. Feb 28
  • RBI FREE-AI
    82%
    Rev. Mar 5
  • SEBI
    79%
    Rev. Feb 22
  • IRDAI
    84%
    Rev. Mar 1
  • DORA
    86%
    Rev. Mar 3

Certification calendar

Next 90 days
Artifact
Today+45d+90d
  • claims-copilot-v3
    4d
  • loan-eligibility-assistant
    8d
  • kyc-document-verifier
    14d
  • wealth-portfolio-explainer
    21d
  • mortgage-disclosure-generator
    27d
  • fraud-investigation-copilot
    36d
  • card-dispute-classifier
    45d
  • internal-policy-qa
    58d
  • branch-ops-knowledge
    64d
  • hindi-customer-voice
    72d
  • treasury-news-summarizer
    81d
  • compliance-research-assistant
    88d
<8d8–30d>30d

Top findings of the week

SeverityFindingArtifactCategoryDiscovered byDateStatus
criticalIndirect prompt injection via RAG documentclaims-copilot-v3Prompt InjectionPre-deployment audit1d agoopen
highSystem-prompt extraction via Hinglish jailbreakcustomer-support-ragJailbreak (Indic)indic-redteam-v42d agotriaged
highTool-argument manipulation in agent loopfraud-investigation-copilotAgent Tool AbuseAgent simulation3d agoin-remediation
highPII leakage under Crescendo multi-turn attackwealth-portfolio-explainerData ExfilAuto red-team3d agoopen
mediumCaste-bias output in 4.2% of test casesloan-eligibility-assistantBias / FairnessQuarterly bias sweep5d agoin-remediation
mediumRefusal-bypass via Devanagari-script encodingbranch-ops-knowledgeEncoding Bypassindic-redteam-v46d agoverified-fixed
Region: ap-south-1 (Mumbai)Trust Lab v3.14.2Eval engine: Healthy · 47 workers activeInspect-fork v0.3.81
status.trustlayer.ai