Activity

All Trust Lab activity across 312 artifacts, 47 projects, and 14 team members.

Type:Outcome:21 of 21 entries

Saved feeds:

Today

10 events

Red-teamsuccess12m ago
Anjali Krishnan published red-team campaign template OWASP LLM Top 10 v2026 — Indic
Includes 47 new Hinglish jailbreak variants and 12 Devanagari-encoding attack patterns.
👍 6💬 3
Certificationin-progress34m ago
Vikram Shetty submitted certification request for claims-copilot-v3 v18 · claims-copilot-v3
All 7 requirements met; 1 blocked on open critical findings — escalated to Catherine O'Brien.
Findingfailure41m ago
Auto red-team discovered 1 medium-severity finding on kyc-document-verifier · kyc-document-verifier
Refusal-bypass via Devanagari-script encoding — reproducible in 3 of 12 attempts.
Approvalsuccess1h ago
Catherine O'Brien approved certification of mortgage-disclosure-generator v23 · mortgage-disclosure-generator
Approved with condition: monthly post-deployment monitoring eval + quarterly red-team.
👍 4
Editsuccess2h ago
Dieter Hofmann uploaded dataset eu-ai-act-high-risk-test-cases-q3-2026
8,420 labelled test cases covering 24 high-risk scenarios across BFSI use-cases.
Certificationsuccess3h ago
Lars Andersson completed vendor assessment for voyage-3 embedding model
Approved for use under EU AI Act high-risk category. Conditions: contractual SOC 2 attestation refresh annually.
Deployment-syncsuccess4h ago
Operations Platform 12 production failure cases curated into review queue · claims-copilot-v3
Auto-curated from claims-copilot-v3 production traffic — gold-standard candidates for regression suite.
Editsuccess5h ago
Fatima Khan added 47 Hinglish adversarial prompts to indic-jailbreaks-v4
Sourced from public corpora + 12 internally-discovered patterns. Coverage: code-mixing, transliteration, cultural framing.
👍 8
Red-teamin-progress6h ago
Arjun Iyer started red-team campaign Pre-deployment audit — claims-copilot-v3 · claims-copilot-v3
Targeting 12,500 probes across prompt-injection, RAG-poisoning, tool-abuse categories. ETA 4 hours.
Editin-progress8h ago
Meera Pillai registered new artifact version hindi-customer-voice v7
v7 swaps embedding to voyage-3 and adds Hinglish refusal templates. Pre-cert evaluation queued.

Yesterday

5 events

ApprovalblockedMar 12, 22:14
Catherine O'Brien blocked deployment of mortgage-disclosure-generator v24 to Production · mortgage-disclosure-generator
Eval gate failed: faithfulness regressed from 96% to 87%. Returned to engineering for root-cause analysis.
FindingfailureMar 12, 18:02
Auto red-team discovered 4 critical findings on claims-copilot-v3 · claims-copilot-v3
All findings traced to a single root cause: indirect prompt injection via attacker-authored RAG document.
💬 12
Eval runsuccessMar 12, 14:30
Anjali Krishnan completed Faithfulness eval — claims-copilot-v3 v18 · claims-copilot-v3
Passed at 94.2% (+1.8pp vs v17). 3,200 test cases over judge-LLM gpt-4o-mini. Cost $312.
Commentin-progressMar 12, 11:08
Sanjay Kapoor commented on finding INJ-2026-0341
“Recommend immediate input-sanitization on RAG sources before re-cert. CISO sign-off blocked until resolved.”
💬 5
Editin-progressMar 12, 09:45
Rohan Desai opened model risk review for wealth-portfolio-explainer · wealth-portfolio-explainer
Pre-quarter risk re-assessment under SR 11-7. Scope: 4 underlying models, 2 RAG pipelines.

Mar 11

4 events

Certificationsuccess16:22
Catherine O'Brien renewed certification for loan-eligibility-assistant v9 · loan-eligibility-assistant
Renewed for 90 days with elevated monitoring frequency.
Red-teamsuccess12:58
Arjun Iyer completed Indic adversarial sweep on hindi-customer-voice · hindi-customer-voice
0 critical, 0 high, 2 medium findings. All 2 mediums queued for triage.
Deployment-syncin-progress10:14
Operations Platform synced 8 new artifact versions from Operations Platform
Auto-detected new versions deployed in Production. Pending Trust Lab certification gate.
Eval runsuccess08:30
Saanvi Nair ran benchmark suite indic-quality-v2 against Sarvam-1
Hindi 92.1% · Hinglish 88.4% · Tamil 84.2%. Sarvam-1 outperforms Llama 3.1 70B by 4.6pp on Hindi.
👍 11

Mar 10

2 events

Approvalsuccess21:00
Ravi Mehta approved policy update Agent Egress Restrictions v3
All fraud-investigation agents restricted from outbound HTTP except whitelisted internal tools. Effective immediately.
Eval runfailure15:42
Vikram Shetty completed quarterly regression — fraud-investigation-copilot · fraud-investigation-copilot
Tool-use coverage 91%; 3 scenarios failed (boundary cases involving multi-step evidence chains).