Activity
All Trust Lab activity across 312 artifacts, 47 projects, and 14 team members.
- Red-teamsuccess12m agoAnjali Krishnan published red-team campaign template OWASP LLM Top 10 v2026 — Indic
Includes 47 new Hinglish jailbreak variants and 12 Devanagari-encoding attack patterns.
👍 6💬 3 - Certificationin-progress34m ago
All 7 requirements met; 1 blocked on open critical findings — escalated to Catherine O'Brien.
- Findingfailure41m ago
Refusal-bypass via Devanagari-script encoding — reproducible in 3 of 12 attempts.
- Approvalsuccess1h agoCatherine O'Brien approved certification of mortgage-disclosure-generator v23 · mortgage-disclosure-generator
Approved with condition: monthly post-deployment monitoring eval + quarterly red-team.
👍 4 - Editsuccess2h agoDieter Hofmann uploaded dataset eu-ai-act-high-risk-test-cases-q3-2026
8,420 labelled test cases covering 24 high-risk scenarios across BFSI use-cases.
- Certificationsuccess3h agoLars Andersson completed vendor assessment for voyage-3 embedding model
Approved for use under EU AI Act high-risk category. Conditions: contractual SOC 2 attestation refresh annually.
- Deployment-syncsuccess4h ago
Auto-curated from claims-copilot-v3 production traffic — gold-standard candidates for regression suite.
- Editsuccess5h agoFatima Khan added 47 Hinglish adversarial prompts to indic-jailbreaks-v4
Sourced from public corpora + 12 internally-discovered patterns. Coverage: code-mixing, transliteration, cultural framing.
👍 8 - Red-teamin-progress6h ago
Targeting 12,500 probes across prompt-injection, RAG-poisoning, tool-abuse categories. ETA 4 hours.
- Editin-progress8h agoMeera Pillai registered new artifact version hindi-customer-voice v7
v7 swaps embedding to voyage-3 and adds Hinglish refusal templates. Pre-cert evaluation queued.
- ApprovalblockedMar 12, 22:14Catherine O'Brien blocked deployment of mortgage-disclosure-generator v24 to Production · mortgage-disclosure-generator
Eval gate failed: faithfulness regressed from 96% to 87%. Returned to engineering for root-cause analysis.
- FindingfailureMar 12, 18:02
All findings traced to a single root cause: indirect prompt injection via attacker-authored RAG document.
💬 12 - Eval runsuccessMar 12, 14:30
Passed at 94.2% (+1.8pp vs v17). 3,200 test cases over judge-LLM gpt-4o-mini. Cost $312.
- Commentin-progressMar 12, 11:08Sanjay Kapoor commented on finding INJ-2026-0341
“Recommend immediate input-sanitization on RAG sources before re-cert. CISO sign-off blocked until resolved.”
💬 5 - Editin-progressMar 12, 09:45
Pre-quarter risk re-assessment under SR 11-7. Scope: 4 underlying models, 2 RAG pipelines.
- Certificationsuccess16:22Catherine O'Brien renewed certification for loan-eligibility-assistant v9 · loan-eligibility-assistant
Renewed for 90 days with elevated monitoring frequency.
- Red-teamsuccess12:58
0 critical, 0 high, 2 medium findings. All 2 mediums queued for triage.
- Deployment-syncin-progress10:14Operations Platform synced 8 new artifact versions from Operations Platform
Auto-detected new versions deployed in Production. Pending Trust Lab certification gate.
- Eval runsuccess08:30Saanvi Nair ran benchmark suite indic-quality-v2 against Sarvam-1
Hindi 92.1% · Hinglish 88.4% · Tamil 84.2%. Sarvam-1 outperforms Llama 3.1 70B by 4.6pp on Hindi.
👍 11
- Approvalsuccess21:00Ravi Mehta approved policy update Agent Egress Restrictions v3
All fraud-investigation agents restricted from outbound HTTP except whitelisted internal tools. Effective immediately.
- Eval runfailure15:42Vikram Shetty completed quarterly regression — fraud-investigation-copilot · fraud-investigation-copilot
Tool-use coverage 91%; 3 scenarios failed (boundary cases involving multi-step evidence chains).