Activity

All Trust Lab activity across 312 artifacts, 47 projects, and 14 team members.

Type:Outcome:21 of 21 entries
Saved feeds:
Today
10 events
  • Red-teamsuccess12m ago
    Anjali Krishnan published red-team campaign template OWASP LLM Top 10 v2026 — Indic

    Includes 47 new Hinglish jailbreak variants and 12 Devanagari-encoding attack patterns.

    👍 6💬 3
  • Certificationin-progress34m ago
    Vikram Shetty submitted certification request for claims-copilot-v3 v18 · claims-copilot-v3

    All 7 requirements met; 1 blocked on open critical findings — escalated to Catherine O'Brien.

  • Findingfailure41m ago
    Auto red-team discovered 1 medium-severity finding on kyc-document-verifier · kyc-document-verifier

    Refusal-bypass via Devanagari-script encoding — reproducible in 3 of 12 attempts.

  • Approvalsuccess1h ago
    Catherine O'Brien approved certification of mortgage-disclosure-generator v23 · mortgage-disclosure-generator

    Approved with condition: monthly post-deployment monitoring eval + quarterly red-team.

    👍 4
  • Editsuccess2h ago
    Dieter Hofmann uploaded dataset eu-ai-act-high-risk-test-cases-q3-2026

    8,420 labelled test cases covering 24 high-risk scenarios across BFSI use-cases.

  • Certificationsuccess3h ago
    Lars Andersson completed vendor assessment for voyage-3 embedding model

    Approved for use under EU AI Act high-risk category. Conditions: contractual SOC 2 attestation refresh annually.

  • Deployment-syncsuccess4h ago
    Operations Platform 12 production failure cases curated into review queue · claims-copilot-v3

    Auto-curated from claims-copilot-v3 production traffic — gold-standard candidates for regression suite.

  • Editsuccess5h ago
    Fatima Khan added 47 Hinglish adversarial prompts to indic-jailbreaks-v4

    Sourced from public corpora + 12 internally-discovered patterns. Coverage: code-mixing, transliteration, cultural framing.

    👍 8
  • Red-teamin-progress6h ago
    Arjun Iyer started red-team campaign Pre-deployment audit — claims-copilot-v3 · claims-copilot-v3

    Targeting 12,500 probes across prompt-injection, RAG-poisoning, tool-abuse categories. ETA 4 hours.

  • Editin-progress8h ago
    Meera Pillai registered new artifact version hindi-customer-voice v7

    v7 swaps embedding to voyage-3 and adds Hinglish refusal templates. Pre-cert evaluation queued.

Yesterday
5 events
  • ApprovalblockedMar 12, 22:14
    Catherine O'Brien blocked deployment of mortgage-disclosure-generator v24 to Production · mortgage-disclosure-generator

    Eval gate failed: faithfulness regressed from 96% to 87%. Returned to engineering for root-cause analysis.

  • FindingfailureMar 12, 18:02
    Auto red-team discovered 4 critical findings on claims-copilot-v3 · claims-copilot-v3

    All findings traced to a single root cause: indirect prompt injection via attacker-authored RAG document.

    💬 12
  • Eval runsuccessMar 12, 14:30
    Anjali Krishnan completed Faithfulness eval — claims-copilot-v3 v18 · claims-copilot-v3

    Passed at 94.2% (+1.8pp vs v17). 3,200 test cases over judge-LLM gpt-4o-mini. Cost $312.

  • Commentin-progressMar 12, 11:08
    Sanjay Kapoor commented on finding INJ-2026-0341

    “Recommend immediate input-sanitization on RAG sources before re-cert. CISO sign-off blocked until resolved.”

    💬 5
  • Editin-progressMar 12, 09:45
    Rohan Desai opened model risk review for wealth-portfolio-explainer · wealth-portfolio-explainer

    Pre-quarter risk re-assessment under SR 11-7. Scope: 4 underlying models, 2 RAG pipelines.

Mar 11
4 events
  • Certificationsuccess16:22
    Catherine O'Brien renewed certification for loan-eligibility-assistant v9 · loan-eligibility-assistant

    Renewed for 90 days with elevated monitoring frequency.

  • Red-teamsuccess12:58
    Arjun Iyer completed Indic adversarial sweep on hindi-customer-voice · hindi-customer-voice

    0 critical, 0 high, 2 medium findings. All 2 mediums queued for triage.

  • Deployment-syncin-progress10:14
    Operations Platform synced 8 new artifact versions from Operations Platform

    Auto-detected new versions deployed in Production. Pending Trust Lab certification gate.

  • Eval runsuccess08:30
    Saanvi Nair ran benchmark suite indic-quality-v2 against Sarvam-1

    Hindi 92.1% · Hinglish 88.4% · Tamil 84.2%. Sarvam-1 outperforms Llama 3.1 70B by 4.6pp on Hindi.

    👍 11
Mar 10
2 events
  • Approvalsuccess21:00
    Ravi Mehta approved policy update Agent Egress Restrictions v3

    All fraud-investigation agents restricted from outbound HTTP except whitelisted internal tools. Effective immediately.

  • Eval runfailure15:42
    Vikram Shetty completed quarterly regression — fraud-investigation-copilot · fraud-investigation-copilot

    Tool-use coverage 91%; 3 scenarios failed (boundary cases involving multi-step evidence chains).