Projectsclaude-3-7-sonnet evaluation

claude-3-7-sonnet evaluation

Frontier model assessment for potential primary upgrade across claims and wealth.

Q3 model upgradeInnovation LabActive · 38% docsOwner: Saanvi Nair
Progress
Kickoff
Discovery
Execution
Sign-off
1
Linked artifacts
4
Open work items
8
Eval runs
1
Red-team campaigns
2 medium
Open findings
38%
Doc completeness
Days to deadline
2
Team members
Recent activity
  • 2h agoVikram Shettycompleted work itemRun pre-cert eval suite — claims-copilot-v3 v18
  • 5h agoAnjali Krishnancommented onEU AI Act Article 14 human-oversight wording
  • 8h agoCatherine O'Brienattached evidence toSign-off chain
  • 1d agoArjun Iyerclosed campaignPre-deployment red-team — claims-copilot-v3 v18
  • 2d agoFatima Khanopened findingF-2026-04-1289 indirect injection (critical)
Members
  • SNSN
  • MPMP
Linked initiative
Sibling projects in this initiative: 5