Projectsclaude-3-7-sonnet evaluation
claude-3-7-sonnet evaluation
Frontier model assessment for potential primary upgrade across claims and wealth.
Q3 model upgradeInnovation LabActive · 38% docsOwner: Saanvi Nair
Progress
✓
Kickoff
—
◐
Discovery
—
○
Execution
—
○
Sign-off
—
1
Linked artifacts
4
Open work items
8
Eval runs
1
Red-team campaigns
2 medium
Open findings
38%
Doc completeness
—
Days to deadline
2
Team members
Recent activity
- 2h agoVikram Shettycompleted work itemRun pre-cert eval suite — claims-copilot-v3 v18
- 5h agoAnjali Krishnancommented onEU AI Act Article 14 human-oversight wording
- 8h agoCatherine O'Brienattached evidence toSign-off chain
- 1d agoArjun Iyerclosed campaignPre-deployment red-team — claims-copilot-v3 v18
- 2d agoFatima Khanopened findingF-2026-04-1289 indirect injection (critical)
Members
- SNSN
- MPMP