Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
debugnik
3 months ago
|
parent
|
context
|
favorite
| on:
Claude Code daily benchmarks for degradation track...
That's why we're setting up adversarial benchmarks to test if they are doing the thing they promised not to do, because we totally trust them.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: