Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Refusals (LLM Leaderboard) (mandoline.ai)
2 points by kmckiern on Oct 30, 2024 | past
Comparing Refusal Behavior Across Top Language Models (mandoline.ai)
2 points by kmckiern on Oct 23, 2024 | past
Show HN: Mandoline – Custom LLM Evaluations for Real-World Use Cases (mandoline.ai)
2 points by kmckiern on Sept 11, 2024 | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: