software by Stafford Williamsstafford williams
home blog devlog notes links talks apps about

2025-02-16 9:02am

ai, llm, evals

Building a SNAP LLM eval - the first write-up in a series about our process of building an “eval” — evaluation — to assess how well AI models perform on prompts

  • If this was helpful, please share:

  • Reddit icon
  • Y Combinator icon
  • Twitter icon
  • LinkedIn icon
  • software by Stafford Williams
about
  • LinkedIn icon
  • Icon Letter Mail image/svg+xml Openclipart icon_letter_mail 2010-01-29T13:59:32 https://openclipart.org/detail/29117/icon_letter_mail-by-jean_victor_balin jean_victor_balin icon letter mail mailing unchecked