LLM

2 posts

  • Write failing tests first with Promptfoo eval, then fix your prompts to make them pass. TDD-style prompt engineering with a real Vision model debugging story.

  • Promptfoo, the LLM testing framework acquired by OpenAI. Prompt regression testing, model comparison, Red Teaming, and hands-on experience applying it to a real translation app.