Even (very) noisy LLM evaluators are useful for improving AI agents

5 pointswww.tensorzero.com
GabrielBianconi2day