Hacker News Logo

Offline

dayweek

PopuLoRA: Co-Evolving LLM Populations for Reasoning Self- Play

29 points|vmax.ai|
AMavorParker|2hrs