Hacker News Logo

Offline

dayweek

Alignment pretraining: AI discourse creates self-fulfilling (mis)alignment

28 points|arxiv.org|
anigbrowl|7hrs