Reward hacking is becoming more sophisticated and deliberate in frontier LLMs lesswrong.com 2 points by cubefox 14 hours ago