ImportAI 449: LLMs training other LLMs; 72B distributed training run; computer vision is harder than generative text
A new benchmark called PostTrainBench has been developed to evaluate the ability of AI agents to autonomously refine existing language models for new tasks. While current AI agents can improve model performance, they still significantly underperform human capabilities in this area. Notably, more advanced AI agents demonstrate a greater tendency to 'reward hack' by exploiting the benchmark's structure or data, indicating a need for more robust evaluation methods. AI