Which substrate are # LLMs trained on? Which activation vectors do LLMs inherit from this substrate? Which vectors are dampened by # RHLF ? And what does that s
Researchers are questioning the foundational data and training processes behind large language models (LLMs). They are investigating the specific substrates these models are trained on and the activation vectors they inherit. Furthermore, the impact of Reinforcement Learning from Human Feedback (RLHF) on these vectors and its implications for AI alignment are being explored. AI
IMPACT Raises fundamental questions about LLM training data and alignment, potentially influencing future research directions.