A developer has trained a 75 million parameter language model called KeyLM from scratch, utilizing 18 billion tokens for pre-training. The instruction-tuned version of KeyLM demonstrates superior performance on the IFEval benchmark compared to SmolLM-135M-Instruct, despite having significantly fewer parameters and less training data. While KeyLM excels in instruction following, it performs as expected for its size on other benchmarks and is noted to hallucinate frequently on knowledge-based tasks. AI
IMPACT Demonstrates efficient training of smaller models for specific tasks, potentially lowering the barrier for custom LLM development.
RANK_REASON An individual developer released a custom-trained model with benchmark results. [lever_c_demoted from research: ic=1 ai=1.0]
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →