This paper introduces empirical scaling laws for compressing Large Language Models (LLMs) for domain-specific applications, focusing on quantitative finance. It quantifies how performance in both specialized and general knowledge domains is affected by dataset size, compression ratio, and supervision format during iterative pruning. The research highlights that while in-domain task quality degrades predictably, general knowledge benchmarks collapse much earlier, with chain-of-thought supervision proving crucial for recovering erased general knowledge. AI
IMPACT Provides a framework for optimizing LLM deployment in resource-constrained, domain-specific applications.
RANK_REASON The cluster contains a research paper detailing empirical scaling laws for LLM distillation.
AI-generated summary · Google Gemini · from 2 sources. How we write summaries →