How Small Can You Go? LoRA Fine-Tuning 270M-8B Models for Merchant Information Extraction in Financial Transactions
Researchers explored fine-tuning smaller language models for financial transaction merchant information extraction, aiming to reduce the costs associated with larger models. Their study evaluated 24 variants across four model families, including Gemma, Qwen, Aya, and LLaMA, focusing on accuracy, throughput, and training cost. Findings indicate that models like Qwen 3.5 4B and even the 0.8B version offer competitive performance with significantly fewer parameters and better latency, making them viable alternatives for production deployment. AI
IMPACT Demonstrates that smaller, more efficient models can achieve comparable performance to larger ones for specific tasks, potentially lowering operational costs and increasing accessibility.