ReRAM-aware Model Finetuning addressing I-V Non-linearity and Retention Errors
Researchers have developed a new finetuning method to adapt deep neural networks for deployment on ReRAM-based in-memory computing hardware. This approach addresses the challenges of I-V non-linearity and retention errors inherent in ReRAM, which typically require computationally expensive training from scratch. The proposed technique integrates these hardware non-idealities into a regularization loss during finetuning, significantly reducing overhead while maintaining high accuracy across various models and tasks, including image classification on ImageNet and question-answering on SQuAD v2. AI
IMPACT Enables more efficient deployment of AI models on specialized hardware, potentially reducing energy consumption and computational costs.