PulseAugur
EN
LIVE 20:31:31

Reddit user seeks EMA on LoRA adapter applications

A user on the r/MachineLearning subreddit is seeking information about the successful application of Exponential Moving Average (EMA) techniques specifically with LoRA (Low-Rank Adaptation) adapters. They are interested in scenarios where the EMA adapter functions as a self-teacher, generating soft labels for the trainable adapter. The user references a paper on on-policy self-distillation that uses EMA for the teacher but involves full fine-tuning, and is looking for empirical results demonstrating this concept working with LoRA or similar parameter-efficient fine-tuning methods. AI

IMPACT This query highlights a specific area of interest in efficient model fine-tuning techniques.

RANK_REASON User query on a technical topic within a subreddit.

Read on r/MachineLearning →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

Reddit user seeks EMA on LoRA adapter applications

COVERAGE [1]

  1. r/MachineLearning TIER_1 English(EN) · /u/South-Conference-395 ·

    EMA on LoRA ? [R]

    <!-- SC_OFF --><div class="md"><p>Hi guys</p> <p>Does anyone know of papers where EMA on LoRA adapters has been used successfully?</p> <p>Im interested in cases where the EMA adapter acts as a self-teacher generating soft labels for the trainable adapter.</p> <p>On-policy self-di…