peft
PulseAugur coverage of peft — every cluster mentioning peft across labs, papers, and developer communities, ranked by signal.
3 天有情绪数据
-
Developers fine-tune LLMs on 3GB GPUs using QLoRA
Developers can fine-tune large language models like TinyLlama on consumer hardware with as little as 3 GB of GPU memory using techniques such as QLoRA and NF4 quantization. This process involves training only a small fr…
-
Researchers explore output composition for PEFT modules in text generation
Researchers have explored methods to generalize parameter-efficient fine-tuning (PEFT) techniques beyond single-task applications. Their work investigates training on combined datasets, composing weight matrices of sepa…
-
New AdaPaD method improves PEFT efficiency for large language models
Researchers have introduced AdaPaD, a novel method for efficiently fine-tuning large language models using Parameter-Efficient Fine-Tuning (PEFT). AdaPaD trains all rank-1 components simultaneously, with each component …
-
Clinical AI fine-tuned on AMD hardware, bypassing CUDA dependency
A project has successfully fine-tuned a clinical AI model, MedQA, using AMD hardware and ROCm, demonstrating that advanced AI development is possible without NVIDIA's CUDA. The fine-tuning process utilized the Qwen3-1.7…
-
DPO vs SimPO: Preference tuning methods compared for LLM training
A recent analysis highlights a critical discrepancy in preference tuning methodologies for large language models, specifically comparing Direct Preference Optimization (DPO) and Simplified Preference Optimization (SimPO…
-
New Deep Reprogramming Distillation framework enhances medical AI models
Researchers have introduced a new framework called Deep Reprogramming Distillation (DRD) to address the challenges of adapting large medical foundation models for specific downstream tasks. DRD utilizes a novel reprogra…
-
Compress Then Adapt? No, Do It Together via Task-aware Union of Subspaces
Researchers have introduced JACTUS, a novel framework that unifies parameter-efficient fine-tuning (PEFT) and low-rank compression for adapting large pretrained models. Unlike sequential methods, JACTUS jointly optimize…
-
NVIDIA Cosmos Predict 2.5 fine-tuned for robots; new ShadowPEFT method emerges
NVIDIA has released a guide for fine-tuning its Cosmos Predict 2.5 world model for robot video generation using parameter-efficient techniques like LoRA and DoRA. This method allows for adaptation to specific domains, s…