实体 Lora

Lora

PulseAugur coverage of Lora — every cluster mentioning Lora across labs, papers, and developer communities, ranked by signal.

Show in brief

总计 · 30天

134

90 天内 134

发布 · 30天

90 天内 0

论文 · 30天

111

90 天内 111

层级分布 · 90 天

frontier release 1
significant 1
research 50
tool 74
commentary 4
meme 4

关系

used by Vít 90%
instance of Low Rank Adaptation 90%
used by large-language models 70%
instance of Direct Preference Optimization 70%
used by magazine 70%
used by Glue 70%
used by supervised fine-tuning 70%
developed large-language models 70%
used by Bert 70%
used by Dopravní podnik Ostrava 70%
used by Transformer Reinforcement Learning 70%
instance of peft 70%

时间线

2026-05-12 research_milestone A paper is published detailing findings on parameter placement in LoRA for fine-tuning. 来源

情绪 · 30 天

16 天有情绪数据

最近 · 第 2/7 页 · 共 134 条

RESEARCH · CL_40249 · May 20 · 07:14

开发者使用 QLoRA 在 3GB GPU 上微调 LLM

开发者可以使用 QLoRA 和 NF4 量化等技术，在仅需 3 GB GPU 内存的消费级硬件上微调 TinyLlama 等大型语言模型。此过程仅训练模型的一小部分参数，显著降低了计算需求。尽管该过程可能很复杂，在调试、提示格式化和依赖管理方面存在挑战，但它为独立开发者构建复杂的 AI 应用程序提供了一条途径。
TOOL · CL_40799 · May 19 · 12:39

New AR1-ZO method boosts LoRA fine-tuning with Zeroth-Order optimization

Researchers have developed AR1-ZO, a novel method for fine-tuning large language models using Zeroth-Order optimization and Low-Rank Adaptation (LoRA). This technique addresses the challenge of effectively increasing Lo…
RESEARCH · CL_39194 · May 19 · 00:00

Qwen 3.7 支持可调式审查，NVIDIA 微调机器人视频生成

Qwen 发布了其语言模型 3.7 版本，该版本包含一个可修改的政治审查专用电路，且修改后不损失事实知识。NVIDIA 的 Cosmos Predict 2.5 模型现在可以使用高效的 LoRA/DoRA 方法进行机器人视频生成的微调。此外，新的 HRM-Text 模型为预训练基础模型提供了更易于访问且成本效益更高的方法。
TOOL · CL_37646 · May 18 · 20:54

NamelessDogLab releases character LoRAs for AI web UIs

NamelessDogLab is releasing character LoRAs for web UI use to Patreon supporters. These LoRAs are designed to enable the creation of specific characters within AI-driven web interfaces. The project encourages support th…
TOOL · CL_37619 · May 18 · 19:31

New infrastructure enables one base AI model to serve millions of LoRA policies

Researchers have developed a new infrastructure that allows a single base AI model to efficiently serve millions of LoRA (Low-Rank Adaptation) policies. This approach avoids the need to copy weights for each policy, sig…
TOOL · CL_35797 · May 17 · 18:40

Fine-tuning Qwen2.5 with LoRA yields structured, not necessarily correct, outputs

This article explores the process of fine-tuning the Qwen2.5 model using the LoRA technique. It demonstrates that while fine-tuning can lead to more structured outputs, this does not necessarily equate to improved reaso…
TOOL · CL_35086 · May 17 · 00:01

LLM Fine-Tuning Explained: SFT, RAG, and Data Preparation

This blog post explains the process and necessity of fine-tuning large language models (LLMs) for specific tasks. It differentiates fine-tuning from Retrieval-Augmented Generation (RAG), stating that fine-tuning is best…
RESEARCH · CL_32530 · May 14 · 16:36

EverAnimate method improves long-form animated video generation

Researchers have developed EverAnimate, a novel post-training method designed to improve the generation of long-form animated videos. This technique addresses issues like visual quality degradation and inconsistent char…
TOOL · CL_33395 · May 14 · 00:19

PreFT method boosts LLM serving throughput with prefill-only finetuning

Researchers have developed PreFT, a novel parameter-efficient finetuning method designed to improve the efficiency of serving personalized large language models. PreFT optimizes for serving throughput by applying adapte…
TOOL · CL_36930 · May 13 · 19:25

PEML method optimizes LLM prompts and weights for multi-task learning

Researchers have introduced PEML, a new method for parameter-efficient multi-task learning in large language models. PEML optimizes both continuous prompts and model weights simultaneously, addressing limitations of exi…
RESEARCH · CL_29549 · May 13 · 04:00

New GPart method offers efficient LLM fine-tuning

Researchers have introduced GPart, a novel parameter-efficient fine-tuning method that bypasses the low-rank bottleneck inherent in LoRA. GPart utilizes a single isometric partition matrix to map a low-dimensional train…
TOOL · CL_29395 · May 12 · 14:46

LoRA parameter placement impacts GRPO fine-tuning, not SFT

Researchers have investigated the parameter placement problem within Low-Rank Adaptation (LoRA) for fine-tuning large language models. Their study reveals that for Supervised Fine-Tuning (SFT), the specific placement of…
RESEARCH · CL_36937 · May 12 · 09:41

New research explores efficient LLM alignment and federated fine-tuning

Researchers are developing new methods for efficient large language model (LLM) alignment and fine-tuning. One approach, P2D, uses task-sensitive attention heads to guide data selection and parameter pruning, achieving …
TOOL · CL_49396 · May 11 · 18:59

新研究分解LoRA架构，识别出路由器重写是关键性能驱动因素

研究人员开发了一种方法，将进化式LoRA混合架构分解为三个关键组成部分：路由器重写、每域评估范围和生命周期机制。他们在约1.5亿参数的基底上进行的实验表明，路由器重写是大部分性能提升的原因，具体表现为+0.0426 nat的平衡对数困惑度（log-PPL）增益。然而，生命周期机制被发现对性能有净负面影响，而评估范围在种子分辨率下未显示出显著影响。
TOOL · CL_28343 · May 11 · 15:44

New AdaPaD method improves PEFT efficiency for large language models

Researchers have introduced AdaPaD, a novel method for efficiently fine-tuning large language models using Parameter-Efficient Fine-Tuning (PEFT). AdaPaD trains all rank-1 components simultaneously, with each component …
TOOL · CL_28266 · May 11 · 00:04

Fashion Florence model extracts structured clothing attributes

Researchers have developed Fashion Florence, a vision-language model based on Florence-2, specifically fine-tuned for extracting structured fashion attributes from images. This model can generate a JSON object detailing…
TOOL · CL_27578 · May 10 · 21:50

EvoPref算法通过进化优化增强语言模型对齐

研究人员开发了EvoPref，这是一种新颖的多目标进化算法，旨在改进大型语言模型（LLM）的对齐。与可能导致偏好崩溃和狭窄行为模式的传统基于梯度的方法不同，EvoPref维护了针对有用性、无害性和诚实性进行优化的适配器多样化种群。这种方法显著增强了偏好覆盖范围并降低了崩溃率，同时实现了具有竞争力的对齐质量，确立了进化优化作为多样化LLM对齐的可行范式。
TOOL · CL_25332 · May 10 · 19:15

LoRA fine-tuning reduces LLM parameter updates

Low-Rank Adaptation (LoRA) is a technique for efficiently fine-tuning large language models. Instead of modifying all model weights, LoRA freezes the original weights and introduces small, trainable matrices to learn ad…
TOOL · CL_24634 · May 10 · 04:16

LoRA fine-tuning explained with matrix-level detail

This article provides a detailed, number-by-number explanation of how LoRA (Low-Rank Adaptation) works for fine-tuning large language models. It aims to go beyond simply stating what LoRA achieves and instead illustrate…
TOOL · CL_24209 · May 9 · 13:29

LoRA Explained: Mathematical Intuition Behind Low-Rank Adaptation

This article delves into the mathematical underpinnings of Low-Rank Adaptation (LoRA), a technique used for efficient fine-tuning of large language models. It explains how LoRA leverages the concept of low intrinsic dim…

开发者使用 QLoRA 在 3GB GPU 上微调 LLM

New AR1-ZO method boosts LoRA fine-tuning with Zeroth-Order optimization

Qwen 3.7 支持可调式审查，NVIDIA 微调机器人视频生成

NamelessDogLab releases character LoRAs for AI web UIs

New infrastructure enables one base AI model to serve millions of LoRA policies

Fine-tuning Qwen2.5 with LoRA yields structured, not necessarily correct, outputs

LLM Fine-Tuning Explained: SFT, RAG, and Data Preparation

EverAnimate method improves long-form animated video generation

PreFT method boosts LLM serving throughput with prefill-only finetuning

PEML method optimizes LLM prompts and weights for multi-task learning

New GPart method offers efficient LLM fine-tuning

LoRA parameter placement impacts GRPO fine-tuning, not SFT

New research explores efficient LLM alignment and federated fine-tuning

新研究分解LoRA架构，识别出路由器重写是关键性能驱动因素

New AdaPaD method improves PEFT efficiency for large language models

Fashion Florence model extracts structured clothing attributes

EvoPref算法通过进化优化增强语言模型对齐

LoRA fine-tuning reduces LLM parameter updates

LoRA fine-tuning explained with matrix-level detail

LoRA Explained: Mathematical Intuition Behind Low-Rank Adaptation