PulseAugur
实时 11:34:36
实体 Lora

Lora

PulseAugur coverage of Lora — every cluster mentioning Lora across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
134
90 天内 134
发布 · 30天
0
90 天内 0
论文 · 30天
111
90 天内 111
层级分布 · 90 天
关系
时间线
  1. 2026-05-12 research_milestone A paper is published detailing findings on parameter placement in LoRA for fine-tuning. 来源
情绪 · 30 天

16 天有情绪数据

最近 · 第 2/7 页 · 共 134 条
  1. RESEARCH · CL_40249 ·

    开发者使用 QLoRA 在 3GB GPU 上微调 LLM

    开发者可以使用 QLoRA 和 NF4 量化等技术,在仅需 3 GB GPU 内存的消费级硬件上微调 TinyLlama 等大型语言模型。此过程仅训练模型的一小部分参数,显著降低了计算需求。尽管该过程可能很复杂,在调试、提示格式化和依赖管理方面存在挑战,但它为独立开发者构建复杂的 AI 应用程序提供了一条途径。

  2. TOOL · CL_40799 ·

    New AR1-ZO method boosts LoRA fine-tuning with Zeroth-Order optimization

    Researchers have developed AR1-ZO, a novel method for fine-tuning large language models using Zeroth-Order optimization and Low-Rank Adaptation (LoRA). This technique addresses the challenge of effectively increasing Lo…

  3. RESEARCH · CL_39194 ·

    Qwen 3.7 支持可调式审查,NVIDIA 微调机器人视频生成

    Qwen 发布了其语言模型 3.7 版本,该版本包含一个可修改的政治审查专用电路,且修改后不损失事实知识。NVIDIA 的 Cosmos Predict 2.5 模型现在可以使用高效的 LoRA/DoRA 方法进行机器人视频生成的微调。此外,新的 HRM-Text 模型为预训练基础模型提供了更易于访问且成本效益更高的方法。

  4. TOOL · CL_37646 ·

    NamelessDogLab releases character LoRAs for AI web UIs

    NamelessDogLab is releasing character LoRAs for web UI use to Patreon supporters. These LoRAs are designed to enable the creation of specific characters within AI-driven web interfaces. The project encourages support th…

  5. TOOL · CL_37619 ·

    New infrastructure enables one base AI model to serve millions of LoRA policies

    Researchers have developed a new infrastructure that allows a single base AI model to efficiently serve millions of LoRA (Low-Rank Adaptation) policies. This approach avoids the need to copy weights for each policy, sig…

  6. TOOL · CL_35797 ·

    Fine-tuning Qwen2.5 with LoRA yields structured, not necessarily correct, outputs

    This article explores the process of fine-tuning the Qwen2.5 model using the LoRA technique. It demonstrates that while fine-tuning can lead to more structured outputs, this does not necessarily equate to improved reaso…

  7. TOOL · CL_35086 ·

    LLM Fine-Tuning Explained: SFT, RAG, and Data Preparation

    This blog post explains the process and necessity of fine-tuning large language models (LLMs) for specific tasks. It differentiates fine-tuning from Retrieval-Augmented Generation (RAG), stating that fine-tuning is best…

  8. RESEARCH · CL_32530 ·

    EverAnimate method improves long-form animated video generation

    Researchers have developed EverAnimate, a novel post-training method designed to improve the generation of long-form animated videos. This technique addresses issues like visual quality degradation and inconsistent char…

  9. TOOL · CL_33395 ·

    PreFT method boosts LLM serving throughput with prefill-only finetuning

    Researchers have developed PreFT, a novel parameter-efficient finetuning method designed to improve the efficiency of serving personalized large language models. PreFT optimizes for serving throughput by applying adapte…

  10. TOOL · CL_36930 ·

    PEML method optimizes LLM prompts and weights for multi-task learning

    Researchers have introduced PEML, a new method for parameter-efficient multi-task learning in large language models. PEML optimizes both continuous prompts and model weights simultaneously, addressing limitations of exi…

  11. RESEARCH · CL_29549 ·

    New GPart method offers efficient LLM fine-tuning

    Researchers have introduced GPart, a novel parameter-efficient fine-tuning method that bypasses the low-rank bottleneck inherent in LoRA. GPart utilizes a single isometric partition matrix to map a low-dimensional train…

  12. TOOL · CL_29395 ·

    LoRA parameter placement impacts GRPO fine-tuning, not SFT

    Researchers have investigated the parameter placement problem within Low-Rank Adaptation (LoRA) for fine-tuning large language models. Their study reveals that for Supervised Fine-Tuning (SFT), the specific placement of…

  13. RESEARCH · CL_36937 ·

    New research explores efficient LLM alignment and federated fine-tuning

    Researchers are developing new methods for efficient large language model (LLM) alignment and fine-tuning. One approach, P2D, uses task-sensitive attention heads to guide data selection and parameter pruning, achieving …

  14. TOOL · CL_49396 ·

    新研究分解LoRA架构,识别出路由器重写是关键性能驱动因素

    研究人员开发了一种方法,将进化式LoRA混合架构分解为三个关键组成部分:路由器重写、每域评估范围和生命周期机制。他们在约1.5亿参数的基底上进行的实验表明,路由器重写是大部分性能提升的原因,具体表现为+0.0426 nat的平衡对数困惑度(log-PPL)增益。然而,生命周期机制被发现对性能有净负面影响,而评估范围在种子分辨率下未显示出显著影响。

  15. TOOL · CL_28343 ·

    New AdaPaD method improves PEFT efficiency for large language models

    Researchers have introduced AdaPaD, a novel method for efficiently fine-tuning large language models using Parameter-Efficient Fine-Tuning (PEFT). AdaPaD trains all rank-1 components simultaneously, with each component …

  16. TOOL · CL_28266 ·

    Fashion Florence model extracts structured clothing attributes

    Researchers have developed Fashion Florence, a vision-language model based on Florence-2, specifically fine-tuned for extracting structured fashion attributes from images. This model can generate a JSON object detailing…

  17. TOOL · CL_27578 ·

    EvoPref算法通过进化优化增强语言模型对齐

    研究人员开发了EvoPref,这是一种新颖的多目标进化算法,旨在改进大型语言模型(LLM)的对齐。与可能导致偏好崩溃和狭窄行为模式的传统基于梯度的方法不同,EvoPref维护了针对有用性、无害性和诚实性进行优化的适配器多样化种群。这种方法显著增强了偏好覆盖范围并降低了崩溃率,同时实现了具有竞争力的对齐质量,确立了进化优化作为多样化LLM对齐的可行范式。

  18. TOOL · CL_25332 ·

    LoRA fine-tuning reduces LLM parameter updates

    Low-Rank Adaptation (LoRA) is a technique for efficiently fine-tuning large language models. Instead of modifying all model weights, LoRA freezes the original weights and introduces small, trainable matrices to learn ad…

  19. TOOL · CL_24634 ·

    LoRA fine-tuning explained with matrix-level detail

    This article provides a detailed, number-by-number explanation of how LoRA (Low-Rank Adaptation) works for fine-tuning large language models. It aims to go beyond simply stating what LoRA achieves and instead illustrate…

  20. TOOL · CL_24209 ·

    LoRA Explained: Mathematical Intuition Behind Low-Rank Adaptation

    This article delves into the mathematical underpinnings of Low-Rank Adaptation (LoRA), a technique used for efficient fine-tuning of large language models. It explains how LoRA leverages the concept of low intrinsic dim…