PulseAugur
EN
LIVE 07:17:27
ENTITY knowledge distillation

knowledge distillation

PulseAugur coverage of knowledge distillation — every cluster mentioning knowledge distillation across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
29
29 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
28
28 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

13 day(s) with sentiment data

RECENT · PAGE 1/2 · 29 TOTAL
  1. COMMENTARY · CL_113656 ·

    Model distillation attacks pose growing AI security threat

    Model distillation attacks, where a smaller model learns from a larger one's outputs, pose an under-recognized security threat to AI systems. These attacks can bypass safety alignments, leading to models that generate h…

  2. RESEARCH · CL_109613 ·

    New IF-Beta framework streamlines knowledge distillation with data pruning

    Researchers have developed IF-Beta, a novel framework for efficient knowledge distillation that utilizes learnable data pruning. This method combines influence functions with a Beta distribution-parameterized sampling p…

  3. RESEARCH · CL_105088 ·

    Knowledge distillation outperforms SFT in low-data LLM training

    A new paper explores knowledge distillation (KD) for post-training large language models (LLMs), finding it outperforms supervised fine-tuning (SFT) in low-data scenarios. The effectiveness of KD diminishes as more data…

  4. TOOL · CL_113322 ·

    Hugging Face paper reveals "subliminal learning" in LLMs, impacting auditability

    A new paper from Hugging Face explores the concept of "subliminal learning" in language models, where a student model can inherit hidden traits from a teacher model through distillation data that doesn't explicitly name…

  5. TOOL · CL_104773 ·

    New TALAS framework improves language model distillation efficiency

    Researchers have introduced TALAS, a novel framework for knowledge distillation in pre-trained language models. TALAS synchronizes hierarchical alignment with advanced optimization techniques to improve efficiency and p…

  6. TOOL · CL_100232 ·

    New LEAP curriculum boosts Vision Transformer distillation efficiency

    Researchers from the University of Oxford have introduced LEAP, a novel training curriculum designed to improve the efficiency of knowledge distillation for Vision Transformers (ViTs). LEAP utilizes a progressive approa…

  7. TOOL · CL_93232 ·

    New knowledge distillation method boosts land-use image classification accuracy

    Researchers have developed an improved knowledge distillation framework to compress deep convolutional neural networks for land-use image classification. This approach uses a teacher-student learning paradigm where a VG…

  8. RESEARCH · CL_84487 ·

    Mixup distillation enhances student model accuracy and calibration

    Researchers have explored the interaction between Knowledge Distillation (KD) and mixup techniques in machine learning, particularly when mixup is applied only during the student model's training. They found that this s…

  9. RESEARCH · CL_79607 ·

    Soft prompt distillation enhances on-device LLM safety

    Researchers have developed a new method for making large language models safer and more efficient for use on devices with limited resources. The technique involves using "soft prompts" combined with distillation to tran…

  10. RESEARCH · CL_79099 ·

    New research advances flow matching models with theoretical and algorithmic improvements

    Researchers have developed new theoretical foundations and practical algorithms for flow matching models, a type of generative model. One paper establishes convergence guarantees for neural network-parameterized conditi…

  11. TOOL · CL_68471 ·

    New Confusion Distillation method enhances self-distillation in ML

    Researchers have developed a new method called Confusion Distillation (CD) to improve self-distillation in machine learning models. This technique analyzes the feature learning process in student models, revealing that …

  12. TOOL · CL_65555 ·

    New framework unifies knowledge transfer analysis in ML

    Researchers have developed a unified spectral analysis framework to understand knowledge transfer in machine learning, particularly in high-dimensional linear regression. This framework explains how knowledge distillati…

  13. RESEARCH · CL_66251 ·

    New KD method improves dead tree detection in diverse forests

    Researchers have developed a new method for detecting dead trees in aerial imagery using knowledge distillation (KD) to improve model generalization across different forest types. The TreeMort-1T-UNet model, initially t…

  14. TOOL · CL_60427 ·

    NVIDIA's X-Token enables cross-tokenizer knowledge distillation for AI models

    NVIDIA researchers have developed X-Token, a novel method for knowledge distillation that allows smaller AI models to learn from larger, incompatible teacher models. Unlike previous methods that struggle with different …

  15. RESEARCH · CL_62946 ·

    Student capacity and architecture correctness key to knowledge distillation

    A new study published on arXiv investigates the effectiveness of knowledge distillation (KD) in ResNet models for image classification on CIFAR-10. The research found that a student model's capacity significantly impact…

  16. RESEARCH · CL_58708 ·

    LoopFM framework enhances foundation model knowledge transfer for recommendation systems

    Researchers have developed LoopFM, a novel framework designed to improve knowledge transfer from large foundation models (FMs) to smaller vertical models (VMs). Unlike traditional knowledge distillation, LoopFM structur…

  17. TOOL · CL_51141 ·

    New BackWeak method implants stealthy backdoors in AI model distillation

    Researchers have developed a new method called BackWeak to implant backdoors into knowledge distillation processes. This technique uses subtle, imperceptible triggers and simple fine-tuning of teacher models. The backdo…

  18. RESEARCH · CL_51269 ·

    Lightweight Bangla Medical Entity Recognition Framework Developed

    Researchers have developed a new, lightweight framework for Bangla medical entity recognition designed for resource-constrained environments. The system utilizes a hybrid Transformer-CRF architecture, starting with a 12…

  19. TOOL · CL_45378 ·

    Research tackles knowledge distillation attacks with adaptive defenses

    A research paper explores knowledge distillation attacks and defenses, proposing efficient methods to counter adaptive attacks. This work is particularly useful for teams focused on the security and robustness of distil…

  20. TOOL · CL_44697 ·

    New CIST technique enhances knowledge distillation with adaptive temperatures

    Researchers have developed a new knowledge distillation technique called CIST, which addresses the limitations of fixed temperature scaling in transferring knowledge from teacher to student models. CIST assigns separate…