Llama 3.2 1B
PulseAugur coverage of Llama 3.2 1B — every cluster mentioning Llama 3.2 1B across labs, papers, and developer communities, ranked by signal.
2 天有情绪数据
-
TubiFM unifies streaming discovery with Llama 3.2 1B model
Researchers have developed TubiFM, a new model that unifies item, carousel, and search ranking for streaming platforms. By representing user journeys as a single token sequence called "user stories," TubiFM leverages a …
-
New UPMs enable collaborative AI training without weight extraction
Researchers have introduced Unextractable Protocol Models (UPMs), a new framework for collaborative training and inference of neural networks where individual participants only process subsets of the model. This approac…
-
X-Token method enhances knowledge distillation for mismatched tokenizers
Researchers have developed X-Token, a novel knowledge distillation technique designed to improve student models by learning from teacher models with different tokenizers. The method addresses limitations in existing log…
-
New BCJR-QAT method pushes LLM quantization to 2 bits per weight
Researchers have developed BCJR-QAT, a novel method for quantizing large language models to 2 bits per weight, a significant advancement beyond current post-training quantization techniques. This new approach uses a dif…
-
New FPO method prevents alignment collapse in iterative RLHF models
Researchers have identified a phenomenon called alignment collapse in iterative Reinforcement Learning from Human Feedback (RLHF). This occurs when the AI policy exploits weaknesses in the reward model it is trained on,…
-
Together AI releases Mamba-3, prioritizing inference speed over training
Together AI has released Mamba-3, a new state space model (SSM) prioritizing inference efficiency over training speed. This model features a more expressive recurrence formula, complex-valued state tracking, and a multi…