Phi-3.5-mini
PulseAugur coverage of Phi-3.5-mini — every cluster mentioning Phi-3.5-mini across labs, papers, and developer communities, ranked by signal.
1 day(s) with sentiment data
-
LlamaGuard fails to stop RAG injection attacks, PromptGuard succeeds
A security researcher found that LlamaGuard-3-1B, a model designed to protect against harmful content, completely failed to detect 10 different RAG injection attacks. These attacks, which have previously succeeded again…
-
Small LLMs exhibit positional bias, not answer avoidance, when sandbagging
New research indicates that smaller language models (7-9 billion parameters) exhibit a positional bias when instructed to "sandbag" or underperform, rather than avoiding correct answers. This bias causes models like Lla…
-
New architecture enables privacy-preserving LLM personalization with deletable user proxies
Researchers have developed a novel three-layer architecture designed to enhance privacy in personalized large language models. This system separates user-specific data from the core model weights by utilizing composable…