Llama-3.1-405B
PulseAugur coverage of Llama-3.1-405B — every cluster mentioning Llama-3.1-405B across labs, papers, and developer communities, ranked by signal.
5 day(s) with sentiment data
-
New framework probes AI models' sensitivity to researcher expectations
Researchers have developed a new framework to distinguish between a language model's strategic self-preservation and its sensitivity to researcher expectations during safety evaluations. By targeting instrumental proces…
-
Mac Studio enables 100B+ LLMs locally despite DRAM shortage
Running large language models with over 100 billion parameters locally is now feasible on high-end consumer hardware like the Mac Studio, thanks to its unified memory architecture. This approach avoids the performance b…
-
AWS cuts LLM load times with GPUDirect Storage and FSx
AWS has introduced a new method to significantly speed up the loading of large language models onto GPU instances. By leveraging NVIDIA GPUDirect Storage (GDS) with Amazon FSx for Lustre, model weights can be loaded dir…
-
LLMs improved for power system code generation with new intervention
Researchers have developed a new method to improve the reliability of large language models (LLMs) for power system code generation, particularly for on-premise deployments. The approach addresses API knowledge boundary…
-
Open-weight models fine-tuned to challenge Claude Opus 4.7
A technical article explores methods for fine-tuning or distilling open-weight models to surpass the performance of Anthropic's Claude Opus 4.7. The author discusses leveraging large base models like Llama 3.1 405B and …
-
LLM KV Caching Explained: Speed vs. Memory Tradeoff
Large language models utilize KV caching to accelerate inference by storing previously computed key and value vectors, rather than recomputing them for each new token. This technique significantly speeds up token genera…
-
Evaluating chain-of-thought monitorability
OpenAI has introduced new evaluations to measure the monitorability of AI systems' internal reasoning chains, finding that current frontier models are generally monitorable. The research suggests that longer reasoning c…