Llama
PulseAugur coverage of Llama — every cluster mentioning Llama across labs, papers, and developer communities, ranked by signal.
26 day(s) with sentiment data
-
Guide details choosing open-source AI models for production
Choosing the right open-source AI model for production requires careful consideration of factors like transparency, adaptability, and control. While proprietary models offer tiered options, open models allow for deeper …
-
NVIDIA Nemotron Diffusion models offer 6.4x faster AI inference
NVIDIA has released the Nemotron-Labs Diffusion family of language models, available in 3B, 8B, and 14B parameter sizes. These models uniquely support autoregressive (AR), diffusion, and self-speculation decoding modes …
-
Together AI deploys 100,000 GPUs in Europe via Hypertec, 5C
Together AI is significantly expanding its infrastructure in Europe through a partnership with Hypertec and 5C Group. This initiative aims to provide up to 2 gigawatts of AI-dedicated data center capacity and nearly 100…
-
New research tackles LLM evaluation, training, and inference efficiency
Researchers are developing new methods to improve the evaluation and training of large language models (LLMs). One approach, SCOPE, calibrates LLM judges to ensure reliable pairwise evaluations with controlled error rat…
-
Meta launches Llama Startup Program offering funding and support for AI builders
Meta has launched the Llama Startup Program to support early-stage companies building generative AI applications with its Llama models. The initiative offers financial reimbursements of up to $6,000 per month for six mo…
-
Tinfoil launches cloud AI service with verifiable privacy using secure enclaves
Tinfoil, a startup founded by researchers from MIT and Cloudflare, has launched a new service designed to provide verifiable privacy for AI workloads hosted in the cloud. The platform utilizes secure enclave technology,…
-
Together AI launches platform for continuous LLM fine-tuning
Together AI has launched a new fine-tuning platform that allows users to continuously improve open-weight language models. The platform now supports preference optimization and continued training, enabling models to ada…
-
Eugene Yan curates essential language modeling papers for study groups
Eugene Yan has compiled a reading list of fundamental language modeling papers, intended to facilitate group study sessions. The list includes seminal works like "Attention Is All You Need," "BERT," and "GPT-3," each ac…
-
New research tackles LLM hallucinations with novel methods and benchmarks
Multiple research papers released on arXiv address the challenge of hallucinations in large language and vision-language models. One paper introduces In-Context Visual Contrastive Optimization (IC-VCO) to mitigate multi…
-
Meta's Llama 2 overtakes open LLM leaderboards, enables commercial use
Meta has released Llama 2, an open-source large language model that has quickly become the state-of-the-art in its weight class, outperforming other open models. The model was pre-trained on 2 trillion tokens with an ex…
-
George Hotz's tiny corp unveils $15K AI computer and RISC-based tinygrad framework
George Hotz's company, tiny corp, has launched the tinybox, a $15,000 personal AI computer designed for local model training and inference. The tinybox boasts 738 FP16 TFLOPS and 144 GB of GPU RAM, capable of running a …
-
Safetensors library audited as secure, set to become default for ML models
The safetensors library, developed by Hugging Face in collaboration with EleutherAI and Stability AI, has undergone a security audit by Trail of Bits, confirming its safety. This audit allows the organizations to move t…