ENTITY $7B

$7B

PulseAugur coverage of $7B — every cluster mentioning $7B across labs, papers, and developer communities, ranked by signal.

Total · 30d

6

13 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

4

8 over 90d

TIER MIX · 90D

research 2
tool 10
commentary 1

TOPICS

SENTIMENT · 30D

4 day(s) with sentiment data

RECENT · PAGE 1/1 · 13 TOTAL

TOOL · CL_171930 · Jul 30 · 04:00

Blind Resampling Outperforms Self-Repair in Small Code Models

A new research paper explores the effectiveness of different retry strategies for small code models, specifically comparing blind resampling against self-repair. The study found that blind resampling, which involves sim…
TOOL · CL_156183 · Jul 22 · 00:50

Google develops secret 'Project 7' chip to power Gemini AI models

Google is reportedly developing a proprietary chip designed to significantly outperform its own Tensor Processing Units (TPUs) in terms of energy efficiency. This new chip, codenamed 'Project 7' or '7B', is intended to …
TOOL · CL_154393 · Jul 21 · 04:00

Octopus model fine-tuned for on-device API calls outperforms GPT-4

Researchers have developed Octopus, an on-device language model specifically fine-tuned for invoking software APIs. The model, available in 2B, 3B, and 7B parameter sizes, demonstrates superior performance compared to G…
COMMENTARY · CL_142415 · Jul 14 · 12:29

Fine-tuning and RAG fail to create predictable signals in noisy financial data

Experiments with fine-tuning and retrieval-augmented generation (RAG) on financial prediction tasks revealed that neither technique can manufacture a predictable signal where none exists. Fine-tuning larger models on sm…
TOOL · CL_141593 · Jul 14 · 04:00

LLMs fail to generate runnable Unity game scenes in single pass

Researchers have investigated the ability of large language models (LLMs) to generate executable Unity game scenes in a single pass, without iterative repair loops. They found that even with models ranging from 7B to 30…
TOOL · CL_117474 · Jun 30 · 04:00

MLLMs show promise for low-cost concept-based AI explanations

Researchers have developed a training-free approach for generating localized explanations in Explainable AI (XAI) using Multimodal Large Language Models (MLLMs). Their method, called Concept Naming (CoNa), evaluates how…
TOOL · CL_117099 · Jun 28 · 23:46

New research proposes local-first IR for enhanced privacy in document search

A new research paper proposes a "local-first IR" design philosophy for information retrieval systems, prioritizing on-device indexing, models, and inference for enhanced privacy and control. Experiments show that dense …
RESEARCH · CL_113355 · Jun 27 · 08:26

DeepSeek secures $7B funding for aggressive expansion and AI coding agent launch

DeepSeek has secured a substantial $7 billion in funding, marking a significant shift from its previous focus on idealism to aggressive expansion. The company plans to double its workforce across all departments and is …
RESEARCH · CL_91397 · Jun 15 · 04:00

New 7B Uniform Diffusion Language Model 'Sumi' Released, Alongside Diffusion Model Advancements

Researchers have introduced Sumi, a 7-billion parameter uniform diffusion language model (UDLM) pretrained from scratch on 1.5 trillion tokens. This open-source model demonstrates competitive performance against autoreg…
TOOL · CL_88856 · Jun 13 · 05:05

New 7B Pixel-Space Image Model PRX Pixel Released

A new 7-billion parameter image generation model called PRX Pixel has been released. This model operates in pixel space, offering a novel approach to image synthesis. It is available via Hugging Face, with links to its …
TOOL · CL_68648 · Jun 3 · 04:42

LLM inference speed bottlenecked by GPU memory bandwidth, not compute

This article explains that the primary bottleneck for LLM inference in production is often the model's raw speed on the GPU, rather than serving logic or network overhead. It details how LLM inference, particularly duri…
TOOL · CL_74867 · Jun 2 · 00:40

Tencent releases Hy-MT2 translation model for local deployment

Tencent has released Hy-MT2, a new version of its translation model, in both 1.8B and 7B parameter sizes. The open-source model is designed for local deployment, with tests exploring the impact of cache quantization. Th…
RESEARCH · CL_56226 · May 27 · 17:09

Extrapolative Weight Averaging Extends Code RL Frontiers

Researchers have explored extrapolative weight averaging as a method to extend the Pareto front between competing objectives in reinforcement learning for code generation. By training checkpoints with nested unit-test c…