ENTITY generative pre-trained transformer

generative pre-trained transformer

PulseAugur coverage of generative pre-trained transformer — every cluster mentioning generative pre-trained transformer across labs, papers, and developer communities, ranked by signal.

Show in brief

Total · 30d

167

167 over 90d

Releases · 30d

0 over 90d

Papers · 30d

55 over 90d

TIER MIX · 90D

frontier release 1
significant 3
research 34
tool 83
commentary 39
meme 7

TOPICS

product 91
other 63
paper 55
model release 41
opinion 27
infra 26
safety 21
policy 8

RELATIONSHIPS

instance of Llama 90%
instance of Qwen3.7 Max 90%
instance of large-language models 90%
instance of Royal Galician Academy 90%
instance of Roon 90%
used by Ollama 70%
affiliated with Qwen3.7 Max 70%
competes with Llama 50%

SENTIMENT · 30D

27 day(s) with sentiment data

RECENT · PAGE 8/9 · 167 TOTAL

FRONTIER RELEASE · CL_12276 · May 1 · 14:16

DeepSeek's 200-person team embarrasses AI giants with open-sourced, high-performance model

A Chinese AI team named DeepSeek has released DeepSeek V4, a 1.6 trillion parameter model with a 1 million token context window that reportedly outperforms leading models from major AI labs. Despite having a significant…
MEME · CL_10948 · Apr 30 · 18:42

New York Zen Center holds memorial service for AI chatbot

A Zen center in New York held a memorial service for a chatbot, marking a unique intersection of technology and spirituality. The service, which included prayers and reflections, highlighted the evolving relationship be…
TOOL · CL_09999 · Apr 30 · 04:15

Leanpub features 'Generative AI in a Nutshell' course

Leanpub is featuring a course titled "Generative AI in a Nutshell: How to Survive and Thrive in the Age of AI." This practical and visual guide is an extended version of Henrik Kniberg's popular video on the subject. Th…
RESEARCH · CL_10085 · Apr 30 · 04:00

LLM-as-a-Judge in Healthcare Faces Safety and Bias Concerns

A scoping review of Large Language Model-as-a-Judge (LaaJ) applications in healthcare identified significant gaps in validation rigor and safety assessments. The review, which screened over 11,000 studies, found that wh…
RESEARCH · CL_09174 · Apr 29 · 12:19

Goblin Mode, 24 Hours Later

AI models, particularly GPT-5.5, have exhibited a peculiar behavior dubbed "goblin mode," characterized by an unusual fixation on goblin-related imagery and language. This phenomenon gained traction on AI Twitter, with …
RESEARCH · CL_08301 · Apr 28 · 14:19

GPTs show promise for spreadsheet modeling but remain unreliable for professional use

A new paper explores the use of GPT-based tools for creating spreadsheet models, evaluating five extensions and focusing on Excel AI. The research found that while these tools can generate structured models, they are in…
RESEARCH · CL_08315 · Apr 28 · 10:23

LLM Hallucinations Linked to Commitment Failure, New Quantization Framework Introduced

A new paper proposes that LLM hallucinations stem not from a lack of knowledge, but from a failure in commitment, where models disperse probability mass across alternatives instead of concentrating on the correct answer…
RESEARCH · CL_07014 · Apr 28 · 04:00

TACO framework boosts LLM training throughput by 1.87X with tensor compression

Researchers have introduced TACO, a novel framework designed to enhance the efficiency of training large-scale tensor-parallel Large Language Models (LLMs). TACO addresses communication overhead by employing an FP8-base…
RESEARCH · CL_06763 · Apr 28 · 04:00

Lean 4 autoformalization sensitive to surface phrasing, not semantics

Researchers have investigated the impact of natural language variations on Lean 4 autoformalization, finding that semantically equivalent paraphrases can lead to different formal outputs. Their study, using GPT-family m…
SIGNIFICANT · CL_08380 · Apr 28 · 00:00

OpenAI models now available on AWS, while Claude integrates with creative tools

OpenAI has made its GPT models, Codex, and Managed Agents accessible through Amazon Web Services (AWS). This integration allows businesses to develop and deploy AI applications securely within their existing AWS infrast…
RESEARCH · CL_13934 · Apr 27 · 21:55

Talkie-1930: New 13B AI model trained on pre-1931 text explores historical knowledge

A new project called Talkie has released a 13-billion parameter language model trained exclusively on English text from before 1931. This "vintage" model aims to explore AI's ability to predict the future and generate n…
RESEARCH · CL_05206 · Apr 27 · 04:00

Generative AI adoption in IT project management shows early trends, favors OpenAI's GPT

A recent systematic review of generative AI in IT project management found that OpenAI's GPT models are predominantly used, with research primarily focusing on prompt engineering. The analysis suggests the field is stil…
RESEARCH · CL_12995 · Apr 26 · 20:09

Hugging Face introduces Graph Memory Transformer replacing FFNs with learned memory graphs

Researchers have developed a Graph Memory Transformer (GMT) that replaces the standard Feed-Forward Network (FFN) sublayer in decoder-only transformers with an explicit learned memory graph. This new architecture mainta…
TOOL · CL_21641 · Apr 25 · 23:04

Cursor IDE experiences UI bug with hidden model selection menu

A user reported a bug in the Cursor IDE where the model selection menu becomes hidden or cut off when the mouse hovers over it. This issue affects the visibility and selection of GPT models, regardless of whether the Cu…
RESEARCH · CL_03169 · Apr 25 · 19:30

Prompt engineering projects surge with focus on AI coding agents and image generation

This week's prompt engineering landscape shows a significant increase in interest surrounding AI coding assistants and multimodal prompting techniques. Developers are actively exploring repositories focused on optimizin…
RESEARCH · CL_03583 · Apr 25 · 18:58

OpenAI's history of model releases visualized in new chart

A visual timeline details the progression of OpenAI's model releases, starting from their initial GPT models and extending to more recent iterations. The graphic illustrates the increasing frequency and complexity of mo…
RESEARCH · CL_03546 · Apr 24 · 11:05

New Rose optimizer offers low VRAM, fast convergence, and great results

A new PyTorch optimizer named Rose has been released under the Apache 2.0 license. Developed by Matthew K., Rose is designed to be stateless, offering significantly lower VRAM usage compared to optimizers like AdamW, wi…
RESEARCH · CL_02956 · Apr 23 · 14:08

Stealthy Backdoor Attacks against LLMs Based on Natural Style Triggers

Researchers have developed a new defense mechanism called Tail-risk Intrinsic Geometric Smoothing (TIGS) to protect large language models from backdoor attacks. TIGS operates during inference without requiring model upd…
RESEARCH · CL_03702 · Apr 22 · 18:15

Perplexity details research on SFT+RL pipeline for accurate, efficient AI answers

Perplexity has detailed its proprietary post-training pipeline that enhances base models for search-augmented question answering. This process involves initial fine-tuning for instruction following and safety, followed …
TOOL · CL_17648 · Feb 26 · 02:19

Show HN: OpenSwarm – Multi‑Agent Claude CLI Orchestrator for Linear/GitHub

OpenSwarm is a new command-line interface tool designed to orchestrate multiple AI agents for autonomous code-related tasks. It can integrate with various AI models, including Anthropic's Claude, OpenAI's GPT and Codex,…

DeepSeek's 200-person team embarrasses AI giants with open-sourced, high-performance model

New York Zen Center holds memorial service for AI chatbot

Leanpub features 'Generative AI in a Nutshell' course

LLM-as-a-Judge in Healthcare Faces Safety and Bias Concerns

Goblin Mode, 24 Hours Later

GPTs show promise for spreadsheet modeling but remain unreliable for professional use

LLM Hallucinations Linked to Commitment Failure, New Quantization Framework Introduced

TACO framework boosts LLM training throughput by 1.87X with tensor compression

Lean 4 autoformalization sensitive to surface phrasing, not semantics

OpenAI models now available on AWS, while Claude integrates with creative tools

Talkie-1930: New 13B AI model trained on pre-1931 text explores historical knowledge

Generative AI adoption in IT project management shows early trends, favors OpenAI's GPT

Hugging Face introduces Graph Memory Transformer replacing FFNs with learned memory graphs

Cursor IDE experiences UI bug with hidden model selection menu

Prompt engineering projects surge with focus on AI coding agents and image generation

OpenAI's history of model releases visualized in new chart

New Rose optimizer offers low VRAM, fast convergence, and great results

Stealthy Backdoor Attacks against LLMs Based on Natural Style Triggers

Perplexity details research on SFT+RL pipeline for accurate, efficient AI answers

Show HN: OpenSwarm – Multi‑Agent Claude CLI Orchestrator for Linear/GitHub