实体 generative pre-trained transformer

generative pre-trained transformer

PulseAugur coverage of generative pre-trained transformer — every cluster mentioning generative pre-trained transformer across labs, papers, and developer communities, ranked by signal.

Show in brief

总计 · 30天

90 天内 74

发布 · 30天

90 天内 0

论文 · 30天

90 天内 31

层级分布 · 90 天

significant 3
research 15
tool 42
commentary 10
meme 4

关系

instance of Llama 90%
instance of Royal Galician Academy 90%
instance of Roon 90%
instance of Qwen3.7 Max 90%
affiliated with Qwen3.7 Max 70%
used by Ollama 70%
competes with Llama 50%

情绪 · 30 天

15 天有情绪数据

最近 · 第 4/4 页 · 共 74 条

TOOL · CL_21641 · Apr 25 · 23:04

Cursor IDE experiences UI bug with hidden model selection menu

A user reported a bug in the Cursor IDE where the model selection menu becomes hidden or cut off when the mouse hovers over it. This issue affects the visibility and selection of GPT models, regardless of whether the Cu…
RESEARCH · CL_03169 · Apr 25 · 19:30

Prompt engineering projects surge with focus on AI coding agents and image generation

This week's prompt engineering landscape shows a significant increase in interest surrounding AI coding assistants and multimodal prompting techniques. Developers are actively exploring repositories focused on optimizin…
RESEARCH · CL_03583 · Apr 25 · 18:58

OpenAI's history of model releases visualized in new chart

A visual timeline details the progression of OpenAI's model releases, starting from their initial GPT models and extending to more recent iterations. The graphic illustrates the increasing frequency and complexity of mo…
RESEARCH · CL_03546 · Apr 24 · 11:05

New Rose optimizer offers low VRAM, fast convergence, and great results

A new PyTorch optimizer named Rose has been released under the Apache 2.0 license. Developed by Matthew K., Rose is designed to be stateless, offering significantly lower VRAM usage compared to optimizers like AdamW, wi…
RESEARCH · CL_02956 · Apr 23 · 14:08

Stealthy Backdoor Attacks against LLMs Based on Natural Style Triggers

Researchers have developed a new defense mechanism called Tail-risk Intrinsic Geometric Smoothing (TIGS) to protect large language models from backdoor attacks. TIGS operates during inference without requiring model upd…
RESEARCH · CL_03702 · Apr 22 · 18:15

Perplexity details research on SFT+RL pipeline for accurate, efficient AI answers

Perplexity has detailed its proprietary post-training pipeline that enhances base models for search-augmented question answering. This process involves initial fine-tuning for instruction following and safety, followed …
TOOL · CL_17648 · Feb 26 · 02:19

Show HN: OpenSwarm – Multi‑Agent Claude CLI Orchestrator for Linear/GitHub

OpenSwarm is a new command-line interface tool designed to orchestrate multiple AI agents for autonomous code-related tasks. It can integrate with various AI models, including Anthropic's Claude, OpenAI's GPT and Codex,…
RESEARCH · CL_01207 · Oct 16 · 00:00

Google Cloud C4, Intel, and Hugging Face partner for 70% TCO improvement on GPT OSS

Google Cloud's C4 platform, in collaboration with Intel and Hugging Face, has achieved a significant total cost of ownership (TCO) improvement of 70% for running open-source GPT models. This optimization is realized thr…
RESEARCH · CL_33152 · Sep 10 · 12:58

Offtoco — count GPT, Claude and Gemini tokens offline for web/CLI/desktop

New research highlights the limitations of current large language models in understanding complex human narratives and social situations. A benchmark called LitVISTA reveals that models like GPT, Claude, and Gemini stru…
TOOL · CL_47741 · Mar 9 · 01:05

Replit推出MCP以连接AI模型与外部工具

Replit推出了模型上下文协议（MCP），这是一个旨在使AI模型能够连接到外部数据源和工具的新标准。该协议充当通用连接器，允许AI模型访问其初始训练数据之外的信息并执行操作，类似于USB-C如何实现各种设备的连接。MCP采用客户端-服务器架构，其中客户端发起请求，通信层定义协议，服务器提供对数据库、Web服务和文件等资源的访问。这种标准化旨在简化集成，允许更轻松地在AI提供商之间切换，并增强AI应用程序的安全性。
COMMENTARY · CL_17721 · Feb 23 · 14:27

应对破碎的开发文化

一位在AI团队工作的开发者描述了一种功能失调的公司文化，其中工程实践几乎不存在，管理层过度依赖AI炒作。这位开发者自学了各种AI和开发技能，目前正在寻找全职的FOSS职位。另一篇文章详细介绍了如何使用FastAPI、React和Docker为忠诚度计划构建一个分析和推荐仪表板。
FRONTIER RELEASE · CL_02286 · Dec 9 · 00:00

Sora 2 System Card

OpenAI has released Sora 2, an advanced video and audio generation model that builds upon its predecessor. This new iteration boasts improved physics simulation, enhanced realism, synchronized audio, and greater user co…
RESEARCH · CL_04679 · Jan 7 · 00:00

Eugene Yan curates essential language modeling papers for study groups

Eugene Yan has compiled a reading list of fundamental language modeling papers, intended to facilitate group study sessions. The list includes seminal works like "Attention Is All You Need," "BERT," and "GPT-3," each ac…
RESEARCH · CL_00875 · May 15 · 00:00

RWKV project revives RNNs to challenge Transformer dominance in LLMs

The RWKV (Receptance Weighted Key Value) project introduces a novel architecture that revives Recurrent Neural Networks (RNNs) while incorporating advantages typically found in Transformers. This approach aims to overco…

Cursor IDE experiences UI bug with hidden model selection menu

Prompt engineering projects surge with focus on AI coding agents and image generation

OpenAI's history of model releases visualized in new chart

New Rose optimizer offers low VRAM, fast convergence, and great results

Stealthy Backdoor Attacks against LLMs Based on Natural Style Triggers

Perplexity details research on SFT+RL pipeline for accurate, efficient AI answers

Show HN: OpenSwarm – Multi‑Agent Claude CLI Orchestrator for Linear/GitHub

Google Cloud C4, Intel, and Hugging Face partner for 70% TCO improvement on GPT OSS

Offtoco — count GPT, Claude and Gemini tokens offline for web/CLI/desktop

Replit推出MCP以连接AI模型与外部工具

应对破碎的开发文化

Sora 2 System Card

Eugene Yan curates essential language modeling papers for study groups

RWKV project revives RNNs to challenge Transformer dominance in LLMs