generative pre-trained transformer
PulseAugur coverage of generative pre-trained transformer — every cluster mentioning generative pre-trained transformer across labs, papers, and developer communities, ranked by signal.
27 day(s) with sentiment data
-
Show HN: OpenSwarm – Multi‑Agent Claude CLI Orchestrator for Linear/GitHub
OpenSwarm is a new command-line interface tool designed to orchestrate multiple AI agents for autonomous code-related tasks. It can integrate with various AI models, including Anthropic's Claude, OpenAI's GPT and Codex,…
-
Google Cloud C4, Intel, and Hugging Face partner for 70% TCO improvement on GPT OSS
Google Cloud's C4 platform, in collaboration with Intel and Hugging Face, has achieved a significant total cost of ownership (TCO) improvement of 70% for running open-source GPT models. This optimization is realized thr…
-
Offtoco — count GPT, Claude and Gemini tokens offline for web/CLI/desktop
New research highlights the limitations of current large language models in understanding complex human narratives and social situations. A benchmark called LitVISTA reveals that models like GPT, Claude, and Gemini stru…
-
Replit launches MCP to connect AI models with external tools
Replit has introduced the Model Context Protocol (MCP), a new standard designed to enable AI models to connect with external data sources and tools. This protocol acts as a universal connector, allowing AI models to acc…
-
Navigating a Broken Dev Culture
A developer working on an AI team describes a dysfunctional corporate culture with nonexistent engineering practices, where management is overly reliant on AI hype. The developer, who has self-taught various AI and deve…
-
Sora 2 System Card
OpenAI has released Sora 2, an advanced video and audio generation model that builds upon its predecessor. This new iteration boasts improved physics simulation, enhanced realism, synchronized audio, and greater user co…
-
Eugene Yan curates essential language modeling papers for study groups
Eugene Yan has compiled a reading list of fundamental language modeling papers, intended to facilitate group study sessions. The list includes seminal works like "Attention Is All You Need," "BERT," and "GPT-3," each ac…
-
RWKV project revives RNNs to challenge Transformer dominance in LLMs
The RWKV (Receptance Weighted Key Value) project introduces a novel architecture that revives Recurrent Neural Networks (RNNs) while incorporating advantages typically found in Transformers. This approach aims to overco…