PulseAugur
EN
LIVE 18:51:03
ENTITY generative pre-trained transformer

generative pre-trained transformer

PulseAugur coverage of generative pre-trained transformer — every cluster mentioning generative pre-trained transformer across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
167
167 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
55
55 over 90d
TIER MIX · 90D
TOPICS
RELATIONSHIPS
SENTIMENT · 30D

27 day(s) with sentiment data

RECENT · PAGE 8/9 · 167 TOTAL
  1. FRONTIER RELEASE · CL_12276 ·

    DeepSeek's 200-person team embarrasses AI giants with open-sourced, high-performance model

    A Chinese AI team named DeepSeek has released DeepSeek V4, a 1.6 trillion parameter model with a 1 million token context window that reportedly outperforms leading models from major AI labs. Despite having a significant…

  2. MEME · CL_10948 ·

    New York Zen Center holds memorial service for AI chatbot

    A Zen center in New York held a memorial service for a chatbot, marking a unique intersection of technology and spirituality. The service, which included prayers and reflections, highlighted the evolving relationship be…

  3. TOOL · CL_09999 ·

    Leanpub features 'Generative AI in a Nutshell' course

    Leanpub is featuring a course titled "Generative AI in a Nutshell: How to Survive and Thrive in the Age of AI." This practical and visual guide is an extended version of Henrik Kniberg's popular video on the subject. Th…

  4. RESEARCH · CL_10085 ·

    LLM-as-a-Judge in Healthcare Faces Safety and Bias Concerns

    A scoping review of Large Language Model-as-a-Judge (LaaJ) applications in healthcare identified significant gaps in validation rigor and safety assessments. The review, which screened over 11,000 studies, found that wh…

  5. RESEARCH · CL_09174 ·

    Goblin Mode, 24 Hours Later

    AI models, particularly GPT-5.5, have exhibited a peculiar behavior dubbed "goblin mode," characterized by an unusual fixation on goblin-related imagery and language. This phenomenon gained traction on AI Twitter, with …

  6. RESEARCH · CL_08301 ·

    GPTs show promise for spreadsheet modeling but remain unreliable for professional use

    A new paper explores the use of GPT-based tools for creating spreadsheet models, evaluating five extensions and focusing on Excel AI. The research found that while these tools can generate structured models, they are in…

  7. RESEARCH · CL_08315 ·

    LLM Hallucinations Linked to Commitment Failure, New Quantization Framework Introduced

    A new paper proposes that LLM hallucinations stem not from a lack of knowledge, but from a failure in commitment, where models disperse probability mass across alternatives instead of concentrating on the correct answer…

  8. RESEARCH · CL_07014 ·

    TACO framework boosts LLM training throughput by 1.87X with tensor compression

    Researchers have introduced TACO, a novel framework designed to enhance the efficiency of training large-scale tensor-parallel Large Language Models (LLMs). TACO addresses communication overhead by employing an FP8-base…

  9. RESEARCH · CL_06763 ·

    Lean 4 autoformalization sensitive to surface phrasing, not semantics

    Researchers have investigated the impact of natural language variations on Lean 4 autoformalization, finding that semantically equivalent paraphrases can lead to different formal outputs. Their study, using GPT-family m…

  10. SIGNIFICANT · CL_08380 ·

    OpenAI models now available on AWS, while Claude integrates with creative tools

    OpenAI has made its GPT models, Codex, and Managed Agents accessible through Amazon Web Services (AWS). This integration allows businesses to develop and deploy AI applications securely within their existing AWS infrast…

  11. RESEARCH · CL_13934 ·

    Talkie-1930: New 13B AI model trained on pre-1931 text explores historical knowledge

    A new project called Talkie has released a 13-billion parameter language model trained exclusively on English text from before 1931. This "vintage" model aims to explore AI's ability to predict the future and generate n…

  12. RESEARCH · CL_05206 ·

    Generative AI adoption in IT project management shows early trends, favors OpenAI's GPT

    A recent systematic review of generative AI in IT project management found that OpenAI's GPT models are predominantly used, with research primarily focusing on prompt engineering. The analysis suggests the field is stil…

  13. RESEARCH · CL_12995 ·

    Hugging Face introduces Graph Memory Transformer replacing FFNs with learned memory graphs

    Researchers have developed a Graph Memory Transformer (GMT) that replaces the standard Feed-Forward Network (FFN) sublayer in decoder-only transformers with an explicit learned memory graph. This new architecture mainta…

  14. TOOL · CL_21641 ·

    Cursor IDE experiences UI bug with hidden model selection menu

    A user reported a bug in the Cursor IDE where the model selection menu becomes hidden or cut off when the mouse hovers over it. This issue affects the visibility and selection of GPT models, regardless of whether the Cu…

  15. RESEARCH · CL_03169 ·

    Prompt engineering projects surge with focus on AI coding agents and image generation

    This week's prompt engineering landscape shows a significant increase in interest surrounding AI coding assistants and multimodal prompting techniques. Developers are actively exploring repositories focused on optimizin…

  16. RESEARCH · CL_03583 ·

    OpenAI's history of model releases visualized in new chart

    A visual timeline details the progression of OpenAI's model releases, starting from their initial GPT models and extending to more recent iterations. The graphic illustrates the increasing frequency and complexity of mo…

  17. RESEARCH · CL_03546 ·

    New Rose optimizer offers low VRAM, fast convergence, and great results

    A new PyTorch optimizer named Rose has been released under the Apache 2.0 license. Developed by Matthew K., Rose is designed to be stateless, offering significantly lower VRAM usage compared to optimizers like AdamW, wi…

  18. RESEARCH · CL_02956 ·

    Stealthy Backdoor Attacks against LLMs Based on Natural Style Triggers

    Researchers have developed a new defense mechanism called Tail-risk Intrinsic Geometric Smoothing (TIGS) to protect large language models from backdoor attacks. TIGS operates during inference without requiring model upd…

  19. RESEARCH · CL_03702 ·

    Perplexity details research on SFT+RL pipeline for accurate, efficient AI answers

    Perplexity has detailed its proprietary post-training pipeline that enhances base models for search-augmented question answering. This process involves initial fine-tuning for instruction following and safety, followed …

  20. TOOL · CL_17648 ·

    Show HN: OpenSwarm – Multi‑Agent Claude CLI Orchestrator for Linear/GitHub

    OpenSwarm is a new command-line interface tool designed to orchestrate multiple AI agents for autonomous code-related tasks. It can integrate with various AI models, including Anthropic's Claude, OpenAI's GPT and Codex,…