PulseAugur
EN
LIVE 21:31:38
ENTITY graphics processing unit

graphics processing unit

PulseAugur coverage of graphics processing unit — every cluster mentioning graphics processing unit across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
224
224 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
69
69 over 90d
TIER MIX · 90D
TOPICS
RELATIONSHIPS
SENTIMENT · 30D

29 day(s) with sentiment data

RECENT · PAGE 10/10 · 200 TOTAL
  1. SIGNIFICANT · CL_11581 ·

    Datavault AI raises $120M to build nationwide GPU network for AI compute

    Datavault AI has secured $120 million in funding from Scilex Holding to establish a nationwide GPU network. This initiative aims to provide increased computing power for companies engaged in artificial intelligence deve…

  2. RESEARCH · CL_11925 ·

    FluxMoE system decouples expert weights for faster LLM serving

    Researchers have developed FluxMoE, a new system designed to improve the efficiency of serving Mixture-of-Experts (MoE) models. FluxMoE addresses the challenge of large parameter sizes in MoE models by decoupling expert…

  3. RESEARCH · CL_11722 ·

    RoundPipe enables efficient LLM fine-tuning on consumer GPUs

    Researchers have developed RoundPipe, a new pipeline scheduling method designed to make fine-tuning large language models on consumer-grade GPUs more efficient. This approach addresses the limitations of existing method…

  4. RESEARCH · CL_14183 ·

    Study finds switchless networks more cost-effective for MoE LLM serving

    A new paper analyzes network topologies for Mixture-of-Experts (MoE) Large Language Model (LLM) serving, finding that lower-cost, switchless networks can be more cost-effective than expensive scale-up infrastructures. T…

  5. RESEARCH · CL_14104 ·

    VkSplat pipeline boosts 3D Gaussian Splatting training with Vulkan compute

    Researchers have developed VkSplat, a novel training pipeline for 3D Gaussian Splatting (3DGS) that utilizes Vulkan compute for enhanced performance and broader compatibility. This new approach offers a significant spee…

  6. RESEARCH · CL_14105 ·

    Researchers combine DPUs and GPUs for faster neural network inference

    Researchers have developed a novel method for accelerating neural network inference by splitting Convolutional Neural Network (CNN) computations between Deep Learning Processing Units (DPUs) and Graphics Processing Unit…

  7. MEME · CL_10938 ·

    AI tools read code, not minds; Chinese GPU maker revenue hits $423M

    A Chinese GPU maker, Cambricon, reported its first-quarter revenue at $423 million. Separately, a blog post discusses how AI tools can read code but not minds, and another mentions AI breaking Silicon Valley's global pl…

  8. RESEARCH · CL_11513 ·

    Strait system enhances ML inference serving with priority-aware scheduling

    Researchers have developed Strait, a new system designed to improve the efficiency of machine learning inference serving, particularly in on-premises environments. Strait addresses limitations in task prioritization and…

  9. COMMENTARY · CL_23141 ·

    China Dominates Critical Minerals for AI Supply Chain

    Six critical chokepoints in the AI supply chain, from raw materials to finished chips, are dominated by China. The country processes 90% of rare earths, highlighting its significant control over the production of GPUs, …

  10. SIGNIFICANT · CL_09991 ·

    T-Head unveils Panmai 920 smartNIC, completing its AI infrastructure chip lineup

    Pingtan, a subsidiary of Alibaba, has launched its first intelligent network card, the "Panmai 920," designed to address bottlenecks in AI computing infrastructure. This new network card utilizes advanced PCIe 5.0 and 1…

  11. SIGNIFICANT · CL_09985 ·

    Google to sell its TPUs to some customers, who also fancy big-G GPUs

    Alphabet announced a significant increase in its 2026 capital expenditure guidance, raising it to $180-$190 billion, driven by unprecedented demand for AI computing resources. The company's CFO highlighted strong growth…

  12. RESEARCH · CL_09247 ·

    Visual explainers detail GPU's AI role and embedding vector meaning

    A visual explainer details why Graphics Processing Units (GPUs) are highly effective for artificial intelligence tasks, highlighting their strengths in matrix multiplication, parallel processing, memory bandwidth, and b…

  13. RESEARCH · CL_09880 ·

    FloatSOM framework accelerates distributed Self-Organizing Maps with flexible topologies

    Researchers have developed FloatSOM, a new framework designed for large-scale Self-Organizing Map (SOM) analysis that overcomes memory limitations on GPUs. This framework enables multi-GPU execution and supports out-of-…

  14. COMMENTARY · CL_08729 ·

    GPU firmware lags behind hardware, throttling AI workloads

    The article argues that current GPU firmware is outdated, relying on early 2000s logic to manage modern AI workloads. This outdated firmware is identified as a bottleneck, potentially throttling the performance of advan…

  15. SIGNIFICANT · CL_08093 ·

    GPU shortage becomes AI's biggest bottleneck, spurring efficiency focus

    The escalating demand for Graphics Processing Units (GPUs) has become the primary constraint for the advancement of artificial intelligence. In response, organizations are increasingly adopting strategies focused on dev…

  16. COMMENTARY · CL_17320 ·

    AI era demands flexible data center investments, moving beyond old refresh cycles

    The AI era is forcing a significant shift in data center infrastructure investments, moving away from traditional refresh cycles. Companies are now navigating multiple, often misaligned, technology lifecycles for comput…

  17. RESEARCH · CL_07820 ·

    Stanford researchers develop new hardware to efficiently process sparse AI models

    Researchers at Stanford University have developed a novel hardware chip designed to efficiently process sparse AI models. Sparsity, where most AI model parameters are zero, offers significant computational savings but i…

  18. RESEARCH · CL_08328 ·

    AHASD architecture boosts LLM speculative decoding on mobile devices

    Researchers have developed AHASD, a novel asynchronous heterogeneous architecture designed to optimize large language model (LLM) inference on mobile devices. This architecture employs task-level decoupling for parallel…

  19. RESEARCH · CL_07203 ·

    DeepSeek V4 prioritizes batch invariance, sacrificing GPU efficiency for stability

    DeepSeek V4's technical report reveals a core design choice of "batch invariance" to ensure consistent outputs across different batch configurations and processing pipelines. This feature is crucial for maintaining repr…

  20. RESEARCH · CL_07063 ·

    New GPU framework accelerates quantum state calculations for complex systems

    Researchers have developed QiankunNet-cuSCI, a novel framework that fully accelerates the NNQS-SCI method for solving complex quantum systems using GPUs. This new approach addresses the scalability limitations of previo…