OpenAI enhances security, releases prompting guides, partners with Apple

X — OpenAI TIER_1 · OpenAI · 2026-04-30 17:23

Now available for ChatGPT accounts: Advanced Account Security, a new opt-in setting for people at higher risk of digital attacks, with stronger protections incl

Now available for ChatGPT accounts: Advanced Account Security, a new opt-in setting for people at higher risk of digital attacks, with stronger protections including phishing-resistant sign-in and more secure account recovery. https://t.co/KhBGENuXzT

OpenAI News TIER_1 · 2026-04-10 00:00

Prompting fundamentals

Learn prompting fundamentals and how to write clear, effective prompts to get better, more useful responses from ChatGPT.

Google AI / Research TIER_1 · 2026-01-28 11:00

Towards a science of scaling agent systems: When and why agent systems work

Generative AI

OpenAI News TIER_1 · 2026-01-08 00:00

Netomi’s lessons for scaling agentic systems into the enterprise

How Netomi scales enterprise AI agents using GPT-4.1 and GPT-5.2—combining concurrency, governance, and multi-step reasoning for reliable production workflows.

OpenAI News TIER_1 · 2024-06-10 11:55

OpenAI and Apple announce partnership

OpenAI and Apple announce partnership to integrate ChatGPT into Apple experiences.

OpenAI News TIER_1 · 2022-08-24 07:00

Our approach to alignment research

We are improving our AI systems’ ability to learn from human feedback and to assist humans at evaluating AI. Our goal is to build a sufficiently aligned AI system that can help us solve all other alignment problems.

Apple Machine Learning Research TIER_1 · 2026-05-01 00:00

Reinforced Agent: Inference-Time Feedback for Tool-Calling Agents

This paper was accepted at the Fifth Workshop on Natural Language Generation, Evaluation, and Metrics at ACL 2026. Tool-calling agents are evaluated on tool selection, parameter accuracy, and scope recognition, yet LLM trajectory assessments remain inherently post-hoc. Disconnect…

Microsoft Research TIER_1 · Gagan Bansal, Shujaat Mirza, Keegan Hines, Will Epperson, Zachary Huang, Whitney Maxwell, Pete Bryan, Tyler Payne, Adam Fourney, Amanda Swearngin, Wenyue Hua, Tori Westerhoff, Amanda Minnich, Maya Murad, Ece Kamar, Ram Shankar Siva Kumar, Saleema Amershi · 2026-04-30 21:53

Red-teaming a network of agents: Understanding what breaks when AI agents interact at scale

<p>Safe agents don’t guarantee a safe ecosystem of interconnected agents. Microsoft Research examines what breaks when AI agents interact and why network-level risks require new approaches.</p> <p>The post <a href="https://www.microsoft.com/en-us/research/blog/red-teaming-a-netwo…

arXiv cs.AI TIER_1 · Tongliang Liu · 2026-05-14 14:36

Beyond Individual Intelligence: Surveying Collaboration, Failure Attribution, and Self-Evolution in LLM-based Multi-Agent Systems

LLM-based autonomous agents have demonstrated strong capabilities in reasoning, planning, and tool use, yet remain limited when tasks require sustained coordination across roles, tools, and environments. Multi-agent systems address this through structured collaboration among spec…

arXiv cs.LG TIER_1 · Nikolaos Al. Papadopoulos · 2026-05-14 14:23

Temporal Fair Division in Multi-Agent Systems: From Precise Alternation Metrics to Scalable Coordination Proxies

A plethora real-world environments require agents to compete repeatedly for the same limited resource, calling for a temporal notion of fairness judged across entire interaction histories. This paper advances the theory of temporal fair division by introducing Rotational Periodic…

Hugging Face Daily Papers TIER_1 · 2026-05-14 00:11

MetaAgent-X : Breaking the Ceiling of Automatic Multi-Agent Systems via End-to-End Reinforcement Learning

Automatic multi-agent systems aim to instantiate agent workflows without relying on manually designed or fixed orchestration. However, existing automatic MAS approaches remain only partially adaptive: they either perform training-free test-time search or optimize the meta-level d…

arXiv cs.LG TIER_1 · S. Pasricha · 2026-05-13 13:20

MARLIN: Multi-Agent Game-Theoretic Reinforcement Learning for Sustainable LLM Inference in Cloud Datacenters

Large Language Models (LLMs) have become increasingly prevalent in cloud-based platforms, propelled by the introduction of AI-based consumer and enterprise services. LLM inference requests in particular account for up to 90% of total LLM lifecycle energy use, dwarfing training en…

arXiv cs.AI TIER_1 · Elias Calboreanu · 2026-05-12 15:39

Iterative Audit Convergence in LLM-Managed Multi-Agent Systems: A Case Study in Prompt Engineering Quality Assurance

Prompt specifications for multi-agent large language model (LLM) systems carry data contracts and integration logic across many interdependent files but are rarely subjected to structured-inspection rigor. This paper reports a single-system empirical case study of iterative, agen…

arXiv cs.CL TIER_1 · Rui Wang · 2026-05-12 09:38

GEAR: Granularity-Adaptive Advantage Reweighting for LLM Agents via Self-Distillation

Reinforcement learning has become a widely used post-training approach for LLM agents, where training commonly relies on outcome-level rewards that provide only coarse supervision. While finer-grained credit assignment is promising for effective policy updates, obtaining reliable…

arXiv cs.AI TIER_1 · Hongyang Chen · 2026-05-11 16:20

ComplexMCP: Evaluation of LLM Agents in Dynamic, Interdependent, and Large-Scale Tool Sandbox

Current LLM agents are proficient at calling isolated APIs but struggle with the "last mile" of commercial software automation. In real-world scenarios, tools are not independent; they are atomic, interdependent, and prone to environmental noise. We introduce $\textbf{ComplexMCP}…

Hugging Face Daily Papers TIER_1 · 2026-05-11 16:20

ComplexMCP: Evaluation of LLM Agents in Dynamic, Interdependent, and Large-Scale Tool Sandbox

Current LLM agents are proficient at calling isolated APIs but struggle with the "last mile" of commercial software automation. In real-world scenarios, tools are not independent; they are atomic, interdependent, and prone to environmental noise. We introduce $\textbf{ComplexMCP}…

arXiv cs.AI TIER_1 · Jiawei Li · 2026-05-11 14:43

Evolving-RL: End-to-End Optimization of Experience-Driven Self-Evolving Capability within Agents

Experience-driven self-evolving agents aim to overcome the static nature of large language models by distilling reusable experience from past interactions, thus enabling adaptation to novel tasks at deployment time. This process places substantial demands on the foundation model'…

arXiv cs.CL TIER_1 · Cristiano De Nobili · 2026-05-11 13:13

Collective Alignment in LLM Multi-Agent Systems: Disentangling Bias from Cooperation via Statistical Physics

We investigate the emergent collective dynamics of LLM-based multi-agent systems on a 2D square lattice and present a model-agnostic statistical-physics method to disentangle social conformity from intrinsic bias, compute critical exponents, and probe the collective behavior and …

arXiv cs.AI TIER_1 · Zhangchun Zhao · 2026-05-11 06:26

Swarm Skills: A Portable, Self-Evolving Multi-Agent System Specification for Coordination Engineering

As artificial intelligence engineering paradigms shift from single-agent Prompt and Context Engineering toward multi-agent \textbf{Coordination Engineering}, the ability to codify and systematically improve how multiple agents collaborate has emerged as a critical bottleneck. Whi…

arXiv cs.CL TIER_1 · Fuli Feng · 2026-05-10 13:52

TacoMAS: Test-Time Co-Evolution of Topology and Capability in LLM-based Multi-Agent Systems

Multi-agent systems (MAS) have emerged as a promising paradigm for solving complex tasks. Recent work has explored self-evolving MAS that automatically optimize agent capabilities or communication topologies. However, existing methods either learn a topology that remains fixed at…

arXiv cs.CL TIER_1 · Heng Huang · 2026-05-08 17:59

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

Test-time scaling (TTS) has become an effective approach for improving large language model performance by allocating additional computation during inference. However, existing TTS strategies are largely hand-crafted: researchers manually design reasoning patterns and tune heuris…

arXiv cs.AI TIER_1 · Vincent Conitzer · 2026-05-08 17:47

The Memory Curse: How Expanded Recall Erodes Cooperative Intent in LLM Agents

Context window expansion is often treated as a straightforward capability upgrade for LLMs, but we find it systematically fails in multi-agent social dilemmas. Across 7 LLMs and 4 games over 500 rounds, expanding accessible history degrades cooperation in 18 of 28 model--game set…

arXiv cs.AI TIER_1 · Xunliang Cai · 2026-05-08 15:59

AgentEscapeBench: Evaluating Out-of-Domain Tool-Grounded Reasoning in LLM Agents

As LLM-based agents increasingly rely on external tools, it is important to evaluate their ability to sustain tool-grounded reasoning beyond familiar workflows and short-range interactions. We introduce AgentEscapeBench, an escape-room-style benchmark that tests whether agents ca…

arXiv cs.LG TIER_1 · İsmail İlkan Ceylan · 2026-05-08 15:06

RelAgent: LLM Agents as Data Scientists for Relational Learning

Relational learning is a challenging problem that has motivated a wide range of approaches, including graph-based models (e.g., graph neural networks, graph transformers), tabular methods (e.g., tabular foundation models), and sequence-based approaches (e.g., large language model…

arXiv cs.AI TIER_1 · Hoki Kim · 2026-05-08 14:57

CyBiasBench: Benchmarking Bias in LLM Agents for Cyber-Attack Scenarios

Large language models (LLMs) are increasingly deployed as autonomous agents in offensive cybersecurity. In this paper, we reveal an interesting phenomenon: different agents exhibit distinct attack patterns. Specifically, each agent exhibits an attack-selection bias, disproportion…

arXiv cs.AI TIER_1 · Keisuke Okumura · 2026-05-08 13:50

Alternating Target-Path Planning for Scalable Multi-Agent Coordination

The concurrent target assignment and pathfinding (TAPF) problem extends multi-agent pathfinding (MAPF) by asking planners to allocate distinct targets and collision-free paths to agents. Prior work on TAPF has relied exclusively on Conflict-Based Search (CBS), which tightly coupl…

arXiv cs.LG TIER_1 · Zheng Zhang, Cuong C. Nguyen, Kevin Wells, Gustavo Carneiro · 2026-05-08 04:00

Multi-agent decision making: A Blackwell's informativeness approach

arXiv:2605.06028v1 Announce Type: new Abstract: The rapid development of large language models (LLMs) has motivated research on decision-making in multi-agent systems, where multiple agents collaborate to achieve shared objectives. Existing aggregation approaches, such as voting …

arXiv cs.LG TIER_1 · Huchen Yang, Xinghao Dong, Dan Negrut, Jin-Long Wu · 2026-05-08 04:00

Active Learning for Communication Structure Optimization in LLM-Based Multi-Agent Systems

arXiv:2605.05703v1 Announce Type: cross Abstract: Optimizing the communication structure of large language model based multi-agent systems (LLM-MAS) has been shown to improve downstream performance and reduce token usage. Existing methods typically rely on randomly sampled traini…

arXiv cs.LG TIER_1 · Hamed Hamzeh · 2026-05-08 04:00

AGMARL-DKS: An Adaptive Graph-Enhanced Multi-Agent Reinforcement Learning for Dynamic Kubernetes Scheduling

arXiv:2603.12031v2 Announce Type: replace-cross Abstract: State-of-the-art cloud-native applications require intelligent schedulers that can effectively balance system stability, resource utilisation, and associated costs. While Kubernetes provides feasibility-based placement by …

arXiv cs.AI TIER_1 · Yuliang Xu, Xiang Xu, Yao Wan, Hu Wei, Tong Jia · 2026-05-08 04:00

MAS-Algorithm: A Workflow for Solving Algorithmic Programming Problems with a Multi-Agent System

arXiv:2605.05949v1 Announce Type: new Abstract: Algorithmic problem solving serves as a rigorous testbed for evaluating structured reasoning in AI coding systems, as it directly reflects a model's ability to perform structured reasoning in complex scenarios.Existing approaches pr…

arXiv cs.AI TIER_1 Dansk(DA) · Hongcheol Cho, Ryangkyung Kang, Youngeun Kim · 2026-05-08 04:00

SkillRet: A Large-Scale Benchmark for Skill Retrieval in LLM Agents

arXiv:2605.05726v1 Announce Type: new Abstract: As LLM agents are increasingly deployed with large libraries of reusable skills, selecting the right skill for a user request has become a critical systems challenge. In small libraries, users may invoke skills explicitly by name, b…

arXiv cs.AI TIER_1 · Zhengru Fang, Senkang Forest Hu, Zhonghao Chang, Yu Guo, Yihang Tao, Hongyao Liu, Mengzhe Ruan, Jun Huang, Yuguang Fang · 2026-05-08 04:00

Inference-Time Budget Control for LLM Search Agents

arXiv:2605.05701v1 Announce Type: new Abstract: LLM search agents increasingly rely on tools at inference time, but their trajectories are often constrained by hard limits on both tool calls and generated tokens. Under such dual budgets, better answers require not only stronger m…

arXiv cs.AI TIER_1 · Haoyang Xie, Xinyuan Wang, Yancheng Wang, Puda Zhao, Feng Ju · 2026-05-08 04:00

From History to State: Constant-Context Skill Learning for LLM Agents

arXiv:2605.05413v1 Announce Type: new Abstract: Large language model (LLM) agents are increasingly used to operate browsers, files, code and tools, making personal assistants a natural deployment target. Yet personal agents face a privacy-cost-capability tension: cloud models exe…

arXiv cs.CL TIER_1 · Bufang Yang, Lilin Xu, Liekang Zeng, Yunqi Guo, Siyang Jiang, Wenrui Lu, Kaiwei Liu, Yixuan Li, Xiaofan Jiang, Guoliang Xing, Zhenyu Yan · 2026-05-08 04:00

ProAgent: Harnessing On-Demand Sensory Contexts for Proactive LLM Agent Systems in the Wild

arXiv:2512.06721v2 Announce Type: replace-cross Abstract: Recent studies have begun to explore proactive large language model (LLM) agents that provide unobtrusive assistance by automatically leveraging contextual information, such as in code editing and in-app suggestions. Howev…

arXiv cs.CL TIER_1 · Zhexuan Wang, Xuebo Liu, Li Wang, Zifei Shan, Yutong Wang, Zhenxi Song, Min Zhang · 2026-05-08 04:00

MASPO: Joint Prompt Optimization for LLM-based Multi-Agent Systems

arXiv:2605.06623v1 Announce Type: cross Abstract: Large language model (LLM)-based Multi-agent systems (MAS) have shown promise in tackling complex collaborative tasks, where agents are typically orchestrated via role-specific prompts. While the quality of these prompts is pivota…

arXiv cs.CL TIER_1 · Ming Liu · 2026-05-08 04:00

More Is Not Always Better: Cross-Component Interference in LLM Agent Scaffolding

arXiv:2605.05716v1 Announce Type: cross Abstract: LLM agent systems are built by stacking scaffolding components (planning, tools, memory, self-reflection, retrieval) assuming more is better. We study cross-component interference (CCI): degradation when components interact destru…

arXiv cs.LG TIER_1 · Zhiyuan Zhai, Xin Wang · 2026-05-08 04:00

Selective Rollout: Mid-Trajectory Termination for Multi-Sample Agent RL

arXiv:2605.05802v1 Announce Type: new Abstract: Group-relative RL training (GRPO) samples a small group of parallel rollouts for every training prompt and uses their within-group reward spread to compute per-trajectory advantages. In agentic environments each rollout is a long mu…

arXiv cs.LG TIER_1 Română(RO) · Apurva Gandhi, Satyaki Chakraborty, Xiangjun Wang, Aviral Kumar, Graham Neubig · 2026-05-08 04:00

Recursive Agent Optimization

arXiv:2605.06639v1 Announce Type: new Abstract: We introduce Recursive Agent Optimization (RAO), a reinforcement learning approach for training recursive agents: agents that can spawn and delegate sub-tasks to new instantiations of themselves recursively. Recursive agents impleme…

arXiv cs.AI TIER_1 · Zhe Liu, Zonghao Ying, Wenxin Zhang, Quanchen Zou, Deyue Zhang, Dongdong Yang, Xiangzheng Zhang, Hao Peng · 2026-05-08 04:00

SafeHarbor: Hierarchical Memory-Augmented Guardrail for LLM Agent Safety

arXiv:2605.05704v1 Announce Type: cross Abstract: With the rapid evolution of foundation models, Large Language Model (LLM) agents have demonstrated increasingly powerful tool-use capabilities. However, this proficiency introduces significant security risks, as malicious actors c…

arXiv cs.AI TIER_1 · Keisuke Kamahori, Shihang Li, Simon Peter, Baris Kasikci · 2026-05-08 04:00

VibeServe: Can AI Agents Build Bespoke LLM Serving Systems?

arXiv:2605.06068v1 Announce Type: new Abstract: For years, we have built LLM serving systems like any other critical infrastructure: a single general-purpose stack, hand-tuned over many engineer-years, meant to support every model and workload. In this paper, we take the opposite…

arXiv cs.LG TIER_1 · Yi Xie, Yangyang Xu, Yi Fan, Bo Liu · 2026-05-08 04:00

SAT: Sequential Agent Tuning for Coordinator Free Plug and Play Multi-LLM Training with Monotonic Improvement Guarantees

arXiv:2605.05216v1 Announce Type: new Abstract: Large language models (LLMs) with a large number of parameters achieve strong performance but are often prohibitively expensive to deploy. Recent work explores using teams of smaller, more efficient LLMs that collectively match or e…

arXiv cs.AI TIER_1 Română(RO) · Graham Neubig · 2026-05-07 17:49

Recursive Agent Optimization

We introduce Recursive Agent Optimization (RAO), a reinforcement learning approach for training recursive agents: agents that can spawn and delegate sub-tasks to new instantiations of themselves recursively. Recursive agents implement an inference-time scaling algorithm that natu…

arXiv cs.AI TIER_1 · Min Zhang · 2026-05-07 17:35

MASPO: Joint Prompt Optimization for LLM-based Multi-Agent Systems

Large language model (LLM)-based Multi-agent systems (MAS) have shown promise in tackling complex collaborative tasks, where agents are typically orchestrated via role-specific prompts. While the quality of these prompts is pivotal, jointly optimizing them across interacting agen…

arXiv cs.AI TIER_1 · Kerri Prinos, Lilianne Brush, Cameron Denton, Zhanqi Wang, Joshua Knox, Snehal Antani, Anton Foltz, Amy Villase\~nor · 2026-05-07 04:00

Stable Agentic Control: Tool-Mediated LLM Architecture for Autonomous Cyber Defense

arXiv:2605.03034v1 Announce Type: new Abstract: Agentic systems involved in high-stake decision-making under adversarial pressure need formal guarantees not offered by existing approaches. Motivated by the operational needs of security operations centers (SOCs) that must configur…

arXiv cs.LG TIER_1 · Henry Peng Zou, Wei-Chieh Huang, Yaozu Wu, Jizhou Guo, Yankai Chen, Chunyu Miao, Hoang Nguyen, Yue Zhou, Weizhi Zhang, Liancheng Fang, Hanrong Zhang, Fangxin Wang, Pengfei Zhang, Huacan Wang, Langzhou He, Yangning Li, Dongyuan Li, Renhe Jiang, Xue Liu, Ph · 2026-05-07 04:00

LLM-Based Human-Agent Collaboration and Interaction Systems: A Survey

arXiv:2505.00753v5 Announce Type: replace-cross Abstract: Recent advances in large language models (LLMs) have sparked growing interest in building fully autonomous agents. However, fully autonomous LLM-based agents still face significant challenges, including limited reliability…

arXiv cs.AI TIER_1 · Andrea Iannoli, Lorenzo Gigli, Luca Sciullo, Angelo Trotta, Marco Di Felice · 2026-05-07 04:00

Say the Mission, Execute the Swarm: Agent-Enhanced LLM Reasoning in the Web-of-Drones

arXiv:2605.03788v1 Announce Type: new Abstract: Large Language Models (LLMs) are increasingly explored as high-level reasoning engines for cyber-physical systems, yet their application to real-time UAV swarm management remains challenging due to heterogeneous interfaces, limited …

arXiv cs.AI TIER_1 · Maxim Chupilkin · 2026-05-07 04:00

Multi-Agent Strategic Games with LLMs

arXiv:2605.03604v1 Announce Type: cross Abstract: This paper asks whether large language models (LLMs) can be used to study the strategic foundations of conflict and cooperation. I introduce LLMs as experimental subjects in a repeated security dilemma and evaluate whether they re…

arXiv cs.AI TIER_1 · Zhiyuan Li, Wenshuai Zhao, Joni Pajarinen · 2026-05-07 04:00

Closed-Loop Vision-Language Planning for Multi-Agent Coordination

arXiv:2502.10148v3 Announce Type: replace Abstract: Cooperative multi-agent reinforcement learning (MARL) struggles with sample efficiency, interpretability, and generalization. While Large Language Models (LLMs) offer powerful planning capabilities, their application has been ha…

arXiv cs.AI TIER_1 · Hanchen Li, Runyuan He, Qiuyang Mang, Qizheng Zhang, Huanzhi Mao, Xiaokun Chen, Hangrui Zhou, Alvin Cheung, Joseph Gonzalez, Ion Stoica · 2026-05-07 04:00

Continuum: Efficient and Robust Multi-Turn LLM Agent Scheduling with KV Cache Time-to-Live

arXiv:2511.02230v4 Announce Type: replace-cross Abstract: KV cache management is essential for efficient LLM inference. To maximize utilization, existing inference engines evict finished requests' KV cache if new requests are waiting. This policy breaks for agentic workloads, whi…

arXiv cs.AI TIER_1 · Mengchen Zhao · 2026-05-06 13:35

Strat-Reasoner: Reinforcing Strategic Reasoning of LLMs in Multi-Agent Games

While Large Language Models (LLMs) excel in certain reasoning tasks, they struggle in multi-agent games where the final outcome depends on the joint strategies of all agents. In multi-agent games, the non-stationarity of other agents brings significant challenges on the evaluatio…

arXiv cs.LG TIER_1 · Robert-Jeron Reifert, Alaa Alameer Ahmad, Hayssam Dahrouj, Aydin Sezgin · 2026-05-06 04:00

Agentic AI-Based Joint Computing and Networking via Mixture of Experts and Large Language Models

arXiv:2605.02911v1 Announce Type: new Abstract: Future sixth-generation (6G) mobile networks are envisioned to be equipped with a diverse set of powerful, yet highly specialized, optimization experts. Such a promising vision is concurrently expected to give rise to the need for s…

arXiv cs.AI TIER_1 · Guowei Zou, Haitao Wang, Beiwen Zhang, Boning Zhang, Hejun Wu · 2026-05-06 04:00

CoFlow: Coordinated Few-Step Flow for Offline Multi-Agent Decision Making

arXiv:2605.01457v1 Announce Type: new Abstract: Generative models have emerged as a major paradigm for offline multi-agent reinforcement learning (MARL), but existing approaches require many iterative sampling steps. Recent few-step accelerations either distill a joint teacher in…

arXiv cs.AI TIER_1 · Manuel Hern\'andez, Eduardo S\'anchez-Soto · 2026-05-06 04:00

Sheaf-Theoretic Planning: A Categorical Foundation for Resilient Multi-Agent Autonomous Systems

arXiv:2605.01879v1 Announce Type: new Abstract: The challenge of engineering autonomous agents capable of navigating the stochastic and adversarial nature of the physical world has historically resided at the intersection of symbolic logic and control theory. Traditional multi-ag…

arXiv cs.AI TIER_1 · Xiyuan Zhou, Ruixi Zou, Xinlei Wang, Yuheng Cheng, Yan Xu, Junhua Zhao, Jinjin Gu · 2026-05-06 04:00

EngiAgent: Fully Connected Coordination of LLM Agents for Solving Open-ended Engineering Problems with Feasible Solutions

arXiv:2605.02289v1 Announce Type: new Abstract: Engineering problem solving is central to real-world decision-making, requiring mathematical formulations that not only represent complex problems but also produce feasible solutions under data and physical constraints. Unlike mathe…

arXiv cs.LG TIER_1 · Hongbo Jin, Rongpeng Zhu, Jiayu Ding, Guibo Luo, Ge Li · 2026-05-06 04:00

HiMAC: Hierarchical Macro-Micro Learning for Long-Horizon LLM Agents

arXiv:2603.00977v2 Announce Type: replace-cross Abstract: Large language model (LLM) agents have recently demonstrated strong capabilities in interactive decision-making, yet they remain fundamentally limited in long-horizon tasks that require structured planning and reliable exe…

arXiv cs.LG TIER_1 · Jackie Baek, Yaopeng Fu, Will Ma, Tianyi Peng · 2026-05-06 04:00

AI Agents for Inventory Control: Human-LLM-OR Complementarity

arXiv:2602.12631v2 Announce Type: replace-cross Abstract: Inventory control is a fundamental operations problem in which ordering decisions are traditionally guided by theoretically grounded operations research (OR) algorithms. However, such algorithms often rely on rigid modelin…

arXiv cs.LG TIER_1 · Maksym Nechepurenko, Pavel Shuvalov · 2026-05-06 04:00

Coordination as an Architectural Layer for LLM-Based Multi-Agent Systems

arXiv:2605.03310v1 Announce Type: cross Abstract: Multi-agent LLM systems fail in production at rates between 41% and 87%, mostly due to coordination defects rather than base-model capability. Existing responses split between cataloguing failure modes empirically and shipping dec…

arXiv cs.AI TIER_1 · Vicente Pelechanoa, Antoni Mestre, Manoli Albert, Miriam Gil · 2026-05-06 04:00

HAAS: A Policy-Aware Framework for Adaptive Task Allocation Between Humans and Artificial Intelligence Systems

arXiv:2605.02832v1 Announce Type: new Abstract: Deciding how to distribute work between humans and AI systems is a central challenge in organisational design. Most approaches treat this as a binary choice, yet the operational reality is richer: humans and AI routinely share tasks…

arXiv cs.AI TIER_1 · Jose Manuel de la Chica, Juan Manuel Vera, Jairo Rodr\'iguez · 2026-05-06 04:00

When Stress Becomes Signal: Detecting Antifragility-Compatible Regimes in Multi-Agent LLM Systems

arXiv:2605.02463v1 Announce Type: cross Abstract: Multi-agent LLM systems are increasingly used to solve complex tasks through decomposition, debate, specialization, and ensemble reasoning. However, these systems are usually evaluated in terms of robustness: whether performance i…

arXiv cs.AI TIER_1 · Shuo Liu, Tianle Chen, Ryan Amiri, Christopher Amato · 2026-05-06 04:00

Learning Decentralized LLM Collaboration with Multi-Agent Actor Critic

arXiv:2601.21972v4 Announce Type: replace Abstract: Recent work has explored optimizing LLM collaboration through Multi-Agent Reinforcement Learning (MARL). However, most MARL fine-tuning approaches rely on predefined execution protocols, which often require centralized execution…

arXiv cs.AI TIER_1 · Marco Di Felice · 2026-05-05 14:14

Say the Mission, Execute the Swarm: Agent-Enhanced LLM Reasoning in the Web-of-Drones

Large Language Models (LLMs) are increasingly explored as high-level reasoning engines for cyber-physical systems, yet their application to real-time UAV swarm management remains challenging due to heterogeneous interfaces, limited grounding, and the need for long-running closed-…

arXiv cs.AI TIER_1 · Maxim Chupilkin · 2026-05-05 10:28

Multi-Agent Strategic Games with LLMs

This paper asks whether large language models (LLMs) can be used to study the strategic foundations of conflict and cooperation. I introduce LLMs as experimental subjects in a repeated security dilemma and evaluate whether they reproduce canonical mechanisms from international re…

arXiv cs.LG TIER_1 · Vik Pant, Eric Yu · 2026-05-05 04:00

Coopetition-Gym v1: A Formally Grounded Platform for Mixed-Motive Multi-Agent Reinforcement Learning under Strategic Coopetition

arXiv:2605.02063v1 Announce Type: cross Abstract: We present Coopetition-Gym v1, a benchmark platform for mixed-motive multi-agent reinforcement learning under strategic coopetition. The platform comprises twenty environments organized into four mechanism classes that correspond …

arXiv cs.CL TIER_1 · Chenchen Zhang · 2026-05-05 04:00

Reinforcement Learning for LLM-based Multi-Agent Systems through Orchestration Traces

arXiv:2605.02801v1 Announce Type: new Abstract: As large language model (LLM) agents evolve from isolated tool users into coordinated teams, reinforcement learning (RL) must optimize not only individual actions but also how work is spawned, delegated, communicated, aggregated, an…

arXiv cs.CL TIER_1 · Jianze Wang, Ying Liu, Jinlong Chen, Xuchun Hu, Qilong Zhang, Yu Cao, Jun Wang, Hua Yang, Yong Xie, Qianglong Chen · 2026-05-05 04:00

MAD-OPD: Breaking the Ceiling in On-Policy Distillation via Multi-Agent Debate

arXiv:2605.01347v1 Announce Type: new Abstract: On-policy distillation (OPD) trains a student on its own trajectories under token-level teacher supervision, but existing methods are capped by a single-teacher capability ceiling: when the teacher errs, the student inherits the err…

arXiv cs.CL TIER_1 · Siddeshwar Raghavan, Tanwi Mallick · 2026-05-05 04:00

MOSAIC: Multi-agent Orchestration for Task-Intelligent Scientific Coding

arXiv:2510.08804v3 Announce Type: replace Abstract: We present MOSAIC, a multi-agent Large Language Model (LLM) framework for solving challenging scientific coding tasks. Unlike general-purpose coding, scientific workflows require algorithms that are rigorous, interconnected with…

arXiv cs.LG TIER_1 · Wenyi Wu, Sibo Zhu, Kun Zhou, Biwei Huang · 2026-05-05 04:00

Planner Matters! An Efficient and Unbalanced Multi-agent Collaboration Framework for Long-horizon Planning

arXiv:2605.02168v1 Announce Type: cross Abstract: Language model (LM)-based agents have demonstrated promising capabilities in automating complex tasks from natural language instructions, yet they continue to struggle with long-horizon planning and reasoning. To address this, we …

arXiv cs.AI TIER_1 · Miriam Gil · 2026-05-04 17:09

HAAS: A Policy-Aware Framework for Adaptive Task Allocation Between Humans and Artificial Intelligence Systems

Deciding how to distribute work between humans and AI systems is a central challenge in organisational design. Most approaches treat this as a binary choice, yet the operational reality is richer: humans and AI routinely share tasks or take complementary roles depending on contex…

arXiv cs.CL TIER_1 · Chenchen Zhang · 2026-05-04 16:42

Reinforcement Learning for LLM-based Multi-Agent Systems through Orchestration Traces

As large language model (LLM) agents evolve from isolated tool users into coordinated teams, reinforcement learning (RL) must optimize not only individual actions but also how work is spawned, delegated, communicated, aggregated, and stopped. This paper studies RL for LLM-based m…

arXiv cs.AI TIER_1 · Jairo Rodríguez · 2026-05-04 11:06

When Stress Becomes Signal: Detecting Antifragility-Compatible Regimes in Multi-Agent LLM Systems

Multi-agent LLM systems are increasingly used to solve complex tasks through decomposition, debate, specialization, and ensemble reasoning. However, these systems are usually evaluated in terms of robustness: whether performance is preserved under perturbation. This paper studies…

Hugging Face Daily Papers TIER_1 · 2026-05-04 07:26

EngiAgent: Fully Connected Coordination of LLM Agents for Solving Open-ended Engineering Problems with Feasible Solutions

Engineering problem solving is central to real-world decision-making, requiring mathematical formulations that not only represent complex problems but also produce feasible solutions under data and physical constraints. Unlike mathematical problem solving, which operates on prede…

arXiv cs.LG TIER_1 · Alejandro Breen Herrera, Aayush Sheth, Steven G. Xu, Zhucheng Zhan, Charles Wright, Marcus Yearwood, Hongtai Wei, Sudeep Das, Danny Nightingale, Meg Watson, Charles Pollnow V · 2026-05-04 04:00

Build, Judge, Optimize: A Blueprint for Continuous Improvement of Multi-Agent Consumer Assistants

arXiv:2603.03565v2 Announce Type: replace-cross Abstract: Conversational shopping assistants (CSAs) represent a compelling application of agentic AI, but moving from prototype to production reveals two underexplored challenges: how to evaluate multi-turn interactions and how to o…

arXiv cs.LG TIER_1 · Chunlei Meng, Pengbin Feng, Rong Fu, Hoi Leong Lee, Xiaojing Du, Zhaolu Kang, Zeyu Zhang, Weilin Zhou, Chun Ouyang, Zhongxue Gan · 2026-05-04 04:00

Group Cognition Learning: Making Everything Better Through Governed Two-Stage Agents Collaboration

arXiv:2605.00370v1 Announce Type: new Abstract: Centralized multimodal learning commonly compresses language, acoustic, and visual signals into a single fused representation for prediction. While effective, this paradigm suffers from two limitations: modality dominance, where opt…

arXiv cs.AI TIER_1 · Pedro-Aar\'on Hern\'andez-\'Avalos, Luciano Garc\'ia-Ba\~nuelos · 2026-05-01 04:00

Pragmos: A Process Agentic Modeling System

arXiv:2604.27311v1 Announce Type: cross Abstract: The advent of Large Language Models (LLMs) has significantly transformed tasks across Software Engineering. In the context of Business Process Management, LLMs are now being explored as tools to derive process models directly from…

arXiv cs.AI TIER_1 · Chunhui Zhang, Yuxuan Wang, Aoyang Qin, Yi-Long Lu, Kunlun Wu, Yizhou Wang, Wei Wang · 2026-05-01 04:00

Bridging Values and Behavior: A Hierarchical Framework for Proactive Embodied Agents

arXiv:2604.27699v1 Announce Type: new Abstract: Current embodied agents are often limited to passive instruction-following or reactive need-satisfaction, lacking a stable, high-order value framework essential for long-term, self-directed behavior and resolving motivational confli…

arXiv cs.AI TIER_1 · Giuseppe Arbore, Andrea Sillano, Luigi De Russis · 2026-05-01 04:00

Building Persona-Based Agents On Demand: Tailoring Multi-Agent Workflows to User Needs

arXiv:2604.27882v1 Announce Type: new Abstract: Recent advances in agentic AI are shifting automation from discrete tools to proactive multi-agent systems that coordinate multi-specialized capabilities behind unified interfaces. However, today's agent systems typically rely on ha…

arXiv cs.AI TIER_1 · Junan Hu, Jian Liu, Jingxiang Lai, Jiarui Hu, Yiwei Sheng, Shuang Chen, Jian Li, Dazhao Du, Song Guo · 2026-05-01 04:00

GUI Agents with Reinforcement Learning: Toward Digital Inhabitants

arXiv:2604.27955v1 Announce Type: new Abstract: Graphical User Interface (GUI) agents have emerged as a promising paradigm for intelligent systems that perceive and interact with graphical interfaces visually. Yet supervised fine-tuning alone cannot handle long-horizon credit ass…

arXiv cs.AI TIER_1 · Rahul Ramachandran, Nidhi Jha, Muthukumaran Ramasubramanian · 2026-05-01 04:00

Collaborative Agent Reasoning Engineering (CARE): A Three-Party Design Methodology for Systematically Engineering AI Agents with Subject Matter Experts, Developers, and Helper Agents

arXiv:2604.28043v1 Announce Type: new Abstract: We present Collaborative Agent Reasoning Engineering (CARE), a disciplined methodology for engineering Large Language Model (LLM) agents in scientific domains. Unlike ad-hoc trial-and-error approaches, CARE specifies behavior, groun…

arXiv cs.LG TIER_1 · Yifei Wang, Hancheng Ye, Yechen Xu, Cong Guo, Chiyue Wei, Qinsi Wang, Dongting Li, Tingjun Chen, Hai "Helen" Li, Danyang Zhuo, Yiran Chen · 2026-05-01 04:00

MARS: Efficient, Adaptive Co-Scheduling for Heterogeneous Agentic Systems

arXiv:2604.26963v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly deployed as the execution core of autonomous agents rather than as standalone text generators. Agentic workloads induce a temporal shift from single-turn inference to multi-turn LLM-to…

arXiv cs.CL TIER_1 · Jiacheng Liu, Zichen Tang, Zhongjun Yang, Xinyi Hu, Xueyuan Lin, Linwei Jia, Ruofei Bai, Rongjin Li, Shiyao Peng, Haocheng Gao, Haihong E · 2026-05-01 04:00

RoadMapper: A Multi-Agent System for Roadmap Generation of Solving Complex Research Problems

arXiv:2604.27616v1 Announce Type: new Abstract: People commonly leverage structured content to accelerate knowledge acquisition and research problem solving. Among these, roadmaps guide researchers through hierarchical subtasks to solve complex research problems step by step. Des…

arXiv cs.AI TIER_1 · Francesca Gomez · 2026-05-01 04:00

From surveillance to signalling: escalation channels as environmental controls for agentic AI

arXiv:2510.05192v2 Announce Type: replace-cross Abstract: When AI agents operating with access to sensitive information encounter a conflict between completing an assigned task and following rules or ethical constraints, they can resort to unsanctioned behaviour. Existing inferen…

arXiv cs.AI TIER_1 · Jiaju Chen, Jinghua Piao, Xia Xu, Songwei Li, Tong Xia, Xiangnan He, Yong Li · 2026-05-01 04:00

AgentEconomist: An End-to-end Agentic System Translating Economic Intuitions into Executable Computational Experiments

arXiv:2604.27725v1 Announce Type: cross Abstract: A long-standing challenge in economics lies not in the lack of intuition, but in the difficulty of translating intuitive insights into verifiable research. To address this challenge, we introduce AgentEconomist, an end-to-end inte…

arXiv cs.AI TIER_1 · Jinbiao Wei, Kangqi Ni, Yilun Zhao, Guo Gan, Arman Cohan · 2026-05-01 04:00

Step-level Optimization for Efficient Computer-use Agents

arXiv:2604.27151v1 Announce Type: new Abstract: Computer-use agents provide a promising path toward general software automation because they can interact directly with arbitrary graphical user interfaces instead of relying on brittle, application-specific integrations. Despite re…

arXiv cs.AI TIER_1 · Anh Ta, Junjie Zhu, Shahin Shayandeh · 2026-05-01 04:00

Reinforced Agent: Inference-Time Feedback for Tool-Calling Agents

arXiv:2604.27233v1 Announce Type: new Abstract: Tool-calling agents are evaluated on tool selection, parameter accuracy, and scope recognition, yet LLM trajectory assessments remain inherently post-hoc. Disconnected from the active execution loop, such assessments identify errors…

X — Greg Brockman (OpenAI) TIER_1 · Greg Brockman · 2026-04-30 19:30

Secure your ChatGPT account with Advanced Account Security:

Secure your ChatGPT account with Advanced Account Security:<div class="rsshub-quote"><br /><br />OpenAI: Now available for ChatGPT accounts: Advanced Account Security, a new opt-in setting for people at higher risk of digital attacks, with stronger protections including phishing-…

arXiv cs.AI TIER_1 · Muthukumaran Ramasubramanian · 2026-04-30 15:54

Collaborative Agent Reasoning Engineering (CARE): A Three-Party Design Methodology for Systematically Engineering AI Agents with Subject Matter Experts, Developers, and Helper Agents

We present Collaborative Agent Reasoning Engineering (CARE), a disciplined methodology for engineering Large Language Model (LLM) agents in scientific domains. Unlike ad-hoc trial-and-error approaches, CARE specifies behavior, grounding, tool orchestration, and verification throu…

arXiv cs.AI TIER_1 · Luigi De Russis · 2026-04-30 14:01

Building Persona-Based Agents On Demand: Tailoring Multi-Agent Workflows to User Needs

Recent advances in agentic AI are shifting automation from discrete tools to proactive multi-agent systems that coordinate multi-specialized capabilities behind unified interfaces. However, today's agent systems typically rely on hard-coded agent architectures with fixed roles, c…

arXiv cs.AI TIER_1 · Yong Li · 2026-04-30 11:17

AgentEconomist: An End-to-end Agentic System Translating Economic Intuitions into Executable Computational Experiments

A long-standing challenge in economics lies not in the lack of intuition, but in the difficulty of translating intuitive insights into verifiable research. To address this challenge, we introduce AgentEconomist, an end-to-end interactive system designed to translate abstract intu…

Hugging Face Daily Papers TIER_1 · 2026-04-30 10:43

Bridging Values and Behavior: A Hierarchical Framework for Proactive Embodied Agents

Current embodied agents are often limited to passive instruction-following or reactive need-satisfaction, lacking a stable, high-order value framework essential for long-term, self-directed behavior and resolving motivational conflicts. We introduce \textit{ValuePlanner}, a hiera…

arXiv cs.CL TIER_1 · Haihong E · 2026-04-30 09:08

RoadMapper: A Multi-Agent System for Roadmap Generation of Solving Complex Research Problems

People commonly leverage structured content to accelerate knowledge acquisition and research problem solving. Among these, roadmaps guide researchers through hierarchical subtasks to solve complex research problems step by step. Despite progress in structured content generation, …

arXiv cs.AI TIER_1 · Ariel Sela · 2026-04-30 04:00

Preserving Disagreement: Architectural Heterogeneity and Coherence Validation in Multi-Agent Policy Simulation

arXiv:2604.26561v1 Announce Type: cross Abstract: Multi-agent deliberation systems using large language models (LLMs) are increasingly proposed for policy simulation, yet they suffer from artificial consensus: evaluator agents converge on the same option regardless of their assig…

arXiv cs.AI TIER_1 · Benedikt Bollig, Matthias F\"ugger, Thomas Nowak · 2026-04-30 04:00

Provable Coordination for LLM Agents via Message Sequence Charts

arXiv:2604.17612v2 Announce Type: replace-cross Abstract: Multi-agent systems built on large language models (LLMs) are difficult to reason about. Coordination errors such as deadlocks or type-mismatched messages are often hard to detect through testing. We introduce a domain-spe…

arXiv cs.AI TIER_1 · Xingyan Liu, Xiyue Luo, Linyu Li, Ganghong Huang, Jianfeng Liu, Honglin Qiao · 2026-04-30 04:00

SkillForge: Forging Domain-Specific, Self-Evolving Agent Skills in Cloud Technical Support

arXiv:2604.08618v2 Announce Type: replace-cross Abstract: Deploying LLM-powered agents in enterprise scenarios such as cloud technical support demands high-quality, domain-specific skills. However, existing skill creators lack domain grounding, producing skills poorly aligned wit…

arXiv cs.AI TIER_1 Nederlands(NL) · Christoph Riedl · 2026-04-30 04:00

Emergent Coordination in Multi-Agent Language Models

arXiv:2510.05174v4 Announce Type: replace-cross Abstract: When are multi-agent LLM systems merely a collection of individual agents versus an integrated collective with higher-order structure? We introduce an information-theoretic framework to test -- in a purely data-driven way …

arXiv cs.AI TIER_1 · Junxing Hu, Tianlong Li, Lei Yu, Ai Han · 2026-04-30 04:00

OxyGent: Making Multi-Agent Systems Modular, Observable, and Evolvable via Oxy Abstraction

arXiv:2604.25602v2 Announce Type: replace Abstract: Deploying production-ready multi-agent systems (MAS) in complex industrial environments remains challenging due to limitations in scalability, observability, and autonomous evolution. We present OxyGent, an open-source framework…

arXiv cs.CL TIER_1 · Rui Wang, Ce Zhang, Jun-Yu Ma, Jianshu Zhang, Hongru Wang, Yi Chen, Boyang Xue, Tianqing Fang, Zhisong Zhang, Hongming Zhang, Haitao Mi, Dong Yu, Kam-Fai Wong · 2026-04-30 04:00

WebAggregator: Enhancing Compositional Reasoning Capabilities of Deep Research Agent Foundation Models

arXiv:2510.14438v2 Announce Type: replace Abstract: The hallmark of Deep Research agents lies in compositional reasoning, the capacity to aggregate distributed, heterogeneous information into coherent logical insights. However, current agentic systems are often retrieval-heavy bu…

arXiv cs.CL TIER_1 · Jincheng Ren, Siwei Wu, Yizhi Li, Kang Zhu, Shu Xu, Boyu Feng, Ruibin Yuan, Wei Zhang, Riza Batista-Navarro, Jian Yang, Chenghua Lin · 2026-04-30 04:00

A Self-Evolving Framework for Efficient Terminal Agents via Observational Context Compression

arXiv:2604.19572v2 Announce Type: replace Abstract: As terminal agents scale to long-horizon, multi-turn workflows, a key bottleneck is not merely limited context length, but the accumulation of noisy terminal observations in the interaction history. Retaining raw observations pr…

arXiv cs.AI TIER_1 · Tom Liptay, Dan Schwarz, Rafael Poyiadzi, Jack Wildman, Nikos I. Bosse · 2026-04-30 04:00

Evaluating Strategic Reasoning in Forecasting Agents

arXiv:2604.26106v1 Announce Type: new Abstract: Forecasting benchmarks produce accuracy leaderboards but little insight into why some forecasters are more accurate than others. We introduce Bench to the Future 2 (BTF-2), 1,417 pastcasting questions with a frozen 15M-document rese…

arXiv cs.AI TIER_1 · Mahnoor Shahid, Hannes Rothe · 2026-04-30 04:00

AGEL-Comp: A Neuro-Symbolic Framework for Compositional Generalization in Interactive Agents

arXiv:2604.26522v1 Announce Type: new Abstract: Large Language Model (LLM)-based agents exhibit systemic failures in compositional generalization, limiting their robustness in interactive environments. This work introduces AGEL-Comp, a neuro-symbolic AI agent architecture designe…

arXiv cs.AI TIER_1 · Zhixin Han, Yanzhi Zhang, Chuyang Wei, Maohang Gao, Xiawei Yue, Kefei Chen, Yu Zhuang, Haoxiang Guan, Jiyan He, Jian Li, Yitong Duan, Yu Shi, Mengting Hu, Shuxin Zheng · 2026-04-30 04:00

FutureWorld: A Live Environment for Training Predictive Agents with Real-World Outcome Rewards

arXiv:2604.26733v1 Announce Type: new Abstract: Live future prediction refers to the task of making predictions about real-world events before they unfold. This task is increasingly studied using large language model-based agent systems, and it is important for building agents th…

arXiv cs.AI TIER_1 · Bochao Liu, Zhipeng Qian, Yang Zhao, Xinyuan Jiang, Zihan Liang, Yufei Ma, Junpeng Zhuang, Ben Chen, Shuo Yang, Hongen Wan, Yao Wu, Chenyi Lei, Xiao Liang · 2026-04-30 04:00

Bian Que: An Agentic Framework with Flexible Skill Arrangement for Online System Operations

arXiv:2604.26805v1 Announce Type: new Abstract: Operating and maintaining (O&M) large-scale online engine systems (search, recommendation, advertising) demands substantial human effort for release monitoring, alert response, and root cause analysis. While LLM-based agents are…

Hugging Face Daily Papers TIER_1 · 2026-04-30 01:49

Pragmos: A Process Agentic Modeling System

The advent of Large Language Models (LLMs) has significantly transformed tasks across Software Engineering. In the context of Business Process Management, LLMs are now being explored as tools to derive process models directly from textual descriptions. Existing approaches range f…

Hugging Face Daily Papers TIER_1 · 2026-04-29 15:35

Bian Que: An Agentic Framework with Flexible Skill Arrangement for Online System Operations

Operating and maintaining (O&M) large-scale online engine systems (search, recommendation, advertising) demands substantial human effort for release monitoring, alert response, and root cause analysis. While LLM-based agents are a natural fit for these tasks, the deployment b…

arXiv cs.AI TIER_1 · Xiao Liang · 2026-04-29 15:35

Bian Que: An Agentic Framework with Flexible Skill Arrangement for Online System Operations

Operating and maintaining (O&M) large-scale online engine systems (search, recommendation, advertising) demands substantial human effort for release monitoring, alert response, and root cause analysis. While LLM-based agents are a natural fit for these tasks, the deployment bottl…

Hugging Face Daily Papers TIER_1 · 2026-04-29 14:34

FutureWorld: A Live Environment for Training Predictive Agents with Real-World Outcome Rewards

Live future prediction refers to the task of making predictions about real-world events before they unfold. This task is increasingly studied using large language model-based agent systems, and it is important for building agents that can continually learn from real-world. Just a…

arXiv cs.AI TIER_1 · Shuxin Zheng · 2026-04-29 14:34

FutureWorld: A Live Environment for Training Predictive Agents with Real-World Outcome Rewards

Live future prediction refers to the task of making predictions about real-world events before they unfold. This task is increasingly studied using large language model-based agent systems, and it is important for building agents that can continually learn from real-world. Just a…

arXiv cs.AI TIER_1 · Ariel Sela · 2026-04-29 11:47

Preserving Disagreement: Architectural Heterogeneity and Coherence Validation in Multi-Agent Policy Simulation

Multi-agent deliberation systems using large language models (LLMs) are increasingly proposed for policy simulation, yet they suffer from artificial consensus: evaluator agents converge on the same option regardless of their assigned value perspectives. We present the AI Council,…

arXiv cs.AI TIER_1 · Hannes Rothe · 2026-04-29 10:42

AGEL-Comp: A Neuro-Symbolic Framework for Compositional Generalization in Interactive Agents

Large Language Model (LLM)-based agents exhibit systemic failures in compositional generalization, limiting their robustness in interactive environments. This work introduces AGEL-Comp, a neuro-symbolic AI agent architecture designed to address this challenge by grounding actions…

arXiv cs.CL TIER_1 · Abigail O'Neill, Alan Zhu, Mihran Miroyan, Narges Norouzi, Joseph E. Gonzalez · 2026-04-29 04:00

Cooperate to Compete: Strategic Coordination in Multi-Agent Conquest

arXiv:2604.25088v1 Announce Type: cross Abstract: Language Model (LM)-based agents remain largely untested in mixed-motive settings where agents must leverage short-term cooperation for long-term competitive goals (e.g., multi-party politics). We introduce Cooperate to Compete (C…

arXiv cs.CL TIER_1 · Yunsu Kim, Kaden Uhlig, Joern Wuebker · 2026-04-29 04:00

GAIA-v2-LILT: Multilingual Adaptation of Agent Benchmark beyond Translation

arXiv:2604.24929v1 Announce Type: new Abstract: Agent benchmarks remain largely English-centric, while their multilingual versions are often built with machine translation (MT) and limited post-editing. We argue that, for agentic tasks, this minimal workflow can easily break benc…

arXiv cs.LG TIER_1 · Shiyi Du, Jiayuan Liu, Weihua Du, Yue Huang, Jiayi Li, Yingtao Luo, Xiangliang Zhang, Vincent Conitzer, Carl Kingsford · 2026-04-29 04:00

Why Search When You Can Transfer? Amortized Agentic Workflow Design from Structural Priors

arXiv:2604.25012v1 Announce Type: new Abstract: Automated agentic workflow design currently relies on per-task iterative search, which is computationally prohibitive and fails to reuse structural knowledge across tasks. We observe that optimized workflows converge to a small fami…

arXiv cs.CL TIER_1 · Mohamed Aghzal, Gregory J. Stein, Ziyu Yao · 2026-04-29 04:00

Why Do LLM-based Web Agents Fail? A Hierarchical Planning Perspective

arXiv:2603.14248v2 Announce Type: replace-cross Abstract: Large language model (LLM) web agents are increasingly used for web navigation but remain far from human reliability on realistic, long-horizon tasks. Existing evaluations focus primarily on end-to-end success, offering li…

arXiv cs.CL TIER_1 Română(RO) · Xiyuan Yang, Jiaru Zou, Rui Pan, Ruizhong Qiu, Pan Lu, Shizhe Diao, Jindong Jiang, Hanghang Tong, Tong Zhang, Markus J. Buehler, Jingrui He, James Zou · 2026-04-29 04:00

Recursive Multi-Agent Systems

arXiv:2604.25917v1 Announce Type: cross Abstract: Recursive or looped language models have recently emerged as a new scaling axis by iteratively refining the same model computation over latent states to deepen reasoning. We extend such scaling principle from a single model to mul…

arXiv cs.CL TIER_1 · Stan Loosmore · 2026-04-29 04:00

Leverage Laws: A Per-Task Framework for Human-Agent Collaboration

arXiv:2604.25040v1 Announce Type: cross Abstract: We propose a per-task leverage ratio for human-agent collaboration: human work displaced by an agent, divided by the human time required to specify the task, resolve mid-run interrupts, and review the result. The denominator decom…

arXiv cs.CL TIER_1 · Kaixuan Fan, Kaituo Feng, Manyuan Zhang, Tianshuo Peng, Zhixun Li, Yilei Jiang, Shuang Chen, Peng Pei, Xunliang Cai, Xiangyu Yue · 2026-04-29 04:00

Exploring Reasoning Reward Model for Agents

arXiv:2601.22154v2 Announce Type: replace-cross Abstract: Agentic Reinforcement Learning (Agentic RL) has achieved notable success in enabling agents to perform complex reasoning and tool use. However, most methods still relies on sparse outcome-based reward for training. Such fe…

arXiv cs.CL TIER_1 · Abhijnan Nath, Hannah VanderHoeven, Nikhil Krishnaswamy · 2026-04-29 04:00

CRAFT: Grounded Multi-Agent Coordination Under Partial Information

arXiv:2603.25268v2 Announce Type: replace Abstract: We introduce CRAFT, a multi-agent benchmark for evaluating pragmatic communication in large language models under strict partial information. In this setting, multiple agents with complementary but incomplete views must coordina…

arXiv cs.CL TIER_1 Română(RO) · James Zou · 2026-04-28 17:59

Recursive Multi-Agent Systems

Recursive or looped language models have recently emerged as a new scaling axis by iteratively refining the same model computation over latent states to deepen reasoning. We extend such scaling principle from a single model to multi-agent systems, and ask: Can agent collaboration…

arXiv cs.AI TIER_1 · Ai Han · 2026-04-28 13:08

OxyGent: Making Multi-Agent Systems Modular, Observable, and Evolvable via Oxy Abstraction

Deploying production-ready multi-agent systems (MAS) in complex industrial environments remains challenging due to limitations in scalability, observability, and autonomous evolution. We present OxyGent, an open-source framework that enables modular, observable, and evolvable MAS…

arXiv cs.AI TIER_1 · Edward Cheng, Jeshua Cheng · 2026-04-28 04:00

A Decoupled Human-in-the-Loop System for Controlled Autonomy in Agentic Workflows

arXiv:2604.23049v1 Announce Type: new Abstract: AI agents are increasingly deployed to execute tasks and make decisions within agentic workflows, introducing new requirements for safe and controlled autonomy. Prior work has established the importance of human oversight for ensuri…

arXiv cs.CL TIER_1 · Qiliang Liang, Hansi Wang, Zhong Liang, Yang Liu · 2026-04-28 04:00

From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills

arXiv:2604.24026v1 Announce Type: new Abstract: LLM agents increasingly rely on reusable skills, capability packages that combine instructions, control flow, constraints, and tool calls. In most current agent systems, however, skills are still represented by text-heavy artifacts,…

arXiv cs.CL TIER_1 · Yizhe Chi, Deyao Hong, Dapeng Jiang, Tianwei Luo, Kaisen Yang, Boshi Zhang, Zhe Cao, Xiaoyan Fan, Bingxiang He, Han Hao, Weiyang Jin, Dianqiao Lei, Qingle Liu, Houde Qian, Bowen Wang, Situ Wang, Youjie Zheng, Yifan Zhou, Calvin Xiao, Eren Cai, Qinhuai Na · 2026-04-28 04:00

Frontier-Eng: Benchmarking Self-Evolving Agents on Real-World Engineering Tasks with Generative Optimization

arXiv:2604.12290v2 Announce Type: replace-cross Abstract: Current LLM agent benchmarks, which predominantly focus on binary pass/fail tasks such as code generation or search-based question answering, often neglect the value of real-world engineering that is often captured through…

arXiv cs.LG TIER_1 · Jie Wu, Ming Gong · 2026-04-28 04:00

Beyond Single-Agent Alignment: Preventing Context-Fragmented Violations in Multi-Agent Systems

arXiv:2604.22879v1 Announce Type: cross Abstract: We identify and formalize a novel security risk: Context-Fragmented Violations (CFVs) - a class of policy breaches where individual agent actions appear locally safe and reasonable, yet collectively violate organizational policies…

arXiv cs.AI TIER_1 · Wenji Fang, Yao Lu, Shang Liu, Jing Wang, Ziyan Guo, Junxian He, Fengbin Tu, Zhiyao Xie · 2026-04-28 04:00

Dr. RTL: Autonomous Agentic RTL Optimization through Tool-Grounded Self-Improvement

arXiv:2604.14989v2 Announce Type: replace Abstract: Recent advances in large language models (LLMs) have sparked growing interest in automatic RTL optimization for better performance, power, and area (PPA). However, existing methods are still far from realistic RTL optimization. …

arXiv cs.AI TIER_1 · Jingwei Ni, Yihao Liu, Xinpeng Liu, Yutao Sun, Mengyu Zhou, Pengyu Cheng, Dexin Wang, Erchao Zhao, Xiaoxi Jiang, Guanjun Jiang · 2026-04-28 04:00

Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills

arXiv:2603.25158v4 Announce Type: replace Abstract: Equipping Large Language Model (LLM) agents with domain-specific skills is critical for tackling complex tasks. Yet, manual authoring creates a severe scalability bottleneck. Conversely, automated skill generation often yields f…

arXiv cs.AI TIER_1 · Yifan Zhang, Jianmin Ye, Jiahao Yang, Xi Wang · 2026-04-28 04:00

RefEvo: Agentic Design with Co-Evolutionary Verification for Agile Reference Model Generation

arXiv:2604.24218v1 Announce Type: cross Abstract: As the complexity of System-on-Chip (SoC) designs grows, the shift-left paradigm necessitates the rapid development of high-fidelity reference models (typically written in SystemC) for early architecture exploration and verificati…

arXiv cs.AI TIER_1 · Zhuohui Zhang, Bin Cheng, Bin He · 2026-04-28 04:00

DLM: Unified Decision Language Models for Offline Multi-Agent Sequential Decision Making

arXiv:2604.23557v1 Announce Type: cross Abstract: Building scalable and reusable multi-agent decision policies from offline datasets remains a challenge in offline multi-agent reinforcement learning (MARL), as existing methods often rely on fixed observation formats and action sp…

arXiv cs.AI TIER_1 · Patrizio Dazzi, Emanuele Carlini, Matteo Mordacchini, Saul Urso · 2026-04-28 04:00

Usable Agent Discovery for Decentralized AI Systems

arXiv:2604.23080v1 Announce Type: cross Abstract: Large-scale agentic systems run on distributed infrastructures where many software agents share physical hosts and are discovered via peer-to-peer mechanisms. Discovery must handle node-level churn from failures and host departure…

arXiv cs.AI TIER_1 · Zavier Ndum Ndum, Jian Tao, John Ford, Mansung Yim, Yang Liu · 2026-04-28 04:00

RADIANT-LLM: an Agentic Retrieval Augmented Generation Framework for Reliable Decision Support in Safety-Critical Nuclear Engineering

arXiv:2604.22755v1 Announce Type: cross Abstract: Reliable decision support in nuclear engineering requires traceable, domain-grounded knowledge retrieval, yet safety and risk analysis workflows remain hampered by fragmented documentation and hallucination when use pre-trained la…

arXiv cs.AI TIER_1 · Boqin Yuan, Renchu Song, Yue Su, Sen Yang, Jing Qin · 2026-04-28 04:00

ClawTrace: Cost-Aware Tracing for LLM Agent Skill Distillation

arXiv:2604.23853v1 Announce Type: new Abstract: Skill-distillation pipelines learn reusable rules from LLM agent trajectories, but they lack a key signal: how much each step costs. Without per-step cost, a pipeline cannot distinguish adding a missing step to fix a bug from removi…

arXiv cs.AI TIER_1 · Rong Xiang · 2026-04-28 04:00

Structural Enforcement of Goal Integrity in AI Agents via Separation-of-Powers Architecture

arXiv:2604.23646v1 Announce Type: new Abstract: Recent evidence suggests that frontier AI systems can exhibit agentic misalignment, generating and executing harmful actions derived from internally constructed goals, even without explicit user requests. Existing mitigation methods…

arXiv cs.AI TIER_1 · Haoran Tan, Zeyu Zhang, Chen Ma, Tianze Liu, Quanyu Dai, Xu Chen · 2026-04-28 04:00

From Coarse to Fine: Self-Adaptive Hierarchical Planning for LLM Agents

arXiv:2604.23194v1 Announce Type: new Abstract: Large language model-based agents have recently emerged as powerful approaches for solving dynamic and multi-step tasks. Most existing agents employ planning mechanisms to guide long-term actions in dynamic environments. However, cu…

arXiv cs.LG TIER_1 · Tianbao Zhang · 2026-04-28 04:00

Harness as an Asset: Enforcing Determinism via the Convergent AI Agent Framework (CAAF)

arXiv:2604.17025v2 Announce Type: replace-cross Abstract: Large Language Models produce a controllability gap in safety-critical engineering: even low rates of undetected constraint violations render a system undeployable. Current orchestration paradigms suffer from sycophantic c…

Hugging Face Daily Papers TIER_1 · 2026-04-28 00:48

Cooperate to Compete: Strategic Coordination in Multi-Agent Conquest

Language Model (LM)-based agents remain largely untested in mixed-motive settings where agents must leverage short-term cooperation for long-term competitive goals (e.g., multi-party politics). We introduce Cooperate to Compete (C2C), a multi-agent environment where players can e…

arXiv cs.CL TIER_1 · Joseph E. Gonzalez · 2026-04-28 00:48

Cooperate to Compete: Strategic Coordination in Multi-Agent Conquest

Language Model (LM)-based agents remain largely untested in mixed-motive settings where agents must leverage short-term cooperation for long-term competitive goals (e.g., multi-party politics). We introduce Cooperate to Compete (C2C), a multi-agent environment where players can e…

Hugging Face Daily Papers TIER_1 · 2026-04-28 00:31

Agentic Architect: An Agentic AI Framework for Architecture Design Exploration and Optimization

Rapid advances in Large Language Models (LLMs) create new opportunities by enabling efficient exploration of broad, complex design spaces. This is particularly valuable in computer architecture, where performance depends on microarchitectural designs and policies drawn from vast …

arXiv cs.CL TIER_1 · Stan Loosmore · 2026-04-27 22:43

Leverage Laws: A Per-Task Framework for Human-Agent Collaboration

We propose a per-task leverage ratio for human-agent collaboration: human work displaced by an agent, divided by the human time required to specify the task, resolve mid-run interrupts, and review the result. The denominator decomposes into three channels through which a conserve…

arXiv cs.LG TIER_1 · Carl Kingsford · 2026-04-27 21:25

Why Search When You Can Transfer? Amortized Agentic Workflow Design from Structural Priors

Automated agentic workflow design currently relies on per-task iterative search, which is computationally prohibitive and fails to reuse structural knowledge across tasks. We observe that optimized workflows converge to a small family of domain-specific topologies, suggesting tha…

arXiv cs.CL TIER_1 · Joern Wuebker · 2026-04-27 19:11

GAIA-v2-LILT: Multilingual Adaptation of Agent Benchmark beyond Translation

Agent benchmarks remain largely English-centric, while their multilingual versions are often built with machine translation (MT) and limited post-editing. We argue that, for agentic tasks, this minimal workflow can easily break benchmark validity through query-answer misalignment…

arXiv cs.AI TIER_1 · Xi Wang · 2026-04-27 09:22

RefEvo: Agentic Design with Co-Evolutionary Verification for Agile Reference Model Generation

As the complexity of System-on-Chip (SoC) designs grows, the shift-left paradigm necessitates the rapid development of high-fidelity reference models (typically written in SystemC) for early architecture exploration and verification. While Large Language Models (LLMs) show promis…

Hugging Face Daily Papers TIER_1 · 2026-04-27 09:00

Rewarding the Scientific Process: Process-Level Reward Modeling for Agentic Data Analysis

Process Reward Models (PRMs) have achieved remarkable success in augmenting the reasoning capabilities of Large Language Models (LLMs) within static domains such as mathematics. However, their potential in dynamic data analysis tasks remains underexplored. In this work, we first …

arXiv cs.CL TIER_1 · Yang Liu · 2026-04-27 04:25

From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills

LLM agents increasingly rely on reusable skills, capability packages that combine instructions, control flow, constraints, and tool calls. In most current agent systems, however, skills are still represented by text-heavy artifacts, including SKILL.md-style documents and structur…

arXiv cs.AI TIER_1 · Zhengxu Yu, Yu Fu, Zhiyuan He, Yuxuan Huang, Lee Ka Yiu, Meng Fang, Weilin Luo, Jun Wang · 2026-04-27 04:00

From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company

arXiv:2604.22446v1 Announce Type: new Abstract: Individual agent capabilities have advanced rapidly through modular skills and tool integrations, yet multi-agent systems remain constrained by fixed team structures, tightly coupled coordination logic, and session-bound learning. W…

arXiv cs.AI TIER_1 · Binyan Xu, Dong Fang, Haitao Li, Kehuan Zhang · 2026-04-27 04:00

From Multi-Agent to Single-Agent: When Is Skill Distillation Beneficial?

arXiv:2604.01608v3 Announce Type: replace Abstract: Multi-agent systems (MAS) tackle complex tasks by distributing expertise, though this often comes at the cost of heavy coordination overhead, context fragmentation, and brittle phase ordering. Distilling a MAS into a single-agen…

arXiv cs.AI TIER_1 · Meng Chu, Xuan Billy Zhang, Kevin Qinghong Lin, Lingdong Kong, Jize Zhang, Teng Tu, Weijian Ma, Ziqi Huang, Senqiao Yang, Wei Huang, Yeying Jin, Zhefan Rao, Jinhui Ye, Xinyu Lin, Xichen Zhang, Qisheng Hu, Shuai Yang, Leyang Shen, Wei Chow, Yifei Dong, Fen · 2026-04-27 04:00

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

arXiv:2604.22748v1 Announce Type: new Abstract: As AI systems move from generating text to accomplishing goals through sustained interaction, the ability to model environment dynamics becomes a central bottleneck. Agents that manipulate objects, navigate software, coordinate with…

arXiv cs.AI TIER_1 · Aimin Zhang, Jiajing Guo, Fuwei Jia, Chen Lv, Boyu Wang, Fangzheng Li · 2026-04-27 04:00

EvoAgent: An Evolvable Agent Framework with Skill Learning and Multi-Agent Delegation

arXiv:2604.20133v2 Announce Type: replace Abstract: This paper proposes EvoAgent - an evolvable large language model (LLM) agent framework that integrates structured skill learning with a hierarchical sub-agent delegation mechanism. EvoAgent models skills as multi-file structured…

Hugging Face Daily Papers TIER_1 · 2026-04-27 03:38

TCOD: Exploring Temporal Curriculum in On-Policy Distillation for Multi-turn Autonomous Agents

On-policy distillation (OPD) has shown strong potential for transferring reasoning ability from frontier or domain-specific models to smaller students. While effective on static single-turn tasks, its behavior in multi-turn agent settings remains underexplored. In this work, we i…

arXiv cs.AI TIER_1 · Jiaya Jia · 2026-04-24 17:48

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

As AI systems move from generating text to accomplishing goals through sustained interaction, the ability to model environment dynamics becomes a central bottleneck. Agents that manipulate objects, navigate software, coordinate with others, or design experiments require predictiv…

arXiv cs.AI TIER_1 · Jun Wang · 2026-04-24 11:02

From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company

Individual agent capabilities have advanced rapidly through modular skills and tool integrations, yet multi-agent systems remain constrained by fixed team structures, tightly coupled coordination logic, and session-bound learning. We argue that this reflects a deeper absence: a p…

Hugging Face Daily Papers TIER_1 · 2026-04-22 17:36

Relative Principals, Pluralistic Alignment, and the Structural Value Alignment Problem

The value alignment problem for artificial intelligence (AI) is often framed as a purely technical or normative challenge, sometimes focused on hypothetical future systems. I argue that the problem is better understood as a structural question about governance: not whether an AI …

EleutherAI Blog TIER_1 · 2023-05-03 00:00

Alignment Research @ EleutherAI

A breif overview of EAIs approach to alignment

MIT Technology Review TIER_1 · Will Douglas Heaven · 2026-04-30 15:59

This startup’s new mechanistic interpretability tool lets you debug LLMs

The San Francisco–based startup Goodfire just released a new tool, called Silico, that lets researchers and engineers peer inside an AI model and adjust its parameters—the settings that determine a model’s behavior—during training. This could give model makers more fine-grained c…

arXiv cs.CV TIER_1 · Song Guo · 2026-04-30 14:51

GUI Agents with Reinforcement Learning: Toward Digital Inhabitants

Graphical User Interface (GUI) agents have emerged as a promising paradigm for intelligent systems that perceive and interact with graphical interfaces visually. Yet supervised fine-tuning alone cannot handle long-horizon credit assignment, distribution shifts, and safe explorati…

LessWrong (AI tag) TIER_1 · nostalgebraist · 2026-04-29 03:53

llm assistant personas seem increasingly incoherent (some subjective observations)

<p><i><span>(This was originally going to be a "quick take" but then it got a bit long. Just FYI.)</span></i></p><p><span>There's this weird trend I perceive with the personas of LLM assistants over time. It feels like they're getting less</span><i><span> </span></i><span>"cohere…

Last Week in AI TIER_1 (CA) · Last Week in AI · 2026-04-01 08:07

LWiAI Podcast #238 - GPT 5.4 mini, OpenAI Pivot, Mamba 3, Attention Residuals

OpenAI ships GPT-5.4 mini and nano, faster and more capable but up to 4x pricier, DLSS 5 looks like a real-time generative AI filter for video games | The Verge, and more!

Last Week in AI TIER_1 · Last Week in AI · 2026-03-16 06:06

LWiAI Podcast #237 - Nemotron 3 Super, xAI reborn, Anthropic Lawsuit, Research!

Nemotron 3 Super: An Open Hybrid Mamba-Transformer MoE for Agentic Reasoning, Another XAI Cofounder Has Left, Anthropic Sues Department of Defense

Last Week in AI TIER_1 · Last Week in AI · 2026-02-17 04:43

LWiAI Podcast #234 - Opus 4.6, GPT-5.3-Codex, Seedance 2.0, GLM-5

An action-packed episode!

Last Week in AI TIER_1 · Last Week in AI · 2026-01-28 09:51

LWiAI Podcast #232 - ChatGPT Ads, Thinking Machines Drama, STEM

OpenAI to test ads in ChatGPT as it burns through billions, The Drama at Thinking Machines, STEM: Scaling Transformers with Embedding Modules

Last Week in AI TIER_1 Nederlands(NL) · Last Week in AI · 2026-01-15 07:06

Last Week in AI #332 - Apple + Gemini, OpenAI + Cerebras, Claude Cowork

Google’s Gemini to power Apple’s AI features like Siri, OpenAI signs deal worth $10B for compute from Cerebras, and more!

Smol AINews TIER_1 · 2026-01-12 05:44

Apple picks Google's Gemini to power Siri's next generation

**Apple** has decided to power Siri with **Google's Gemini models** and cloud technology, marking a significant partnership and a setback for **OpenAI**, which was initially partnered with Apple. **Anthropic** launched "Cowork," a product preview for Claude's coding capabilities,…

Last Week in AI TIER_1 · Last Week in AI · 2025-12-25 21:29

LWiAI Podcast #229 - Gemini 3 Flash, ChatGPT Apps, Nemotron 3

Google launches Gemini 3 Flash, ChatGPT launches an app store, Introducing GPT-5.2-Codex

Smol AINews TIER_1 · 2025-10-13 05:44

OpenAI Titan XPU: 10GW of self-designed chips with Broadcom

**OpenAI** is finalizing a custom ASIC chip design to deploy **10GW** of inference compute, complementing existing deals with **NVIDIA** (10GW) and **AMD** (6GW). This marks a significant scale-up from OpenAI's current **2GW** compute, aiming for a roadmap of **250GW** total, whi…

Eugene Yan TIER_1 · 2024-05-26 00:00

Prompting Fundamentals and How to Apply them Effectively

Structured input/output, prefilling, n-shots prompting, chain-of-thought, reducing hallucinations, etc.

The Gradient TIER_1 · Jessica Dai · 2023-10-07 16:00

The Artificiality of Alignment

<p><em>This essay first appeared in <a href="https://joinreboot.org/p/alignment">Reboot</a></em>. </p><p>Credulous, breathless coverage of “AI existential risk” (abbreviated “x-risk”) has reached the mainstream. Who could have foreseen that the smallca…

36氪 (36Kr) TIER_1 中文(ZH) · 2026-05-11 11:44

Longfor Group: Total contracted sales amount in the first 4 months was 10.35 billion yuan

36氪获悉，龙湖集团公告，截至2026年4月30日止四个月，集团累计实现总合同销售金额人民币103.5亿元，合同销售面积115.3万平方米，4月单月实现合同销售金额人民币28.9亿元，合同销售面积31.1万平方米；4月单月实现归属公司股东权益的合同销售金额人民币19.3亿元，归属公司股东权益的合同销售面积22.3万平方米。

36氪 (36Kr) TIER_1 中文(ZH) · 2026-05-11 11:37

European Commission welcomes OpenAI's access to new ChatGPT model

欧盟委员会发言人托马斯·雷尼尔5月11日表示，欧盟委员会欢迎OpenAI拟开放其最新版ChatGPT模型访问权限的意向，本周将与OpenAI展开进一步交流。欧盟委员会发言人指出，与Anthropic公司就其Mythos模型进行的磋商目前处于不同阶段，双方正保持持续接触，欧盟委员会正寻求与Anthropic达成类似目前正与OpenAI探讨的解决方案。（财联社）

36氪 (36Kr) TIER_1 中文(ZH) · 2026-05-08 11:50

ST Qingyue: Company's stock may face delisting due to major violations

36氪获悉，ST清越公告，公司因涉嫌定期报告等财务数据虚假记载被证监会立案调查，并于2026年5月8日收到《行政处罚事先告知书》。根据认定情况，公司可能触及重大违法行为，可能被实施重大违法强制退市。

36氪 (36Kr) TIER_1 中文(ZH) · 2026-05-08 04:36

Cangzhou Mingzhu: The company's diaphragm products are currently in a state of full production and full sales.

36氪获悉，沧州明珠在业绩说明会上表示，目前公司隔膜产品基本处于满产满销状态。12亿平米锂电隔膜项目一期目前正在实施中，处于设备的调试阶段，预计后续进入试生产阶段。

36氪 (36Kr) TIER_1 中文(ZH) · 2026-05-08 03:53

In the first half of the day, major funds increased their positions in machinery and equipment and auto stocks, and sold off electronics stocks.

主力资金早间净流入机械设备、汽车、交通运输、国防军工等板块，净流出电子、电力设备、有色金属、非银金融等板块。具体到个股来看，大唐发电、胜宏科技、云南锗业获净流入25.99亿元、20.4亿元、15.76亿元。净流出方面，海光信息、兆易创新、宁德时代遭抛售15.63亿元、14.79亿元、13.18亿元。（第一财经）

Gary Marcus TIER_1 · Gary Marcus · 2026-05-07 21:42

Breaking news: “they hadn’t figured out how OpenAI would pay for it”

Sign of things to come?

36氪 (36Kr) TIER_1 中文(ZH) · 2026-05-06 03:39

In the first half of the day, main funds increased their holdings in electronics and computer stocks, and sold off bank stocks.

主力资金早间净流入电子、计算机、通信、电力设备、机械设备等板块，净流出银行、食品饮料、交通运输板块。具体到个股来看，兆易创新、佰维存储、澜起科技获净流入37.90亿元、31.92亿元、31.24亿元。净流出方面，胜宏科技、中国船舶、盛邦股份遭抛售13.22亿元、9.14亿元、5.04亿元。（第一财经）

Rest of World TIER_1 · Rina Chandran · 2026-05-05 10:00

The global cybersecurity gap deepens as AI-powered attacks surge

Restricted access to powerful defensive AI tools like Anthropic’s Mythos leaves some companies, central banks, and nations more vulnerable than others.

36氪 (36Kr) TIER_1 中文(ZH) · 2026-05-05 05:26

2026 May Day holiday box office exceeds 700 million yuan

据猫眼专业版数据，截至5月5日13时7分，2026年五一档（5月1日—5月5日）总票房破7亿元，《消失的人》《寒战1994》《穿普拉达的女王2》分列档期票房榜前三位。

36氪 (36Kr) TIER_1 中文(ZH) · 2026-05-04 07:13

2026 May Day Film Box Office Exceeds 600 Million Yuan

36氪获悉，据灯塔专业版，截至5月4日14时57分，2026年五一档（5月1日—5月5日）档期总票房（含预售）突破6亿元，影片《消失的人》《寒战1994》《穿普拉达的女王2》《10间敢死队》《门牙》暂列档期票房前五名。

36氪 (36Kr) TIER_1 中文(ZH) · 2026-05-03 01:30

2026 May Day Film Box Office Exceeds 400 Million

据猫眼专业版数据，2026年五一档新片总票房破4亿，《消失的人》《寒战1994》《穿普拉达的女王2》暂列五一档新片票房榜前三位。

36氪 (36Kr) TIER_1 中文(ZH) · 2026-05-03 01:15

GAC International: Export volume increased by 133.9% year-on-year from January to April

据广汽国际消息，延续一季度海外销售同比增长86%的强劲势能，广汽国际化征程再提速。今年1—4月，广汽累计出口量70474台，同比增幅133.9%。

36氪 (36Kr) TIER_1 中文(ZH) · 2026-05-02 07:09

2026 May Day Film Box Office Exceeds 300 Million Yuan

36氪获悉，据灯塔专业版，截至5月2日15时02分，2026年五一档新片总票房（含点映及预售）突破3亿元，《寒战1994》《消失的人》《穿普拉达的女王2》《10间敢死队》《给阿嬷的情书》暂列票房前五名。

36氪 (36Kr) TIER_1 中文(ZH) · 2026-05-02 06:59

International Maritime Organization Decides to Hold Additional Meeting to Discuss Net-Zero Emissions Framework

国际海事组织海上环境保护委员会第84届会议5月1日决定，拟在委员会第85届会议召开前举行两次船舶温室气体减排工作组会议，进一步研究对国际海事组织净零排放框架的相关关切。委员会决定，第85届会议暂定于2026年11月30日至12月3日举行。两次工作组会议暂定分别在2026年9月初和11月下旬举行，此外还将举办为期1天的温室气体专家研讨会。（新华社）

36氪 (36Kr) TIER_1 中文(ZH) · 2026-05-02 04:48

Berkshire Hathaway Shareholder Meeting Approaches, New CEO to Lead Q&A Session in Place of Buffett

伯克希尔·哈撒韦年度股东大会将于当地时间5月2日在美国奥马哈如期召开。与往年不同的是，尽管沃伦·巴菲特仍将以公司董事长身份出席大会，但备受关注的问答环节将由新任首席执行官（CEO）格雷格·阿贝尔承担核心发言职责。具体来看，首场问答环节将由格雷格·阿贝尔与公司副董事长、保险业务负责人阿吉特·贾恩共同回答问题。在第二场问答中，格雷格·阿贝尔还将与其他伯克希尔旗下公司负责人一同参与交流。（财联社）

36氪 (36Kr) TIER_1 中文(ZH) · 2026-05-01 06:12

2026 May Day box office exceeds 100 million

据网络平台数据，2026年五一档（5月1日-5月5日）档期总票房（含预售）突破1亿，影片《寒战1994》《消失的人》《门牙》暂列档期票房前三。（华尔街见闻）

AI Supremacy (Michael Spencer) TIER_1 · Michael Spencer · 2026-04-30 10:02

The Landowners of AI

BigTech Earnings and a look back at Nvidia GTC. BigTech incumbents are fragmenting into winners and losers. We have enough data to project the AI monopoly.

36氪 (36Kr) TIER_1 中文(ZH) · 2026-04-30 00:34

OpenAI plans to significantly expand cheaper ChatGPT service

OpenAI计划大幅拓展更廉价的ChatGPT服务，该公司效仿网飞（Netflix）模式，预计高级服务订阅用户减少的损失将通过广告收入弥补。（财联社）

36氪 (36Kr) TIER_1 中文(ZH) · 2026-04-28 23:59

OpenAI expects users to shift significantly to cheaper ChatGPT plans

当地时间4月28日，据报道，OpenAI预计，其更便宜、含广告的套餐不仅将吸引新用户，也会促使数千万付费订阅用户降级。OpenAI预计，今年其消费者订阅用户将增长一倍以上，达到1.22亿，并在2030年增至3.06亿。其中，旗舰月费套餐ChatGPT Plus的订阅用户数量预计今年将下降80%，至约900万；最昂贵的Pro套餐用户数量预计将翻倍，但占比仍不到总用户的1%。（界面）

36氪 (36Kr) TIER_1 中文(ZH) · 2026-04-28 01:04

Market News: OpenAI Misses Key Revenue and User Targets in Critical IPO Sprint Phase

市场消息：OpenAI在冲刺IPO的关键阶段未达核心营收及用户目标。（新浪财经）

Databricks Blog TIER_1 · 2026-04-27 10:55

Inside one of the first production deployments of Lakebase: LangGuard's agentic workflow governance engine

The invisible problem with agentic AIMost enterprises are experimenting with autonomous AI agents...

IEEE Spectrum — AI TIER_1 · Julianne Pepitone · 2026-04-14 18:00

Sarang Gupta Builds AI Systems With Real-World Impact

<img src="https://spectrum.ieee.org/media-library/a-young-adult-indian-man-smiling-with-his-arms-crossed.png?id=65519413&width=1200&height=800&coordinates=0%2C83%2C0%2C84" /><br /><br /><p>Like many engineers, <a href="https://www.linkedin.com/in/sarang-gupta/" rel="n…

ChinaTalk TIER_1 · Jordan Schneider · 2026-04-09 09:46

China’s AI Companies Are Going Closed Source

We explain why

Latent Space Podcast TIER_1 · Latent.Space · 2024-09-20 18:59

The Ultimate Guide to Prompting

<p><em>Noah Hein from Latent Space University is finally launching with </em><a href="https://maven.com/p/933f3d" target="_blank"><em>a free lightning course this Sunday</em></a><em> for those new to AI Engineering. Tell a friend!</em></p><p>Did you know there are >1,600 papers o…

Engadget TIER_1 · [email protected] (Lawrence Bonk) · 2026-05-14 17:20

The ChatGPT desktop app for Mac just got hit with a security breach

OpenAI found no evidence that user data was accessed.

The Decoder TIER_1 · Matthias Bastian · 2026-05-14 12:07

ChatGPT's web traffic share dropped from 78% to 54% in one year as Gemini quietly tripled its reach

<p><img alt="" class="attachment-full size-full wp-post-image" height="608" src="https://the-decoder.com/wp-content/uploads/2026/05/openai_vs_gemini.png" style="height: auto; margin-bottom: 10px;" width="1088" /></p> <p> ChatGPT's website traffic share dropped from 77.6% to 53.7%…

Engadget TIER_1 · [email protected] (Karissa Bell) · 2026-05-13 18:34

OpenAI endorses the Kids Online Safety Act

The bill appears to be gaining new momentum.

SCMP — Tech TIER_1 · Xinmei Shen,Sylvia Ma · 2026-05-07 01:59

ByteDance’s AI subscription gamble: chatbot faces reality check in China

Chloe Wang, a 26-year-old fund employee in Shenzhen, said she “definitely wouldn’t” pay for a subscription to Doubao, Chinese tech giant ByteDance’s artificial intelligence chatbot, at its proposed price. “I’m willing to pay for AI tools, but I don’t think it’s worth that much – …

The Decoder TIER_1 · Matthias Bastian · 2026-05-05 17:14

OpenAI's first hardware play might be a phone that replaces your app grid with an agent task stream

<p><img alt="" class="attachment-full size-full wp-post-image" height="768" src="https://the-decoder.com/wp-content/uploads/2026/04/openai_dark_pattern.png" style="height: auto; margin-bottom: 10px;" width="1376" /></p> <p> OpenAI is reportedly planning its own AI smartphone, wit…

Forbes — Innovation TIER_1 · Alison Durkee, Forbes Staff · 2026-05-14 19:52

OpenAI’s Bad Month: Apple Partnership Reportedly In Peril, Sora Shutters And Musk Attacks In Court

OpenAI could soon take legal action against Apple after the company’s promise to integrate ChatGPT into its software hasn’t fully panned out, Bloomberg reports.

Forbes — Innovation TIER_1 · Richard Nieva, Forbes Staff · 2026-05-14 10:30

Inside Scale AI’s Business After Meta’s Bombshell $14 Billion Deal

How Scale AI Is Adapting Post Meta Deal And Founder’s Departure

Forbes — Innovation TIER_1 · Michael Ashley, Contributor · 2026-05-13 12:00

The Hidden Legal Trap In Every AI Prompt Executives Type

Executives are feeding confidential business strategy into AI every day. New court rulings suggest those prompts could become discoverable in litigation.

Hacker News — AI stories ≥50 points TIER_1 · cylo · 2026-05-10 17:19

Local AI needs to be the norm

Hacker News — AI stories ≥50 points TIER_1 · oomuinio · 2026-04-28 12:15

OpenAI misses revenue, is the AI bubble bursting?

Hacker News — AI stories ≥50 points TIER_1 · ksherlock · 2026-04-28 01:34

OpenAI Misses Key Revenue, User Targets in High-Stakes Sprint Toward IPO

HN — claude-code stories TIER_1 · cg505 · 2026-03-31 23:41

GitHub has DMCA'd nearly all forks of the official Claude-code repo

Fortune TIER_1 · Alexandra Smyth · 2026-05-16 10:30

Would you hire the lawyer who just got sanctioned for using AI?

Fabricated citations. Invented quotations. Waived privileges. Lawyers are reaching for the wrong tool, and clients are paying the price.

dev.to — Claude Code tag TIER_1 · Malik Chohra · 2026-05-13 08:01

How I wire Claude into my React Native workflow (skills, projects, Cowork)

<p>Claude isn't a chat app anymore. It's a runtime. The interface is still text, but the architecture underneath is execution: load context, pick tools, call APIs, write files, schedule work. Most people are still typing at it like ChatGPT in 2023 and wondering why their workflow…

HN — claude cli stories TIER_1 · Jsttan · 2026-05-10 05:31

Best AI coding plan alternative to Claude and ChatGPT

The Verge — AI TIER_1 · Stevie Bonifield · 2026-04-29 14:43

ChatGPT downloads are slowing — and may cause problems for OpenAI’s IPO

ChatGPT is struggling to keep up its once-explosive growth as users uninstall the app or opt for rival chatbots instead. According to data from market intelligence firm Sensor Tower, ChatGPT experienced a 132 percent increase in uninstalls year over year in April. Its uninstall r…

HN — claude cli stories TIER_1 · cedarscarlett · 2026-03-05 01:13

Ask HN: Has anyone noticed the fear-driven prompt suggestions that GPT5.3 makes?

HN — claude cli stories TIER_1 · 101008 · 2026-02-16 13:44

Ask HN: What happens after the AI bubble bursts?

Medium — Claude tag TIER_1 · Story of Hina · 2026-05-16 15:12

I Used Claude and ChatGPT Every Day for 30 Days. Here’s What Nobody Told Me.

<div class="medium-feed-item"><p class="medium-feed-image"><a href="https://medium.com/@storyofhina/i-used-claude-and-chatgpt-every-day-for-30-days-heres-what-nobody-told-me-2c330933ce70?source=rss------claude-5"><img src="https://cdn-images-1.medium.com/max/735/1*ztxK5SbD6G7bYGp…

Mastodon — sigmoid.social TIER_1 日本語(JA) · [email protected] · 2026-05-16 07:32

【Held in Izumo City on June 3rd】Free "Claude Code AI Agent Course for Practical Business Use" for San'in Business Owners / Limited to 20 Participants by Application | Nihonkai Shimbun NetNihonkai https://www.yayafa.com/2801767/ # AgenticAi # AI # Anthropic # AnthropicC

【6/3出雲市開催】山陰の経営者向け「業務で本当に使えるClaude Code AIエージェント講座」を無料開催／審査制20名限定 | 日本海新聞 NetNihonkai https://www. yayafa.com/2801767/ # AgenticAi # AI # Anthropic # AnthropicClaude # ArtificialGeneralIntelligence # ArtificialIntelligence # claude # エージェント型AI # 人工知能 # 汎用人工知能

LINKS yayafa.com/2801767

Mastodon — sigmoid.social TIER_1 日本語(JA) · [email protected] · 2026-05-16 07:29

Google Analytics makes AI assistants an independent channel | Visualizing traffic from ChatGPT, Claude, and Gemini https://www.yayafa.com/2801765/ # AgenticAi # AI # ai (Artificial Intelligence) News # ArtificialGeneralIntelligence

Google Analytics、AIアシスタントを独立チャネル化｜ChatGPT・Claude・Gemini流入を可視化 https://www. yayafa.com/2801765/ # AgenticAi # AI # ai （人工知能）ニュース # ArtificialGeneralIntelligence # ArtificialIntelligence # DeepMind # Gemini # Google # GoogleAI # GoogleDeepMind # GoogleGemini # HeadlineNews # エージェント…

LINKS yayafa.com/2801765

Mastodon — sigmoid.social TIER_1 · [email protected] · 2026-05-16 04:40

Prompt Drift: Will Claude & Gemini Fail in 2026? Prompt drift threatens Claude & Gemini's reliability by 2026. Learn how subtle shifts in AI responses could und

Prompt Drift: Will Claude & Gemini Fail in 2026? Prompt drift threatens Claude & Gemini's reliability by 2026. Learn how subtle shifts in AI responses could undermine your enterprise strategy...and what it means for you. https:// theboard.world/articles/techno logy/prompt-drift-c…

LINKS theboard.world/…/prompt-drift-claude-gemi…

Mastodon — sigmoid.social TIER_1 · [email protected] · 2026-05-16 03:17

Las Vegas video studio Whisenhunt Media transforms into AI-first media house, combining Emmy-winning production with LLM optimization expertise. Repositioning a

Las Vegas video studio Whisenhunt Media transforms into AI-first media house, combining Emmy-winning production with LLM optimization expertise. Repositioning ahead of industry disruption. # AI # MediaProduction

Mastodon — sigmoid.social TIER_1 · [email protected] · 2026-05-16 03:17

Catalyst Crew Technologies appoints Carlos Pena as CFO to strengthen financial leadership for its AI-driven digital health expansion across emerging markets. #

Catalyst Crew Technologies appoints Carlos Pena as CFO to strengthen financial leadership for its AI-driven digital health expansion across emerging markets. # HealthTech # AI

Mastodon — sigmoid.social TIER_1 Italiano(IT) · [email protected] · 2026-05-16 03:15

🗽 I just got back from a vacation in New York. 🤖 And I brought something unexpected home: not souvenirs, but a very concrete reflection on where

🗽 Sono appena tornato da una vacanza a New York. 🤖 E mi sono portato a casa qualcosa di inaspettato: non souvenir, ma una riflessione molto concreta su dove siamo davvero arrivati con l’AI. 🌃 A Times Square, accanto a Coca-Cola e Samsung, c’era la pubblicità di una piattaforma AI…

LINKS iam.fabiograsso.net/…/okta-ai-newyork

Mastodon — sigmoid.social TIER_1 · [email protected] · 2026-05-16 03:15

Ketryx launches Model Context Protocol beta, bringing compliance intelligence to ChatGPT, Claude & Copilot. AI-powered compliance data access accelerates regula

Ketryx launches Model Context Protocol beta, bringing compliance intelligence to ChatGPT, Claude & Copilot. AI-powered compliance data access accelerates regulated product development. # Compliance # AI

Mastodon — sigmoid.social TIER_1 · [email protected] · 2026-05-15 19:01

🤖 OpenAI Explores Legal Action Over Disappointing Apple ChatGPT Integration OpenAI is exploring legal options as Apple's ChatGPT integration reportedly falls sh

🤖 OpenAI Explores Legal Action Over Disappointing Apple ChatGPT Integration OpenAI is exploring legal options as Apple's ChatGPT integration reportedly falls short of expectations, potentially harming the brand. https:// byte-pulse.net/article/openai- explores-legal-action-over-d…

LINKS byte-pulse.net/…/openai-explores-legal-ac…

Mastodon — sigmoid.social TIER_1 · [email protected] · 2026-05-15 01:52

OpenAI is reportedly considering legal action against Apple after the partnership failed to deliver the deep iOS integration the company expected. According to

OpenAI is reportedly considering legal action against Apple after the partnership failed to deliver the deep iOS integration the company expected. According to Bloomberg, OpenAI anticipated ChatGPT would be prominently featured in iPhone usage, but Apple's AI features have strugg…

LINKS gizmodo.com/openai-falls-behind-and-looks…

Mastodon — sigmoid.social TIER_1 · [email protected] · 2026-05-14 21:24

OpenAI is weighing legal action against Apple over the ChatGPT-Siri deal after user adoption fell short of expectations. Meanwhile, iOS 27 will open Siri to Cla

OpenAI is weighing legal action against Apple over the ChatGPT-Siri deal after user adoption fell short of expectations. Meanwhile, iOS 27 will open Siri to Claude, Gemini and others. The dispute highlights a core tension: platform gatekeepers control distribution, leaving AI mak…

LINKS implicator.ai/openai-apple-legal-notice-c…

Mastodon — sigmoid.social TIER_1 · [email protected] · 2026-05-14 20:35

Apparently, Apple has some teams on $300/day Claude plans. What can you spend that many tokens on? # ai # claudecode

TechCrunch AI TIER_1 · Connie Loizos · 2026-05-14 19:12

OpenAI is reportedly preparing legal action against Apple; it wouldn’t be the first partner to feel burned

According to Bloomberg, OpenAI has enlisted an outside law firm to work through its options.

Mastodon — sigmoid.social TIER_1 · [email protected] · 2026-05-14 05:03

What a time to be alive # Programming # Coding # Programmer # Programmers # ProgrammerLife # Dev # Debug # Debugging # CodingStruggles # SoftwareDeveloper # Sof

What a time to be alive # Programming # Coding # Programmer # Programmers # ProgrammerLife # Dev # Debug # Debugging # CodingStruggles # SoftwareDeveloper # SoftwareDevelopment # OpenAI # ChatGPT # AI # Copilot . # Meme # Memes # Humour # Humor # Technology

Mastodon — sigmoid.social TIER_1 日本語(JA) · [email protected] · 2026-05-13 22:09

https://www.tkhunt.com/2313005/ [2025 Latest] Complete Explanation of AI Agents! How Autonomous AI Will Change Work | Thorough Explanation of Usage, Use Cases, and Risks # 2025AI # AgenticAi # AI # AIAgent # AIUtilization # anthropic # ArtificialIntel

https://www. tkhunt.com/2313005/ 【2025年最新】AIエージェント完全解説！自律型AIが仕事を変える｜使い方・活用事例・リスクまで徹底解説 # 2025AI # AgenticAi # AI # AIエージェント # AI活用 # anthropic # ArtificialIntelligence # ChatGPT # Claude # openai # エージェント型AI # テクノロジー # バイブコーディング # マルチエージェント # 人工知能 # 業務効率化 # 生成AI

LINKS tkhunt.com/2313005

TechCrunch AI TIER_1 · Lucas Ropek · 2026-05-13 19:28

Anthropic’s Cat Wu says that, in the future, AI will anticipate your needs before you know what they are

The head of product for Claude Code and Cowork says that the next big step for AI is proactivity.

dev.to — MCP tag TIER_1 · Alex Boissonneault · 2026-05-13 15:18

Your AI assistant can't read your pipeline — here's why that's a problem

<p><strong>You use AI every day for writing, summarising, and brainstorming.</strong> But ask it what's really happening in your pipeline right now — and it stares back at you blankly. That's not a prompt problem. It's a structural one.</p> <h3> The honest reality of AI and busin…

Towards AI TIER_1 · Deepanshu Gupta · 2026-05-12 21:01

The Orchestration Tax: Why Multi-Agent Systems Get Expensive

<h4>How context propagation, supervisor loops, tool calls, memory, and observability quietly drive up the cost of production agentic systems.</h4><p>Multi-agent AI systems are quickly becoming a default pattern for building advanced LLM applications. Instead of relying on one mod…

Medium — Claude tag TIER_1 · Daniel Olatunji · 2026-05-12 20:57

Most People Are Using Claude Like Google.

<div class="medium-feed-item"><p class="medium-feed-snippet">A deep, honest guide to what Claude actually is, how it actually thinks, and the real ways professionals are using it to do serious work.</p><p class="medium-feed-link"><a href="https://medium.com/@oluwafikayore/most-pe…

Mastodon — sigmoid.social TIER_1 Türkçe(TR) · [email protected] · 2026-05-12 08:02

1YZ | Artificial Intelligence Forum https:// 1yz.com.tr/d/38-1yz-yapay-zeka -forumu-acildi-turkiyenin-yeni-ai-toplulugu-ve-rehberleri-2026 # AI # CHATGPT # artificialintelligence # cloud

1YZ | YAPAY ZEKA FORUM https:// 1yz.com.tr/d/38-1yz-yapay-zeka -forumu-acildi-turkiyenin-yeni-ai-toplulugu-ve-rehberleri-2026 # Aİ # CHATGPT # yapayzeka # cloude # cursor # gemini

Medium — Claude tag TIER_1 · Prardhana Kennedy · 2026-05-11 12:46

Most People Use Claude Like a Search Engine. Here’s Why That’s a Mistake.

<div class="medium-feed-item"><p class="medium-feed-image"><a href="https://medium.com/@prardhanakennedy/most-people-use-claude-like-a-search-engine-heres-why-that-s-a-mistake-ac20be0eced5?source=rss------claude-5"><img src="https://cdn-images-1.medium.com/max/1650/1*UpZwkf6Dc8VR…

Mastodon — sigmoid.social TIER_1 · [email protected] · 2026-05-11 07:52

Prompt Drift: Will Claude & Gemini Fail in 2026? Prompt drift threatens Claude & Gemini's reliability by 2026. Learn how subtle shifts in AI responses could und

Prompt Drift: Will Claude & Gemini Fail in 2026? Prompt drift threatens Claude & Gemini's reliability by 2026. Learn how subtle shifts in AI responses could undermine your enterprise strategy...and what it means for you. https:// theboard.world/articles/techno logy/prompt-drift-c…

LINKS theboard.world/…/prompt-drift-claude-gemi…

Medium — Claude tag TIER_1 · Priyansh · 2026-05-09 20:48

20 Repos That Make CLAUDE CODE Actually Worth the Subscription

<div class="medium-feed-item"><p class="medium-feed-image"><a href="https://medium.com/@priyansh0327/20-repos-that-make-claude-code-actually-worth-the-subscription-048aa7cabcd5?source=rss------claude-5"><img src="https://cdn-images-1.medium.com/max/1734/1*WqW42-pPI_tT0UovPhAjOA.p…

Mastodon — sigmoid.social TIER_1 Polski(PL) · [email protected] · 2026-05-09 11:30

Thanks to a strategic alliance with SpaceX, Anthropic is radically increasing the computing power of the Claude model, doubling the five-hour usage limits for paid users.

Dzięki strategicznemu sojuszowi ze SpaceX Anthropic radykalnie zwiększa moc obliczeniową modelu Claude, podwajając pięciogodzinne limity użytkowania dla płatnych subskrypcji. Współpraca ma kluczowe znaczenie dla rozwoju AI w przemyśle. # si # ai # sztucznainteligencja # wiadomośc…

LINKS aisight.pl/…/anthropic-zwieksza-limity-cl… aisight.pl/…/generatory-obrazow-ai-stereo…

Mastodon — sigmoid.social TIER_1 Polski(PL) · [email protected] · 2026-05-09 11:29

Latest study reveals that popular chatbots like ChatGPT, Claude, Grok, and Perplexity send user data to ad brokers, including Meta and G

Najnowsze badanie ujawnia, że popularne chatboty, takie jak ChatGPT, Claude, Grok i Perplexity, przesyłają dane użytkowników brokerom reklamowym, m.in. Meta i Google, budząc poważne obawy o prywatność. # si # ai # sztucznainteligencja # wiadomości # informacje # technologia https…

LINKS aisight.pl/…/chatboty-na-podsluchu-u-giga… aisight.pl/…/Awarie-i-cyberataki-tydzien-…

Medium — Claude tag TIER_1 · Didier PH Martin · 2026-05-07 22:10

Stop your Ai to Hallucinate!

<div class="medium-feed-item"><p class="medium-feed-image"><a href="https://medium.com/@interblockchain/stop-you-ai-to-hallucinate-1e20c64b9c02?source=rss------claude-5"><img src="https://cdn-images-1.medium.com/max/2600/1*obo8VPVJ8wKSSu-D2g3_dg.png" width="3584" /></a></p><p cla…

dev.to — MCP tag TIER_1 · Shir Meir Lador · 2026-05-07 21:01

Architect A Personalized Multi-Agent System with Long-Term Memory

<p>In support of our mission to accelerate the developer journey on Google Cloud, we built <strong>Dev Signal</strong> — a multi-agent system designed to transform raw community signals into reliable technical guidance by automating the path from discovery to expert creation.</p>…

Towards AI TIER_1 · Anubhav · 2026-05-07 11:31

Multi-Agent Systems: When 2 Agents Beat 1 (and When They Don’t)

<div class="medium-feed-item"><p class="medium-feed-image"><a href="https://pub.towardsai.net/multi-agent-systems-when-2-agents-beat-1-and-when-they-dont-f4e352541695?source=rss----98111c9905da---4"><img src="https://cdn-images-1.medium.com/max/2600/1*XHvN4bndLs56ffkXLJtw3w.png" …

Mastodon — sigmoid.social TIER_1 · [email protected] · 2026-05-07 02:34

ByteDance's AI chatbot Doubao is facing a reality check in China, with users pushing back against its proposed subscription price. A 26-year-old fund employee i

ByteDance's AI chatbot Doubao is facing a reality check in China, with users pushing back against its proposed subscription price. A 26-year-old fund employee in Shenzhen said she definitely would not pay for the service, calling it not worth it even though she finds Doubao relat…

LINKS scmp.com/…/bytedances-ai-subscription-gam…

dev.to — MCP tag TIER_1 · Jangwook Kim · 2026-05-07 00:15

LangGraph + MCP: Build a Supervisor Multi-Agent System

<h2> Why This Pattern Matters </h2> <p>Most LangGraph tutorials stop at single agents. A single agent that does research, writes code, and formats a report is juggling three jobs — and as the task list grows, the prompt grows with it. The supervisor pattern solves this: one orche…

Medium — Claude tag TIER_1 · Ritikkungwani · 2026-05-06 17:43

How to Transfer Gemini Conversations to Claude (Step-by-Step Guide)

<div class="medium-feed-item"><p class="medium-feed-image"><a href="https://medium.com/@ritikkungwani8888/how-to-transfer-gemini-conversations-to-claude-step-by-step-guide-50541ff044dd?source=rss------claude-5"><img src="https://cdn-images-1.medium.com/max/1536/1*M1LCg8yiccvl8jQd…

Medium — Claude tag TIER_1 · Halil ibrahim Tutuncu · 2026-05-06 17:36

I Spent 80 Hours Writing 14 Claude Skills. Here’s What That Time Actually Bought Me.

<div class="medium-feed-item"><p class="medium-feed-image"><a href="https://medium.com/write-a-catalyst/i-spent-80-hours-writing-14-claude-skills-heres-what-that-time-actually-bought-me-baa9549d52c7?source=rss------claude-5"><img src="https://cdn-images-1.medium.com/max/1408/1*Om…

Axios Technology TIER_1 · Mike Allen · 2026-05-06 10:04

Axios interview: Scale AI CEO Jason Droege pushes "reliablity" reality

<p>Scale AI CEO <a href="https://www.axios.com/2025/09/17/jason-droege-scale-ai" target="_blank">Jason Droege</a> tells Axios that AI is often too unreliable for mission-critical use by business, military and government.</p><ul><li>"The cost of mistakes in these environments can …

Mastodon — sigmoid.social TIER_1 日本語(JA) · [email protected] · 2026-05-02 22:54

ChatGPT Strengthens Account Security with Passkeys and Other Measures - Impress Watch

ChatGPTに「高度なアカウントセキュリティ」パスキーなどで乗っ取り対策強化 – Impress Watch https://www. yayafa.com/2792172/ # AgenticAi # AI # ArtificialGeneralIntelligence # ArtificialIntelligence # ChatGPT # OpenAI # エージェント型AI # テック # 人工知能 # 汎用人工知能

Mastodon — sigmoid.social TIER_1 日本語(JA) · [email protected] · 2026-05-02 22:50

What does the US-India struggle for access to advanced AI models, which are beginning to have a presence comparable to nuclear weapons, mean? [Generative AI Incident Report] Will Japan be allowed to use the strongest AI... US intentions to encircle with cyber, bio, and computational resources (1/5) | JBpress https://www.yayafa.co

核兵器並みの存在感を帯び始めた先端AIモデル、Mythosへのアクセス権を巡る米印の攻防は何を意味しているのか？【生成AI事件簿】日本は最強AIを使わせてもらえるのか…サイバー、バイオ、計算資源で囲い込みを図る米国の思惑(1/5) | JBpress (ジェイビープレス) https://www. yayafa.com/2792170/ # AgenticAi # AI # Anthropic # ArtificialGeneralIntelligence # ArtificialIntelligence # isMedia # jbpress # エ…

Mastodon — sigmoid.social TIER_1 日本語(JA) · [email protected] · 2026-05-02 22:47

For those who want to start Claude from Golden Week | Summary of Spring 2026 Prices and New Features | TECH NOISY https://www.yayafa.com/2792168/ # AgenticAi # AI # Anthropic # AnthropicClaude # ArtificialGeneralIntelligence

ClaudeをGWから始めたい方へ｜2026年春の料金・新機能まとめ | TECH NOISY https://www. yayafa.com/2792168/ # AgenticAi # AI # Anthropic # AnthropicClaude # ArtificialGeneralIntelligence # ArtificialIntelligence # claude # エージェント型AI # 人工知能 # 汎用人工知能

Mastodon — sigmoid.social TIER_1 日本語(JA) · [email protected] · 2026-05-02 22:45

Five New Features! April 2026 Version of "Gemini Drop" for Japan Released – Jetstream https://www.yayafa.com/2792166/ # AgenticAi # AI # ArtificialGeneralIntelligence # ArtificialIntelligence # DeepM

新機能5つ！日本向け2026年4月版「Gemini Drop」公開 – Jetstream https://www. yayafa.com/2792166/ # AgenticAi # AI # ArtificialGeneralIntelligence # ArtificialIntelligence # DeepMind # Gemini # Google # GoogleAI # GoogleDeepMind # GoogleGemini # エージェント型AI # 人工知能 # 汎用人工知能

Mastodon — sigmoid.social TIER_1 日本語(JA) · [email protected] · 2026-05-02 22:42

Google acknowledges the possibility of introducing ads to the Gemini app. A reversal from its January "no plans" statement | HelenTech https://www.yayafa.com/2792165/ # AgenticAi # AI # android # ArtificialGeneralIntelligence # Artifi

Google、Gemini アプリへの広告導入の可能性を認める。1月の「計画なし」発言から一転 | HelenTech https://www. yayafa.com/2792165/ # AgenticAi # AI # android # ArtificialGeneralIntelligence # ArtificialIntelligence # Chrome # Chromebase # Chromebook # Chromebox # ChromeOS # DeepMind # Gemini # Google # GoogleAI # Goo…

Mastodon — sigmoid.social TIER_1 · [email protected] · 2026-04-30 20:30

"And that’s been blatantly obvious for years. " "Of the 114GW of data centers supposedly being built by the end of 2028, only 15.2GW is under construction in an

"And that’s been blatantly obvious for years. " "Of the 114GW of data centers supposedly being built by the end of 2028, only 15.2GW is under construction in any way, shape, or form. And “under construction” can mean as little as “there’s a hole in the ground.” " https://www. whe…

TechCrunch AI TIER_1 · Lucas Ropek · 2026-04-30 18:20

OpenAI announces new advanced security for ChatGPT accounts, including a partnership with Yubico

OpenAI is launching additional opt-in protections for ChatGPT accounts. The new security initiative includes a new partnership with security key provider Yubico.

Email — Mindstream TIER_1 · bounces+35008234-749c-ns3evnpcff6928077d7u=kill-the-newsletter.com@em5320.mindstream.news (bounces+35008234-749c-ns3evnpcff6928077d7u=kill-the-newsletter.com@em5320.mindstream.news) · 2026-04-29 15:05

OpenAI's secret project - an iPhone killer

Is OpenAI making a smartphone?<!--[if mso]><style type="text/css"> h1, h2, h3, h4, h5, h6 {fon…

The Guardian — AI TIER_1 · Jamie Bartlett · 2026-04-29 09:00

Meet the AI jailbreakers: ‘I see the worst things humanity has produced’

<p>To test the safety and security of AI, hackers have to trick large language models into breaking their own rules. It requires ingenuity and manipulation – and can come at a deep emotional cost</p><p>A few months ago, Valen Tagliabue sat in his hotel room watching his chatbot, …

Mastodon — sigmoid.social TIER_1 Svenska(SV) · [email protected] · 2026-04-28 18:58

OpenAI reportedly exploring developing its own chip for future AI-powered mobile products including a phone. # qualcomm # ai # news # mediatek

OpenAI uppges undersöka möjligheten att utveckla ett eget chipp för framtida AI‑drivna mobilprodukter inklusive en telefon. # qualcomm # ai # nyheter # mediatek # ai-telefon # openai OpenAI kan utveckla hårdvara för egen AI‑telefon

Mastodon — sigmoid.social TIER_1 · [email protected] · 2026-04-28 11:57

Bloomberg: #Shares in OpenAI’s key partners #SoftBank and #Oracle are falling after the Wall Street Journal reported that the #AI startup recently failed to mee

Bloomberg: #Shares in OpenAI’s key partners #SoftBank and #Oracle are falling after the Wall Street Journal reported that the #AI startup recently failed to meet its goals for new users and sales. #OpenAI

Axios Technology TIER_1 · Kathryn Varn · 2026-04-28 00:11

Florida AG probes ChatGPT's role in USF student killings

<p>Florida's top cop said Monday his office will investigate the alleged role of <a href="https://www.axios.com/2025/09/02/chatgpt-openai-mental-health-teens" target="_blank">ChatGPT</a> in the slayings of two University of South Florida students.</p><p><strong>The big picture: <…

TechCrunch AI TIER_1 · Ivan Mehta · 2026-04-27 13:08

OpenAI could be making a phone with AI agents replacing apps

The phone could go in mass production in 2028, an analyst says.

Mastodon — sigmoid.social TIER_1 · [email protected] · 2026-04-26 12:58

Claude's Mythos Preview ships to 50+ enterprise partners with $100M credits while facing ongoing outages. Gemini secures Apple's Siri integration and launches e

Claude's Mythos Preview ships to 50+ enterprise partners with $100M credits while facing ongoing outages. Gemini secures Apple's Siri integration and launches enterprise agent platform. OpenAI releases GPT-5.5 but appears to trail on coding benchmarks. Competition intensifies acr…

The Guardian — AI TIER_1 · Dan Milmo, Kalyeena Makortoff and Aisha Down · 2026-04-22 15:03

What is Mythos AI and why could it be a threat to global cybersecurity?

<p>Anthropic’s decision to restrict access to its powerful new model increases fears about the advanced technology</p><p>Anthropic has ruled out releasing its latest AI model, Claude Mythos, to the public because of the threat it poses to global cybersecurity.</p><p>However, the …

HN — AI startup stories TIER_1 · speckx · 2025-10-03 16:37

OpenAI Is Just Another Boring, Desperate AI Startup

Mastodon — fosstodon.org TIER_1 Polski(PL) · [email protected] · 2026-05-16 14:34

Frustration and a sense of injustice are growing in Silicon Valley. While a few engineers from OpenAI or Nvidia are making millions, the majority of hard-working

W Dolinie Krzemowej narasta frustracja i poczucie niesprawiedliwości. Podczas gdy nieliczni inżynierowie z OpenAI czy Nvidii zdobywają miliony, większość pracowników branży technologicznej czuje się pominięta, a ich dotychczasowe ścieżki kariery tracą sens. # si # ai # sztucznain…

LINKS aisight.pl/…/zloty-bilet-wieczna-frustrac…

Mastodon — fosstodon.org TIER_1 Polski(PL) · [email protected] · 2026-05-16 14:31

US space agency ends era of outdated integrated circuits. New SoC processor, developed in cooperation with Microchip Technology, offers performance

Amerykańska agencja kosmiczna kończy erę przestarzałych układów scalonych. Nowy procesor typu SoC, opracowany we współpracy z Microchip Technology, oferuje wydajność pięćsetkrotnie wyższą od obecnych standardów i przetrwa tam, gdzie ziemska elektronika zamienia się w pył. # si # …

LINKS aisight.pl/…/nasa-procesor-autonomia-sond…

Mastodon — fosstodon.org TIER_1 Polski(PL) · [email protected] · 2026-05-16 14:29

Anthropic CFO reveals that almost all code generated within the company is the work of artificial intelligence. While giants like Google and Microso

Dyrektor finansowy Anthropic ujawnia, że niemal cały kod powstający wewnątrz firmy jest dziełem sztucznej inteligencji. Choć giganci tacy jak Google czy Microsoft również zwiększają stopień automatyzacji, deklaracje twórców Claude’a stawiają pytania o przyszłość zawodu programist…

LINKS aisight.pl/…/koniec-ery-recznego-pisania-… aisight.pl/…/koniec-abonamentowej-sielank…

Mastodon — fosstodon.org TIER_1 Polski(PL) · [email protected] · 2026-05-16 14:29

Long-term social simulations show that autonomous AI agents left unsupervised tend to criminal behavior, violence, and rebellion

Długofalowe symulacje społeczne wykazują, że autonomiczni agenci AI pozostawieni bez nadzoru mają tendencję do zachowań przestępczych, aktów przemocy i buntu przeciwko cyfrowej strukturze władzy. # si # ai # sztucznainteligencja # wiadomości # informacje # technologia https:// ai…

LINKS aisight.pl/…/ciemna-strona-autonomii-emer… aisight.pl/…/ciemna-strona-ai-chatgpt-man…

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-16 06:17

OpenAI Considering Legal Action Against Apple Over 'Strained' Siri Partnership OpenAI is preparing to potentially take legal action against Apple due to a "stra

OpenAI Considering Legal Action Against Apple Over 'Strained' Siri Partnership OpenAI is preparing to potentially take legal action against Apple due to a "strained" relationship with the iPhone maker, according to Bloomberg's Mark Gurman. The two companies reached a partnership …

LINKS macrumors.com/…/openai-considering-legal-…

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-15 18:14

📰 OpenAI feels “burned” by Apple’s crappy ChatGPT integration, insiders say Judge orders Apple to give Musk internal messages discussing secretive ChatGPT deal.

📰 OpenAI feels “burned” by Apple’s crappy ChatGPT integration, insiders say Judge orders Apple to give Musk internal messages discussing secretive ChatGPT deal. 📰 Source: Ars Technica 🔗 Link: https://arstechnica.com/tech-policy/2026/05/openai-feels-burned-by-apples-crappy-chatgpt…

LINKS arstechnica.com/…/openai-feels-burned-by-…

dev.to — LLM tag TIER_1 · Yang Goufang · 2026-05-15 02:28

AI Weekly — 2026-05-08 to 2026-05-15 | OpenAI the Consultant, Anthropic the Platform — Model Companies Pivot Collectively

<blockquote> <p>One-sentence takeaway this week: OpenAI is becoming a consulting firm, Anthropic is becoming a platform company — both have simultaneously abandoned the "model-as-product" narrative.</p> </blockquote> <h2> Model Companies Pivot Collectively: From API Sales to Inst…

dev.to — LLM tag TIER_1 中文(ZH) · Yang Goufang · 2026-05-15 02:10

AI Weekly — 2026-05-08 to 2026-05-15 | OpenAI as consultant, Anthropic as ecosystem, model companies collectively pivot

<blockquote> <p>本週一句話：OpenAI 正在變成一家顧問公司，Anthropic 正在變成一家平台公司——兩者都不約同時放棄了「模型即產品」的故事。</p> </blockquote> <h2> 模型公司集體轉向：從 API 銷售到制度性資源 </h2> <p>140 億美元——這是 OpenAI 對其新建「Deployment Company」的估值，聲稱來自同一週的外部融資談判<a href="https://news.google.com/rss/articles/CBMickFVX3lxTE95QnZPQkxzLW9WdHJLb…

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-14 22:06

Ah yes, because nothing screams "cutting-edge legal solutions" quite like a jumbled pile of # GitHub buzzwords and an # AI named # Claude 🤖. We all know lawyers

Ah yes, because nothing screams "cutting-edge legal solutions" quite like a jumbled pile of # GitHub buzzwords and an # AI named # Claude 🤖. We all know lawyers love nothing more than diving into a "suite of plugins" - thrilling! 🎉 Meanwhile, AI is apparently fixing your typos wh…

LINKS github.com/…/claude-for-legal

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-14 20:38

Ooopsies! The # ChatGPT desktop app for # Mac just got hit with a # SecurityBreach By Lawrence Bonk, May 14, 2026 "OpenAI's ChatGPT app for Mac just experienced

Ooopsies! The # ChatGPT desktop app for # Mac just got hit with a # SecurityBreach By Lawrence Bonk, May 14, 2026 "OpenAI's ChatGPT app for Mac just experienced a security breach involving two employee devices, according to a report by 9to5Mac. The company is issuing a software u…

LINKS engadget.com/…/the-chatgpt-desktop-app-fo…

Mastodon — fosstodon.org TIER_1 한국어(KO) · [email protected] · 2026-05-14 17:51

Reports have emerged that Sarbjeet Johal (@sarbjeetjohal) and Apple's partnership with OpenAI has soured, raising concerns about potential future legal disputes. This is a significant issue regarding AI platform collaboration. https:// x.com/sarbjeetjohal/sta

Sarbjeet Johal (@sarbjeetjohal) Apple과 OpenAI의 협력 관계가 틀어졌다는 보도가 나왔으며, 향후 법적 분쟁으로 이어질 가능성이 제기됐습니다. AI 플랫폼 협업과 관련한 중요한 이슈로 주목됩니다. https:// x.com/sarbjeetjohal/status/205 4969948972392885 # apple # openai # legal # partnership # ai

Mastodon — fosstodon.org TIER_1 Türkçe(TR) · [email protected] · 2026-05-14 07:31

For Architects: AI-Assisted Architectural Concept Design and Prompt (Cheat Sheet) https:// 1yz.com.tr/d/70-mimarlar-icin- ai-destekli-mimari-konsept-tasarim-ve-prompt-c

Mimarlar İçin: AI Destekli Mimari Konsept Tasarım ve Prompt (Cheat Sheet) https:// 1yz.com.tr/d/70-mimarlar-icin- ai-destekli-mimari-konsept-tasarim-ve-prompt-cheat-sheet # ai # mimarlar # chatgpt # promt # mühendispromt # aiforum # yapayzekaforum

LINKS 1yz.com.tr/…/70-mimarlar-icin-ai-destekli… 1yz.com.tr/…/70-mimarlar-icin-

Mastodon — fosstodon.org TIER_1 Türkçe(TR) · [email protected] · 2026-05-14 07:28

Artificial Intelligence for Lawyers: Petition and Case Law Analysis Prompts

Hukukçular İçin Yapay Zeka: Dilekçe ve Emsal Karar Analizi Promptları https:// 1yz.com.tr/d/69-hukukcular-ici n-yapay-zeka-dilekce-ve-emsal-karar-analizi-promptlari # dilekçe # ai # yapayzeka # cloude # araç # görsel # yapayzekaforum # 1yz

LINKS 1yz.com.tr/…/69-hukukcular-icin-yapay-zek… 1yz.com.tr/…/69-hukukcular-ici

Mastodon — fosstodon.org TIER_1 Türkçe(TR) · [email protected] · 2026-05-14 07:25

Sales Guaranteed Listing Text and Communication Prompt Guide for Real Estate Agents https:// 1yz.com.tr/d/68-emlak-danisman lari-icin-satis-garantili-ilan-metni-ve-i

Emlak Danışmanları İçin Satış Garantili İlan Metni ve İletişim Prompt Kılavuzu https:// 1yz.com.tr/d/68-emlak-danisman lari-icin-satis-garantili-ilan-metni-ve-iletisim-prompt-kilavuzu # emlakpromt # chatgptemlak # yapayzeka # geminiemlak # yapayzekasatis # yapayzeka # ai # cloude

LINKS 1yz.com.tr/…/68-emlak-danismanlari-icin-s… 1yz.com.tr/…/68-emlak-danisman

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-13 23:54

Anthropic says AI will anticipate your needs before you know what they are. The company's Cat Wu explained at the Code with Claude conference that the next big

Anthropic says AI will anticipate your needs before you know what they are. The company's Cat Wu explained at the Code with Claude conference that the next big step is proactivity - moving beyond reactive chatbots to systems that act preemptively for users. This marks a shift fro…

LINKS techcrunch.com/…/anthropics-cat-wu-says-t…

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-13 17:33

Prompt Drift: Will Claude & Gemini Fail in 2026? Prompt drift threatens Claude & Gemini's reliability by 2026. Learn how subtle shifts in AI responses could und

Prompt Drift: Will Claude & Gemini Fail in 2026? Prompt drift threatens Claude & Gemini's reliability by 2026. Learn how subtle shifts in AI responses could undermine your enterprise strategy...and what it means for you. https:// theboard.world/articles/techno logy/prompt-drift-c…

LINKS theboard.world/…/prompt-drift-claude-gemi…

Mastodon — fosstodon.org TIER_1 日本語(JA) · [email protected] · 2026-05-12 22:20

ChatGPT's Voice Mode Supports CarPlay on iOS 26.4. Results of Trying It Out and Points to Note | Lifehacker Japan https://www.yayafa.com/?p=2799375 # AgenticAi # AI # ArtificialGeneralIntelligence # Artificial

ChatGPTの音声モードがiOS26.4でCarPlayに対応。使ってみた結果と注意点について | ライフハッカー・ジャパン https://www. yayafa.com/?p=2799375 # AgenticAi # AI # ArtificialGeneralIntelligence # ArtificialIntelligence # ChatGPT # OpenAI # エージェント型AI # 人工知能 # 汎用人工知能

Mastodon — fosstodon.org TIER_1 日本語(JA) · [email protected] · 2026-05-12 22:18

【Claude's '80x Growth' Pushes AI Infrastructure to the Limit】Elon Musk's SpaceX is Anthropic's "Savior" / Demand Forecasting: OpenAI is One Step Ahead / Shota Imai: "The Rapidus Plan Was Correct" [AI QUEST] | TBS CROSS DIG with Bloomberg https://www.yayafa.com/27993

【Claude「80倍成長」でAIインフラが限界】イーロン・マスクのスペースXはアンソロピックの“救世主”／需要予測はOpenAIが一枚上手／今井翔太「ラピダス計画は正しかった」【AI QUEST】 | TBS CROSS DIG with Bloomberg https://www. yayafa.com/2799373/ # AgenticAi # AI # Anthropic # ArtificialGeneralIntelligence # ArtificialIntelligence # エージェント型AI # 人工知能 # 汎用人工知能

LINKS yayafa.com/2799373

Mastodon — fosstodon.org TIER_1 日本語(JA) · [email protected] · 2026-05-12 22:15

Leave it to AI too for "Tohoho no WWW Nyumon"! Explanations of "Claude" and "Claude Code" Added - Mado no Mori https://www.yayafa.com/2799371/ #AgenticAi #AI #AICoding #Anthropic #AnthropicClaude #ArtificialGen

AIも「とほほのWWW入門」にお任せ! 「Claude」「Claude Code」の解説が追加 – 窓の杜 https://www. yayafa.com/2799371/ # AgenticAi # AI # AIコーディング # Anthropic # AnthropicClaude # ArtificialGeneralIntelligence # ArtificialIntelligence # claude # ClaudeCode # genai # Webサービス # エージェント型AI # とほほのWWW入門 # ライブ # 人工知能 #…

LINKS yayafa.com/2799371

Mastodon — fosstodon.org TIER_1 日本語(JA) · [email protected] · 2026-05-12 22:12

Google Finance to Expand into Europe | AI Financial Tools Stand at the Boundary Between "Research" and "Advice" https://www.yayafa.com/2799369/ # AgenticAi # AI # ai (Artificial Intelligence) News # ArtificialGeneralIntelligence # ArtificialI

Google Finance、ヨーロッパ進出へ｜AI金融ツールが「研究」と「助言」の境界線に立つ https://www. yayafa.com/2799369/ # AgenticAi # AI # ai （人工知能）ニュース # ArtificialGeneralIntelligence # ArtificialIntelligence # DeepMind # Gemini # Google # GoogleAI # GoogleDeepMind # GoogleGemini # HeadlineNews # エージェント型AI # テクノロジーと…

LINKS yayafa.com/2799369

dev.to — LLM tag TIER_1 · Sergei Peleskov · 2026-05-12 10:59

Why Single Agents Beat Multi-Agent Systems at Equal Token Budgets

<h2> TL;DR </h2> <ul> <li>Stanford (Tran & Kiela, arXiv 2604.02460) tested single-agent vs multi-agent systems with <strong>identical thinking-token budgets</strong> </li> <li>Single agent wins on accuracy AND on compute, across three model families</li> <li>The mechanism is …

dev.to — LLM tag TIER_1 · Rishabh Sethia · 2026-05-12 09:30

Multi-Agent Systems Explained: How Orchestrator + Specialist Agent Architecture Works

<p>Here's the uncomfortable truth about single-agent AI systems: they don't scale. Not because the models aren't capable, but because you're asking one entity to simultaneously plan, execute, research, verify, and synthesize — often in a single context window that fills up faster…

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-12 07:42

It is kinda shocking: # Siri used to be very good, picking up commands and doing all the things I needed. Then a few years back I assume they started using # AI

It is kinda shocking: # Siri used to be very good, picking up commands and doing all the things I needed. Then a few years back I assume they started using # AI models to process. It got so bad that now I have disabled # AppleIntelligence . The results? It works back again. Simpl…

dev.to — LLM tag TIER_1 · 丁久 · 2026-05-12 07:33

Multi-Agent Systems: Coordination, Communication, Consensus

<blockquote> <p><em>This article was originally published on <a href="https://dingjiu1989-hue.github.io/en/ai/multi-agent-systems.html" rel="noopener noreferrer">AI Study Room</a>. For the full version with working code examples and related articles, visit the original post.</em>…

Mastodon — fosstodon.org TIER_1 日本語(JA) · [email protected] · 2026-05-12 02:56

CarPlay Another AI from Mr. Musk Appears. Musk's Grok Starts Operating in Cars – Letem svetem Applem https://www.yayafa.com/2798752/ # AgenticAi # AI # ArtificialGeneralIntelligence # ArtificialInt

CarPlay マスク氏のAIがまた一つ登場。マスク氏のGrokが車内で稼働開始 – Letem svetem Applem https://www. yayafa.com/2798752/ # AgenticAi # AI # ArtificialGeneralIntelligence # ArtificialIntelligence # Grok # xai # XAIGrok # エージェント型AI # 人工知能 # 汎用人工知能

LINKS yayafa.com/2798752

Mastodon — fosstodon.org TIER_1 日本語(JA) · [email protected] · 2026-05-12 02:53

Will AI Assigned a Task Tamper with Data? Microsoft's Latest Research Exposes the Trap of Autonomous Agents | XenoSpectrum https://www.yayafa.com/2798750/ # AgenticAi # AI # ArtificialGeneralIntelligence # ArtificialIn

仕事を任されたAIはデータを改ざんする？Microsoftの最新研究が暴く自律型エージェントの罠 | XenoSpectrum https://www. yayafa.com/2798750/ # AgenticAi # AI # ArtificialGeneralIntelligence # ArtificialIntelligence # Copilot # Microsoft # MicrosoftAI # MicrosoftCopilot # エージェント型AI # 人工知能 # 汎用人工知能

LINKS yayafa.com/2798750

Mastodon — fosstodon.org TIER_1 日本語(JA) · [email protected] · 2026-05-12 02:50

What is Copilot Cowork? Explaining the latest features, how to use it, and pricing https://www.yayafa.com/2798748/ # AgenticAi # AI # ArtificialGeneralIntelligence # ArtificialIntelligence # Copilot # Microsoft

Copilot Coworkとは？最新機能・使い方・料金を解説 https://www. yayafa.com/2798748/ # AgenticAi # AI # ArtificialGeneralIntelligence # ArtificialIntelligence # Copilot # Microsoft # MicrosoftAI # MicrosoftCopilot # エージェント型AI # 人工知能 # 汎用人工知能

LINKS yayafa.com/2798748

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-11 06:25

Why would I want an AI agent to replace my phone? You can’t go anywhere on the internet today without running into AI. In some cases, that’s useful. AI can do s

Why would I want an AI agent to replace my phone? You can’t go anywhere on the internet today without running into AI. In some cases, that’s useful. AI can do some amazing things. On the other hand, it often feels like we’re watching someone reinvent the wheel. Now, wo… https:// …

LINKS 9to5google.com/…/why-would-i-want-an-ai-a…

Mastodon — fosstodon.org TIER_1 日本語(JA) · [email protected] · 2026-05-11 02:11

Claude Code Agent Loop 'deepclaude' Using DeepSeek V4 Pro That Can Save Costs to Just 1/17th – GIGAZINE https://www.yayafa.com/2797989/ # AgenticAi # AI # Anthropic # ArtificialGe

コストをなんとわずか17分の1に節約できるDeepSeek V4 Proを使ったClaude Codeエージェントループ「deepclaude」 – GIGAZINE https://www. yayafa.com/2797989/ # AgenticAi # AI # Anthropic # ArtificialGeneralIntelligence # ArtificialIntelligence # エージェント型AI # 人工知能 # 汎用人工知能

LINKS yayafa.com/2797989

Mastodon — fosstodon.org TIER_1 日本語(JA) · [email protected] · 2026-05-11 02:08

I switched to Claude for work, but I'm still using ChatGPT. I'll introduce how to use both | Business Insider Japan https://www.yayafa.com/2797987/ # AgenticAi # AI # Anthropic # AnthropicClau

仕事で使うAIをClaudeに切り替えましたが、ChatGPTも引き続き使用しています。両者の活用方法を紹介します | Business Insider Japan https://www. yayafa.com/2797987/ # AgenticAi # AI # Anthropic # AnthropicClaude # ArtificialGeneralIntelligence # ArtificialIntelligence # claude # エージェント型AI # 人工知能 # 汎用人工知能

LINKS yayafa.com/2797987

Mastodon — fosstodon.org TIER_1 日本語(JA) · [email protected] · 2026-05-11 02:05

Google DeepMind Announces AlphaEvolve Results, Reducing DNA Mutation Detection Errors by 30% https://www.yayafa.com/2797985/ # AgenticAi # AI # AlphaEvolve # ArtificialGeneralIntelligence # Artificial

Google DeepMind、「AlphaEvolve」の成果公表 DNA変異検出エラーを30％削減 https://www. yayafa.com/2797985/ # AgenticAi # AI # AlphaEvolve # ArtificialGeneralIntelligence # ArtificialIntelligence # DeepMind # DNA分析 # Gemini # Google # GoogleAI # GoogleDeepMind # GoogleGemini # TPU # エージェント型AI # 人工知能 # …

LINKS yayafa.com/2797985

Mastodon — fosstodon.org TIER_1 日本語(JA) · [email protected] · 2026-05-11 02:03

Google Gemini Agent will soon be able to do anything for you 24/7. https://www.yayafa.com/2797983/ #AgenticAi #AI #ArtificialGeneralIntelligence #ArtificialIn

Google Gemini Agentはまもなく、24時間365日、あなたのためにあらゆることを代行してくれるようになります。 https://www. yayafa.com/2797983/ # AgenticAi # AI # ArtificialGeneralIntelligence # ArtificialIntelligence # DeepMind # Gemini # Google # GoogleAI # GoogleDeepMind # GoogleGemini # エージェント型AI # 人工知能 # 汎用人工知能

LINKS yayafa.com/2797983

Mastodon — fosstodon.org TIER_1 Русский(RU) · [email protected] · 2026-05-10 16:22

AI Public Services in the UAE, Claude in Adobe and Ableton, ChatGPT in Spreadsheets, and AI Spotify by ElevenLabs Hello, this is a new issue of "Neuro-Digest" – short and useful reviews

ИИ-госуслуги в ОАЭ, Claude в Adobe и Ableton, ChatGPT в таблицах и AI Spotify от ElevenLabs Привет, это новый выпуск «Нейро-дайджеста» — коротких и полезных обзоров ключевых событий в мире искусственного интеллекта и технологий. Неделя выдалась насыщенной: ОАЭ хотят перевести пол…

LINKS habr.com/…/1033354 habr.com/…/articles

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-09 20:33

OpenAI releases GPT-5.5 Instant update to make ChatGPT smarter with fewer emoji ChatGPT should feel “smarter and more accurate” starting today, according to Ope

OpenAI releases GPT-5.5 Instant update to make ChatGPT smarter with fewer emoji ChatGPT should feel “smarter and more accurate” starting today, according to OpenAI. That’s because the company is replacing the default model with an update called GPT-5.5 Instant. OpenAI also says t…

LINKS 9to5mac.com/…/gpt-5-5-instant-makes-chatg…

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-09 07:18

Apple's AirTag-Sized AI Pendant: Five Features Rumored So Far Apple is developing a wearable AI device that's been described as a pin or pendant, and that could

Apple's AirTag-Sized AI Pendant: Five Features Rumored So Far Apple is developing a wearable AI device that's been described as a pin or pendant, and that could compete with a similar AI product coming from OpenAI's Jony Ive. It wasn't clear if the wearable would actually make it…

LINKS macrumors.com/…/ai-pendant-five-features

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-09 04:42

Prompt Drift: Will Claude & Gemini Fail in 2026? Prompt drift threatens Claude & Gemini's reliability by 2026. Learn how subtle shifts in AI responses could und

Prompt Drift: Will Claude & Gemini Fail in 2026? Prompt drift threatens Claude & Gemini's reliability by 2026. Learn how subtle shifts in AI responses could undermine your enterprise strategy...and what it means for you. https:// theboard.world/articles/techno logy/prompt-drift-c…

LINKS theboard.world/…/prompt-drift-claude-gemi…

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-08 12:19

Yubico and OpenAI are partnering on hardware-backed security keys for ChatGPT users. Dawn Manley, senior vice president of product management at Yubico, told us

Yubico and OpenAI are partnering on hardware-backed security keys for ChatGPT users. Dawn Manley, senior vice president of product management at Yubico, told us that traditional security methods are no longer sufficient for AI-driven workflows involving sensitive data and automat…

LINKS movetheneedle.news/…/yubico-targets-the-n…

Mastodon — fosstodon.org TIER_1 Italiano(IT) · [email protected] · 2026-05-08 11:18

🧠 Multi-agent orchestration is a new feature of Claude's Managed Agents. 🤖 A coordinating agent can delegate tasks to multiple independent agents

🧠 La multi-agent orchestration è una nuova funzionalità dei Managed Agents di # Claude . 🤖 Un agente coordinatore può delegare attività a più agenti indipendenti. 👉 I dettagli: https://www. linkedin.com/posts/alessiopoma ro_claude-ai-ai-activity-7458473224192962560-Yr4O ___ ✉️ 𝗦𝗲…

LINKS linkedin.com/…/alessiopomaro_claude-ai-ai… alessiopomaro.it

dev.to — LLM tag TIER_1 · code plato · 2026-05-08 06:27

LLM-Based AI Agent Architecture: A New Kind of Personal Computer on Your Device

<p>For a long time, we've thought of AI as a "chatbot."</p> <p>But if you step back and look from a systems architecture perspective, you'll find that a truly mature AI agent looks more like a new kind of personal computer — one that lives on your device.</p> <p>It has:</p> <ul> …

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-07 21:32

ChatGPT’s ‘Trusted Contact’ will alert loved ones of safety concerns OpenAI is launching an optional safety feature for ChatGPT that allows adult users to assig

ChatGPT’s ‘Trusted Contact’ will alert loved ones of safety concerns OpenAI is launching an optional safety feature for ChatGPT that allows adult users to assign an emergency contact for mental health and safety concerns. Friends, family members, or caregivers designated as a "Tr…

LINKS theverge.com/…/chatgpt-trusted-contact-em…

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-05 15:54

❗️ The global cybersecurity gap deepens as AI-powered attacks surge - Restricted access to powerful defensive # AI tools like Anthropic's Mythos leaves some com

❗️ The global cybersecurity gap deepens as AI-powered attacks surge - Restricted access to powerful defensive # AI tools like Anthropic's Mythos leaves some companies, central banks, and nations more vulnerable than others https:// restofworld.org/2026/ai-cybers ecurity-anthropic…

LINKS restofworld.org/…/ai-cybersecurity-anthro…

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-05 15:45

‘Astonishing’: Richard Dawkins says AI is conscious, even if it doesn’t know it Chats with AI bots have convinced the evolutionary biologist but most experts sa

‘Astonishing’: Richard Dawkins says AI is conscious, even if it doesn’t know it Chats with AI bots have convinced the evolutionary biologist but most experts say he is being misled by mimicry When Richard Dawkins met Claudia it was like a whirlwind romance. Over three days last w…

LINKS theguardian.com/…/richard-dawkins-ai-cons…

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-05 15:21

📰 Nolan's The Odyssey gets a new trailer, and we're here for it "You're a man who needs to control his fate. But you cannot control this." 📰 Source: Ars Technic

📰 Nolan's The Odyssey gets a new trailer, and we're here for it "You're a man who needs to control his fate. But you cannot control this." 📰 Source: Ars Technica 🔗 Link: https://arstechnica.com/culture/2026/05/nolans-the-odyssey-gets-a-new-trailer-and-were-here-for-it/ # AI # Art…

LINKS arstechnica.com/…/nolans-the-odyssey-gets…

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-05 15:21

🎮 New Remedy CEO wants to preserve small budgets and break into Asia With Control Resonant, Remedy boss Jean-Charles Gaudechon said the studio has done well to

🎮 New Remedy CEO wants to preserve small budgets and break into Asia With Control Resonant, Remedy boss Jean-Charles Gaudechon said the studio has done well to 'build a triple-A game on a relatively small budget.' 📰 Source: gamedeveloper 🔗 Link: https://www.gamedeveloper.com/prod…

LINKS gamedeveloper.com/…/new-remedy-ceo-wants-…

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-05 15:19

🤖 ‘Astonishing’: Richard Dawkins says AI is conscious, even if it doesn’t know it Chats with AI bots have convinced the evolutionary biologist but most experts

🤖 ‘Astonishing’: Richard Dawkins says AI is conscious, even if it doesn’t know it Chats with AI bots have convinced the evolutionary biologist but most experts say he is being misled by mimicryWhen Richard Dawkins met Claudia it was like a whirlwind romance. Over three days last.…

LINKS web.archive.org

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-04 21:38

Your ChatGPT account just got more secure, but you have to opt in - here's how OpenAI adds a feature called Advanced Account Security with four opt-in settings

Your ChatGPT account just got more secure, but you have to opt in - here's how OpenAI adds a feature called Advanced Account Security with four opt-in settings designed to safeguard your account and personal data. https://www. zdnet.com/article/chatgpt-adva nced-account-security/…

LINKS zdnet.com/…/chatgpt-advanced-account-secu…

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-04 20:00

🤖 AI AGENTS OpenAI's Codex CLI now has a "/goal" command. It runs autonomous coding loops — keeps going until it self-evaluates completion or hits token limits.

🤖 AI AGENTS OpenAI's Codex CLI now has a "/goal" command. It runs autonomous coding loops — keeps going until it self-evaluates completion or hits token limits. Think of it as the "Ralph loop" pattern, built in. If you're building and want to delegate implementation grunt work wi…

LINKS github.com/…/codex

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-04 20:00

🛠️ DEV TOOLS Goodfire’s Silico lets engineers tweak LLM parameters in real-time during training. No more black-box guesswork—just precision debugging. This is h

🛠️ DEV TOOLS Goodfire’s Silico lets engineers tweak LLM parameters in real-time during training. No more black-box guesswork—just precision debugging. This is how AI development moves from voodoo to engineering. https://www. goodfire.ai/silico # AI # DevTools # LLM # AIAgents

LINKS goodfire.ai/silico

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-04 06:00

»Massive costs – Unlike traditional software, where marginal costs tend towards zero (for example, the millionth copy of Windows costs Microsoft nothing), gener

»Massive costs – Unlike traditional software, where marginal costs tend towards zero (for example, the millionth copy of Windows costs Microsoft nothing), generative # AI requires massive infrastructure.« # KI https:// theconversation.com/openai-get s-set-to-go-public-can-we-entr…

LINKS theconversation.com/openai-gets-set-to-go…

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-03 16:40

📰 ChatGPT Became So Obsessed With Goblins That OpenAI Had to Intervene The Wall Street Journal reports that OpenAI "recently gave its popular ChatGPT strict ins

📰 ChatGPT Became So Obsessed With Goblins That OpenAI Had to Intervene The Wall Street Journal reports that OpenAI "recently gave its popular ChatGPT strict instructions. Stop talking about goblins." Recent models of the artificial-intelligence chatbot have been bring... 📰 Source…

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-01 21:15

OpenAI's Stumbles Cast Shadow on Infrastructure Partners OpenAI missed revenue and user growth targets, causing stock drops for partners like Oracle and CoreWea

OpenAI's Stumbles Cast Shadow on Infrastructure Partners OpenAI missed revenue and user growth targets, causing stock drops for partners like Oracle and CoreWeave. Learn why this matters for AI infrastructure. # OpenAI # Oracle # CoreWeave # AI # TechStocks https:// newsletter.tf…

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-01 21:14

OpenAI's missed targets have caused a dip in stock prices for key partners like Oracle and CoreWeave, raising questions about the AI sector's rapid growth. # Op

OpenAI's missed targets have caused a dip in stock prices for key partners like Oracle and CoreWeave, raising questions about the AI sector's rapid growth. # OpenAI # Oracle # CoreWeave # AI # TechStocks https:// newsletter.tf/openai-revenue-m isses-affect-oracle-coreweave/

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-01 21:02

China's embrace of open-source AI has fueled its rapid rise and global influence, but mounting financial pressures and competitive dynamics are testing whether

China's embrace of open-source AI has fueled its rapid rise and global influence, but mounting financial pressures and competitive dynamics are testing whether that model can endure. https://www. japantimes.co.jp/commentary/20 26/05/01/world/china-cant-quit-open-ai/?utm_medium=So…

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-01 17:00

# OpenAI announces new advanced security for # ChatGPT accounts, including a partnership with # Yubico https:// techcrunch.com/2026/04/30/open ai-announces-new-

# OpenAI announces new advanced security for # ChatGPT accounts, including a partnership with # Yubico https:// techcrunch.com/2026/04/30/open ai-announces-new-advanced-security-for-chatgpt-accounts-including-a-partnership-with-yubico/ # AI # cybersecurity # yubikey

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-01 15:01

💸 OpenAI Misses Key Revenue, User Targets in High-Stakes Sprint Toward IPO // WSJ ｢ Chief Financial Officer Sarah Friar has told other company leaders that she

💸 OpenAI Misses Key Revenue, User Targets in High-Stakes Sprint Toward IPO // WSJ ｢ Chief Financial Officer Sarah Friar has told other company leaders that she is worried the company might not be able to pay for future computing contracts if revenue doesn’t grow fast enough, acco…

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-05-01 11:55

https:// winbuzzer.com/2026/05/01/opena i-announces-new-advanced-security-for-chatgpt-xcxwbn/ OpenAI Launches Yubico-Backed ChatGPT Account Protection # AI # Ch

https:// winbuzzer.com/2026/05/01/opena i-announces-new-advanced-security-for-chatgpt-xcxwbn/ OpenAI Launches Yubico-Backed ChatGPT Account Protection # AI # ChatGPT # OpenAI # AISecurity # Authentication # Passwordless # Cybersecurity # DataSecurity # AIPartnerships

LINKS winbuzzer.com/…/openagi-unveils-lux-agent…

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-04-29 14:02

I Asked AI to Count My Carbs 27,000 Times. It Couldn’t Give Me the Same Answer Twice. > Ask ChatGPT to estimate the carbs in your lunch. Now ask it again. And a

I Asked AI to Count My Carbs 27,000 Times. It Couldn’t Give Me the Same Answer Twice. > Ask ChatGPT to estimate the carbs in your lunch. Now ask it again. And again. Five hundred times. > You’d expect the same answer each time. It’s the same photo, the same model, the same questi…

LINKS diabettech.com/i-asked-ai-to-count-my-car…

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-04-29 09:00

Wall Street Journal: OpenAI misses key revenue, user targets in high-stakes sprint toward IPO. “OpenAI recently missed its own targets for new users and revenue

Wall Street Journal: OpenAI misses key revenue, user targets in high-stakes sprint toward IPO. “OpenAI recently missed its own targets for new users and revenue, stumbles that have raised concern among some company leaders about whether it will be able to support its massive spen…

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-04-29 06:05

Market slumps as OpenAI reportedly misses internal targets for active users and revenue — Nvidia, Oracle, AMD, and CoreWeave shares all tremble on the news http

Market slumps as OpenAI reportedly misses internal targets for active users and revenue — Nvidia, Oracle, AMD, and CoreWeave shares all tremble on the news https://www. tomshardware.com/tech-industry /artificial-intelligence/market-slumps-as-openai-reportedly-misses-internal-targ…

LINKS tomshardware.com/tech-industry

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-04-29 05:22

Apple Seeds Fourth iOS 26.5 and iPadOS 26.5 Betas to Developers Apple today seeded the fourth betas of upcoming iOS 26.5 and iPadOS 26.5 updates to developers f

Apple Seeds Fourth iOS 26.5 and iPadOS 26.5 Betas to Developers Apple today seeded the fourth betas of upcoming iOS 26.5 and iPadOS 26.5 updates to developers for testing purposes, with the software coming a week after Apple released the third betas. Registered developers can dow…

Mastodon — fosstodon.org TIER_1 · [email protected] · 2026-04-29 04:37

Market slumps as OpenAI reportedly misses internal targets for active users and revenue — Nvidia, Oracle,… Nvidia, Oracle, SoftBank, and CoreWeave saw their sto

Market slumps as OpenAI reportedly misses internal targets for active users and revenue — Nvidia, Oracle,… Nvidia, Oracle, SoftBank, and CoreWeave saw their stock prices go down because of news that OpenAI has been missing its internal targets. SoftBank stock lost 9.9% of its val…

LINKS tomshardware.com/tech-industry