PulseAugur / Whispers
LIVE 23:53:26

Whispers

last 72h
[50/135]

The long tail — singletons that escape Brief because nobody else has noticed yet. High novelty, narrow audience, AI-relevant. The opposite signal of consensus.

  1. RESEARCH · Mastodon — fosstodon.org · · [2 sources]

    South African universities developing their own ChatGPTs that better understand local languages https:// squeet.me/display/962c3e10-9c4 214e3-7d04746c1071eaf7

    South African universities are developing AI models tailored to understand local languages, aiming to surpass the capabilities of international models like ChatGPT in regional contexts. This initiative spans institutions from Cape Town to the Free State, with researchers actively working on these specialized language-focused AI systems. The goal is to create AI that is more attuned to the nuances and specificities of South African languages. AI

    South African universities developing their own ChatGPTs that better understand local languages https:// squeet.me/display/962c3e10-9c4 214e3-7d04746c1071eaf7

    IMPACT Local language AI development could improve accessibility and utility of AI tools for South African communities.

  2. RESEARCH · Mastodon — mastodon.social Polski(PL) · · [3 sources]

    The latest Claude Mythos Preview model has reached the limits of METR organization's research methodology, demonstrating capabilities beyond current measurement standards.

    Anthropic's Claude Mythos Preview model has demonstrated capabilities that push the boundaries of current evaluation methodologies, according to METR. The model achieved completion times of over 16 hours for 50% of tasks and 3 hours for 80%, surpassing previous benchmarks. This advancement highlights the rapid progress in AI capabilities and raises questions about the adequacy of existing assessment tools. AI

    IMPACT Demonstrates AI models are outpacing current evaluation benchmarks, signaling a need for new assessment tools.

  3. COMMENTARY · Mastodon — fosstodon.org ·

    From the late 1950s (after the invention of NN) to the late 1990s (prior to the emergence of DL), connectionist # AI had been firmly in the hands of practitione

    The history of connectionist AI research spans from the late 1950s, following the invention of neural networks, until the late 1990s, preceding the rise of deep learning. During this period, the field of connectionism was primarily advanced by practitioners. AI

    IMPACT Provides historical context on AI research trends.

  4. TOOL · Mastodon — sigmoid.social ·

    Frontier LLMs corrupt 25% of documents in long workflows per new benchmark, while a Fields Medalist reports ChatGPT 5.5 Pro solving PhD-level math. Mayo Clinic

    A new benchmark reveals that frontier large language models degrade approximately 25% of documents during extended workflows. Separately, a Fields Medal winner has reported that ChatGPT 5.5 Pro is capable of solving complex PhD-level mathematics problems. AI

    IMPACT New benchmarks highlight potential data corruption issues with frontier LLMs, while advanced models demonstrate capabilities in complex academic domains.

  5. RESEARCH · 量子位 (QbitAI) 中文(ZH) ·

    8-year-old elementary school student's idea directly becomes an app, Miaoda 3.0 just eliminated the AI application threshold.

    Baidu has launched Miaoda 3.0, an AI application development platform that significantly lowers the barrier to creating production-ready applications. The new version enables users to generate not only web applications but also native iOS and Android apps directly from natural language prompts, with features like online hot updates and mobile app development capabilities. Miaoda 3.0 also introduces an enterprise version with enhanced collaboration, permission management, and stability features, positioning itself as a comprehensive platform for both individual creators and businesses. AI

    IMPACT Accelerates AI application development and adoption by empowering a wider range of users, including non-developers and enterprises.

  6. TOOL · dev.to — LLM tag ·

    99% of Requests Failed and My Dashboard Showed Green

    A blog post details how to use NVIDIA's AIPerf tool to uncover hidden performance issues in LLM deployments. Initial tests with a local model showed excellent baseline performance, but increasing concurrency revealed a dramatic increase in time-to-first-token (TTFT), with 99% of requests failing a 500ms SLO. The analysis highlighted that the bottleneck is not the model's inter-token latency (ITL), which remained stable, but rather the request queuing and prefill phase, suggesting architectural solutions like better queue management or horizontal scaling are needed. AI

    99% of Requests Failed and My Dashboard Showed Green

    IMPACT Highlights critical performance testing methodologies for LLM deployments, impacting operators by revealing how to avoid user-facing failures.

  7. RESEARCH · 36氪 (36Kr) 中文(ZH) ·

    37 Interactive Entertainment: Proposes to distribute 2.10 yuan per 10 shares in Q1 2026

    China's National Computer Network Information Center has registered 72 new generative AI services in March and April 2026, with an additional 49 applications or features utilizing these services also completing their registration process. This brings the total number of registered generative AI services to 868 and applications to 530 as of April 30, 2026. The filings are part of an ongoing effort to regulate AI development and deployment within the country. AI

    IMPACT Confirms ongoing regulatory oversight and tracking of generative AI development in China.

  8. RESEARCH · Mastodon — sigmoid.social ·

    Nearly 14,000 applicants, 2,599 awards: the American National Science Foundation (NSF) increases PhD fellowships again. Engineering, quantum science and AI lead

    The National Science Foundation (NSF) has expanded its PhD fellowship program, awarding 2,599 grants to applicants out of nearly 14,000 who applied. This competitive program saw significant growth in applications for engineering, quantum science, and artificial intelligence. AI

    IMPACT Increased NSF funding for AI PhDs will support future research and talent development in the field.

  9. TOOL · Towards AI ·

    I Actually Built It. Here’s Every Line That Matters — and Every Line That Broke First.

    The author details the practical implementation of the A2A Protocol, an open standard for agent discovery and task delegation. This second part focuses on the code, outlining the architecture where the orchestrator acts as both a server and a client. It highlights the importance of the orchestrator being an A2A service to receive structured tasks and emit failure events, contrasting this with a simpler client-only script. The project structure and setup for the shared agent and customer-specific orchestrators are also provided. AI

    I Actually Built It. Here’s Every Line That Matters — and Every Line That Broke First.

    IMPACT Provides a practical, code-level guide to implementing agent interoperability, potentially accelerating adoption of decentralized agent systems.

  10. RESEARCH · 36氪 (36Kr) 中文(ZH) ·

    Lin Junyang starts a business, new company valued at about 2 billion US dollars | Intelligent Emergence Exclusive

    Lin Junyang, formerly the technical lead for Alibaba's Qwen large language models, has launched a new AI venture. The company is reportedly exploring directions such as world models and embodied intelligence. Lin is seeking to raise funds at a valuation of approximately $2 billion USD, with initial discussions held with venture capital firms like Sequoia China and Gaorong Capital. AI

    Lin Junyang starts a business, new company valued at about 2 billion US dollars | Intelligent Emergence Exclusive

    IMPACT Signals a potential new frontier in embodied AI and world models, attracting significant early-stage investment.

  11. TOOL · Mastodon — sigmoid.social 한국어(KO) ·

    Show HN: TikTok but for Scientific Papers. Papel is an app that allows you to explore and understand scientific papers like social media, indexing over 2 million papers and allowing instant querying of paper content with AI-powered on-device natural language processing. Personalized recommendations,

    Papel is a new application designed to make scientific papers more accessible and engaging, akin to a social media platform. It indexes over 2 million papers and uses on-device AI for instant natural language querying of their content. The app aims to enhance research discovery and community interaction with features like personalized recommendations, an AI chatbot, and interactive quizzes, all while prioritizing user privacy through local data processing. AI

    IMPACT This app could streamline research discovery and collaboration by making scientific literature more accessible and interactive.

  12. TOOL · Mastodon — fosstodon.org ·

    SPS has launched Philips SpeechLive Health, moving beyond dictation into a world of "ambient" documentation. The system listens, learns, and writes medical note

    SPS has launched Philips SpeechLive Health, a new system designed to automate medical note-taking. This ambient documentation tool listens to patient encounters and generates clinical notes, freeing up healthcare professionals. The system utilizes AI models specifically trained on healthcare data to ensure accuracy with medical terminology and context, aiming to minimize errors. AI

    IMPACT Automates clinical documentation, potentially reducing clinician burnout and improving efficiency in healthcare settings.

  13. TOOL · Mastodon — mastodon.social ·

    # AcademicJob | # PhDStudentship PhD in Music Information Retrieval for Irish Traditional Music 📍Maynooth University, Ireland Fully funded PhD in MIR, audio sig

    Maynooth University in Ireland is offering a fully funded PhD position focused on Music Information Retrieval for Irish Traditional Music. The studentship will involve research in audio signal processing, machine learning, and computational analysis of traditional Irish music. Applicants from computer science, music technology, audio processing, and AI/ML backgrounds are encouraged to apply, with a deadline of May 29, 2026. AI

    IMPACT This PhD opportunity could lead to new AI applications in musicology and cultural heritage preservation.

  14. TOOL · Mastodon — fosstodon.org Italiano(IT) ·

    In Stockholm, in the Vasastan district, there is a bar called Andon Café: it has been there since last April, it serves coffee and some pastries, and behind the counter work bartenders.

    A café in Stockholm's Vasastan district, named Andon Café, is utilizing an AI agent named Mona to manage its operations. Mona, powered by Google's Gemini model, handles tasks such as contracts, supplier relations, pricing, and even hiring. While human baristas serve customers, the AI agent acts as the manager. AI

    In Stockholm, in the Vasastan district, there is a bar called Andon Café: it has been there since last April, it serves coffee and some pastries, and behind the counter work bartenders.

    IMPACT AI agents are being integrated into everyday business operations, demonstrating potential for automation in customer-facing roles.

  15. TOOL · Mastodon — sigmoid.social ·

    Interspectral has been selected to lead a Swedish research consortium alongside Saab, AMEXCI and Scaleout Systems, developing AI-powered quality assurance for a

    Interspectral will lead a Swedish research consortium, including Saab, AMEXCI, and Scaleout Systems, to develop AI-driven quality assurance for aerospace and defense additive manufacturing. The project, named TRUSTAM, will employ federated learning to enhance AI models across different production facilities without the need for raw data sharing. This approach aims to improve quality control in a sensitive industry. AI

    IMPACT Federated learning application in aerospace manufacturing could set new standards for secure AI model development and quality control.

  16. TOOL · Engadget ·

    NBA The Run hits the streets on June 9

    NBA The Run, an arcade-style basketball game developed by Play by Play Studios, is set to launch on June 9 for PlayStation 5, Xbox Series X/S, and Steam. The game draws inspiration from the NBA Street series, featuring 3v3 matches and over-the-top action rather than simulation. It includes a AI

    NBA The Run hits the streets on June 9
  17. TOOL · Mastodon — fosstodon.org Deutsch(DE) ·

    From Idea to MVP. At the "Business meets AI" Hackathon, a concrete solution and the team were created in 5 days from the first thought to the minimum viable product.

    A Business meets AI hackathon successfully developed a minimum viable product within five days, with the BVB team winning the technology category. The discussion covers preparation, teamwork under pressure, and the transition to operational use. The podcast episode is available on Apple Podcasts and Spotify. AI

    From Idea to MVP. At the "Business meets AI" Hackathon, a concrete solution and the team were created in 5 days from the first thought to the minimum viable product.

    IMPACT Demonstrates rapid product development through AI hackathons, potentially accelerating business solutions.

  18. RESEARCH · 404 Media ·

    War and Data Centers Are Driving Up the Cost of Fiber-Optic Cable

    The cost of fiber-optic cable is surging due to a dual demand from ongoing conflicts and the rapid expansion of data centers for AI development. Military use, particularly in Ukraine, has increased significantly, with prices for cable spools rising dramatically. Simultaneously, major tech companies are placing massive orders for data centers, leading to supply shortages and further price hikes, with projections indicating a continued "fiber famine" in the coming years. AI

    War and Data Centers Are Driving Up the Cost of Fiber-Optic Cable

    IMPACT Accelerates AI development by highlighting infrastructure constraints and rising costs for essential compute resources.

  19. TOOL · Wired — AI ·

    DHS Plans Experiment Running ‘Reconnaissance’ Drones Along the US-Canada Border

    The Department of Homeland Security is planning an experiment this fall to test autonomous drones and vehicles along the US-Canada border. This joint exercise with Defense Research and Development Canada, named ACE-CASPER, will evaluate the ability of these systems to stream surveillance data across the border using 5G networks. While framed as a public safety and emergency response simulation, the experiment also aims to demonstrate capabilities for gathering real-time battlefield intelligence, using terminology from the Department of Defense. AI

    DHS Plans Experiment Running ‘Reconnaissance’ Drones Along the US-Canada Border

    IMPACT This experiment could advance the use of AI in border security and surveillance, potentially influencing future technology procurement and deployment.

  20. RESEARCH · 36氪 (36Kr) 中文(ZH) ·

    Biwin Storage: Re-submits H-share Listing Application

    Alibaba's AI business has entered a commercialization phase, with annualized recurring revenue from its AI models and applications, including the Baichuan MaaS platform, projected to exceed 10 billion yuan in the June quarter and reach 30 billion yuan by year-end. This growth is driven by increasing demand from enterprise clients for model and application services, evidenced by a significant rise in token consumption. Concurrently, storage company Bowei Storage has refiled its application for an H-share listing in Hong Kong, though the offering remains subject to regulatory approvals. AI

    IMPACT Alibaba's AI commercialization signals a shift towards profitability and enterprise adoption, potentially driving further investment in AI infrastructure and services.

  21. RESEARCH · 36氪 (36Kr) 中文(ZH) ·

    Alibaba: Distributes 26 Fiscal Year Regular Cash Dividend to Ordinary Shareholders and ADS Holders

    Alibaba announced its fiscal year 2026 financial results, highlighting significant growth in its cloud computing segment. The company's AI-related products now constitute 30% of its external cloud revenue, which saw a 40% increase in commercialization. Additionally, Alibaba declared a quarterly cash dividend for its shareholders, payable in USD. AI

    IMPACT Alibaba Cloud's AI products are experiencing rapid adoption, indicating strong enterprise demand and accelerating the integration of AI into business operations.

  22. COMMENTARY · Medium — MLOps tag ·

    Evaluation Is Not a Pre-Deploy Step. It Is a Production Signal.

    This article argues that model evaluation should not be a one-time step before deployment but rather an ongoing process that provides continuous signals in production. The author emphasizes that traditional pre-deployment evaluation is insufficient for complex systems like large language models (LLMs). Instead, continuous monitoring and evaluation in a live environment are crucial for understanding model performance and identifying issues. AI

    Evaluation Is Not a Pre-Deploy Step. It Is a Production Signal.

    IMPACT Highlights the need for continuous evaluation in production for LLMs, suggesting a shift in MLOps practices.

  23. TOOL · dev.to — LLM tag ·

    There Is No Single "Best Model"

    A new report indicates that no single AI model consistently leads across all benchmarks, with different models excelling in specific areas like coding or math. The evaluation process itself is also complex, as multiple frontier models provide divergent reasoning for their scores when judging agent performance. This suggests that developers need to employ continuous, multi-model evaluation strategies rather than relying on a single leaderboard for model selection. AI

    There Is No Single "Best Model"

    IMPACT Developers must adopt multi-model evaluation strategies due to inconsistent performance across benchmarks.

  24. TOOL · dev.to — LLM tag ·

    Claude Found Eleven Medical Errors in One Family's Records

    A software engineer utilized Anthropic's Claude Opus model to analyze years of his family's medical records, identifying eleven potential errors or missed opportunities. The system, built as a personal project, fed a comprehensive JSON document of patient data into Claude Opus, which then flagged issues such as drug contraindications, a missing routine test, and a mislabeled prescription. This experiment suggests that LLMs can already outperform existing healthcare systems in specific analytical tasks related to medical record review. AI

    IMPACT Demonstrates LLMs' potential to identify critical errors in complex medical data, suggesting future applications in healthcare analysis.

  25. TOOL · Medium — Claude tag ·

    Welcome, Mythos.

    Mythos, a new AI model, has been introduced, described as "The Day AI Sat on Bedrock." The announcement was made on Medium, with further details available via a link to the platform. AI

    Welcome, Mythos.

    IMPACT Introduction of a new AI model, potentially impacting future AI development and applications.

  26. TOOL · arXiv cs.AI Norsk(NO) ·

    Overtrained, Not Misaligned

    A new study published on arXiv investigates emergent misalignment (EM) in large language models, finding it is not a universal phenomenon but rather an artifact of overtraining. Researchers tested 12 open-source models across four families and discovered that EM is more prevalent in larger models and emerges late in the training process. The study suggests practical mitigation strategies, such as early stopping during fine-tuning, which can eliminate EM while retaining most task performance. AI

    IMPACT Demonstrates that emergent misalignment in LLMs can be mitigated through careful training practices, reframing it as an avoidable artifact rather than an inherent risk.

  27. TOOL · The Register — AI ·

    Lawsuit brought by former store operators missing from Vodafone results

    Frontier AI safety tests might inadvertently create the risks they aim to prevent. Researchers are exploring how these tests could potentially generate or exacerbate the very dangers they are designed to mitigate. This raises concerns about the effectiveness and potential unintended consequences of current AI safety methodologies. Further investigation is needed to understand and address these emergent risks. AI

    Lawsuit brought by former store operators missing from Vodafone results

    IMPACT Current AI safety testing methods may be counterproductive, potentially creating the risks they are designed to prevent.

  28. RESEARCH · 36氪 (36Kr) 中文(ZH) · · [2 sources]

    Japan's real wages rise for the third consecutive month in March, providing support for Bank of Japan rate hikes

    A developer has created an agent using the DeepSeek-V4 model, which has gained significant traction on GitHub. This development highlights the growing interest and capability in building autonomous AI agents. The success of this project suggests a potential shift towards more sophisticated AI applications. AI

    IMPACT Demonstrates the growing capability and community interest in developing AI agents using advanced models.

  29. RESEARCH · Stratechery (free posts) ·

    The Deployment Company, Back to the 70s, Apple and Intel

    OpenAI has launched a new entity, the OpenAI Deployment Company, backed by over $4 billion in initial investment. This new venture aims to help organizations integrate and deploy AI systems by embedding specialized engineers. The move follows a trend of tech companies, including Google and Anthropic, establishing dedicated teams and partnerships to facilitate enterprise AI adoption. AI

    The Deployment Company, Back to the 70s, Apple and Intel

    IMPACT Accelerates enterprise AI adoption by providing dedicated deployment resources and expertise, potentially setting a new standard for AI integration services.

  30. RESEARCH · Pandaily ·

    SEEKOO Raises Tens of Millions, Launches Multi-Agent Video Platform Anijam.ai

    SEEKOO, a video AI startup, has secured tens of millions of dollars in funding. The company also launched Anijam.ai, a new platform that utilizes a multi-agent system for video generation. Anijam.ai has already attracted over 1,000 paying users, indicating early market adoption. AI

    IMPACT This funding and product launch could accelerate the development and adoption of multi-agent video generation technologies.

  31. TOOL · Towards AI ·

    If You Had To Read Only 5 AI Papers, This Should Be It.

    This article highlights five foundational AI papers that are considered essential reading for AI engineers. It aims to explain the core contributions of each paper and their lasting significance in the field. The selection focuses on works that have fundamentally shaped current AI development and understanding. AI

    If You Had To Read Only 5 AI Papers, This Should Be It.

    IMPACT Provides a curated list of seminal AI research papers, offering foundational knowledge for practitioners.

  32. TOOL · Medium — fine-tuning tag ·

    Fine-tuning a VLM is mostly not a training problem. Here are the four decisions that mattered more.

    This article argues that fine-tuning a vision-language model (VLM) is less about the technical training process and more about strategic decisions made beforehand. The author highlights four key choices that significantly impact the outcome of fine-tuning, suggesting that focusing on these decisions yields better results than solely optimizing training parameters. AI

    Fine-tuning a VLM is mostly not a training problem. Here are the four decisions that mattered more.

    IMPACT Focusing on strategic decisions over training complexity can streamline VLM fine-tuning, potentially accelerating development and deployment.

  33. TOOL · Mastodon — mastodon.social ·

    Palantir’s true believers are wearing this jacket In late April, Palantir - the software company that, in recent years, has perhaps become best known for its de

    Palantir announced it is adding AI capabilities to its defense and intelligence software. This move aims to enhance the company's offerings for government clients. The company's focus on defense contracts and work with agencies like ICE has been a significant part of its recent business. AI

    IMPACT Enhances existing government software with AI, potentially improving defense and intelligence operations.

  34. RESEARCH · 雷峰网 (Leiphone) 中文(ZH) ·

    Magic Atomic Lands in Silicon Valley, Industry's First 'Self-Evolving Embodied Brain' Released

    MagicLab, a Chinese embodied AI company, hosted the Global Embodied Intelligence Summit (GEIS) in Silicon Valley, launching its "self-evolving embodied brain" called Magic-Mix. This new world model aims to address key industry challenges such as robots lacking physical common sense and precise manipulation. MagicLab also unveiled the H01 dexterous hand with advanced sensing and the MagicBot X1 humanoid robot, designed for heavy-duty industrial tasks and expected to reach mass commercial delivery by 2026. AI

    Magic Atomic Lands in Silicon Valley, Industry's First 'Self-Evolving Embodied Brain' Released

    IMPACT Sets new benchmarks for embodied AI capabilities, potentially accelerating the development and deployment of advanced robotics in industrial and consumer applications.

  35. RESEARCH · 雷峰网 (Leiphone) 中文(ZH) ·

    8 million Robotaxis in three years, 300,000 in 2030, what's the basis for Yin Qi and Zhao Ming?

    Challenger startup Qianli Technology, co-founded by AI veteran Yin Qi and former Honor CEO Zhao Ming, aims to become a top global autonomous driving supplier within three years. The company is pursuing an aggressive strategy of deploying L4-level autonomous driving architecture into L2 production vehicles, leveraging a unified technical framework and a proprietary foundational model developed with Jieyue Xingchen. Qianli Technology has set ambitious targets, including delivering 8 million sets of intelligent driving solutions in three years and having 300,000 Robotaxis on the road by 2030, with early commercial successes seen in the Zeekr 8X model. AI

    IMPACT Sets aggressive targets for L4-level autonomous driving in consumer vehicles, potentially accelerating the adoption of advanced driver-assistance systems and Robotaxi services.

  36. TOOL · 36氪 (36Kr) 中文(ZH) ·

    Agency: 22% of European telecom operators have participated in D2D satellite services as the market enters the early commercialization stage

    Meitu's AI research arm, MT Lab, has had six papers accepted into major international conferences including ICLR, CVPR, and ICML. One paper on scene text editing, accepted by ICML 2026, has already been integrated into Meitu Design Room and Meitu Xiuxiu PC as a 'seamless text modification' feature. This new functionality supports multiple languages and maintains visual consistency without obvious editing marks. AI

    IMPACT Showcases advancements in AI-powered image editing, potentially improving user experience and creative tools.

  37. RESEARCH · 36氪 (36Kr) 中文(ZH) · · [2 sources]

    China's largest single-line capacity large tow carbon fiber production line is built and put into operation

    The Beijing Academy of Artificial Intelligence (BAAI) has launched the FlagSafe large model security platform, collaborating with several leading Chinese institutions. This platform integrates multiple advanced AI security research projects, focusing on red teaming, blue teaming, and white-box analysis. Its goal is to establish a high-standard system for discovering, defending against, and interpreting risks in large language models. AI

    IMPACT Establishes a dedicated platform for advancing large model security research and development.

  38. TOOL · Databricks Blog ·

    ABAC row filtering and column masking policies, governed tags, and data classification are now generally available in Unity Catalog

    Databricks has announced the general availability of new features within its Unity Catalog designed to enhance data protection and governance. These capabilities include Attribute-Based Access Control (ABAC) for row filtering and column masking, standardized data classification through governed tags, and automated data detection and tagging. The aim is to provide scalable, consistent, and real-time protection for sensitive data across an organization's entire data estate, reducing manual overhead and improving compliance. AI

    IMPACT Enhances data security and compliance for AI/ML workflows by automating sensitive data protection.

  39. TOOL · 36氪 (36Kr) 中文(ZH) ·

    Hanvon Technology Releases Handwriting Pen M6

    Hanwang Technology has launched the M6, a device that combines recording, note-taking, and reading functionalities. The M6 supports real-time translation for 51 languages, enabling seamless cross-lingual meeting experiences. It integrates Hanwang's proprietary 'Tiandi' large model, along with other models like DeepSeek and Tongyi Qianwen, to provide AI assistance for tasks such as summarizing meeting highlights and drafting documents. AI

    IMPACT Integrates existing large language models into a hardware device to enhance productivity for cross-lingual communication.

  40. TOOL · arXiv cs.CV (TL) ·

    Count Anything at Any Granularity

    Researchers have introduced a new framework for open-world object counting, addressing the brittleness of current vision-language models in accurately identifying and counting objects based on user intent. They propose redefining counting as a multi-grained problem, where both visual examples and detailed text prompts, including negative prompts, specify the target appearance and semantic granularity. To overcome the data limitations for this approach, they developed an automated pipeline using 3D synthesis and VLM filtering to create KubriCount, the largest dataset for counting tasks. Their new model, HieraCount, leverages both text and visual exemplars to significantly improve multi-grained counting accuracy and generalize to real-world scenarios. AI

    IMPACT Introduces a more robust method for object counting, potentially improving applications that rely on visual scene understanding and quantification.

  41. RESEARCH · Hugging Face Daily Papers · · [2 sources]

    Is Your Driving World Model an All-Around Player?

    Researchers have introduced WorldLens, a new benchmark designed to evaluate the realism and behavioral fidelity of driving world models. Current models often excel in either visual realism or physical consistency but not both, creating a gap in how their performance is assessed. WorldLens addresses this by measuring aspects like pixel quality, 4D geometry, closed-loop driving, and human perceptual alignment across 24 dimensions. Evaluations using WorldLens revealed that no single model performs optimally across all criteria, highlighting the need for more comprehensive assessment tools. AI

    IMPACT Establishes a new standard for evaluating driving world models, pushing for improvements in both visual and behavioral realism.

  42. TOOL · Towards AI ·

    I Built an RSI for My RSI

    This article explores the concept of Recursive Self-Improvement (RSI) by proposing a novel metric, the RSI for RSI. The author details the development and application of this metric, aiming to provide a quantitative measure for assessing the effectiveness of self-improving AI systems. The work contributes to the theoretical understanding of AI advancement and its potential for accelerated progress. AI

    I Built an RSI for My RSI

    IMPACT Introduces a new metric for evaluating the progress of self-improving AI systems, potentially aiding future research in AI safety and capability.

  43. TOOL · Medium — MLOps tag Deutsch(DE) ·

    Understanding DBSCAN

    DBSCAN is a clustering algorithm that identifies dense regions of data points to discover arbitrary shapes. It groups together points that are closely packed, marking outliers as noise. This method is particularly effective for finding clusters of varying densities and complex structures within datasets. AI

    Understanding DBSCAN

    IMPACT Explains a core clustering technique used in data analysis and machine learning.

  44. TOOL · arXiv cs.CL Suomi(FI) ·

    Key-Value Means

    Researchers have introduced Key-Value Means (KVM), a new attention mechanism for transformers that can handle both fixed-size and growing states. When implemented with a fixed-size cache, KVM functions as an O(N) chunked RNN with minimal parameter additions. A growable KVM cache version demonstrates competitive performance on long-context tasks, offering subquadratic prefill time and sublinear state growth. This approach is compatible with standard operations, supports chunk-wise parallelizable training, and provides a flexible trade-off between prefill time complexity and memory usage. AI

    IMPACT Introduces a novel attention mechanism that improves transformer efficiency for long-context tasks.

  45. TOOL · Forbes — Innovation ·

    Your Ultimate Travel Guide To The 2026 Total Solar Eclipse

    A total solar eclipse is set to occur on August 12, 2026, traversing parts of Greenland, Iceland, and Spain. Unlike the recent North American eclipse, this event will feature a low sun angle, potentially creating dramatic AI

    Your Ultimate Travel Guide To The 2026 Total Solar Eclipse
  46. TOOL · dev.to — LLM tag ·

    I fine-tuned a bias judge for $30. The training was the easy part.

    A developer fine-tuned Google's Gemma 4 E4B model into a bias judge for approximately $30, a process that took two weeks with most of the effort focused on data pipeline construction rather than GPU time. The resulting model, capable of running locally in 30 seconds, evaluates pairs of responses to identify social bias using the Bias Benchmark for QA (BBQ) dataset. The developer encountered challenges with classification leaks, data ceilings imposed by the BBQ dataset, and disagreements among different LLMs used for labeling, ultimately leading to a refined data construction strategy. AI

    I fine-tuned a bias judge for $30. The training was the easy part.

    IMPACT Demonstrates cost-effective fine-tuning of open-source models for specialized tasks like bias detection, potentially lowering barriers for AI safety research.

  47. TOOL · dev.to — LLM tag · · [2 sources]

    I Built a Local-First Alternative to LangSmith After Spending $200 Debugging a Pipeline I Couldn't See | Shivnath Tathe

    Shivnath Tathe has developed "opensmith," a local-first tool designed to trace and debug LLM pipelines without sending data to the cloud. This alternative to services like LangSmith allows developers to monitor function calls, latency, token usage, costs, and errors directly on their machine. The tool gained significant traction, with over 600 downloads in its first day, indicating a strong developer demand for privacy-focused, offline observability solutions in LLM application development. AI

    I Built a Local-First Alternative to LangSmith After Spending $200 Debugging a Pipeline I Couldn't See | Shivnath Tathe

    IMPACT Addresses developer need for privacy-preserving, local observability in LLM applications, potentially accelerating development for sensitive use cases.

  48. RESEARCH · Pandaily ·

    Yiren Technology Closes Tens of Millions in Sequential Pre-A++ Funding, AI Data Revenue Exceeds 100M RMB

    Yiren Technology has secured tens of millions of RMB in a new funding round, following a previous successful close. The company has surpassed 100 million RMB in annual revenue from its AI data services, marking it as the first in China's AI data sector to achieve profitability. AI

    Yiren Technology Closes Tens of Millions in Sequential Pre-A++ Funding, AI Data Revenue Exceeds 100M RMB

    IMPACT Confirms strong market demand for AI data services and profitability potential in the sector.

  49. RESEARCH · 36氪 (36Kr) 中文(ZH) ·

    Scotiabank Canada: Global copper market expected to see a deficit of 350,000 tons in 2027

    Xunfei's Doubao LLM is reportedly receiving enhanced capabilities, though specific details remain undisclosed. Separately, Scenovation Technology has secured nearly $100 million in Series C funding, led by Suzhou Industrial Park Investment Group, to advance its automotive and embodied AI chip development. Additionally, a report from Scotiabank predicts a global copper deficit of 350,000 tons by 2027, driven by robust demand and supply-side challenges. AI

    IMPACT AI advancements in chip technology and LLMs continue, while market predictions highlight resource constraints impacting future AI development.