PulseAugur
实时 16:16:11
English(EN) Baidu has unveiled Unlimited-OCR, a new model that solves a fundamental bottleneck in long-document transcription. By introducing Reference Sliding Window Atten

百度发布Unlimited OCR以实现高级长文档解析

百度发布了Unlimited OCR,一个专为高级文档解析设计的新模型。该模型利用恒定的KV缓存机制,在长文档上取得了最先进的性能。它可在Hugging Face上获取,并与Transformers等流行库以及vLLM和SGLang等推理提供商集成,提供了包括Docker在内的灵活部署选项。 AI

影响 此次发布提供了改进的长文档解析能力,可能惠及处理大量文本数据的行业。

排序理由 来自知名AI实验室(百度)的模型发布,具有特定名称和功能。[lever_c_demoted from frontier_release: ic=2 ai=1.0]

在 Mastodon — fosstodon.org 阅读 →

AI 生成摘要 · Google Gemini · 来自 7 个来源。 我们如何撰写摘要 →

百度发布Unlimited OCR以实现高级长文档解析

报道来源 [7]

  1. Hugging Face Trending Models TIER_1 (ET) · baidu ·

    baidu/Unlimited-OCR

    image-text-to-text · 47 downloads · 55 likes

  2. Pandaily TIER_1 English(EN) · [email protected] (Pandaily) ·

    Baidu Unveils Unlimited-OCR: Constant KV Cache Delivers SOTA Performance on Long Documents

    Baidu Unveils Unlimited-OCR: Constant KV Cache Delivers SOTA Performance on Long Documents

  3. r/LocalLLaMA TIER_1 English(EN) · /u/zxyzyxz ·

    Baidu: One-shot Long-horizon Parsing

    <table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1udiz7h/baidu_oneshot_longhorizon_parsing/"> <img alt="Baidu: One-shot Long-horizon Parsing" src="https://external-preview.redd.it/otatLedMYkn2U0gyOiLWHhD0bBH0tDg6xQ-7MlaT4Wk.png?width=640&amp;crop=smart&amp;a…

  4. Mastodon — fosstodon.org TIER_1 中文(ZH) · [email protected] ·

    🌘 GitHub - baidu/Unlimited-OCR: The Era of Unlimited OCR: Embracing the Revolution of Single-Pass Long-View Analysis ➤ Building a High-Performance, Long-Text Industrial-Grade OCR Analysis Solution ✤ https://github.com/baidu/Unlimited-OCR Baidu has open-sourced the "Unlimited-OCR" project, aiming to further push the boundaries of document analysis technology

    🌘 GitHub - baidu/Unlimited-OCR:無限 OCR 時代:迎接單次長視野解析的革命 ➤ 打造高效能、長文本的工業級 OCR 解析方案 ✤ https:// github.com/baidu/Unlimited-OCR 百度開源了「Unlimited-OCR」專案,旨在進一步推進文檔解析技術的邊界。該工具專注於「單次長視野解析」(One-shot Long-horizon Parsing),能夠高效處理單頁與多頁文件的 OCR 需求。該模型不僅支援 Huggingface Transformers 的標準推理,還針對高效能需求提供了…

  5. Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] ·

    🚀 # GitHub and # Baidu introduce "Unlimited OCR: One-Shot Long-Horizon Parsing," proving that even # AI can get lost in its own overcomplicated jargon maze. 🙄 W

    🚀 # GitHub and # Baidu introduce "Unlimited OCR: One-Shot Long-Horizon Parsing," proving that even # AI can get lost in its own overcomplicated jargon maze. 🙄 With promises of "direct agents" and "automate any workflow," it's like they've discovered the fax machine of the digital…

  6. Mastodon — fosstodon.org TIER_1 English(EN) · [email protected] ·

    Baidu has unveiled Unlimited-OCR, a new model that solves a fundamental bottleneck in long-document transcription. By introducing Reference Sliding Window Atten

    Baidu has unveiled Unlimited-OCR, a new model that solves a fundamental bottleneck in long-document transcription. By introducing Reference Sliding Window Attention, it compresses memory from linear to constant growth, achieving 93.92 percent on the OmniDocBench benchmark. The 3B…

  7. Mastodon — mastodon.social TIER_1 English(EN) · AI_Tech_News_UK ·

    🔥 Unlimited OCR: One-Shot Long-Horizon Parsing Researchers have developed a new OCR (Optical Character Recognition) system that can parse long-horizon text with

    🔥 Unlimited OCR: One-Shot Long-Horizon Parsing Researchers have developed a new OCR (Optical Character Recognition) system that can parse long-horizon text with unprecedented accuracy. This technology has significant implications for document scanning and data extraction, and cou…