Subquadratic, a Miami-based startup, has emerged from stealth claiming to have developed the first Large Language Model (LLM) that does not utilize quadratic attention. This architectural innovation reportedly enables the model to process a context window of 12 million tokens at a significantly reduced cost compared to existing frontier models. AI
影响 Potential to drastically lower inference costs for LLMs with extremely long context windows.
排序理由 Startup announces a novel LLM architecture with a large context window and reduced cost. [lever_c_demoted from significant: ic=1 ai=1.0]
在 Mastodon — fosstodon.org 阅读 →
AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →