PulseAugur
EN
LIVE 11:46:30

Meta uses SOLARIS to scale recommendation model inference

Researchers have developed SOLARIS, a new framework designed to make large recommendation models more efficient for real-time serving. SOLARIS uses a speculative approach to precompute user-item interaction embeddings, generating foundation model representations ahead of time for predicted future requests. This method, deployed within Meta's advertising system, has shown a 0.67% gain in revenue-driving metrics by decoupling expensive inference from the critical serving path. AI

IMPACT Enables real-time serving of complex recommendation models, potentially improving user experience and revenue for large-scale systems.

RANK_REASON Academic paper introducing a novel framework with real-world deployment results. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.LG →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.LG TIER_1 English(EN) · Zikun Liu, Liang Luo, Qianru Li, Zhengyu Zhang, Wei Ling, Jingyi Shen, Zeliang Chen, Yaning Huang, Jingxian Huang, Abdallah Aboelela, Chonglin Sun, Feifan Gu, Fenggang Wu, Hang Qu, Huayu Li, Jill Pan, Kaidi Pei, Laming Chen, Longhao Jin, Qin Huang, Tongy… ·

    SOLARIS: Speculative Offloading of Latent-bAsed Representation for Inference Scaling

    arXiv:2604.12110v2 Announce Type: replace Abstract: Recent advances in recommendation scaling laws have led to foundation models of unprecedented complexity. While these models offer superior performance, their computational demands make real-time serving impractical, often forci…