DLRM
PulseAugur coverage of DLRM — every cluster mentioning DLRM across labs, papers, and developer communities, ranked by signal.
2 day(s) with sentiment data
-
New metric quantifies AI model vulnerability to hardware faults
Researchers have developed a new metric called Parameter Vulnerability Factor (PVF) to quantify the susceptibility of AI models to hardware faults, specifically silent data corruptions (SDCs). This metric aims to standa…
-
New lossless compression speeds up ML training and inference
Researchers have developed a new lossless compression algorithm called Invariant Bit Packing (IBP) to address GPU memory limitations in machine learning. IBP identifies and removes redundant bits across tensor groups, e…
-
New ML-based GPU caching algorithm LCR boosts LLM inference speed
Researchers have developed a new GPU caching algorithm called Learning-Augmented LRU (LALRU) designed to improve efficiency during AI inference. This algorithm integrates learned predictions with caching policies to ens…