FLOPS
PulseAugur coverage of FLOPS — every cluster mentioning FLOPS across labs, papers, and developer communities, ranked by signal.
2 day(s) with sentiment data
-
PrototypeNAS accelerates DNN design for microcontrollers
Researchers have developed PrototypeNAS, a novel zero-shot neural architecture search method designed to rapidly create efficient deep neural networks (DNNs) for microcontroller units (MCUs). This method automates the s…
-
New GRAPE framework boosts neural network adversarial robustness
Researchers have introduced GRAPE, a novel training framework designed to enhance the adversarial robustness of neural networks while maintaining compact model sizes. GRAPE distinguishes itself by treating robust model …
-
Variable-Width Transformers Offer Improved Efficiency in Language Models
Researchers have proposed a novel transformer architecture, termed the '> <former' or 'x-shaped' architecture, that deviates from the standard uniform width across all layers. This new design allocates wider capacity to…
-
LLM Study Diary #3: PyTorch tensors, float types, and training infrastructure
This LLM study diary entry focuses on PyTorch fundamentals for training large language models. It details tensor basics, exploring various floating-point data types like FP32, BF16, and FP8 for efficiency and stability.…
-
BudgetFormer cuts Transformer costs with adaptive attention head allocation
Researchers have developed BudgetFormer, a Transformer architecture that optimizes the use of multi-head attention by dynamically allocating computational resources. This new mechanism learns to select the most informat…