Brief · PulseAugur

RESEARCH · arXiv cs.LG English(EN) · 4d · [3 sources]

Scaling Neural Network Verification with Tensor Parallelism and Fully Sharded Data Parallelism

Researchers have adapted tensor parallelism and fully sharded data parallelism techniques, typically used for training large models, to improve the scalability of neural network verification. These methods address the GPU memory limitations that have previously constrained formal verification algorithms. The study demonstrates significant memory reductions, with FSDP achieving up to 90% baseline memory drops while maintaining bitwise identical bounds to single-GPU systems. AI

IMPACT Enables verification of larger and more complex neural networks, crucial for safety-critical AI applications.

CIFAR-100
ResNet-large
VNN-COMP 2022
auto_LiRPA
MNIST-FC
VNN-COMP 2024
alpha_beta-CROWN
Tensor Parallelism
CIFAR-100 ResNet-large
Fully Sharded Data Parallelism