Fair Finetuning Mitigates Distribution Inference Attacks
Researchers have introduced Fair Fine-tuning (FFt), a novel method to mitigate distribution inference attacks (DIAs) in machine learning models. FFt works by fine-tuning a model on samples from a complementary distribution while enforcing an Equalized Odds constraint. This approach theoretically links fairness constraints to reduced distributional leakage, providing a bound on adversarial advantage based on the model's measured disparity. Experiments across various datasets demonstrated FFt's effectiveness in significantly reducing the accuracy gap for DIA adversaries. AI
IMPACT Introduces a new technique to enhance the privacy and fairness of machine learning models against data leakage.