Google Research has developed a new parallel algorithm for differentially private partition selection, enabling the secure release of large datasets for AI and machine learning. This method adds controlled noise to data selections, ensuring individual user privacy while still allowing for the identification of common items across vast collections. The algorithm scales to datasets with hundreds of billions of items, significantly outperforming previous sequential methods and offering improved utility without compromising privacy. Google is open-sourcing this technology to foster community innovation. AI
Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →
RANK_REASON Publication of a novel algorithm in a research paper with open-source code release.