Brief

last 24h

[3/3] 224 sources

Multi-source AI news clustered, deduplicated, and scored 0–100 across authority, cluster strength, headline signal, and time decay.

TOOL · arXiv stat.ML English(EN) · 10h

Optimality in importance sampling: a gentle survey

This paper provides a comprehensive review of optimality within importance sampling techniques, a critical component for the performance of Monte Carlo sampling methods. It explores various frameworks for designing adaptive proposal densities, including marginal likelihood approximation for model selection, the use of multiple proposal densities, and sequences of tempered posteriors. The survey also delves into applications in noisy scenarios such as approximate Bayesian computation and reinforcement learning, offering theoretical and empirical comparisons. AI

IMPACT Provides a theoretical foundation for advanced sampling techniques used in AI research.
RESEARCH · arXiv cs.AI English(EN) · 2w · [2 sources]

Quotient DAGs for Off-Policy Evaluation:Forward-Flow Importance Sampling and Exact Slate Propensities

Two new research papers explore advanced techniques for off-policy evaluation (OPE) in machine learning, a critical process for assessing the performance of new policies using existing data. The first paper introduces "Quotient DAGs" to handle situations where the reward depends only on an unordered set of items, even if the generation process is ordered, thereby reducing nuisance variance. The second paper, "CANDOR," proposes a doubly robust OPE estimator that effectively leverages imperfect expert-annotated counterfactual samples, particularly for healthcare applications, by incorporating annotations into the reward model component. AI

IMPACT These papers introduce novel methods for off-policy evaluation, potentially improving the reliability and safety of deploying new policies in critical domains like healthcare.
TOOL · arXiv cs.LG English(EN) · 1mo

Disagreement-Regularized Importance Sampling for Adversarial Label Corruption

Researchers have developed a new sub-sampling method called Disagreement-Regularized Importance Sampling (DR-IS) to improve robustness against adversarial label corruption in machine learning. This method leverages the disagreement in loss rankings across independent proxy ensembles to identify and down-weight corrupted data points. DR-IS provides theoretical guarantees on sample concentration and contamination bounds, demonstrating empirical superiority over magnitude-based methods like EL2N, particularly under targeted attacks. AI

IMPACT Enhances machine learning model reliability by providing a robust method to handle noisy or intentionally corrupted labels.

Brief

Optimality in importance sampling: a gentle survey

Quotient DAGs for Off-Policy Evaluation:Forward-Flow Importance Sampling and Exact Slate Propensities

Disagreement-Regularized Importance Sampling for Adversarial Label Corruption