PulseAugur
实时 23:58:20

Researchers propose new bootstrap methods for AI-generated labels in economics

A new paper from Timothy Christensen proposes a coupled-label bootstrap method to address biases in OLS estimators that arise when using AI/ML-generated labels as covariates in economic regressions. The research highlights that standard fixed-label bootstrap methods are often invalid unless specific independence conditions are met. The proposed coupled-label bootstrap jointly resamples true and imputed labels, offering a more robust solution without these stringent conditions, and includes finite-sample adjustments for improved accuracy. This work is illustrated with simulations and applied to analyze the relationship between wages and remote work status. AI

影响 Provides a statistical method to improve the reliability of economic analyses that incorporate AI-generated data labels.

排序理由 Academic paper on a statistical method for using AI-generated labels in economic regressions.

在 Eugene Yan 阅读 →

AI 生成摘要 · Google Gemini · 来自 3 个来源。 我们如何撰写摘要 →

Researchers propose new bootstrap methods for AI-generated labels in economics

报道来源 [3]

  1. arXiv stat.ML TIER_1 English(EN) · Timothy Christensen, Silvia Goncalves, Benoit Perron ·

    Bootstrapping with AI/ML-generated labels

    arXiv:2604.23770v1 Announce Type: cross Abstract: AI/ML methods are increasingly used in economics to generate binary variables (or labels) via classification algorithms. When these generated variables are included as covariates in regressions, even small misclassification errors…

  2. arXiv stat.ML TIER_1 English(EN) · Benoit Perron ·

    Bootstrapping with AI/ML-generated labels

    AI/ML methods are increasingly used in economics to generate binary variables (or labels) via classification algorithms. When these generated variables are included as covariates in regressions, even small misclassification errors can induce large biases in OLS estimators and inv…

  3. Eugene Yan TIER_1 English(EN) ·

    Bootstrapping Labels via ___ Supervision & Human-In-The-Loop

    How to generate labels from scratch with semi, active, and weakly supervised learning.