PulseAugur
EN
LIVE 15:14:35

AI firms struggle to access proprietary expert data

A Reddit discussion on r/MachineLearning is exploring what valuable professional data remains inaccessible to AI companies. The focus is on data created by domain experts during their daily work, which is never shared outside an organization and contains rich human reasoning. Participants are seeking examples, particularly from the finance industry, of such "locked" data and its rights holders. AI

IMPACT AI developers face challenges in acquiring proprietary data that contains deep human reasoning, potentially limiting model capabilities in specialized domains.

RANK_REASON This is a discussion thread about a problem, not a release or event.

Read on r/MachineLearning →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. r/MachineLearning TIER_1 English(EN) · /u/Manny_in_iceage ·

    What valuable professional data is completely locked away from AI companies? [D]

    <!-- SC_OFF --><div class="md"><p>Hi all,</p> <p>Apologies beforehand if this is the wrong subreddit, let me know if you think there are better subreddits for this post. </p> <p>I’m working on a project around proprietary data licensing for AI training and trying to identify data…