Donate your coding sessions to an open CC-BY-4.0 dataset to help train open-weight and open source models
A new initiative called Trace Commons is aiming to create an open dataset of coding sessions to counter the data advantage held by large AI labs like Anthropic and OpenAI. The project encourages individuals to donate their coding traces, allowing open-weight and open-source models to be trained on this valuable data. This effort seeks to democratize access to training data and prevent an oligopoly in AI development. AI
IMPACT Could democratize access to coding data for open-source models, potentially leveling the playing field against proprietary AI labs.