A Reddit user proposed a hypothetical method for creating crowd-sourced, open-source distilled large language models. The idea involves a wrapper around existing command-line AI services to collect user inputs and outputs, thereby generating large datasets. This data could then be used to train models, with the training process potentially distributed across the GPUs of volunteers, similar to distributed computing projects. The main challenges identified are the coordination required to manage such a project and the establishment of a trusted central authority to oversee data collection and model release. AI
RANK_REASON User-generated hypothetical idea on Reddit without primary source or verifiable claims.
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →