PulseAugur
实时 16:12:19
实体 EleutherAI

EleutherAI

PulseAugur coverage of EleutherAI — every cluster mentioning EleutherAI across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
7
90 天内 7
发布 · 30天
0
90 天内 0
论文 · 30天
4
90 天内 4
层级分布 · 90 天
情绪 · 30 天

1 天有情绪数据

最近 · 第 1/1 页 · 共 7 条
  1. TOOL · CL_31715 ·

    使用Qwen2.5-0.5B评估LLM的成本低于1美元

    这篇博文详细介绍了一种经济高效的评估大型语言模型的方法,证明了运行全面的基准测试的成本可以低于一美元。作者使用免费的Google Colab T4实例在三个不同的任务上测试了Qwen2.5-0.5B模型:GSM8K用于数学推理,HellaSwag用于常识,TruthfulQA-MC2用于真实性。实验重点是测量运行时间和成本,利用lm-evaluation-harness并进行特定调整以优化性能和降低费用,例如限制生成令牌的长度。

  2. RESEARCH · CL_14791 ·

    AI Safety Bootcamp Oxford offers technical and generalist tracks

    OAISI is organizing its fourth AI Safety Research Bootcamp (ARBOx4) in Oxford from June 28 to July 10, 2026. The program offers two tracks: a Technical Research Stream focusing on ML safety techniques and a new Generali…

  3. RESEARCH · CL_09277 ·

    AI model evaluations are becoming a costly bottleneck, surpassing training expenses

    AI model evaluations are becoming prohibitively expensive, with recent benchmarks costing tens of thousands of dollars and consuming thousands of GPU hours. This high cost is particularly pronounced for agent-based eval…

  4. RESEARCH · CL_00954 ·

    EleutherAI发布开源工具用于解释AI模型特征

    EleutherAI发布了一个开源库,用于自动解释稀疏自编码器中的特征,这是一种用于分解模型激活的方法。该工具利用Llama 3.1和Claude 3.5 Sonnet等大型语言模型为这些特征生成自然语言解释,与之前的手动方法相比,大大降低了成本和工作量。该库旨在使社区更容易研究这些可解释的特征。

  5. SIGNIFICANT · CL_00112 ·

    OpenAI launches EU AI Blueprint 2.0, signs EU Code of Practice

    OpenAI has launched its EU Economic Blueprint 2.0, aiming to boost AI adoption across Europe by training 20,000 SMEs and providing grants for youth safety research. The initiative highlights a "capability overhang" wher…

  6. RESEARCH · CL_00966 ·

    Safetensors library audited as secure, set to become default for ML models

    The safetensors library, developed by Hugging Face in collaboration with EleutherAI and Stability AI, has undergone a security audit by Trail of Bits, confirming its safety. This audit allows the organizations to move t…

  7. RESEARCH · CL_00875 ·

    RWKV project revives RNNs to challenge Transformer dominance in LLMs

    The RWKV (Receptance Weighted Key Value) project introduces a novel architecture that revives Recurrent Neural Networks (RNNs) while incorporating advantages typically found in Transformers. This approach aims to overco…