PulseAugur
实时 19:42:24
实体 FineWeb-Gutenberg corpus

FineWeb-Gutenberg corpus

PulseAugur coverage of FineWeb-Gutenberg corpus — every cluster mentioning FineWeb-Gutenberg corpus across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
1
90 天内 1
发布 · 30天
0
90 天内 0
论文 · 30天
1
90 天内 1
层级分布 · 90 天
情绪 · 30 天

1 天有情绪数据

最近 · 第 1/1 页 · 共 1 条
  1. RESEARCH · CL_48726 ·

    FIM pretraining increases language model verbatim memorization

    Researchers investigated how the fill-in-the-middle (FIM) pretraining objective affects language model memorization compared to standard left-to-right (LTR) training. Their study, using Llama 3.2 models and a corpus wi…