PulseAugur
EN
LIVE 21:29:58

New 'vintage' LLM trained on pre-1931 English text

A new language model, talkie-1930-13b-it, has been developed, trained exclusively on English text from before 1931. This 13 billion parameter model is an instruction-tuned version of talkie-1930-13b-base, which was initially trained on 260 billion tokens. The fine-tuning process utilized a unique dataset of instruction-response pairs derived from historical reference works, followed by reinforcement learning to enhance its ability to follow instructions. AI

IMPACT This model offers a novel approach to historical language simulation, potentially enabling new research into past communication styles and knowledge.

RANK_REASON The cluster describes a new language model release with a unique training dataset and methodology, accompanied by a report and GitHub repository. [lever_c_demoted from research: ic=1 ai=1.0]

Read on r/LocalLLaMA →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. r/LocalLLaMA TIER_1 English(EN) · /u/pmttyji ·

    model : add support for talkie-1930-13b by niklassheth · Pull Request #22596 · ggml-org/llama.cpp

    <!-- SC_OFF --><div class="md"><blockquote> <p><a href="https://huggingface.co/talkie-lm/talkie-1930-13b-it">https://huggingface.co/talkie-lm/talkie-1930-13b-it</a> </p> </blockquote> <p><strong>talkie-1930-13b-it</strong> </p> <p>talkie-1930-13b-it is a 13B vintage language mode…