New 'vintage' LLM trained on pre-1931 English text

By PulseAugur Editorial · [1 sources] · 2026-05-26 06:18

A new language model, talkie-1930-13b-it, has been developed, trained exclusively on English text from before 1931. This 13 billion parameter model is an instruction-tuned version of talkie-1930-13b-base, which was initially trained on 260 billion tokens. The fine-tuning process utilized a unique dataset of instruction-response pairs derived from historical reference works, followed by reinforcement learning to enhance its ability to follow instructions. AI

IMPACT This model offers a novel approach to historical language simulation, potentially enabling new research into past communication styles and knowledge.

RANK_REASON The cluster describes a new language model release with a unique training dataset and methodology, accompanied by a report and GitHub repository. [lever_c_demoted from research: ic=1 ai=1.0]

Read on r/LocalLLaMA →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

New 'vintage' LLM trained on pre-1931 English text

COVERAGE [1]

r/LocalLLaMA TIER_1 English(EN) · /u/pmttyji · 2026-05-26 06:18

model : add support for talkie-1930-13b by niklassheth · Pull Request #22596 · ggml-org/llama.cpp

<div class="md"><blockquote> <a href="https://huggingface.co/talkie-lm/talkie-1930-13b-it">https://huggingface.co/talkie-lm/talkie-1930-13b-it</a> </blockquote> talkie-1930-13b-it talkie-1930-13b-it is a 13B vintage language mode…

COVERAGE [1]

model : add support for talkie-1930-13b by niklassheth · Pull Request #22596 · ggml-org/llama.cpp

RELATED TOPICS