PulseAugur
EN
LIVE 23:52:26

Numind releases NuExtract3, a 4B open-weight VLM for document extraction

Numind has released NuExtract3, an open-weight 4B visual language model designed for extracting information from complex documents. Built on Qwen3.5-4B and licensed under Apache-2.0, this model can convert document images to Markdown, extract structured data into JSON templates, and handle various visual inputs. It is designed to be self-hostable with minimal VRAM requirements and offers multiple weight formats for broad compatibility. AI

IMPACT Provides a self-hostable, open-weight alternative for document information extraction tasks.

RANK_REASON Release of an open-weight model from a non-frontier lab.

Read on r/MachineLearning →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

Numind releases NuExtract3, a 4B open-weight VLM for document extraction

COVERAGE [2]

  1. r/LocalLLaMA TIER_1 English(EN) · /u/Gailenstorm ·

    NuExtract3 released: open-weight 4B VLM for Markdown, OCR and structured extraction (self-hostable)

    <table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1tn8utn/nuextract3_released_openweight_4b_vlm_for/"> <img alt="NuExtract3 released: open-weight 4B VLM for Markdown, OCR and structured extraction (self-hostable)" src="https://preview.redd.it/2kyyubbs9a3h1.jp…

  2. r/MachineLearning TIER_1 English(EN) · /u/Gailenstorm ·

    NuExtract3 released: open-weight 4B VLM for Markdown, OCR and structured extraction (self-hostable) [P]

    <table> <tr><td> <a href="https://www.reddit.com/r/MachineLearning/comments/1tkejqr/nuextract3_released_openweight_4b_vlm_for/"> <img alt="NuExtract3 released: open-weight 4B VLM for Markdown, OCR and structured extraction (self-hostable) [P]" src="https://preview.redd.it/pm2xboo…