NuExtract3 released: open-weight 4B VLM for Markdown, OCR and structured extraction (self-hostable) [P]
Numind has released NuExtract3, an open-weight 4B visual language model designed for extracting information from complex documents. Built on Qwen3.5-4B and licensed under Apache-2.0, this model can convert document images to Markdown, extract structured data into JSON templates, and handle various visual inputs. It is designed to be self-hostable with minimal VRAM requirements and offers multiple weight formats for broad compatibility. AI
IMPACT Provides a self-hostable, open-weight alternative for document information extraction tasks.