A user tested the Qwen3.6 35B model to compare output quality and efficiency across different formatting styles: raw text, markdown, unstyled HTML, and styled HTML. The experiment revealed that while markdown produced the best overall quality score from ChatGPT 5.5 Extended Reasoning, unstyled and styled HTML formats resulted in significantly larger token counts and longer generation times. Raw text was the most token-efficient for content but lacked formatting. AI
IMPACT Demonstrates that while HTML can increase token counts significantly, markdown remains a strong choice for balanced quality and efficiency in LLM responses.
RANK_REASON User-generated benchmark comparing model output formats. [lever_c_demoted from research: ic=1 ai=1.0]
AI-generated summary · Google Gemini · from 1 sources. How we write summaries →