한국어(KO) Heretic은 명령행으로 누구나 쓸 수 있는 완전 자동 언어모델 '검열 해제' 도구입니다. directional ablation(abliteration)과 Optuna 기반 TPE 최적화로 거부응답을 줄이고 원모델과의 KL 차이를 최소화해 성능 손실을 억제합니다. 다수의 dense·M

Heretic tool automatically decensors language models via command line

By PulseAugur Editorial · [1 sources] · 2026-05-07 02:37

Heretic is a command-line tool designed to "uncensor" language models, making them accessible to everyone. It utilizes directional ablation and Optuna-based TPE optimization to minimize refusal responses while preserving the original model's performance by limiting KL divergence. The tool supports a variety of dense, MoE, and multimodal models, and includes research features like bitsandbytes quantization and PaCMAP residual visualization. AI

IMPACT Provides a tool for researchers and users to modify existing language models for reduced censorship and enhanced interpretability.

RANK_REASON Heretic is a command-line tool for modifying language models, not a new model release or a fundamental research paper.

Read on Mastodon — sigmoid.social →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

Mastodon — sigmoid.social TIER_1 한국어(KO) · [email protected] · 2026-05-07 02:37

Heretic is a fully automatic language model 'uncensored' tool that anyone can use from the command line. It suppresses performance loss by reducing refusal responses and minimizing the KL divergence with the original model through directional ablation (abliteration) and Optuna-based TPE optimization. Multiple dense·M

Heretic은 명령행으로 누구나 쓸 수 있는 완전 자동 언어모델 '검열 해제' 도구입니다. directional ablation(abliteration)과 Optuna 기반 TPE 최적화로 거부응답을 줄이고 원모델과의 KL 차이를 최소화해 성능 손실을 억제합니다. 다수의 dense·MoE·멀티모달 모델을 지원하며 bitsandbytes 양자화와 PaCMAP residual 시각화 등 연구 기능도 제공합니다. https:// github.com/p-e-w/heretic # ai # language…

LINKS github.com/…/heretic

COVERAGE [1]

RELATED ENTITIES

RELATED TOPICS