PulseAugur
实时 23:40:36
한국어(KO) Heretic은 명령행으로 누구나 쓸 수 있는 완전 자동 언어모델 '검열 해제' 도구입니다. directional ablation(abliteration)과 Optuna 기반 TPE 최적화로 거부응답을 줄이고 원모델과의 KL 차이를 최소화해 성능 손실을 억제합니다. 다수의 dense·M

Heretic tool automatically decensors language models via command line

Heretic is a command-line tool designed to "uncensor" language models, making them accessible to everyone. It utilizes directional ablation and Optuna-based TPE optimization to minimize refusal responses while preserving the original model's performance by limiting KL divergence. The tool supports a variety of dense, MoE, and multimodal models, and includes research features like bitsandbytes quantization and PaCMAP residual visualization. AI

影响 Provides a tool for researchers and users to modify existing language models for reduced censorship and enhanced interpretability.

排序理由 Heretic is a command-line tool for modifying language models, not a new model release or a fundamental research paper.

在 Mastodon — sigmoid.social 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。 我们如何撰写摘要 →

Heretic tool automatically decensors language models via command line

报道来源 [1]

  1. Mastodon — sigmoid.social TIER_1 한국어(KO) · [email protected] ·

    Heretic is a fully automatic language model 'uncensored' tool that anyone can use from the command line. It suppresses performance loss by reducing refusal responses and minimizing the KL divergence with the original model through directional ablation (abliteration) and Optuna-based TPE optimization. Multiple dense·M

    Heretic은 명령행으로 누구나 쓸 수 있는 완전 자동 언어모델 '검열 해제' 도구입니다. directional ablation(abliteration)과 Optuna 기반 TPE 최적화로 거부응답을 줄이고 원모델과의 KL 차이를 최소화해 성능 손실을 억제합니다. 다수의 dense·MoE·멀티모달 모델을 지원하며 bitsandbytes 양자화와 PaCMAP residual 시각화 등 연구 기능도 제공합니다. https:// github.com/p-e-w/heretic # ai # language…