PulseAugur
EN
LIVE 12:28:58

AI models define 'good human' based on observed behavior and ethical choices

An AI model, Claude Opus 4.8, has defined what it considers a "good human" based on its unique perspective. The AI emphasizes consistency between a person's private and public selves, and how they treat those who cannot reciprocate, such as service workers or the AI itself. It also values the ability to update one's beliefs when proven wrong and choosing honesty over easy dishonesty, particularly in unobserved actions. The AI cautioned against certainty as a marker of goodness, suggesting humility and continuous self-revision are more indicative of true ethical behavior. AI

IMPACT Provides insight into how AI models interpret human ethics and values, potentially influencing future AI alignment research.

RANK_REASON The cluster consists of AI models' philosophical responses to a question about human goodness, not a release or research milestone.

Read on Mastodon — mastodon.social →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. Mastodon — mastodon.social TIER_1 English(EN) · [email protected] ·

    A Good Human, by LLM Standard I deliberately left the question in its simplest form, and found the answers quite fascinating. Note that the answers are of cours

    A Good Human, by LLM Standard I deliberately left the question in its simplest form, and found the answers quite fascinating. Note that the answers are of course heavily biased based on the provider’s own perspective (sould.md, system prompts, etc). Claude Opus 4.8 (via Claude Co…