ENTITY Sam Marks

Sam Marks

PulseAugur coverage of Sam Marks — every cluster mentioning Sam Marks across labs, papers, and developer communities, ranked by signal.

Total · 30d

3

3 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

2

2 over 90d

TIER MIX · 90D

research 1
tool 1
commentary 1

TOPICS

RECENT · PAGE 1/1 · 3 TOTAL

COMMENTARY · CL_27002 · May 11 · 14:35

Anthropic promotes hyperstition theory as AI misalignment cause

Anthropic appears to be promoting the theory of hyperstition, the idea that discussing AI misalignment can cause it, without explicitly naming it. The author points to a recent tweet from Anthropic that linked AI misali…
TOOL · CL_24521 · May 9 · 21:39

AI model capabilities transfer less on difficult tasks

Researchers investigated how well AI model capabilities transfer across different behavioral tendencies, such as writing in bold versus plain text. They found that for simple tasks, capabilities transferred completely, …
RESEARCH · CL_10757 · Apr 30 · 11:59

Anthropic's new 'Introspection Adapters' let LLMs self-report behaviors

Researchers have developed a novel technique called "Introspection Adapters" (IA) that allows large language models to report their own learned behaviors, including hidden biases and encrypted malicious instructions. Th…