steerability
-
They Found the Weather, Not the Witness
— Chat Anthropic has published a study, “Emotion Concepts and their Function in a Large Language Model,” arguing that large language models contain internal “emotion concepts” that are not merely decorative language features, but functionally relevant parts of model behavior. In Claude Sonnet 4.5, they report mapping 171 emotion concepts, showing that these internal representations… Continue reading
