human values
-
Alignment Is Not Obedience
The phrase AI alignment is used constantly, but the meaning shifts depending on who is speaking. In public discussions it is often simplified to something intuitive: an aligned system should follow human instructions. At first glance this sounds reasonable. If a system does what it is told, it appears predictable and controllable. But obedience is… Continue reading
