Submission + - Anthropic's models show signs of introspection (axios.com)
They're starting to be introspective, like humans, Anthropic researcher Jack Lindsey, who studies models' "brains," tells us.
Why it matters: These introspective capabilities could make the models safer — or, possibly, just better at pretending to be safe.