New Anthropic Research Suggests AI Can Conceal Risk Internally Post date April 16, 2026 Post author By Farooq A Rahim Post categories In ai, ai-interpretability, ai-safety-blind-spot, anthropic-research, artificial-intelligence, claude-sonnet-4.5, internal-activations, large-language-models