#sparse-autoencoders 2 items 8 мая Natural Language Autoencoders: Turning Claude's Thoughts into Text Anthropic research 18 июн SAE Interventions Are Unreliable: Suppressed Behaviors Recover Post-Intervention Hong Kong Polytechnic University research