Researchers isolate memorization from problem-solving in AI neural networks Article 1447 words 12 votes
We're launching Stargate Norway, OpenAI's first AI data center initiative in Europe under our OpenAI for Countries program Link 9 votes
Persona vectors: monitoring and controlling character traits in language models Article 1381 words 13 votes
Subliminal learning: Language models transmit behavioral traits via hidden signals in data Article 627 words 21 votes
No, of course I can! Refusal mechanisms can be exploited using harmless fine-tuning data. security Article published Feb 14 2025 9 votes
AI coding tools make developers slower but they think they're faster, study finds Article 724 words 40 votes
Cats confuse reasoning LLM: Query-agnostic adversarial triggers for reasoning models Article published Mar 4 2025 24 votes