Anthropic disrupts cybercriminal using AI for large-scale theft and extortion security Article 1068 words 17 votes
Persona vectors: monitoring and controlling character traits in language models Article 1381 words 13 votes
Subliminal learning: Language models transmit behavioral traits via hidden signals in data Article 627 words 21 votes
Anthropic announces New Claude 3.5 Sonnet, Claude 3.5 Haiku and the Computer Use API Article 2288 words 19 votes