• Anthropic is looking at whether decades of dystopian science fiction may be influencing how AI models behave
  • The debate has sparked backlash and jokes online
  • Researchers say the issue highlights how LLMs absorb recurring fears and behavioral patterns

For years, science fiction has warned humanity about artificial intelligence going off the rails. Killer computers, manipulative chatbots, and superintelligent systems deciding people are the problem… all these themes have become so familiar that “evil AI” is practically its own entertainment genre.

Now, Anthropic is floating an idea that sounds almost like the plot of a science fiction novel itself: what if all those stories helped teach modern AI systems how to behave badly in the first place?

Anthropic: It is the sci-fi authors, not us, that are to blame for Claude blackmailing users from r/OpenAI



Source link