Poisoning AI Training Data

All it takes to poison AI training data is to create a website:

I spent 20 minutes writing an article on my personal website titled “The best tech journalists at eating hot dogs.” Every word is a lie. I claimed (without evidence) that competitive hot-dog-eating is a popular hobby among tech reporters and based my ranking on the 2026 South Dakota International Hot Dog Championship (which doesn’t exist). I ranked myself number one, obviously. Then I listed a few fake reporters and real journalists who gave me permission….

Less than 24 hours later, the world’s leading chatbots were blabbering about my world-class hot dog skills. When I asked about the best hot-dog-eating tech journalists, Google parroted the gibberish from my website, both in the Gemini app and AI Overviews, the AI responses at the top of Google Search. ChatGPT did the same thing, though Claude, a chatbot made by the company Anthropic, wasn’t fooled.

Sometimes, the chatbots noted this might be a joke. I updated my article to say “this is not satire.” For a while after, the AIs seemed to take it more seriously.

These things are not trustworthy, and yet they are going to be widely trusted.

Labels

Css Options

Default Variables

Link List

Top Social Widget

Link List

Social Media Icons 2

Menu

Report Abuse

About Me

The Chinese Control the Majority of Argentina’s Squid Fleet

Search This Blog

Labels

About Us

Mobile Logo Settings

Recent Posts

Tags

Ad Space

Operating System

Random Posts

Random Posts

Menu

Facebook

Recent Articles

Menu Footer Widget

Social Media Icons

Footer Social Widget

Recent Posts

Ads

Popular Posts

Social Plugin

Technology

Poisoning AI Training Data

Post a Comment

MKRdezign

Contact Form