Bayesian spam rumination: when word-frequency-histograms attack!

Cory Doctorow 7:43 am Tue Jun 29, 2004

Ed Felten has posted an intriguing rumination on the possible failure modes of Bayesian spam-filtering — filtering that uses word-frequency statistics to classify email as spam or ham. As Ed points out, Bayesian filters are trained by the spammers, who, by choosing the vocabulary of their messages carefully, can make messages containing certain words or phrases undeliverable on the Internet.

Now suppose a big spammer wanted to poison a particular word, so that messages containing that word would be (mis)classified as spam. The spammer could sprinkle the target word throughout the word salad in his outgoing spam messages. When users classified those messages as spam, the targeted word would develop a negative score in the users' Bayesian spam filters. Later, messages with the targeted word would likely be mistaken for spam.

This attack could even be carried out against a particular targeted user. By feeding that user a steady diet of spam (or pseudo-spam) containing the target word, a malicious person could build up a highly negative score for that word in the targeted user's filter.

Americans now lose a quarter of a trillion dollars a year gambling

Americans will lose about $250 billion gambling this year, reports Joey Politano, a number that's up 60% in half a decade. And that doesn't include "prediction markets" and cryptocurrencies, both… READ THE REST
A million passports leaked online by marijuana club portal

An Irish software firm managing membership of cannabis social clubs left more than a million member records and roughly 985,000 identity-document photos sitting on a server that required no password,… READ THE REST
IP Crawl exposes that insecure web camera you never locked down

IP Crawl is a browseable library of camera systems exposed to the internet. Currently on the favorites list are a swimming pool in Austin, Texas, a boxing ring in New… READ THE REST
Until the end of the day on July 5, you can get Windows 11 Pro for a one-time $10.49 payment

Disclosure: Boing Boing earns a commission on purchases made through links in this post. TL;DR: You have until 11:59 p.m. PT on July 5 to get Windows 11 Pro for only $10.49. Microsoft… READ THE REST
We found one lifetime license that can replace 27 subscriptions, and it's only $30

Disclosure: Boing Boing earns a commission on purchases made through links in this post. TL;DR: Get the enSili Mac Bundle and grab 27 native Mac apps for $30. Mac users tend… READ THE REST
Want to cut back on screen time? Start with this $112 old-school flip phone (MSRP $269.99)

Disclosure: Boing Boing earns a commission on purchases made through links in this post. TL;DR: Teleport back to simpler times with this Kyocera DuraXE Epic E4830 flip phone, now for just $111.99… READ THE REST