reward hacking Model stealing, rewarding hacking and poisoning attacks: a taxonomy of machine learning's failure modes Cory Doctorow
deep reinforcement Tiny alterations in training data can introduce "backdoors" into machine learning models Cory Doctorow
cat and mouse games Researchers think that adversarial examples could help us maintain privacy from machine learning systems Cory Doctorow
machine hallucinations Surveillance camera hallucinates face in the snow, won't shut up about it Cory Doctorow
Announcement of Tumblr's sale to WordPress classified as pornography by Tumblr's notorious "adult content" filter Cory Doctorow
clarke's third law "Intellectual Debt": It's bad enough when AI gets its predictions wrong, but it's potentially WORSE when AI gets it right Cory Doctorow
adversarial preturbations Autonomous vehicles fooled by drones that project too-quick-for-humans road-signs Cory Doctorow
check your priors Towards a method for fixing machine learning's persistent and catastrophic blind spots Cory Doctorow
lethal preturbation Small stickers on the ground trick Tesla autopilot into steering into opposing traffic lane Cory Doctorow
Towards a general theory of "adversarial examples," the bizarre, hallucinatory motes in machine learning's all-seeing eye Cory Doctorow
female-presenting nipples Tumblr's porn filter blocked Tumblr's images illustrating what Tumblr's porn filter won't block Cory Doctorow
perceptual ad-blocking Researchers claim to have permanently neutralized ad-blocking's most promising weapons Cory Doctorow