Last May, Dave at Euri.ca took at crack at expanding Gabriel Rossman's excellent post on spurious correlation in data. It's an important read for anyone wondering whether the core hypothesis of the Big Data movement is that every sufficiently large pile of horseshit must have a pony in it somewhere. As O'Reilly's Nat Torkington says, "Anyone who thinks it’s possible to draw truthful conclusions from data analysis without really learning statistics needs to read this."
* If good looks and smarts are distributed normally, and
If good looks and smarts have nothing to do with each other, and
If movie producers want both smarts and looks
Then, by observing employed actors we’ll assume that looks and smarts have a negative correlation
Even though we constructed this experiment with no correlation
Here’s a graph of 250 randomly generated points (with no correlation). With the red circles representing “actors who are smart and good looking enough to get a job (looks+smarts>2), and lighter blue x’s representing “people who wanted to be actors”
Clearly if we only look at actors with jobs, we’ll see a clearly negative correlation between smarts and good looks. In fact, some brilliant actors are less attractive than an average person, and some gorgeous actors are dumber than an average person. Even more interesting though, is that if we try to rule out bias by looking at aspiring but unsuccessful actors as well, we’ll find that they exhibit a similar correlation...
You’re probably polluting your statistics more than you think
(via O'Reilly Radar)
Romanian artist HyperGlu creates programs and algorithms that generate fascinating images and animations with a geometric and mathematical beauty.
Zero-knowledge proofs are one of the most important concepts in cryptography: they’re a way to “validate a computation on private data by allowing a prover to generate a cryptographic proof that asserts to the correctness of the computed output” — in other words, a way to prove that something is true without learning the details.
The hexidecimal color #C0FFEE (192 Red, 255 Green, 238 Blue, on a scale of 0-255) is a pleasing greenish color, while #BEADED is a kind of mauve.
The Pry.Me Bottle Opener holds tens of thousands of times its own weight, and you can pick one up now from the Boing Boing Store.This remarkable keychain is considerably smaller than any of your keys, but don’t let that fool you: it can easily open any bottle, and could even tow a trailer full of […]
Guaranteeing your privacy online goes way beyond checking the “Do Not Track” option in your browser’s settings. To ensure that your internet activity is totally hidden from Internet Service Providers, advertisers, and other prying eyes, take a look at Windscribe’s VPN protection. It usually costs $7.50 per month, but you can get a 3-year subscription […]
This project management bundle will help you get organized and learn how to lead a team to success. You can pay what you want for these five courses when you pick them up from the Boing Boing Store.To help you become an invaluable asset for your company, this bundle includes a curated collection of professional […]