Last May, Dave at Euri.ca took at crack at expanding Gabriel Rossman's excellent post on spurious correlation in data. It's an important read for anyone wondering whether the core hypothesis of the Big Data movement is that every sufficiently large pile of horseshit must have a pony in it somewhere. As O'Reilly's Nat Torkington says, "Anyone who thinks it’s possible to draw truthful conclusions from data analysis without really learning statistics needs to read this."
* If good looks and smarts are distributed normally, and
If good looks and smarts have nothing to do with each other, and
If movie producers want both smarts and looks
Then, by observing employed actors we’ll assume that looks and smarts have a negative correlation
Even though we constructed this experiment with no correlation
Here’s a graph of 250 randomly generated points (with no correlation). With the red circles representing “actors who are smart and good looking enough to get a job (looks+smarts>2), and lighter blue x’s representing “people who wanted to be actors”
Clearly if we only look at actors with jobs, we’ll see a clearly negative correlation between smarts and good looks. In fact, some brilliant actors are less attractive than an average person, and some gorgeous actors are dumber than an average person. Even more interesting though, is that if we try to rule out bias by looking at aspiring but unsuccessful actors as well, we’ll find that they exhibit a similar correlation...
You’re probably polluting your statistics more than you think
(via O'Reilly Radar)
It turns out that folding a pizza slice lengthwise to improve its rigidity is a great example of the “Remarkable Theorem” by Gauss. Cliff Stoll explains.
Data-scientist Kevin H Wilson argues that computers are tools for manipulating data — from companies’ sales data to the input from games controllers — but we teach computer programming as either a way to make cool stuff (like games) or as a gateway to “rigorous implementation details of complicated language,” while we should be focusing […]
Meet Danica McKellar who as an undergraduate in college co-published a paper titled “Percolation and Gibbs states multiplicity for ferromagnetic Ashkin-Teller models on Z2,” research that resulted in the Chayes–McKellar–Winn theorem. Oh yeah, before that, McKellar was Winnie on The Wonder Years. (And just to confirm, Josh Saviano who played Paul Pfeiffer did not grow […]
Looks like all of your potential employers are hiring candidates with programming skills (which you don’t have). With all of the languages out there today, it’s tough to know where to start.With the Complete Front-End to Back-End Coding Bundle, you can beef your resume up in all the right places, no confusion necessary. This package of […]
Those of us who love music wish we could listen to it 24/7. But it’s impossible when we’re trying to converse with our friends, or when are swimming in the local pool.That is, until now. The KOAR Bone Conduction Bluetooth Headset, now 48% off, has changed the audio game.Made with lightweight titanium memory metal, this headset boasts patented bone conduction technology to transport sound […]
It’s one thing to enjoy dinner at home and a nice glass of Cabernet Sauvignon with your best friend, Netflix, but it’s another thing entirely to make that meal from scratch and get that wine delivered right to your doorstep.But what if we told you there’s a way to make this possible? To keep your social life, […]