Last May, Dave at Euri.ca took at crack at expanding Gabriel Rossman's excellent post on spurious correlation in data. It's an important read for anyone wondering whether the core hypothesis of the Big Data movement is that every sufficiently large pile of horseshit must have a pony in it somewhere. As O'Reilly's Nat Torkington says, "Anyone who thinks it’s possible to draw truthful conclusions from data analysis without really learning statistics needs to read this."
* If good looks and smarts are distributed normally, and
If good looks and smarts have nothing to do with each other, and
If movie producers want both smarts and looks
Then, by observing employed actors we’ll assume that looks and smarts have a negative correlation
Even though we constructed this experiment with no correlation
Here’s a graph of 250 randomly generated points (with no correlation). With the red circles representing “actors who are smart and good looking enough to get a job (looks+smarts>2), and lighter blue x’s representing “people who wanted to be actors”
Clearly if we only look at actors with jobs, we’ll see a clearly negative correlation between smarts and good looks. In fact, some brilliant actors are less attractive than an average person, and some gorgeous actors are dumber than an average person. Even more interesting though, is that if we try to rule out bias by looking at aspiring but unsuccessful actors as well, we’ll find that they exhibit a similar correlation...
You’re probably polluting your statistics more than you think
(via O'Reilly Radar)
It turns out that folding a pizza slice lengthwise to improve its rigidity is a great example of the “Remarkable Theorem” by Gauss. Cliff Stoll explains.
Data-scientist Kevin H Wilson argues that computers are tools for manipulating data — from companies’ sales data to the input from games controllers — but we teach computer programming as either a way to make cool stuff (like games) or as a gateway to “rigorous implementation details of complicated language,” while we should be focusing […]
Meet Danica McKellar who as an undergraduate in college co-published a paper titled “Percolation and Gibbs states multiplicity for ferromagnetic Ashkin-Teller models on Z2,” research that resulted in the Chayes–McKellar–Winn theorem. Oh yeah, before that, McKellar was Winnie on The Wonder Years. (And just to confirm, Josh Saviano who played Paul Pfeiffer did not grow […]
The realm of web development is constantly evolving. New platforms, languages, and processes materialize all the time, so staying on top of all that innovation is a tall order.Whether you’re brushing up on new tricks, starting from scratch, or just looking to make your own website a little jazzier, Rob Percival’s new Complete Web Developer Course 2.0 (now […]
Folks used to rely on alarms to protect their home – and before that, the family dog. Now, anyone looking to guard their homes can choose from some high-tech options, including the Amaryllo iCamPRO FHD Home Security Camera (now just $219 in the Boing Boing Store).In fact, this 2015 CES “Best of Innovation” award-winner boasts so many features, it’s […]
If you want a quality vaping experience, it’s usually going to cost you. Vaporizers that deliver a fast, controlled burn will set you back up to $300, which is why the FEZ Vaporizer (now just $99) is an absolute steal.The FEZ dry herb pen does everything that more expensive models handle at a reduced price. It heats up […]