The Pitfalls of "Big Data"

I greatly enjoyed this article ("Big data: are we making a big mistake?", by Tim Harford) on the dangers of "big data", which certainly has opened up new avenues of research but nevertheless still requires understanding to avoid statistical pitfalls. Understanding causes beyond mere correlation is still necessary: correlations can predict trends with uncanny accuracy, until they break down.

Very useful is this idea of "digital exhaust" : "…we might call 'found data', the digital exhaust of web searches, credit card payments and mobiles pinging the nearest phone mast". I rather like the phrase, too.

Read the article and heed the warning: "But while big data promise much to scientists, entrepreneurs and governments, they are doomed to disappoint us if we ignore some very familiar statistical lessons."

Posted on April 3, 2014 at 19.05 by jns · Permalink
In: All, It's Only Rocket Science, Plus Ca Change...

Leave a Reply

To thwart spam, comments by new people are held for moderation; give me a bit of time and your comment will show up.

I welcome comments -- even dissent -- but I will delete without notice irrelevant, rude, psychotic, or incomprehensible comments, particularly those that I deem homophobic, unless they are amusing. The same goes for commercial comments and trackbacks. Sorry, but it's my blog and my decisions are final.