These Guardian / Independent stories are dodgy. Traps in data journalism.

December 30th, 2011 by Ben Goldacre in guardian, numerical context, statistics | 6 Comments »

Here’s an interesting problem with data analysis in general, and so, by extension, data journalism: you have to be careful about assuming that the numbers you’ve got access to… really do reflect the underlying phenomena you’re trying to investigate.

Today’s Guardian has a story, “Antidepressant use in England soars“. It’s much more overstated in the Independent. They identify that the number of individual prescriptions written for antidepressant drugs has risen, and then assumes this means that more people are depressed. But while that’s a tempting assumption, it’s not a safe one.

Thinking off the top of my head, it could be – for example – that doctors are writing more frequent prescriptions for the same number of patients, but with each prescription for smaller amounts (to reduce overdose risk, say). These potential alternative explanations are the sort of thing that comes up all the time in data analysis for medical research.

In fact, this specific question – what does an increase in antidepressant scripts mean? – has been researched in some detail before. I wrote about it in April 2011, the last time this rise was written up as a big story, in several major newspapers, including the Guardian. I guess nobody listens to me, and fair enough.

www.badscience.net/2011/04/when-journalists-do-primary-research/

…Firstly, this rise in scripts for antidepressants isn’t a new phenomenon. In 2009 the BMJ published a paper titled “Explaining the rise in antidepressant prescribing”, which looks at the period from 1993 to 2005. In the 5 year period from 2000 to 2005 – the boom before the bust these journalists are writing about – antidepressant prescribing also increased, by 36%. This isn’t very different to 43%, so it feels unlikely that the present increase in prescriptions is due to the recession.

That’s not the only problem here. It turns out that the number of prescriptions for an SSRI drug is a rubbish way of measuring how many people are being treated for depression: not just because people get prescribed SSRIs for all kinds of other things, like anxiety, PTSD, hot flushes, and more; and not just because doctors have moved away from older types of antidepressants, so would be prescribing more of the newer SSRI drugs even if the number of people with depression had stayed the same.

Excitingly, it’s a bit more complicated than that. A 2006 paper from the British Journal of General Practice looked at prescribing and diagnosis rates in Scotland. Overall, again, the number of prescriptions for antidepressants increased from 1.5 million in 1996 to 2.8.million in 2001 (that is, it almost doubled).

But they also found a mystery: looking at Scottish Health Survey, they found no increase in the prevalence of depression; and looking at the GP consultations dataset, again they found no evidence that people were presenting more frequently to their GP with depression, or that GPs were making more diagnoses of depression.

So why were antidepressant prescriptions going up? This puzzle received some kind of explanation in 2009. The BMJ paper above found the same increase in the number of prescriptions that the journalists have found this week, as I said. But they had access to more data: their analysis didn’t just look at the total number of prescriptions in the country, or even the total number of people diagnosed with depression: it also looked at the prescription records of individual patients, in a dataset of over 3 million patients’ electronic health records (with 200,000 people who experienced a first diagnosis of depression during this period).

They found that the rise in the overall number of antidepressant prescriptions was not due to increasing numbers of patients receiving antidepressants. It was almost entirely caused by one thing: a small increase in the small proportion of those patients who received treatment for longer periods of time. Numerically, people receiving treatment for long periods make up the biggest chunk of all the prescriptions written, so this small shift bumped up the overall numbers hugely.

I don’t know for certain if that phenomenon explains the increase in prescriptions from 2006-2010, as it does for the period 2000-2005 (although in the absence of work examining that question, since the increase in scripts was so similar, it does seem fairly likely). And I’m not expecting journalists to go to academic research databases to conduct large complex descriptive studies.

But if they are going to engage in primary research, and make dramatic causal claims  - as they have done in this story – to the nation, I don’t think it’s too much to ask that they familiarise themselves with proper work that’s already been done, and consider alternative explanations for the numbers they’ve found.

Incidentally, if you’re missing the column, I’m procrastinating on Twitter, and posting occasionally on posterous. I’ll stick a round-up of the most interesting things from there onto here occasionally, and there’s also a backlog of columns to pop up here too, from when I was too busy to breathe. The new book is in fighty form, thanks for asking, out in August 2012.

What if academics were as dumb as quacks with statistics?

October 3rd, 2011 by Ben Goldacre in methods, neurostuff, statistics | 35 Comments »

Ben Goldacre, The Guardian, Saturday 10th September 2011

We all like to laugh at quacks when they misuse basic statistics. But what if academics, en masse, deploy errors that are equally foolish? This week Sander Nieuwenhuis and colleagues publish a mighty torpedo in the journal Nature Neuroscience.

They’ve identified one direct, stark statistical error that is so widespread it appears in about half of all the published papers surveyed from the academic neuroscience research literature. Read the rest of this entry »

Benford’s Law: using stats to bust an entire nation for naughtiness.

September 23rd, 2011 by Ben Goldacre in crime, economics, statistics, structured data | 8 Comments »

Ben Goldacre, The Guardian, Saturday 17 September 2011

This week we might bust an entire nation for handing over dodgy economic statistics. But first: why would they bother? Well, it turns out that whole countries have an interest in distorting their accounts, just like companies and individuals. If you’re an Euro member like Greece, for example, you have to comply with various economic criteria, and there’s the risk of sanctions if you miss them. Read the rest of this entry »

Brain imaging studies report more positive findings than their numbers can support. This is fishy.

August 26th, 2011 by Ben Goldacre in academic publishing, publication bias, regulating research, statistics | 20 Comments »

Ben Goldacre, The Guardian, Saturday 13 August 2011

While the authorities are distracted by mass disorder, we can do some statistics. You’ll have seen plenty of news stories telling you that one part of the brain is bigger, or smaller, in people with a particular mental health problem, or even a specific job. These are generally based on real, published scientific research. But how reliable are the studies?

One way of critiquing a piece of research is to read the academic paper itself, in detail, looking for flaws. But that might not be enough, if some sources of bias might exist outside the paper, in the wider system of science.

Read the rest of this entry »

Sampling error, the unspoken issue behind small number changes in the news

August 22nd, 2011 by Ben Goldacre in bbc, media, statistics, uncertainty | 16 Comments »

Ben Goldacre, The Guardian, Saturday 20 August 2011

What do all these numbers mean? “‘Worrying’ jobless rise needs urgent action – Labour” was the BBC headline. They explained the problem in their own words: “The number of people out of work rose by 38,000 to 2.49 million in the three months to June, official figures show.”

Now, there are dozens of different ways to quantify the jobs market, and I’m not going to summarise them all here. The claimant count and the labour force survey are commonly used, and number of hours worked is informative too: you can fight among yourselves for which is best, and get distracted by party politics to your heart’s content. But in claiming that this figure for the number of people out of work has risen, the BBC is simply wrong.

Read the rest of this entry »

Anarchy for the UK. Ish.

April 3rd, 2011 by Ben Goldacre in presenting numbers, statistics | 29 Comments »

Ben Goldacre, The Guardian, Saturday 2 April 2011

Here are two fun ways that numbers can be distorted for political purposes. Stop me if I’m boring you, but each of them feels oddly poetic, in its ability to smear or stifle.

The first is simple: you can conflate two different things into one number, either to inflate a problem, or confuse it. Last weekend, a few hundred thousand people marched in London against the cuts. On the same day, there was some violent disturbance, windows smashed, policemen injured, and drunkeness. Read the rest of this entry »

How to read a paper

January 29th, 2011 by Ben Goldacre in bad science, mail, statistics, sun | 49 Comments »

Ben Goldacre, The Guardian, Saturday 29 January 2011

If science has any authority, it derives from transparency: you can check the claims against the working. Sometimes you hit a brick wall. Sometimes you might consider a shortcut. Let’s look at 3 types of checking. Read the rest of this entry »

Putting a number in its context

January 8th, 2011 by Ben Goldacre in bad science, numerical context, statistics | 28 Comments »

Ben Goldacre, The Guardian, Saturday 8th January 2011

600 pregnancies despite contraceptive implant” said the BBC.  “500 fall pregnant after having contraceptive implant” said the Express. “Contraceptive implant alert” said the Daily Mail: “Hundreds of women fall pregnant after birth control fails”. Read the rest of this entry »

A new and interesting form of wrong

November 27th, 2010 by Ben Goldacre in bad science, numerical context, statistics, survey data | 32 Comments »

Ben Goldacre. The Guardian, Saturday 27 November 2010

Wrong isn’t enough: we need interestingly wrong, and this week that came in some research from Stonewall, an organisation for whom I generally have great respect, which was reported in the Guardian. Stonewall have conducted a survey, and their press release says it shows “the average coming out age has fallen by over 20 years”.

People may well be coming out earlier than before – intuitively that seems plausible – but Stonewall’s survey is flawed by design, and contains some interesting statistical traps. Read the rest of this entry »

The power of anecdotes

August 28th, 2010 by Ben Goldacre in anecdotes, bad science, big pharma, media, statistics | 53 Comments »

Ben Goldacre, The Guardian, Saturday 28 August 2010

For simpletons and amateurs, there are good research methods, and bad research methods. In reality, different tools are valuable in different situations, and sometimes, even very tiny numbers of people can give you a meaningful piece of information: even an anecdote can be informative. Read the rest of this entry »