/r/badstats

Photograph via snooOG

Exposés of bad statistical analyses and screwed-up graphs.

Exposés of bad statistical analyses and screwed-up graphs.

  • If you post a link, please provide some discussion about what's wrong, and why it's wrong.

  • Remember to play the ball, not the player. Attack the stats, not the person using them.

  • If you're reading a popular article that's referencing a journal article, make sure the original is being reported correctly before attacking the original paper.

Relevant other subreddits:

/r/badstats

2,392 Subscribers

2

bad stats assignment

Ok so my assignment for my prob and stat class is to find a bad an misleading stat in household products or advertisements. I found one but I’m getting mixed opinions on whether it’s really a bad stat or not. I think it is. Help!

2 Comments
2024/10/10
01:07 UTC

17

They Knew What They Were Doing

5 Comments
2024/10/09
05:23 UTC

2

3.3 billion people use as much as 1 old American fridge (385 kWh/year) according too this "fun" fact?

https://www.youtube.com/shorts/06tSa0qSv_4

Tell me his stats or wrong or my brain is wrong

Ah i mean 3.3B vs 949kWh/year

0 Comments
2024/10/03
13:29 UTC

2

NRK representing cheating cases pr student at different norwegian universities

2 Comments
2024/02/03
16:28 UTC

9

Germany lost 2:0 to Austria or.....

1 Comment
2023/11/22
07:37 UTC

2

Fun axes

Love these axes not even trying

0 Comments
2023/11/02
07:10 UTC

29

This absolutely horrible graph on "Reading Gains" from George W. Bush's masterclass.

1 Comment
2023/10/29
23:13 UTC

11

Also hate the graph but decided to post it here 🤣

1 Comment
2023/09/05
01:17 UTC

11

Um... I don't believe you.

2 Comments
2023/03/16
01:45 UTC

57

They Knew What They Were Doing

3 Comments
2023/01/25
00:24 UTC

31

Interesting way to show a survey result (The Times, 27th June 22)

5 Comments
2022/06/27
17:32 UTC

6

No statistical significance? Invent alternate metric!

I was commented this study recently and was really trying to give leeway because am aware of some major bias on my part. Then I got to the Saliva Cortisol results, and saw Figure 5. Astounded, I went to look up the peer review process but it appears there actually isn't one? Unless I can't find it because of language barrier issues. The premise is super flawed and there's all kinds of major issues but seriously, Figure 5?!? It's hard to even imagine they are working in good faith here.

https://spca.bc.ca/wp-content/uploads/shock-collar-assets-Salgirli-Efficacy-and-stress-effects-between-3-training-methods.pdf

Edit: Link fix

7 Comments
2022/05/03
18:51 UTC

0

Sendgrid Deliverability Metrics

This is a graph in Sendgrid, which is a company which sends lots of emails. It annoys me every day because there is no reason to add up 'Unique Opens' with Delivered and Bounced & Blocked. Emails that were delivered and opened will count for both categories, and therefore be counted twice.

\"How are your overall deliverability metrics trending?\" No clue cause this graph is useless.

0 Comments
2022/02/01
14:35 UTC

0

This hilariously dishonest graph

3 Comments
2021/11/27
07:41 UTC

27

US has more than 3X as many people

3 Comments
2021/11/16
02:03 UTC

1

Does it count as bad statictics when he won't show the data source?

1 Comment
2021/09/10
19:31 UTC

41

This linear fit clearly makes sense

1 Comment
2021/07/22
20:46 UTC

18

Tell me this is not misleading

5 Comments
2021/04/10
20:40 UTC

14

Samsung making it seem like their sata drive is faster than my nvme drive

1 Comment
2020/04/17
16:09 UTC

28

It’s oddly convenient that there aren’t 10,001-29,999 cases in any of the states

4 Comments
2020/03/28
02:36 UTC

0 Comments
2020/02/24
04:31 UTC

3

Messing with polling crosstabs to get a number you like.

Subtle one, but I keep seeing these same numbers:

https://twitter.com/LukewSavage/status/1217895333230972931

They claim it's explained by this:

https://pbs.twimg.com/media/EOf32bKW4AA7PNZ?format=jpg&name=4096x4096

I'm not 100% sure what math they're doing on the numbers. I THINK they just took an average of the other 3 highlighted numbers. This average doesn't really tell you anything. It's the expected percentage of trump voters given a randomly selected candidate who isn't their preferred candidate.

They seem to be ignoring the fact that a bunch of people would vote for trump over their preferred democrat. For instance, for buttigieg you get 5% who would vote for trump... over buttigieg. 15% of them would also vote for trump over sanders. Since presumably 0% would vote sanders over buttigieg, given they most prefer buttigieg, it should be around 10% who would switch to trump over sanders, not 12%. Similarly, you'd get 8% of biden, 5% warren, 4% sanders switching to trump over their least favorite other.

That still wouldn't really be completely accurate though, since the percentages given in the other candidates might have less than total overlap. So for instance, for biden it could be from that 8% up to 17% (the sum of the other trump percentages minus the biden percentage) who would back trump over another.

So I think a reasonable guess for a more accurate number would be

Buttigieg: 12%

Biden: 8%

Warren: 5%

Sanders: 4%

But really the best we can say is more like

Biden: 8%-17%

Warren 5%-12%

Buttigieg: 12%-20%

Sanders 4%-15%

Though chances are the real number would be near the bottom of that range.

I'm still assuming nobody would vote trump over their own candidate, but wouldn't vote for trump over some other candidate, but I feel like that's fair.

0 Comments
2020/01/18
00:51 UTC

20

3 y axes, not a unit in sight

0 Comments
2019/10/23
05:34 UTC

Back To Top