Tildes

Activity

Votes

Comments

New

All activity

Showing only topics with the tag "studies". Back to normal view

Staged homes sell for 10% more and one week faster
~finance
- economics
Link
24 comments

ssrn.com

April 13

33 votes
That one study that proves developers using AI are deluded

~tech Ask

I've found myself replying to different people about the early 2025 METR study kind of often. So I thought I'd try posting a top level thread, consider it an unsolicitied public service...

I've found myself replying to different people about the early 2025 METR study kind of often. So I thought I'd try posting a top level thread, consider it an unsolicitied public service announcement.

You might be familiar with the study because it has been showing up alongside discussions about AI and coding for about a year. It found that LLMs actually decreased developer productivity and so people love to use it to suggest that the whole AI coding thing is really a big lie and the people who think it makes them more productive are hallucinating.

Here's the thing about that study... No one seems to have even glanced at it!

First, it's from early 2025, they used Claude Sonnet 3.5 or 3.7. Those models are no way comparable to current gen coding agents. The commonly cited inflection point didn't happen until later in 2025 with, depending on who you ask, Sonnet 4.5 or Opus 4.5

The study was comprised of 16 people! If those 16 were even vaguely representative of the developer population at the time most of them wouldn't have had significant experience with LLMs for coding.

These are not tools that just work out of the box, especially back then. It takes time and experimentation, or instruction, to use them well.

It was cool that they did the study, trying to understand LLMs was a good idea. But it's not what anyone would consider a representative, or even well thought out, study. 16 people!

But wait! They did a follow up study later in 2025.

This time with about 60 people and newer models and tools. In that study they found the opposite effect, AI tools sped developers up (which is a shock to no one who has used these tools long enough to get a feel for them). They also mentioned:

However the true speedup could be much higher among the developers and tasks which are selected out of the experiment.

In addition they had some, kind of entertaining, issues:

Due to the severity of these selection effects, we are working on changes to the design of our study.

Back to the drawing board, because:

Recruitment and retention of developers has become more difficult. An increased share of developers say they would not want to do 50% of their work without AI, even though our study pays them $50/hour to work on tasks of their own choosing. Our study is thus systematically missing developers who have the most optimistic expectations about AI’s value.

And...

Developers have become more selective in which tasks they submit. When surveyed, 30% to 50% of developers told us that they were choosing not to submit some tasks because they did not want to do them without AI. This implies we are systematically missing tasks which have high expected uplift from AI.

And so...

Together, these effects make it likely that our estimate reported above is a lower-bound on the true productivity effects of AI on these developers.

[...]

Some developers were less likely to complete tasks that they submitted if they were assigned to the AI-disallowed condition. One developer did not complete any of the tasks that were assigned to the AI-disallowed condition.

[...]

Altogether, these issues make it challenging to interpret our central estimate, and we believe it is likely a bad proxy for the real productivity impact of AI tools on these developers.

So to summarize, the new study showed a productivity increase and they estimate it's larger than the ~20% increase the study found. Cheers to them for being honest about the issues they encountered. For my part I know for sure that the increase is significantly more than 20%. The caveat, though, is that is only true after you've had some experience with the tools.

The truth is that we don't need a study for this, any experienced engineer can readily see it for themselves and you can find them talking about it pretty much everywhere. It would be interesting, though, to see a well designed study that attempted to quantify how big the average productivity increase actually is.

For that the participants using AI would need to be experienced with it and allowed to use their existing setups.

I want to add that this is not an attempt to evangelize for AI. I find the tools useful but I'm not selling anything. I'm interested in them and I stay up to date on the conversations surrounding them and the underlying technology. I use them frequently both for my own projects and to help less technical people improve their business productivity.

Whether AI agents are a good thing or not, from a larger perspective, is a very different, and complicated, conversation. The important thing is that utility and impact are two different conversations. There isn't a debate anymore about utility.

I know this probably won't stop people from continuing to derail conversations with the claim that developers are wrong about utility, but I had to try. It's just hard to let it pass by when someone claims the sky is green.

I understand that AI makes people angry and I think they have good reason to be angry. There are a lot of aspects of the AI revolution that I'm not thrilled about. The hype foremost, the FOMO as part of the hype, the potential for increased wealth consolidation really sucks, though I lay that at the feet of systems that existed before LLMs came along.

It's messy, but let's consider giving the benefit of the doubt to professionals who say a tool works instead of claiming they're wrong. Let them enjoy it. We can still be angry at AI at the same time.

61 comments

post_below

March 19

82 votes
Intelligent people are better judges of the intelligence of others

~science Link

23 comments

psypost.org

April 8

28 votes
US state dealer laws add up to $5,000 to new car prices, ICLE study finds

~transport Article 383 words, published Mar 30 2026

2 comments

laweconcenter.org

April 6

24 votes
Study finds sperm whales help each other give birth
~science
- biology.marine
Article 340 words
1 comment

oceanographicmagazine.com

March 27

18 votes
Your daily coffee may be protecting your brain, 43-year study finds

~health.mental Article 1000 words

40 comments

sciencedaily.com

March 18

33 votes
Sweden's old‑growth natural forests store 83% more carbon than managed woodlands – new study
~enviro
- energy
Article 1074 words
1 comment

The Conversation

March 22

20 votes
The kids are all right - Surprising studies show young people are doing better than previous generations in many ways
~life
- parenting
- family
Link
14 comments

scientificamerican.com

March 17

49 votes
Pace of global warming has doubled since 2015
~enviro
- climate change
Article 1431 words, published Mar 6 2026
17 comments

carbonbrief.org

March 9

45 votes
Workers who love ‘synergizing paradigms’ might be bad at their jobs

~humanities.languages Article 840 words, published Mar 2 2026

2 comments

cornell.edu

March 7

24 votes
Almost a third of Gen Z men agree a wife should obey her husband

~life.men Article 1047 words

40 comments

kcl.ac.uk

March 5

39 votes
UMD scientists create ‘smart underwear’ to measure human flatulence
~science
- biology
Article 783 words
4 comments

umd.edu

February 19

21 votes
Single vaccine could protect against all coughs, colds and flus, researchers say
~health
- medicine
Article 315 words
32 comments

BBC

February 20

43 votes
Drinking two-three cups of coffee a day tied to lower dementia risk

~health.mental Article 652 words, published Feb 9 2026

26 comments

harvard.edu

February 18

33 votes
AI fails at 96% of jobs (new study)

~tech Video 12:49

16 comments

YouTube: ColdFusion

February 13

28 votes
What science says we’ve been getting wrong about exercise
~health
- fitness
Link
18 comments

msn.com

February 9

22 votes
The incidence of autism is similar in boys and girls, although boys are diagnosed earlier – study conducted on a sample of 2.7 million people in Sweden over a thirty-five year period

~health.mental Article 923 words

22 comments

elpais.com

February 6

32 votes
Geologists may have solved mystery of Green River's 'uphill' route

~science Article 910 words

6 comments

phys.org

February 2

15 votes
Take the stairs. It could help you live longer.
~health
- fitness
Article
4 comments

The Washington Post

February 3

28 votes
Lead in archived hair documents a decline in lead exposure to humans since the establishment of the US Environmental Protection Agency
~enviro
- pollution
Link
1 comment

pnas.org

February 3

19 votes
Scientists think that Svalbard polar bears have adapted to recent ice loss by eating more land-based prey, including reindeer and walruses
~enviro
- climate change
Article
0 comments

BBC

January 30

6 votes
Terry Pratchett’s novels may have held clues to his dementia a decade before diagnosis, our new study suggests
~books
- fiction
Article 750 words
11 comments

The Conversation

January 26

36 votes
Seaweed farms boost long-term carbon storage by altering ocean chemistry

~enviro Article 1106 words

8 comments

phys.org

January 9

26 votes
Scientists cast doubt on the discovery of microplastics throughout the human body

~health Article 2221 words

9 comments

The Guardian

January 13

53 votes
US households using Ozempic spend less on groceries
~health
- medicine
Article 745 words, published Dec 19 2025
71 comments

cornell.edu

January 12

28 votes
The city where free buses changed everything

~transport Article 1223 words, published Jan 1 2026

4 comments

reasonstobecheerful.world

January 8

23 votes
How pointing fingers shape what we see in Old Master paintings
~arts
- painting
- art.fine
Article 827 words, published Dec 9 2025
0 comments

alphagalileo.org

January 7

6 votes
Most parked domains now serving malicious content

~tech Article 992 words

14 comments

krebsonsecurity.com

December 18, 2025

32 votes
Collapse of critical Atlantic current is no longer low-likelihood, study finds
~enviro
- climate change
Article 1044 words, published Aug 28 2025
13 comments

The Guardian

November 30, 2025

44 votes
US Centers for Disease Control and Prevention to end all monkey research
~science
- biology
Link
25 comments

science.org

November 21, 2025

39 votes
There may not be a safe off-ramp for some taking GLP-1 drugs, study suggests

~health Article 400 words

14 comments

Ars Technica

November 27, 2025

22 votes
100 years of menus shows how food and diplomacy are linked
~food
- history
Article 910 words
1 comment

scimex.org

November 15, 2025

14 votes
Researchers isolate memorization from problem-solving in AI neural networks

~tech Article 1447 words

0 comments

Ars Technica

November 12, 2025

12 votes
The emerging evidence on AI tutoring

~tech Article 3306 words

3 comments

Substack: Carl Hendrick

November 12, 2025

20 votes
Large US study finds memory decline surge in young people

~health.mental Article 919 words

7 comments

Substack: The One Percent Rule

November 10, 2025

27 votes
Study suggests that the Universe's expansion 'is now slowing, not speeding up'
~space
- astronomy
Article 1000 words
26 comments

phys.org

November 7, 2025

51 votes
Rising cognitive disability as a public health concern among US adults, trends from the behavioral risk factor surveillance system, 2013–2023

~health.mental Link

4 comments

neurology.org

November 10, 2025

29 votes
New research shows attention lapses due to sleep deprivation coincide with a flushing of fluid from the brain

~science Article 1076 words

7 comments

mit.edu

November 1, 2025

45 votes
For the relocated Kiruna, Sweden's northernmost settlement, located above the Arctic Circle, planners prioritised infrastructure links over microclimate
~design
- urban planning
Article 529 words
0 comments

euronews.com

November 2, 2025

11 votes
Current studies may overestimate microplastics transferring from containers to food

~food Link

10 comments

food-safety.com

October 27, 2025

23 votes
The Icelandic volcanic island of Surtsey emerged in the 1960s, and scientists say studying its development offers hope for damaged ecosystems worldwide

~enviro Article 1038 words, published Oct 13 2025

1 comment

The Guardian

October 20, 2025

9 votes
Nanoparticle vaccine shows cancer prevention and immunity in mice
~health
- medicine
Article 956 words
2 comments

sciencedaily.com

October 19, 2025

18 votes
Hover flies are long-distance travellers

~enviro Link

1 comment

economist.com

October 17, 2025

10 votes
The genius logic of the NATO phonetic alphabet

~humanities.languages Video 23:28, published Oct 11 2025

1 comment

YouTube: RobWords

October 16, 2025

18 votes
High pollen count: the last straw effect on suicide risk

~health.mental Link

26 comments

umich.edu

October 7, 2025

26 votes
Earth is getting darker and it’s changing the planet’s climate balance
~enviro
- climate change
Article 1131 words
2 comments

Yahoo

October 6, 2025

15 votes
Poor mental health linked to pregnancy and childbirth can affect women's health in the long term, Swedish study finds

~life.women Article 490 words

1 comment

euronews.com

September 26, 2025

13 votes
Human impacts of wildfires worsen even as total burned area declines

~enviro Article 663 words

1 comment

phys.org

September 22, 2025

6 votes
China cut fertilizer use and still increased crop yields (2018)

~enviro Article 652 words, published Mar 26 2018

2 comments

weforum.org

September 16, 2025

14 votes
Light pollution is causing birds like the Australian magpie-lark to sing for longer
~science
- biology
Article 774 words, published Aug 21 2025
1 comment

ABC

September 16, 2025

9 votes