26 votes

Undergraduate upends a 40-year-old Data Science conjecture

Posted February 12 by skybrian

Tags: hash tables, source.quanta magazine, author.steve nadis, data science, computer science, data structures, pointers.tiny

https://www.quantamagazine.org/undergraduate-upends-a-40-year-old-data-science-conjecture-20250210/

Link information

This data is scraped automatically and may be incorrect.

Published: Feb 10 2025
Word count: 644 words

6 comments

[4]
sparksbet
February 12
Link
This is probably nitpicky, but why is this called a "data science" conjecture rather than one in computer science? I get that "data science" is a buzzword now, but it wasn't back in the 80s when...

This is probably nitpicky, but why is this called a "data science" conjecture rather than one in computer science? I get that "data science" is a buzzword now, but it wasn't back in the 80s when Yao's conjecture was... conjected. I already have trouble, as someone whose last job title was "data scientist", with "data science" being used to describe too many dissimilar things. It goes from having a very nebulous meaning to being nigh-meaningless if you use it for any computer science concept that remotely involves data.

18 votes
1. [3]
  stu2b50
  February 12
  Link Parent
  It's just the layman reporter subbing in words that they know. You don't have to get further than the second paragraph yeeaaaahhhhh
  
  It's just the layman reporter subbing in words that they know. You don't have to get further than the second paragraph
  
  The paper’s title, “Tiny Pointers,” referred to arrowlike entities that can direct you to a piece of information, or element, in a computer’s memory
  
  yeeaaaahhhhh
  
  19 votes
  1. [2]
    tanglisha
    February 12 (edited February 13)
    Link Parent
    I like how the reporter tried to explain pointers. I thought they did a decent job for someone with no background. I imagine the arrows came from someone drawing it out for them.
    
    I like how the reporter tried to explain pointers. I thought they did a decent job for someone with no background.
    
    I imagine the arrows came from someone drawing it out for them.
    
    8 votes
    
    Plik
    February 13
    Link Parent
    Oof, I remember pointers. ...Vaguely.
    
    Oof, I remember pointers.
    
    ...Vaguely.
    
    1 vote
skybrian (OP)
February 12
Link
Here is the paper. From the abstract:

Together, Krapivin (now a graduate student at the University of Cambridge), Farach-Colton (now at New York University) and Kuszmaul demonstrated in a January 2025 paper that this new hash table can indeed find elements faster than was considered possible. ln so doing, they had disproved a conjecture long held to be true.

Here is the paper. From the abstract:

We show that, even without reordering elements over time, it is possible to construct a hash table that achieves far better expected search complexities (both amortized and worst-case) than were previously thought possible. Along the way, we disprove the central conjecture left by Yao in his seminal paper ``Uniform Hashing is Optimal''. All of our results come with matching lower bounds.

11 votes
skybrian (OP)
February 12
Link
From discussion on Hacker News, the reason this isn't a practical result is that normally, when a hash table gets full enough, it's automatically resized.

From discussion on Hacker News, the reason this isn't a practical result is that normally, when a hash table gets full enough, it's automatically resized.

9 votes