Showing only topics with the tag "machine learning". Back to normal view

Aurora: A leverage-aware optimizer for rectangular matrices

~comp Article 5001 words

6 comments

tilderesearch.com

17 hours ago

10 votes
Robot golf vs holes that keep getting harder

~tech Video 24:46

5 comments

YouTube: Stuff Made Here

April 17

24 votes

Predicting the NBA MVP with Machine Learning

~sports.basketball Text 989 words

Predicting the NBA MVP with Machine Learning Thesis Every season, basketball fans debate who deserves the MVP award. We built 3 machine learning models that attempt to answer that question using...

Predicting the NBA MVP with Machine Learning

Thesis

Every season, basketball fans debate who deserves the MVP award. We built 3 machine learning models that attempt to answer that question using box score statistics. At the end of each season, this award is determined by a panel of voters.

Methodology

Each model is trained on every NBA season from 1974 to 2017. For each player season, it looks at nine statistics:

Points, assists, blocks, defensive rebounds, and field goals per game the core production numbers
Win Shares (WS): an estimate of how many wins a player contributed to their team
Value Over Replacement Player (VORP): how much better a player is than a league average replacement
Box Plus/Minus (BPM): a player's net impact per 100 possessions
Usage Rate (USG%): what share of team plays run through that player

From those nine numbers, the model learns what a typical MVP season looks like versus a non MVP season, then applies that knowledge to current players. Each model outputs an independent probability that a given player wins MVP, not a share of a single pool, so the values do not sum to 1. Think of it as each player's individual odds.

Three Models, One Question

Rather than relying on a single approach, the system runs three different models and lets you compare:

Logistic Regression

The simplest of the three. It draws a straight line through the data, each statistic gets a weight, and a player's score is the weighted sum of their stats. It's easy to interpret (a higher coefficient means that stat matters more).

Win Shares (WS) is by far the most influential feature, with an absolute coefficient of ~1.85, nearly double the next most important feature. Box Plus/Minus (BPM) ranks second at ~1.0, followed by Defensive Rebounds per Game (DRBPG, ~0.85) and Assists per Game (ASTPG, ~0.70). VORP and Field Goals per Game (FGPG) contribute moderately at ~0.50. Blocks per Game (BLKPG), Points per Game (PTSPG), and Usage Rate (USG%) have minimal weight, all under 0.15.

Random Forest

Builds hundreds of decision trees, each one asking a series of "is this stat above or below X?" questions and averages their answers. It handles complex relationships between stats well and is less sensitive to any one unusual data point. Think of it as a large committee of simple rules voting together.

WS again dominates at ~0.31, accounting for roughly twice the importance of the next feature. VORP (~0.15) and BPM (~0.125) rank second and third. DRBPG (~0.10), PTSPG (~0.08), BLKPG (~0.07), FGPG (~0.065), and ASTPG (~0.06) contribute in a fairly tight mid-range band. USG% is the least important at ~0.05. Compared to logistic regression, the Random Forest spreads importance more evenly across features.

Gradient Boosting

Also uses decision trees, but builds them sequentially: each new tree focuses on correcting the mistakes the previous ones made.

This model is heavily concentrated on just two features: BPM (~0.47) and WS (~0.41) together account for roughly 88% of total feature importance. All remaining features, PTSPG, VORP, ASTPG, DRBPG, contribute ~0.02–0.03 each, and BLKPG, USG%, and FGPG are effectively unused (near zero). This suggests the gradient boosting model learned that BPM and WS alone are nearly sufficient to separate MVP candidates.

Historical Results

The models were trained on data through 2017, so every season from 2018 onward is a genuine out of sample test, the models have never seen these players or seasons before.

Season	Actual MVP	LR	RF	GB
2018	James Harden	#2	#2	#1 ✓
2019	Giannis Antetokounmpo	#1 ✓	#1 ✓	#1 ✓
2020	Giannis Antetokounmpo	#1 ✓	#1 ✓	#1 ✓
2021	Nikola Jokić	#1 ✓	#1 ✓	#1 ✓
2022	Nikola Jokić	#1 ✓	#1 ✓	#1 ✓
2023	Joel Embiid	#2	#4	#2
2024	Nikola Jokić	#1 ✓	#1 ✓	#1 ✓
2025	Shai Gilgeous-Alexander	#3	#2	#569

Top-1 accuracy: LR 5/8 · RF 5/8 · GB 6/8

Top-3 accuracy: LR 8/8 · RF 7/8 · GB 7/8

Top-3 accuracy: LR 8/8 · RF 7/8 · GB 7/8

For five straight seasons (2019–2022 + 2024), all three models agreed on the same #1 pick, and were right every time.

In 2023, every model ranked Nikola Jokić #1, and by the numbers, he arguably had the better season. Joel Embiid won the award anyway, the kind of outcome that may reflect voter narrative/fatigue and team performance rather than pure statistics. In 2025, Gradient Boosting ranked Shai Gilgeous-Alexander outside the top 500, while Logistic Regression and Random Forest had him at #3 and #2 respectively. I have no idea why GB did this. Likely a bug.

Future Direction

No model is perfect, and these have known blind spots. Team record is not included, MVP voters have historically punished players on losing teams regardless of individual stats. Injuries and narrative don't appear in a box score. And the training data skews toward an older era; the three point revolution and the rise of players like SGA have introduced statistical profiles the 1970s–1990s data doesn't fully capture.

Current Season Predictions (2025–26)

	LR	RF	GB
#1	Nikola Jokić	Shai Gilgeous-Alexander	Nikola Jokić
#2	Shai Gilgeous-Alexander	Nikola Jokić	Victor Wembanyama
#3	Victor Wembanyama	Victor Wembanyama	Giannis Antetokounmpo
#4	Luka Dončić	Giannis Antetokounmpo	Kawhi Leonard
#5	Jalen Johnson	Luka Dončić	Luka Dončić

Two of the three models have Nikola Jokić as the frontrunner. Random Forest is the dissenter, putting Shai Gilgeous-Alexander ahead. Victor Wembanyama appears in all three top 3s in just his second season, which is notable. Before running the models, I expected him to be #1 for all of them considering the way the models use advanced stats.

Conclusion

Thank you for reading. I hope you found this interesting. Basketball reference also has their own model if you would like to see a different result. Please do not gamble on my models!

13 votes

Top twenty worldwide with social-engineering and a cheat that's still undetected

~games Article 3150 words, published Jan 29 2026

8 comments

ud2.rip

February 19

27 votes
AI doesn’t reduce work—it intensifies it

~tech Link

9 comments

hbr.org

February 9

41 votes
An explainer: physical AI must sense, think, act, and optimize

~tech Article 1218 words, published Jan 6 2026

0 comments

aptiv.com

January 13

4 votes
The spy who came in from the WiFi: Beware of radio network surveillance!
~comp
- security.cyber
- privacy
Article 740 words
18 comments

alphagalileo.org

November 13, 2025

27 votes
Video models are zero-shot learners and reasoners
~tech
- google
Article 262 words
1 comment

video-zero-shot.github.io

September 28, 2025

17 votes
Increasing trust in automated driving

~transport Article 1013 words

0 comments

aptiv.com

September 24, 2025

6 votes
Can AI tell if I'm writing AI slop? A machine learning journey.

~comp Article 1708 words

3 comments

mattsayar.com

September 15, 2025

21 votes
Air Spot | Reinforcement Learning behavior research

~tech Video 4:59

1 comment

YouTube: Boston Dynamics

August 27, 2025

6 votes
Subliminal learning: Language models transmit behavioral traits via hidden signals in data

~tech Article 627 words

3 comments

anthropic.com

July 22, 2025

21 votes
AI coding tools make developers slower but they think they're faster, study finds

~tech Article 724 words

11 comments

theregister.com

July 13, 2025

40 votes
The Common Pile v0.1: An 8TB dataset of public domain and openly licensed text

~tech Article 432 words

19 comments

arXiv

June 10, 2025

26 votes
Counting all of the yurts in Mongolia using machine learning

~comp Article 3450 words

3 comments

monroeclinton.com

June 18, 2025

15 votes
Waymos are getting more assertive. Why the driverless taxis are learning to drive like humans.

~transport Article 1241 words

29 comments

San Francisco Chronicle

June 3, 2025

45 votes
Intelligent Agent Technology: Open Sesame! (1993)

~tech Article 252 words

2 comments

gingerbeardman.com

May 31, 2025

7 votes
Visualising how AI training avoids getting stuck in local minima

~comp Video 22:57

8 comments

YouTube: Welch Labs

May 11, 2025

18 votes
US scientists are using machine learning to find new treatments among thousands of old medicines
~health
- medicine
Article 2024 words, published Mar 20 2025
3 comments

The New York Times

April 1, 2025

12 votes
Tracing the thoughts of a large language model

~tech Link

1 comment

anthropic.com

March 28, 2025

10 votes
Combining machine learning and homomorphic encryption in the Apple ecosystem
~tech
- privacy
Article 1733 words
1 comment

apple.com

March 26, 2025

9 votes
Show Tildes: we built the world's first legal AI API

~tech Article 727 words

11 comments

isaacus.com

March 9, 2025

22 votes

Funny output from Claude.ai: a codeblock in a question about an early rennaissance theological work

~comp

programming

Ask

I asked Claude for references to some ideas the ai had output earlier about a 16th century cardinal (Sadoleto). It's response was, Let me analyze the letter, and then this: async function...

I asked Claude for references to some ideas the ai had output earlier about a 16th century cardinal (Sadoleto). It's response was, Let me analyze the letter, and then this:

async function searchForSadoletoLetter() {
  try {
    const files = ['sadoleto_letter.txt', 'sadoleto.txt', 'sadoleto_to_geneva.txt', 'letter_to_geneva.txt'];
    
    for (const filename of files) {
      try {
        const content = await window.fs.readFile(filename, { encoding: 'utf8' });
        console.log(`Found file: ${filename}`);
        console.log("First 500 characters:");
        console.log(content.substring(0, 500));
        return content;
      } catch (err) {
        // File not found, try the next one
      }
    }
    console.log("Could not find any files containing Sadoleto's letter");
    return null;
  } catch (err) {
    console.error("Error searching for files:", err);
    return null;
  }
}

const content = await searchForSadoletoLetter();

// Since we don't have the letter available as a file, let's search for published versions online
console.log("The letter isn't available as a file. We would need to rely on scholarly knowledge about Sadoleto's letter to Geneva (1539).");
console.log("Sadoleto's letter was published in 'A Reformation Debate: Sadoleto's Letter to the Genevans and Calvin's Reply' (ed. John C. Olin).");

This strikes me as odd? Did Claude write a script to generate its own response? Have coders put something like this in as a gaurdrail?

edit: details about earlier convo

15 votes

Overfitting to theories of overfitting
~science
- mathematics
- statistics
Article 1064 words
10 comments

argmin.net

February 15, 2025

10 votes
What trustworthy resources are you using for AI/LLMs/ML education?

~tech Ask (recommendations)

Every company is trying to shoehorn AI into every product, and many online materials provide a general snake oil vibe, making it increasingly difficult to parse. So far, my primary sources have...

Every company is trying to shoehorn AI into every product, and many online materials provide a general snake oil vibe, making it increasingly difficult to parse. So far, my primary sources have been GitHub, Medium, and some YouTube.

My goal is to better understand the underlying technology so that I can manipulate it better, train models, and use it most effectively. This goes beyond just experimenting with prompts and trying to overcome guardrails. It includes running local, like Ollama on my M1 Max, which I'm not opposed to.

5 comments

GreasyGoose

January 24, 2025

8 votes
Task-Specific LLM Evals that Do & Don't Work

~comp Article 6254 words, published Mar 31 2024

2 comments

eugeneyan.com

December 9, 2024

4 votes
Someone made a dataset of one million Bluesky posts for 'machine learning research'
~tech
- social media
Link
30 comments

404media.co

November 27, 2024

20 votes
When Machine Learning Tells the Wrong Story
~comp
- security
- hardware
Article 7485 words
1 comment

jackcook.com

November 10, 2024

6 votes
Real-time speech-to-speech translation
~comp
- open source
Ask (recommendations)
Has anyone used a free, offline, open-source, real-time speech-to-speech translation app on under-powered devices (i.e., older smart phones)? There are a few libraries that written that...

Has anyone used a free, offline, open-source, real-time speech-to-speech translation app on under-powered devices (i.e., older smart phones)? There are a few libraries that written that purportedly can do or help with local speech-to-speech:
I'm looking for a simple app that can listen for English, translate into Korean (and other languages), then perform speech synthesis on the translation. Although real-time would be great, a short delay would work.

RTranslator is awkward (couldn't get it to perform speech-to-speech using a single phone). 3PO sprouts errors like dandelions and requires an online connection.

Any suggestions?
8 comments

DaveJarvis

October 25, 2024

6 votes
GSM-Symbolic: Understanding the limitations of mathematical reasoning in large language models

~tech Article 278 words

12 comments

apple.com

October 19, 2024

15 votes
On the path to delivering next generation UK weather forecasts

~tech Article 1297 words, published Sep 5 2024

2 comments

metoffice.gov.uk

September 11, 2024

7 votes
The LLMentalist effect: how chat-based large language models replicate the mechanisms of a psychic's con

~tech Article 4404 words, published Jul 4 2023

14 comments

softwarecrisis.dev

August 16, 2024

29 votes
Six distinct types of depression identified in Stanford Medicine-led study

~health.mental Article 886 words, published Jun 22 2023

26 comments

stanford.edu

July 17, 2024

51 votes
"Mechanistic interpretability" for LLMs, explained

~comp Article 3670 words

1 comment

Substack: Sean Trott

July 8, 2024

6 votes
Can I have some advice on the neural net I've been working on?
~comp
- programming.csharp
- programming
Ask (advice)
Apologies if this isn't an appropriate place to post this. Inspired by a paper I found a while back (https://publications.lib.chalmers.se/records/fulltext/215545/local_215545.pdf), I tried my hand...

Apologies if this isn't an appropriate place to post this.

Inspired by a paper I found a while back (https://publications.lib.chalmers.se/records/fulltext/215545/local_215545.pdf), I tried my hand at implementing a program (in C#) to create ASCII art from an image. It works pretty well, but like they observed in the paper, it's pretty slow to compare every tile to 90-some glyphs. In the paper, they make a decision tree to replicate this process at a faster speed.

Recently, I revisited this. I thought I'd try making a neural net, since I found the idea interesting. I've watched some videos on neural nets, and refreshed myself on my linear algebra, and I think I've gotten pretty close. That said, I feel like there's something I'm missing (especially given the fact that the loss isn't really decreasing). I think my problem is specifically during backpropagation.

Here is a link to the TrainAsync method in GitHub: https://github.com/bendstein/ImageToASCII/blob/1c2e2260f5d4cfb45443fac8737566141f5eff6e/LibI2A/Converter/NNConverter.cs#L164C59-L164C69. The forward and backward propagation methods are below it.

If anyone can give me any feedback or advice on what I might be missing, I'd really appreciate it.

6 comments

a_sharp_soprano_sax

July 7, 2024

14 votes
I will fucking piledrive you if you mention AI again

~comp Article 4269 words

32 comments

mataroa.blog

June 19, 2024

119 votes
Extracting interpretable features from Claude 3 Sonnet

~tech Article 219 words

5 comments

transformer-circuits.pub

May 22, 2024

13 votes
Hallucination-free RAG: Making LLMs safe for healthcare

~tech Article 2467 words, published Apr 21 2024

2 comments

mattyyeung.github.io

May 8, 2024

12 votes
Turning old maps into 3D digital models of lost neighborhoods

~tech Link

0 comments

osu.edu

April 20, 2024

9 votes
MDN’s AI Help and lucid lies

~comp Article 1837 words

2 comments

seirdy.one

April 6, 2024

7 votes
Stability AI reportedly ran out of cash to pay its bills for rented cloudy GPUs
~tech
- amazon.web services
- google.cloud platform
Article 874 words
13 comments

theregister.com

April 4, 2024

28 votes
Noam Chomsky: The false promise of ChatGPT

~tech Article 1740 words, published Mar 8 2023

37 comments

The New York Times

March 31, 2024

30 votes
What useful tasks are possible with an LLM with only 3B parameters?

~comp Ask (advice)

Playing with Llama 7B and 13B, I found that the 13B model was capable of doing a simple task, rewriting titles in sentence case for Tildes submissions. The 7B model doesn't appear capable of the...

Playing with Llama 7B and 13B, I found that the 13B model was capable of doing a simple task, rewriting titles in sentence case for Tildes submissions. The 7B model doesn't appear capable of the same task, out of the box.

I heard about Android's new AICore available on a couple of new devices. But it sounds like Gemini Nano, which runs on-device, can only handle 2B or 3B parameters.

Is this size of model useful for real tasks? Does it only become useful after training on a specific domain? I'm a novice and wanting to learn a little bit about it. On-device AI is an appealing concept to me.

2 comments

talklittle

March 26, 2024

12 votes
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

~comp Article 2634 words

15 comments

arXiv

March 1, 2024

21 votes
Polymath - Toolkit to automatically segment music tracks and convert to MIDI

~comp Link

2 comments

GitHub: samim23

March 7, 2024

10 votes
What are some interesting machine learning research papers you found?

~tech Ask (survey)

Here's a place to share machine learning research papers that seem interesting to you. I'm no expert, but sometimes I skim them, and maybe there are some folks on Tilde who know more than I do?...

Here's a place to share machine learning research papers that seem interesting to you. I'm no expert, but sometimes I skim them, and maybe there are some folks on Tilde who know more than I do?

One paper per top-level post, and please link to arXiv (if relevant) and quote a bit of the abstract.

35 comments

skybrian

June 21, 2023

11 votes
Google Bard is now Gemini; Gemini Advanced launched

~tech Article 425 words

12 comments

blog.google

February 8, 2024

24 votes
Google's Gemini 1.5 Pro is a new, more efficient AI model
~tech
- google
Article 761 words
1 comment

Engadget

February 16, 2024

10 votes
Vesuvius Challenge 2023 Grand Prize awarded: we can read the first scroll!

~comp Article 4322 words

9 comments

scrollprize.org

February 7, 2024

34 votes
Why autonomous trucking is harder than autonomous rideshare

~transport Article 2273 words, published Jan 10 2024

25 comments

kevinchen.co

January 14, 2024

12 votes