Activity

Votes

Comments

New

All activity

Showing only topics with the tag "machine learning". Back to normal view

US scientists are using machine learning to find new treatments among thousands of old medicines
~health
- medicine
Article 2024 words, published Mar 20 2025
3 comments

The New York Times

2 days ago

12 votes
Tracing the thoughts of a large language model

~tech Link

1 comment

anthropic.com

March 28

10 votes
Combining machine learning and homomorphic encryption in the Apple ecosystem
~tech
- privacy
Article 1733 words
1 comment

apple.com

March 26

9 votes
Show Tildes: we built the world's first legal AI API

~tech Article 727 words

14 comments

isaacus.com

March 9

22 votes

Funny output from Claude.ai: a codeblock in a question about an early rennaissance theological work

~comp

programming

Ask

I asked Claude for references to some ideas the ai had output earlier about a 16th century cardinal (Sadoleto). It's response was, Let me analyze the letter, and then this: async function...

I asked Claude for references to some ideas the ai had output earlier about a 16th century cardinal (Sadoleto). It's response was, Let me analyze the letter, and then this:

async function searchForSadoletoLetter() {
  try {
    const files = ['sadoleto_letter.txt', 'sadoleto.txt', 'sadoleto_to_geneva.txt', 'letter_to_geneva.txt'];
    
    for (const filename of files) {
      try {
        const content = await window.fs.readFile(filename, { encoding: 'utf8' });
        console.log(`Found file: ${filename}`);
        console.log("First 500 characters:");
        console.log(content.substring(0, 500));
        return content;
      } catch (err) {
        // File not found, try the next one
      }
    }
    console.log("Could not find any files containing Sadoleto's letter");
    return null;
  } catch (err) {
    console.error("Error searching for files:", err);
    return null;
  }
}

const content = await searchForSadoletoLetter();

// Since we don't have the letter available as a file, let's search for published versions online
console.log("The letter isn't available as a file. We would need to rely on scholarly knowledge about Sadoleto's letter to Geneva (1539).");
console.log("Sadoleto's letter was published in 'A Reformation Debate: Sadoleto's Letter to the Genevans and Calvin's Reply' (ed. John C. Olin).");

This strikes me as odd? Did Claude write a script to generate its own response? Have coders put something like this in as a gaurdrail?

edit: details about earlier convo

15 votes

Overfitting to theories of overfitting
~science
- mathematics
- statistics
Article 1064 words
10 comments

argmin.net

February 15

10 votes
What trustworthy resources are you using for AI/LLMs/ML education?

~tech Ask (recommendations)

Every company is trying to shoehorn AI into every product, and many online materials provide a general snake oil vibe, making it increasingly difficult to parse. So far, my primary sources have...

Every company is trying to shoehorn AI into every product, and many online materials provide a general snake oil vibe, making it increasingly difficult to parse. So far, my primary sources have been GitHub, Medium, and some YouTube.

My goal is to better understand the underlying technology so that I can manipulate it better, train models, and use it most effectively. This goes beyond just experimenting with prompts and trying to overcome guardrails. It includes running local, like Ollama on my M1 Max, which I'm not opposed to.

5 comments

GreasyGoose

January 24

8 votes
Task-Specific LLM Evals that Do & Don't Work

~comp Article 6254 words, published Mar 31 2024

2 comments

eugeneyan.com

December 9, 2024

4 votes
Someone made a dataset of one million Bluesky posts for 'machine learning research'
~tech
- social media
Link
30 comments

404media.co

November 27, 2024

20 votes
When Machine Learning Tells the Wrong Story
~comp
- security
- hardware
Article 7485 words
1 comment

jackcook.com

November 10, 2024

6 votes
Real-time speech-to-speech translation
~comp
- open source
Ask (recommendations)
Has anyone used a free, offline, open-source, real-time speech-to-speech translation app on under-powered devices (i.e., older smart phones)? There are a few libraries that written that...

Has anyone used a free, offline, open-source, real-time speech-to-speech translation app on under-powered devices (i.e., older smart phones)? There are a few libraries that written that purportedly can do or help with local speech-to-speech:
I'm looking for a simple app that can listen for English, translate into Korean (and other languages), then perform speech synthesis on the translation. Although real-time would be great, a short delay would work.

RTranslator is awkward (couldn't get it to perform speech-to-speech using a single phone). 3PO sprouts errors like dandelions and requires an online connection.

Any suggestions?
8 comments

DaveJarvis

October 25, 2024

6 votes
GSM-Symbolic: Understanding the limitations of mathematical reasoning in large language models

~tech Article 278 words

12 comments

apple.com

October 19, 2024

15 votes
On the path to delivering next generation UK weather forecasts

~tech Article 1297 words, published Sep 5 2024

2 comments

metoffice.gov.uk

September 11, 2024

7 votes
The LLMentalist effect: how chat-based large language models replicate the mechanisms of a psychic's con

~tech Article 4404 words, published Jul 4 2023

14 comments

softwarecrisis.dev

August 16, 2024

29 votes
Six distinct types of depression identified in Stanford Medicine-led study

~health.mental Article 886 words, published Jun 22 2023

26 comments

stanford.edu

July 17, 2024

51 votes
"Mechanistic interpretability" for LLMs, explained

~comp Article 3670 words

1 comment

Substack: Sean Trott

July 8, 2024

6 votes
Can I have some advice on the neural net I've been working on?
~comp
- programming.csharp
Ask (advice)
Apologies if this isn't an appropriate place to post this. Inspired by a paper I found a while back (https://publications.lib.chalmers.se/records/fulltext/215545/local_215545.pdf), I tried my hand...

Apologies if this isn't an appropriate place to post this.

Inspired by a paper I found a while back (https://publications.lib.chalmers.se/records/fulltext/215545/local_215545.pdf), I tried my hand at implementing a program (in C#) to create ASCII art from an image. It works pretty well, but like they observed in the paper, it's pretty slow to compare every tile to 90-some glyphs. In the paper, they make a decision tree to replicate this process at a faster speed.

Recently, I revisited this. I thought I'd try making a neural net, since I found the idea interesting. I've watched some videos on neural nets, and refreshed myself on my linear algebra, and I think I've gotten pretty close. That said, I feel like there's something I'm missing (especially given the fact that the loss isn't really decreasing). I think my problem is specifically during backpropagation.

Here is a link to the TrainAsync method in GitHub: https://github.com/bendstein/ImageToASCII/blob/1c2e2260f5d4cfb45443fac8737566141f5eff6e/LibI2A/Converter/NNConverter.cs#L164C59-L164C69. The forward and backward propagation methods are below it.

If anyone can give me any feedback or advice on what I might be missing, I'd really appreciate it.

6 comments

a_sharp_soprano_sax

July 7, 2024

14 votes
I will fucking piledrive you if you mention AI again

~comp Article 4269 words

32 comments

mataroa.blog

June 19, 2024

119 votes
Extracting interpretable features from Claude 3 Sonnet

~tech Article 219 words

5 comments

transformer-circuits.pub

May 22, 2024

13 votes
Hallucination-free RAG: Making LLMs safe for healthcare

~tech Article 2467 words, published Apr 21 2024

2 comments

mattyyeung.github.io

May 8, 2024

12 votes
Turning old maps into 3D digital models of lost neighborhoods

~tech Link

0 comments

osu.edu

April 20, 2024

9 votes
MDN’s AI Help and lucid lies
~comp
- web development
Article 1837 words
2 comments

seirdy.one

April 6, 2024

7 votes
Stability AI reportedly ran out of cash to pay its bills for rented cloudy GPUs
~tech
- amazon.web services
- google.cloud platform
Article 874 words
13 comments

theregister.com

April 4, 2024

28 votes
Noam Chomsky: The false promise of ChatGPT

~tech Article 1740 words, published Mar 8 2023

37 comments

The New York Times

March 31, 2024

30 votes
What useful tasks are possible with an LLM with only 3B parameters?

~comp Ask (advice)

Playing with Llama 7B and 13B, I found that the 13B model was capable of doing a simple task, rewriting titles in sentence case for Tildes submissions. The 7B model doesn't appear capable of the...

Playing with Llama 7B and 13B, I found that the 13B model was capable of doing a simple task, rewriting titles in sentence case for Tildes submissions. The 7B model doesn't appear capable of the same task, out of the box.

I heard about Android's new AICore available on a couple of new devices. But it sounds like Gemini Nano, which runs on-device, can only handle 2B or 3B parameters.

Is this size of model useful for real tasks? Does it only become useful after training on a specific domain? I'm a novice and wanting to learn a little bit about it. On-device AI is an appealing concept to me.

2 comments

talklittle

March 26, 2024

12 votes
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

~comp Article 2634 words

15 comments

arXiv

March 1, 2024

21 votes
Polymath - Toolkit to automatically segment music tracks and convert to MIDI

~comp Link

2 comments

GitHub: samim23

March 7, 2024

10 votes
What are some interesting machine learning research papers you found?

~tech Ask (survey)

Here's a place to share machine learning research papers that seem interesting to you. I'm no expert, but sometimes I skim them, and maybe there are some folks on Tilde who know more than I do?...

Here's a place to share machine learning research papers that seem interesting to you. I'm no expert, but sometimes I skim them, and maybe there are some folks on Tilde who know more than I do?

One paper per top-level post, and please link to arXiv (if relevant) and quote a bit of the abstract.

35 comments

skybrian

June 21, 2023

11 votes
Google Bard is now Gemini; Gemini Advanced launched
~tech
- google
Article 425 words
12 comments

blog.google

February 8, 2024

24 votes
Google's Gemini 1.5 Pro is a new, more efficient AI model
~tech
- google
Article 761 words
1 comment

Engadget

February 16, 2024

10 votes
Vesuvius Challenge 2023 Grand Prize awarded: we can read the first scroll!

~comp Article 4322 words

9 comments

scrollprize.org

February 7, 2024

34 votes
Why autonomous trucking is harder than autonomous rideshare

~transport Article 2273 words, published Jan 10 2024

25 comments

kevinchen.co

January 14, 2024

12 votes
"The AI revolution is rotten to the core"

~tech Video 1:18:39, published Sep 15 2023

34 comments

YouTube: Jimmy McGee

November 29, 2023

27 votes
Machine learning creates a massive map of smelly molecules
~science
- chemistry
Article 536 words
2 comments

scientificamerican.com

November 24, 2023

14 votes
The unstoppable rise of disposable ML frameworks

~comp Article 1613 words, published Oct 15 2023

5 comments

petewarden.com

November 14, 2023

10 votes
Return of the AI Megathread (#13) - news of chatbots, image generators, etc

~tech Text 22 words

I haven't done one of these since early July, but it seems like there's an uptick in news. Here's the previous one.

18 comments

skybrian

September 13, 2023

28 votes
Show Tildes: how I built the largest open database of Australian law
~comp
- open source
Article 1051 words
8 comments

umarbutler.com

October 29, 2023

28 votes
Jina AI releases first open source 8k embedding model

~comp Article 700 words

3 comments

jina.ai

October 28, 2023

8 votes
FedFingerprinting: A federated learning approach to website fingerprinting attacks in Tor networks
~tech
- internet
- security.cyber
Link
0 comments

ieee.org

August 3, 2023

6 votes
Meta is releasing AudioCraft: Generative AI for audio made simple and available to all
~tech
- facebook
Article 1543 words
23 comments

meta.com

August 2, 2023

34 votes
Megathread #12 for news/updates/discussion of AI chatbots and image generators

~tech Text 21 words

Haven't done one of these in a while, but there's a bit of news, so here's another. Here's the previous thread.

26 comments

skybrian

July 5, 2023

36 votes
A jargon-free explanation of how AI large language models work

~tech Article 538 words

12 comments

Ars Technica

July 31, 2023

40 votes
ChatGPT broke the Turing test but can't solve visual logic puzzles

~tech Article 3423 words

0 comments

Nature

July 27, 2023

11 votes
US federal aid is supercharging local Washington state police surveillance tech
~tech
- privacy
Link
0 comments

cascadepbs.org

July 26, 2023

11 votes
AI tools are designing entirely new proteins that could transform medicine
~science
- biology
Article 2627 words
2 comments

Nature

July 11, 2023

12 votes
Numerically Stable RWKV Language Model

~comp Article 814 words

4 comments

bolte.cc

June 25, 2023

11 votes
Anyone can Photoshop now, thanks to AI’s latest leap

~tech Article 1759 words

8 comments

The Washington Post

June 18, 2023

12 votes
Anyone know of research using GPTs for non-language tasks

~tech Ask

I've been a computer scientist in the field of AI for almost 15 years. Much of my time has been devoted to classical AI; things like planning, reasoning, clustering, induction, logic, etc. This...

I've been a computer scientist in the field of AI for almost 15 years. Much of my time has been devoted to classical AI; things like planning, reasoning, clustering, induction, logic, etc. This has included (but had rarely been my focus) machine learning tasks (lots of Case-Based Reasoning). For whatever reason though, the deep learning trend never really interested me until recently. It really just felt like they were claiming huge AI advancements when all they really found was an impressive way to store learned data (I know this is an understatement).

Over time my opinion on that has changed slightly, and I have been blown away with the boom that is happening with transformers (GPTs specifically) and large language models. Open source projects are creating models comparable to OpenAIs behemoths with far less training and parameters which is making me take another look into GPTs.

What I find surprising though is that they seem to have only experimented with language. As far as I understand the inputs/outputs, the language is tokenized into bytes before prediction anyway. Why does it seem like (or rather the community act like) the technology can only be used for LLMs?

For example, what about a planning domain? You can specify actions in a domain in such a manner that tokenization would be trivial, and have far fewer tokens then raw text. Similarly you could generate a near infinite amount of training data if you wanted via other planning algorithms or simulations. Is there some obvious flaw I'm not seeing? Other examples might include behavior and/or state prediction.

I'm not saying that out of the box a standard GPT architecture is a guaranteed success for plan learning/planning... But it seems like it should be viable and no one is trying?

10 comments

Beenrak

June 18, 2023

9 votes
Let's talk Local LLMs - So many questions

~tech Ask
Hello there (oh god, I am opening my first thread here - so exciting) I'd love to ask the people here about local LLMs. To be honest, I got interested in this topic, but am leaving reddit, where a...

Hello there
(oh god, I am opening my first thread here - so exciting)

I'd love to ask the people here about local LLMs.
To be honest, I got interested in this topic, but am leaving reddit, where a sub r/locallama exists.
I don't want to interact with that site anymore, so I am taking this here.

My questions, to start us off:
- Models are available on huggingface (among other places), but where do I get the underlying software? I read "oogabooga" somewhere, but honestly, I am lost.
- If I only want to USE a local model, what are the requirements, and how do I judge if I can use something from the values of "4bit / 8 bit" and "30B, 7B"??
- If I get crazy and want to TRAIN a LorA ... what then?
- Good resources / wiki pages, tutorials, etc?
25 comments

zielperson

June 14, 2023

21 votes
Megathread #11 for news/updates/discussion of AI chatbots and image generators

~tech Text 39 words

It's been six months since ChatGPT launched and about three months since I started posting these. I think it's getting harder to find new things to post about about AI, but here's another one...

It's been six months since ChatGPT launched and about three months since I started posting these. I think it's getting harder to find new things to post about about AI, but here's another one anyway.

Here's the previous thread.

40 comments

skybrian

June 3, 2023

27 votes