Activity

Votes

Comments

New

All activity

Showing only topics in ~comp with the tag "machine learning". Back to normal view / Search all groups

The spy who came in from the WiFi: Beware of radio network surveillance!
- security.cyber
- privacy
Article 740 words
18 comments

alphagalileo.org

November 13

27 votes
Can AI tell if I'm writing AI slop? A machine learning journey

Article 1708 words

3 comments

mattsayar.com

September 15

21 votes
Counting all of the yurts in Mongolia using machine learning

Article 3450 words

3 comments

monroeclinton.com

June 18

15 votes
Visualising how AI training avoids getting stuck in local minima

Video 22:57

8 comments

YouTube: Welch Labs

May 11

18 votes

Funny output from Claude.ai: a codeblock in a question about an early rennaissance theological work

programming

Ask

I asked Claude for references to some ideas the ai had output earlier about a 16th century cardinal (Sadoleto). It's response was, Let me analyze the letter, and then this: async function...

I asked Claude for references to some ideas the ai had output earlier about a 16th century cardinal (Sadoleto). It's response was, Let me analyze the letter, and then this:

async function searchForSadoletoLetter() {
  try {
    const files = ['sadoleto_letter.txt', 'sadoleto.txt', 'sadoleto_to_geneva.txt', 'letter_to_geneva.txt'];
    
    for (const filename of files) {
      try {
        const content = await window.fs.readFile(filename, { encoding: 'utf8' });
        console.log(`Found file: ${filename}`);
        console.log("First 500 characters:");
        console.log(content.substring(0, 500));
        return content;
      } catch (err) {
        // File not found, try the next one
      }
    }
    console.log("Could not find any files containing Sadoleto's letter");
    return null;
  } catch (err) {
    console.error("Error searching for files:", err);
    return null;
  }
}

const content = await searchForSadoletoLetter();

// Since we don't have the letter available as a file, let's search for published versions online
console.log("The letter isn't available as a file. We would need to rely on scholarly knowledge about Sadoleto's letter to Geneva (1539).");
console.log("Sadoleto's letter was published in 'A Reformation Debate: Sadoleto's Letter to the Genevans and Calvin's Reply' (ed. John C. Olin).");

This strikes me as odd? Did Claude write a script to generate its own response? Have coders put something like this in as a gaurdrail?

edit: details about earlier convo

15 votes

Task-Specific LLM Evals that Do & Don't Work

Article 6254 words, published Mar 31 2024

2 comments

eugeneyan.com

December 9, 2024

4 votes
When Machine Learning Tells the Wrong Story
- security
- hardware
Article 7485 words
1 comment

jackcook.com

November 10, 2024

6 votes
Real-time speech-to-speech translation
- open source
Ask (recommendations)
Has anyone used a free, offline, open-source, real-time speech-to-speech translation app on under-powered devices (i.e., older smart phones)? There are a few libraries that written that...

Has anyone used a free, offline, open-source, real-time speech-to-speech translation app on under-powered devices (i.e., older smart phones)? There are a few libraries that written that purportedly can do or help with local speech-to-speech:
I'm looking for a simple app that can listen for English, translate into Korean (and other languages), then perform speech synthesis on the translation. Although real-time would be great, a short delay would work.

RTranslator is awkward (couldn't get it to perform speech-to-speech using a single phone). 3PO sprouts errors like dandelions and requires an online connection.

Any suggestions?
8 comments

DaveJarvis

October 25, 2024

6 votes
"Mechanistic interpretability" for LLMs, explained

Article 3670 words

1 comment

Substack: Sean Trott

July 8, 2024

6 votes
Can I have some advice on the neural net I've been working on?
- programming.csharp
Ask (advice)
Apologies if this isn't an appropriate place to post this. Inspired by a paper I found a while back (https://publications.lib.chalmers.se/records/fulltext/215545/local_215545.pdf), I tried my hand...

Apologies if this isn't an appropriate place to post this.

Inspired by a paper I found a while back (https://publications.lib.chalmers.se/records/fulltext/215545/local_215545.pdf), I tried my hand at implementing a program (in C#) to create ASCII art from an image. It works pretty well, but like they observed in the paper, it's pretty slow to compare every tile to 90-some glyphs. In the paper, they make a decision tree to replicate this process at a faster speed.

Recently, I revisited this. I thought I'd try making a neural net, since I found the idea interesting. I've watched some videos on neural nets, and refreshed myself on my linear algebra, and I think I've gotten pretty close. That said, I feel like there's something I'm missing (especially given the fact that the loss isn't really decreasing). I think my problem is specifically during backpropagation.

Here is a link to the TrainAsync method in GitHub: https://github.com/bendstein/ImageToASCII/blob/1c2e2260f5d4cfb45443fac8737566141f5eff6e/LibI2A/Converter/NNConverter.cs#L164C59-L164C69. The forward and backward propagation methods are below it.

If anyone can give me any feedback or advice on what I might be missing, I'd really appreciate it.

6 comments

a_sharp_soprano_sax

July 7, 2024

14 votes
I will fucking piledrive you if you mention AI again

Article 4269 words

32 comments

mataroa.blog

June 19, 2024

119 votes
MDN’s AI Help and lucid lies

Article 1837 words

2 comments

seirdy.one

April 6, 2024

7 votes
What useful tasks are possible with an LLM with only 3B parameters?

Ask (advice)

Playing with Llama 7B and 13B, I found that the 13B model was capable of doing a simple task, rewriting titles in sentence case for Tildes submissions. The 7B model doesn't appear capable of the...

Playing with Llama 7B and 13B, I found that the 13B model was capable of doing a simple task, rewriting titles in sentence case for Tildes submissions. The 7B model doesn't appear capable of the same task, out of the box.

I heard about Android's new AICore available on a couple of new devices. But it sounds like Gemini Nano, which runs on-device, can only handle 2B or 3B parameters.

Is this size of model useful for real tasks? Does it only become useful after training on a specific domain? I'm a novice and wanting to learn a little bit about it. On-device AI is an appealing concept to me.

2 comments

talklittle

March 26, 2024

12 votes
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Article 2634 words

15 comments

arXiv

March 1, 2024

21 votes
Polymath - Toolkit to automatically segment music tracks and convert to MIDI

Link

2 comments

GitHub: samim23

March 7, 2024

10 votes
Vesuvius Challenge 2023 Grand Prize awarded: we can read the first scroll!

Article 4322 words

9 comments

scrollprize.org

February 7, 2024

34 votes
The unstoppable rise of disposable ML frameworks

Article 1613 words, published Oct 15 2023

5 comments

petewarden.com

November 14, 2023

10 votes
Show Tildes: how I built the largest open database of Australian law
- open source
Article 1051 words
8 comments

umarbutler.com

October 29, 2023

28 votes
Jina AI releases first open source 8k embedding model

Article 700 words

3 comments

jina.ai

October 28, 2023

8 votes
Numerically Stable RWKV Language Model

Article 814 words

4 comments

bolte.cc

June 25, 2023

11 votes
GradIEEEnt half decent: The hidden power of imprecise lines

Video 55:02

0 comments

YouTube: suckerpinch

May 2, 2023

9 votes
Will Floating Point 8 Solve AI/ML Overhead?
- hardware
Article 2930 words
1 comment

semiengineering.com

January 14, 2023

6 votes
Infinite AI Array

Article 433 words

0 comments

ianbicking.org

January 3, 2023

3 votes
Introducing Whisper (OpenAI speech recognition model)

Article 357 words

16 comments

openai.com

September 21, 2022

16 votes
DreamCoder: Growing generalizable, interpretable knowledge with wake-sleep Bayesian program learning

Article published Jun 20 2020

1 comment

arXiv

September 20, 2022

5 votes
An experiment to test GitHub Copilot's legality
- open source
Article 1301 words
22 comments

seirdy.one

July 2, 2022

11 votes
GitHub Copilot - Your AI pair programmer
- programming
Link
11 comments

GitHub

June 30, 2021

20 votes
Uppestcase and Lowestcase Letters [advances in derp learning]

Video 24:13, published Apr 1 2021

1 comment

YouTube: suckerpinch

April 7, 2021

11 votes
Exploiting machine learning models distributed as Python pickle files, and introducing Fickling: a new tool for analyzing and modifying pickle bytecode
- security
Article 1798 words
0 comments

trailofbits.com

March 15, 2021

3 votes
Nx (Numerical Elixir) is now publicly available
- programming languages
Article 1804 words, published Feb 18 2021
1 comment

dashbit.co

February 21, 2021

7 votes
Researching the potential of using machine learning to predict random number generation
- security
Article 6572 words
0 comments

airza.net

November 10, 2020

11 votes
Musings on Typicality

Article 4169 words, published Aug 31 2020

1 comment

benanne.github.io

September 24, 2020

3 votes
Neuroevolution of Self-Interpretable Agents

Article 8718 words, published Mar 18 2020

1 comment

attentionagent.github.io

April 5, 2020

4 votes
AutoML-Zero: Evolving Machine Learning Algorithms From Scratch

Article

1 comment

arXiv

March 12, 2020

5 votes
A new model and dataset for long-range memory

Article 2956 words, published Jan 15 2020

2 comments

deepmind.com

February 11, 2020

7 votes
When artificial intelligence lost in translation is
- nsfw.nudity
Article 1783 words
1 comment

The Correspondent

January 24, 2020

9 votes
Play Chess against GPT-2
- programming
Tweet
@theshawwn: I am preparing to release a notebook where you can play chess vs GPT-2. If anyone wants to help beta test it: 1. visit https://t.co/CpWrFvtnY2 2. open in playground mode 3. click Runtime -> Run All 4. Scroll to the bottommost cell and wait 6 minutes If you get stuck, tell me.

4 comments

Twitter: theshawwn

January 7, 2020

5 votes
A Look at Cerebras Wafer-Scale Engine: Half Square Foot Silicon Chip

Article 1032 words, published Nov 16 2019

1 comment

wikichip.org

November 29, 2019

6 votes
OpenAI releases the largest version (1.5B parameters) of their GPT-2 language model, along with code and model weights

Article 746 words

2 comments

openai.com

November 5, 2019

11 votes
OpenAI Plays Hide and Seek…and Breaks The Game!

Video 6:08

12 comments

YouTube: Two Minute Papers

October 22, 2019

19 votes
GPT-2 is not as dangerous as OpenAI thought

Article 1426 words

0 comments

tumblr.com

September 10, 2019

5 votes
Specification Gaming Examples in AI

Link

1 comment

google.com

August 24, 2019

10 votes
Puffer, a machine learning research study by Stanford University which allows you to stream live TV in your browser

Link

2 comments

stanford.edu

July 22, 2019

13 votes
Ludwig: Uber open sourced a config-based deep learning tool
- open source
Article 2302 words, published Feb 11 2019
1 comment

uber.com

July 15, 2019

4 votes
Facebook and Carnegie Mellon's "Pluribus", the first AI to defeat professionals in 6-player poker

Article 3844 words

0 comments

facebook.com

July 11, 2019

8 votes
Generative Adversarial Networks - The story so far

Article 4204 words, published Jun 21 2019

0 comments

floydhub.com

June 24, 2019

6 votes
Generating YouTube Titles Using Image Captioning

Article 1030 words, published May 11 2019

1 comment

darshancrout.ai

May 24, 2019

4 votes
Few-Shot Adversarial Learning of Realistic Neural Talking Head Models

Video 5:34

1 comment

YouTube: Egor Zakharov

May 23, 2019

4 votes
Synthetic Sensors: Towards General-Purpose Sensing

PDF

1 comment

gierad.com

March 21, 2019

4 votes
Tutorial on Automatic Machine Learning (NeurIPS2018)

Link

1 comment

videoken.com

February 13, 2019

5 votes