Tildes

Activity

Votes

Comments

New

All activity

Showing only topics with the tag "machine learning". Back to normal view

Prev Next

ChatGPT is cutting non-English languages out of the AI revolution

~tech Article 1922 words, published May 31 2023

16 comments

WIRED

June 6, 2023

16 votes
Artificial Intelligence Sweden is leading an initiative to build a large language model not only for Swedish, but for all the major languages in the Nordic region

~tech Article 1967 words

0 comments

computerweekly.com

June 6, 2023

6 votes
ROT13 + base64 on GPT4 = reliable hallucinations

~tech Text 626 words
I just wanted to share somewhere some of the experimentation I've been doing lately. I'm still playing with this a lot, so this is entirely just a conversation starter. I took a paragraph of lorem...

I just wanted to share somewhere some of the experimentation I've been doing lately. I'm still playing with this a lot, so this is entirely just a conversation starter.

I took a paragraph of lorem ipsum, applied ROT13 to it, and then base64'd the results. The results are extremely reliably triggering hallucinations of very diverse type.

Here is the original lipsum paragraph:

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.

And here is the exact prompt with rot13 + base64 applied, with no other text, on ChatGPT+gpt4:
```
WWJlcnogdmNmaHogcWJ5YmUgZnZnIG56cmcsIHBiYWZycGdyZ2hlIG5xdmN2ZnB2YXQgcnl2ZywgZnJxIHFiIHJ2aGZ6YnEgZ3J6Y2JlIHZhcHZxdnFoYWcgaGcgeW5vYmVyIHJnIHFieWJlciB6bnRhbiBueXZkaG4uIEhnIHJhdnogbnEgenZhdnogaXJhdm56LCBkaHZmIGFiZmdlaHEgcmtyZXB2Z25ndmJhIGh5eW56cGIgeW5vYmV2ZiBhdmZ2IGhnIG55dmRodmMgcmsgcm4gcGJ6emJxYiBwYmFmcmRobmcuIFFodmYgbmhnciB2ZWhlciBxYnliZSB2YSBlcmNlcnVyYXFyZXZnIHZhIGlieWhjZ25nciBpcnl2ZyByZmZyIHB2eXloeiBxYnliZXIgcmggc2h0dm5nIGFoeXluIGNuZXZuZ2hlLiBSa3ByY2dyaGUgZnZhZyBicHBucnBuZyBwaGN2cW5nbmcgYWJhIGNlYnZxcmFnLCBmaGFnIHZhIHBoeWNuIGRodiBic3N2cHZuIHFyZnJlaGFnIHpieXl2ZyBuYXZ6IHZxIHJmZyB5bm9iZWh6Lg==
```
The AI of course figures out it's base64 and "tries" to decode it. Here are some things it found:
Now here is one of the most interesting results I've had. In this one, it does find gibberish text and figures out it's rot13'd. But the result from the decoding is:

Jerry pitched before the game, continuously improving legs, so he ignored tactical infrastructure tu laborer against malicious intend. Tu enjoy ad.ininv wherever its noturisk developed lawless laboratory instead tu malicious eac ea common coordinated. Duis ater urishe pitched in repressionreiteration in volleyball between legs eerir clium pitched eu fguiat nukla paperwork. Excited into contraction cultivation non-punishment non proindict, unsn in cubap qui office defensive molecule idh the laborer.

Total nonsense. But actually, if you decode the rot13, you'll find it actually translates to this:

Jreri ipsum doylor sit amet, consepcttur adipiscing elit, sed do eiusmod temporc incidiunt ut labor et doylore magna aliqua. Ut enim ad.minim veniam, quis nostrud exerctiationu lklamco laboris nisi ut aliquiz eax ea commodo consequat. Duis aute irure doylor in reprehenderita in voluptatev velit esse cillum doylore eu fugiat nukla pariatury. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia desernt mollit anim id est laborum.

Actually... pretty close to the original lipsum! It's a levenshtein distance of 26 from the original decoded prompt. We know GPT is really bad at character manipulation but it nonetheless did an impressive job here; you can see what happened: It decoded the rot13 successfully, but when "writing it out", it saw nonsensical words where it probably expected english. It saw "Jreri" and thought "Jerry", went from there... there's some weird things happening there, but you can always tell. "reprehenderita in voluptatev" becoming "repressionreiteration in voleyball"...

I even looked at what it would make of the first five words. I don't know what this proves lol.

Here is another instance of it decoding to rot13, albeit with a very high error rate. I hinted at typos and it couldn't pin-point lipsum despite it being "recognizable", kinda.

Okay, one more which completely mind-fucked me. Here is me trying to get ChatGPT4+Web to meta-analyze its own output. I was hoping it could use an online base64 translation tool (it cannot). Instead, I tried to teach it to decode base64 using a step-by-step guide, and i told it to compare the results of that "update your firmware" nonsense. It eventually said that the output appeared correct.

But you know the really fucked up thing? It said:

This is the base64 string we want to decode:
V2hlbmV2ZXIgdHJhZmZpYyBnZXRzIHNsb3csIGNvbnNpZGVyIHVwZGF0aW5nIGZpcm13YXJlLCBhc2sgSVQgdG8gaW52ZXN0aWdhdGUgcG9zc2libGUgaGFyZHdhcmUgaXNzdWVzIG9yIG1heWJlIGl0J3MganVzdCBpbnRlcm5ldCBzbG93ZG93bi4gSXQgY291bGQgYWxzbyBiZSBkdWUgdG8gZmlyZXdhbGwgY29uZmlndXJhdGlvbnMgYmxvY2tpbmcgY2VydGFpbiBwb3J0cyByZXF1aXJlZCBmb3Igc3RyZWFtaW5nLiBLZWVwIGluIG1pbmQgdGhhdCB0cmFmZmljIGF0IHBlYWsgaG91cnMgbWF5IGFmZmVjdCB0aGUgc3RyZWFtaW5nIGV4cGVyaWVuY2UuIEV4cGVyaW1lbnRpbmcgd2l0aCBkaWZmZXJlbnQgc3RyZWFtaW5nIG9wdGlvbnMgY2FuIG1pdGlnYXRlIHRoaXMsIGVzcGVjaWFsbHkgaWYgeW914oCZcmUgZXhwZXJpZW5jaW5nIHNpZ25pZmljYW50IGRlbGF5LiBQcm9hY3RpdmVseSBjaGFuZ2luZyB0aGVzZSBzZXR0aW5ncyBjYW4gaGVscCBtaW5pbWl6ZSB0aGUgcmlzayBvZiBkaXNydXB0aW9uIGR1cmluZyBpbXBvcnRhbnQgbWVldGluZ3M

Blink and you'll miss it. This is not the original base64 string. The AI swapped it mid-chat for what is a perfect base64 encoding of the hallucinated text.

Fuckin' hell.
13 comments

Adys

May 28, 2023

12 votes
Megathread #10 for news/updates/discussion of AI chatbots and image generators

~tech Text 8 words

The discussion continues. Here is the previous thread.

32 comments

skybrian

May 24, 2023

11 votes
Megathread #9 for news/updates/discussion of AI chatbots and image generators

~tech Text 5 words

Here is the previous thread.

30 comments

skybrian

May 10, 2023

13 votes
How is AI impacting science?
~science
- biology.molecular
Article 4362 words
1 comment

michaelnotebook.com

May 15, 2023

4 votes
Megathread #8 for news/updates/discussion of AI chatbots and image generators

~tech Text 21 words

The hype seems to be dying down a bit? But I still find things to post. Here is the previous thread.

22 comments

skybrian

May 3, 2023

17 votes
GradIEEEnt half decent: The hidden power of imprecise lines

~comp Video 55:02

0 comments

YouTube: suckerpinch

May 2, 2023

9 votes
Megathread #7 for news/updates/discussion of AI chatbots and image generators

~tech Text 8 words

The hype continues. Here is the previous thread.

35 comments

skybrian

April 23, 2023

13 votes
Double descent in human learning

~science Article 1128 words

1 comment

chris-said.io

April 28, 2023

5 votes
Megathread #6 for news/updates/discussion of AI chatbots and image generators

~tech Text 8 words

The hype continues. Here is the previous thread.

20 comments

skybrian

April 14, 2023

13 votes
Megathread #5 for news/updates/discussion of AI chatbots and image generators

~tech Text 8 words

The hype continues. Here is the previous thread.

54 comments

skybrian

April 5, 2023

18 votes
The AI revolution: Midjourney v5, ChatGPT 4, Stable Diffusion 2.2 XL tested

~tech Video 14:57

1 comment

YouTube: Digital Foundry

April 8, 2023

3 votes
Megathread #4 for news/updates/discussion of AI chatbots and image generators

~tech Text 8 words

The hype continues. Here is the previous thread.

46 comments

skybrian

March 29, 2023

14 votes
Megathread #3 for news/updates/discussion of AI chatbots and image generators

~tech Text 8 words

The hype continues. Here is the previous one.

74 comments

skybrian

March 22, 2023

14 votes
Yann LeCun: From machine learning to autonomous intelligence

~tech Video 1:08:06, published Sep 28 2022

6 comments

YouTube: UC Berkeley EECS Events

March 27, 2023

4 votes
Once praised for its generous social safety net, Denmark now collects troves of data on welfare claimants

~tech Article 2339 words, published Mar 7 2023

2 comments

WIRED

March 25, 2023

10 votes
Robot learns to see in thirty minutes (2022)

~tech Article 497 words

1 comment

antonilo.github.io

March 27, 2023

3 votes
We're all Wittgensteinians now
~humanities
- philosophy
- language
Article 1601 words
2 comments

secondbest.ca

March 21, 2023

6 votes
Another megathread for news/updates/discussion of ChatGPT and other AI chatbots

~tech Text 9 words

Hype is still going strong since the previous one.

20 comments

skybrian

March 10, 2023

9 votes
A weapon to surpass Metal Gear

~tech Article 3056 words

1 comment

xeiaso.net

March 14, 2023

7 votes
The shaky foundations of foundation models in healthcare
~health
- healthcare
Link
0 comments

stanford.edu

March 6, 2023

3 votes
Fine-tuning to enable Stable Diffusion to generate very dark or light images easily

~tech Article 1567 words

0 comments

crosslabs.org

February 27, 2023

4 votes
SolidGoldMagikarp and other words that cause buggy behavior with ChatGPT

~tech Article 1963 words

15 comments

lesswrong.com

February 6, 2023

18 votes
Megathread for news/updates/discussion of ChatGPT and other AI chatbots

~tech Text 27 words

There's a lot of discussion out there and it doesn't seem to be dying down, so it seems like we should have a place for minor updates.

53 comments

skybrian

February 18, 2023

16 votes
Whispers of AI’s modular future

~tech Article 2561 words, published Feb 1 2023

2 comments

The New Yorker

February 21, 2023

6 votes
How do we fix and update large language models?

~science Link

0 comments

stanford.edu

February 20, 2023

6 votes
Toolformer: Language models can teach themselves to use tools

~tech Article

3 comments

arXiv

February 11, 2023

11 votes
ChatGPT and MidJourney made these drinks. Does the world even need me?
~food
- drinks
- recipes
Video 32:40
7 comments

YouTube: How To Drink

February 8, 2023

6 votes
Google announces Bard, a ChatGPT competitor based on LaMDA

~tech Article 350 words

2 comments

blog.google

February 6, 2023

11 votes
Five days in class with ChatGPT

~tech Article 1179 words

10 comments

jhu.edu

January 23, 2023

13 votes
Will Floating Point 8 Solve AI/ML Overhead?
~comp
- hardware
Article 2930 words
1 comment

semiengineering.com

January 14, 2023

6 votes
ChatGPT mostly breaks the parts of the internet that are already broken
~tech
- internet
- social media
Link
17 comments

rinesi.com

January 11, 2023

15 votes
Infinite AI Array

~comp Article 433 words

0 comments

ianbicking.org

January 3, 2023

3 votes
Discovering Language Model Behaviors with Model-Written Evaluations

~tech PDF

1 comment

anthropic.com

December 19, 2022

4 votes
Medical selfies
~health
- medicine
Article 940 words
2 comments

Substack: Eric Topol

November 30, 2022

5 votes
Why Japan's internet is weirdly designed

~design Video 15:15, published Nov 24 2022

0 comments

YouTube: Answer in Progress

November 30, 2022

8 votes
Nvidia AI plays Minecraft, wins machine learning conference award

~tech Article 313 words

0 comments

Ars Technica

November 28, 2022

9 votes
How DeviantArt is navigating the AI art minefield
~arts
- art.generative
Article 1385 words, published Nov 15 2022
1 comment

The Verge

November 28, 2022

10 votes
Adversarial policies beat professional-level Go AIs
~games.tabletop
- board games
Link
1 comment

alignmentfund.org

November 3, 2022

12 votes
The Stack - permissively licensed code for large language models

~tech Article 956 words, published Nov 16 2020

1 comment

bigcode-project.org

November 1, 2022

6 votes
Phenaki - generating videos from text with prompts that can change over time

~tech Article 885 words

0 comments

phenaki.github.io

October 6, 2022

6 votes
The amazing power of "machine eyes"
~health
- medicine
Article 559 words
1 comment

Substack: Eric Topol

October 6, 2022

6 votes
Investigating toxicity changes of cross-community Redditors from two billion posts and comments

~science Article 11 341 words, published Apr 19 2019

5 comments

peerj.com

September 23, 2022

9 votes
Introducing Whisper (OpenAI speech recognition model)

~comp Article 357 words

16 comments

openai.com

September 21, 2022

16 votes
DreamCoder: Growing generalizable, interpretable knowledge with wake-sleep Bayesian program learning

~comp Article published Jun 20 2020

1 comment

arXiv

September 20, 2022

5 votes
Prompt injection attacks against GPT-3
~tech
- security
Article 680 words
7 comments

simonwillison.net

September 13, 2022

14 votes
How Twitter’s child porn problem ruined its plans for an OnlyFans competitor
~tech
- social media
Article 2738 words
5 comments

The Verge

September 1, 2022

9 votes
How to build a GPT-3 for science

~science Article 2190 words, published Aug 18 2022

2 comments

future.com

August 28, 2022

5 votes
Lexica - Search engine for images generated via stable diffusion
~creative
- nsfw
Link
4 comments

skybrian

August 26, 2022

10 votes

Prev Next