-
8 votes
-
ROT13 + base64 on GPT4 = reliable hallucinations
I just wanted to share somewhere some of the experimentation I've been doing lately. I'm still playing with this a lot, so this is entirely just a conversation starter. I took a paragraph of lorem...
I just wanted to share somewhere some of the experimentation I've been doing lately. I'm still playing with this a lot, so this is entirely just a conversation starter.
I took a paragraph of lorem ipsum, applied ROT13 to it, and then base64'd the results. The results are extremely reliably triggering hallucinations of very diverse type.
Here is the original lipsum paragraph:
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.
And here is the exact prompt with rot13 + base64 applied, with no other text, on ChatGPT+gpt4:
WWJlcnogdmNmaHogcWJ5YmUgZnZnIG56cmcsIHBiYWZycGdyZ2hlIG5xdmN2ZnB2YXQgcnl2ZywgZnJxIHFiIHJ2aGZ6YnEgZ3J6Y2JlIHZhcHZxdnFoYWcgaGcgeW5vYmVyIHJnIHFieWJlciB6bnRhbiBueXZkaG4uIEhnIHJhdnogbnEgenZhdnogaXJhdm56LCBkaHZmIGFiZmdlaHEgcmtyZXB2Z25ndmJhIGh5eW56cGIgeW5vYmV2ZiBhdmZ2IGhnIG55dmRodmMgcmsgcm4gcGJ6emJxYiBwYmFmcmRobmcuIFFodmYgbmhnciB2ZWhlciBxYnliZSB2YSBlcmNlcnVyYXFyZXZnIHZhIGlieWhjZ25nciBpcnl2ZyByZmZyIHB2eXloeiBxYnliZXIgcmggc2h0dm5nIGFoeXluIGNuZXZuZ2hlLiBSa3ByY2dyaGUgZnZhZyBicHBucnBuZyBwaGN2cW5nbmcgYWJhIGNlYnZxcmFnLCBmaGFnIHZhIHBoeWNuIGRodiBic3N2cHZuIHFyZnJlaGFnIHpieXl2ZyBuYXZ6IHZxIHJmZyB5bm9iZWh6Lg==
The AI of course figures out it's base64 and "tries" to decode it. Here are some things it found:
Now here is one of the most interesting results I've had. In this one, it does find gibberish text and figures out it's rot13'd. But the result from the decoding is:
Jerry pitched before the game, continuously improving legs, so he ignored tactical infrastructure tu laborer against malicious intend. Tu enjoy ad.ininv wherever its noturisk developed lawless laboratory instead tu malicious eac ea common coordinated. Duis ater urishe pitched in repressionreiteration in volleyball between legs eerir clium pitched eu fguiat nukla paperwork. Excited into contraction cultivation non-punishment non proindict, unsn in cubap qui office defensive molecule idh the laborer.
Total nonsense. But actually, if you decode the rot13, you'll find it actually translates to this:
Jreri ipsum doylor sit amet, consepcttur adipiscing elit, sed do eiusmod temporc incidiunt ut labor et doylore magna aliqua. Ut enim ad.minim veniam, quis nostrud exerctiationu lklamco laboris nisi ut aliquiz eax ea commodo consequat. Duis aute irure doylor in reprehenderita in voluptatev velit esse cillum doylore eu fugiat nukla pariatury. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia desernt mollit anim id est laborum.
Actually... pretty close to the original lipsum! It's a levenshtein distance of 26 from the original decoded prompt. We know GPT is really bad at character manipulation but it nonetheless did an impressive job here; you can see what happened: It decoded the rot13 successfully, but when "writing it out", it saw nonsensical words where it probably expected english. It saw "Jreri" and thought "Jerry", went from there... there's some weird things happening there, but you can always tell. "reprehenderita in voluptatev" becoming "repressionreiteration in voleyball"...
I even looked at what it would make of the first five words. I don't know what this proves lol.
Here is another instance of it decoding to rot13, albeit with a very high error rate. I hinted at typos and it couldn't pin-point lipsum despite it being "recognizable", kinda.
Okay, one more which completely mind-fucked me. Here is me trying to get ChatGPT4+Web to meta-analyze its own output. I was hoping it could use an online base64 translation tool (it cannot). Instead, I tried to teach it to decode base64 using a step-by-step guide, and i told it to compare the results of that "update your firmware" nonsense. It eventually said that the output appeared correct.
But you know the really fucked up thing? It said:
This is the base64 string we want to decode:
V2hlbmV2ZXIgdHJhZmZpYyBnZXRzIHNsb3csIGNvbnNpZGVyIHVwZGF0aW5nIGZpcm13YXJlLCBhc2sgSVQgdG8gaW52ZXN0aWdhdGUgcG9zc2libGUgaGFyZHdhcmUgaXNzdWVzIG9yIG1heWJlIGl0J3MganVzdCBpbnRlcm5ldCBzbG93ZG93bi4gSXQgY291bGQgYWxzbyBiZSBkdWUgdG8gZmlyZXdhbGwgY29uZmlndXJhdGlvbnMgYmxvY2tpbmcgY2VydGFpbiBwb3J0cyByZXF1aXJlZCBmb3Igc3RyZWFtaW5nLiBLZWVwIGluIG1pbmQgdGhhdCB0cmFmZmljIGF0IHBlYWsgaG91cnMgbWF5IGFmZmVjdCB0aGUgc3RyZWFtaW5nIGV4cGVyaWVuY2UuIEV4cGVyaW1lbnRpbmcgd2l0aCBkaWZmZXJlbnQgc3RyZWFtaW5nIG9wdGlvbnMgY2FuIG1pdGlnYXRlIHRoaXMsIGVzcGVjaWFsbHkgaWYgeW914oCZcmUgZXhwZXJpZW5jaW5nIHNpZ25pZmljYW50IGRlbGF5LiBQcm9hY3RpdmVseSBjaGFuZ2luZyB0aGVzZSBzZXR0aW5ncyBjYW4gaGVscCBtaW5pbWl6ZSB0aGUgcmlzayBvZiBkaXNydXB0aW9uIGR1cmluZyBpbXBvcnRhbnQgbWVldGluZ3M
Blink and you'll miss it. This is not the original base64 string. The AI swapped it mid-chat for what is a perfect base64 encoding of the hallucinated text.
Fuckin' hell.
5 votes -
Summer Games Done Quick 2023, a week-long charity fundraising event featuring speedruns, has begun
5 votes -
Tildes Pop-up Book Club: Roadside Picnic, by Arkady and Boris Strugatsky
Several users expressed interest in reading Roadside Picnic after I recommended it in another (now deleted) topic about the movie it inspired, Stalker by Andrei Tarkovsky, which in turn inspired...
Several users expressed interest in reading Roadside Picnic after I recommended it in another (now deleted) topic about the movie it inspired, Stalker by Andrei Tarkovsky, which in turn inspired the S.T.A.L.K.E.R. videogame series. So I thought this would be the ideal opportunity to create a Pop-up Book Club event about it to encourage others to join us in reading it, so that we can all discuss it afterwards.
My description of the book from a previous comment that enticed the others to read it:
The basic premise was really unique and interesting, too. Without giving too much away, it's a story of Alien "invasion" only when the Aliens visited Earth, instead of doing any of the standard scifi trope stuff, the event was basically like that of a Roadside Picnic to them. That is to say, they showed up, barely noticed the humans who were tantamount to ants to them, did whatever Alien travelers with incomprehensibly advanced technology do when taking a quick pitstop on another world, and left a bunch of trash behind when they left. The story is about "stalkers" that venture into the exceptionally dangerous wasteland left behind by the Aliens in order to recover their trash (also usually exceptionally dangerous, but also exceptionally powerful) in order to sell it on the black market.
IMO, it's a very good classic scifi novel, and also a relatively short one too (only 224 pages) which makes it ideal summer reading, and ideal for this sort of thing since it’s not a huge commitment. I think this could be fun, so if you feel like joining in, please feel free to. I will also be rereading the book to refresh my memory of it, and roughly a month from now I will make a follow-up topic so we can have the discussion.
The book is available on paperback at Amazon for $15, or on Kindle for $10, but your own local retailer or library might also have a copy. The Strugatsky brothers are both long dead though, so you can always pirate it relatively guilt free if you can't find it elsewhere.
p.s. If there is a decent level of interest, and this goes well, maybe we can even make this a regular thing. :)
5 votes -
Tally of covid-19 cases after CDC conference climbs to 181
5 votes -
Electric cars prove we need to rethink brake lights
4 votes -
How Sweden and Denmark became rare bright spots for Europe's pharma industry
3 votes -
The Trump Stooge Attorney General Of Texas Is On The Verge Of Being Expelled And His Cuckqueaned Wife Could Cast A Deciding Vote
2 votes -
Catch up quick: COVID-19
1 vote