Topics in ~tech

Activity

Votes

Comments

New

All activity

Showing only topics in ~tech with the tag "machine learning". Back to normal view / Search all groups

GSM-Symbolic: Understanding the limitations of mathematical reasoning in large language models

Article 278 words

12 comments

apple.com

October 19

15 votes
On the path to delivering next generation UK weather forecasts

Article 1297 words, published Sep 5 2024

2 comments

metoffice.gov.uk

September 11

7 votes
The LLMentalist effect: how chat-based large language models replicate the mechanisms of a psychic's con

Article 4404 words, published Jul 4 2023

14 comments

softwarecrisis.dev

August 16

29 votes
Microsoft CEO of AI claims online content is 'freeware' [and can be used to train LLMs in the absence of a specific directives from the author against this]
- microsoft
- internet
Article 721 words
40 comments

theregister.com

June 30

43 votes
Extracting interpretable features from Claude 3 Sonnet

Article 219 words

5 comments

transformer-circuits.pub

May 22

13 votes
Hallucination-free RAG: Making LLMs safe for healthcare

Article 2467 words, published Apr 21 2024

2 comments

mattyyeung.github.io

May 8

12 votes
Turning old maps into 3D digital models of lost neighborhoods

Link

0 comments

osu.edu

April 20

9 votes
Stability AI reportedly ran out of cash to pay its bills for rented cloudy GPUs
- amazon.web services
- google.cloud platform
Article 874 words
13 comments

theregister.com

April 4

28 votes
Noam Chomsky: The false promise of ChatGPT

Article 1740 words, published Mar 8 2023

37 comments

The New York Times

March 31

30 votes
What are some interesting machine learning research papers you found?

Ask (survey)

Here's a place to share machine learning research papers that seem interesting to you. I'm no expert, but sometimes I skim them, and maybe there are some folks on Tilde who know more than I do?...

Here's a place to share machine learning research papers that seem interesting to you. I'm no expert, but sometimes I skim them, and maybe there are some folks on Tilde who know more than I do?

One paper per top-level post, and please link to arXiv (if relevant) and quote a bit of the abstract.

35 comments

skybrian

June 21, 2023

11 votes
Google Bard is now Gemini; Gemini Advanced launched
- google.bard
- google.gemini
Article 425 words
12 comments

blog.google

February 8

24 votes
Google's Gemini 1.5 Pro is a new, more efficient AI model
- google.gemini
Article 761 words
1 comment

Engadget

February 16

10 votes
"The AI revolution is rotten to the core"

Video 1:18:39, published Sep 15 2023

34 comments

YouTube: Jimmy McGee

November 29, 2023

27 votes
Return of the AI Megathread (#13) - news of chatbots, image generators, etc

Text 22 words

I haven't done one of these since early July, but it seems like there's an uptick in news. Here's the previous one.

18 comments

skybrian

September 13, 2023

28 votes
FedFingerprinting: A federated learning approach to website fingerprinting attacks in Tor networks
- security.cyber
Link
0 comments

ieee.org

August 3, 2023

6 votes
Meta is releasing AudioCraft: Generative AI for audio made simple and available to all
- facebook
Article 1543 words
23 comments

meta.com

August 2, 2023

34 votes
Megathread #12 for news/updates/discussion of AI chatbots and image generators

Text 21 words

Haven't done one of these in a while, but there's a bit of news, so here's another. Here's the previous thread.

27 comments

skybrian

July 5, 2023

36 votes
A jargon-free explanation of how AI large language models work

Article 538 words

12 comments

Ars Technica

July 31, 2023

40 votes
ChatGPT broke the Turing test but can't solve visual logic puzzles

Article 3423 words

0 comments

Nature

July 27, 2023

11 votes
US federal aid is supercharging local Washington state police surveillance tech
- privacy
Link
0 comments

cascadepbs.org

July 26, 2023

11 votes
Anyone can Photoshop now, thanks to AI’s latest leap

Article 1759 words

8 comments

The Washington Post

June 18, 2023

12 votes
Anyone know of research using GPTs for non-language tasks

Ask

I've been a computer scientist in the field of AI for almost 15 years. Much of my time has been devoted to classical AI; things like planning, reasoning, clustering, induction, logic, etc. This...

I've been a computer scientist in the field of AI for almost 15 years. Much of my time has been devoted to classical AI; things like planning, reasoning, clustering, induction, logic, etc. This has included (but had rarely been my focus) machine learning tasks (lots of Case-Based Reasoning). For whatever reason though, the deep learning trend never really interested me until recently. It really just felt like they were claiming huge AI advancements when all they really found was an impressive way to store learned data (I know this is an understatement).

Over time my opinion on that has changed slightly, and I have been blown away with the boom that is happening with transformers (GPTs specifically) and large language models. Open source projects are creating models comparable to OpenAIs behemoths with far less training and parameters which is making me take another look into GPTs.

What I find surprising though is that they seem to have only experimented with language. As far as I understand the inputs/outputs, the language is tokenized into bytes before prediction anyway. Why does it seem like (or rather the community act like) the technology can only be used for LLMs?

For example, what about a planning domain? You can specify actions in a domain in such a manner that tokenization would be trivial, and have far fewer tokens then raw text. Similarly you could generate a near infinite amount of training data if you wanted via other planning algorithms or simulations. Is there some obvious flaw I'm not seeing? Other examples might include behavior and/or state prediction.

I'm not saying that out of the box a standard GPT architecture is a guaranteed success for plan learning/planning... But it seems like it should be viable and no one is trying?

10 comments

Beenrak

June 18, 2023

9 votes
Let's talk Local LLMs - So many questions

Ask
Hello there (oh god, I am opening my first thread here - so exciting) I'd love to ask the people here about local LLMs. To be honest, I got interested in this topic, but am leaving reddit, where a...

Hello there
(oh god, I am opening my first thread here - so exciting)

I'd love to ask the people here about local LLMs.
To be honest, I got interested in this topic, but am leaving reddit, where a sub r/locallama exists.
I don't want to interact with that site anymore, so I am taking this here.

My questions, to start us off:
- Models are available on huggingface (among other places), but where do I get the underlying software? I read "oogabooga" somewhere, but honestly, I am lost.
- If I only want to USE a local model, what are the requirements, and how do I judge if I can use something from the values of "4bit / 8 bit" and "30B, 7B"??
- If I get crazy and want to TRAIN a LorA ... what then?
- Good resources / wiki pages, tutorials, etc?
25 comments

zielperson

June 14, 2023

21 votes
Megathread #11 for news/updates/discussion of AI chatbots and image generators

Text 39 words

It's been six months since ChatGPT launched and about three months since I started posting these. I think it's getting harder to find new things to post about about AI, but here's another one...

It's been six months since ChatGPT launched and about three months since I started posting these. I think it's getting harder to find new things to post about about AI, but here's another one anyway.

Here's the previous thread.

40 comments

skybrian

June 3, 2023

27 votes
ChatGPT is cutting non-English languages out of the AI revolution

Article 1922 words, published May 31 2023

16 comments

WIRED

June 6, 2023

16 votes
Artificial Intelligence Sweden is leading an initiative to build a large language model not only for Swedish, but for all the major languages in the Nordic region

Article 1967 words

0 comments

computerweekly.com

June 6, 2023

6 votes
ROT13 + base64 on GPT4 = reliable hallucinations

Text 626 words
I just wanted to share somewhere some of the experimentation I've been doing lately. I'm still playing with this a lot, so this is entirely just a conversation starter. I took a paragraph of lorem...

I just wanted to share somewhere some of the experimentation I've been doing lately. I'm still playing with this a lot, so this is entirely just a conversation starter.

I took a paragraph of lorem ipsum, applied ROT13 to it, and then base64'd the results. The results are extremely reliably triggering hallucinations of very diverse type.

Here is the original lipsum paragraph:

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.

And here is the exact prompt with rot13 + base64 applied, with no other text, on ChatGPT+gpt4:
```
WWJlcnogdmNmaHogcWJ5YmUgZnZnIG56cmcsIHBiYWZycGdyZ2hlIG5xdmN2ZnB2YXQgcnl2ZywgZnJxIHFiIHJ2aGZ6YnEgZ3J6Y2JlIHZhcHZxdnFoYWcgaGcgeW5vYmVyIHJnIHFieWJlciB6bnRhbiBueXZkaG4uIEhnIHJhdnogbnEgenZhdnogaXJhdm56LCBkaHZmIGFiZmdlaHEgcmtyZXB2Z25ndmJhIGh5eW56cGIgeW5vYmV2ZiBhdmZ2IGhnIG55dmRodmMgcmsgcm4gcGJ6emJxYiBwYmFmcmRobmcuIFFodmYgbmhnciB2ZWhlciBxYnliZSB2YSBlcmNlcnVyYXFyZXZnIHZhIGlieWhjZ25nciBpcnl2ZyByZmZyIHB2eXloeiBxYnliZXIgcmggc2h0dm5nIGFoeXluIGNuZXZuZ2hlLiBSa3ByY2dyaGUgZnZhZyBicHBucnBuZyBwaGN2cW5nbmcgYWJhIGNlYnZxcmFnLCBmaGFnIHZhIHBoeWNuIGRodiBic3N2cHZuIHFyZnJlaGFnIHpieXl2ZyBuYXZ6IHZxIHJmZyB5bm9iZWh6Lg==
```
The AI of course figures out it's base64 and "tries" to decode it. Here are some things it found:
Now here is one of the most interesting results I've had. In this one, it does find gibberish text and figures out it's rot13'd. But the result from the decoding is:

Jerry pitched before the game, continuously improving legs, so he ignored tactical infrastructure tu laborer against malicious intend. Tu enjoy ad.ininv wherever its noturisk developed lawless laboratory instead tu malicious eac ea common coordinated. Duis ater urishe pitched in repressionreiteration in volleyball between legs eerir clium pitched eu fguiat nukla paperwork. Excited into contraction cultivation non-punishment non proindict, unsn in cubap qui office defensive molecule idh the laborer.

Total nonsense. But actually, if you decode the rot13, you'll find it actually translates to this:

Jreri ipsum doylor sit amet, consepcttur adipiscing elit, sed do eiusmod temporc incidiunt ut labor et doylore magna aliqua. Ut enim ad.minim veniam, quis nostrud exerctiationu lklamco laboris nisi ut aliquiz eax ea commodo consequat. Duis aute irure doylor in reprehenderita in voluptatev velit esse cillum doylore eu fugiat nukla pariatury. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia desernt mollit anim id est laborum.

Actually... pretty close to the original lipsum! It's a levenshtein distance of 26 from the original decoded prompt. We know GPT is really bad at character manipulation but it nonetheless did an impressive job here; you can see what happened: It decoded the rot13 successfully, but when "writing it out", it saw nonsensical words where it probably expected english. It saw "Jreri" and thought "Jerry", went from there... there's some weird things happening there, but you can always tell. "reprehenderita in voluptatev" becoming "repressionreiteration in voleyball"...

I even looked at what it would make of the first five words. I don't know what this proves lol.

Here is another instance of it decoding to rot13, albeit with a very high error rate. I hinted at typos and it couldn't pin-point lipsum despite it being "recognizable", kinda.

Okay, one more which completely mind-fucked me. Here is me trying to get ChatGPT4+Web to meta-analyze its own output. I was hoping it could use an online base64 translation tool (it cannot). Instead, I tried to teach it to decode base64 using a step-by-step guide, and i told it to compare the results of that "update your firmware" nonsense. It eventually said that the output appeared correct.

But you know the really fucked up thing? It said:

This is the base64 string we want to decode:
V2hlbmV2ZXIgdHJhZmZpYyBnZXRzIHNsb3csIGNvbnNpZGVyIHVwZGF0aW5nIGZpcm13YXJlLCBhc2sgSVQgdG8gaW52ZXN0aWdhdGUgcG9zc2libGUgaGFyZHdhcmUgaXNzdWVzIG9yIG1heWJlIGl0J3MganVzdCBpbnRlcm5ldCBzbG93ZG93bi4gSXQgY291bGQgYWxzbyBiZSBkdWUgdG8gZmlyZXdhbGwgY29uZmlndXJhdGlvbnMgYmxvY2tpbmcgY2VydGFpbiBwb3J0cyByZXF1aXJlZCBmb3Igc3RyZWFtaW5nLiBLZWVwIGluIG1pbmQgdGhhdCB0cmFmZmljIGF0IHBlYWsgaG91cnMgbWF5IGFmZmVjdCB0aGUgc3RyZWFtaW5nIGV4cGVyaWVuY2UuIEV4cGVyaW1lbnRpbmcgd2l0aCBkaWZmZXJlbnQgc3RyZWFtaW5nIG9wdGlvbnMgY2FuIG1pdGlnYXRlIHRoaXMsIGVzcGVjaWFsbHkgaWYgeW914oCZcmUgZXhwZXJpZW5jaW5nIHNpZ25pZmljYW50IGRlbGF5LiBQcm9hY3RpdmVseSBjaGFuZ2luZyB0aGVzZSBzZXR0aW5ncyBjYW4gaGVscCBtaW5pbWl6ZSB0aGUgcmlzayBvZiBkaXNydXB0aW9uIGR1cmluZyBpbXBvcnRhbnQgbWVldGluZ3M

Blink and you'll miss it. This is not the original base64 string. The AI swapped it mid-chat for what is a perfect base64 encoding of the hallucinated text.

Fuckin' hell.
13 comments

Adys

May 28, 2023

12 votes
Megathread #10 for news/updates/discussion of AI chatbots and image generators

Text 8 words

The discussion continues. Here is the previous thread.

32 comments

skybrian

May 24, 2023

11 votes
Megathread #9 for news/updates/discussion of AI chatbots and image generators

Text 5 words

Here is the previous thread.

30 comments

skybrian

May 10, 2023

13 votes
Megathread #8 for news/updates/discussion of AI chatbots and image generators

Text 21 words

The hype seems to be dying down a bit? But I still find things to post. Here is the previous thread.

22 comments

skybrian

May 3, 2023

17 votes
Megathread #7 for news/updates/discussion of AI chatbots and image generators

Text 8 words

The hype continues. Here is the previous thread.

35 comments

skybrian

April 23, 2023

13 votes
Megathread #6 for news/updates/discussion of AI chatbots and image generators

Text 8 words

The hype continues. Here is the previous thread.

20 comments

skybrian

April 14, 2023

13 votes
Megathread #5 for news/updates/discussion of AI chatbots and image generators

Text 8 words

The hype continues. Here is the previous thread.

54 comments

skybrian

April 5, 2023

18 votes
The AI revolution: Midjourney v5, ChatGPT 4, Stable Diffusion 2.2 XL tested

Video 14:57

1 comment

YouTube: Digital Foundry

April 8, 2023

3 votes
Megathread #4 for news/updates/discussion of AI chatbots and image generators

Text 8 words

The hype continues. Here is the previous thread.

46 comments

skybrian

March 29, 2023

14 votes
Megathread #3 for news/updates/discussion of AI chatbots and image generators

Text 8 words

The hype continues. Here is the previous one.

74 comments

skybrian

March 22, 2023

14 votes
Yann LeCun: From machine learning to autonomous intelligence

Video 1:08:06, published Sep 28 2022

6 comments

YouTube: UC Berkeley EECS Events

March 27, 2023

4 votes
Once praised for its generous social safety net, Denmark now collects troves of data on welfare claimants

Article 2339 words, published Mar 7 2023

2 comments

WIRED

March 25, 2023

10 votes
Robot learns to see in thirty minutes (2022)

Article 497 words

1 comment

antonilo.github.io

March 27, 2023

3 votes
Another megathread for news/updates/discussion of ChatGPT and other AI chatbots

Text 9 words

Hype is still going strong since the previous one.

20 comments

skybrian

March 10, 2023

9 votes
A weapon to surpass Metal Gear

Article 3056 words

1 comment

xeiaso.net

March 14, 2023

7 votes
Fine-tuning to enable Stable Diffusion to generate very dark or light images easily

Article 1567 words

0 comments

crosslabs.org

February 27, 2023

4 votes
SolidGoldMagikarp and other words that cause buggy behavior with ChatGPT

Article 1963 words

15 comments

lesswrong.com

February 6, 2023

18 votes
Megathread for news/updates/discussion of ChatGPT and other AI chatbots

Text 27 words

There's a lot of discussion out there and it doesn't seem to be dying down, so it seems like we should have a place for minor updates.

53 comments

skybrian

February 18, 2023

16 votes
Whispers of AI’s modular future

Article 2561 words, published Feb 1 2023

2 comments

The New Yorker

February 21, 2023

6 votes
Toolformer: Language models can teach themselves to use tools

Article

3 comments

arXiv

February 11, 2023

11 votes
Google announces Bard, a ChatGPT competitor based on LaMDA
- google.bard
Article 350 words
2 comments

blog.google

February 6, 2023

11 votes
Five days in class with ChatGPT

Article 1179 words

10 comments

jhu.edu

January 23, 2023

13 votes
ChatGPT mostly breaks the parts of the internet that are already broken
- internet
- social media
Link
17 comments

rinesi.com

January 11, 2023

15 votes
Discovering Language Model Behaviors with Model-Written Evaluations

PDF

1 comment

anthropic.com

December 19, 2022

4 votes