Topics in ~tech

Activity

Votes

Comments

New

All activity

Showing only topics in ~tech with the tag "language models.large". Back to normal view / Search all groups

Animals versus ghosts

Article 1776 words, published Oct 1 2025

3 comments

bearblog.dev

11 hours ago

4 votes
GPT-5 has come a long way in mathematics

Article 3904 words

24 comments

ritchot.me

November 23

21 votes
LLMs are bullshitters. But that doesn't mean they're not useful.

Article 1919 words, published Nov 19 2025

3 comments

kagi.com

November 23

19 votes
Is trying to become an author insane in times of LLMs?

Ask

A simple question. I know LLMs are currently not a replacement for authors. Will that remain true in 5 to 10 years? EDIT: No. I never expected to earn a living either mostly or exclusively by...

A simple question. I know LLMs are currently not a replacement for authors. Will that remain true in 5 to 10 years?

EDIT: No. I never expected to earn a living either mostly or exclusively by selling books. There are however many "side gigs" in my country that can greatly benefit from being published by a real company. Ultimately though, I'm not in it primarily for the money. But I wonder what the future holds for fiction as a whole.

16 comments

lou

November 16

21 votes
The worlds on fire. So lets just make AI porn.

Article 5247 words

21 comments

itstoday.site

November 20

23 votes
Part of me wishes it wasn't true but: AI coding is legit

Ask

I stay current on tech for both personal and professional reasons but I also really hate hype. As a result I've been skeptical of AI claims throughout the historic hype cycle we're currently in....

I stay current on tech for both personal and professional reasons but I also really hate hype. As a result I've been skeptical of AI claims throughout the historic hype cycle we're currently in. Note that I'm using AI here as shorthand for frontier LLMs.

So I'm sort of a late adopter when it comes to LLMs. At each new generation of models I've spent enough time playing with them to feel like I understand where the technology is and can speak about its viability for different applications. But I haven't really incorporated it into my own work/life in any serious way.

That changed recently when I decided to lean all the way in to agent assisted coding for a project after getting some impressive boilerplate out of one of the leading models (I don't remember which one). That AI can do a competent job on basic coding tasks like writing boilerplate code is nothing new, and that wasn't the part that impressed me. What impressed me was the process, especially the degree to which it modified its behavior in practical ways based on feedback. In previous tests it was a lot harder to get the model to go against patterns that featured heavily in the training data, and then get it to stay true to the new patterns for the rest of the session. That's not true anymore.

Long story short, add me to the long list of people whose minds have been blown by coding agents. You can find plenty of articles and posts about what that process looks like so I won't rehash all the details. I'll only say that the comparisons to having your own dedicated junior or intern who is at once highly educated and dumb are apt. Maybe an even better comparison would be to having a team of tireless, emotionless, junior developers willing to respond to your requests at warp speed 24/7 for the price of 1/100th of one developer. You need the team comparison to capture the speed.

You've probably read, or experienced, that AI is good at basic tasks, boilerplate, writing tests, finding bugs and so on. And that it gets progressively worse as things get more complicated and the LoCs start to stack up. That's all true but one part that has changed, in more recent models, is the definition of "basic".

The bit that's difficult to articulate, and I think leads to the "having a nearly free assistant" comparisons, is what it feels like to have AI as a coding companion. I'm not going to try to capture it here, I'll just say it's remarkable.

The usual caveats apply, if you rely on agents to do extensive coding, or handle complex problems, you'll end up regretting it unless you go over every line with a magnifying glass. They will cheerfully introduce subtle bugs that are hard to catch and harder to fix when you finally do stumble across them. And that's assuming they can do the thing you're asking then to do at all. Beyond the basics they still abjectly fail a lot of the time. They'll write humorously bad code, they'll break unrelated code for no apparent reason, they'll freak out and get stuck in loops (that one suprised me in 2025). We're still a long way from agents that can actually write software on their own, despite the hype.

But wow, it's liberating to have an assistant that can do 100's of basic tasks you'd rather not be distracted by, answer questions accurately and knowledgeably, scan and report clearly about code, find bugs you might have missed and otherwise soften the edges of countless engineering pain points. And brainstorming! A pseudo-intelligent partner with an incomprehensibly wide knowledge base and unparalled pattern matching abilities is guaranteed to surface things you wouldn't have considered.

AI coding agents are no joke.

I still agree with the perspectives of many skeptics. Execs and middle managers are still out of their minds when they convince themselves that they can fire 90% of their teams and just have a few seniors do all the work with AI. I will read gleefully about the failures of that strategy over the coming months and years. The failure of their short sightedness and the cost to their organizations won't make up for the human cost of their decisions, but at least there will be consequences.

When it comes to AI in general I have all the mixed feelings. As an artist, I feel the weight of what AI is doing, and will do, to creative work. As a human I'm concerned about AI becoming another tool to funnel ever more wealth to the top. I'm concerned about it ruining the livelihoods of huge swaths of people living in places where there aren't systems that can handle the load of taking care of them. Or aren't even really designed to try. There are a lot of legitimate dystopian outcomes to be worried about.

Despite all that, actually using the technology is pretty exciting, which is the ultimate point of this post: What's your experience? Are you using agents for coding in practical ways? What works and what doesn't? What's your setup? What does it feel like? What do you love/hate about it?

67 comments

post_below

November 16

50 votes
Duck Duck Go search AI curiously cited Tildes

Ask

I was trying to find out why Lidarr wasn't matching my copy of The Cure's Greatest Hits. Found out I've got some bootleg Russian release that's catalogued on discogs (I eventually found the...

I was trying to find out why Lidarr wasn't matching my copy of The Cure's Greatest Hits. Found out I've got some bootleg Russian release that's catalogued on discogs (I eventually found the musicbrainz release and updated my profile to include bootlegs). So I search "Lidarr use specific discogs release" and the duck duck go search assist spat out some text about Lidarr not using discogs and cited this Tildes post.

It's curious because that post is 3yrs old and doesn't talk about discogs integration in Lidarr, just one mention of discogs in the post and some folks talking about Lidarr in the comments (It did cite a relevant GitHub issue about it though). The AI response mentioned that some users track new releases with Lidarr and downloads disabled, while covered in the post, it seems fairly tangential to my query.

I'm curious why it decided to check or cite a tildes post. No tildes posts came up in the first couple pages of search results. I use tildes from the same location, though on my phone where this query was on my desktop, and have done a couple DDG queries using "site:tildes.net" on my phone.

Has anyone else seen a search assist cite an unexpected site? Not unexpected as in irrelevant, that's all too common, but small and specific sources.

7 comments

Carrow

November 15

29 votes
How has AI positively impacted your life?

Ask (survey)

I've been trying to get a more rounded understanding of the impacts that "AI" has had since ChatGPT went viral back in 2022. I've found it easy to gather a list of negative impacts, but have...

I've been trying to get a more rounded understanding of the impacts that "AI" has had since ChatGPT went viral back in 2022.

I've found it easy to gather a list of negative impacts, but have struggled to point to many positives.

I was curious if there were folks who have used any of these AI tools, and would willing to share any positive impacts those tools have had in their lives. I'm particularly interested in the text, audio, image, and video generation tools that have appeared since ChatGPT went viral, but please share anything else that you think fits.

80 comments

zoroa

November 10

50 votes
Researchers isolate memorization from problem-solving in AI neural networks

Article 1447 words

0 comments

Ars Technica

November 12

12 votes
Anthropic to bring its AI to hundreds of teachers in Iceland with pilot scheme – aim of helping them with lesson planning, classroom materials, and administrative work

Article 414 words

2 comments

euronews.com

November 6

7 votes
Signs of introspection in large language models

Article 3454 words

17 comments

anthropic.com

October 31

28 votes
Who’s making these AI copies of my work?

Video 18:43

3 comments

YouTube: Christophe

October 30

17 votes
AI slop is killing our channel
- social media
Video 12:14
8 comments

YouTube: Kurzgesagt – In a Nutshell

October 8

36 votes
Why do LLMs freak out over the seahorse emoji?

Article 1542 words

32 comments

vgel.me

October 6

50 votes
Merriam-Webster has unveiled their latest and greatest LLM to date

Link

10 comments

bsky.app

October 3

67 votes
Why language models hallucinate

Link

21 comments

openai.com

September 6

27 votes
Is it possible to easily finetune an LLM for free?

Ask (advice)

so Google's AI Studio used to have an option to finetune gemini flash for free by simply uploading a csv file. but it seems they have removed that option, so I'm looking for something similar. I...

so Google's AI Studio used to have an option to finetune gemini flash for free by simply uploading a csv file. but it seems they have removed that option, so I'm looking for something similar. I know models can be finetuned on colab but the problem with that is it's way too complicated for me, I want something simpler. I think I know enough python to be able to prepare a dataset so that shouldn't be a problem.

7 comments

cuteFox

August 28

21 votes
Deep Think with Confidence

Link

3 comments

jiaweizzhao.github.io

August 24

9 votes
AI tokens are getting more expensive

Article 1988 words

3 comments

Substack: Ethan Ding

August 20

10 votes
Claude Opus 4 and 4.1 can now end a rare subset of conversations

Article 504 words

17 comments

anthropic.com

August 15

15 votes
Social media probably can’t be fixed
- social media
- internet
Article 3458 words
34 comments

Ars Technica

August 13

38 votes
Evaluating GPT5's reasoning ability using the Only Connect game show

Article 618 words

9 comments

ingram.tech

August 13

18 votes
Is chain-of-thought reasoning of LLMs a mirage? A data distribution lens.

Article

8 comments

arXiv

August 10

28 votes
Reddit will block the Internet Archive
- internet
- social media
Article 682 words
26 comments

The Verge

August 11

58 votes
Question - how would you best explain how an LLM functions to someone who has never taken a statistics class?

Ask

My understanding of how large language models work is rooted in my knowledge of statistics. However a significant number of people have never been to college and statistics is a required course...

My understanding of how large language models work is rooted in my knowledge of statistics. However a significant number of people have never been to college and statistics is a required course only for some degree programs.

How should chatgpt etc be explained to the public at large to avoid the worst problems that are emerging from widespread use?

39 comments

boxer_dogs_dance

August 9

37 votes
AI industry horrified to face largest copyright class action ever certified

Article 422 words

39 comments

Ars Technica

August 9

63 votes
The great LLM scrape
- internet
Article 766 words, published Mar 26 2025
4 comments

bearblog.dev

August 4

24 votes
Persona vectors: monitoring and controlling character traits in language models

Article 1381 words

1 comment

anthropic.com

August 2

13 votes
OpenAI can rehabilitate AI models that develop a “bad boy persona”

Article 990 words

5 comments

MIT Technology Review

June 18

14 votes
The future of forums is lies, I guess
- internet
- social media
Article 1559 words
49 comments

aphyr.com

July 8

63 votes
No, of course I can! Refusal mechanisms can be exploited using harmless fine-tuning data.
- security
Article published Feb 14 2025
1 comment

arXiv

July 14

9 votes
AI coding tools make developers slower but they think they're faster, study finds

Article 724 words

11 comments

theregister.com

July 13

40 votes
Pay up or stop scraping: Cloudflare program charges bots for each crawl
- internet
Article 319 words
13 comments

Ars Technica

July 1

46 votes
Cats confuse reasoning LLM: Query-agnostic adversarial triggers for reasoning models

Article published Mar 4 2025

5 comments

arXiv

July 4

24 votes
TikTok is being flooded with racist AI videos generated by Google’s Veo 3
- google
- social media
Article 364 words
25 comments

Ars Technica

July 3

35 votes
Your brain on ChatGPT: Accumulation of cognitive debt when using an AI assistant for essay writing task

Article 875 words

22 comments

mit.edu

June 20

54 votes
Echo Chamber: A context-poisoning jailbreak that bypasses LLM guardrails

Article 1820 words

10 comments

neuraltrust.ai

June 24

34 votes
Is pop culture a form of "model collapse?"

Ask
Disclaimer: I do not like LLMs. I am not going to fight you on if you say LLMs are shit. One of the things I find interesting about conversations on LLMs is when have a critique about them, and...

Disclaimer: I do not like LLMs. I am not going to fight you on if you say LLMs are shit.

One of the things I find interesting about conversations on LLMs is when have a critique about them, and someone says, "Well, it's no different than people." People are only as good as their training data, people misremember / misspeak / make mistakes all the time, people will listen to you and affirm you as you think terrible things. My thought is that not being reliably consistent is a verifiable issue for automation. Still, I think it's excellent food for thought.

I was looking for new music venues the other day. I happened upon several, and as I looked at their menu and layout, it occurred to me that I had eaten there before. Not there, but in my city, and in others. The Stylish-Expensive-Small-Plates-Record-Bar was an international phenomenon. And more than that, I couldn't help but shake that it was a perversion of the original, alluring concept-- to be in a somewhat secretive record bar in Tokyo where you'll be glared into the ground if you speak over the music.

It's not a bad idea. And what's wrong with evoking a good idea, especially if the similarity is just unintentional? Isn't it helpful to be able to signal to people that you're like-that-thing instead of having to explain to people how you're different? Still, the idea of going just made me assume it'd be not simply like something I had experienced before, but played out and "fake." We're not in Tokyo, and people do talk over the music. And even if they didn't, they have silverware and such clanging. It makes me wonder if this permutation is a lossy estimation of the original concept, just chewed up, spat out, slurped, regurgitated, and expensively funded.

other forms of conceptual perversion:
- Matters of Body Image - is it a sort of collapse when we go from wanting 'conventional beauty' to frankensteining features onto ourselves? Think fox eye surgeries, buccal fat removal, etc. Rather than wanting to be conventionally attractive, we aim for the related concept of looking like people who are famous.
- (still thinking)
13 comments

vingtcinqunvingtcinq

June 19

15 votes
Disney files landmark case against AI image generator

Video 21:59

3 comments

YouTube: LegalEagle

June 21

16 votes
The Common Pile v0.1: An 8TB dataset of public domain and openly licensed text

Article 432 words

19 comments

arXiv

June 10

26 votes
Six-month-old, solo-owned vibe coder Base44 sells to Wix for $80M cash

Article 608 words

4 comments

TechCrunch

June 20

13 votes
Is the AI bubble about to burst?
- microsoft
- google
Article 2879 words
42 comments

versobooks.com

June 16

35 votes
OpenAI featured chatbot is pushing extreme surgeries to “subhuman” men

Article 1364 words, published May 31 2025

42 comments

citationneeded.news

June 6

35 votes
LLMs and privacy
- privacy
Ask
Hello to everyone who's reading this post :) Now LLMs are increasingly so useful (of course after careful review of their generated answers), but I'm concerned about sharing my data, especially...

Hello to everyone who's reading this post :)

Now LLMs are increasingly so useful (of course after careful review of their generated answers), but I'm concerned about sharing my data, especially very personal questions and my thought process to these large tech giants who seem to be rather sketchy in terms of their privacy policy.

What are some ways I can keep my data private but still harness this amazing LLM technology? Also what are some legitimate and active forums for discussions on this topic? I have looked at reddit but haven't found it genuinely useful or trustworthy so far.

I am excited to hear your thoughts on this!

34 comments

hexagonsun

June 4

33 votes
Which translation tools are LLM free? Will they remain LLM free?

Ask

Looking at the submission rules for Clarkesworld Magazine, I found the following: Statement on the Use of “AI” writing tools such as ChatGPT We will not consider any submissions translated,...

Looking at the submission rules for Clarkesworld Magazine, I found the following:

Statement on the Use of “AI” writing tools such as ChatGPT

We will not consider any submissions translated, written, developed, or assisted by these tools. Attempting to submit these works may result in being banned from submitting works in the future.

EDIT: I assume that Clarkesworld means a popular, non-technical understanding of AI meaning post-chatGPT LLMs specifically and not a broader definition of AI that is more academic or pertinent the computer science field.

I imagine that other magazines and website have similar rules. As someone who does not write directly in English, that is concerning. I have never translated without assistance in my life. In the past I used both Google Translate and Google Translator Toolkit (which no longer exist).

Of course, no machine translation is perfect, that was only a first pass that I would change, adapt and fix extensively and intensely. In the past I have used the built-in translation feature from Google Docs. However, now that Gemini is integrated in Google Docs, I suspected that it uses AI instead for translation. So I asked Gemini, and it said that it does. I am not sure if Gemini is correct, but, if it doesn't use AI now it probably will in the future.

That poses a problem for me, since, in the event that I wish to submit a story to English speaking magazines or websites, I will have to find a tool that is guaranteed to be dumb. I am sure they exist, but for how long? Will I be forced to translate my stories like a cave men? Is anyone concerned with keeping non-AI translation tools available, relevant, and updated? How can I even be sure that a translation tool does not use AI?

71 comments

lou

June 2

28 votes
Duolingo is replacing human workers with AI

Article 366 words, published Apr 30 2025

34 comments

VICE

May 28

34 votes
Large Language Models are more persuasive than incentivized human persuaders

Article 459 words

2 comments

arXiv

May 26

14 votes
Some ChatGPT users are developing delusional beliefs that are reinforced by the large language model

Article 603 words

73 comments

futurism.com

May 7

53 votes
When ChatGPT broke an entire field: An oral history

Article 5324 words

0 comments

Quanta Magazine

May 1

14 votes
State Bar of California admits it used AI to develop exam questions, triggering new furor

Article 1583 words

4 comments

Los Angeles Times

April 24

25 votes