• Activity
  • Votes
  • Comments
  • New
  • All activity
    1. What's an atypical thing you do that you'd recommend to others?

      You do it, and it's against the grain -- outside the norm. But you like it, or think it's worthwhile. In fact, you'd recommend that more people do it, so that it can shift the grain or become the...

      You do it, and it's against the grain -- outside the norm.

      But you like it, or think it's worthwhile.

      In fact, you'd recommend that more people do it, so that it can shift the grain or become the norm.

      What is it, and why do you recommend it?

      65 votes
    2. The ARC-AGI-2 benchmark could help reframe the conversation about AI performance in a more constructive way

      The popular online discourse on Large Language Models’ (LLMs’) capabilities is often polarized in a way I find annoying and tiresome. On one end of the spectrum, there is nearly complete dismissal...

      The popular online discourse on Large Language Models’ (LLMs’) capabilities is often polarized in a way I find annoying and tiresome.

      On one end of the spectrum, there is nearly complete dismissal of LLMs: an LLM is just a slightly fancier version of the autocomplete on your phone’s keyboard, there’s nothing to see here, move on (dot org).

      This dismissive perspective overlooks some genuinely interesting novel capabilities of LLMs. For example, I can come up with a new joke and ask ChatGPT to explain why it’s funny or come up with a new reasoning problem and ask ChatGPT to solve it. My phone’s keyboard can’t do that.

      On the other end of the spectrum, there are eschatological predictions: human-level or superhuman artificial general intelligence (AGI) will likely be developed within 10 years or even within 5 years, and skepticism toward such predictions is “AI denialism”, analogous to climate change denial. Just listen to the experts!

      There are inconvenient facts for this narrative, such as that the majority of AI experts give much more conservative timelines for AGI when asked in surveys and disagree with the idea that scaling up LLMs could lead to AGI.

      The ARC Prize is an attempt by prominent AI researcher François Chollet (with help from Mike Knoop, who apparently does AI stuff at Zapier) to introduce some scientific rigour into the conversation. There is a monetary prize for open source AI systems that can perform well on a benchmark called ARC-AGI-2, which recently superseded the ARC-AGI benchmark. (“ARC” stands for “Abstract and Reasoning Corpus”.)

      ARC-AGI-2 is not a test of whether an AI is an AGI or not. It’s intended to test whether AI systems are making incremental progress toward AGI. The tasks the AI is asked to complete are colour-coded visual puzzles like you might find in a tricky puzzle game. (Example.) The intention is to design tasks that are easy for humans to solve and hard for AI to solve.

      The current frontier AI models score less than 5% on ARC-AGI-2. Humans score 60% on average and 100% of tasks have been solved by at least two humans in two attempts or less.

      For me, this helps the conversation about AI capabilities because it gives a rigorous test and quantitative measure to my casual, subjective observations that LLMs routinely fail at tasks that are easy for humans.

      François Chollet was impressed when OpenAI’s o3 model scored 75.7% on ARC-AGI (the older version of the benchmark). He emphasizes the concept of “fluid intelligence”, which he seems to define as the ability to adapt to new situations and solve novel problems. Chollet thinks that o3 is the first AI system to demonstrate fluid intelligence, although it’s still a low level of fluid intelligence. (o3 also required thousands of dollars’ worth of computation to achieve this result.)

      This is the sort of distinction that can’t be teased out by the polarized popular discourse. It’s the sort of nuanced analysis I’ve been seeking out, but which has been drowned out by extreme positions on LLMs that ignore inconvenient facts.

      I would like to see more benchmarks that try to do what AGI-AGI-2 does: find problems that humans can easily solve and frontier AI models can’t solve. These sort of benchmarks can help us measure AGI progress much more usefully than the typical benchmarks, which play to LLMs’ strengths (e.g. massive-scale memorization) and don’t challenge them on their weaknesses (e.g. reasoning).

      I long to see AGI within my lifetime. But the super short timeframes given by some people in the AI industry feel to me like they border on mania or psychosis. The discussion is unrigorous, with people pulling numbers out of thin air based on gut feeling.

      It’s clear that there are many things humans are good at doing that AI can’t do at all (where the humans vs. AI success rate is ~100% vs. ~0%). It serves no constructive purpose to ignore this truth and it may serve AI research to develop rigorous benchmarks around it.

      Such benchmarks will at least improve the quality of discussion around AI capabilities, insofar as people pay attention to them.


      Update (2024-04-11 at 19:16 UTC): François Chollet has a new 20-minute talk on YouTube that I recommend. I've watched a few videos of Chollet talking about ARC-AGI or ARC-AGI-2, and this one is beautifully succinct: https://www.youtube.com/watch?v=TWHezX43I-4

      10 votes
    3. Do you have games that you play (almost) exclusively?

      I was reading the recent post about strategy games, and I'm still astonished to see for how many hours (at least hundreds, often 1000+) people are playing these. I'm guessing that in these cases,...

      I was reading the recent post about strategy games, and I'm still astonished to see for how many hours (at least hundreds, often 1000+) people are playing these. I'm guessing that in these cases, all your gaming time is exclusively taken by that single game.

      So, do you have (or did you have) games, or series, like that? Do you play solo or multi? What compels you to spend so much time on a single game? How do you feel about it?

      39 votes
    4. Fitness Weekly Discussion

      What have you been doing lately for your own fitness? Try out any new programs or exercises? Have any questions for others about your training? Want to vent about poor behavior in the gym? Started...

      What have you been doing lately for your own fitness? Try out any new programs or exercises? Have any questions for others about your training? Want to vent about poor behavior in the gym? Started a new diet or have a new recipe you want to share? Anything else health and wellness related?

      2 votes
    5. Introductions | March 2025

      The previous introductions thread was waaaaay back in June of 2023, figured it might be time for a new one, eh? This is a place for new and existing users to post an introduction with a few fun...

      The previous introductions thread was waaaaay back in June of 2023, figured it might be time for a new one, eh?

      This is a place for new and existing users to post an introduction with a few fun facts about themselves. You will find the post box at the bottom the page. Maybe say "Hi!" to someone else you see while scrolling down?

      If you like, you can also write something about yourself in your profile. See "Edit your user bio" on the settings page. Anyone who clicks on your username will see it in your profile. (It appears on the right side of the page.)

      You can find out more about how to use Tildes in this topic: "New users: Ask your questions about Tildes here!.

      Some sample questions you *could* answer (but not required!)
      • How long have you been on Tildes? How did you find out about us?
      • How did you choose your username?
      • What are your interests? (This could be music, tech, art, video games, board games, books, anything!)
      • A/S/L (the standard old school intro for an old school kind of forum, but not required!, we value our pseudonymity around here!)
        • for those born post 1998: age/(sex|gender|identifier|pronouns)/location
          • Example: 27/nb trans woman (she/her)/USA or 54/M/USA or 907/Timelord/Gallifrey
        • You don't have to follow the structure, or include it at all!
      • What do you do? This could be in your spare time, for work, your passions.
      • Do you want other users to PM you from this thread?
      • Give us a fun fact or link, if there is anything to know about tilderinos, we value knowledge sharing!
      Here is a template if you need something to kickstart your intro
      **How long have you been on Tildes? How did you find out about us?**
      
      **How did you choose your username?** 
      
      **What are your interests?** 
      
      **A/S/L (age/(gender|pronouns|identifier)/location)**
      
      **What do you do? This could be in your spare time, for work, your passions.**
      
      **Do you want other users to PM/DM you from this thread?**
      
      **Give us a fun fact (or a link!)! If there is anything to know about tilderinos, it's that we value knowledge sharing!**
      
      
      49 votes
    6. Humble Choice - April 2025

      April 2025's Humble Choice is now available with the following eight Steam games. Steam Page Opencritic Steam Recent/All Operating Systems Steam Deck ProtonDB Tomb Raider I-III Remastered 76 90 /...

      April 2025's Humble Choice is now available with the following eight Steam games.

      Steam Page Opencritic Steam Recent/All Operating Systems Steam Deck ProtonDB
      Tomb Raider I-III Remastered 76 90 / 85 Win ✅ Verified 🎖️ Platinum
      DREDGE 82 96 / 95 Win, Mac ✅ Verified 🎖️ Platinum
      Aliens: Dark Descent 89 86 / 88 Win ❌ Unsupported 🎖️ Platinum
      1000xRESIST 89 97 / 96 Win 🟨 Playable 🎖️ Platinum
      Nova Lands 80 84 / 91 Win ✅ Verified 🎖️ Platinum
      Diplomacy is Not an Option -- 85 / 85 Win 🟨 Playable 🎖️ Platinum
      Distant Worlds 2 81 -- / 88 Win ❌ Unsupported ⬜ Silver
      Nomad Survival -- 88 / 95 Win ✅ Verified 🟨 Gold

      Does anyone have experience with any of the games and, if so, would you recommend them? Is there anything in here that you're particularly excited to play?

      14 votes
    7. What programming/technical projects have you been working on?

      This is a recurring post to discuss programming or other technical projects that we've been working on. Tell us about one of your recent projects, either at work or personal projects. What's...

      This is a recurring post to discuss programming or other technical projects that we've been working on. Tell us about one of your recent projects, either at work or personal projects. What's interesting about it? Are you having trouble with anything?

      7 votes
    8. Request recommendation: temp controlled teapot

      Hello, as the title implies, I am looking for a temp controlled teapot. I own an ember mug and have to say, it is one of my favorite purchases/splurges. For those that do not want to click on the...

      Hello, as the title implies, I am looking for a temp controlled teapot.

      I own an ember mug and have to say, it is one of my favorite purchases/splurges. For those that do not want to click on the link, this is a temperature controlled mug that holds a hot temp for liquids in a mug wirelessly. It does this with quite good precision IMO, have not whipped out a thermometer to check accuracy.

      I have been looking for some time for a teapot that could do the same. The requirements I have are that no plastic or otherwise health adverse heat volatile materials come into contact with the heated water, and I would like it to be a teapot, because I like the ritual of pouring the tea out of the pot. I want to be able to steep and pour the tea from the same temp controlled vessel(I don't mind heating water and then pouring it into the temp controlled teapot), so I'm not looking for a temp controlled kettle, necessarily.

      I think last time I looked I ended up on pages showing things like this set up, which I'm open to, but would like some guidance or reviews, if possible.

      Thank you!

      Bonus, what are your favorite mint teas or other teas that have a sensory experience outside of temp?

      EDIT: On the same website I linked, I forgot to direct people here. Has anyone ever used a teapot warmer like the ones on that website?

      14 votes
    9. Megathread: April Fools' Day 2025 on the internet

      Over the next day or so, the internet will be filled with jokes, pranks, fake "announcements" from companies, fun interactive activities, games, and so on. A lot of these can be quite clever and...

      Over the next day or so, the internet will be filled with jokes, pranks, fake "announcements" from companies, fun interactive activities, games, and so on. A lot of these can be quite clever and interesting so I think posting about them in general is fine, but in the interest of preventing them from completely taking over Tildes, let's try to keep as many of them restricted to this thread as possible. Ideally, a separate top-level comment for each individual item would be good.

      If something particularly discussion-worthy comes up (like an ARG or activity that a lot of people want to talk about), a separate thread is reasonable, but please make sure it has the "april fools day" tag. That way, if anyone wants to avoid seeing the April Fools' Day threads, they can use the topic tag filters and filter that tag out.

      I'm going to use the "official" styling for this topic (that's usually only for ~tildes.official topics) to make it stand out more to try to encourage people to notice it. If you notice people making individual topics for April Fools' Day things that don't really warrant their own topic, please (nicely) encourage them to delete and post in here instead.

      109 votes
    10. Midweek Movie Free Talk

      Warning: this post may contain spoilers

      Have you watched any movies recently you want to discuss? Any films you want to recommend or are hyped about? Feel free to discuss anything here.

      Please just try to provide fair warning of spoilers if you can.

      9 votes
    11. You get to choose your favorite director's next project. What is it, and why?

      I think about this from time to time. Good directors often switch gears. They abandon beloved themes, franchises, or entire genres. Sometimes they become producers for the series they initiated...

      I think about this from time to time. Good directors often switch gears. They abandon beloved themes, franchises, or entire genres. Sometimes they become producers for the series they initiated and the quality takes a dip. In any event, for whatever reason, talented movie directors sometimes decide to dedicate their careers to something we are not as fond of. Perhaps you read a book that would be perfect for them! This post is an opportunity to put them on the right track! (in our imaginations at least...)

      27 votes
    12. Hey parents, how many of you read vs. tell stories before bedtime for your kids?

      My son loves reading time before bed, but he’s only 3.5 so the books have mostly been picture books until now. Lately though he’s been getting more into stories with plots and an extended...

      My son loves reading time before bed, but he’s only 3.5 so the books have mostly been picture books until now. Lately though he’s been getting more into stories with plots and an extended narrative, but entirely in the form of movies. There aren’t a lot of kid’s books to go around with the sorts of dramatic stories he likes, they’re more like “caterpillar eats food” and “train engine climbs a hill with grit and determination” type stuff. And whenever I’ve tried to have him just lay down and listen to me read a story without any pictures to stare at he has absolutely no interest. He really likes having pretty visuals to look at.

      I know when I was a small child these sorts of board/picture books weren’t really a thing in India. The pre-sleep ritual was usually “storytime” instead, where my parents would tell us stories. I’m a little bit concerned that my kid has been so accustomed to always having visual cues presented to him that it’s stunting his imagination a bit, like failing to exercise his capacity to visualize ideas and concepts for himself without being anchored by some artist’s depiction.

      So I’m curious to hear from other parents or caregivers/educators (@kfwyre?). Did you find there was a natural transition point between going from picture books to telling/reading stories? Was there any sort of work you had to do to enable it? Are there “exercises” I can work on to help my son exercise his imagination? I have been working with him to have him tell me stories about his day, which he does pretty well. But his stories are always quite grounded and he’s usually telling me what he’s actually done and seen. When my nephews and nieces were his age they tended to spin out a lot of random stories that pretty obviously did not happen, and I assume this is because they had more experience being told stories themselves rather than just factual reporting about the happenings around them.

      25 votes
    13. Graduated and moving to an apartment in a major city in the US, advice/tips?

      I graduated in the spring and spent the last few months at home with family. I'm hoping to find an apartment/job in a major US city working as a legal assistant or paralegal. I've never rented...

      I graduated in the spring and spent the last few months at home with family. I'm hoping to find an apartment/job in a major US city working as a legal assistant or paralegal.

      I've never rented before, so I have questions, but due to circumstances related to the ones pushing me away from staying at home any longer, I don't have any parents or older siblings to ask for help, so I'm hoping to crowdsource wisdom here. I have specific questions, but also happy to hear any general advice for someone renting an apartment for the first time.

      Some relevant context:

      • Studio apartment is nonnegotiable. I don't know anyone I trust enough to be my roommate. If that weren't enough, I'm one of the only people still wearing a mask and trying not to get COVID, so it's gotta be a studio for me to be able to relax.
      • I'm looking for a walkable neighborhood and good public transit in addition to affordability, so really looking at Chicago and Philly right now.
      • I love to cook, but I know studios often have really lackluster kitchens. Hoping to find one with a usable amount of counterspace.

      Those questions I had:

      1. If you offer to pay a landlord the cost of the lease in full, is it typical for the landlord to waive the requirement that you provide proof of employment, or if you asked them to waive that, would they be likely to say yes? I would really like to take some time off, and coming home was intended to be that, but toxic family means I've just been stressed the whole time, so being able to get a place without needing a job right off the bat would be a fantastic setup to make sure this new start goes well. (If I save up part-time earnings for the next 2-3 months, I'll have enough to pay a lease on a $1200 studio in full, furnish it with the necessities, and feed myself.)

      2. Recently I realized that when apartments are listed as unfurnished, that means no bed or mattress either. (I kind of see now why futons are a thing.) How do people usually address this issue? Do you buy a mattress and bed frame with the expectation that you'll take it with you whenever you move out to the next apartment?

      3. Since I don't live in the city I'll be moving to, I most likely won't be able to see the apartment in person before committing. Any particular ideas on how I could handle this or what I should be wary of? I've heard of services where you can pay someone to show up to a tour and video call you so you can follow along (https://www.gandertour.com/). $50 doesn't seem too steep, but I don't know if that specific service is trustworthy, or if there's a cheaper or simpler way to deal with this.

      Those are my questions, and like I said above, any advice not 100% related to these questions is welcome too!

      38 votes