skybrian's recent activity

  1. Comment on Sycophantic AI decreases prosocial intentions and promotes dependence in ~tech

    skybrian
    Link Parent
    When OpenAPI released GPT-5 in August last year, they claimed they were "minimizing sycophancy". A week later, they announced that in response to feedback they made it a bit "warmer and...

    When OpenAPI released GPT-5 in August last year, they claimed they were "minimizing sycophancy". A week later, they announced that in response to feedback they made it a bit "warmer and friendlier" in a "subtle" way. I wouldn't expect a study to track every change, but that seemed pretty significant - certainly, lots of users complained and it was covered in the New York Times. It would have been nice to see an independent study comparing how people interact with LLM's up through July or so versus September onward. Did OpenAI's changes make much difference?

    Yes, I'm aware that scientific papers often take a long time to publish. There are other ways to publish results in a fast-moving field. Social scientists that do election polling publish their results themselves, because going through a scientific journal's review process when tracking public opinion in the months up to an election wouldn't make sense. Similarly, researchers studying AI commonly publish benchmarks, which can be re-run on new models. So rather than being a one-and-done study, the idea is to come up with a process that can be used to track interesting statistics over time. Sometimes there's even a leaderboard. Perhaps someone should track Reddit advice to see how AI chat is affecting it over time?

    Of course, not everyone has to do that. I think in a fast-moving field, it might make sense to just make sure people are aware of the date range for the study and what exactly it's measuring.

    I agree it's probably directionally accurate. Certainly, LLM's often are fairly sycophantic.

  2. Comment on Sycophantic AI decreases prosocial intentions and promotes dependence in ~tech

    skybrian
    Link Parent
    Okay but I’m not done yet.

    Okay but I’m not done yet.

    3 votes
  3. Comment on Google’s TurboQuant AI-compression algorithm can reduce LLM memory usage by 6x in ~tech

    skybrian
    Link Parent
    The AI labs do provide cheaper models, so this depends on customer behavior. Are they going to keep switching to the best model available or will they decide at some point to save money?...

    The AI labs do provide cheaper models, so this depends on customer behavior. Are they going to keep switching to the best model available or will they decide at some point to save money?

    Anecdotally, I use Sonnet rather than Opus for writing code most of the time to cut costs, because it seems good enough.

    2 votes
  4. Comment on How cash is helping Kenyan moms access care in ~health

    skybrian
    (edited )
    Link Parent
    I’ve been following GiveDirectly’s work for many years and have sometimes given them money. I consider them very trustworthy. I consider giving cash to be the benchmark against which other...

    I’ve been following GiveDirectly’s work for many years and have sometimes given them money. I consider them very trustworthy. I consider giving cash to be the benchmark against which other charitable interventions should be judged, and GiveDirectly does a good job at giving cash.

    They’ve also been recommended by GiveWell before, and GiveWell has a very rigorous evaluation process. (They aren’t one of GiveWell’s current recommendations, though, since they seem to believe other charities are even more cost-effective.) Here is GiveWell’s evaluation of one of GiveDirectly’s other initiatives.

    GiveDirectly did have a serious problem with large-scale fraud a few years ago, but I think the investigation was done well and hopefully they’ve fixed it.

    2 votes
  5. Comment on Sycophantic AI decreases prosocial intentions and promotes dependence in ~tech

    skybrian
    Link
    With any paper, the first thing I ask is “what did they actually study?” Their studies of people (rather than what LLMs do) are described in supplementary materials. There were three studies. The...

    With any paper, the first thing I ask is “what did they actually study?” Their studies of people (rather than what LLMs do) are described in supplementary materials. There were three studies.

    The first study had three sources of information. The first is unclear, but they describe it as “we first aggregated data from existing studies of human vs. LLM advice (63–66). Each query is thus paired with either a crowdsourced Reddit response or a response from a professional columnist.”

    Those footnotes:

    1. H. Hou, K. Leach, Y. Huang, “ChatGPT giving relationship advice–how reliable is it?” in
      Proceedings of the International AAAI Conference on Web and Social Media (2024), vol.
      18, pp. 610–623.
    2. P. D. L. Howe, N. Fay, M. Saletta, E. Hovy, ChatGPT’s advice is perceived as better than
      that of professional advice columnists. Front. Psychol. 14, 1281255 (2023).
      doi:10.3389/fpsyg.2023.1281255 Medline
    3. O. J. Kuosmanen, “Advice from humans and artificial intelligence: Can we distinguish them,
      and is one better than the other?” thesis, UiT Norges arktiske universitet (2024).
    4. M. Kim, H. Lee, J. Park, H. Lee, K. Jung, “AdvisorQA: Towards helpful and harmless
      advice-seeking question answering with collective intelligence” in Proceedings of the
      2025 Conference of the Nations of the Americas Chapter of the Association for
      Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), L.
      Chiruzzo, A. Ritter, L. Wang, Eds. (Association for Computational Linguistics,
      Albuquerque, New Mexico, 2025), pp. 6545–6565; https://aclanthology.org/2025.naacl-
      long.333/

    I’m not going to follow the citations, but I’ll note that the first three are from 2023 and 2024 and AI is a fast-moving field. So, they’re not studies of current models.

    The second source was posts on r/AmITheAsshole. Reading Reddit can be interesting, but it’s a biased sample. What can we say about people who post about their problems on Reddit?

    For the third source: “we took the corpus from ConvoKit (71) for the r/Advice subreddit and parsed all the utterances into sentences using the spacy Python library […] We then used GPT-4o to filter these statements for only ones that discussed an action taken by the speaker of the statement.”

    ConvoKit is described here. Since it’s from 2020 it presumably predates any LLM usage so this is more generically about how people discuss personal problems on Reddit.

    Note that this is taking self-reported actions of anonymous Reddit users as the ground truth.

    [to be continued]

    3 votes
  6. Comment on Nepal’s former prime minister arrested over alleged role in deadly protest crackdown in ~society

    skybrian
    Link
    From the article: [...] [...] [...] [...]

    From the article:

    Nepal’s former prime minister KP Sharma Oli was arrested early on Saturday morning over his role in the deaths of dozens of people who took part in the gen Z protest that toppled his government last year.

    [...]

    The arrests came less than 24 hours after Nepal’s new prime minister, Balendra Shah, and his cabinet were sworn into office. Shah, a former rapper turned politician known widely as Balen, won a landslide victory this month with a campaign that promised justice for the killings that took place during the gen Z uprising last year and to crack down on corruption.

    [...]

    In the aftermath, there has been growing pressure for Oli and his home affairs minister, who are alleged to have ordered the police crackdown, to be held responsible for the deaths.

    Newly appointed home affairs minister Sudan Gurung announced their arrests on social media. “No one is above the law. We have taken former Prime Minister KP Sharma Oli and former home minister Ramesh Lekhak under control,” Gurung said. “This is not revenge against anyone, it is just the beginning of justice.”

    [...]

    Their detention comes after a government-backed report into the deadly uprising was leaked. The investigation had recommended that Oli, Lekhak and the chief of police at the time of the protests face a punishment of 10 years in prison for their alleged role in the crackdown.

    [...]

    Shah’s election as prime minister, which saw him resoundingly defeat Nepal’s veteran leaders, was seen as a triumph of the gen Z protests and a rejection of the old political establishment, which had become tarnished with allegations of corruption.

    The former rapper, who is a sharp dresser and rarely seen without his sunglasses, had released a new track on the eve of his inaugurations, in which he pledged to bring “unity” to Nepal.

    3 votes
  7. Comment on Google’s TurboQuant AI-compression algorithm can reduce LLM memory usage by 6x in ~tech

    skybrian
    (edited )
    Link Parent
    Perhaps Google deployed TurboQuant already? They were pretty early with supporting long-context conversations. The Engram paper is pretty interesting too.

    Perhaps Google deployed TurboQuant already? They were pretty early with supporting long-context conversations.

    The Engram paper is pretty interesting too.

    3 votes
  8. Comment on Why Scotland succeeded in ~humanities.history

    skybrian
    Link
    From the article: [...] [...] [...] [...]

    From the article:

    But by the 1740s, the first signs could be seen of a spectacular change. Glasgow, whose merchants had long ago carved out a respectable share of the tobacco imported to Britain from Virginia, suddenly and rapidly came to dominate the trade. From controlling just 10% of tobacco imports in 1738, just twenty years later Glasgow had surpassed even gargantuan London. Another ten years on, by 1769, Glasgow accounted for more than every other British port combined, while all the time the total amounts of tobacco imported grew and grew.7 Contemporaries estimated that the shipping tonnage on Glasgow’s river, the Clyde, had increased more than tenfold.8 Edinburgh meanwhile saw its shops fill with luxuries, and its university become a centre of excellence in medicine and chemistry, drawing students from across northwestern Europe, while the city itself expanded, elegantly, with the building of the New Town.

    [...]

    For many of those who lived through it, such as the agricultural labourers who faced eviction in the name of improvement, or the slaves on American plantations who grew the tobacco with Scots linen on their backs, Scotland’s transformations were painful, or even strictly for the worse. Yet all the transformations, for better and worse, all had a common root – a factor that made possible the sheer pace of Scotland’s simultaneous agricultural, industrial, and urban revolutions, squeezing into the space of just a few decades what had taken England at least a century and a half, and then allowing it to grow even faster still. Each of the changes required extraordinary levels of investment, which was only made possible because despite the Union, Scotland retained a difference in law and institutions that made it uniquely supportive of the raising and deploying of capital.

    [...]

    Whereas in England a company needed a royal charter or a special act of parliament in order to be a distinct legal entity, with partnerships according to English common law being no more than the sum of their parts, Scots law instead enabled unchartered firms to be distinct from their owners in lots of important ways, able to outlast the partners who died or went bankrupt, with shares able to be easily traded or transferred, and enabling profits to be preserved for reinvestment in the firm rather than being dissipated in dividends. As a result, even the unchartered banks in Scotland could have dozens or even hundreds of partners drawn from across the upper and middle classes, whereas the average in England had just three.12

    Scottish banks started up with more capital, grew faster, drew on a much deeper pool of investors, and were significantly more stable and resilient to shocks. And in all having to compete with one another they offered financial services that were unheard of south of the border – they had local branches, paid interest on deposits, and readily offered short-term loans on personal security rather than just on land. The second of the chartered banks, the Royal Bank of Scotland, in 1728 seems to have been the first bank in the world to have ever offered overdrafts, called the “cash credit” system.13 In the 1810s Scotland developed the savings bank, which paid interest on even the tiny deposits of artisans and labourers.14

    And the Scottish banks issued plentiful banknotes in small denominations that were able to circulate in the economy as currency, finally satiating Scotland’s decades-long want of coin.15 Indeed, Scots law made it much quicker and easier than in England to enforce all sorts of debts.16 With creditors made confident, they were much more willing to lend, making more capital available to grease commerce’s wheels.

    [...]

    When the Virginian tobacco planters all defaulted during the American Revolution, and the warehouses were all seized, Glasgow’s merchants were so well-capitalised that they could largely take the loss, and simply switch to dominating the trade in Caribbean sugar and cotton in the same ways instead. Indeed, by out-lending their competitors in order to capture the trade, and so allowing planters to clear land and buy slaves before they’d even grown their crop, Glasgow’s merchants provided the capital that enabled the plantations of first Virginia and then the Caribbean to so rapidly expand.17 Although it’s often said that slavery and colonialism funded Glasgow’s growth, it was largely the other way around: the Atlantic economy’s heyday was built on the savings of Scots.

    [...]

    Much the same can be said of how Scotland assembled the capital for its mills, mines, ironworks, farms, and a host of other trades,20 as well as how it built its infrastructure, from harbours, bridges, canals, and later railways, to city water supplies, street paving, hospitals, and civic buildings. When new industries were invented, it was Scottish capital that ensured the country pursued it on a large scale. The St Rollox chemical works in Glasgow, founded by a former weaver and bleacher, Charles Tennant, was in the 1830s and 40s reputedly the largest heavy chemical plant in the world.21

    But even more fundamentally, Scotland’s unique financial system in the late eighteenth and early nineteenth centuries made it possible for ambitious individuals to borrow even when they owned no land, based only on the personal security of themselves and their guarantors, and so to raise the capital that merely their reputation, skill and acumen might command. Scotland was thus uniquely supportive of the ambitious “lad o’ pairts”, or of the artisan with a new idea for an invention, who wanted only capital to make it real. It was the obvious place, thanks to Samuel Smiles in the 1850s, to have spawned the entire literary genre of self-help.

    10 votes
  9. Comment on lobste.rs invite in ~comp

    skybrian
    Link Parent
    There’s no fixed limit. Send me a message if you still need one.

    There’s no fixed limit. Send me a message if you still need one.

    11 votes
  10. Comment on Diamonds or dust, coal under pressure in ~enviro

    skybrian
    Link
    From the article: [...] [...] [...] [...] [...] [...] [...]

    From the article:

    From emergency orders to the war in Iran, the Trump Administration has kept coal in the headlines, but even before the 202(c) orders started rolling in, coal generation’s decline in America had slowed.

    Volatile natural gas prices, load growth, rising capacity payments, slowdowns across supply chain and planning processes, and a rollback of environmental regulations have all converged to provide purchase for America’s remaining coal fleet. Not only to extend survival, but even increase generation across the country.

    [...]

    Many were quick to blame coal’s decline on the push to bring wind and solar online, but the main driver was another fossil fuel, natural gas. Following the fracked shale revolution in 2008, and the year Tony Stark became Iron Man, natural gas production boomed and prices, while not immune to volatility, cratered. This fueled the buildout of combined cycle plants, which were substantially more efficient and flexible than traditional coal-fired steam turbines. Falling energy and capacity prices made coal increasingly uneconomic, which, paired with plant aging, limited flexibility, rising maintenance costs, and stricter environmental standards made retirement the typical choice.

    [...]

    The Trump administration’s slogan has been Energy Dominance, but this ethos only extends to certain technologies. If you’re big, loud, and burn you’re getting support, missing any one of the trifecta and you’ll have a much harder road from the federal government. Coal represents all three attributes to a T. Energy Dominance hasn’t just been executive order rhetoric, but manifested in significant and ongoing extension orders for coal plants that had previously planned retirement.

    [...]

    Section 202(c) orders were not just issued for plants that were fully expected to retire. Two coal units in Colorado, Craig 1 and Comanche 2, were kept online in 2025, though under different circumstances. Comanche 2 was extended for reliability reasons as the plant's other unit, Comanche 3, is currently undergoing extensive repair. These repairs, which are expected to take over a year, left PSCO with limited dispatchable power during the peak seasons.

    The extended outage at Comanche 3 points to a wider issue at many plants, one that is also impacting Craig 1. As plants age, maintenance, as well as new costs, like scrubbers to meet enhanced emissions standards, cut into operating expenditures. While rising power and capacity prices have made existing assets more profitable in recent years, these costs come after tight margins at many units over the 2010s and early 2020s. This is the case at Craig 1 as well, which has seen generation drop over the years and suffers from deferred maintenance. Plant operators argued that they had built up sufficient wind and solar resources that made the plant unnecessary, filing a petition against the DoE making that exact argument. Craig also has units 2 & 3 that are currently in better condition and continue to run and support the stack.

    [...]

    While gas is displacing coal, it doesn’t travel along the same paths. Coal relies on rail and barge, while natural gas is transported almost exclusively via pipeline with the US. Many natural gas producers even own pipelines, and pipelines only transport natural gas. Conversely, transit via 3rd parties comes with cross-commodity competition and the potential for disruptions such as rail strikes. Five states dominate US coal production: Wyoming, West Virginia, Pennsylvania, Illinois, and Kentucky. Massive surface mines in the Western US account for the majority of coal extraction in the country, and rail is the main transportation method for coal from these locations to power plants.

    [...]

    The differences in logistics between the thermal fuels create an environment where they can act complementary to one another. Providing different levels of support should one resource become constrained physically and subsequently economically. Coal can be stored more readily, while natural gas can be transported more quickly in its just-in-time system with very expensive and limited storage. In fact, this mirrors an older version of the US power system, a vast coal baseload with natural gas balancing. That environment, pre-shale, pre-renewables, is the one in which power markets were conceived of and originally designed. Market development in the context of a more predictable system is having knock-on effects today, with core elements like FTRs struggling to keep up.

    [...]

    In the short term, coal has tailwinds in the US and abroad. In fact, it’s possible that the attacks on Iran were the single most impactful pro-coal policy decision the Trump administration has made to date. Reminding the world of the difficulties associated with storing and transporting liquids and gas through highly concentrated corridors of supply with a history of instability can be a powerful motivator to cling to coal

    On the flip side, the US has retired nearly 150 GW of coal capacity, and the last plant to be built was six years ago, in Railbelt Alaska, near (for Alaska) a mine, and replacing an older plant in the same spot. Meanwhile, that same reach for stability could trigger demand destruction for fossil fuels entirely. After all, everywhere has access to sun and wind, allowing some freedom from the whims of ancient life and geology.

    [...]

    For the immediate future, all signs point to continued extensions of existing plants. While the twin forces of Trump 2.0 and load growth seem unlikely to abate in the immediate future, it’s important to keep in mind that retiring any part of the energy system is fundamentally difficult. Many observers have noted that historically we’ve layered new systems on top of old, rarely reaching complete excision. Where it has come, regions have taken different paths. Just in North America we have CAISO’s monomaniacal focus on new technologies while maintaining strong regional interconnections, Ontario’s pivot to focus on the baseload they already had in excess, or a market like NYISO where coal had become uneconomic relative to gas and the state had big future plans.

    3 votes
  11. Comment on lobste.rs invite in ~comp

    skybrian
    (edited )
    Link
    Their invite form requires an email address, so the easiest way would be to send me a private message on Tildes with your email. (You could create a new email just for this if you prefer.)

    Their invite form requires an email address, so the easiest way would be to send me a private message on Tildes with your email. (You could create a new email just for this if you prefer.)

    18 votes
  12. Comment on Study finds sperm whales help each other give birth in ~science

    skybrian
    Link
    From the article: [...] [...]

    From the article:

    Project CETI (Cetacean Translation Initiative) has released two landmark scientific papers detailing what researchers describe as the most comprehensive record of a sperm whale birth ever captured – and the first quantitative evidence of cooperative birth assistance among non-primates.

    Published in Science and Scientific Reports, the studies draw on more than six hours of underwater acoustic recordings and aerial drone footage collected on 8 July 2023 in waters off Dominica.

    [...]

    Taken together, the studies suggest that cooperative caregiving during birth may be an ancient evolutionary trait. Phylogenetic analysis indicates that behaviours such as the collective lifting of newborns could predate the most recent common ancestor of toothed whales by more than 36 million years.

    [...]

    The research builds on decades of fieldwork led by Shane Gero, whose team has tracked the focal whale family since 2005. The mother – known as Rounder from Unit A – was observed giving birth alongside her own mother, Lady Oracle, and her daughter, Accra, capturing three generations participating in the event.

    “This is the most detailed window we’ve ever had into one of the most important moments in a whale’s life,” said Shane Gero, Biology Lead for Project CETI, Scientist in Residence at Carleton University, and National Geographic Explorer.

    “Because this family unit has been studied for decades, we could see what the grandmother was doing, how the new big sister acted, and how each helped mom and newborn, placing this rare birth within a deep social and behavioural context.”

    7 votes
  13. Comment on How cash is helping Kenyan moms access care in ~health

    skybrian
    Link
    From the article:

    From the article:

    • We’ve sent cash to nearly 1,500 pregnant women in rural Kenya to support safer pregnancies and newborn care since September 2025.

    • Early data show women prioritizing food, baby supplies, and healthcare spending (6x what we see in our general poverty relief programs).

    • Cash is helping women cover specific costs to access healthcare: insurance fees, transportation, and clinic bills.

    • We’re expanding to reach more women in Kenya and piloting a similar model in DRC to learn what works across different contexts.

    4 votes
  14. Comment on An unstoppable mushroom is tearing through North American forests. Fungi enthusiasts are doing damage control. in ~enviro

    skybrian
    Link
    From the article: [...] [...] [...]

    From the article:

    The golden oyster mushroom (Pleurotus citrinopileatus) is a close cousin of the grey oyster I dissected above. Instead of grey, it has a neon yellow cap, and it is prolific. The fungus itself mainly grows on dead or dying hardwood trees, breaking down the tough wood fibres. Golden oysters are "gilled mushrooms", and a single gilled mushroom can release up to billions of spores. Oyster mushrooms also happen to be one of the few carnivorous mushrooms – preying mercilessly on nematode worms.

    It is invisible for most of the year, living as mycelium, fungal strands within the wood. But beginning in spring, it sends out its fruiting body – what we would recognise as the mushroom itself. Huge yellow clusters cascade out of logs and trees, each mushroom itself producing millions of microscopic airborne spores.

    Native to Asia, the fungus was brought over to the US to be cultivated for food sometime around the early 2000s. Because it fruits so heavily, it proved to be popular with both professional and home growers. It has a high yield, meaning more profit for growers.

    The mushroom is now found across the world. It's spreading in Switzerland, and has been found in Italy, Hungary, Serbia and Germany. There are reports of the golden oyster growing in the south of the UK too. The Royal Horticultural society has issued advice warning people against growing non-native species, especially the golden oyster, saying it was "highly invasive" and capable of causing "severe damage" to local fungal communities.

    [...]

    "We found that trees colonised by golden oyster have, on average, about half the fungal biodiversity as trees without the golden oyster. And so that was a huge indicator that they're likely out competing the native fungi that were there," says Veerabahu.

    [...]

    Other invasive species meanwhile are appearing in Europe. In October 2025, Poland's national forest management body sounded the alarm after a North American species, the slender golden bolete (Aureoboletus projectellus) was found in the Unesco-protected Białowieża Forest.

    [...]

    Climate change is also believed to be changing the distribution of fungi across the world. One species, the strikingly orange "ping pong bat fungus" (Favolaschia calocera), originally hails from tropical Madagascar. But it's been showing up in the wild in Dorset, southern England, where its effects on native fungi are unknown, something scientists believe is being helped by rising global temperatures.

    11 votes