FLAKE-Bench: Outsourcing Awkwardness in the Age of AI — LessWrong
Published on April 1, 2025 5:08 PM GMTIntroductionA key part of modern social dynamics is flaking...
Introducing Deepgeek — LessWrong
Published on April 1, 2025 4:41 PM GMTTL;DR: We present Deepgeek, a new AI language model...
Comments on Karma systems — LessWrong
Published on April 1, 2025 12:53 PM GMTA Karma system is a resource sharing mechanism based...
LessWrong has been acquired by EA — LessWrong
Published on April 1, 2025 1:09 PM GMTHey Everyone,It is with a sense of... considerable cognitive...
Insect Suffering Is The Biggest Issue: What To Do About It — LessWrong
Published on April 1, 2025 12:51 PM GMT1 IntroductionCrosspost from my blog. A beetle lay crushed in...
Organisation-Level Lock-In Risk Interventions — LessWrong
Published on April 1, 2025 12:42 PM GMTEpistemic status: my own thoughts and opinions after thinking...
Keltham's Lectures in Project Lawful — LessWrong
Published on April 1, 2025 10:39 AM GMT(Not an April fools' joke)If anyone wants to return...
Was the historical Jesus talking about evolution? (You might be surprised) — LessWrong
Published on April 1, 2025 10:32 AM GMTOut of all the research rabbit holes I've ever...
Follow me on TikTok — LessWrong
Published on April 1, 2025 8:22 AM GMTFor more than five years, I've posted an average...
New Cause Area Proposal — LessWrong
Published on April 1, 2025 7:12 AM GMTEpistemic status - statistically verified. I'm writing this post to...
Why do many people who care about AI Safety not clearly endorse PauseAI? — LessWrong
Published on March 30, 2025 6:06 PM GMTtl;dr:From my current understanding, one of the following two...
Extracting proper nouns a model "knows" using entity-detection neurons. — LessWrong
Published on March 30, 2025 4:58 PM GMTIntroductionResearch on Sparse Autoencoders (SAEs) has identified "known entity"...
The g-Zombie Formal Argument — LessWrong
Published on March 30, 2025 1:16 PM GMTNote: I created an entry highlighting the formal attack...
Memory Persistence within Conversation Threads with Multimodal LLMS — LessWrong
Published on March 30, 2025 7:16 AM GMTIn neuroscience, we learned about foveated vision — our...
How I talk to those above me — LessWrong
Published on March 30, 2025 6:54 AM GMTNow and then, at work, we’ll have a CEO...
I, G(Zombie) — LessWrong
Published on March 30, 2025 1:24 AM GMTThere is no such thing as philosophy-free science; there...
Exercising Rationality — LessWrong
Published on March 29, 2025 7:08 PM GMTOr: Why thinking about blue tentacle arms is not...
Climbing the Hill of Experiments — LessWrong
Published on March 29, 2025 8:37 PM GMTA better anything can be achieved with simple tests...
Does the AI control agenda broadly rely on no FOOM being possible? — LessWrong
Published on March 29, 2025 7:38 PM GMTFor the purposes of FOOM, I'm defining it as...
Yeshua's Basilisk — LessWrong
Published on March 29, 2025 6:11 PM GMTSuppose you’re an AI researcher trying to make AIs...
40 - Jason Gross on Compact Proofs and Interpretability — LessWrong
Published on March 28, 2025 6:40 PM GMTYouTube link How do we figure out whether interpretability...
AI x Bio Workshop — LessWrong
Published on March 28, 2025 5:21 PM GMTMay 9 & 10Lighthaven, Berkeley Hosted by The Longevity...
How many times faster can the AGI advance the science than humans do? — LessWrong
Published on March 28, 2025 3:16 PM GMTI hope that the reasoning in my two posts...
Gemini 2.5 is the New SoTA — LessWrong
Published on March 28, 2025 2:20 PM GMTGemini 2.5 Pro Experimental is America’s next top large...
Will the Need to Retrain AI Models from Scratch Block a Software Intelligence Explosion? — LessWrong
Published on March 28, 2025 2:12 PM GMTTl;dr: no.This is a rough research note – we’re...
How We Might All Die in A Year — LessWrong
Published on March 28, 2025 1:22 PM GMTIlya thought back again to when he’d overheard that...
The vision of Bill Thurston — LessWrong
Published on March 28, 2025 11:45 AM GMTPDF version. berkeleygenomics.org. X.com. Bluesky. William Thurston was a...
What Uniparental Disomy Tells Us About Improper Imprinting in Humans — LessWrong
Published on March 28, 2025 11:24 AM GMTTLDRWe do not yet understand all the genetic imprints...
Explaining British Naval Dominance During the Age of Sail — LessWrong
Published on March 28, 2025 5:47 AM GMTThe other day I discussed how high monitoring costs...
Will the AGIs be able to run the civilisation? — LessWrong
Published on March 28, 2025 4:50 AM GMTEven an AGI "aligned" to a purpose which doesn't...
Will AI R&D Automation Cause a Software Intelligence Explosion? — LessWrong
Published on March 26, 2025 6:12 PM GMTEmpirical evidence suggests that, if AI automates AI research,...
Why Does Unemployment Happen? — LessWrong
Published on March 26, 2025 6:02 PM GMTAnd specifically, what does this imply for AI? There...
Apply to become a Futurekind AI Facilitator or Mentor (deadline: April 10) — LessWrong
Published on March 26, 2025 3:47 PM GMTWe are accepting applications for up to 12 paid...
Finding Emergent Misalignment — LessWrong
Published on March 26, 2025 5:33 PM GMTWe've recently published a paper on Emergent Misalignment, where...
Center on Long-Term Risk: Summer Research Fellowship 2025 - Apply Now — LessWrong
Published on March 26, 2025 5:29 PM GMTSummary: CLR is hiring for our Summer Research Fellowship....
Eukaryote Skips Town - Why I'm leaving DC — LessWrong
Published on March 26, 2025 5:16 PM GMTI’ve spent the past 7 years living in the...
Language and My Frustration Continue in Our RSI — LessWrong
Published on March 26, 2025 2:13 PM GMTWhat's this post about?I make some rants and recommendations...
Would it be effective to learn a language to improve cognition? — LessWrong
Published on March 26, 2025 10:17 AM GMTo1 has shown a strange behavior where it thinks...
New AI safety treaty paper out! — LessWrong
Published on March 26, 2025 9:29 AM GMTLast year, we (the Existential Risk Observatory) published a Time...
Map of all 40 copyright suits v. AI in U.S. — LessWrong
Published on March 26, 2025 7:57 AM GMTDownload the latest PDF with links to court dockets...
AI "Deep Research" Tools Reviewed — LessWrong
Published on March 24, 2025 6:40 PM GMTMidjourney: “an artificially intelligent researcher, library, posthuman archivist, mapping...
Notes on countermeasures for exploration hacking (aka sandbagging) — LessWrong
Published on March 24, 2025 6:39 PM GMTIf we naively apply RL to a scheming AI,...
Subversion Strategy Eval: Can language models statelessly strategize to subvert control protocols? — LessWrong
Published on March 24, 2025 5:55 PM GMTWe recently released Subversion Strategy Eval: Can language models statelessly...
From Loops to Klein Bottles: Uncovering Hidden Topology in High Dimensional Data — LessWrong
Published on March 24, 2025 5:09 PM GMTMotivationDimensionality reduction is vital to the analysis of high...
Will Jesus Christ return in an election year? — LessWrong
Published on March 24, 2025 4:50 PM GMTThanks to Jesse Richardson for discussion.Polymarket asks: will Jesus...
Sentinel's Global Risks Weekly Roundup #12/2025: Famine in Gaza, H7N9 outbreak, US geopolitical leadership weakening. — LessWrong
Published on March 24, 2025 4:46 PM GMTExecutive summaryForecasters believe there’s an 18% chance (range: 4%-50%)...
Delicious Boy Slop - Boring Diet, Effortless Weightloss — LessWrong
Published on March 24, 2025 3:01 PM GMTYour beloved 34 year old author is never hungryI...
More on Various AI Action Plans — LessWrong
Published on March 24, 2025 1:10 PM GMTLast week I covered Anthropic’s relatively strong submission, and...
Emergent scaling effects on the functional hierarchies within LLMs — LessWrong
Published on March 24, 2025 1:03 PM GMTI have been poking around with LLMs, and I...
Recommender Alignment for Lock-In Risk — LessWrong
Published on March 24, 2025 12:56 PM GMTEpistemic status: my own research and reasoning about lock-in...