How Democratic Is Effective Altruism — Really? — LessWrong
Published on April 25, 2025 4:02 PM GMTIntroductionEffective Altruism (EA) is a social movement that aims...
Will Programmer Compensation Decouple from Productivity? — LessWrong
Published on April 25, 2025 3:32 PM GMTSince the 1970s, productivity has outpaced wage growth in...
Zstd Window Size — LessWrong
Published on April 25, 2025 2:40 PM GMT At work we've recently been using zstd as...
List of petitions against OpenAI's for-profit move — LessWrong
Published on April 25, 2025 10:03 AM GMTLetters to attorney generals, etc, to block OpenAI from...
A review of "Why Did Environmentalism Become Partisan?" — LessWrong
Published on April 25, 2025 5:12 AM GMTI was recently encouraged to read Jeffrey Heninger's report...
LLM Pareto Frontier But Live — LessWrong
Published on April 24, 2025 9:22 PM GMTTLDR: I really like the graph where they show...
Modifying LLM Beliefs with Synthetic Document Finetuning — LessWrong
Published on April 24, 2025 9:15 PM GMTIn this post, we study whether we can modify...
This prompt (sometimes) makes ChatGPT think about terrorist organisations — LessWrong
Published on April 24, 2025 9:15 PM GMTYesterday, I couldn't wrap my head around some programming...
Token and Taboo — LessWrong
Published on April 24, 2025 8:17 PM GMTWhat in retrospect seem like serious moral crimes were...
Trouble at Miningtown: Prologue — LessWrong
Published on April 24, 2025 7:09 PM GMTIn late 2019 I wrote a TTRPG.The theme was...
Putting up Bumpers — LessWrong
Published on April 23, 2025 4:05 PM GMTtl;dr: Even if we can't solve alignment, we can...
The AI Belief-Consistency Letter — LessWrong
Published on April 23, 2025 12:01 PM GMTDear policymakers,We demand that the AI alignment budget be...
Jaan Tallinn's 2024 Philanthropy Overview — LessWrong
Published on April 23, 2025 11:06 AM GMTto follow up my philantropic pledge from 2020, i've...
Fish and Faces — LessWrong
Published on April 23, 2025 3:35 AM GMTWhat would it take to convince you to come...
Are we "being poisoned"? — LessWrong
Published on April 23, 2025 5:11 AM GMTI would like to revisit some of the concepts...
To Understand History, Keep Former Population Distributions In Mind — LessWrong
Published on April 23, 2025 4:51 AM GMTGuillaume Blanc has a piece in Works in Progress...
Is alignment reducible to becoming more coherent? — LessWrong
Published on April 22, 2025 11:47 PM GMTEpistemic status: Like all alignment ideas, this one is...
The EU Is Asking for Feedback on Frontier AI Regulation (Open to Global Experts)—This Post Breaks Down What’s at Stake for AI Safety — LessWrong
Published on April 22, 2025 8:39 PM GMTThe European AI Office is currently writing the rules...
Corrupted by Reasoning: Reasoning Language Models Become Free-Riders in Public Goods Games — LessWrong
Published on April 22, 2025 7:25 PM GMTSummary:Traditional LLMs outperform reasoning models in cooperative Public Goods...
Alignment from equivariance II - language equivariance as a way of figuring out what an AI "means" — LessWrong
Published on April 22, 2025 7:04 PM GMTI recently had the privilege of having my idea...
There is no Red Line — LessWrong
Published on April 22, 2025 6:28 PM GMTThere will be no single moment, no dramatic cinematic...
Manifund 2025 Regrants — LessWrong
Published on April 22, 2025 5:36 PM GMTEach year, Manifund partners with regrantors: experts in the...
AISN#52: An Expert Virology Benchmark — LessWrong
Published on April 22, 2025 5:08 PM GMTWelcome to the AI Safety Newsletter by the Center...
Problems with Bayesianism: A Socratic Dialogue — LessWrong
Published on April 22, 2025 2:09 PM GMTCrossposted from my blog In this fictional dialogue between a...
Societal and technological progress as sewing an ever-growing, ever-changing, patchy, and polychrome quilt — LessWrong
Published on April 22, 2025 1:21 PM GMTJoel Z. Leibo [1], Alexander Sasha Vezhnevets [1], William A. Cunningham...
You Better Mechanize — LessWrong
Published on April 22, 2025 1:10 PM GMTOr you had better not. The question is which...
Experimental testing: can I treat myself as a random sample? — LessWrong
Published on April 22, 2025 12:34 PM GMTTL;DR: Several experiments show that I can extract useful...
Family-line selection optimizer — LessWrong
Published on April 22, 2025 7:16 AM GMTO3 and Claude 3.7 are terribly dishonest creatures. Gemini...
Accountability Sinks — LessWrong
Published on April 22, 2025 5:00 AM GMTThis is a cross-post from https://250bpm.substack.com/p/accountability-sinksBack in the 1990s,...
Most AI value will come from broad automation, not from R&D — LessWrong
Published on April 22, 2025 3:22 AM GMTThis is a linkpost to an article by Ege...
Q2 AI Forecasting Benchmark: $30,000 in Prizes — LessWrong
Published on April 21, 2025 5:29 PM GMTDiscuss
Crime and Punishment #1 — LessWrong
Published on April 21, 2025 3:30 PM GMTThis seemed like a good next topic to spin...
Improving CNNs with Klein Networks: A Topological Approach to AI — LessWrong
Published on April 21, 2025 3:21 PM GMTIn our earlier post, we described how one could...
Eulogy to the Obits — LessWrong
Published on April 21, 2025 2:10 PM GMTBy Xander BalwitWith death all but obsolete, Jamie’s life...
Not All Beliefs Are Created Equal: Diagnosing Toxic Ideologies — LessWrong
Published on April 21, 2025 3:18 AM GMTEpistemic status: exploratory but confident. This essay presents a...
Research Notes: Running Claude 3.7, Gemini 2.5 Pro, and o3 on Pokémon Red — LessWrong
Published on April 21, 2025 3:52 AM GMTDisclaimer: this post was not written by me, but...
AI 2027 is a Bet Against Amdahl's Law — LessWrong
Published on April 21, 2025 3:09 AM GMTAI 2027 lies at a Pareto frontier – it...
Severance and the Ethics of the Conscious Agents — LessWrong
Published on April 21, 2025 2:21 AM GMT***Severance Spoilers!***Nick Bostrom talks about coherent, extrapolated ethics as...
March-April 2025 Progress in Guaranteed Safe AI — LessWrong
Published on April 20, 2025 7:00 PM GMTSay hi at ICSE in Ottawa, I’ll be at...
How to end credentialism — LessWrong
Published on April 20, 2025 6:50 PM GMTThe current University System is bad. Very bad. Half...
How Close We Are to a Complete List of Imprinted Genes — LessWrong
Published on April 19, 2025 6:37 PM GMTThis post summarizes some of the research I have...
AI, Alignment & the Art of Relationship Design — LessWrong
Published on April 19, 2025 12:47 AM GMTWe don’t always know what we’re looking for until...
Novel Idea Generation in LLMs: Judgment as Bottleneck — LessWrong
Published on April 19, 2025 3:37 PM GMTIn the face of any hard problem—reversing climate change,...
Why Should I Assume CCP AGI is Worse Than USG AGI? — LessWrong
Published on April 19, 2025 2:47 PM GMTThough, given my doomerism, I think the natsec framing...
An Introduction to SAEs and their Variants for Mech Interp — LessWrong
Published on April 19, 2025 2:09 PM GMTI aim to cover a lot of ground, but...
AI Advances and Detection Strategy — LessWrong
Published on April 19, 2025 11:40 AM GMT Cross-posted from my NAO Notebook. This is an...
Emotional Theory for a Technical Manual on How Not to Freeze Completely — LessWrong
Published on April 19, 2025 9:12 AM GMTThe ambulance screeched to a halt with the flair...
SecureDrop review — LessWrong
Published on April 19, 2025 4:29 AM GMTThis is a living document. Crosspost below may not...
o3 Will Use Its Tools For You — LessWrong
Published on April 18, 2025 9:20 PM GMTOpenAI has finally introduced us to the full o3...
AI Control Methods Literature Review — LessWrong
Published on April 18, 2025 9:15 PM GMTAI Control is a subfield of AI Safety research...