How can we prevent AGI value drift? — LessWrong
Published on November 20, 2024 6:19 PM GMTI am grateful to Noosphere89 for prompting me to...
China Hawks are Manufacturing an AI Arms Race — LessWrong
Published on November 20, 2024 6:17 PM GMTDiscuss
Why I Think All The Species Of Significantly Debate Consciousness Are Conscious And Suffer Intensely — LessWrong
Published on November 20, 2024 4:48 PM GMTCrosspost of this on my blog. 1 My basic view 'Cause...
Zvi’s Thoughts on His 2nd Round of SFF — LessWrong
Published on November 20, 2024 1:40 PM GMTPreviously: Long-Term Charities: Apply For SFF Funding, Zvi’s Thoughts...
A Little Depth Goes a Long Way: the Expressive Power of Log-Depth Transformers — LessWrong
Published on November 20, 2024 11:48 AM GMTAuthors: Anonymous (I'm not one of them).Abstract:Most analysis of...
What changes should happen in the HHS? — LessWrong
Published on November 20, 2024 11:04 AM GMTCurrently, it looks like Robert F. Kennedy Jr. will...
Garrison Lovely: China Hawks are Manufacturing an AI Arms Race — LessWrong
Published on November 20, 2024 10:13 AM GMTSummary thread: https://x.com/GarrisonLovely/status/1859022323799699474. An influential congressional commission is calling for...
What are the good rationality films? — LessWrong
Published on November 20, 2024 6:04 AM GMTI run a weekly sequences-reading meetup with some friends,...
Valence Need Not Be Bounded; Utility Need Not Synthesize — LessWrong
Published on November 20, 2024 1:37 AM GMTAs I relayed in my last post, in "Theory...
Programmers, How Bad Is It out There? — LessWrong
Published on November 20, 2024 12:57 AM GMTI work in a niche company in a niche...
Social events with plausible deniability — LessWrong
Published on November 18, 2024 6:25 PM GMTWe wanted to run an event where controversial opinions...
Ethical Implications of the Quantum Multiverse — LessWrong
Published on November 18, 2024 4:00 PM GMTWhat kinds of ethical implications should we expect from...
Reducing x-risk might be actively harmful — LessWrong
Published on November 18, 2024 2:25 PM GMTGreat. Another crucial consideration I missed. I was convinced...
How likely is brain preservation to work? — LessWrong
Published on November 18, 2024 4:58 PM GMTPeople often ask me this. It’s a good question....
Why imperfect adversarial robustness doesn't doom AI control — LessWrong
Published on November 18, 2024 4:05 PM GMT(thanks to Alex Mallen, Cody Rushing, Zach Stein-Perlman, Hoagy...
Monthly Roundup #24: November 2024 — LessWrong
Published on November 18, 2024 1:20 PM GMTThis is your monthly roundup. Let’s get right to...
A Straightforward Explanation of the Good Regulator Theorem — LessWrong
Published on November 18, 2024 12:45 PM GMTThis post was written during the agent foundations fellowship...
The Choice Transition — LessWrong
Published on November 18, 2024 12:30 PM GMTOn the emergence of history's reinsOne general law, leading...
Proposal to increase fertility: University parent clubs — LessWrong
Published on November 18, 2024 4:21 AM GMTFertility rates in the developed world are too low...
Small improvement to Wikipedia page on Pareto Efficiency — LessWrong
Published on November 18, 2024 2:13 AM GMTNote: I would have done this as a Quick...
What (if anything) made your p(doom) go down in 2024? — LessWrong
Published on November 16, 2024 4:46 PM GMTDiscuss
Gwerns — LessWrong
Published on November 16, 2024 2:31 PM GMTAt every turn there was a Gwern, Within this...
Which evals resources would be good? — LessWrong
Published on November 16, 2024 2:24 PM GMTI want to make a serious effort to create...
OpenAI Email Archives (from Musk v. Altman) — LessWrong
Published on November 16, 2024 6:38 AM GMTAs part of the court case between Elon Musk...
Using Dangerous AI, But Safely? — LessWrong
Published on November 16, 2024 4:29 AM GMTRob Miles has released a new video, this time...
Ayn Rand’s model of “living money”; and an upside of burnout — LessWrong
Published on November 16, 2024 2:59 AM GMTEpistemic status: Toy model. Oversimplified, but has been anecdotally...
Fundamental Uncertainty: Epilogue — LessWrong
Published on November 16, 2024 12:57 AM GMTI wrote a whole book! What's next?I'm currently doing...
Making a conservative case for alignment — LessWrong
Published on November 15, 2024 6:55 PM GMTTrump and the Republican party will yield broad governmental...
Win/continue/lose scenarios and execute/replace/audit protocols — LessWrong
Published on November 15, 2024 3:47 PM GMTIn this post, I’ll make a technical point that...
Proposing the Conditional AI Safety Treaty (linkpost TIME) — LessWrong
Published on November 15, 2024 1:59 PM GMTTechnological progress can excite us, politics can infuriate us,...
Seven lessons I didn't learn from election day — LessWrong
Published on November 14, 2024 6:39 PM GMTI spent most of my election day -- 3pm...
Effects of Non-Uniform Sparsity on Superposition in Toy Models — LessWrong
Published on November 14, 2024 4:59 PM GMTAbstractThis post summarises my findings on the effects of...
The Early Christian Strategy — LessWrong
Published on November 14, 2024 5:02 PM GMTScott Alexander's latest today discusses Robert Axelrod's Prisoner’s Dilemma...
'Estimat - Values and Data’s For Starters'- A Necessary Proposal? — LessWrong
Published on November 14, 2024 2:37 PM GMT1. PROBLEM In today’s digital era, teenagers face a dual...
AI #90: The Wall — LessWrong
Published on November 14, 2024 2:10 PM GMTAs the Trump transition continues and we try to...
Evolutionary prompt optimization for SAE feature visualization — LessWrong
Published on November 14, 2024 1:06 PM GMTTLDR:Fluent dreaming for language models is an algorithm based on...
AXRP Episode 38.0 - Zhijing Jin on LLMs, Causality, and Multi-Agent Systems — LessWrong
Published on November 14, 2024 7:00 AM GMTYouTube link Do language models understand the causal structure...
FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI — LessWrong
Published on November 14, 2024 6:13 AM GMTFrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in...
Concrete Methods for Heuristic Estimation on Neural Networks — LessWrong
Published on November 14, 2024 5:07 AM GMTThanks to Erik Jenner for helpful comments and discussion(Epistemic...
Heresies in the Shadow of the Sequences — LessWrong
Published on November 14, 2024 5:01 AM GMTReligions are collections of cherished but mistaken principles. So...
Current Attitudes Toward AI Provide Little Data Relevant to Attitudes Toward AGI — LessWrong
Published on November 12, 2024 6:23 PM GMTEpistemic status: Sudden public attitude shift seems quite possible,...
Basics of Handling Disagreements with People — LessWrong
Published on November 12, 2024 5:55 PM GMTEpistemic Status: This is a collection of useful heuristics...
Registrations Open for 2024 NYC Secular Solstice & Megameetup — LessWrong
Published on November 12, 2024 5:50 PM GMTOn December 14th, New York City will have a...
2024 NYC Secular Solstice & Megameetup — LessWrong
Published on November 12, 2024 5:46 PM GMTSecular Solstice is a celebration of hope in darkness....
2025 Q1 Pivotal Research Fellowship (Technical & Policy) — LessWrong
Published on November 12, 2024 10:56 AM GMTWe’re excited to announce that applications are now open...
Theories With Mentalistic Atoms Are As Validly Called Theories As Theories With Only Non-Mentalistic Atoms — LessWrong
Published on November 12, 2024 6:45 AM GMT[ This is supposed to be a didactic post....
The lying p value — LessWrong
Published on November 12, 2024 6:12 AM GMTQuick check: do you agree or disagree with the...
The Packaging and the Payload — LessWrong
Published on November 12, 2024 3:07 AM GMTI.As I've run and studied meetups, there's a useful...
Consider tabooing "I think" — LessWrong
Published on November 12, 2024 2:00 AM GMTPeople say "I think" a lot. Here are some...
Festival Stats 2024 — LessWrong
Published on November 12, 2024 2:00 AM GMT Each year ( 2014, 2015, 2016, 2017, 2018,...