Bookmarks (704)

  • screenshot

    How Democratic Is Effective Altruism — Really? — LessWrong

    Published on April 25, 2025 4:02 PM GMTIntroductionEffective Altruism (EA) is a social movement that aims...

  • screenshot

    Will Programmer Compensation Decouple from Productivity? — LessWrong

    Published on April 25, 2025 3:32 PM GMTSince the 1970s, productivity has outpaced wage growth in...

  • screenshot

    Zstd Window Size — LessWrong

    Published on April 25, 2025 2:40 PM GMT At work we've recently been using zstd as...

  • screenshot

    List of petitions against OpenAI's for-profit move — LessWrong

    Published on April 25, 2025 10:03 AM GMTLetters to attorney generals, etc, to block OpenAI from...

  • screenshot

    A review of "Why Did Environmentalism Become Partisan?" — LessWrong

    Published on April 25, 2025 5:12 AM GMTI was recently encouraged to read Jeffrey Heninger's report...

  • screenshot

    LLM Pareto Frontier But Live — LessWrong

    Published on April 24, 2025 9:22 PM GMTTLDR: I really like the graph where they show...

  • screenshot

    Modifying LLM Beliefs with Synthetic Document Finetuning — LessWrong

    Published on April 24, 2025 9:15 PM GMTIn this post, we study whether we can modify...

  • screenshot

    This prompt (sometimes) makes ChatGPT think about terrorist organisations — LessWrong

    Published on April 24, 2025 9:15 PM GMTYesterday, I couldn't wrap my head around some programming...

  • screenshot

    Token and Taboo — LessWrong

    Published on April 24, 2025 8:17 PM GMTWhat in retrospect seem like serious moral crimes were...

  • screenshot

    Trouble at Miningtown: Prologue — LessWrong

    Published on April 24, 2025 7:09 PM GMTIn late 2019 I wrote a TTRPG.The theme was...

  • screenshot

    Putting up Bumpers — LessWrong

    Published on April 23, 2025 4:05 PM GMTtl;dr: Even if we can't solve alignment, we can...

  • screenshot

    The AI Belief-Consistency Letter — LessWrong

    Published on April 23, 2025 12:01 PM GMTDear policymakers,We demand that the AI alignment budget be...

  • screenshot

    Jaan Tallinn's 2024 Philanthropy Overview — LessWrong

    Published on April 23, 2025 11:06 AM GMTto follow up my philantropic pledge from 2020, i've...

  • screenshot

    Fish and Faces — LessWrong

    Published on April 23, 2025 3:35 AM GMTWhat would it take to convince you to come...

  • screenshot

    Are we "being poisoned"? — LessWrong

    Published on April 23, 2025 5:11 AM GMTI would like to revisit some of the concepts...

  • screenshot

    To Understand History, Keep Former Population Distributions In Mind — LessWrong

    Published on April 23, 2025 4:51 AM GMTGuillaume Blanc has a piece in Works in Progress...

  • screenshot

    Is alignment reducible to becoming more coherent? — LessWrong

    Published on April 22, 2025 11:47 PM GMTEpistemic status: Like all alignment ideas, this one is...

  • screenshot

    The EU Is Asking for Feedback on Frontier AI Regulation (Open to Global Experts)—This Post Breaks Down What’s at Stake for AI Safety — LessWrong

    Published on April 22, 2025 8:39 PM GMTThe European AI Office is currently writing the rules...

  • screenshot

    Corrupted by Reasoning: Reasoning Language Models Become Free-Riders in Public Goods Games — LessWrong

    Published on April 22, 2025 7:25 PM GMTSummary:Traditional LLMs outperform reasoning models in cooperative Public Goods...

  • screenshot

    Alignment from equivariance II - language equivariance as a way of figuring out what an AI "means" — LessWrong

    Published on April 22, 2025 7:04 PM GMTI recently had the privilege of having my idea...

  • screenshot

    There is no Red Line — LessWrong

    Published on April 22, 2025 6:28 PM GMTThere will be no single moment, no dramatic cinematic...

  • screenshot

    Manifund 2025 Regrants — LessWrong

    Published on April 22, 2025 5:36 PM GMTEach year, Manifund partners with regrantors: experts in the...

  • screenshot

    AISN#52: An Expert Virology Benchmark — LessWrong

    Published on April 22, 2025 5:08 PM GMTWelcome to the AI Safety Newsletter by the Center...

  • screenshot

    Problems with Bayesianism: A Socratic Dialogue — LessWrong

    Published on April 22, 2025 2:09 PM GMTCrossposted from my blog In this fictional dialogue between a...

  • screenshot

    Societal and technological progress as sewing an ever-growing, ever-changing, patchy, and polychrome quilt — LessWrong

    Published on April 22, 2025 1:21 PM GMTJoel Z. Leibo [1], Alexander Sasha Vezhnevets [1], William A. Cunningham...

  • screenshot

    You Better Mechanize — LessWrong

    Published on April 22, 2025 1:10 PM GMTOr you had better not. The question is which...

  • screenshot

    Experimental testing: can I treat myself as a random sample? — LessWrong

    Published on April 22, 2025 12:34 PM GMTTL;DR: Several experiments show that I can extract useful...

  • screenshot

    Family-line selection optimizer — LessWrong

    Published on April 22, 2025 7:16 AM GMTO3 and Claude 3.7 are terribly dishonest creatures. Gemini...

  • screenshot

    Accountability Sinks — LessWrong

    Published on April 22, 2025 5:00 AM GMTThis is a cross-post from https://250bpm.substack.com/p/accountability-sinksBack in the 1990s,...

  • screenshot

    Most AI value will come from broad automation, not from R&D — LessWrong

    Published on April 22, 2025 3:22 AM GMTThis is a linkpost to an article by Ege...

  • screenshot

    Q2 AI Forecasting Benchmark: $30,000 in Prizes — LessWrong

    Published on April 21, 2025 5:29 PM GMTDiscuss

  • screenshot

    Crime and Punishment #1 — LessWrong

    Published on April 21, 2025 3:30 PM GMTThis seemed like a good next topic to spin...

  • screenshot

    Improving CNNs with Klein Networks: A Topological Approach to AI — LessWrong

    Published on April 21, 2025 3:21 PM GMTIn our earlier post, we described how one could...

  • screenshot

    Eulogy to the Obits — LessWrong

    Published on April 21, 2025 2:10 PM GMTBy Xander BalwitWith death all but obsolete, Jamie’s life...

  • screenshot

    Not All Beliefs Are Created Equal: Diagnosing Toxic Ideologies — LessWrong

    Published on April 21, 2025 3:18 AM GMTEpistemic status: exploratory but confident. This essay presents a...

  • screenshot

    Research Notes: Running Claude 3.7, Gemini 2.5 Pro, and o3 on Pokémon Red — LessWrong

    Published on April 21, 2025 3:52 AM GMTDisclaimer: this post was not written by me, but...

  • screenshot

    AI 2027 is a Bet Against Amdahl's Law — LessWrong

    Published on April 21, 2025 3:09 AM GMTAI 2027 lies at a Pareto frontier – it...

  • screenshot

    Severance and the Ethics of the Conscious Agents — LessWrong

    Published on April 21, 2025 2:21 AM GMT***Severance Spoilers!***Nick Bostrom talks about coherent, extrapolated ethics as...

  • screenshot

    March-April 2025 Progress in Guaranteed Safe AI — LessWrong

    Published on April 20, 2025 7:00 PM GMTSay hi at ICSE in Ottawa, I’ll be at...

  • screenshot

    How to end credentialism — LessWrong

    Published on April 20, 2025 6:50 PM GMTThe current University System is bad. Very bad. Half...

  • screenshot

    How Close We Are to a Complete List of Imprinted Genes — LessWrong

    Published on April 19, 2025 6:37 PM GMTThis post summarizes some of the research I have...

  • screenshot

    AI, Alignment & the Art of Relationship Design — LessWrong

    Published on April 19, 2025 12:47 AM GMTWe don’t always know what we’re looking for until...

  • screenshot

    Novel Idea Generation in LLMs: Judgment as Bottleneck — LessWrong

    Published on April 19, 2025 3:37 PM GMTIn the face of any hard problem—reversing climate change,...

  • screenshot

    Why Should I Assume CCP AGI is Worse Than USG AGI? — LessWrong

    Published on April 19, 2025 2:47 PM GMTThough, given my doomerism, I think the natsec framing...

  • screenshot

    An Introduction to SAEs and their Variants for Mech Interp — LessWrong

    Published on April 19, 2025 2:09 PM GMTI aim to cover a lot of ground, but...

  • screenshot

    AI Advances and Detection Strategy — LessWrong

    Published on April 19, 2025 11:40 AM GMT Cross-posted from my NAO Notebook. This is an...

  • screenshot

    Emotional Theory for a Technical Manual on How Not to Freeze Completely — LessWrong

    Published on April 19, 2025 9:12 AM GMTThe ambulance screeched to a halt with the flair...

  • screenshot

    SecureDrop review — LessWrong

    Published on April 19, 2025 4:29 AM GMTThis is a living document. Crosspost below may not...

  • screenshot

    o3 Will Use Its Tools For You — LessWrong

    Published on April 18, 2025 9:20 PM GMTOpenAI has finally introduced us to the full o3...

  • screenshot

    AI Control Methods Literature Review — LessWrong

    Published on April 18, 2025 9:15 PM GMTAI Control is a subfield of AI Safety research...