~www_lesswrong_com | Bookmarks (706)
-
Systematic Sandbagging Evaluations on Claude 3.5 Sonnet — LessWrong
Published on February 14, 2025 1:22 AM GMTThis was the project I worked on during BlueDot...
-
Notes on the Presidential Election of 1836 — LessWrong
Published on February 13, 2025 11:40 PM GMTIn 1836, Andrew Jackson had served two terms. In...
-
I'm making a ttrpg about life in an intentional community during the last year before the Singularity — LessWrong
Published on February 13, 2025 9:54 PM GMTHi there! I'm Thomas Eliot. You may remember me...
-
The Paris AI Anti-Safety Summit — LessWrong
Published on February 12, 2025 2:00 PM GMTIt doesn’t look good. What used to be the...
-
Inside the dark forests of the internet — LessWrong
Published on February 12, 2025 10:20 AM GMTThis is the second part of a series on...
-
Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs — LessWrong
Published on February 12, 2025 9:15 AM GMTDiscuss
-
Why you maybe should lift weights, and How to. — LessWrong
Published on February 12, 2025 5:15 AM GMTWho this post is for? Someone who either:Wonders if...
-
If Neuroscientists Succeed — LessWrong
Published on February 11, 2025 3:33 PM GMTIntroductionIn the Spring of 2022, Stuart Russell wrote an...
-
Where Would Good Forecasts Most Help AI Governance Efforts? — LessWrong
Published on February 11, 2025 6:15 PM GMTThanks to Josh Rosenberg for comments and discussion.IntroductionOne of...
-
AI Safety at the Frontier: Paper Highlights, January '25 — LessWrong
Published on February 11, 2025 4:14 PM GMTThis is the selection of AI safety papers from...
-
The News is Never Neglected — LessWrong
Published on February 11, 2025 2:59 PM GMTDear Lsusr,I am inspired by your stories about Effective...
-
The AI Safety Approach in the Era of Open-Source AI — LessWrong
Published on February 11, 2025 2:01 PM GMTOpen-Source AI Undermines Traditional AI Safety ApproachIn the past...
-
What About The Horses? — LessWrong
Published on February 11, 2025 1:59 PM GMTIn a previous post, I argued that AGI would...
-
On Deliberative Alignment — LessWrong
Published on February 11, 2025 1:00 PM GMTNot too long ago, OpenAI presented a paper on...
-
Detecting AI Agent Failure Modes in Simulations — LessWrong
Published on February 11, 2025 11:10 AM GMTAI agents have become significantly more common in the...
-
World Citizen Assembly about AI - Announcement — LessWrong
Published on February 11, 2025 10:51 AM GMTDiscuss
-
Visual Reference for Frontier Large Language Models — LessWrong
Published on February 11, 2025 5:14 AM GMTHopefully this can be a helpful visual reference for...
-
Why Did Elon Musk Just Offer to Buy Control of OpenAI for $100 Billion? — LessWrong
Published on February 11, 2025 12:20 AM GMTDiscuss
-
Forecasting newsletter #2/2025: Forecasting meetup network — LessWrong
Published on February 9, 2025 6:07 PM GMTHighlightsForecasting meetup network (a) looking for volunteers. If you...
-
How identical twin sisters feel about nieces vs their own daughters — LessWrong
Published on February 9, 2025 5:36 PM GMT(cross posted from https://mugwumpery.com/how-identical-twin-sisters-feel-about-nieces-vs-their-own-daughters/)It seems to be generally assumed...
-
Two hemispheres - I do not think it means what you think it means — LessWrong
Published on February 9, 2025 3:33 PM GMTI am going to address some misconceptions about brain...
-
The Structure of Professional Revolutions — LessWrong
Published on February 9, 2025 1:23 PM GMTAn expert is not merely someone who has memorized...
-
Gary Marcus now saying AI can't do things it can already do — LessWrong
Published on February 9, 2025 12:24 PM GMTJanuary 2020, Gary Marcus wrote GPT-2 And The Nature...
-
How do you make a 250x better vaccine at 1/10 the cost? Develop it in India. — LessWrong
Published on February 9, 2025 3:53 AM GMT(I made a vaccinology/policy-based podcast! A very long one!...