~www_lesswrong_com | Bookmarks (664)
-
Generic advice caveats — LessWrong
Published on October 30, 2024 9:03 PM GMTYou were (probably) linked here from some advice. Unfortunately,...
-
I turned decision theory problems into memes about trolleys — LessWrong
Published on October 30, 2024 8:13 PM GMTI hope it has some educational, memetic or at...
-
The Alignment Trap: AI Safety as Path to Power — LessWrong
Published on October 29, 2024 3:21 PM GMTRecent discussions about artificial intelligence safety have focused heavily...
-
Housing Roundup #10 — LessWrong
Published on October 29, 2024 1:50 PM GMTThere’s more campaign talk about housing. The talk of...
-
[Intuitive self-models] 7. Hearing Voices, and Other Hallucinations — LessWrong
Published on October 29, 2024 1:36 PM GMT7.1 Post summary / Table of contentsThis is the...
-
Review: “The Case Against Reality” — LessWrong
Published on October 29, 2024 1:13 PM GMTThis is not a red stop sign:For one thing,...
-
A Poem Is All You Need: Jailbreaking ChatGPT, Meta & More — LessWrong
Published on October 29, 2024 12:41 PM GMTThis project report was created in September 2024 as...
-
Searching for phenomenal consciousness in LLMs: Perceptual reality monitoring and introspective confidence — LessWrong
Published on October 29, 2024 12:16 PM GMTTo update our credence on whether or not LLMs...
-
AI #87: Staying in Character — LessWrong
Published on October 29, 2024 7:10 AM GMTThe big news of the week was the release...
-
A path to human autonomy — LessWrong
Published on October 29, 2024 3:02 AM GMT"Each one of us, and also us as the...
-
D&D.Sci Coliseum: Arena of Data Evaluation and Ruleset — LessWrong
Published on October 29, 2024 1:21 AM GMTThis is a follow-up to last week's D&D.Sci scenario:...
-
Gwern: Why So Few Matt Levines? — LessWrong
Published on October 29, 2024 1:07 AM GMTMatt Levine is the most well-known newslettrist (“Money Stuff”)...
-
Hiring a writer to co-author with me (Spencer Greenberg for ClearerThinking.org) — LessWrong
Published on October 27, 2024 5:34 PM GMTDiscuss
-
Dario Amodei's "Machines of Loving Grace" sound incredibly dangerous, for Humans — LessWrong
Published on October 27, 2024 5:05 AM GMTWhat Dario lays out as a "best-case scenario" in...
-
Interview with Bill O’Rourke - Russian Corruption, Putin, Applied Ethics, and More — LessWrong
Published on October 27, 2024 5:11 PM GMTThis is cross-posted from my blog and I interviewed...
-
On Shifgrethor — LessWrong
Published on October 27, 2024 3:30 PM GMTA small number of terms are elevated from the...
-
The hostile telepaths problem — LessWrong
Published on October 27, 2024 3:26 PM GMTEpistemic status: model-building based on observation, with a few...
-
What are some good ways to form opinions on controversial subjects in the current and upcoming era? — LessWrong
Published on October 27, 2024 2:33 PM GMTTake a random political issue with two sides A...
-
Video lectures on the learning-theoretic agenda — LessWrong
Published on October 27, 2024 12:01 PM GMTThis is a YouTube playlist of recorded lectures on...
-
Electrostatic Airships? — LessWrong
Published on October 27, 2024 4:32 AM GMTAirships are pretty dang cool. Airplanes need a continuous...
-
A suite of Vision Sparse Autoencoders — LessWrong
Published on October 27, 2024 4:05 AM GMTCLIP-Scope?Inspired by Gemma-Scope We trained 8 Sparse Autoencoders each...
-
Ways to think about alignment — LessWrong
Published on October 27, 2024 1:40 AM GMTI’m listing some “ways to think about alignment”. I’m...
-
Is there a CFAR handbook audio option? — LessWrong
Published on October 26, 2024 5:08 PM GMTI've gotten spoiled by AI readings, and curious if...
-
A superficially plausible promising alternate Earth without lockstep — LessWrong
Published on October 26, 2024 4:04 PM GMT[ Context re dath ilan:- [Keltham reflects on the...