~www_lesswrong_com | Bookmarks (669)
-
Emergence, The Blind Spot of GenAI Interpretability? — LessWrong
Published on August 10, 2024 10:07 AM GMTEpistemic status: This post was planned to be part...
-
Rowing vs steering — LessWrong
Published on August 10, 2024 7:00 AM GMTAlex Lawsen used a great metaphor on the 80k...
-
Provably Safe AI: Worldview and Projects — LessWrong
Published on August 9, 2024 11:21 PM GMTIn September 2023, Max Tegmark and Steve Omohundro proposed...
-
All The Latest Human tFUS Studies — LessWrong
Published on August 9, 2024 10:20 PM GMTfrom Peng, et al; stimulating the nucleus accumbens inhibits...
-
But Where do the Variables of my Causal Model come from? — LessWrong
Published on August 9, 2024 10:07 PM GMTTl;dr, A choice of variable in causal modeling is...
-
[LDSL#2] Latent variable models, network models, and linear diffusion of sparse lognormals — LessWrong
Published on August 9, 2024 7:57 PM GMTThis post is also available on my Substack.Where we...
-
"Which Future Mind is Me?" Is a Question of Values — LessWrong
Published on August 9, 2024 6:17 PM GMTThis is written right after reading Rob Bensinger's relevant...
-
Simulation-aware causal decision theory: A case for one-boxing in CDT — LessWrong
Published on August 9, 2024 6:09 PM GMTDisclaimer: I am a math student new to LW...
-
You can remove GPT2’s LayerNorm by fine-tuning for an hour — LessWrong
Published on August 8, 2024 6:33 PM GMTThis work was produced at Apollo Research, based on...
-
Leaving MIRI, Seeking Funding — LessWrong
Published on August 8, 2024 6:32 PM GMTThis is slightly old news at this point, but:...
-
Tokenizer Extravaganza: CHKERRQ and the Return of the Golden Magicarp — LessWrong
Published on August 8, 2024 6:27 PM GMTThe tokenizer for GPT-4o is interesting. It's called o200k_base, after...
-
Does VETLM solve AI superalignment? — LessWrong
Published on August 8, 2024 6:22 PM GMTEliezer Yudkowsky’s main message to his Twitter fans is:Aligning...
-
Toy Models of Superposition: what about BitNets? — LessWrong
Published on August 8, 2024 4:29 PM GMTSummary In this post I want to briefly share...
-
[LDSL#1] Performance optimization as a metaphor for life — LessWrong
Published on August 8, 2024 4:16 PM GMTFollowup to: Some epistemological conundrums. Response to: The omnigenic...
-
Four Randomized Control Trials In Economics — LessWrong
Published on August 8, 2024 3:59 PM GMTRandomized Control Trials have some drawbacks. For many important...
-
Cheap Whiteboards! — LessWrong
Published on August 8, 2024 1:52 PM GMTTake a large thick cardboard box. Cut it into...
-
AI #76: Six Shorts Stories About OpenAI — LessWrong
Published on August 8, 2024 1:50 PM GMTIf you’re looking for audio of my posts, you’re...
-
Motivation Theory — LessWrong
Published on August 8, 2024 5:05 AM GMTThis is a brief sketch of a theory of...
-
Inference-Only Debate Experiments Using Math Problems — LessWrong
Published on August 6, 2024 5:44 PM GMTWork supported by MATS and SPAR. Code at https://github.com/ArjunPanickssery/math_problems_debate/.Three...
-
Startup Roundup #2 — LessWrong
Published on August 6, 2024 1:30 PM GMTPreviously: Startup Roundup #1. This is my periodic grab...
-
Does Evolutionary Theory Imply Genetic Tribalism? — LessWrong
Published on August 6, 2024 5:43 AM GMTBelief in genetic tribalism comes from the idea that...
-
Mechanistic Anomaly Detection Research Update — LessWrong
Published on August 6, 2024 10:33 AM GMTOver the last few months, the EleutherAI interpretability team...
-
Reasoning is not search - a chess example — LessWrong
Published on August 6, 2024 9:29 AM GMTIn the past AI systems have reached super human...
-
Broadly human level, cognitively complete AGI — LessWrong
Published on August 6, 2024 9:26 AM GMTThere is a growing? fraction of people who consider...