~www_lesswrong_com | Bookmarks (669)
-
LLMs Look Increasingly Like General Reasoners — LessWrong
Published on November 8, 2024 11:47 PM GMTSummaryFour months after my post 'LLM Generality is a...
-
overengineered air filter shelving — LessWrong
Published on November 8, 2024 10:04 PM GMTLet's consider air purifier design a bit. preface This...
-
Bigger Livers? — LessWrong
Published on November 8, 2024 9:50 PM GMTMy husband, Andrew Rettek, has a blog you should...
-
New UChicago Rationality Group — LessWrong
Published on November 8, 2024 9:20 PM GMTHey y'all! I just started a rationality group on the...
-
Fundamental Uncertainty: Chapter 9 - How do we live with uncertainty? — LessWrong
Published on November 7, 2024 6:15 PM GMTN.B. This is a chapter in a planned book...
-
AI #89: Trump Card — LessWrong
Published on November 7, 2024 4:30 PM GMTA lot happened in AI this week, but most...
-
Quantum Immortality: A Perspective if AI Doomers are Probably Right — LessWrong
Published on November 7, 2024 4:06 PM GMTEpistemic status: This text presents a thought experiment suggested...
-
Targeted Manipulation and Deception Emerge when Optimizing LLMs for User Feedback — LessWrong
Published on November 7, 2024 3:39 PM GMTProduced as part of MATS 6.0 and 6.1.Key takeaways:Training...
-
In the Name of All That Needs Saving — LessWrong
Published on November 7, 2024 3:26 PM GMTThere is some difference between despotic and cosmopolitan agents....
-
Agency overhang as a proxy for Sharp left turn — LessWrong
Published on November 7, 2024 12:14 PM GMTI've been accepted as a mentor for the next...
-
What are the primary drivers that caused selection pressure for intelligence in humans? — LessWrong
Published on November 7, 2024 9:40 AM GMTI just read the wikipedia article on the evolution...
-
The Logistics of Distribution of Meaning — LessWrong
Published on November 7, 2024 5:27 AM GMTThis is an excerpt from the Introductions section to...
-
SAEs are highly dataset dependent: a case study on the refusal direction — LessWrong
Published on November 7, 2024 5:22 AM GMTThis is an interim report sharing preliminary results. We...
-
Should CA, TX, OK, and LA merge into a giant swing state, just for elections? — LessWrong
Published on November 6, 2024 11:01 PM GMTAs Americans know, the electoral college gives disproportionate influence...
-
Why Recursion Pharmaceuticals abandoned cell painting for brightfield imaging — LessWrong
Published on November 5, 2024 2:51 PM GMTNote: thank you to Brita Belli, senior communications manager...
-
Winning isn't enough — LessWrong
Published on November 5, 2024 11:37 AM GMTIn our jobs as AI safety researchers, we think...
-
Anthropic - The case for targeted regulation — LessWrong
Published on November 5, 2024 7:07 AM GMTThe first two sections are below: Increasingly powerful AI...
-
ML4Good (AI Safety Bootcamp) - Experience report — LessWrong
Published on November 5, 2024 1:18 AM GMTIntroductionThis is a short summary of my experience attending...
-
The Shallow Bench — LessWrong
Published on November 5, 2024 5:07 AM GMTCross posting from my personal blog: https://spiralprogress.com/2024/10/28/the-shallow-bench/Spoilers for "Project...
-
Catastrophic Cyber Capabilities Benchmark (3CB): Robustly Evaluating LLM Agent Cyber Offense Capabilities — LessWrong
Published on November 5, 2024 1:01 AM GMTThis blog was published by Jonathan Ng, Andrey Anurin,...
-
Could orcas be (trained to be) smarter than humans? — LessWrong
Published on November 4, 2024 11:29 PM GMT(Btw everything I write here about orcas also applies...
-
Metastatic Cancer Treatment Since 2010: The Success Stories — LessWrong
Published on November 4, 2024 10:50 PM GMTMidjourney, “metastatic cancer”Metastatic Cancer Is Usually DeadlyWhen my mom...
-
Bay Winter Solstice 2024: Speech Auditions — LessWrong
Published on November 4, 2024 10:31 PM GMTHello! I'm looking for community members to read speeches...
-
Empathy/Systemizing Quotient is a poor/biased model for the autism/sex link — LessWrong
Published on November 4, 2024 9:11 PM GMTThank you to Justis Millis for providing feedback and...