Bookmarks (704)

screenshot

How Democratic Is Effective Altruism — Really? — LessWrong

lesswrong.com

screenshot

1

screenshot

Will Programmer Compensation Decouple from Productivity? — LessWrong

lesswrong.com

screenshot

1

screenshot

Zstd Window Size — LessWrong

lesswrong.com

screenshot

1

screenshot

List of petitions against OpenAI's for-profit move — LessWrong

lesswrong.com

screenshot

1

screenshot

A review of "Why Did Environmentalism Become Partisan?" — LessWrong

lesswrong.com

screenshot

1

screenshot

LLM Pareto Frontier But Live — LessWrong

lesswrong.com

screenshot

1

screenshot

Modifying LLM Beliefs with Synthetic Document Finetuning — LessWrong

lesswrong.com

screenshot

1

screenshot

This prompt (sometimes) makes ChatGPT think about terrorist organisations — LessWrong

lesswrong.com

screenshot

1

screenshot

Token and Taboo — LessWrong

lesswrong.com

screenshot

1

screenshot

Trouble at Miningtown: Prologue — LessWrong

lesswrong.com

screenshot

1

screenshot

Putting up Bumpers — LessWrong

lesswrong.com

screenshot

1

screenshot

The AI Belief-Consistency Letter — LessWrong

lesswrong.com

screenshot

1

screenshot

Jaan Tallinn's 2024 Philanthropy Overview — LessWrong

lesswrong.com

screenshot

1

screenshot

Fish and Faces — LessWrong

lesswrong.com

screenshot

1

screenshot

Are we "being poisoned"? — LessWrong

lesswrong.com

screenshot

1

screenshot

To Understand History, Keep Former Population Distributions In Mind — LessWrong

lesswrong.com

screenshot

1

screenshot

Is alignment reducible to becoming more coherent? — LessWrong

lesswrong.com

screenshot

1

screenshot

The EU Is Asking for Feedback on Frontier AI Regulation (Open to Global Experts)—This Post Breaks Down What’s at Stake for AI Safety — LessWrong

lesswrong.com

screenshot

1

screenshot

Corrupted by Reasoning: Reasoning Language Models Become Free-Riders in Public Goods Games — LessWrong

lesswrong.com

screenshot

1

screenshot

Alignment from equivariance II - language equivariance as a way of figuring out what an AI "means" — LessWrong

lesswrong.com

screenshot

1

screenshot

There is no Red Line — LessWrong

lesswrong.com

screenshot

1

screenshot

Manifund 2025 Regrants — LessWrong

lesswrong.com

screenshot

1

screenshot

AISN#52: An Expert Virology Benchmark — LessWrong

lesswrong.com

screenshot

1

screenshot

Problems with Bayesianism: A Socratic Dialogue — LessWrong

lesswrong.com

screenshot

1

screenshot

Societal and technological progress as sewing an ever-growing, ever-changing, patchy, and polychrome quilt — LessWrong

lesswrong.com

screenshot

1

screenshot

You Better Mechanize — LessWrong

lesswrong.com

screenshot

1

screenshot

Experimental testing: can I treat myself as a random sample? — LessWrong

lesswrong.com

screenshot

1

screenshot

Family-line selection optimizer — LessWrong

lesswrong.com

screenshot

1

screenshot

Accountability Sinks — LessWrong

lesswrong.com

screenshot

1

screenshot

Most AI value will come from broad automation, not from R&D — LessWrong

lesswrong.com

screenshot

1

screenshot

Q2 AI Forecasting Benchmark: $30,000 in Prizes — LessWrong

lesswrong.com

Published on April 21, 2025 5:29 PM GMTDiscuss

screenshot

1

screenshot

Crime and Punishment #1 — LessWrong

lesswrong.com

screenshot

1

screenshot

Improving CNNs with Klein Networks: A Topological Approach to AI — LessWrong

lesswrong.com

screenshot

1

screenshot

Eulogy to the Obits — LessWrong

lesswrong.com

screenshot

1

screenshot

Not All Beliefs Are Created Equal: Diagnosing Toxic Ideologies — LessWrong

lesswrong.com

screenshot

1

screenshot

Research Notes: Running Claude 3.7, Gemini 2.5 Pro, and o3 on Pokémon Red — LessWrong

lesswrong.com

screenshot

1

screenshot

AI 2027 is a Bet Against Amdahl's Law — LessWrong

lesswrong.com

screenshot

1

screenshot

Severance and the Ethics of the Conscious Agents — LessWrong

lesswrong.com

screenshot

1

screenshot

March-April 2025 Progress in Guaranteed Safe AI — LessWrong

lesswrong.com

screenshot

1

screenshot

How to end credentialism — LessWrong

lesswrong.com

screenshot

1

screenshot

How Close We Are to a Complete List of Imprinted Genes — LessWrong

lesswrong.com

screenshot

1

screenshot

AI, Alignment & the Art of Relationship Design — LessWrong

lesswrong.com

screenshot

1

screenshot

Novel Idea Generation in LLMs: Judgment as Bottleneck — LessWrong

lesswrong.com

screenshot

1

screenshot

Why Should I Assume CCP AGI is Worse Than USG AGI? — LessWrong

lesswrong.com

screenshot

1

screenshot

An Introduction to SAEs and their Variants for Mech Interp — LessWrong

lesswrong.com

screenshot

1

screenshot

AI Advances and Detection Strategy — LessWrong

lesswrong.com

screenshot

1

screenshot

Emotional Theory for a Technical Manual on How Not to Freeze Completely — LessWrong

lesswrong.com

screenshot

1

screenshot

SecureDrop review — LessWrong

lesswrong.com

screenshot

1

screenshot

o3 Will Use Its Tools For You — LessWrong

lesswrong.com

screenshot

1

screenshot

AI Control Methods Literature Review — LessWrong

lesswrong.com

screenshot

1