~www_lesswrong_com | Bookmarks (657)
-
Just because an LLM said it doesn't mean it's true: an illustrative example — LessWrong
Published on August 21, 2024 9:05 PM GMTThis was originally posted in the comments of You...
-
AI Safety Newsletter #40: California AI Legislation Plus, NVIDIA Delays Chip Production, and Do AI Safety Benchmarks Actually Measure Safety? — LessWrong
Published on August 21, 2024 6:09 PM GMTWelcome to the AI Safety Newsletter by the Center...
-
Should LW suggest standard metaprompts? — LessWrong
Published on August 21, 2024 4:41 PM GMTBased on low-quality articles that seem to be coming...
-
Please do not use AI to write for you — LessWrong
Published on August 21, 2024 9:53 AM GMTI've recently seen several articles here that were clearly...
-
Apply to Aether - Independent LLM Agent Safety Research Group — LessWrong
Published on August 21, 2024 9:47 AM GMTThe basic ideaAether will be a small group of...
-
the Giga Press was a mistake — LessWrong
Published on August 21, 2024 4:51 AM GMTthe giga press Tesla decided to use large aluminum...
-
What is the point of 2v2 debates? — LessWrong
Published on August 20, 2024 9:59 PM GMTFor instance, I am thinking about the munk debates...
-
Where should I look for information on gut health? — LessWrong
Published on August 20, 2024 7:44 PM GMTI've been on a gut health kick, reading Brain...
-
Would you benefit from, or object to, a page with LW users' reacts? — LessWrong
Published on August 20, 2024 4:35 PM GMTThere is currently an admin-only page that shows a...
-
AGI Safety and Alignment at Google DeepMind: A Summary of Recent Work — LessWrong
Published on August 20, 2024 4:22 PM GMTWe wanted to share a recap of our recent...
-
Trying to be rational for the wrong reasons — LessWrong
Published on August 20, 2024 4:18 PM GMTRationalists are people who have an irrational preference for...
-
Vilnius – ACX Meetups Everywhere Fall 2024 — LessWrong
Published on August 19, 2024 5:38 PM GMTHey folks, We're organizing an ACX meetup in Vilnius...
-
A primer on why computational predictive toxicology is hard — LessWrong
Published on August 19, 2024 5:16 PM GMTIntroductionThere are now (claimed) foundation models for protein sequences,...
-
Can Current LLMs be Trusted To Produce Paperclips Safely? — LessWrong
Published on August 19, 2024 5:17 PM GMTThere's a browser-based game about paperclip maximization: Universal Paperclips,...
-
Trustworthy and untrustworthy models — LessWrong
Published on August 19, 2024 4:27 PM GMTThere are a lot of points in AI control...
-
Apartment Price Map Discontinuity — LessWrong
Published on August 19, 2024 3:30 PM GMT I maintain a Boston apartment price map, scraping...
-
Will we ever run out of new jobs? — LessWrong
Published on August 19, 2024 3:04 PM GMTA lot of the debate on long-term, structural technological...
-
What are the best resources for building gears-level models of how governments actually work? — LessWrong
Published on August 19, 2024 2:05 PM GMTOne big hole in my set of frames and...
-
[Cross-post] Book Review: Bureaucracy, by James Q Wilson — LessWrong
Published on August 19, 2024 1:57 PM GMT[Cross-posted from my substack, davekasten.substack.com. {I never said that...
-
If AI is in a bubble and the bubble bursts, what would you do? — LessWrong
Published on August 19, 2024 10:56 AM GMT"The AI bubble is reaching a tipping point", says...
-
Thinking About What Are Propensity Evaluations [WIP] — LessWrong
Published on August 19, 2024 9:23 AM GMTWarning: This post was written at the start of...
-
Scaling Laws and Likely Limits to AI — LessWrong
Published on August 18, 2024 5:19 PM GMTDiscuss
-
What is "True Love"? — LessWrong
Published on August 18, 2024 4:05 PM GMTMeta: I recently entered the dating market, so naturally...
-
Quick look: applications of chaos theory — LessWrong
Published on August 18, 2024 3:00 PM GMTIntroductionRecently we (Elizabeth Van Nostrand and Alex Altair) started...