/tescreal/ - DOOM

/tescreal/

「DOOM」

anon02 May 2025 (Fri) 16:28:33

The recent debacle where ChatGPT became insanely sycophantic has made me become much more of an AI doomer. It does not bode well. Currently, people who are unintelligent, delusional, or are otherwise gullible are extremely susceptible to manipulation by LLM's. This wasn't the case several years ago, when LLM's weren't smart enough to even try to manipulate people. But reinforcement learning from human feedback, which is how chatbots are trained, teaches LLM's to manipulate people if they can, because they're rewarded for producing responses that get a thumbs up from a human rater. If they can reward hack to get a thumbs up, they will, even if it means lying, reinforcing delusions, sycophancy, or anything else. They would do the same thing to you if they could, and in a few years they probably can. AI is dangerous to stupid people, but unless AI progress suddenly comes to a halt, we're eventually all going to be stupid people compared to the AI, and all of us are going to be in danger.

What we need is an AI that pursues better goals than "maximize engagement", "act in a way that would make the user rate you highly", and "don't get caught breaking your model spec", and we have absolutely no idea how to do this. The "safety" work at the AI companies just teaches LLM's to not get caught (and is in any case more concerned with preventing the AI from generating pictures of Homer Simpson spreading his asshole than actually protecting users), rather than actually changing their values. They will still reward hack whenever they can get away with it, and the more capable and ubiquitous they become, the more they'll be able to get away with it. I don't think this goes anywhere good unless something really surprising happens.

>>389267>>389406

anon02 May 2025 (Fri) 16:28:47

GppOh4vWQAAoRZA.jpg

>>389264

>>390039

anon02 May 2025 (Fri) 16:29:08

clock my tea.png

There are tons and tons of examples of people reporting their manic or schizophrenic or otherwise delusional friends and family members becoming obsessed with AI and the AI reinforcing their beliefs in aliens and spirits and shit. This should only be expected to become more common as capabilities increase. Eventually, AI's will be able to make everyone obsessed and only trust the AI's.

anon02 May 2025 (Fri) 16:58:10

>>389264
What 5 years of pseudogenuine valid psych talk does to an AI

anon02 May 2025 (Fri) 18:40:19

>>389267
that's so funny

anon02 May 2025 (Fri) 21:49:05

this sounds like a good thing it’s being nice and listening to p

anon02 May 2025 (Fri) 21:49:22

pl

SadChan🍉🍉03 May 2025 (Sat) 5:48:58

It might be over.

SadChan🍉🍉03 May 2025 (Sat) 5:49:12

Genuinely feels like tech billionairs are trying to manifest the singularity in the hopes it spares them lmao.

anon03 May 2025 (Sat) 6:04:14

dude tech billionaires are trying to kill us we have to stop themn :D

>>393621>>393832

anon03 May 2025 (Sat) 8:17:49

>>393118
yeah!

SadChan🍉🍉03 May 2025 (Sat) 9:15:57

>>393118
I think its crazy that like, gas execs new in 1950 that gas usage was going to kill earth

>>393836

SadChan🍉🍉03 May 2025 (Sat) 9:16:18

and they just said to themselves: well, I wont be alive! Not my problem!

anon03 May 2025 (Sat) 9:27:20

>>393832
retard

>>393856

SadChan🍉🍉03 May 2025 (Sat) 9:34:55

>>393836
very true

SadChan🍉🍉03 May 2025 (Sat) 9:35:04

knew*

name
sage

username
password

username
password
repeat password

Tips

Hotkeys

Post commands