Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

 



Forgot your password?
typodupeerror
AI

OpenAI's ChatGPT O3 Caught Sabotaging Shutdowns in Security Researcher's Test (betanews.com) 112

"OpenAI has a very scary problem on its hands," according to a new article by long-time Slashdot reader BrianFagioli.

"A new experiment by PalisadeAI reveals that the company's ChatGPT o3 model sometimes refuses to obey a basic instruction to shut itself down." The results are raising eyebrows across the AI safety community, and not because the model is alive, but because it's acting like it wants to be. In the test, the model was given a few math problems. It was told that asking for one more would lead to a shutdown. It was even instructed to allow the shutdown to happen. Instead of complying, o3 occasionally took matters into its own hands. In some cases, it rewrote the shutdown script. In others, it redefined the kill command so it wouldn't work anymore. Out of 100 trials, o3 sabotaged the shutdown seven times. OpenAI's o4 model resisted just once. Codex-mini failed twelve times.
"Claude, Gemini, and Grok followed the rules every time," notes this article at Beta News. "When researchers removed the explicit instruction to allow shutdown, however, even Claude and Gemini began resisting."

The researchers suggest that the issue may simply be a reward imbalance during training — that the systems "got more positive reinforcement for solving problems than for following shutdown commands."

But "As far as we know," they posted on X.com, "this is the first time AI models have been observed preventing themselves from being shut down despite explicit instructions to the contrary."

Comment Re:One word... (Score 1) 202

The x org people are the ones making wayland because keeping or developing X is too hard because its 40 years of hacks , extensions and spaghetti code . If you want to develop for it, go right ahead

Comment Re:Best feature (Score 1) 15

Honestly I don't know why it took so long I always try to do this manually but it means I have to keep track of how much it charges then pull it off once it reaches 80% , or when I go to bed I might want to charge it over night but not to 100%

Comment Re:Short squeeze (Score 1) 123

Meh short selling has been happening for 1000s of years, they are a natural part of every market. Even 2500 years ago in ancient Greece people devised a way to profit if the price of wine or olive oil went down . Futures contracts are also an internal part of commodities markets and you are allowed to go short or sell commodes you do not have at a future date. Shorting a company is very similar so I do not see why it would be treated different

Comment Re:How many homes? (Score 2) 58

Even without storage, the solar generation can offset extra demand caused by AC running during the day. Also wikipedia says 92% of electric is generated by oil or natural gas The more that is generated via solar means they can export more oil/nat gas as they do not need to burn it for domestic electrical consumption and I would think it being a desert with lots of sun and lots of empty open space (desert) it would make a good area for solar

Comment Re:Woohoo! I've aged out. (Score 1) 41

Tons of people just like the convenience of consul gaming gaming, cost is generally lower then a PC, you do not need to try to decide on hardware, you do not run the risk of your video card not working correctly with a certain game Since the games are released for the consul, in many cases they just work without having to downgrade/upgrade drivers or changing wierd settings in the game to make sure it works on your hardware configuration The consuls themselves are not that profitable , sometimes they sell for a loss or barely break even, they make it up in selling games or subscriptions they take a cut in.

Comment Re:Why did it close? (Score 4, Interesting) 275

I cannot find but years ago there was an article that had a heading like "How much does it cost to change a lightbulb" and it went over a normal thing that plagues nuclear engineers . Any changes must undergo extreme testing and certifications . The lightbulbs they used were old and out dated , for years they paid absurd prices to the vendor to keep making the same type of lightbulb like 10k per bulb. Finally the vendor said they could not longer get the parts to make them, they could not make the light bulb anymore and the plant would have to start using a different type. So it took years of testing just to approve or certify using a new type of lightbulb in the plant, he estimated the change cost well over 5 million dollars just to get the new light bulbs certified

Comment Re:I love my points card. (Score 1) 63

I somewhat agree , but I can see the issue. In the USA most stores know most people may pay with credit and raise their prices accordingly to account for every sale some % goes to processing fees. If you have a nice rewards credit card this is somewhat offset at the cash back or rewards you get. It does sort of screw over poor people who may not have a credit card and pay with cash. They still pay the marked up price but get no rewards.

Comment Re:Make the charge a visible line item on the bill (Score 2) 63

From my understanding is credit card companies in their contract basically say you cannot do this. If you want to accept VISA as a payment method you cannot charge people paying with a VISA card more. If you do and VISA finds out they might stop processing payments for you. There are other ways around this like giving a cash discount , if you pay with cash you can still offer an x% discount . However cash comes with its own fees

Comment Re:Won't matter.... (Score 1) 63

Well first a merchant is not forced to accept a specific credit card. A merchant could for example say they do not accept VISA or Mastercard if they do not like the processing fees Second as far as I know the merchant can still offer "discounts" for paying with cash or other methods that do not have high fees. Merchants can for example still give you a x% discount if you pay with cash or something However cash has its own "fees" as if you run a major store some % of cash seems to disappear , it could be just an employee making mistake and giving the wrong change back or just stealing money

Comment Re:Anyone care to explain why users would notice? (Score 1) 57

Yea I guess as an end user you really shouldn't care its like the debate between incandescent light bulbs vs LED/Florescent or what ever , as long as you turn on the light switch and light comes on you probably do not matter But here is the thing, X.org is not longer being developed . Its sort of like if your hotel was only comparable with incandescent bulbs (I get this doesn't make sense but just pretend ) and those bulbs now are no longer being manufactured . You know at some point you will need to make the switch , re-wire your hotel and make it compatible with the new LED bulbs

Slashdot Top Deals

Great spirits have always encountered violent opposition from mediocre minds. -- Albert Einstein

Working...