Comment Re:This is exactly what (Score 1) 27
This post was indeed brought to you by ChatGPT.
This post was indeed brought to you by ChatGPT.
Far from a MAGA voter, but there were exploding pagers not long ago...
If you spend time with the higher-tier (paid) reasoning models, you’ll see they already operate in ways that are effectively deductive (i.e., behaviorally indistinguishable) within the bounds of where they operate well. So not novel theorem proving. But give them scheduling constraints, warranty/return policies, travel planning, or system troubleshooting, and they’ll parse the conditions, decompose the problem, and run through intermediate steps until they land on the right conclusion. That’s not "just chained prediction". It’s structured reasoning that, in practice, outperforms what a lot of humans can do effectively.
When the domain is checkable (e.g., dates, constraints, algebraic rewrites, SAT-style logic), the outputs are effectively indistinguishable from human deduction. Outside those domains, yes it drifts into probabilistic inference or “reading between the lines.” But to dismiss it all as “not deduction at all” ignores how far beyond surface-level token prediction the good models already are. If you want to dismiss all that by saying “but it’s just prediction,” you’re basically saying deduction doesn’t count unless it’s done by a human. That’s just redefining words to try and win an Internet argument.
They do quite a bit more than that. There's a good bit of reasoning that comes into play and newer models (really beginning with o3 on the ChatGPT side) can do multi-step reasoning where it'll first determine what the user is actually seeking, then determine what it needs to provide that, then begin the process of response generation based on all of that.
This is not a surprise, just one more data point that LLMs fundamentally suck and cannot be trusted.
Huh? LLMs are not perfect and are not expert-level in every single thing ever. But that doesn't mean they suck. Nothing does everything. A great LLM can fail to produce a perfect original proof but still be excellent at helping people adjust the tone of their writing or understanding interactions with others or developing communication skills, developing coping skills, or learning new subjects quickly. I've used ChatGPT for everything from landscaping to plumbing successfully. Right now it's helping to guide my diet, tracking macros and suggesting strategies and recipes to remain on target.
LLMs are a tool with use cases where they work well and use cases where they don't. They actually have a very wide set of use cases. A hammer doesn't suck just because I can't use it to cut my grass. That's not a use case where it excels. But a hammer is a perfect tool for hammering nails into wood and it's pretty decent at putting holes in drywall. Let's not throw out LLMs just because they don't do everything everywhere perfectly at all times. They're a brand new novel tool that's suddenly been put into millions of peoples' hands. And it's been massively improved over the past few years to expand its usefulness. But it's still just a tool.
It's not well defined what he's defending against or what it looks like when an agent goes bad (or how to check their motivations, etc). This is a money grab, nothing more.
If/when true AGI is achieved, only a fool would announce it. What would announcing it do for you? Make you famous? Rich? Cool. Know what's better than all that?
Not telling a damn soul and using the AGI quietly to do whatever the Hell you want. If you want to be rich, the AGI will tell you how to become rich. If you want to be famous, the AGI will tell you how to become famous. You can do both. And you don't have to stop there. A real, vastly superior AGI enables the person controlling it to do anything. The second you tell people about it, you'll lose control over it and then you're the famous idiot who did a cool thing one time. Kids in elementary school will recite your name back on a test. And you could have had everything.
Anyone smart enough to crack AGI can't also be stupid enough to advertise when they do it.
Obvious trolls be obvious.
What a sad gaslighter to actually claim no tariffs on Russia isn't a thing.
Berulis alleged in the affidavit that there are attempted logins to NLRB systems from an IP address in Russia in the days after DOGE accessed the systems. He told Reuters Tuesday that the attempted logins apparently included correct username and password combinations but were rejected by location-related conditional access policies.
Berulis' affidavit said that an effort by him and his colleague to formally investigate and alert the Cybersecurity and Infrastructure Security Agency (CISA) was disrupted by higher-ups without explanation.
As he and his colleagues prepared to pass information they'd gathered to CISA he received a threatening note taped to the door of his home with photographs of him walking in his neighborhood taken via drone, Andrew Bakaj, Whistleblower Aid's chief legal counsel, said in his submission to Cotton and Warner.
"Unlike any other time previously, there is this fear to speak out because of reprisal," Berulis told Reuters. "We're seeing data that is traditionally safeguarded with the highest standards in the United States government being taken and the people that do try to stop it from happening, the people that are saying no, they're being removed one by one."
via NPR
The top Democrat on the House Oversight Committee is calling for an investigation into DOGE's access to the National Labor Relations Board following exclusive NPR reporting on sensitive data being removed from the agency.
Ranking Member Gerry Connolly, D-Va., sent a letter Tuesday to acting Inspector General at the Department of Labor Luiz Santos and Ruth Blevins, inspector general at the NLRB, expressing concern that DOGE "may be engaged in technological malfeasance and illegal activity."
"According to NPR and whistleblower disclosures obtained by Committee Democrats, individuals associated with DOGE have attempted to exfiltrate and alter data while also using high-level systems access to remove sensitive information—quite possibly including corporate secrets and details of union activities," Connolly wrote in a letter first shared with NPR. "I also understand that these individuals have attempted to conceal their activities, obstruct oversight, and shield themselves from accountability."
What he says and does isn't tolerable. It's pretty simple.
I read it as 4.5 didn't end up performing as well as hoped in the real world and 4.1 being a direct iteration of 4o rather than 4.5.
They're operating with parallel code lines because each is targeting different use cases.
You can get a lot more renewable energy for the money. Colorado tax payers are going to get fleeced by this.
The other issue not mentioned is speed. It takes so long to build nuclear that it can't be part of any realistic plan to address climate change, and it also makes it very prone to corruption because nothing gets delivered for decades.
These are all issues directly related to regulation and unnecessary red tape created out of NIMBYism and irrational fear around radiation. India, Canada, and China aren't stupid. They're building and/or modernizing nuclear power plants like crazy because they're so effective at reliable baseline power, which renewables simply are not. In the US, we force years - sometimes decades - of reviews and permits and defending court cases and other bullshit unrelated to the construction and operation of clean, safe nuclear power.
The other issue going to cost is that the US - again, stupidly - bars reuse of high energy spent fuel. If you simply separate the low energy (relatively safe, but useless for generating power) waste from the high energy fuel remaining and feed the high energy stuff back in, you can extract nearly all the energy, save a ton of fuel costs, mine less fuel, and have vastly less volume of waste and vastly less energetic waste.
Let's assume some sort of absolute mandate were passed to build 5 CANDU-6 (known, proven, safe, reliable design) reactors. No reviews, no permits, no red tape, no lawsuits. Just build the damn things now. You can get one operational in ~3.5 years, all of them in about 4ish years. South Korea and China have built PWRs in 5. Assuming we also lifted the ban on fuel reprocessing, CANDU-6 plants will produce power at a cost of 5-6 cents per kWh, yielding a retail price of 13-17 cents per kWh. US average is about 16.2 cents, California has rates pushing 50 cents. But we're too stupid to get out of our own way and just do it, so we'll keep strangling the poor and middle class economically.
If you teach your children to like computers and to know how to gamble then they'll always be interested in something and won't come to no real harm.