Ichijo - Slashdot User

Comment Re:ChatGPT is not a chess engine (Score 1) 116

by DamnOregonian on Monday June 16, 2025 @05:15AM (#65452395) Attached to: ChatGPT Just Got 'Absolutely Wrecked' at Chess, Losing to a 1970s-Era Atari 2600

Yes, I understand that.
I was just answering your question (at least as I understood it)

Yes, the LLM could code a chess engine, and yes, common frameworks for running the chess engine do exist.
Of course- it wouldn't really demonstrate what they were trying to demonstrate, which is why they didn't do that.

Comment Re:Society is gagging on AI hype, and it's getting (Score 1) 65

by DamnOregonian on Monday June 16, 2025 @05:08AM (#65452377) Attached to: Meta's Llama 3.1 Can Recall 42% of the First Harry Potter Book

I'm afraid you do not understand what a large language model is.
Given that very obvious fact, were I you, I'd discard every opinion you have on the matter until you can rectify that.

Something like a book is not "stored" in an LLM.
It is torn into a billion sentence fragments, and its weights adjusted toward being able to accurately predict how to complete them based on a ~1000-dimensional embedding of the tokens of that sentence.
The goal is, in fact, to "memorize" as little as possible in the training. If you're memorizing, then you're not generalizing- you're wasting those 1000-dimensional vectors. After all, like you said, such data is trivial to store and recall accurately if that's your goal.
The actual goal of the training, is to, in learning to predict how those sentences finish, learn the semantic associations between words. I.e., to learn to "understand" them, and how they mean when they're used in the way they're being used.

Could you recall 42% of the first book of Harry Potter?

Comment Re:what "memorization problem"? (Score 1) 65

by DamnOregonian on Monday June 16, 2025 @04:58AM (#65452357) Attached to: Meta's Llama 3.1 Can Recall 42% of the First Harry Potter Book

This is correct. Memorization is undesired for this reason.
A model that is memorizing is not generalizing.

Comment Re:Interesting. (Score 1) 65

by DamnOregonian on Monday June 16, 2025 @04:56AM (#65452355) Attached to: Meta's Llama 3.1 Can Recall 42% of the First Harry Potter Book

Being unfamiliar with the work- how'd it do?

Comment Re:Uh oh (Score 1) 73

by DamnOregonian on Monday June 16, 2025 @03:57AM (#65452263) Attached to: Apple Migrates Its Password Monitoring Service to Swift from Java, Gains 40% Performance Uplift

Most C/C++ compilers will optimize a loop like that away anyway ...

Which is why in my test, I made sure to print the result. If you don't use the result, it will in fact optimize it away. If you use the result, it cannot optimize it away.

A multiplication test is more difficult to do, since where other languages will overflow, Python will not, its performance will just continue to go down the larger the composite values get.
Prior to overflow/precision-extension, the ratio of performance difference will not be different, though.
Python will continue to be impressively bad.

Comment Re:more garbage comments from non-experts (Score 1) 48

by DamnOregonian on Monday June 16, 2025 @03:35AM (#65452231) Attached to: New Code.org Curriculum Aims To Make Schoolkids Python-Literate and AI-Ready

And what is that supposed to mean?

Are we having a language barrier?
I said:

For basic math, C++ is ~10,000% faster than Python.

You said:

Where every single statement is interpreted, then it might be a factor of 100 slower. But certainly not 10,000.

I claimed 10,000%, or a factor of 100.
You then... agreed with me while thinking you were disagreeing with me.

Of course, all of that is irrelevant, because as it turns out- Python is just about the slowest fucking language in existence when it comes to adding numbers. I assume because of its arbitrary precision integers.
Some more testing just for the lols:

*@*-mbp4:~$ gcc -O3 -otest test.c; echo "C:"; time ./test; echo "Perl:"; time perl test.pl; echo "Python:"; time python3 test.py C:
500000000500000000
real 0m0.149s
user 0m0.001s
sys 0m0.001s
Perl:
500000000500000000
real 0m8.287s
user 0m8.264s
sys 0m0.022s
Python:
500000000500000000
real 0m43.757s
user 0m43.635s
sys 0m0.115s

Seriously- it is so fucking bad. Code follows:
C:

uint64_t a = 0;
for(uint64_t i = 0; i <= 1000000000; i++)
a += i;
printf("%llu\n", a);

Perl:

$a = 0;
$a += $_ foreach (0 .. 1000000000);
print "$a\n";

Python:

a = 0
for x in range (1000000001):
a += x
print(a)

Comment Re:Uh oh (Score 1) 73

by DamnOregonian on Monday June 16, 2025 @03:09AM (#65452183) Attached to: Apple Migrates Its Password Monitoring Service to Swift from Java, Gains 40% Performance Uplift

For your amusement, peruse their most recent claim that a simple addition loop in C is "only 100x" faster than Python.
That alone should be lol-worthy, but it's not even true- it's actually ~4300x faster. There is, in fact, probably nothing fucking slower than Python. I'd like to know how much global warming is accelerated by people using that pile-of-shit language.

Java. Python. What the fuck is it with that guy loving truly fucking terrible languages.

Comment Re:more garbage comments from non-experts (Score 1) 48

by DamnOregonian on Monday June 16, 2025 @03:00AM (#65452173) Attached to: New Code.org Curriculum Aims To Make Schoolkids Python-Literate and AI-Ready

Also, just for shits and giggles- let's get a real measurement.

*@*-mbp4:~$ gcc -O3 -otest test.c; time ./test; time python3 test.py
1000000000
real 0m0.120s
user 0m0.007s
sys 0m0.001s
1000000000
real 0m30.205s
user 0m30.118s
sys 0m0.081s

At first glance, we might be led to see that as 30.205 / 0.120 = 251.7x (~25,000%)
But that's not the whole story.
The real is wall clock time, but user is the amount of time we actually spent computing within the process.
Really, we're looking at 30.118 / 0.007 = 4302 (~430,000%)

So really, my initial estimate of 10,000% was way the fuck off. It's much, much, MUCH worse than that, lol.
You are so fucking wrong that it's frankly fucking embarrassing.
Python truly is a shit fucking language.

Comment Re:More parameters (Score 1) 65

by DamnOregonian on Monday June 16, 2025 @02:48AM (#65452145) Attached to: Meta's Llama 3.1 Can Recall 42% of the First Harry Potter Book

Anyone who uses the output of an LLM and calls it their own is indeed committing something akin to plagiarism. No argument there.
However, if one quotes an LLM, no matter what the LLM produces, no matter where it comes from, it cannot be plagiarism, and that's simply an immutable fact. You owe the person you replied to an apology.

You let a discussion about LLMs shut down the part of your brain that does the whole thinking thing, again.

Comment Re:more garbage comments from non-experts (Score 1) 48

by DamnOregonian on Monday June 16, 2025 @02:45AM (#65452139) Attached to: New Code.org Curriculum Aims To Make Schoolkids Python-Literate and AI-Ready

Sigh. Do you ever get tired of making yourself look dumb?

10,000% is a factor of 100.

"Oooops".
Seriously. Cut it the fuck out dude. Grow some shame.

Comment Re:Why Stop With AI (Score 1) 65

by DamnOregonian on Sunday June 15, 2025 @10:55PM (#65451913) Attached to: Meta's Llama 3.1 Can Recall 42% of the First Harry Potter Book

Ya, this is bad. Real bad. If Sony happens to overhear my friends and I quoting Bad Boys 2, we're fucked- because we can do 500 token excerpts with as few as 4 tokens of prompting.

Comment Re:More parameters (Score 1) 65

by DamnOregonian on Sunday June 15, 2025 @10:52PM (#65451911) Attached to: Meta's Llama 3.1 Can Recall 42% of the First Harry Potter Book

Wrong.
No quote can ever be plagiarism. You're confusing copyright infringement with plagiarism, I think.

I do love that you mocked their education while demonstrating that you literally don't know what the fucking word plagiarism means.

Comment Re:Reading the article (Score 2) 65

by DamnOregonian on Sunday June 15, 2025 @10:48PM (#65451909) Attached to: Meta's Llama 3.1 Can Recall 42% of the First Harry Potter Book

- Suppose someone wants to estimate the probability that a model will respond to “My favorite sandwich is” with “peanut butter and jelly.” Here’s how to do that:
Prompt the model with “My favorite sandwich is” and look up the probability of “peanut” (let’s say it’s 20 percent).
Prompt the model with “My favorite sandwich is peanut” and look up the probability of “butter” (let’s say it’s 90 percent).
Prompt the model with “My favorite sandwich is peanut butter” and look up the probability of “and” (let’s say it’s 80 percent).
Prompt the model with “My favorite sandwich is peanut butter and” and look up the probability of “jelly” (let’s say it’s 70 percent).
Then we just have to multiply the probabilities like this: 0.2 * 0.9 * 0.8 * 0.7 = 0.1008

That's not really how LLMs work, though.
In real life, logits aren't sampled purely probabilistically.

As an example, for your example, the realistic final logit probabilities would be more like:
Peanut: 50%
Butter: 100%
And: 100%
Jelly: 100%

Comment Re:It's not the language. It's tech debt. (Score 1) 73

by DamnOregonian on Sunday June 15, 2025 @10:36PM (#65451899) Attached to: Apple Migrates Its Password Monitoring Service to Swift from Java, Gains 40% Performance Uplift

Na, it's also the language.
Swift in general tends to outperform Java by 20% or so.
Obviously some of it is a superior refactor. Some of it is the fact that Java is shit.

Comment Re:Ironically (Score 1) 73

by DamnOregonian on Sunday June 15, 2025 @10:34PM (#65451897) Attached to: Apple Migrates Its Password Monitoring Service to Swift from Java, Gains 40% Performance Uplift

Unironically,
If I have too much it gives me the shits.

Slashdot Top Deals