OpenAI's o3 Model Beats Master-Level Geoguessr Player 32

Posted by BeauHD on Tuesday April 29, 2025 @06:10PM from the not-too-shabby dept.

In a blog post yesterday, Master I-ranked human GeoGuessr player Sam Patterson said that OpenAI's o3 model outscored him in a head-to-head match, "correctly identifying all five countries and twice landing within a few hundred meters." Geoguessing is a game -- most popularly known through the platform GeoGuessr -- where players are dropped into a random location in Google Street View and must figure out where in the world they are using only visual clues from the environment. With the release of its newest AI models, o3 and o4-mini, OpenAI now does a surprisingly good job of analyzing uploaded images to determine their locations using nothing but subtle visual clues.

"Even when I embedded fake GPS coordinates in the image EXIF, the model ignored the spoof and still pinpointed the real locations, showing its performance comes from visual reasoning and on-the-fly web sleuthing -- not hidden metadata," says Patterson. From the post: I notice that it often does a lot of unnecessary and repetitive cropping, and will sometimes spend way too much time on something unimportant. A human is very good at knowing what matters, and o3 is less knowledgeable about what things it should focus on. It got distracted by advertising multiple times. However, most of what it says about things like signs and road lines appears to be accurate, or at least close enough to truth that they meaningfully add up. Given the end result of these excellent guesses, it seems to arrive at the guesses from that information.

If it's using other information to arrive at the guess, then it's not metadata from the files, but instead web search. It seems likely that in the Austria round, the web search was meaningful, since it mentioned the website named the town itself. It appeared less meaningful in the Ireland round. It was still very capable in the rounds without search.

So to put a bow on this:
- The o3 model isn't smoke and mirrors, tricking us by only using EXIF data. It's at a comparable Geoguessr skill level to Master I or better players now (at least according to my own ~20 or so rounds of testing).
- Humans still hold a big edge in decision time -- most of my guesses were 4 min.
- Spoofing EXIF data doesn't throw off the model.

Whether you view this as dystopian or as a technological marvel -- or both -- you can't claim it's a parlor trick.

OpenAI's o3 Model Beats Master-Level Geoguessr Player

Post Load All Comments

Search 32 Comments Log In/Create an Account

Comments Filter:

WTF (Score:1)

by Valgrus Thunderaxe ( 8769977 ) writes:

is a "Geoguessr" player? Do you play "Geoguessr". I bet you don't and you don't give a shit about it, at all.
- Re: (Score:3, Funny)
  
  by Anonymous Coward writes:
  
  You can think of it like Slashdot, where you are dropped into a random summary and are expected to figure out what the fuck it concerns.
  - Re: (Score:3)
    
    by martin-boundary ( 547041 ) writes:
    
    Wait, I know this one!
    "DUPE!!!"
- Re: (Score:2)
  
  by GrumpySteen ( 1250194 ) writes:
  
  I play it occasionally. It's kind of fun and you get to see and explore places you'll never visit IRL.
- Re: (Score:2)
  
  by ET3D ( 1169851 ) writes:
  
  You know, what a GeoGuessr player is was explained in the excerpt, so, you know, RTFM.
  There are probably more Geoguesser players than Slashdot readers. :)
  - Re: (Score:2)
    
    by blue trane ( 110704 ) writes:
    
    Are you saying that throttling posts and adding advertising has made the site lose market share since back in the old days when slashdotting was a thing?
  - Re: WTF (Score:2)
    
    by ThurstonMoore ( 605470 ) writes:
    
    They should have explained it at the beginning.
- Re: WTF (Score:2)
  
  by ThurstonMoore ( 605470 ) writes:
  
  It took me a bit to figure it out too. I thought they were talking about a chess program.
- Re: (Score:2)
  
  by dunkelfalke ( 91624 ) writes:
  
  I have played it many years ago, was actually pretty good at it. But didn't know it was called "geoguessr".
- Re: Comment Subject: (Score:2)
  
  by grahamsz ( 150076 ) writes:
  
  I've played with it a little and it's amazing. I submitted a photo when I was charging my car with no real words or signs. It clocked a Colorado license plate, a Wyoming plate, identified the charging station, then identified the red brick and blue awnings typical of a mid 2000s front range Walmart. It guessed I was one town over, but it's an honest mistake and I don't think I could have told the difference
Alt headline (Score:4, Insightful)

by cascadingstylesheet ( 140919 ) writes: on Tuesday April 29, 2025 @07:09PM (#65341245) Journal

"Computers better than people at remembering and sifting through gigantic amounts of data"

Reply to This Share
Flag as Inappropriate
- Re: (Score:2)
  
  by ToasterMonkey ( 467067 ) writes:
  
  "Computers better than people at remembering and sifting through gigantic amounts of data"
  You just took a _very_ hard computing problem, beating humans at geoguessr, using just image classification and a reasoning LLM, and dismissed it as ... computers fast.
  So every advancement in computing could similarly be dismissed as computer fast, it's expected.
  You're saying AI is innately superior to humans.
Yay~ (Score:1)

by Inyu ( 919458 ) writes:

Another point against AI-naysayers.
Oh good (Score:3)

by commodore73 ( 967172 ) writes: on Wednesday April 30, 2025 @01:19AM (#65341699)

It can beat us at games. Huge value to society.

Reply to This Share
Flag as Inappropriate
- Re: (Score:2)
  
  by AmiMoJo ( 196126 ) writes:
  
  Geoguesser is a game where you are shown a photo and have to figure out where it was taken just by looking at it. Things like the type of road signs, the landscape, country specific building regulations and so forth are used, as well as more obvious stuff like the language of any text that is visible.
  The fact that AI is good at this has some practical uses. Law enforcement will probably be interested in the ability to locate any random photo taken outdoors. Consumers may like to have that feature to locate
  - Re: (Score:2)
    
    by commodore73 ( 967172 ) writes:
    
    Sounds more like something like ML/pattern recognition than what most people would call AI. But AI is not my field of expertise and never will be. I feel that we will always refer to AI as anything we haven't yet programmed, and the next day we'll realize it's just data and algorithms.
  - Re: (Score:2)
    
    by commodore73 ( 967172 ) writes:
    
    Sorry for a second comment; I've settled down since reading yesterday's news and haven't started reading the news yet today. Thanks for your response. I'm processing so much information these days that I barely have time to read the headlines, and so I neglected these details.
    
    I don't know the resource requirements to achieve this result or speculated future results. I still question the value to society relative to the social and environmental impact of "AI" in general. Who will have access to which of t
- Re: (Score:2)
  
  by thegarbz ( 1787294 ) writes:
  
  Performing image analysis and coming to correct conclusions based on the content of an image is indeed a huge value of society, especially when it outperforms people.
  Today: we're talking about someone determining a location on the world better than a human player.
  Tomorrow: we're talking about a computer determining whether a shadow on a CT scan is a cancer better than a human player.
  Calling this "games" rather than fundamentally the principle of the game is a very low-IQ approach to this story.
  - Re: (Score:2)
    
    by commodore73 ( 967172 ) writes:
    
    > very low-IQ approach
    
    I am not sure how IQ relates. I have never measured my IQ and don't concern myself with your assessment of mine. IQ is simply one measure of intelligence. There are many other forms that we do not understand and for which we have no ability to measure. The human mind is a bit more complex than you seem to perceive, which may be one reason that you are a proponent of AI. I also believe that, in general, human beings are more valuable than machines.
    
    I am sharing my perspective, w
Nobody fucking cares (Score:2)

by paul_engr ( 6280294 ) writes:

Nfc
But LLMs only regurgitate (Score:1)

by CalgaryD ( 9235067 ) writes:

But LLMs only regurgitate what was put in them, how is it possible? Or do they?
Shh!! (Score:1)

by Kekke ( 236130 ) writes:

The Robot Lord actually knows where you live.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

OpenAI's o3 Model Beats Master-Level Geoguessr Player 32

OpenAI's o3 Model Beats Master-Level Geoguessr Player More | Reply Login

OpenAI's o3 Model Beats Master-Level Geoguessr Player

WTF (Score:1)

Re: (Score:3, Funny)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: WTF (Score:2)

Re: WTF (Score:2)

Re: (Score:2)

Re: Comment Subject: (Score:2)

Alt headline (Score:4, Insightful)

Re: (Score:2)

Yay~ (Score:1)

Oh good (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Nobody fucking cares (Score:2)

But LLMs only regurgitate (Score:1)

Shh!! (Score:1)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot