Amd's biggest problem is getting people to use their software so they can get a foot in the door and take over from cuda.
This is indeed AMD's biggest problem.
However the main problem with AMD is... AMD.
AMD desparately want to replace CUDA so people can buy an AMD GPU instead of an NVidia one and run their existing code on AMD GPUs. What they really care about is the expensive, high margin datacentre or pro GPUs.
This is deeply stupid for a number of reasons.
First let's accept the obvious truth that currently ML is the lion's share of the market. Sure there's a long tail of other stuff out there, but really to be available for a substantial fraction
Here's how you do the common thing on an NVidia GPU.
1. Get a GPU. Really doesn't matter how. But a 1650 second hand from CEX. Use the 40whatever in your laptop. Hire an H200 on a cloud machine. Get a Jetson. Literally anything in between. Doesn't matter.
2. pip install torch
3. go to town
Here's how you do it on AMD:
1. Get a GPU.
2. pip ins... wait docker what?
3. Find it doesn't work.
4. Check compatibility lists.
5. Find only GPUs way out of your price range for a quick experiment will work. Oh also fuck you if you have a laptop with an AMD GPU or even better an AI chip? Want to do AI on an AI chip? Fuck you.
6. Find maybe your chip might work because it's the same microarchitecture stupid code name
7. Fuck with docker some more because you hate yourself
8. Discover that your chip was supported, but suffered from the turbo speed deprecation cycle
9. Try an older version.
10. Fuck with docker some more
11. Cry.
12. Yeet your AMD gpu into the sun
13. Get an NVidia GPU
All that shit makes perfect sense from AMD's perspective. Why bother targeting gaming GPUs? That's not where the big money is. And since you're only doing pro ones, there's no harm in pro level difficulty, and limited hardware support. Why not just sell nothing but heaps of high margin MIbullshit to big people. They've got the resources to make the most of them.
What they can't seem to grasp is NVidia provide the low effort on ramp to their shit. You can go from a notion to an idea to a thought to a hack to a slightly bigger hack, to a project, to a business, to a large business easily. Or alternatively people flow out of universities into companies taking new found ML education with them which is all on NVidia because that's accessible to them and they use it because it's no risk to use what you know.
AMD are purposely missing the big base which feeds the upper levels. Plus millions of small projects with millions of hackers will hammer on their ecosystem and reveal all the annoying bugs and tough cases which will help it also not be a bit of a buggy mess (pytorch I love you but you suck).
It's crazy: AMD are demanding people jump through inane hoops to use their stuff from a position of being completely behind in the market.