My 50 cents on the 5000-series
My 50 cents on the 5000-series
Posted in r/StableDiffusion by u/VirusCharacter • 43 points and 50 comments
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/VirusCharacter on 2025-01-08 15:56:53+00:00.
Nvidia has done it again! They are producing comparisons that greatly comfuse users and I'm not the only one noticing this. For example I found this text on their blog:
"With a GeForce RTX 4090 with FP16, the FLUX.1 [dev] model can generate images in 15 seconds with 30 steps. With a GeForce RTX 5090 with FP4, images can be generated in just over five seconds."
Comparing apples to oranges much? 🤨
Nvidia have added hardware acceleration for FP4 on their 5000-series of cards. They have done so to be able to fit bigger AI-models into less VRAM. Comparing hardware accelerated FP4 FLUX.1[Dev] generation on a 5090 with a normal FP16 FLUX.1[Dev] generation with a 4090 is rather misleading though. FP4 is not directly comparable with FP16. Sure FP4 can be good too, but this claim is like dangling a candy cane in front of hobby-users wanting to generate Anime characters on their VRAM-limited 4070 or 4080 GPU's. This claim stings in the eyes of everyone with a 3090 or a 4090 whos only upgrade path has now become a $2000 GPU 😓
Because of VRAM limitations in the 4070 and 4080, there is absolutely no reason whatsoever for anyone with a 4090 card to look any other way than to a 5090 and that is really sneaky move of Nvidia. Those of us on 3090 may hopefully be able to get some cheap second hand 4090's though.
I do understand why Nvidia have created the cards they have with 12, 16 and 32GB VRAM, but it's not very nice of them. Other than for gamers who will probably have an interesting end of January fighting over the 5000-series leftovers after the miners, scalpers and businesses have cleaned the house.
Below I have put together a little table showing all Nvidia cards from 3060 up to 5090 and I have chosen to sort it by FP32 Compute which is one way to show the "actual performance" of the card without all the fancy bells and whistles like DLSS and hardware accelerated FP4. Funny enough 4090 comes second best in this list, so if you have one. Don't be so quick to upgrade. If you're a gamer... Don't look at this list. You need to find one of Nvidias more gamer-directed lists including different DLSS-versions, different technologies and FPS.
For us who work with AI this is probably the best list we can get right now...