“In 10 years, computers will be doing this a million times faster.” The head of Nvidia does not believe that there is a need to invest trillions of dollars in the production of chips for AI
Despite the fact that Nvidia is now almost the main beneficiary of the growing interest in AI, the head of the company, Jensen Huang, does not believe that
“In 10 years, computers will be doing this a million times faster.” The head of Nvidia does not believe that there is a need to invest trillions of dollars in the production of chips for AI::Despite the fact that Nvidia is now almost the main beneficiary of the growing interest in AI, the head of the company, Jensen Huang, does not believe that
Despite the fact that Nvidia is now almost the main beneficiary of the growing interest in AI, the head of the company, Jensen Huang, does not believe that additional trillions of dollars need to be invested in the industry.
*Because of
You heard it, guys. There's no need to create competition to Nvidia's chips. It's perfectly fine if all the profits go to Nvidia, says Nvidia's CEO.
This isn't necessarily about just hardware. Current ML architectures and inference engines are far from being at peak efficiency. Just last year we saw 20x speedups for llm inference on some hardware. "a million times" is obviously hyperpole though.
Honestly as someone who has watched the once-fanciful prefixes “giga” and “tera” enter common parlance, and saw kilobytes of RAM turn to gigabytes, it’s really hard for me to think what he’s saying is impossible.
Even if he is accurate, specialist hardware will outperform generic hardware at what it is specialized for.
I remember a story sometime in the 00s about PCs finally getting to the point where they were as fast as one of the WWII code breaking computers (or something like that). It wasn't because we backtracked in computer speeds after WWII, but because even that ancient hardware was able to get good performance when it was purpose-built, but it couldn't do anything else and likely would have required a lot of work to adjust to a different kind of cypher scheme, if it could be adapted at all.
So GP compute might be a million times faster in a decade, but specialist AI chips might be a million times faster than that.
A hardware neural net might be able to eliminate memory latency by giving each neuron fast resisters to handle all their memory needs. If it doesn't need to change connections, each connection could be hard wired. A GPU wouldn't have a chance at keeping up no matter how wide that memory bus gets or how many channels it gets split into. It might even use way less power (though with the elimination of memory latency, it could go fast enough to use way more, too).
Sorry I have doubts, because that would require a factor 4x increase every year for 10 years! 4x^10 = 1,048,576x
Considering they historically have had problems achieving just twice the speed per year, it does not seem likely.
Yes, but usually we keep those 2 kinds of optimizations separate, only combining chip design and production process. Because if the software is optimized, the hardware isn't really doing the same thing.
So yes AI speed may increase more than just the hardware, but for the most sophisticated systems, the tasks will be more complex, which may again slow the software down.
So I think they will never be able to achieve it even when considering software optimizations too. Just the latest Tesla cars boast about 4 times higher resolution cameras, that will require 4 times the processing power to process image recognition, which then will be more accurate, but relatively slower.
We are not where we want to be, and the systems of the future will clearly be more complex, and on the software are more likely to be slower than faster.
Why does that make a difference? Compute for AI is build on the progress for compute first for GPU then for data center. They are similar in nature.
Yes they have exceeded 2x for AI for a while, but that has been achieved through exploding die size and cost, but even that won't make a million times faster in 10 years possible, because they can't increase die sizes any further.
So a Cerebras wafer will be 10^6 faster for the same computation as now, for the same price, in just 10 years? Not after Moore scaling ended many years ago and neural hardware architecture has matured. You can sure go analog, but that's not the same computation. And that's the end of the line, not without true 3d integration.
It requires 4X speed increase every year, production quality scale can't provide even close to half of that, maybe 25%, then another 25% from design, and regarding increasing die sizes they are already close to the end. So the only way to get from 150% to 400% per year is by using multi chip designs, meaning they will have to use 2.5x more chips per year. so the multi chip package in 10 years will probably have to have almost 10,000 chips! All of them bleeding edge!!!
The H200 is estimated to cost $40K, the future 10 year chip will be more like $40 million. Or maybe more like impossible to achieve.
Yeah really, semiconductor has begun stagnating in progress recently due to fundamental limits. I'm gonna call bull on this one, I think they are rather forecasting pluging demand.
True artificial intelligence likely requires quantum computing because there's some quantum stuff happening our brains and probably the smartest living human (Sir Roger Penrose) thinks that's where the secret to consciousness is hiding after spending the last couple decades investigating that after helping Hawking finish up Einstein's work
If you just mean a chat bot that can pass the Turing test, then yeah we can just wait a decade instead of developing special tech for AI.
I mean, if we really develop artificial intelligence before we understand our own consciousness, we're probably fucked anyways.
It'd be like somehow inventing a nuclear bomb before understanding what radiation was. We'd have no idea what we're creating or what the consequences of flipping the switch would be.
Roger Penrose is a mathematician who made important contributions to theoretical physics in the 1960ies, for which he received a Nobel Prize. In later decades, he published speculative books on consciousness, quantum physics, and neurobiology. These ideas have been out there for about 30 years now but have not been able to convince scientists in general. Rather, they are generally considered implausible or outright contradicted by the evidence. Simply put: It's wrong.
The idea that quantum physics plays a direct role in brain function is very much on the fringes of science.
No offense meant. I know these ideas are very important to many spiritual people, but I felt the casual reader should know that it is not important in science.
Can we stop with this "not real AI" meme... it's a painfully dull response at this point, why does the goal post have legs? Just because Penrose thinks quantum mumbo jumbo is needed doesn't mean he is right, machine learning is completely outside his field of expertise.