OpenAI, a non-profit AI company that will lose anywhere from $4 billion to $5 billion this year, will at some point in the next six or so months convert into a for-profit AI company, at which point it will continue to lose money in exactly the same way. Shortly after
Then as you ask "provide sources.", it says simply "Source: Tech Review Websites". If this came from an actual person I would genuinely ask it "do you take me for gullible trash?".
It's still somewhat useful, due to Google Search crumbling away into nothingness, if you ask "link me five sites with info about [topic]".
Your experience highlights what current iterations of LLMs are not well suited for, so I understand if that's what you were hoping to achieve, why you were left wanting, or disillusioned.
There's a lot of things that LLMs are really good at, or incredibly useful for, such as ingesting large bodies of text, and then analyzing them based on your ability to create well thought out prompts.
This can save you hours and hours, of reading time, and it's something that you can verify the answer on relatively quickly, to double check the LLMs response accuracy.
They're also good at doing something Google used to be good at, but sucks at now. Which enabling you to describe process, simple or complicated, short or long, that you either can't recall the name of, or aren't even sure where it's called, and letting you know exactly what it is. Also, easily verifiable.
There's plenty of other things too, but just remember that they are tools, not magic, or sentient intelligence.
The models are not real time, but there are tricks to figure out it's most recent dates of ingestion, such as asking topical entertainment or news questions, but don't go looking for a real-time information.
Also, I have yet to find a model that can provide an actual URL and specific source for anything it generates, which is why it's a good practice to use them to do tasks, or get information, that would take you longer to do, or get, manually, but that can be easily verified once you receive it.
But if any research source cannot be used without verification, is it really useful? I agree, we should verfiy crucial information but when its wrong often, but confidently so, using natural language is a barrier not a benefit.
There's a lot of things that LLMs are really good at, or incredibly useful for, such as ingesting large bodies of text, and then analyzing them based on your ability to create well thought out prompts.
That's the story people tell at least. The weasel phrase at the end is fun, I guess. Leaves a massive backdoor excuse when it doesn't actually work.
But in practice, LLMs are falling down even at this job. They seem to have some yse in academic qualitaruve coding, but for summarizing novel or extended bodies of text, they struggle to actually tell people what they want to know.
Most people do not give a shit if text contains a reference to X. And if they do, they can generally just CTRL+F "X".
"You'll fucking know when I'm swearing at you," was my reply to that shit the last time I gave it a spin (after it regurgitated nonsense after many prompts specifically asking for not nonsense).
I hope sonething better comes along because google ruined their search engine a decade ago. stract.com is probabky the closest to what google used to be.
As for chatgpt, it is not an index. It cannot refer you back to infornation it was trained on because it doesn't build a massive indexed internet database.
It has some method of probable relations and conglomerarion of input.
It is why it "hallucinates" information output, because it doesn't "know" what is wrong or right info, it just fetches data based on probabilities of connections.
It is good at suggesting new music or movies based on your list of media you like, but it is terrible with actual factual info
The funniest part is that all the AI hype is focused on all the wrong things. There are absolutely great AI tools that get very little mention.
For example, I'm visually impaired and use AI tools A fair bit to help me get around the internet and such. Especially when it comes to using AI I to generate descriptions of images.
Ed is getting good at lobbing these darts at hype bubbles.
The thing that this writeup ignores is that the object isn't to show short-term revenue, but to put all competitors out of business, be the last one standing, and create a monopoly. Either that or get bought out so the investors can move on to the next thing. But at $150B valuation, only MSFT or Nvidia can afford to buy them outright.
Google, Meta, and Amazon burned through cash for years, but they eventually outran all competition and then monetized the users who had nowhere else to go.
See that it's never going to make money, go public, hand the keys over to someone else, and then try again with a wallet full of cash and a reputation for making billion dollar businesses.
That cost-per-user doesn’t decrease as you add more customers. You need more servers. More GPUs.
This is assuming constant use, which is not the case. If I have a server handling LLM prompt requests, and for illustrative purposes each request uses 100% of the single discrete GPU in it, and I only have 1 customer, but that one customer only uses it 5% of the day (which would actually be pretty high in real terms), I can still add additional customers without needing to buy additional servers. The question is whether the given revenue of a single server outweighs its cost to run.
And when it comes to training, that is an upfront cost, that you could (if you get a model to where you want it) stop having to pay whenever you want. I'm pretty surprised they haven't been really leaning into training models for medical diagnoses, because once you have a model that can e.g. spot a type of tumor with n% accuracy beyond a human, you don't really have to refine it further if you don't want to (after all, it's not like the humans can choose to do it better themselves at that point, like they can with writing prompts).
I'd say they've probably long reached the point where they have enough customers around the world to hold the load on their servers fairly constant. The example with one user only taking 5% of a servers load only works for low customer counts, similar to how you can't count on one wind turbine or solar plant to provide all of your energy but if you have enough of them you can provide a base line of fairly constant energy