Xbox plans for using AI to create scripts, dialogue trees, quest lines
Xbox plans for using AI to create scripts, dialogue trees, quest lines

Microsoft is bringing AI characters to Xbox

Xbox plans for using AI to create scripts, dialogue trees, quest lines
Microsoft is bringing AI characters to Xbox
From reading an earlier article, it sounds like there are at least two things being done:
I can believe that #1 is technically-doable today, but I'm not clear whether there's a need for it.
And #2 I can believe a need for, but I'm not clear how technically-doable it is.
For #1, people already do use LLMs to generate text. But...it's generally not very high quality text.
In most video games I see, the limiting factor isn't the amount of text to produce -- I generally do very little reading in video games. Rather, it's how much text people are willing to read. There are maybe some niche video games that have a lot of text, like the stuff that Choice of Games put out, where one is basically doing a choose-your-own-adventure type book. But for those, the game lives and dies on writing quality -- which moving to an LLM does not, today, seem likely to improve -- and I'm not sure whether the cost is a big deal; my question is really more whether I'd enjoy the time I put into playing the game. I mean, the games are pretty inexpensive.
Choice of Robots, one popular example, is $6.99.
https://keytowriting.com/guides/ideal-length-different-books-novels-non-fiction-short-stories/375/
So, already, you're talking about the equivalent of three novels there, if you play through it enough times to hit all the content. (checks) The Lord of the Rings trilogy is only about 500k words. That's a lot of player time for a few dollars.
But, okay. Let's say that we aim at having an LLM reduce the writer's workload by 50%. If an LLM can reduce the writer's workload by 50%, it's going to have to generate more than 50% of the text, because the writer is going to have to probably do a pass over the text, touch it up. The vendor is probably getting something like 30%. I don't know what the rest of the breakdown is, but let's assume that the writer gets all remaining 70%, all of which seem like pretty generous assumptions. Assume that the LLM does half of all the work. Then we're talking about maybe selling the game for $4.54 instead of $6.99. That's I don't know how much that's going to impact my purchasing decision; there are a lot of Choice of Games games that I don't really think are all that worth playing, and a bigger factor is whether it's worth my time to read the text.
The inputs that I do care about are how appealing the plot and writing are, whether it's worth my time. Today, LLMs are not competitive with humans at that, and in any event, as long as they're just using an approach of being trained on and aiming to replicate the style of human works, are not super-likely to exceed human quality. I can very much believe that it's possible to create software that writes more-engaging stuff than a human does, but I don't think that doing that is near-term commercially practical.
So it's kind of hard for me to see how compelling this is gonna be as an aid to writers.
Maybe it'd be possible to help a human ghostwriter imitate a writing style -- LLMs seem to be pretty impressive at imitating graphical styles of human artists. But I don't think that human writers have as much of a recognizable style as human graphical artists.
Okay, how about #2?
There, I completely believe that there's value in access to dynamically-generated conversation. It would be great if a game could dynamically-interact with the player, deal with many permutations of their actions.
But from what I've seen, looking at KoboldAI (or TavernAI or similar), I'm skeptical that the performance or writing quality is there. For those who haven't poked at those, there are several problems:
Oh, and a #3 -- in this article, they have a reference to doing voice synthesis. I can definitely see voice synth being used to speak arbitrary text. Stuff like Tortoise TTS is pretty good, can very easily generate a voice from samples or combinations of samples, and its short output snippet constraint is no big deal for the type of text that comes up in many video games. It's a great fit for static generation of speech for mods, where the original voice actors aren't available and maybe one doesn't require the absolute best acting. But...again, it's slow and VRAM-hungry, which I'd think would limit it for dynamic, in-game use. And speech synthesizers aren't new; there's nothing really specific to gaming or characters there, unless maybe one wants ease of generating new voices. So, due to the runtime resource limitations, I don't know how practical using it for speech synth at runtime is. If one uses it to generate static speech snippets, sure, that could be done...but then, what's game-specific here? You're just using an ordinary old speech synthesis engine.
Maybe combining animated lip movements with static, generated speech? There are tools for that, but maybe it's possible to do a better job of automatically generating facial and hand gestures to fit with speech.
I just don't see how we're at a point today where it's really possible to take a lot of commercial advantage of this in video games.
I can definitely believe that there are non-writing applications, right now, for LLMs in video games, like in generating character art and animations. But it's less-clear to me where the large opportunities are when it comes to writing and speech.