Skip to main content

Nvidia turns simple text prompts into game-ready 3D models

A colorful collage of images generated by Nvidia's LATTE3D.
Nvidia

Nvidia just unveiled its new generative AI model, dubbed Latte3D, during GTC 2024. Latte3D appears to be ChatGPT on extreme steroids. I’s a text-to-3D model that accepts simple, short text prompts and turns them into 3D objects and animals within a second. Much faster than its older counterparts, Latte3D works like a virtual 3D printe that could come in handy for creators across many industries.

Latte3D was made to simplify the creation of 3D models for many types of creators, such as those working on video games, design projects, marketing, or even machine learning and training for robotics. In Nvidia’s demo of the model, it appears super simple to use. Following a quick text prompt, the AI generates a 3D model and shortly after finishes it off with much more detail. While the end result is nowhere near as lifelike as OpenAI’s Sora, it’s not meant to be — this is a way to speed up creating assets instead of having to build them from the ground up.

The model generates several different options for the user to choose from, and Nvidia says that these shapes can be “optimized for higher quality within a few minutes.” The designs can then be exported to different platforms, such as Nvidia’s Omniverse, and can be tweaked to match the desired end result. Nvidia trained Latte3D by using its Ada A100 Tensor Core GPUs and supported the training with ChatGPT prompts to ready it for interacting with real users.

Get your weekly teardown of the tech behind PC gaming
Check your inbox!

As of right now, Latte3D can only generate objects and animals. To that end, it appears to do a solid job of discerning different animals, textures, and object types. Nvidia showed off these capabilities by presenting objects such as an amigurumi (crochet) common crane or an origami sphynx cat. The model was taught to recognize various species and thus can tell the difference between an Italian greyhound and a Shiba Inu.

LATTE3D Text to 3D Generative AI Model from NVIDIA Research

Creators who want to use Latte3D to do more can train it on a different dataset, be it plants or household objects, and later use it for their own purposes. Nvidia brings up some interesting use cases here, such as training personal assistant robots before deploying them. It’s easy to imagine that Latte3D will come in handy for game devs, but the potential goes far beyond just gaming scenarios.

Sanja Fidler, vice president of AI research at Nvidia, remarked on how much faster Latte3D is compared to its predecessors: “A year ago, it took an hour for AI models to generate 3D visuals of this quality — and the current state of the art is now around 10 to 12 seconds. We can now produce results an order of magnitude faster,” said Fidler.

The recent announcements related to using AI in game development are all pretty groundbreaking, and Nvidia’s Latte3D joins a growing list of tools that may one day completely change the process of creating a game. For instance, Nvidia just recently unveiled non-player characters (NPCs) with dialogue entirely generated by AI. Meanwhile, Unreal Engine’s latest update can generate film-quality visuals in games in real time, all with the help of machine learning.

Editors' Recommendations

Monica J. White
Monica is a UK-based freelance writer and self-proclaimed geek. A firm believer in the "PC building is just like expensive…
What is AMD 3D V-Cache? Extra gaming performance unlocked
The AMD Ryzen 9 7950X3D installed in a motherboard.

AMD launched the Ryzen 7 5800X3D in 2022, bringing the world's first processor with 3D V-Cache to market. It remains one of the best gaming CPUs available in 2023, offering credible competition to AMD's Ryzen 7000 series and Intel's Raptor Lake processors. It's no longer alone, though, with newer Ryzen 7000-series 3D VCache CPUs poised to offer even greater gaming performance.

With AMD's latest Ryzen 7000 generation now launched, here's everything you need to know about 3D V-Cache.
What is AMD 3D V-Cache?

Read more
I brought ChatGPT to the board game world. Is it ready for game night?
chatgpt board game night chatgptboard03

We all know that ChatGPT is great at speeding up mundane tasks. What could be drier than explaining the rules of a complicated game at board game night?

There's no substitute for just knowing the game, but being able to reach for AI instead of the rulebook could make things a whole lot easier. Nothing derails a great game of Twilight Imperium like breaking out the Living Rules and endlessly scrolling. So, if I were to bring ChatGPT to board game night, I could definitely see it coming in handy. But before I subjected my friends to a robot reading them the rules, I decided to test it out with some basic questions to see if it was up to snuff.
A defined ruleset sounds ideal
I have no idea why ChatGPT knows the rules of so many games, but it does. Or at least thinks it does. While it might be tricky to find an online manual for some of my collection, ChatGPT seems to have it all -- some of those billions of data points it was trained on reportedly included the errata for something as obscure as the third Battlestar Galactica expansion.

Read more
Nvidia built a massive dual GPU to power models like ChatGPT
Nvidia's H100 NVL being installed in a server.

Nvidia's semi-annual GPU Technology Conference (GTC) usually focuses on advancements in AI, but this year, Nvidia is responding to the massive rise of ChatGPT with a slate of new GPUs. Chief among them is the H100 NVL, which stitches two of Nvidia's H100 GPUs together to deploy Large Language Models (LLM) like ChatGPT.

The H100 isn't a new GPU. Nvidia announced it a year ago at GTC, sporting its Hopper architecture and promising to speed up AI inference in a variety of tasks. The new NVL model with its massive 94GB of memory is said to work best when deploying LLMs at scale, offering up to 12 times faster inference compared to last-gen's A100.

Read more