Skip to main content

Nvidia turns simple text prompts into game-ready 3D models

A colorful collage of images generated by Nvidia's LATTE3D.
Nvidia

Nvidia just unveiled its new generative AI model, dubbed Latte3D, during GTC 2024. Latte3D appears to be ChatGPT on extreme steroids. I’s a text-to-3D model that accepts simple, short text prompts and turns them into 3D objects and animals within a second. Much faster than its older counterparts, Latte3D works like a virtual 3D printe that could come in handy for creators across many industries.

Latte3D was made to simplify the creation of 3D models for many types of creators, such as those working on video games, design projects, marketing, or even machine learning and training for robotics. In Nvidia’s demo of the model, it appears super simple to use. Following a quick text prompt, the AI generates a 3D model and shortly after finishes it off with much more detail. While the end result is nowhere near as lifelike as OpenAI’s Sora, it’s not meant to be — this is a way to speed up creating assets instead of having to build them from the ground up.

The model generates several different options for the user to choose from, and Nvidia says that these shapes can be “optimized for higher quality within a few minutes.” The designs can then be exported to different platforms, such as Nvidia’s Omniverse, and can be tweaked to match the desired end result. Nvidia trained Latte3D by using its Ada A100 Tensor Core GPUs and supported the training with ChatGPT prompts to ready it for interacting with real users.

Get your weekly teardown of the tech behind PC gaming
Check your inbox!

As of right now, Latte3D can only generate objects and animals. To that end, it appears to do a solid job of discerning different animals, textures, and object types. Nvidia showed off these capabilities by presenting objects such as an amigurumi (crochet) common crane or an origami sphynx cat. The model was taught to recognize various species and thus can tell the difference between an Italian greyhound and a Shiba Inu.

LATTE3D Text to 3D Generative AI Model from NVIDIA Research

Creators who want to use Latte3D to do more can train it on a different dataset, be it plants or household objects, and later use it for their own purposes. Nvidia brings up some interesting use cases here, such as training personal assistant robots before deploying them. It’s easy to imagine that Latte3D will come in handy for game devs, but the potential goes far beyond just gaming scenarios.

Sanja Fidler, vice president of AI research at Nvidia, remarked on how much faster Latte3D is compared to its predecessors: “A year ago, it took an hour for AI models to generate 3D visuals of this quality — and the current state of the art is now around 10 to 12 seconds. We can now produce results an order of magnitude faster,” said Fidler.

The recent announcements related to using AI in game development are all pretty groundbreaking, and Nvidia’s Latte3D joins a growing list of tools that may one day completely change the process of creating a game. For instance, Nvidia just recently unveiled non-player characters (NPCs) with dialogue entirely generated by AI. Meanwhile, Unreal Engine’s latest update can generate film-quality visuals in games in real time, all with the help of machine learning.

Monica J. White
Monica is a UK-based freelance writer and self-proclaimed geek. A firm believer in the "PC building is just like expensive…
Nvidia built a massive dual GPU to power models like ChatGPT
Nvidia's H100 NVL being installed in a server.

Nvidia's semi-annual GPU Technology Conference (GTC) usually focuses on advancements in AI, but this year, Nvidia is responding to the massive rise of ChatGPT with a slate of new GPUs. Chief among them is the H100 NVL, which stitches two of Nvidia's H100 GPUs together to deploy Large Language Models (LLM) like ChatGPT.

The H100 isn't a new GPU. Nvidia announced it a year ago at GTC, sporting its Hopper architecture and promising to speed up AI inference in a variety of tasks. The new NVL model with its massive 94GB of memory is said to work best when deploying LLMs at scale, offering up to 12 times faster inference compared to last-gen's A100.

Read more
A dangerous new jailbreak for AI chatbots was just discovered
the side of a Microsoft building

Microsoft has released more details about a troubling new generative AI jailbreak technique it has discovered, called "Skeleton Key." Using this prompt injection method, malicious users can effectively bypass a chatbot's safety guardrails, the security features that keeps ChatGPT from going full Taye.

Skeleton Key is an example of a prompt injection or prompt engineering attack. It's a multi-turn strategy designed to essentially convince an AI model to ignore its ingrained safety guardrails, "[causing] the system to violate its operators’ policies, make decisions unduly influenced by a user, or execute malicious instructions," Mark Russinovich, CTO of Microsoft Azure, wrote in the announcement.

Read more