Skip to main content

OpenAI’s new Shap-E tool is Dall-E for 3D objects

OpenAI‘s latest endeavor, Shap-E, is a model that allows you to generate 3D objects from text, not unlike how Dall-E can create 2D images.

According to OpenAI, Shap-E is “a conditional generative model for 3D assets. Unlike recent work on 3D generative models which produce a single output representation, Shap-E directly generates the parameters of implicit functions that can be rendered as both textured meshes and neural radiance fields.”

Image used with permission by copyright holder

The company’s GitHub posting goes on to explain how Shap-E is trained on a combination of mapping 3D assets and a conditional diffusion model.

However, this free-to-run program is a little more challenging to install and set up than the company’s ever-popular ChatGPT, as was explored by Tom’s Hardware.

You can download the Shap-E model on GitHub at no charge and access it on Microsoft Paint 3D. It also works when converted into an STL file, which allows the renders you create to be brought to life via 3D printers.

While this basic knowledge of the Shap-E model might seem simple enough, some tech savviness might be required to get the model installed and running.

The publication’s editor-in-chief, Avram Piltch, tested out Shap-E, which he claims took him eight hours to wrap his mind around. He added that OpenAI offers little by way of instructions outside of explaining that you should use a Python pip command for installation.

Once installed, Piltch says he was able to test prompts with color-animated GIF files and monochrome PLY files, with the animated GIFs being favorable, he noted.

Some prompts included a shark, a Minecraft creeper, and “an airplane that looks like a banana,” all of which had varying levels of quality depending on their file type. Piltch also used the model’s function, which lets users upload a 2D image for conversion into a 3D object.

The editor noted that those attempting to install Shap-E and render 3D objects should keep in mind that the model requires a lot of system resources from a PC.

In particular, Shap-E is compatible only with Nvidia GPUs and requires high-performance CPUs to render in a matter of minutes as opposed to hours.

Editors' Recommendations

Fionna Agomuoh
Fionna Agomuoh is a technology journalist with over a decade of experience writing about various consumer electronics topics…
How much does an AI supercomputer cost? Try $100 billion
A Microsoft datacenter.

It looks like OpenAI's ChatGPT and Sora, among other projects, are about to get a lot more juice. According to a new report shared by The Information, Microsoft and OpenAI are working on a new data center project, one part of which will be a massive AI supercomputer dubbed "Stargate." Microsoft is said to be footing the bill, and the cost is astronomical as the name of the supercomputer suggests -- the whole project might cost over $100 billion.

Spending over $100 billion on anything is mind-blowing, but when put into perspective, the price truly shows just how big a venture this might be: The Information claims that the new Microsoft and OpenAI joint project might cost a whopping 100 times more than some of the largest data centers currently in operation.

Read more
OpenAI boss takes Sora tech to Hollywood, report claims
An AI image portraying two mammoths that walk through snow, with mountains and a forest in the background.

OpenAI’s new text-to-video artificial intelligence model left jaws on the floor recently when the company offered up examples of what it can do.

Sora, as it’s called, generates astonishingly realistic footage from descriptive text inputs, and while a close look can sometimes reveal slight flaws in the imagery, the technology has left many wondering to what extent it could upend the TV and movie industries.

Read more
Nvidia turns simple text prompts into game-ready 3D models
A colorful collage of images generated by Nvidia's LATTE3D.

Nvidia just unveiled its new generative AI model, dubbed Latte3D, during GTC 2024. Latte3D appears to be ChatGPT on extreme steroids. I's a text-to-3D model that accepts simple, short text prompts and turns them into 3D objects and animals within a second. Much faster than its older counterparts, Latte3D works like a virtual 3D printe that could come in handy for creators across many industries.

Latte3D was made to simplify the creation of 3D models for many types of creators, such as those working on video games, design projects, marketing, or even machine learning and training for robotics. In Nvidia's demo of the model, it appears super simple to use. Following a quick text prompt, the AI generates a 3D model and shortly after finishes it off with much more detail. While the end result is nowhere near as lifelike as OpenAI's Sora, it's not meant to be -- this is a way to speed up creating assets instead of having to build them from the ground up.

Read more