Skip to main content

Google just made its Show and Tell AI system open source on TensorFlow

Trusted Contacts
Image used with permission by copyright holder
Artificial intelligence keeps getting more intelligent.

Two years ago, the Google Brain team began employing machine learning techniques to teach a computer how to interpret and caption images. Sure, it won’t win any humor contests for being punny or particularly clever, but if you’re looking for a literal translation of what you’re looking at, Google’s AI system has you covered.

On Thursday, the internet giant announced that it had made “the latest version of our image captioning system available as an open source model in TensorFlow.” The most recent iteration of its AI “contains significant improvements to the computer vision component of the captioning system, is much faster to train, and produces more detailed and accurate descriptions compared to the original system,” Google said.

Called “Show and Tell,” the algorithm can recognize objects in imagery with an impressive 93.9 percent accuracy rate. That’s quite the improvement from just two years ago, when the AI was still scoring in the B-range, identifying images correctly just 89.6 percent of the time. So what’s changed? In essence, Google’s tool now tries to describe objects rather than simply classifying them.

“For example, an image classification model will tell you that a dog, grass and a Frisbee are in the image,” Google noted, “But a natural description should also tell you the color of the grass and how the dog relates to the Frisbee.”

While you may not need Google to tell you what you’re looking at on a daily basis, these machine learning capabilities could be used to help those with visual impairments, and further the work of other AI researchers. “We hope that sharing this model in TensorFlow will help push forward image captioning research and applications, and will also allow interested people to learn and have fun,” Google said.

For a full description of Google’s latest algorithm, check out “Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning Challenge,” published in IEEE Transactions on Pattern Analysis and Machine Intelligence.

Lulu Chang
Former Digital Trends Contributor
Fascinated by the effects of technology on human interaction, Lulu believes that if her parents can use your new app…
How to download a video from Facebook
An elderly person holding a phone.

Facebook is a great place for sharing photos, videos, and other media with friends and family. But what if you’d like to download a video to store offline? This means you’d be able to watch the clip on your PC or mobile device, without needing to be connected to the internet. Fortunately, there’s a way to download Facebook videos to your everyday gadgets, although it’s not as straightforward a process as it could be.

Read more
How to delete your Gmail account (and what you need to know)
The top corner of Gmail on a laptop screen.

Is it time to part ways with your Gmail account? Whether you’re moving onto greener email pastures, or you want to start fresh with a new Gmail address, deleting your old Gmail account is something anyone can do. Of course, we’re not just going to bid you farewell without a guide all our own. If you need to delete your Gmail account, we hope these step-by-step instructions will make the process even easier.

Read more
How to change margins in Google Docs
Laptop Working from Home

You may find that Google Docs has a UI that is almost too clean. It can be difficult to find basic things you're used to, such as margin settings. Don't worry, though, you can change margins in Google Docs just like with any other word processor through a couple of different means.

Read more