Skip to main content

His granddad lost the ability to read, so he built a DIY text-to-speech rig

text to speech reader
Bennie Affleck

Every birthday, Bennie Affleck (no, not that one) buys his grandfather the best bottle of Chilean red wine he can find. But when his grandad, also named Ben, was due to turn 100 this year, his 40-year-old British software engineer grandson decided the occasion warranted a special present.

“He’s a very inspirational man,” Affleck, who runs a U.K. software consultancy called Ionium Design, said of Ben Sr. “Still cheerful despite outliving two wives and both his children.”

For his grandfather’s centenary, a bottle of wine — or even a couple of them — simply wasn’t going to cut it. So Affleck built him an A.I.-powered digital text-to-speech reading machine for helping his grandad, who has macular degeneration, to be able to read again.

Reading machines

“My grandfather began losing his sight around 10 years ago,” Affleck told Digital Trends. “He is an extremely determined man and wants to stay independent, living at home, and taking an active part in the world. Losing his ability to drive was a blow, but he replaced that with taxis. Losing his ability to read has been harder.”

text to speech reader
Bennie Affleck

Despite the best available vision aids, Affleck said his grandfather regularly took over an hour to decipher a single A4 letter. “Although there are many smartphone and tablet-based [tools] available, they are unsuitable for people unfamiliar with these devices, or whose vision, touch, or hearing makes touch screens unusable,” he said.

What Affleck designed for his grandfather as an alternative was a DIY digital reading assistant, made using off-the-shelf components and software. It allows users to place a document on an illuminated platform, where it is scanned by a camera, deciphered by text recognition algorithms, and ultimately read out in a natural-sounding voice.

To make the interface easier for someone with diminishing eyesight, the machine is kitted out with chunky colored buttons. Hitting the blue one scans a document. Green starts reading the most recently scanned document. The left yellow button skips back six seconds, the right yellow skips forward six seconds, and the red button pauses and unpauses. For security reasons, Affleck said that the machine does not store any of the scanned documents locally or in the cloud.

Designed for ease-of-use

Affleck said that he usually works in a cabin in his garden. However, he had to piece together his grandfather’s gift on his kitchen table, due to a lack of outside heating. (He’s a snowboarder as a hobby and used to the cold, but everyone has their limits!)

“The entire device was constructed in three weeks,” he said. “Building a physical enclosure could have been daunting, but I had a brainwave to repurpose an old 3M portable overhead projector. All other parts came from small U.K. businesses and [a big electronic components company.] My neighbor kindly machined the metalwork for a sturdy keypad of my design, that I fitted with arcade buttons.”

text to speech reader
Bennie Affleck

The reading machine’s software is custom Python with Google’s Cloud Vision and Wavenet text-to-speech software. It’s powered via a Raspberry Pi 3B with a Pi V2 camera.

“The complete system works surprisingly well,” Affleck said, describing this as a “testament” to Google’s high-quality A.I. tools. “Printed text is read with amazing accuracy, even accounting for rotations, distortions, [and other challenges]. The voice is also very listenable. In testing, I had the machine reading pages from The Lion The Witch and The Wardrobe, and I found myself getting engaged in the story.”

An amusing, but poignant, moment occurred when Affleck gave his grandfather the gift. “After setting it up for him, grandad said, ‘Now I can use this to read the instructions for my digital magnifier,” Affleck said. “It was funny and sad that he couldn’t operate another device he’d bought to help him because he couldn’t read its instructions.”

Helping more people

The only difficulty with the machine at present, Affleck said, is with spatially structured data like bank statements. “I will be adding heuristics and some of my own A.I. to allow these to be read in a more human-like manner,” he noted.

Affleck said that building this device has given him a new appreciation of the challenges people with limited eyesight face. “As I began developing the concept, it became apparent that many other people have similar problems. I realized there is demand for a much better device, so I built the best prototype I could and am building more units so I can run field trials with volunteers in February.”

If these tests go well, Affleck said that he would consider turning this into a product — complete with additional features and more suitable casing. For now, though, he’s built a game-changing device his grandfather can use on a daily basis — and, really, that’s just what he set out to do.

Luke Dormehl
I'm a UK-based tech writer covering Cool Tech at Digital Trends. I've also written for Fast Company, Wired, the Guardian…
Digital Trends’ Top Tech of CES 2023 Awards
Best of CES 2023 Awards Our Top Tech from the Show Feature

Let there be no doubt: CES isn’t just alive in 2023; it’s thriving. Take one glance at the taxi gridlock outside the Las Vegas Convention Center and it’s evident that two quiet COVID years didn’t kill the world’s desire for an overcrowded in-person tech extravaganza -- they just built up a ravenous demand.

From VR to AI, eVTOLs and QD-OLED, the acronyms were flying and fresh technologies populated every corner of the show floor, and even the parking lot. So naturally, we poked, prodded, and tried on everything we could. They weren’t all revolutionary. But they didn’t have to be. We’ve watched enough waves of “game-changing” technologies that never quite arrive to know that sometimes it’s the little tweaks that really count.

Read more
Digital Trends’ Tech For Change CES 2023 Awards
Digital Trends CES 2023 Tech For Change Award Winners Feature

CES is more than just a neon-drenched show-and-tell session for the world’s biggest tech manufacturers. More and more, it’s also a place where companies showcase innovations that could truly make the world a better place — and at CES 2023, this type of tech was on full display. We saw everything from accessibility-minded PS5 controllers to pedal-powered smart desks. But of all the amazing innovations on display this year, these three impressed us the most:

Samsung's Relumino Mode
Across the globe, roughly 300 million people suffer from moderate to severe vision loss, and generally speaking, most TVs don’t take that into account. So in an effort to make television more accessible and enjoyable for those millions of people suffering from impaired vision, Samsung is adding a new picture mode to many of its new TVs.
[CES 2023] Relumino Mode: Innovation for every need | Samsung
Relumino Mode, as it’s called, works by adding a bunch of different visual filters to the picture simultaneously. Outlines of people and objects on screen are highlighted, the contrast and brightness of the overall picture are cranked up, and extra sharpness is applied to everything. The resulting video would likely look strange to people with normal vision, but for folks with low vision, it should look clearer and closer to "normal" than it otherwise would.
Excitingly, since Relumino Mode is ultimately just a clever software trick, this technology could theoretically be pushed out via a software update and installed on millions of existing Samsung TVs -- not just new and recently purchased ones.

Read more
AI turned Breaking Bad into an anime — and it’s terrifying
Split image of Breaking Bad anime characters.

These days, it seems like there's nothing AI programs can't do. Thanks to advancements in artificial intelligence, deepfakes have done digital "face-offs" with Hollywood celebrities in films and TV shows, VFX artists can de-age actors almost instantly, and ChatGPT has learned how to write big-budget screenplays in the blink of an eye. Pretty soon, AI will probably decide who wins at the Oscars.

Within the past year, AI has also been used to generate beautiful works of art in seconds, creating a viral new trend and causing a boon for fan artists everywhere. TikTok user @cyborgism recently broke the internet by posting a clip featuring many AI-generated pictures of Breaking Bad. The theme here is that the characters are depicted as anime characters straight out of the 1980s, and the result is concerning to say the least. Depending on your viewpoint, Breaking Bad AI (my unofficial name for it) shows how technology can either threaten the integrity of original works of art or nurture artistic expression.
What if AI created Breaking Bad as a 1980s anime?
Playing over Metro Boomin's rap remix of the famous "I am the one who knocks" monologue, the video features images of the cast that range from shockingly realistic to full-on exaggerated. The clip currently has over 65,000 likes on TikTok alone, and many other users have shared their thoughts on the art. One user wrote, "Regardless of the repercussions on the entertainment industry, I can't wait for AI to be advanced enough to animate the whole show like this."

Read more