Skip to main content

A.I. detects skin cancer better than dermatologists in international study

Skin cancer detection won’t be turned over to machines anytime soon, but artificial intelligence detected skin cancer more accurately than a large group of international dermatologists in controlled testing, Agence France Presse reports.

In an academic study and clinical trial published in Annals of Oncology, the study’s lead author, Professor Holger A. Haenssle, of the University of Heidelberg Department of Dermatology, wrote, “Most dermatologists were outperformed by the CNN. Regardless of any physician’s level of experience, they may benefit from assistance by a CNN’s image classification.”

Man versus machine

The study pitted 58 dermatologists from 17 countries against a deep learning convolutional neural network (CNN).

Prior to the test, researchers from Germany, France, and the U.S.  taught the CNN to differentiate benign skin lesions from dangerous melanomas. In the process, the team showed more than 100,000 images of correctly identified skin cancers to the neural network, which was designed with Google’s Inception v4 CNN architecture.

The 58 dermatologists were divided into three self-identified groups: beginners with less than two years of experience, skilled with two to five years, and experts with more than five years of experience. There were 19 beginners, 11 skilled, and 30 experts among the group.

Two tests were run. In one test the dermatologists were shown 100 dermoscopic images with no other information. They were asked to indicate whether the cancer was a melanoma or benign. In addition, the doctors were asked whether they would recommend excision, short-term follow-up, or no action. Four weeks later the dermatologists were shown the same images again, this time with additional clinical information about the patients plus close-up images.

The results

The CNN scored higher than the overall group of dermatologists on both tests, with and without extra information. The dermatologists accurately identified an average of 86.5 percent of the skin cancers on the image-only test. In the second test, with more information, the doctors averaged 88.9 percent accuracy.  The CNN, however, correctly detected the types of cancers 95 percent of the time based on images only.

Rated by experience group, none of the three groups of dermatologists was as accurate as the neural network. The team did report, however, that 18 of the dermatologists scored higher than the CNN.

“The CNN missed fewer melanomas, meaning it had a higher sensitivity than the dermatologists,” Haenssle said. It also “misdiagnosed fewer benign moles as malignant melanoma … this would result in less unnecessary surgery.”

According to the authors of the study, the test does not mean machines will replace doctors. One issue is that melanomas can be difficult to recognize or image in some parts of the body such as the toes and scalp. The study calls for repeated, large-sized clinical tests.

The test does show, however, that dermatologists at all skill levels could benefit from A.I. assistance in skin cancer classification.

Editors' Recommendations

Bruce Brown
Digital Trends Contributing Editor Bruce Brown is a member of the Smart Homes and Commerce teams. Bruce uses smart devices…
Algorithmic architecture: Should we let A.I. design buildings for us?
Generated Venice cities

Designs iterate over time. Architecture designed and built in 1921 won’t look the same as a building from 1971 or from 2021. Trends change, materials evolve, and issues like sustainability gain importance, among other factors. But what if this evolution wasn’t just about the types of buildings architects design, but was, in fact, key to how they design? That’s the promise of evolutionary algorithms as a design tool.

While designers have long since used tools like Computer Aided Design (CAD) to help conceptualize projects, proponents of generative design want to go several steps further. They want to use algorithms that mimic evolutionary processes inside a computer to help design buildings from the ground up. And, at least when it comes to houses, the results are pretty darn interesting.
Generative design
Celestino Soddu has been working with evolutionary algorithms for longer than most people working today have been using computers. A contemporary Italian architect and designer now in his mid-70s, Soddu became interested in the technology’s potential impact on design back in the days of the Apple II. What interested him was the potential for endlessly riffing on a theme. Or as Soddu, who is also professor of generative design at the Polytechnic University of Milan in Italy, told Digital Trends, he liked the idea of “opening the door to endless variation.”

Read more
Emotion-sensing A.I. is here, and it could be in your next job interview
man speaking into phone

I vividly remember witnessing speech recognition technology in action for the first time. It was in the mid-1990s on a Macintosh computer in my grade school classroom. The science fiction writer Arthur C. Clarke once wrote that “any sufficiently advanced technology is indistinguishable from magic” -- and this was magical all right, seeing spoken words appearing on the screen without anyone having to physically hammer them out on a keyboard.

Jump forward another couple of decades, and now a large (and rapidly growing) number of our devices feature A.I. assistants like Apple’s Siri or Amazon’s Alexa. These tools, built using the latest artificial intelligence technology, aren’t simply able to transcribe words -- they are able to make sense of their contents to carry out actions.

Read more
Language supermodel: How GPT-3 is quietly ushering in the A.I. revolution
Profile of head on computer chip artificial intelligence.

OpenAI’s GPT-2 text-generating algorithm was once considered too dangerous to release. Then it got released -- and the world kept on turning.

In retrospect, the comparatively small GPT-2 language model (a puny 1.5 billion parameters) looks paltry next to its sequel, GPT-3, which boasts a massive 175 billion parameters, was trained on 45 TB of text data, and cost a reported $12 million (at least) to build.

Read more