Microsoft's CaptionBot describes pictures so you don't have to

Reading time icon 2 min. read


Readers help support Windows Report. When you make a purchase using links on our site, we may earn an affiliate commission. Tooltip Icon

Read the affiliate disclosure page to find out how can you help Windows Report effortlessly and without spending any money. Read more

Microsoft has launched an image recognition tool that attempts to describe the content of a picture. This artificial intelligence tool is still in the development stage and is constantly learning from pictures uploaded by users.

As far as accuracy is concerned, sometimes the description is fairly accurate while sometimes CaptionBot offers descriptions that have nothing to do with what is being depicted. There are also cases when the app cannot provide a description at all, owing to its rough-around-the-edges state.

Microsoft designed CaptionBot to learn with more experience, with the expectation that its captions will become more accurate over time. The more pictures users upload, the better the app becomes, as CaptionBot describes itself:

I can understand the content of any image and I’ll try to describe it as well as any human. I’m still learning so I’ll hold onto your photo but no personal info.

CaptionBot uses three technologies to describe what is being depicted in a picture: Microsoft’s Computer Vision, Emotion and Bing Image. The Computer Vision API extracts rich information from images to categorize and process visual data alongside identifying and extracting text from an image. The Emotion API, as its name suggests, analyze faces to detect a range of feelings, everything from anger, contempt, disgust, fear, happiness, neutrality, sadness and surprise. Bing Image searches the web for images.

We tested CaptionBot and the results were accurate in 50% of cases. For example, we uploaded two pictures: one depicting a gaming mouse, the other a stack of card. In both cases, the tool suggested it was a cell phone. On the other hand, CaptionBot accurately detected humans and faces.

Apparently, CaptionBot has an obsession with cellphones. One Twitter user reported the app thought Michelle Obama was a cell phone. For more CaptionBot funny captions, check out this Twitter page.

You can also test CaptionBot here. Do give it a try: you’ll either help the tool improve or you’ll have a good laugh!

RELATED STORIES YOU NEED TO CHECK OUT: