Microsoft’s CaptionBot describes pictures so you don’t have to
Microsoft has launched an image recognition tool that attempts to describe the content of a picture. This artificial intelligence tool is still in the development stage and is constantly learning from pictures uploaded by users.
As far as accuracy is concerned, sometimes the description is fairly accurate while sometimes CaptionBot offers descriptions that have nothing to do with what is being depicted. There are also cases when the app cannot provide a description at all, owing to its rough-around-the-edges state.
Microsoft designed CaptionBot to learn with more experience, with the expectation that its captions will become more accurate over time. The more pictures users upload, the better the app becomes, as CaptionBot describes itself:
I can understand the content of any image and I’ll try to describe it as well as any human. I’m still learning so I’ll hold onto your photo but no personal info.
CaptionBot uses three technologies to describe what is being depicted in a picture: Microsoft’s Computer Vision, Emotion and Bing Image. The Computer Vision API extracts rich information from images to categorize and process visual data alongside identifying and extracting text from an image. The Emotion API, as its name suggests, analyze faces to detect a range of feelings, everything from anger, contempt, disgust, fear, happiness, neutrality, sadness and surprise. Bing Image searches the web for images.
We tested CaptionBot and the results were accurate in 50% of cases. For example, we uploaded two pictures: one depicting a gaming mouse, the other a stack of card. In both cases, the tool suggested it was a cell phone. On the other hand, CaptionBot accurately detected humans and faces.
Apparently, CaptionBot has an obsession with cellphones. One Twitter user reported the app thought Michelle Obama was a cell phone. For more CaptionBot funny captions, check out this Twitter page.
You can also test CaptionBot here. Do give it a try: you’ll either help the tool improve or you’ll have a good laugh!
RELATED STORIES YOU NEED TO CHECK OUT:
- Build 2016: Microsoft planning to demo smart AI bots
- Cortana gets a lot of improvements in Windows 10: here they are
- Microsoft lets you search the web with Skype bots
- Bots are the future and Microsoft is onboard [Build 2016]
UML (Unified Modeling Language) diagrams are visual representations of software systems. The diagrams are essential for software engineers who need to document software systems. There […]
Because we all love listening to our favorite music, it is very important to be able to organize our music library. This task has become […]
Computer problems are usually not that complicated to fix. The real problem is that most people are not that keen to try and fix them, […]