MIT's newest computer vision algorithm identifies images down to the pixel

Post Views: 397

For humans, identifying items in a scene — whether that’s an avocado or an Aventador, a pile of mashed potatoes or an alien mothership — is as simple as looking at them. But for artificial intelligence and computer vision systems, developing a high-fidelity understanding of their surroundings takes a bit more effort. Well, a lot more effort. Around 800 hours of hand-labeling training images effort, if we’re being specific. To help machines better see the way people do, a team of researchers at MIT CSAIL in collaboration with Cornell University and Microsoft have developed STEGO, an algorithm able to identify images down to the individual pixel.

Whereas a labeled box would have the object plus other items in the surrounding pixels within the boxed-in boundary, semantic segmentation labels every pixel in the object, but only the pixels that comprise the object — you get just dog pixels, not dog pixels plus some grass too. It’s the machine learning equivalent of using the Smart Lasso in Photoshop versus the Rectangular Marquee tool.

The problem with this technique is one of scope. Conventional multi-shot supervised systems often demand thousands, if not hundreds of thousands, of labeled images with which to train the algorithm. Multiply that by the 65,536 individual pixels that make up even a single 256×256 image, all of which now need to be individually labeled as well, and the workload required quickly spirals into impossibility.

Instead, “STEGO looks for similar objects that appear throughout a dataset,” the CSAIL team wrote in a press release Thursday. “It then associates these similar objects together to construct a consistent view of the world across all of the images it learns from.”

Trained on a wide variety of image domains — from home interiors to high altitude aerial shots — STEGO doubled the performance of previous semantic segmentation schemes, closely aligning with the image appraisals of the human control. What’s more, “when applied to driverless car datasets, STEGO successfully segmented out roads, people, and street signs with much higher resolution and granularity than previous systems. On images from space, the system broke down every single square foot of the surface of the Earth into roads, vegetation, and buildings,” the MIT CSAIL team wrote.

“In making a general tool for understanding potentially complicated data sets, we hope that this type of an algorithm can automate the scientific process of object discovery from images,” Hamilton said. “There's a lot of different domains where human labeling would be prohibitively expensive, or humans simply don’t even know the specific structure, like in certain biological and astrophysical domains. We hope that future work enables application to a very broad scope of data sets. Since you don't need any human labels, we can now start to apply ML tools more broadly.”

source

Tags: Buy Gadgets Online, digital technologies, Emerging Technology, environmental technology, High Technology, New Vehicle Technology, Winter Gadgets

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

MIT's newest computer vision algorithm identifies images down to the pixel

Other Press Releases

CBDC Ensures Nigeria Remains Competitive in Increasingly Digital World — Central Bank Governor

The Olive Corporation Could Be the Next Big Thing in Crypto, the Metaverse and the Food Sector

These are the most useless car tech features

Author Infomation

Alexander Proud

Featured Image

Recent Press Releases

Guest Post

SparxWorks Unveils DOME – A Dynamic Omni Media Experience Reimagining Digital Content Delivery

Pro-line Trailers Bringing Exclusive Display to Hyper-Fest 2025

Pro-line Trailers Declares March 3, 2025, as “Trailer Day 2025”

BrightFunded Reports Exceptional Q1 2025 Growth, Setting the Stage for Record-Breaking Year

Guest Post

SparxWorks Unveils DOME – A Dynamic Omni Media Experience Reimagining Digital Content Delivery

Pro-line Trailers Bringing Exclusive Display to Hyper-Fest 2025

Pro-line Trailers Declares March 3, 2025, as “Trailer Day 2025”

BrightFunded Reports Exceptional Q1 2025 Growth, Setting the Stage for Record-Breaking Year

Follow Us

Office Address

Contact

PR Distribution

Releases by Industry

Our Services

Newswires

Help/Support

© 2024 Copyright All Rights Reserved. Prwires.com.