April 30, 2019

For better deep neural network vision, just add feedback (loops)

by Sabbi Lall, Massachusetts Institute of Technology

Your ability to recognize objects is remarkable. If you see a cup under unusual lighting or from unexpected directions, there's a good chance that your brain will still compute that it is a cup. Such precise object recognition is one holy grail for artificial intelligence developers, such as those improving self-driving car navigation.

While modeling primate object recognition in the visual cortex has revolutionized artificial visual recognition systems, current deep learning systems are simplified, and fail to recognize some objects that are child's play for primates such as humans.

In findings published in Nature Neuroscience, McGovern Institute investigator James DiCarlo and colleagues have found evidence that feedback improves recognition of hard-to-recognize objects in the primate brain, and that adding feedback circuitry also improves the performance of artificial neural network systems used for vision applications.

Deep convolutional neural networks (DCNN) are currently the most successful models for accurately recognizing objects on a fast timescale (less than 100 milliseconds) and have a general architecture inspired by the primate ventral visual stream, cortical regions that progressively build an accessible and refined representation of viewed objects. Most DCNNs are simple in comparison to the primate ventral stream, however.

"For a long period of time, we were far from an model-based understanding. Thus our field got started on this quest by modeling visual recognition as a feedforward process," explains senior author DiCarlo, who is also the head of MIT's Department of Brain and Cognitive Sciences and research co-leader in the Center for Brains, Minds, and Machines (CBMM). "However, we know there are recurrent anatomical connections in brain regions linked to object recognition."

Think of feedforward DCNNs, and the portion of the visual system that first attempts to capture objects, as a subway line that runs forward through a series of stations. The extra, recurrent brain networks are instead like the streets above, interconnected and not unidirectional. Because it only takes about 200 ms for the brain to recognize an object quite accurately, it was unclear if these recurrent interconnections in the brain had any role at all in core object recognition. Perhaps those recurrent connections are only in place to keep the visual system in tune over long periods of time. For example, the return gutters of the streets help slowly clear it of water and trash, but are not strictly needed to quickly move people from one end of town to the other. DiCarlo, along with lead author and CBMM postdoc Kohitij Kar, set out to test whether a subtle role of recurrent operations in rapid visual object recognition was being overlooked.

Challenging recognition

The authors first needed to identify objects that are trivially decoded by the primate brain, but are challenging for artificial systems. Rather than trying to guess why deep learning was having problems recognizing an object (is it due to clutter in the image? a misleading shadow?), the authors took an unbiased approach that turned out to be critical.

Kar explains further that "we realized that AI models actually don't have problems with every image where an object is occluded or in clutter. Humans trying to guess why AI models were challenged turned out to be holding us back."

Instead, the authors presented the deep learning system, as well as monkeys and humans, with images, homing in on "challenge images" where the primates could easily recognize the objects in those images, but a feedforward DCNN ran into problems. When they, and others, added appropriate recurrent processing to these DCNNs, object recognition in challenge images suddenly became a breeze.

Processing times

Kar used neural recording methods with very high spatial and temporal precision to determine whether these images were really so trivial for primates. Remarkably, they found that although challenge images had initially appeared to be child's play to the human brain, they actually involve extra neural processing time (about an additional 30 ms), suggesting that recurrent loops operate in our brain, too.

"What the computer vision community has recently achieved by stacking more and more layers onto artificial neural networks, evolution has achieved through a brain architecture with recurrent connections," says Kar.

Diane Beck, professor of psychology and co-chair of the Intelligent Systems Theme at the Beckman Institute and not an author on the study, explains further. "Since entirely feedforward deep convolutional nets are now remarkably good at predicting primate brain activity, it raised questions about the role of feedback connections in the primate brain. This study shows that, yes, feedback connections are very likely playing a role in object recognition after all."

What does this mean for a self-driving car? It shows that deep learning architectures involved in object recognition need recurrent components if they are to match the primate brain, and also indicates how to operationalize this procedure for the next generation of intelligent machines.

"Recurrent models offer predictions of neural activity and behavior over time," says Kar. "We may now be able to model more involved tasks. Perhaps one day, the systems will not only recognize an object, such as a person, but also perform cognitive tasks that the human brain so easily manages, such as understanding the emotions of other people."

More information: Kohitij Kar et al. Evidence that recurrent circuits are critical to the ventral stream's execution of core object recognition behavior, Nature Neuroscience (2019). DOI: 10.1038/s41593-019-0392-5

Journal information: Nature Neuroscience

Provided by Massachusetts Institute of Technology

This story is republished courtesy of MIT News (web.mit.edu/newsoffice/), a popular site that covers news about MIT research, innovation and teaching.

Citation: For better deep neural network vision, just add feedback (loops) (2019, April 30) retrieved 19 April 2024 from https://medicalxpress.com/news/2019-04-deep-neural-network-vision-feedback.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Newest computer neural networks can identify visual objects as well as the primate brain

302 shares

Feedback to editors

Engineered peptides open new avenue for immunotherapy drug development

2 hours ago

Signs of multiple sclerosis show up in blood years before symptoms, study finds

2 hours ago

Study of cancer-induced liver inflammation finds a promising therapeutic target

2 hours ago

Researchers discover new therapeutic target for non-small cell lung cancer

14 hours ago

Immune cells carry a long-lasting 'memory' of early-life pain

15 hours ago

Cannabis legalization and rising sales have not contributed to increase in substance abuse, study finds

15 hours ago

No negative impact from prolonged eye patching on child's development or family stress levels

15 hours ago

COVID-19 booster immunity lasts much longer than primary series alone, study shows

15 hours ago

Study finds that human neuron signals flow in one direction

16 hours ago

A common pathway in the brain that enables addictive drugs to hijack natural reward processing identified

17 hours ago

Load comments (0)

For better deep neural network vision, just add feedback (loops)

Challenging recognition

Processing times

Engineered peptides open new avenue for immunotherapy drug development

Signs of multiple sclerosis show up in blood years before symptoms, study finds

Study of cancer-induced liver inflammation finds a promising therapeutic target

Researchers discover new therapeutic target for non-small cell lung cancer

Immune cells carry a long-lasting 'memory' of early-life pain

Cannabis legalization and rising sales have not contributed to increase in substance abuse, study finds

No negative impact from prolonged eye patching on child's development or family stress levels

COVID-19 booster immunity lasts much longer than primary series alone, study shows

Study finds that human neuron signals flow in one direction

A common pathway in the brain that enables addictive drugs to hijack natural reward processing identified

Newest computer neural networks can identify visual objects as well as the primate brain

Research identifies key weakness in modern computer vision systems

Neural net activations are aligned with gamma band activity of the human visual cortex

Study validates monkey model of visual perception

An integrated visual and semantic neural network model explains human object recognition in the brain

Recognizing the partially seen

Immune cells carry a long-lasting 'memory' of early-life pain

Study finds that human neuron signals flow in one direction

A common pathway in the brain that enables addictive drugs to hijack natural reward processing identified

Scientists identify airway cells that sense aspirated water and acid reflux

Environment may influence metacognitive abilities more than genetics

Perfect balance: How the brain fine-tunes its sensitivity

Phys.org

Tech Xplore

Science X

For better deep neural network vision, just add feedback (loops)

Challenging recognition

Processing times

Engineered peptides open new avenue for immunotherapy drug development

Signs of multiple sclerosis show up in blood years before symptoms, study finds

Study of cancer-induced liver inflammation finds a promising therapeutic target

Researchers discover new therapeutic target for non-small cell lung cancer

Immune cells carry a long-lasting 'memory' of early-life pain

Cannabis legalization and rising sales have not contributed to increase in substance abuse, study finds

No negative impact from prolonged eye patching on child's development or family stress levels

COVID-19 booster immunity lasts much longer than primary series alone, study shows

Study finds that human neuron signals flow in one direction

A common pathway in the brain that enables addictive drugs to hijack natural reward processing identified

Related Stories

Newest computer neural networks can identify visual objects as well as the primate brain

Research identifies key weakness in modern computer vision systems

Neural net activations are aligned with gamma band activity of the human visual cortex

Study validates monkey model of visual perception

An integrated visual and semantic neural network model explains human object recognition in the brain

Recognizing the partially seen

Recommended for you

Immune cells carry a long-lasting 'memory' of early-life pain

Study finds that human neuron signals flow in one direction

A common pathway in the brain that enables addictive drugs to hijack natural reward processing identified

Scientists identify airway cells that sense aspirated water and acid reflux

Environment may influence metacognitive abilities more than genetics

Perfect balance: How the brain fine-tunes its sensitivity

Newsletter sign up

Donate and enjoy an ad-free experience