Using DNNs to understand the primate vision: A shortcut or a distraction?

Yaoda Xu; Maryam Vaziri-Pashkam

doi:10.1017/S0140525X23001528

Using DNNs to understand the primate vision: A shortcut or a distraction?

Published online by Cambridge University Press: 06 December 2023

Yaoda Xu

and

Maryam Vaziri-Pashkam

Show author details

Yaoda Xu: Affiliation:
Department of Psychology, Yale University, New Haven, CT, USA yaoda.xu@yale.edu, https://sites.google.com/view/yaodaxu/home
Maryam Vaziri-Pashkam: Affiliation:
National Institute of Mental Health, Bethesda, MD, USA maryam.vaziri-pashkam@nih.gov, https://mvaziri.github.io/Homepage/Bio.html

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

Bowers et al. bring forward critical issues in the current use of deep neural networks (DNNs) to model primate vision. Our own research further reveals fundamentally different algorithms utilized by DNNs for visual processing compared to the brain. It is time to reemphasize the value of basic vision research and put more resources and effort on understanding the primate brain itself.

Type: Open Peer Commentary
Information: Behavioral and Brain Sciences , Volume 46 , 2023 , e413

DOI: https://doi.org/10.1017/S0140525X23001528 [Opens in a new window]
Copyright: Copyright © The Author(s), 2023. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Bakhtiari, S., Mineault, P., Lillicrap, T., Pack, C., & Richards, B. (2021). The functional specialization of visual cortex emerges from training parallel pathways with self-supervised predictive learning. Advances in Neural Information Processing Systems, 34, 25164–25178.Google Scholar

Blauch, N. M., Behrmann, M., & Plaut, D. C. (2022). A connectivity-constrained computational account of topographic organization in primate high-level visual cortex. Proceedings of the National Academy of Sciences of the United States of America, 119, e2112566119.CrossRef Google Scholar PubMed

DiCarlo, J. J., & Cox, D. D. (2007). Untangling invariant object recognition. Trends in Cognitive Science, 11, 333–341.CrossRef Google Scholar PubMed

DiCarlo, J. J., Zoccolan, D., & Rust, R. C. (2012). How does the brain solve visual object recognition? Neuron, 73, 415–434.CrossRef Google Scholar PubMed

Jeong, S. K., & Xu, Y. (2017). Task-context dependent linear representation of multiple visual objects in human parietal cortex. Journal of Cognitive Neuroscience, 29, 1778–1789.CrossRef Google Scholar PubMed

Kanwisher, N., Khosla, M., & Dobs, K. (2023). Using artificial neural networks to ask ‘why’ questions of minds and brains. Trends in Neuroscience, 46, 240–254.CrossRef Google Scholar PubMed

Kay, K. N. (2018). Principles for models of neural information processing. NeuroImage, 180, 101–109.CrossRef Google Scholar PubMed

Mocz, V., Jeong, S. K., Chun, M., & Xu, Y. (2023). The representation of multiple visual objects in human ventral visual areas and in convolutional neural networks. Scientific Reports, 13, 9088.CrossRef Google Scholar

Serre, T. (2019). Deep learning: the good, the bad, and the ugly. Annual Review of Vision Science, 5, 399–426.CrossRef Google Scholar PubMed

Tacchetti, A., Isik, L., & Poggio, T. A. (2018). Invariant recognition shapes neural representations of visual input. Annual Review of Vision Science, 4, 403–422.CrossRef Google Scholar PubMed

Tang, K., Chin, M., Chun, M., & Xu, Y. (2022). The contribution of object identity and configuration to scene representation in convolutional neural networks. PLoS ONE, 17, e0270667.CrossRef Google Scholar PubMed

Taylor, J., & Xu, Y. (2021). Joint representation of color and shape in convolutional neural networks: A stimulus-rich network perspective. PLoS ONE, 16, e0253442.CrossRef Google Scholar

Vaziri-Pashkam, M., Taylor, J., & Xu, Y. (2019). Spatial frequency tolerant visual object representations in the human ventral and dorsal visual processing pathways. Journal of Cognitive Neuroscience, 31, 49–63.CrossRef Google Scholar PubMed

Vaziri-Pashkam, M., & Xu, Y. (2019). An information-driven 2-pathway characterization of occipitotemporal and posterior parietal visual object representations. Cerebral Cortex, 29, 2034–2050.CrossRef Google Scholar PubMed

Xu, Y., & Vaziri-Pashkam, M. (2021a). Limited correspondence in visual representation between the human brain and convolutional neural networks. Nature Communications, 12, 2065.CrossRef Google Scholar PubMed

Xu, Y., & Vaziri-Pashkam, M. (2021b). The coding of object identity and nonidentity features in human occipito-temporal cortex and convolutional neural networks. Journal of Neuroscience, 41, 4234–4252.CrossRef Google Scholar PubMed

Xu, Y., & Vaziri-Pashkam, M. (2022). Understanding transformation tolerant visual object representations in the human brain and convolutional neural networks. NeuroImage, 263, 119635.CrossRef Google Scholar PubMed

Explananda and explanantia in deep neural network models of neurological network functions

Mihnea Moldoveanu Mihnea Moldoveanu

Behavioral and Brain Sciences , Volume 46

A deep new look at color

Jelmer Philip de Vries Jelmer Philip de Vries ,

Alban Flachot Alban Flachot ,

Takuma Morimoto Takuma Morimoto and

Karl R. Gegenfurtner Karl R. Gegenfurtner

Behavioral and Brain Sciences , Volume 46

Beyond the limitations of any imaginable mechanism: Large language models and psycholinguistics

Conor Houghton Conor Houghton ,

Nina Kazanina Nina Kazanina and

Priyanka Sukumaran Priyanka Sukumaran

Behavioral and Brain Sciences , Volume 46

Comprehensive assessment methods are key to progress in deep learning

Michael W. Spratling

Behavioral and Brain Sciences , Volume 46

Deep neural networks are not a single hypothesis but a language for expressing computational hypotheses

Behavioral and Brain Sciences , Volume 46

Even deeper problems with neural network models of language

Thomas G. Bever Thomas G. Bever , Noam Chomsky , Sandiway Fong and Massimo Piattelli-Palmarini

Behavioral and Brain Sciences , Volume 46

Fixing the problems of deep neural networks will require better training data and learning algorithms

Drew Linsley and

Thomas Serre Thomas Serre

Behavioral and Brain Sciences , Volume 46

For deep networks, the whole equals the sum of the parts

Philip J. Kellman Philip J. Kellman , Nicholas Baker , Patrick Garrigan , Austin Phillips and Hongjing Lu

Behavioral and Brain Sciences , Volume 46

For human-like models, train on human-like tasks

Katherine Hermann Katherine Hermann ,

Aran Nayebi Aran Nayebi ,

Sjoerd van Steenkiste Sjoerd van Steenkiste and

Matt Jones Matt Jones

Behavioral and Brain Sciences , Volume 46

Going after the bigger picture: Using high-capacity models to understand mind and brain

Hans Op de Beeck Hans Op de Beeck and Stefania Bracci

Behavioral and Brain Sciences , Volume 46

Implications of capacity-limited, generative models for human vision

Joseph Scott German and

Robert A. Jacobs Robert A. Jacobs

Behavioral and Brain Sciences , Volume 46

Let's move forward: Image-computable models and a common model evaluation scheme are prerequisites for a scientific understanding of human vision

James J. DiCarlo James J. DiCarlo , Daniel L. K. Yamins , Michael E. Ferguson , Evelina Fedorenko , Matthias Bethge , Tyler Bonnen and Martin Schrimpf

Behavioral and Brain Sciences , Volume 46

Modelling human vision needs to account for subjective experience

Marcin Koculak Marcin Koculak and

Michał Wierzchoń Michał Wierzchoń

Behavioral and Brain Sciences , Volume 46

Models of vision need some action

Constantin Rothkopf Constantin Rothkopf , Frank Bremmer , Katja Fiehler , Katharina Dobs and Jochen Triesch

Behavioral and Brain Sciences , Volume 46

My pet pig won't fly and I want a refund

Michael J. Tarr Michael J. Tarr

Behavioral and Brain Sciences , Volume 46

Neither hype nor gloom do DNNs justice

Felix A. Wichmann Felix A. Wichmann ,

Simon Kornblith Simon Kornblith and

Robert Geirhos Robert Geirhos

Behavioral and Brain Sciences , Volume 46

Neural networks need real-world behavior

Aedan Y. Li Aedan Y. Li and

Marieke Mur Marieke Mur

Behavioral and Brain Sciences , Volume 46

Neural networks, AI, and the goals of modeling

Walter Veit Walter Veit and

Heather Browning Heather Browning

Behavioral and Brain Sciences , Volume 46

Perceptual learning in humans: An active, top-down-guided process

Heleen A. Slagter Heleen A. Slagter

Behavioral and Brain Sciences , Volume 46

Psychophysics may be the game-changer for deep neural networks (DNNs) to imitate the human vision

Keerthi S. Chandran Keerthi S. Chandran , Amrita Mukherjee Paul , Avijit Paul and

Kuntal Ghosh Kuntal Ghosh

Behavioral and Brain Sciences , Volume 46

Statistical prediction alone cannot identify good models of behavior

Nisheeth Srivastava Nisheeth Srivastava , Anjali Sifar and Narayanan Srinivasan

Behavioral and Brain Sciences , Volume 46

The model-resistant richness of human visual experience

Jianghao Liu Jianghao Liu and Paolo Bartolomeo

Behavioral and Brain Sciences , Volume 46

The scientific value of explanation and prediction

Hause Lin Hause Lin

Behavioral and Brain Sciences , Volume 46

There is a fundamental, unbridgeable gap between DNNs and the visual cortex

Moshe Gur Moshe Gur