Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ming Bo Cai

Learning 3D object-centric representation through prediction

Mar 06, 2024

John Day, Tushar Arora, Jirui Liu, Li Erran Li, Ming Bo Cai

Abstract:As part of human core knowledge, the representation of objects is the building block of mental representation that supports high-level concepts and symbolic reasoning. While humans develop the ability of perceiving objects situated in 3D environments without supervision, models that learn the same set of abilities with similar constraints faced by human infants are lacking. Towards this end, we developed a novel network architecture that simultaneously learns to 1) segment objects from discrete images, 2) infer their 3D locations, and 3) perceive depth, all while using only information directly available to the brain as training data, namely: sequences of images and self-motion. The core idea is treating objects as latent causes of visual input which the brain uses to make efficient predictions of future scenes. This results in object representations being learned as an essential byproduct of learning to predict.

* 21 pages, 11 figures. Project webpage can be found at https://jday54.github.io/opple_site/

Via

Access Paper or Ask Questions

Incorporating structured assumptions with probabilistic graphical models in fMRI data analysis

May 29, 2020

Ming Bo Cai, Michael Shvartsman, Anqi Wu, Hejia Zhang, Xia Zhu

Figure 1 for Incorporating structured assumptions with probabilistic graphical models in fMRI data analysis

Figure 2 for Incorporating structured assumptions with probabilistic graphical models in fMRI data analysis

Figure 3 for Incorporating structured assumptions with probabilistic graphical models in fMRI data analysis

Figure 4 for Incorporating structured assumptions with probabilistic graphical models in fMRI data analysis

Abstract:With the wide adoption of functional magnetic resonance imaging (fMRI) by cognitive neuroscience researchers, large volumes of brain imaging data have been accumulated in recent years. Aggregating these data to derive scientific insights often faces the challenge that fMRI data are high-dimensional, heterogeneous across people, and noisy. These challenges demand the development of computational tools that are tailored both for the neuroscience questions and for the properties of the data. We review a few recently developed algorithms in various domains of fMRI research: fMRI in naturalistic tasks, analyzing full-brain functional connectivity, pattern classification, inferring representational similarity and modeling structured residuals. These algorithms all tackle the challenges in fMRI similarly: they start by making clear statements of assumptions about neural data and existing domain knowledge, incorporating those assumptions and domain knowledge into probabilistic graphical models, and using those models to estimate properties of interest or latent structures in the data. Such approaches can avoid erroneous findings, reduce the impact of noise, better utilize known properties of the data, and better aggregate data across groups of subjects. With these successful cases, we advocate wider adoption of explicit model construction in cognitive neuroscience. Although we focus on fMRI, the principle illustrated here is generally applicable to brain data of other modalities.

* Neuropsychologia, 107500 (2020)
* update with the version accepted by Neuropsychologia

Via

Access Paper or Ask Questions