Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kazushige Okayasu

Pre-training without Natural Images

Jan 21, 2021

Hirokatsu Kataoka, Kazushige Okayasu, Asato Matsumoto, Eisuke Yamagata, Ryosuke Yamada, Nakamasa Inoue, Akio Nakamura, Yutaka Satoh

Figure 1 for Pre-training without Natural Images

Figure 2 for Pre-training without Natural Images

Figure 3 for Pre-training without Natural Images

Figure 4 for Pre-training without Natural Images

Abstract:Is it possible to use convolutional neural networks pre-trained without any natural images to assist natural image understanding? The paper proposes a novel concept, Formula-driven Supervised Learning. We automatically generate image patterns and their category labels by assigning fractals, which are based on a natural law existing in the background knowledge of the real world. Theoretically, the use of automatically generated images instead of natural images in the pre-training phase allows us to generate an infinite scale dataset of labeled images. Although the models pre-trained with the proposed Fractal DataBase (FractalDB), a database without natural images, does not necessarily outperform models pre-trained with human annotated datasets at all settings, we are able to partially surpass the accuracy of ImageNet/Places pre-trained models. The image representation with the proposed FractalDB captures a unique feature in the visualization of convolutional layers and attentions.

* ACCV 2020 Best Paper Honorable Mention Award, Codes are publicly available: https://github.com/hirokatsukataoka16/FractalDB-Pretrained-ResNet-PyTorch

Via

Access Paper or Ask Questions

cvpaper.challenge in 2016: Futuristic Computer Vision through 1,600 Papers Survey

Jul 20, 2017

Hirokatsu Kataoka, Soma Shirakabe, Yun He, Shunya Ueta, Teppei Suzuki, Kaori Abe, Asako Kanezaki, Shin'ichiro Morita, Toshiyuki Yabe, Yoshihiro Kanehara(+7 more)

Figure 1 for cvpaper.challenge in 2016: Futuristic Computer Vision through 1,600 Papers Survey

Figure 2 for cvpaper.challenge in 2016: Futuristic Computer Vision through 1,600 Papers Survey

Figure 3 for cvpaper.challenge in 2016: Futuristic Computer Vision through 1,600 Papers Survey

Figure 4 for cvpaper.challenge in 2016: Futuristic Computer Vision through 1,600 Papers Survey

Abstract:The paper gives futuristic challenges disscussed in the cvpaper.challenge. In 2015 and 2016, we thoroughly study 1,600+ papers in several conferences/journals such as CVPR/ICCV/ECCV/NIPS/PAMI/IJCV.

Via

Access Paper or Ask Questions

Could you guess an interesting movie from the posters?: An evaluation of vision-based features on movie poster database

Apr 07, 2017

Yuta Matsuzaki, Kazushige Okayasu, Takaaki Imanari, Naomichi Kobayashi, Yoshihiro Kanehara, Ryousuke Takasawa, Akio Nakamura, Hirokatsu Kataoka

Figure 1 for Could you guess an interesting movie from the posters?: An evaluation of vision-based features on movie poster database

Figure 2 for Could you guess an interesting movie from the posters?: An evaluation of vision-based features on movie poster database

Figure 3 for Could you guess an interesting movie from the posters?: An evaluation of vision-based features on movie poster database

Figure 4 for Could you guess an interesting movie from the posters?: An evaluation of vision-based features on movie poster database

Abstract:In this paper, we aim to estimate the Winner of world-wide film festival from the exhibited movie poster. The task is an extremely challenging because the estimation must be done with only an exhibited movie poster, without any film ratings and box-office takings. In order to tackle this problem, we have created a new database which is consist of all movie posters included in the four biggest film festivals. The movie poster database (MPDB) contains historic movies over 80 years which are nominated a movie award at each year. We apply a couple of feature types, namely hand-craft, mid-level and deep feature to extract various information from a movie poster. Our experiments showed suggestive knowledge, for example, the Academy award estimation can be better rate with a color feature and a facial emotion feature generally performs good rate on the MPDB. The paper may suggest a possibility of modeling human taste for a movie recommendation.

* 4 pages, 4 figures

Via

Access Paper or Ask Questions