Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mason Swofford

DANTE: Deep Affinity Network for Clustering Conversational Interactants

Jul 24, 2019

Mason Swofford, John Charles Peruzzi, Marynel Vázquez, Roberto Martín-Martín, Silvio Savarese

Figure 1 for DANTE: Deep Affinity Network for Clustering Conversational Interactants

Figure 2 for DANTE: Deep Affinity Network for Clustering Conversational Interactants

Figure 3 for DANTE: Deep Affinity Network for Clustering Conversational Interactants

Figure 4 for DANTE: Deep Affinity Network for Clustering Conversational Interactants

Abstract:We propose a data-driven approach to visually detect conversational groups by identifying spatial arrangements typical of these focused social encounters. Our approach uses a novel Deep Affinity Network (DANTE) to predict the likelihood that two individuals in a scene are part of the same conversational group, considering contextual information like the position and orientation of other nearby individuals. The predicted pair-wise affinities are then used in a graph clustering framework to identify both small (e.g., dyads) and bigger groups. The results from our evaluation on two standard benchmarks suggest that the combination of powerful deep learning methods with classical clustering techniques can improve the detection of conversational groups in comparison to prior approaches. Our technique has a wide range of applications from visual scene understanding, e.g., for surveillance, to social robotics.

* 6 pages

Via

Access Paper or Ask Questions

Image Completion on CIFAR-10

Oct 07, 2018

Mason Swofford

Figure 1 for Image Completion on CIFAR-10

Figure 2 for Image Completion on CIFAR-10

Figure 3 for Image Completion on CIFAR-10

Figure 4 for Image Completion on CIFAR-10

Abstract:This project performed image completion on CIFAR-10, a dataset of 60,000 32x32 RGB images, using three different neural network architectures: fully convolutional networks, convolutional networks with fully connected layers, and encoder-decoder convolutional networks. The highest performing model was a deep fully convolutional network, which was able to achieve a mean squared error of .015 when comparing the original image pixel values with the predicted pixel values. As well, this network was able to output in-painted images which appeared real to the human eye.

* 6 pages, 4 figures

Via

Access Paper or Ask Questions

Conversational Group Detection With Deep Convolutional Networks

Oct 07, 2018

Mason Swofford, John Peruzzi

Figure 1 for Conversational Group Detection With Deep Convolutional Networks

Figure 2 for Conversational Group Detection With Deep Convolutional Networks

Figure 3 for Conversational Group Detection With Deep Convolutional Networks

Figure 4 for Conversational Group Detection With Deep Convolutional Networks

Abstract:Detection of interacting and conversational groups from images has applications in video surveillance and social robotics. In this paper we build on prior attempts to find conversational groups by detection of social gathering spaces called o-spaces used to assign people to groups. As our contributions to the task, we are the first paper to incorporate features extracted from the room layout image, and the first to incorporate a deep network to generate an image representation of the proposed o-spaces. Specifically, this novel network builds on the PointNet architecture which allows unordered inputs of variable sizes. We present accuracies which demonstrate the ability to rival and sometimes outperform the best models, but due to a data imbalance issue we do not yet outperform existing models in our test results.

* 6 pages

Via

Access Paper or Ask Questions