Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mohamed Saleh

Differentiable Sensor Layouts for End-to-End Learning of Task-Specific Camera Parameters

Apr 28, 2023

Hendrik Sommerhoff, Shashank Agnihotri, Mohamed Saleh, Michael Moeller, Margret Keuper, Andreas Kolb

Figure 1 for Differentiable Sensor Layouts for End-to-End Learning of Task-Specific Camera Parameters

Figure 2 for Differentiable Sensor Layouts for End-to-End Learning of Task-Specific Camera Parameters

Figure 3 for Differentiable Sensor Layouts for End-to-End Learning of Task-Specific Camera Parameters

Figure 4 for Differentiable Sensor Layouts for End-to-End Learning of Task-Specific Camera Parameters

Abstract:The success of deep learning is frequently described as the ability to train all parameters of a network on a specific application in an end-to-end fashion. Yet, several design choices on the camera level, including the pixel layout of the sensor, are considered as pre-defined and fixed, and high resolution, regular pixel layouts are considered to be the most generic ones in computer vision and graphics, treating all regions of an image as equally important. While several works have considered non-uniform, \eg, hexagonal or foveated, pixel layouts in hardware and image processing, the layout has not been integrated into the end-to-end learning paradigm so far. In this work, we present the first truly end-to-end trained imaging pipeline that optimizes the size and distribution of pixels on the imaging sensor jointly with the parameters of a given neural network on a specific task. We derive an analytic, differentiable approach for the sensor layout parameterization that allows for task-specific, local varying pixel resolutions. We present two pixel layout parameterization functions: rectangular and curvilinear grid shapes that retain a regular topology. We provide a drop-in module that approximates sensor simulation given existing high-resolution images to directly connect our method with existing deep learning models. We show that network predictions benefit from learnable pixel layouts for two different downstream tasks, classification and semantic segmentation.

Via

Access Paper or Ask Questions

Cross-Country Skiing Gears Classification using Deep Learning

Jun 27, 2017

Aliaa Rassem, Mohammed El-Beltagy, Mohamed Saleh

Figure 1 for Cross-Country Skiing Gears Classification using Deep Learning

Figure 2 for Cross-Country Skiing Gears Classification using Deep Learning

Figure 3 for Cross-Country Skiing Gears Classification using Deep Learning

Figure 4 for Cross-Country Skiing Gears Classification using Deep Learning

Abstract:Human Activity Recognition has witnessed a significant progress in the last decade. Although a great deal of work in this field goes in recognizing normal human activities, few studies focused on identifying motion in sports. Recognizing human movements in different sports has high impact on understanding the different styles of humans in the play and on improving their performance. As deep learning models proved to have good results in many classification problems, this paper will utilize deep learning to classify cross-country skiing movements, known as gears, collected using a 3D accelerometer. It will also provide a comparison between different deep learning models such as convolutional and recurrent neural networks versus standard multi-layer perceptron. Results show that deep learning is more effective and has the highest classification accuracy.

* 15 pages, 8 figures, 1 table

Via

Access Paper or Ask Questions