Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dotan Kaufman

Balancing Specialization, Generalization, and Compression for Detection and Tracking

Sep 25, 2019

Dotan Kaufman, Koby Bibas, Eran Borenstein, Michael Chertok, Tal Hassner

Figure 1 for Balancing Specialization, Generalization, and Compression for Detection and Tracking

Figure 2 for Balancing Specialization, Generalization, and Compression for Detection and Tracking

Figure 3 for Balancing Specialization, Generalization, and Compression for Detection and Tracking

Figure 4 for Balancing Specialization, Generalization, and Compression for Detection and Tracking

Abstract:We propose a method for specializing deep detectors and trackers to restricted settings. Our approach is designed with the following goals in mind: (a) Improving accuracy in restricted domains; (b) preventing overfitting to new domains and forgetting of generalized capabilities; (c) aggressive model compression and acceleration. To this end, we propose a novel loss that balances compression and acceleration of a deep learning model vs. loss of generalization capabilities. We apply our method to the existing tracker and detector models. We report detection results on the VIRAT and CAVIAR data sets. These results show our method to offer unprecedented compression rates along with improved detection. We apply our loss for tracker compression at test time, as it processes each video. Our tests on the OTB2015 benchmark show that applying compression during test time actually improves tracking performance.

* Accepted to BMVC 2019

Via

Access Paper or Ask Questions

Temporal Tessellation: A Unified Approach for Video Analysis

Apr 14, 2017

Dotan Kaufman, Gil Levi, Tal Hassner, Lior Wolf

Figure 1 for Temporal Tessellation: A Unified Approach for Video Analysis

Figure 2 for Temporal Tessellation: A Unified Approach for Video Analysis

Figure 3 for Temporal Tessellation: A Unified Approach for Video Analysis

Figure 4 for Temporal Tessellation: A Unified Approach for Video Analysis

Abstract:We present a general approach to video understanding, inspired by semantic transfer techniques that have been successfully used for 2D image analysis. Our method considers a video to be a 1D sequence of clips, each one associated with its own semantics. The nature of these semantics -- natural language captions or other labels -- depends on the task at hand. A test video is processed by forming correspondences between its clips and the clips of reference videos with known semantics, following which, reference semantics can be transferred to the test video. We describe two matching methods, both designed to ensure that (a) reference clips appear similar to test clips and (b), taken together, the semantics of the selected reference clips is consistent and maintains temporal coherence. We use our method for video captioning on the LSMDC'16 benchmark, video summarization on the SumMe and TVSum benchmarks, Temporal Action Detection on the Thumos2014 benchmark, and sound prediction on the Greatest Hits benchmark. Our method not only surpasses the state of the art, in four out of five benchmarks, but importantly, it is the only single method we know of that was successfully applied to such a diverse range of tasks.

Via

Access Paper or Ask Questions