Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mahmoud Gamal

Highlighting the Importance of Reducing Research Bias and Carbon Emissions in CNNs

Jun 06, 2021

Ahmed Badar, Arnav Varma, Adrian Staniec, Mahmoud Gamal, Omar Magdy, Haris Iqbal, Elahe Arani, Bahram Zonooz

Figure 1 for Highlighting the Importance of Reducing Research Bias and Carbon Emissions in CNNs

Figure 2 for Highlighting the Importance of Reducing Research Bias and Carbon Emissions in CNNs

Figure 3 for Highlighting the Importance of Reducing Research Bias and Carbon Emissions in CNNs

Figure 4 for Highlighting the Importance of Reducing Research Bias and Carbon Emissions in CNNs

Abstract:Convolutional neural networks (CNNs) have become commonplace in addressing major challenges in computer vision. Researchers are not only coming up with new CNN architectures but are also researching different techniques to improve the performance of existing architectures. However, there is a tendency to over-emphasize performance improvement while neglecting certain important variables such as simplicity, versatility, the fairness of comparisons, and energy efficiency. Overlooking these variables in architectural design and evaluation has led to research bias and a significantly negative environmental impact. Furthermore, this can undermine the positive impact of research in using deep learning models to tackle climate change. Here, we perform an extensive and fair empirical study of a number of proposed techniques to gauge the utility of each technique for segmentation and classification. Our findings restate the importance of favoring simplicity over complexity in model design (Occam's Razor). Furthermore, our results indicate that simple standardized practices can lead to a significant reduction in environmental impact with little drop in performance. We highlight that there is a need to rethink the design and evaluation of CNNs to alleviate the issue of research bias and carbon emissions.

Via

Access Paper or Ask Questions

Video Segmentation using Teacher-Student Adaptation in a Human Robot Interaction (HRI) Setting

Oct 17, 2018

Mennatullah Siam, Chen Jiang, Steven Lu, Laura Petrich, Mahmoud Gamal, Mohamed Elhoseiny, Martin Jagersand

Figure 1 for Video Segmentation using Teacher-Student Adaptation in a Human Robot Interaction (HRI) Setting

Figure 2 for Video Segmentation using Teacher-Student Adaptation in a Human Robot Interaction (HRI) Setting

Figure 3 for Video Segmentation using Teacher-Student Adaptation in a Human Robot Interaction (HRI) Setting

Figure 4 for Video Segmentation using Teacher-Student Adaptation in a Human Robot Interaction (HRI) Setting

Abstract:Video segmentation is a challenging task that has many applications in robotics. Learning segmentation from few examples on-line is important for robotics in unstructured environments. The total number of objects and their variation in the real world is intractable, but for a specific task the robot deals with a small subset. Our network is taught, by a human moving a hand-held object through different poses. A novel two-stream motion and appearance "teacher" network provides pseudo-labels. These labels are used to adapt an appearance "student" network. Segmentation can be used to support a variety of robot vision functionality, such as grasping or affordance segmentation. We propose different variants of motion adaptation training and extensively compare against the state-of-the-art methods. We collected a carefully designed dataset in the human robot interaction (HRI) setting. We denote our dataset as (L)ow-shot (O)bject (R)ecognition, (D)etection and (S)egmentation using HRI. Our dataset contains teaching videos of different hand-held objects moving in translation, scale and rotation. It contains kitchen manipulation tasks as well, performed by humans and robots. Our proposed method outperforms the state-of-the-art on DAVIS and FBMS with 7% and 1.2% in F-measure respectively. In our more challenging LORDS-HRI dataset, our approach achieves significantly better performance with 46.7% and 24.2% relative improvement in mIoU over the baseline.

* Submitted to ICRA'19

Via

Access Paper or Ask Questions