Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Binyamin Manela

Video Editing for Audio-Visual Dubbing

May 29, 2025

Binyamin Manela, Sharon Gannot, Ethan Fetyaya

Abstract:Visual dubbing, the synchronization of facial movements with new speech, is crucial for making content accessible across different languages, enabling broader global reach. However, current methods face significant limitations. Existing approaches often generate talking faces, hindering seamless integration into original scenes, or employ inpainting techniques that discard vital visual information like partial occlusions and lighting variations. This work introduces EdiDub, a novel framework that reformulates visual dubbing as a content-aware editing task. EdiDub preserves the original video context by utilizing a specialized conditioning scheme to ensure faithful and accurate modifications rather than mere copying. On multiple benchmarks, including a challenging occluded-lip dataset, EdiDub significantly improves identity preservation and synchronization. Human evaluations further confirm its superiority, achieving higher synchronization and visual naturalness scores compared to the leading methods. These results demonstrate that our content-aware editing approach outperforms traditional generation or inpainting, particularly in maintaining complex visual elements while ensuring accurate lip synchronization.

Via

Access Paper or Ask Questions

Curriculum Learning with Hindsight Experience Replay for Sequential Object Manipulation Tasks

Aug 21, 2020

Binyamin Manela, Armin Biess

Figure 1 for Curriculum Learning with Hindsight Experience Replay for Sequential Object Manipulation Tasks

Figure 2 for Curriculum Learning with Hindsight Experience Replay for Sequential Object Manipulation Tasks

Figure 3 for Curriculum Learning with Hindsight Experience Replay for Sequential Object Manipulation Tasks

Figure 4 for Curriculum Learning with Hindsight Experience Replay for Sequential Object Manipulation Tasks

Abstract:Learning complex tasks from scratch is challenging and often impossible for humans as well as for artificial agents. A curriculum can be used instead, which decomposes a complex task (target task) into a sequence of source tasks (the curriculum). Each source task is a simplified version of the next source task with increasing complexity. Learning then occurs gradually by training on each source task while using knowledge from the curriculum's prior source tasks. In this study, we present a new algorithm that combines curriculum learning with Hindsight Experience Replay (HER), to learn sequential object manipulation tasks for multiple goals and sparse feedback. The algorithm exploits the recurrent structure inherent in many object manipulation tasks and implements the entire learning process in the original simulation without adjusting it to each source task. We have tested our algorithm on three challenging throwing tasks and show vast improvements compared to vanilla-HER.

* arXiv admin note: text overlap with arXiv:2001.03877

Via

Access Paper or Ask Questions

Metric-Based Imitation Learning Between Two Dissimilar Anthropomorphic Robotic Arms

Feb 25, 2020

Marcus Ebner von Eschenbach, Binyamin Manela, Jan Peters, Armin Biess

Figure 1 for Metric-Based Imitation Learning Between Two Dissimilar Anthropomorphic Robotic Arms

Figure 2 for Metric-Based Imitation Learning Between Two Dissimilar Anthropomorphic Robotic Arms

Figure 3 for Metric-Based Imitation Learning Between Two Dissimilar Anthropomorphic Robotic Arms

Figure 4 for Metric-Based Imitation Learning Between Two Dissimilar Anthropomorphic Robotic Arms

Abstract:The development of autonomous robotic systems that can learn from human demonstrations to imitate a desired behavior - rather than being manually programmed - has huge technological potential. One major challenge in imitation learning is the correspondence problem: how to establish corresponding states and actions between expert and learner, when the embodiments of the agents are different (morphology, dynamics, degrees of freedom, etc.). Many existing approaches in imitation learning circumvent the correspondence problem, for example, kinesthetic teaching or teleoperation, which are performed on the robot. In this work we explicitly address the correspondence problem by introducing a distance measure between dissimilar embodiments. This measure is then used as a loss function for static pose imitation and as a feedback signal within a model-free deep reinforcement learning framework for dynamic movement imitation between two anthropomorphic robotic arms in simulation. We find that the measure is well suited for describing the similarity between embodiments and for learning imitation policies by distance minimization.

* 8 pages, 5 figures, submitted to IEEE Robotics and Automation Letters/IROS 2020

Via

Access Paper or Ask Questions

Deep Reinforcement Learning for Complex Manipulation Tasks with Sparse Feedback

Jan 12, 2020

Binyamin Manela

Figure 1 for Deep Reinforcement Learning for Complex Manipulation Tasks with Sparse Feedback

Figure 2 for Deep Reinforcement Learning for Complex Manipulation Tasks with Sparse Feedback

Figure 3 for Deep Reinforcement Learning for Complex Manipulation Tasks with Sparse Feedback

Figure 4 for Deep Reinforcement Learning for Complex Manipulation Tasks with Sparse Feedback

Abstract:Learning optimal policies from sparse feedback is a known challenge in reinforcement learning. Hindsight Experience Replay (HER) is a multi-goal reinforcement learning algorithm that comes to solve such tasks. The algorithm treats every failure as a success for an alternative (virtual) goal that has been achieved in the episode and then generalizes from that virtual goal to real goals. HER has known flaws and is limited to relatively simple tasks. In this thesis, we present three algorithms based on the existing HER algorithm that improves its performances. First, we prioritize virtual goals from which the agent will learn more valuable information. We call this property the \textit{instructiveness} of the virtual goal and define it by a heuristic measure, which expresses how well the agent will be able to generalize from that virtual goal to actual goals. Secondly, we designed a filtering process that detects and removes misleading samples that may induce bias throughout the learning process. Lastly, we enable the learning of complex, sequential, tasks using a form of curriculum learning combined with HER. We call this algorithm \textit{Curriculum HER}. To test our algorithms, we built three challenging manipulation environments with sparse reward functions. Each environment has three levels of complexity. Our empirical results show vast improvement in the final success rate and sample efficiency when compared to the original HER algorithm.

* A thesis submitted in fulfillment of the requirements for the degree of Master of Science in the department of Industrial Engineering and Management at Ben-Gurion University of the Negev

Via

Access Paper or Ask Questions

Bias-Reduced Hindsight Experience Replay with Virtual Goal Prioritization

May 14, 2019

Binyamin Manela, Armin Biess

Figure 1 for Bias-Reduced Hindsight Experience Replay with Virtual Goal Prioritization

Figure 2 for Bias-Reduced Hindsight Experience Replay with Virtual Goal Prioritization

Figure 3 for Bias-Reduced Hindsight Experience Replay with Virtual Goal Prioritization

Figure 4 for Bias-Reduced Hindsight Experience Replay with Virtual Goal Prioritization

Abstract:Hindsight Experience Replay (HER) is a multi-goal reinforcement learning algorithm for sparse reward functions. The algorithm treats every failure as a success for an alternative (virtual) goal that has been achieved in the episode. Virtual goals are randomly selected, irrespective of which are most instructive for the agent. In this paper, we present two improvements over the existing HER algorithm. First, we prioritize virtual goals from which the agent will learn more valuable information. We call this property the instructiveness of the virtual goal and define it by a heuristic measure, which expresses how well the agent will be able to generalize from that virtual goal to actual goals. Secondly, we reduce existing bias in HER by the removal of misleading samples. To test our algorithms, we built two challenging environments with sparse reward functions. Our empirical results in both environments show vast improvement in the final success rate and sample efficiency when compared to the original HER algorithm.

Via

Access Paper or Ask Questions