Picture for Di Fu

Di Fu

The 1st InterAI Workshop: Interactive AI for Human-centered Robotics

Add code
Sep 17, 2024
Viaarxiv icon

MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models

Add code
Aug 30, 2024
Viaarxiv icon

Vote&Mix: Plug-and-Play Token Reduction for Efficient Vision Transformer

Add code
Aug 30, 2024
Figure 1 for Vote&Mix: Plug-and-Play Token Reduction for Efficient Vision Transformer
Figure 2 for Vote&Mix: Plug-and-Play Token Reduction for Efficient Vision Transformer
Figure 3 for Vote&Mix: Plug-and-Play Token Reduction for Efficient Vision Transformer
Figure 4 for Vote&Mix: Plug-and-Play Token Reduction for Efficient Vision Transformer
Viaarxiv icon

Shot Segmentation Based on Von Neumann Entropy for Key Frame Extraction

Add code
Aug 29, 2024
Viaarxiv icon

Decoupled Prompt-Adapter Tuning for Continual Activity Recognition

Add code
Jul 20, 2024
Viaarxiv icon

Unified Dynamic Scanpath Predictors Outperform Individually Trained Neural Models

Add code
May 07, 2024
Viaarxiv icon

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward

Add code
Apr 02, 2024
Figure 1 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Figure 2 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Figure 3 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Figure 4 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Viaarxiv icon

Human Impression of Humanoid Robots Mirroring Social Cues

Add code
Jan 22, 2024
Viaarxiv icon

The Emotional Dilemma: Influence of a Human-like Robot on Trust and Cooperation

Add code
Jul 06, 2023
Viaarxiv icon

The Robot in the Room: Influence of Robot Facial Expressions and Gaze on Human-Human-Robot Collaboration

Add code
Mar 24, 2023
Viaarxiv icon