Picture for Di Fu

Di Fu

Advancing User-Voice Interaction: Exploring Emotion-Aware Voice Assistants Through a Role-Swapping Approach

Add code
Feb 21, 2025
Viaarxiv icon

The 1st InterAI Workshop: Interactive AI for Human-centered Robotics

Add code
Sep 17, 2024
Viaarxiv icon

Vote&Mix: Plug-and-Play Token Reduction for Efficient Vision Transformer

Add code
Aug 30, 2024
Figure 1 for Vote&Mix: Plug-and-Play Token Reduction for Efficient Vision Transformer
Figure 2 for Vote&Mix: Plug-and-Play Token Reduction for Efficient Vision Transformer
Figure 3 for Vote&Mix: Plug-and-Play Token Reduction for Efficient Vision Transformer
Figure 4 for Vote&Mix: Plug-and-Play Token Reduction for Efficient Vision Transformer
Viaarxiv icon

MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models

Add code
Aug 30, 2024
Figure 1 for MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models
Figure 2 for MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models
Figure 3 for MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models
Figure 4 for MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models
Viaarxiv icon

Shot Segmentation Based on Von Neumann Entropy for Key Frame Extraction

Add code
Aug 29, 2024
Viaarxiv icon

Decoupled Prompt-Adapter Tuning for Continual Activity Recognition

Add code
Jul 20, 2024
Viaarxiv icon

Unified Dynamic Scanpath Predictors Outperform Individually Trained Neural Models

Add code
May 07, 2024
Viaarxiv icon

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward

Add code
Apr 02, 2024
Figure 1 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Figure 2 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Figure 3 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Figure 4 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Viaarxiv icon

Human Impression of Humanoid Robots Mirroring Social Cues

Add code
Jan 22, 2024
Viaarxiv icon

The Emotional Dilemma: Influence of a Human-like Robot on Trust and Cooperation

Add code
Jul 06, 2023
Viaarxiv icon