Picture for Xu Yang

Xu Yang

Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark

Add code
Dec 12, 2024
Viaarxiv icon

DaDu-E: Rethinking the Role of Large Language Model in Robotic Computing Pipeline

Add code
Dec 02, 2024
Figure 1 for DaDu-E: Rethinking the Role of Large Language Model in Robotic Computing Pipeline
Figure 2 for DaDu-E: Rethinking the Role of Large Language Model in Robotic Computing Pipeline
Figure 3 for DaDu-E: Rethinking the Role of Large Language Model in Robotic Computing Pipeline
Figure 4 for DaDu-E: Rethinking the Role of Large Language Model in Robotic Computing Pipeline
Viaarxiv icon

RL2: Reinforce Large Language Model to Assist Safe Reinforcement Learning for Energy Management of Active Distribution Networks

Add code
Dec 02, 2024
Viaarxiv icon

BPQP: A Differentiable Convex Optimization Framework for Efficient End-to-End Learning

Add code
Nov 28, 2024
Viaarxiv icon

Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens

Add code
Nov 23, 2024
Figure 1 for Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens
Figure 2 for Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens
Figure 3 for Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens
Figure 4 for Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens
Viaarxiv icon

Number it: Temporal Grounding Videos like Flipping Manga

Add code
Nov 15, 2024
Viaarxiv icon

Gradient-Guided Conditional Diffusion Models for Private Image Reconstruction: Analyzing Adversarial Impacts of Differential Privacy and Denoising

Add code
Nov 05, 2024
Figure 1 for Gradient-Guided Conditional Diffusion Models for Private Image Reconstruction: Analyzing Adversarial Impacts of Differential Privacy and Denoising
Figure 2 for Gradient-Guided Conditional Diffusion Models for Private Image Reconstruction: Analyzing Adversarial Impacts of Differential Privacy and Denoising
Figure 3 for Gradient-Guided Conditional Diffusion Models for Private Image Reconstruction: Analyzing Adversarial Impacts of Differential Privacy and Denoising
Figure 4 for Gradient-Guided Conditional Diffusion Models for Private Image Reconstruction: Analyzing Adversarial Impacts of Differential Privacy and Denoising
Viaarxiv icon

Enhancing DP-SGD through Non-monotonous Adaptive Scaling Gradient Weight

Add code
Nov 05, 2024
Viaarxiv icon

Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory Tasks

Add code
Oct 31, 2024
Figure 1 for Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory Tasks
Figure 2 for Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory Tasks
Figure 3 for Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory Tasks
Figure 4 for Navigating the Unknown: A Chat-Based Collaborative Interface for Personalized Exploratory Tasks
Viaarxiv icon

Unlabeled Action Quality Assessment Based on Multi-dimensional Adaptive Constrained Dynamic Time Warping

Add code
Oct 18, 2024
Figure 1 for Unlabeled Action Quality Assessment Based on Multi-dimensional Adaptive Constrained Dynamic Time Warping
Figure 2 for Unlabeled Action Quality Assessment Based on Multi-dimensional Adaptive Constrained Dynamic Time Warping
Figure 3 for Unlabeled Action Quality Assessment Based on Multi-dimensional Adaptive Constrained Dynamic Time Warping
Figure 4 for Unlabeled Action Quality Assessment Based on Multi-dimensional Adaptive Constrained Dynamic Time Warping
Viaarxiv icon