Picture for Xu Yang

Xu Yang

STHFL: Spatio-Temporal Heterogeneous Federated Learning

Add code
Jan 10, 2025
Viaarxiv icon

VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception

Add code
Jan 06, 2025
Figure 1 for VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception
Figure 2 for VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception
Figure 3 for VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception
Figure 4 for VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception
Viaarxiv icon

Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark

Add code
Dec 12, 2024
Viaarxiv icon

RL2: Reinforce Large Language Model to Assist Safe Reinforcement Learning for Energy Management of Active Distribution Networks

Add code
Dec 02, 2024
Viaarxiv icon

DaDu-E: Rethinking the Role of Large Language Model in Robotic Computing Pipeline

Add code
Dec 02, 2024
Figure 1 for DaDu-E: Rethinking the Role of Large Language Model in Robotic Computing Pipeline
Figure 2 for DaDu-E: Rethinking the Role of Large Language Model in Robotic Computing Pipeline
Figure 3 for DaDu-E: Rethinking the Role of Large Language Model in Robotic Computing Pipeline
Figure 4 for DaDu-E: Rethinking the Role of Large Language Model in Robotic Computing Pipeline
Viaarxiv icon

BPQP: A Differentiable Convex Optimization Framework for Efficient End-to-End Learning

Add code
Nov 28, 2024
Viaarxiv icon

Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens

Add code
Nov 23, 2024
Figure 1 for Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens
Figure 2 for Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens
Figure 3 for Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens
Figure 4 for Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens
Viaarxiv icon

Number it: Temporal Grounding Videos like Flipping Manga

Add code
Nov 15, 2024
Viaarxiv icon

Enhancing DP-SGD through Non-monotonous Adaptive Scaling Gradient Weight

Add code
Nov 05, 2024
Viaarxiv icon

Gradient-Guided Conditional Diffusion Models for Private Image Reconstruction: Analyzing Adversarial Impacts of Differential Privacy and Denoising

Add code
Nov 05, 2024
Figure 1 for Gradient-Guided Conditional Diffusion Models for Private Image Reconstruction: Analyzing Adversarial Impacts of Differential Privacy and Denoising
Figure 2 for Gradient-Guided Conditional Diffusion Models for Private Image Reconstruction: Analyzing Adversarial Impacts of Differential Privacy and Denoising
Figure 3 for Gradient-Guided Conditional Diffusion Models for Private Image Reconstruction: Analyzing Adversarial Impacts of Differential Privacy and Denoising
Figure 4 for Gradient-Guided Conditional Diffusion Models for Private Image Reconstruction: Analyzing Adversarial Impacts of Differential Privacy and Denoising
Viaarxiv icon