Picture for Hang Li

Hang Li

NEC Corporation

GR-2: A Generative Video-Language-Action Model with Web-Scale Knowledge for Robot Manipulation

Add code
Oct 08, 2024
Viaarxiv icon

FedBiP: Heterogeneous One-Shot Federated Learning with Personalized Latent Diffusion Models

Add code
Oct 07, 2024
Viaarxiv icon

ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection

Add code
Oct 06, 2024
Figure 1 for ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection
Figure 2 for ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection
Figure 3 for ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection
Figure 4 for ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection
Viaarxiv icon

A LLM-Powered Automatic Grading Framework with Human-Level Guidelines Optimization

Add code
Oct 03, 2024
Viaarxiv icon

Sub-graph Based Diffusion Model for Link Prediction

Add code
Sep 13, 2024
Viaarxiv icon

Knowledge Tagging with Large Language Model based Multi-Agent System

Add code
Sep 12, 2024
Viaarxiv icon

CrossFi: A Cross Domain Wi-Fi Sensing Framework Based on Siamese Network

Add code
Aug 21, 2024
Viaarxiv icon

Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent

Add code
Jul 31, 2024
Viaarxiv icon

DexGANGrasp: Dexterous Generative Adversarial Grasping Synthesis for Task-Oriented Manipulation

Add code
Jul 24, 2024
Viaarxiv icon

Efffcient Sensing Parameter Estimation with Direct Clutter Mitigation in Perceptive Mobile Networks

Add code
Jul 24, 2024
Viaarxiv icon