Picture for Lu Wang

Lu Wang

CSSE, Shenzhen University

SPHERE: A Hierarchical Evaluation on Spatial Perception and Reasoning for Vision-Language Models

Add code
Dec 17, 2024
Viaarxiv icon

Large Action Models: From Inception to Implementation

Add code
Dec 13, 2024
Figure 1 for Large Action Models: From Inception to Implementation
Figure 2 for Large Action Models: From Inception to Implementation
Figure 3 for Large Action Models: From Inception to Implementation
Figure 4 for Large Action Models: From Inception to Implementation
Viaarxiv icon

If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs

Add code
Dec 05, 2024
Viaarxiv icon

ASR-EC Benchmark: Evaluating Large Language Models on Chinese ASR Error Correction

Add code
Dec 04, 2024
Viaarxiv icon

Visual Adversarial Attack on Vision-Language Models for Autonomous Driving

Add code
Nov 27, 2024
Figure 1 for Visual Adversarial Attack on Vision-Language Models for Autonomous Driving
Figure 2 for Visual Adversarial Attack on Vision-Language Models for Autonomous Driving
Figure 3 for Visual Adversarial Attack on Vision-Language Models for Autonomous Driving
Figure 4 for Visual Adversarial Attack on Vision-Language Models for Autonomous Driving
Viaarxiv icon

Retrieval or Global Context Understanding? On Many-Shot In-Context Learning for Long-Context Evaluation

Add code
Nov 11, 2024
Figure 1 for Retrieval or Global Context Understanding? On Many-Shot In-Context Learning for Long-Context Evaluation
Figure 2 for Retrieval or Global Context Understanding? On Many-Shot In-Context Learning for Long-Context Evaluation
Figure 3 for Retrieval or Global Context Understanding? On Many-Shot In-Context Learning for Long-Context Evaluation
Figure 4 for Retrieval or Global Context Understanding? On Many-Shot In-Context Learning for Long-Context Evaluation
Viaarxiv icon

RuAG: Learned-rule-augmented Generation for Large Language Models

Add code
Nov 04, 2024
Figure 1 for RuAG: Learned-rule-augmented Generation for Large Language Models
Figure 2 for RuAG: Learned-rule-augmented Generation for Large Language Models
Figure 3 for RuAG: Learned-rule-augmented Generation for Large Language Models
Figure 4 for RuAG: Learned-rule-augmented Generation for Large Language Models
Viaarxiv icon

Token-level Proximal Policy Optimization for Query Generation

Add code
Nov 01, 2024
Viaarxiv icon

Self-Evolved Reward Learning for LLMs

Add code
Nov 01, 2024
Figure 1 for Self-Evolved Reward Learning for LLMs
Figure 2 for Self-Evolved Reward Learning for LLMs
Figure 3 for Self-Evolved Reward Learning for LLMs
Figure 4 for Self-Evolved Reward Learning for LLMs
Viaarxiv icon

FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation

Add code
Oct 29, 2024
Viaarxiv icon