Picture for Xuan He

Xuan He

Violet

Guiding Through Complexity: What Makes Good Supervision for Hard Reasoning Tasks?

Add code
Oct 27, 2024
Viaarxiv icon

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Add code
Oct 14, 2024
Viaarxiv icon

VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation

Add code
Jun 24, 2024
Figure 1 for VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation
Figure 2 for VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation
Figure 3 for VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation
Figure 4 for VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation
Viaarxiv icon

MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark

Add code
Jun 04, 2024
Figure 1 for MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark
Figure 2 for MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark
Figure 3 for MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark
Figure 4 for MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark
Viaarxiv icon

LocMoE+: Enhanced Router with Token Feature Awareness for Efficient LLM Pre-Training

Add code
May 24, 2024
Viaarxiv icon

MANTIS: Interleaved Multi-Image Instruction Tuning

Add code
May 02, 2024
Viaarxiv icon

Real-time Transformer-based Open-Vocabulary Detection with Efficient Fusion Head

Add code
Mar 11, 2024
Viaarxiv icon

EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous Driving

Add code
Feb 28, 2024
Viaarxiv icon

LocMoE: A Low-overhead MoE for Large Language Model Training

Add code
Jan 25, 2024
Viaarxiv icon

Beam-Delay Domain Channel Estimation for mmWave XL-MIMO Systems

Add code
Dec 10, 2023
Viaarxiv icon