Picture for Dongxu Li

Dongxu Li

VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation

Add code
Nov 20, 2024
Figure 1 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Figure 2 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Figure 3 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Figure 4 for VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
Viaarxiv icon

Aria: An Open Multimodal Native Mixture-of-Experts Model

Add code
Oct 08, 2024
Figure 1 for Aria: An Open Multimodal Native Mixture-of-Experts Model
Figure 2 for Aria: An Open Multimodal Native Mixture-of-Experts Model
Figure 3 for Aria: An Open Multimodal Native Mixture-of-Experts Model
Figure 4 for Aria: An Open Multimodal Native Mixture-of-Experts Model
Viaarxiv icon

EZSR: Event-based Zero-Shot Recognition

Add code
Jul 31, 2024
Figure 1 for EZSR: Event-based Zero-Shot Recognition
Figure 2 for EZSR: Event-based Zero-Shot Recognition
Figure 3 for EZSR: Event-based Zero-Shot Recognition
Figure 4 for EZSR: Event-based Zero-Shot Recognition
Viaarxiv icon

LongVideoBench: A Benchmark for Long-context Interleaved Video-Language Understanding

Add code
Jul 22, 2024
Viaarxiv icon

PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery

Add code
Jun 16, 2024
Figure 1 for PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery
Figure 2 for PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery
Figure 3 for PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery
Figure 4 for PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery
Viaarxiv icon

Design and Performance of Resonant Beam Communications -- Part I: Quasi-Static Scenario

Add code
Mar 25, 2024
Viaarxiv icon

Resonant Beam Communications: A New Design Paradigm and Challenges

Add code
Mar 25, 2024
Figure 1 for Resonant Beam Communications: A New Design Paradigm and Challenges
Figure 2 for Resonant Beam Communications: A New Design Paradigm and Challenges
Figure 3 for Resonant Beam Communications: A New Design Paradigm and Challenges
Figure 4 for Resonant Beam Communications: A New Design Paradigm and Challenges
Viaarxiv icon

Design and Performance of Resonant Beam Communications -- Part II: Mobile Scenario

Add code
Mar 25, 2024
Figure 1 for Design and Performance of Resonant Beam Communications -- Part II: Mobile Scenario
Figure 2 for Design and Performance of Resonant Beam Communications -- Part II: Mobile Scenario
Figure 3 for Design and Performance of Resonant Beam Communications -- Part II: Mobile Scenario
Figure 4 for Design and Performance of Resonant Beam Communications -- Part II: Mobile Scenario
Viaarxiv icon

Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions

Add code
Jan 03, 2024
Figure 1 for Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions
Figure 2 for Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions
Figure 3 for Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions
Figure 4 for Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions
Viaarxiv icon

Fundamental Limitation of Semantic Communications: Neural Estimation for Rate-Distortion

Add code
Jan 02, 2024
Viaarxiv icon