Picture for Dongxu Li

Dongxu Li

Aria: An Open Multimodal Native Mixture-of-Experts Model

Add code
Oct 08, 2024
Viaarxiv icon

EZSR: Event-based Zero-Shot Recognition

Add code
Jul 31, 2024
Viaarxiv icon

LongVideoBench: A Benchmark for Long-context Interleaved Video-Language Understanding

Add code
Jul 22, 2024
Viaarxiv icon

PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery

Add code
Jun 16, 2024
Figure 1 for PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery
Figure 2 for PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery
Figure 3 for PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery
Figure 4 for PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery
Viaarxiv icon

Design and Performance of Resonant Beam Communications -- Part I: Quasi-Static Scenario

Add code
Mar 25, 2024
Viaarxiv icon

Design and Performance of Resonant Beam Communications -- Part II: Mobile Scenario

Add code
Mar 25, 2024
Viaarxiv icon

Resonant Beam Communications: A New Design Paradigm and Challenges

Add code
Mar 25, 2024
Viaarxiv icon

Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions

Add code
Jan 03, 2024
Viaarxiv icon

Fundamental Limitation of Semantic Communications: Neural Estimation for Rate-Distortion

Add code
Jan 02, 2024
Viaarxiv icon

X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning

Add code
Nov 30, 2023
Figure 1 for X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning
Figure 2 for X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning
Figure 3 for X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning
Figure 4 for X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning
Viaarxiv icon