Picture for Ming Li

Ming Li

Univ. Waterloo

Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction

Add code
Feb 18, 2025
Viaarxiv icon

Self-Enhanced Reasoning Training: Activating Latent Reasoning in Small Models for Enhanced Reasoning Distillation

Add code
Feb 18, 2025
Viaarxiv icon

Text2World: Benchmarking Large Language Models for Symbolic World Model Generation

Add code
Feb 18, 2025
Viaarxiv icon

Out-of-Distribution Detection on Graphs: A Survey

Add code
Feb 12, 2025
Viaarxiv icon

Generative Adversarial Networks Bridging Art and Machine Intelligence

Add code
Feb 09, 2025
Viaarxiv icon

Target Detection in OFDM-ISAC Systems: A Multipath Exploitation Approach

Add code
Jan 14, 2025
Viaarxiv icon

Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models

Add code
Jan 14, 2025
Figure 1 for Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models
Figure 2 for Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models
Figure 3 for Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models
Figure 4 for Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models
Viaarxiv icon

Target Detection in ISAC Systems with Active RISs: A Multi-Perspective Observation Approach

Add code
Jan 11, 2025
Viaarxiv icon

Repeat-bias-aware Optimization of Beyond-accuracy Metrics for Next Basket Recommendation

Add code
Jan 10, 2025
Viaarxiv icon

Monotonic Learning in the PAC Framework: A New Perspective

Add code
Jan 09, 2025
Viaarxiv icon