Picture for Chong Zhang

Chong Zhang

Tony

A Novel ViDAR Device With Visual Inertial Encoder Odometry and Reinforcement Learning-Based Active SLAM Method

Add code
Jun 16, 2025
Viaarxiv icon

Attention-Based Map Encoding for Learning Generalized Legged Locomotion

Add code
Jun 11, 2025
Viaarxiv icon

OmniDRCA: Parallel Speech-Text Foundation Model via Dual-Resolution Speech Representations and Contrastive Alignment

Add code
Jun 11, 2025
Viaarxiv icon

Plug-and-Play Co-Occurring Face Attention for Robust Audio-Visual Speaker Extraction

Add code
May 27, 2025
Viaarxiv icon

Invisible Prompts, Visible Threats: Malicious Font Injection in External Resources for Large Language Models

Add code
May 22, 2025
Viaarxiv icon

100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

Add code
May 01, 2025
Viaarxiv icon

SegOTA: Accelerating Over-the-Air Federated Learning with Segmented Transmission

Add code
Apr 13, 2025
Viaarxiv icon

ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation

Add code
Mar 25, 2025
Viaarxiv icon

UniCodec: Unified Audio Codec with Single Domain-Adaptive Codebook

Add code
Feb 27, 2025
Viaarxiv icon

MMRC: A Large-Scale Benchmark for Understanding Multimodal Large Language Model in Real-World Conversation

Add code
Feb 17, 2025
Viaarxiv icon