Picture for Hui Lu

Hui Lu

Investigating Decoder-only Large Language Models for Speech-to-text Translation

Add code
Jul 03, 2024
Viaarxiv icon

VideoMambaPro: A Leap Forward for Mamba in Video Understanding

Add code
Jun 27, 2024
Viaarxiv icon

MMCL: Boosting Deformable DETR-Based Detectors with Multi-Class Min-Margin Contrastive Learning for Superior Prohibited Item Detection

Add code
Jun 05, 2024
Viaarxiv icon

Addressing Index Collapse of Large-Codebook Speech Tokenizer with Dual-Decoding Product-Quantized Variational Auto-Encoder

Add code
Jun 05, 2024
Viaarxiv icon

Enhancing Video Transformers for Action Understanding with VLM-aided Training

Add code
Mar 24, 2024
Viaarxiv icon

MSLM-S2ST: A Multitask Speech Language Model for Textless Speech-to-Speech Translation with Speaker Style Preservation

Add code
Mar 19, 2024
Viaarxiv icon

An Empirical Study of Speech Language Models for Prompt-Conditioned Speech Synthesis

Add code
Mar 19, 2024
Viaarxiv icon

TCNet: Continuous Sign Language Recognition from Trajectories and Correlated Regions

Add code
Mar 18, 2024
Viaarxiv icon

Olapa-MCoT: Enhancing the Chinese Mathematical Reasoning Capability of LLMs

Add code
Dec 29, 2023
Viaarxiv icon

Compensation Sampling for Improved Convergence in Diffusion Models

Add code
Dec 11, 2023
Viaarxiv icon