Picture for Kai Liu

Kai Liu

refer to the report for detailed contributions

UniMamba: Unified Spatial-Channel Representation Learning with Group-Efficient Mamba for LiDAR-based 3D Object Detection

Add code
Mar 15, 2025
Viaarxiv icon

CondiQuant: Condition Number Based Low-Bit Quantization for Image Super-Resolution

Add code
Feb 21, 2025
Viaarxiv icon

Robust 6DoF Pose Tracking Considering Contour and Interior Correspondence Uncertainty for AR Assembly Guidance

Add code
Feb 17, 2025
Viaarxiv icon

SCDiar: a streaming diarization system based on speaker change detection and speech recognition

Add code
Jan 28, 2025
Viaarxiv icon

Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Add code
Jan 21, 2025
Figure 1 for Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Figure 2 for Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Figure 3 for Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Figure 4 for Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Viaarxiv icon

UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle Imagery

Add code
Jan 03, 2025
Figure 1 for UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle Imagery
Figure 2 for UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle Imagery
Figure 3 for UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle Imagery
Figure 4 for UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle Imagery
Viaarxiv icon

CAMEL: Cross-Attention Enhanced Mixture-of-Experts and Language Bias for Code-Switching Speech Recognition

Add code
Dec 17, 2024
Figure 1 for CAMEL: Cross-Attention Enhanced Mixture-of-Experts and Language Bias for Code-Switching Speech Recognition
Figure 2 for CAMEL: Cross-Attention Enhanced Mixture-of-Experts and Language Bias for Code-Switching Speech Recognition
Figure 3 for CAMEL: Cross-Attention Enhanced Mixture-of-Experts and Language Bias for Code-Switching Speech Recognition
Figure 4 for CAMEL: Cross-Attention Enhanced Mixture-of-Experts and Language Bias for Code-Switching Speech Recognition
Viaarxiv icon

ACQ: A Unified Framework for Automated Programmatic Creativity in Online Advertising

Add code
Dec 09, 2024
Viaarxiv icon

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

Add code
Nov 05, 2024
Figure 1 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 2 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 3 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 4 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Viaarxiv icon

Learning Identifiable Factorized Causal Representations of Cellular Responses

Add code
Oct 29, 2024
Figure 1 for Learning Identifiable Factorized Causal Representations of Cellular Responses
Figure 2 for Learning Identifiable Factorized Causal Representations of Cellular Responses
Figure 3 for Learning Identifiable Factorized Causal Representations of Cellular Responses
Figure 4 for Learning Identifiable Factorized Causal Representations of Cellular Responses
Viaarxiv icon