Picture for Kai Liu

Kai Liu

refer to the report for detailed contributions

Robust 6DoF Pose Tracking Considering Contour and Interior Correspondence Uncertainty for AR Assembly Guidance

Add code
Feb 17, 2025
Viaarxiv icon

SCDiar: a streaming diarization system based on speaker change detection and speech recognition

Add code
Jan 28, 2025
Viaarxiv icon

Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Add code
Jan 21, 2025
Figure 1 for Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Figure 2 for Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Figure 3 for Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Figure 4 for Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation
Viaarxiv icon

UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle Imagery

Add code
Jan 03, 2025
Figure 1 for UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle Imagery
Figure 2 for UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle Imagery
Figure 3 for UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle Imagery
Figure 4 for UAV-DETR: Efficient End-to-End Object Detection for Unmanned Aerial Vehicle Imagery
Viaarxiv icon

CAMEL: Cross-Attention Enhanced Mixture-of-Experts and Language Bias for Code-Switching Speech Recognition

Add code
Dec 17, 2024
Figure 1 for CAMEL: Cross-Attention Enhanced Mixture-of-Experts and Language Bias for Code-Switching Speech Recognition
Figure 2 for CAMEL: Cross-Attention Enhanced Mixture-of-Experts and Language Bias for Code-Switching Speech Recognition
Figure 3 for CAMEL: Cross-Attention Enhanced Mixture-of-Experts and Language Bias for Code-Switching Speech Recognition
Figure 4 for CAMEL: Cross-Attention Enhanced Mixture-of-Experts and Language Bias for Code-Switching Speech Recognition
Viaarxiv icon

ACQ: A Unified Framework for Automated Programmatic Creativity in Online Advertising

Add code
Dec 09, 2024
Viaarxiv icon

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

Add code
Nov 05, 2024
Figure 1 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 2 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 3 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 4 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Viaarxiv icon

Learning Identifiable Factorized Causal Representations of Cellular Responses

Add code
Oct 29, 2024
Figure 1 for Learning Identifiable Factorized Causal Representations of Cellular Responses
Figure 2 for Learning Identifiable Factorized Causal Representations of Cellular Responses
Figure 3 for Learning Identifiable Factorized Causal Representations of Cellular Responses
Figure 4 for Learning Identifiable Factorized Causal Representations of Cellular Responses
Viaarxiv icon

Delving into the Reversal Curse: How Far Can Large Language Models Generalize?

Add code
Oct 24, 2024
Figure 1 for Delving into the Reversal Curse: How Far Can Large Language Models Generalize?
Figure 2 for Delving into the Reversal Curse: How Far Can Large Language Models Generalize?
Figure 3 for Delving into the Reversal Curse: How Far Can Large Language Models Generalize?
Figure 4 for Delving into the Reversal Curse: How Far Can Large Language Models Generalize?
Viaarxiv icon

Att2CPC: Attention-Guided Lossy Attribute Compression of Point Clouds

Add code
Oct 23, 2024
Viaarxiv icon