Picture for Kun Zhou

Kun Zhou

YuLan-Mini: An Open Data-efficient Language Model

Add code
Dec 24, 2024
Viaarxiv icon

Hierarchical Control of Emotion Rendering in Speech Synthesis

Add code
Dec 17, 2024
Viaarxiv icon

RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector

Add code
Dec 13, 2024
Figure 1 for RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector
Figure 2 for RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector
Figure 3 for RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector
Figure 4 for RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector
Viaarxiv icon

MaterialPicker: Multi-Modal Material Generation with Diffusion Transformers

Add code
Dec 04, 2024
Viaarxiv icon

Enhancing Visual Reasoning with Autonomous Imagination in Multimodal Large Language Models

Add code
Nov 27, 2024
Viaarxiv icon

ARM: Appearance Reconstruction Model for Relightable 3D Generation

Add code
Nov 16, 2024
Viaarxiv icon

Self-Calibrated Listwise Reranking with Large Language Models

Add code
Nov 07, 2024
Figure 1 for Self-Calibrated Listwise Reranking with Large Language Models
Figure 2 for Self-Calibrated Listwise Reranking with Large Language Models
Figure 3 for Self-Calibrated Listwise Reranking with Large Language Models
Figure 4 for Self-Calibrated Listwise Reranking with Large Language Models
Viaarxiv icon

Exploring the Design Space of Visual Context Representation in Video MLLMs

Add code
Oct 17, 2024
Figure 1 for Exploring the Design Space of Visual Context Representation in Video MLLMs
Figure 2 for Exploring the Design Space of Visual Context Representation in Video MLLMs
Figure 3 for Exploring the Design Space of Visual Context Representation in Video MLLMs
Figure 4 for Exploring the Design Space of Visual Context Representation in Video MLLMs
Viaarxiv icon

GS^3: Efficient Relighting with Triple Gaussian Splatting

Add code
Oct 15, 2024
Viaarxiv icon

Extracting and Transferring Abilities For Building Multi-lingual Ability-enhanced Large Language Models

Add code
Oct 10, 2024
Viaarxiv icon