Picture for Bowen Zhang

Bowen Zhang

RareSpot: Spotting Small and Rare Wildlife in Aerial Imagery with Multi-Scale Consistency and Context-Aware Augmentation

Add code
Jun 23, 2025
Viaarxiv icon

Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

Add code
Jun 18, 2025
Viaarxiv icon

Abstract, Align, Predict: Zero-Shot Stance Detection via Cognitive Inductive Reasoning

Add code
Jun 16, 2025
Viaarxiv icon

Automated evaluation of children's speech fluency for low-resource languages

Add code
May 26, 2025
Viaarxiv icon

MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

Add code
May 12, 2025
Viaarxiv icon

CSE-SFP: Enabling Unsupervised Sentence Representation Learning via a Single Forward Pass

Add code
May 01, 2025
Viaarxiv icon

C-MTCSD: A Chinese Multi-Turn Conversational Stance Detection Dataset

Add code
Apr 14, 2025
Viaarxiv icon

PBR3DGen: A VLM-guided Mesh Generation with High-quality PBR Texture

Add code
Mar 14, 2025
Viaarxiv icon

OR-LLM-Agent: Automating Modeling and Solving of Operations Research Optimization Problem with Reasoning Large Language Model

Add code
Mar 13, 2025
Viaarxiv icon

DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation

Add code
Mar 13, 2025
Viaarxiv icon