Picture for Hongjie Chen

Hongjie Chen

Dolby Laboratories

InfinityStory: Unlimited Video Generation with World Consistency and Character-Aware Shot Transitions

Add code
Mar 04, 2026
Viaarxiv icon

Human-Aligned MLLM Judges for Fine-Grained Image Editing Evaluation: A Benchmark, Framework, and Analysis

Add code
Feb 13, 2026
Viaarxiv icon

Segment Length Matters: A Study of Segment Lengths on Audio Fingerprinting Performance

Add code
Jan 25, 2026
Viaarxiv icon

A Unified Spoken Language Model with Injected Emotional-Attribution Thinking for Human-like Interaction

Add code
Jan 08, 2026
Viaarxiv icon

Measuring Time-Series Dataset Similarity using Wasserstein Distance

Add code
Jul 29, 2025
Viaarxiv icon

TELEVAL: A Dynamic Benchmark Designed for Spoken Language Models in Chinese Interactive Scenarios

Add code
Jul 24, 2025
Viaarxiv icon

BoSS: Beyond-Semantic Speech

Add code
Jul 23, 2025
Viaarxiv icon

A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality

Add code
Jul 09, 2025
Figure 1 for A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality
Figure 2 for A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality
Viaarxiv icon

Forecasting Time Series with LLMs via Patch-Based Prompting and Decomposition

Add code
Jun 15, 2025
Viaarxiv icon

Leveraging LLM and Self-Supervised Training Models for Speech Recognition in Chinese Dialects: A Comparative Analysis

Add code
May 27, 2025
Viaarxiv icon