Picture for Jin Wang

Jin Wang

University of California, Los Angeles, USA

TAGRPO: Boosting GRPO on Image-to-Video Generation with Direct Trajectory Alignment

Add code
Jan 09, 2026
Viaarxiv icon

Rotate Your Character: Revisiting Video Diffusion Models for High-Quality 3D Character Generation

Add code
Jan 09, 2026
Viaarxiv icon

HearSay Benchmark: Do Audio LLMs Leak What They Hear?

Add code
Jan 07, 2026
Viaarxiv icon

ChemBART: A Pre-trained BART Model Assisting Organic Chemistry Analysis

Add code
Jan 06, 2026
Viaarxiv icon

Evaluating transfer learning strategies for improving dairy cattle body weight prediction in small farms using depth-image and point-cloud data

Add code
Jan 03, 2026
Viaarxiv icon

Vision-Language-Policy Model for Dynamic Robot Task Planning

Add code
Dec 22, 2025
Figure 1 for Vision-Language-Policy Model for Dynamic Robot Task Planning
Figure 2 for Vision-Language-Policy Model for Dynamic Robot Task Planning
Figure 3 for Vision-Language-Policy Model for Dynamic Robot Task Planning
Figure 4 for Vision-Language-Policy Model for Dynamic Robot Task Planning
Viaarxiv icon

Unbiased Semantic Decoding with Vision Foundation Models for Few-shot Segmentation

Add code
Nov 19, 2025
Viaarxiv icon

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Add code
Nov 18, 2025
Viaarxiv icon

HCC-3D: Hierarchical Compensatory Compression for 98% 3D Token Reduction in Vision-Language Models

Add code
Nov 13, 2025
Viaarxiv icon

Asymmetric Cross-Modal Knowledge Distillation: Bridging Modalities with Weak Semantic Consistency

Add code
Nov 12, 2025
Figure 1 for Asymmetric Cross-Modal Knowledge Distillation: Bridging Modalities with Weak Semantic Consistency
Figure 2 for Asymmetric Cross-Modal Knowledge Distillation: Bridging Modalities with Weak Semantic Consistency
Figure 3 for Asymmetric Cross-Modal Knowledge Distillation: Bridging Modalities with Weak Semantic Consistency
Figure 4 for Asymmetric Cross-Modal Knowledge Distillation: Bridging Modalities with Weak Semantic Consistency
Viaarxiv icon