Picture for Jing Shi

Jing Shi

Trust-MARL: Trust-Based Multi-Agent Reinforcement Learning Framework for Cooperative On-Ramp Merging Control in Heterogeneous Traffic Flow

Add code
Jun 14, 2025
Viaarxiv icon

Give Me FP32 or Give Me Death? Challenges and Solutions for Reproducible Reasoning

Add code
Jun 11, 2025
Viaarxiv icon

YoChameleon: Personalized Vision and Language Generation

Add code
Apr 29, 2025
Viaarxiv icon

Accelerating Multi-Objective Collaborative Optimization of Doped Thermoelectric Materials via Artificial Intelligence

Add code
Apr 11, 2025
Viaarxiv icon

Visual Persona: Foundation Model for Full-Body Human Customization

Add code
Mar 19, 2025
Viaarxiv icon

MAGNET: Augmenting Generative Decoders with Representation Learning and Infilling Capabilities

Add code
Jan 15, 2025
Viaarxiv icon

Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage

Add code
Dec 24, 2024
Viaarxiv icon

GUI Agents: A Survey

Add code
Dec 18, 2024
Viaarxiv icon

SUGAR: Subject-Driven Video Customization in a Zero-Shot Manner

Add code
Dec 13, 2024
Viaarxiv icon

FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity

Add code
Nov 23, 2024
Figure 1 for FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity
Figure 2 for FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity
Figure 3 for FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity
Figure 4 for FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity
Viaarxiv icon