Picture for Zeyi Huang

Zeyi Huang

MultiCrafter: High-Fidelity Multi-Subject Generation via Spatially Disentangled Attention and Identity-Aware Reinforcement Learning

Add code
Sep 26, 2025
Viaarxiv icon

VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection

Add code
May 26, 2025
Viaarxiv icon

T2I-ConBench: Text-to-Image Benchmark for Continual Post-training

Add code
May 22, 2025
Viaarxiv icon

Talk is Not Always Cheap: Promoting Wireless Sensing Models with Text Prompts

Add code
Apr 22, 2025
Viaarxiv icon

HoGS: Unified Near and Far Object Reconstruction via Homogeneous Gaussian Splatting

Add code
Mar 25, 2025
Viaarxiv icon

Do Vision Models Develop Human-Like Progressive Difficulty Understanding?

Add code
Mar 17, 2025
Viaarxiv icon

IMPROVE: Iterative Model Pipeline Refinement and Optimization Leveraging LLM Agents

Add code
Feb 25, 2025
Figure 1 for IMPROVE: Iterative Model Pipeline Refinement and Optimization Leveraging LLM Agents
Figure 2 for IMPROVE: Iterative Model Pipeline Refinement and Optimization Leveraging LLM Agents
Figure 3 for IMPROVE: Iterative Model Pipeline Refinement and Optimization Leveraging LLM Agents
Figure 4 for IMPROVE: Iterative Model Pipeline Refinement and Optimization Leveraging LLM Agents
Viaarxiv icon

Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs

Add code
Jan 08, 2025
Figure 1 for Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs
Figure 2 for Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs
Figure 3 for Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs
Figure 4 for Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs
Viaarxiv icon

Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation

Add code
Dec 03, 2024
Figure 1 for Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation
Figure 2 for Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation
Figure 3 for Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation
Figure 4 for Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation
Viaarxiv icon

Ascend HiFloat8 Format for Deep Learning

Add code
Sep 26, 2024
Figure 1 for Ascend HiFloat8 Format for Deep Learning
Figure 2 for Ascend HiFloat8 Format for Deep Learning
Figure 3 for Ascend HiFloat8 Format for Deep Learning
Figure 4 for Ascend HiFloat8 Format for Deep Learning
Viaarxiv icon