Picture for Changsheng Xu

Changsheng Xu

Unifying Perplexing Behaviors in Modified BP Attributions through Alignment Perspective

Add code
Mar 14, 2025
Viaarxiv icon

PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC

Add code
Feb 21, 2025
Viaarxiv icon

Bringing Characters to New Stories: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting

Add code
Jan 26, 2025
Figure 1 for Bringing Characters to New Stories: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting
Figure 2 for Bringing Characters to New Stories: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting
Figure 3 for Bringing Characters to New Stories: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting
Figure 4 for Bringing Characters to New Stories: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting
Viaarxiv icon

Towards Visual Grounding: A Survey

Add code
Dec 28, 2024
Viaarxiv icon

Do We Need to Design Specific Diffusion Models for Different Tasks? Try ONE-PIC

Add code
Dec 07, 2024
Viaarxiv icon

LumiSculpt: A Consistency Lighting Control Network for Video Generation

Add code
Oct 30, 2024
Figure 1 for LumiSculpt: A Consistency Lighting Control Network for Video Generation
Figure 2 for LumiSculpt: A Consistency Lighting Control Network for Video Generation
Figure 3 for LumiSculpt: A Consistency Lighting Control Network for Video Generation
Figure 4 for LumiSculpt: A Consistency Lighting Control Network for Video Generation
Viaarxiv icon

Text-Guided Attention is All You Need for Zero-Shot Robustness in Vision-Language Models

Add code
Oct 29, 2024
Viaarxiv icon

Conjugated Semantic Pool Improves OOD Detection with Pre-trained Vision-Language Models

Add code
Oct 11, 2024
Viaarxiv icon

OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling

Add code
Oct 10, 2024
Figure 1 for OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling
Figure 2 for OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling
Figure 3 for OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling
Figure 4 for OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling
Viaarxiv icon

Revisiting Essential and Nonessential Settings of Evidential Deep Learning

Add code
Oct 01, 2024
Figure 1 for Revisiting Essential and Nonessential Settings of Evidential Deep Learning
Figure 2 for Revisiting Essential and Nonessential Settings of Evidential Deep Learning
Figure 3 for Revisiting Essential and Nonessential Settings of Evidential Deep Learning
Figure 4 for Revisiting Essential and Nonessential Settings of Evidential Deep Learning
Viaarxiv icon