Picture for Chaoyi Zhang

Chaoyi Zhang

DeepIcon: A Hierarchical Network for Layer-wise Icon Vectorization

Add code
Oct 21, 2024
Viaarxiv icon

Learning to Synthesize Graphics Programs for Geometric Artworks

Add code
Oct 21, 2024
Viaarxiv icon

Enhancing Advanced Visual Reasoning Ability of Large Language Models

Add code
Sep 21, 2024
Figure 1 for Enhancing Advanced Visual Reasoning Ability of Large Language Models
Figure 2 for Enhancing Advanced Visual Reasoning Ability of Large Language Models
Figure 3 for Enhancing Advanced Visual Reasoning Ability of Large Language Models
Figure 4 for Enhancing Advanced Visual Reasoning Ability of Large Language Models
Viaarxiv icon

Multimodal Causal Reasoning Benchmark: Challenging Vision Large Language Models to Infer Causal Links Between Siamese Images

Add code
Aug 15, 2024
Figure 1 for Multimodal Causal Reasoning Benchmark: Challenging Vision Large Language Models to Infer Causal Links Between Siamese Images
Figure 2 for Multimodal Causal Reasoning Benchmark: Challenging Vision Large Language Models to Infer Causal Links Between Siamese Images
Figure 3 for Multimodal Causal Reasoning Benchmark: Challenging Vision Large Language Models to Infer Causal Links Between Siamese Images
Figure 4 for Multimodal Causal Reasoning Benchmark: Challenging Vision Large Language Models to Infer Causal Links Between Siamese Images
Viaarxiv icon

Controllable Contextualized Image Captioning: Directing the Visual Narrative through User-Defined Highlights

Add code
Jul 16, 2024
Viaarxiv icon

Enhancing Robustness to Noise Corruption for Point Cloud Model via Spatial Sorting and Set-Mixing Aggregation Module

Add code
Jul 15, 2024
Figure 1 for Enhancing Robustness to Noise Corruption for Point Cloud Model via Spatial Sorting and Set-Mixing Aggregation Module
Figure 2 for Enhancing Robustness to Noise Corruption for Point Cloud Model via Spatial Sorting and Set-Mixing Aggregation Module
Figure 3 for Enhancing Robustness to Noise Corruption for Point Cloud Model via Spatial Sorting and Set-Mixing Aggregation Module
Figure 4 for Enhancing Robustness to Noise Corruption for Point Cloud Model via Spatial Sorting and Set-Mixing Aggregation Module
Viaarxiv icon

TractGraphFormer: Anatomically Informed Hybrid Graph CNN-Transformer Network for Classification from Diffusion MRI Tractography

Add code
Jul 11, 2024
Figure 1 for TractGraphFormer: Anatomically Informed Hybrid Graph CNN-Transformer Network for Classification from Diffusion MRI Tractography
Figure 2 for TractGraphFormer: Anatomically Informed Hybrid Graph CNN-Transformer Network for Classification from Diffusion MRI Tractography
Figure 3 for TractGraphFormer: Anatomically Informed Hybrid Graph CNN-Transformer Network for Classification from Diffusion MRI Tractography
Figure 4 for TractGraphFormer: Anatomically Informed Hybrid Graph CNN-Transformer Network for Classification from Diffusion MRI Tractography
Viaarxiv icon

Exploiting Structural Consistency of Chest Anatomy for Unsupervised Anomaly Detection in Radiography Images

Add code
Mar 13, 2024
Viaarxiv icon

MM-Narrator: Narrating Long-form Videos with Multimodal In-Context Learning

Add code
Nov 29, 2023
Viaarxiv icon

Exploring Annotation-free Image Captioning with Retrieval-augmented Pseudo Sentence Generation

Add code
Jul 28, 2023
Viaarxiv icon