Picture for Yongxin Wang

Yongxin Wang

Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs

Add code
Jun 28, 2024
Figure 1 for Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs
Figure 2 for Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs
Figure 3 for Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs
Figure 4 for Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs
Viaarxiv icon

Prototype-Based Layered Federated Cross-Modal Hashing

Add code
Oct 27, 2022
Viaarxiv icon

Three-Stream Joint Network for Zero-Shot Sketch-Based Image Retrieval

Add code
Apr 12, 2022
Figure 1 for Three-Stream Joint Network for Zero-Shot Sketch-Based Image Retrieval
Figure 2 for Three-Stream Joint Network for Zero-Shot Sketch-Based Image Retrieval
Figure 3 for Three-Stream Joint Network for Zero-Shot Sketch-Based Image Retrieval
Figure 4 for Three-Stream Joint Network for Zero-Shot Sketch-Based Image Retrieval
Viaarxiv icon

ViT-FOD: A Vision Transformer based Fine-grained Object Discriminator

Add code
Mar 24, 2022
Figure 1 for ViT-FOD: A Vision Transformer based Fine-grained Object Discriminator
Figure 2 for ViT-FOD: A Vision Transformer based Fine-grained Object Discriminator
Figure 3 for ViT-FOD: A Vision Transformer based Fine-grained Object Discriminator
Figure 4 for ViT-FOD: A Vision Transformer based Fine-grained Object Discriminator
Viaarxiv icon

Learning Hierarchical Graph Neural Networks for Image Clustering

Add code
Jul 17, 2021
Figure 1 for Learning Hierarchical Graph Neural Networks for Image Clustering
Figure 2 for Learning Hierarchical Graph Neural Networks for Image Clustering
Figure 3 for Learning Hierarchical Graph Neural Networks for Image Clustering
Figure 4 for Learning Hierarchical Graph Neural Networks for Image Clustering
Viaarxiv icon

Semi-TCL: Semi-Supervised Track Contrastive Representation Learning

Add code
Jul 06, 2021
Figure 1 for Semi-TCL: Semi-Supervised Track Contrastive Representation Learning
Figure 2 for Semi-TCL: Semi-Supervised Track Contrastive Representation Learning
Figure 3 for Semi-TCL: Semi-Supervised Track Contrastive Representation Learning
Figure 4 for Semi-TCL: Semi-Supervised Track Contrastive Representation Learning
Viaarxiv icon

MTGAT: Multimodal Temporal Graph Attention Networks for Unaligned Human Multimodal Language Sequences

Add code
Oct 22, 2020
Figure 1 for MTGAT: Multimodal Temporal Graph Attention Networks for Unaligned Human Multimodal Language Sequences
Figure 2 for MTGAT: Multimodal Temporal Graph Attention Networks for Unaligned Human Multimodal Language Sequences
Figure 3 for MTGAT: Multimodal Temporal Graph Attention Networks for Unaligned Human Multimodal Language Sequences
Figure 4 for MTGAT: Multimodal Temporal Graph Attention Networks for Unaligned Human Multimodal Language Sequences
Viaarxiv icon

Weakly-Supervised Online Hashing

Add code
Sep 16, 2020
Figure 1 for Weakly-Supervised Online Hashing
Figure 2 for Weakly-Supervised Online Hashing
Figure 3 for Weakly-Supervised Online Hashing
Figure 4 for Weakly-Supervised Online Hashing
Viaarxiv icon

Graph Neural Networks for 3D Multi-Object Tracking

Add code
Aug 20, 2020
Figure 1 for Graph Neural Networks for 3D Multi-Object Tracking
Figure 2 for Graph Neural Networks for 3D Multi-Object Tracking
Figure 3 for Graph Neural Networks for 3D Multi-Object Tracking
Viaarxiv icon

What Gives the Answer Away? Question Answering Bias Analysis on Video QA Datasets

Add code
Jul 07, 2020
Figure 1 for What Gives the Answer Away? Question Answering Bias Analysis on Video QA Datasets
Figure 2 for What Gives the Answer Away? Question Answering Bias Analysis on Video QA Datasets
Figure 3 for What Gives the Answer Away? Question Answering Bias Analysis on Video QA Datasets
Figure 4 for What Gives the Answer Away? Question Answering Bias Analysis on Video QA Datasets
Viaarxiv icon