Picture for Guoli Song

Guoli Song

Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning

Add code
May 30, 2023
Viaarxiv icon

Position Embedding Needs an Independent Layer Normalization

Add code
Dec 22, 2022
Viaarxiv icon

Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations

Add code
Nov 21, 2022
Viaarxiv icon

PCR: Pessimistic Consistency Regularization for Semi-Supervised Segmentation

Add code
Oct 16, 2022
Figure 1 for PCR: Pessimistic Consistency Regularization for Semi-Supervised Segmentation
Figure 2 for PCR: Pessimistic Consistency Regularization for Semi-Supervised Segmentation
Figure 3 for PCR: Pessimistic Consistency Regularization for Semi-Supervised Segmentation
Figure 4 for PCR: Pessimistic Consistency Regularization for Semi-Supervised Segmentation
Viaarxiv icon

Dynamic Clustering Network for Unsupervised Semantic Segmentation

Add code
Oct 12, 2022
Figure 1 for Dynamic Clustering Network for Unsupervised Semantic Segmentation
Figure 2 for Dynamic Clustering Network for Unsupervised Semantic Segmentation
Figure 3 for Dynamic Clustering Network for Unsupervised Semantic Segmentation
Figure 4 for Dynamic Clustering Network for Unsupervised Semantic Segmentation
Viaarxiv icon

Toward 3D Spatial Reasoning for Human-like Text-based Visual Question Answering

Add code
Sep 21, 2022
Figure 1 for Toward 3D Spatial Reasoning for Human-like Text-based Visual Question Answering
Figure 2 for Toward 3D Spatial Reasoning for Human-like Text-based Visual Question Answering
Figure 3 for Toward 3D Spatial Reasoning for Human-like Text-based Visual Question Answering
Figure 4 for Toward 3D Spatial Reasoning for Human-like Text-based Visual Question Answering
Viaarxiv icon

Locality Guidance for Improving Vision Transformers on Tiny Datasets

Add code
Jul 20, 2022
Figure 1 for Locality Guidance for Improving Vision Transformers on Tiny Datasets
Figure 2 for Locality Guidance for Improving Vision Transformers on Tiny Datasets
Figure 3 for Locality Guidance for Improving Vision Transformers on Tiny Datasets
Figure 4 for Locality Guidance for Improving Vision Transformers on Tiny Datasets
Viaarxiv icon

Difference in Euclidean Norm Can Cause Semantic Divergence in Batch Normalization

Add code
Jul 06, 2022
Figure 1 for Difference in Euclidean Norm Can Cause Semantic Divergence in Batch Normalization
Figure 2 for Difference in Euclidean Norm Can Cause Semantic Divergence in Batch Normalization
Figure 3 for Difference in Euclidean Norm Can Cause Semantic Divergence in Batch Normalization
Figure 4 for Difference in Euclidean Norm Can Cause Semantic Divergence in Batch Normalization
Viaarxiv icon

ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval

Add code
Mar 31, 2022
Figure 1 for ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval
Figure 2 for ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval
Figure 3 for ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval
Figure 4 for ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval
Viaarxiv icon

Harmonized Multimodal Learning with Gaussian Process Latent Variable Models

Add code
Aug 14, 2019
Figure 1 for Harmonized Multimodal Learning with Gaussian Process Latent Variable Models
Figure 2 for Harmonized Multimodal Learning with Gaussian Process Latent Variable Models
Figure 3 for Harmonized Multimodal Learning with Gaussian Process Latent Variable Models
Figure 4 for Harmonized Multimodal Learning with Gaussian Process Latent Variable Models
Viaarxiv icon