Picture for Sitong Wu

Sitong Wu

RoboCoder: Robotic Learning from Basic Skills to General Tasks with Large Language Models

Add code
Jun 06, 2024
Figure 1 for RoboCoder: Robotic Learning from Basic Skills to General Tasks with Large Language Models
Figure 2 for RoboCoder: Robotic Learning from Basic Skills to General Tasks with Large Language Models
Figure 3 for RoboCoder: Robotic Learning from Basic Skills to General Tasks with Large Language Models
Figure 4 for RoboCoder: Robotic Learning from Basic Skills to General Tasks with Large Language Models
Viaarxiv icon

Ensemble Quadratic Assignment Network for Graph Matching

Add code
Mar 11, 2024
Viaarxiv icon

Data Pruning via Moving-one-Sample-out

Add code
Oct 25, 2023
Viaarxiv icon

RegionBLIP: A Unified Multi-modal Pre-training Framework for Holistic and Regional Comprehension

Add code
Aug 03, 2023
Viaarxiv icon

AxWin Transformer: A Context-Aware Vision Transformer Backbone with Axial Windows

Add code
May 02, 2023
Viaarxiv icon

UniNeXt: Exploring A Unified Architecture for Vision Recognition

Add code
May 01, 2023
Viaarxiv icon

PRSeg: A Lightweight Patch Rotate MLP Decoder for Semantic Segmentation

Add code
May 01, 2023
Viaarxiv icon

Semantic Diffusion Network for Semantic Segmentation

Add code
Feb 04, 2023
Viaarxiv icon

Demystify Transformers & Convolutions in Modern Image Deep Networks

Add code
Nov 10, 2022
Viaarxiv icon

CATrans: Context and Affinity Transformer for Few-Shot Segmentation

Add code
Apr 27, 2022
Figure 1 for CATrans: Context and Affinity Transformer for Few-Shot Segmentation
Figure 2 for CATrans: Context and Affinity Transformer for Few-Shot Segmentation
Figure 3 for CATrans: Context and Affinity Transformer for Few-Shot Segmentation
Figure 4 for CATrans: Context and Affinity Transformer for Few-Shot Segmentation
Viaarxiv icon