Picture for Pengwei Wang

Pengwei Wang

Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning

Add code
Mar 27, 2025
Viaarxiv icon

Modeling Variants of Prompts for Vision-Language Models

Add code
Mar 11, 2025
Viaarxiv icon

RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete

Add code
Feb 28, 2025
Viaarxiv icon

MapNav: A Novel Memory Representation via Annotated Semantic Maps for VLM-based Vision-and-Language Navigation

Add code
Feb 19, 2025
Viaarxiv icon

M2SE: A Multistage Multitask Instruction Tuning Strategy for Unified Sentiment and Emotion Analysis

Add code
Dec 11, 2024
Figure 1 for M2SE: A Multistage Multitask Instruction Tuning Strategy for Unified Sentiment and Emotion Analysis
Figure 2 for M2SE: A Multistage Multitask Instruction Tuning Strategy for Unified Sentiment and Emotion Analysis
Figure 3 for M2SE: A Multistage Multitask Instruction Tuning Strategy for Unified Sentiment and Emotion Analysis
Figure 4 for M2SE: A Multistage Multitask Instruction Tuning Strategy for Unified Sentiment and Emotion Analysis
Viaarxiv icon

Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation

Add code
Nov 27, 2024
Figure 1 for Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
Figure 2 for Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
Figure 3 for Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
Figure 4 for Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation
Viaarxiv icon

Optimizing Medical Image Segmentation with Advanced Decoder Design

Add code
Oct 05, 2024
Figure 1 for Optimizing Medical Image Segmentation with Advanced Decoder Design
Figure 2 for Optimizing Medical Image Segmentation with Advanced Decoder Design
Figure 3 for Optimizing Medical Image Segmentation with Advanced Decoder Design
Figure 4 for Optimizing Medical Image Segmentation with Advanced Decoder Design
Viaarxiv icon

Beyond Gait: Learning Knee Angle for Seamless Prosthesis Control in Multiple Scenarios

Add code
Apr 10, 2024
Viaarxiv icon

More complex encoder is not all you need

Add code
Sep 21, 2023
Figure 1 for More complex encoder is not all you need
Figure 2 for More complex encoder is not all you need
Figure 3 for More complex encoder is not all you need
Figure 4 for More complex encoder is not all you need
Viaarxiv icon

History-Aware Hierarchical Transformer for Multi-session Open-domain Dialogue System

Add code
Feb 02, 2023
Viaarxiv icon