Picture for Long Bai

Long Bai

Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping

Add code
Jan 31, 2025
Figure 1 for Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping
Figure 2 for Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping
Figure 3 for Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping
Figure 4 for Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping
Viaarxiv icon

EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery

Add code
Jan 20, 2025
Figure 1 for EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery
Figure 2 for EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery
Figure 3 for EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery
Figure 4 for EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery
Viaarxiv icon

V$^2$-SfMLearner: Learning Monocular Depth and Ego-motion for Multimodal Wireless Capsule Endoscopy

Add code
Dec 23, 2024
Viaarxiv icon

SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation

Add code
Dec 18, 2024
Viaarxiv icon

ETSM: Automating Dissection Trajectory Suggestion and Confidence Map-Based Safety Margin Prediction for Robot-assisted Endoscopic Submucosal Dissection

Add code
Nov 28, 2024
Viaarxiv icon

AlignXIE: Improving Multilingual Information Extraction by Cross-Lingual Alignment

Add code
Nov 07, 2024
Figure 1 for AlignXIE: Improving Multilingual Information Extraction by Cross-Lingual Alignment
Figure 2 for AlignXIE: Improving Multilingual Information Extraction by Cross-Lingual Alignment
Figure 3 for AlignXIE: Improving Multilingual Information Extraction by Cross-Lingual Alignment
Figure 4 for AlignXIE: Improving Multilingual Information Extraction by Cross-Lingual Alignment
Viaarxiv icon

Transferring Knowledge from High-Quality to Low-Quality MRI for Adult Glioma Diagnosis

Add code
Oct 24, 2024
Viaarxiv icon

CoPESD: A Multi-Level Surgical Motion Dataset for Training Large Vision-Language Models to Co-Pilot Endoscopic Submucosal Dissection

Add code
Oct 10, 2024
Figure 1 for CoPESD: A Multi-Level Surgical Motion Dataset for Training Large Vision-Language Models to Co-Pilot Endoscopic Submucosal Dissection
Figure 2 for CoPESD: A Multi-Level Surgical Motion Dataset for Training Large Vision-Language Models to Co-Pilot Endoscopic Submucosal Dissection
Figure 3 for CoPESD: A Multi-Level Surgical Motion Dataset for Training Large Vision-Language Models to Co-Pilot Endoscopic Submucosal Dissection
Figure 4 for CoPESD: A Multi-Level Surgical Motion Dataset for Training Large Vision-Language Models to Co-Pilot Endoscopic Submucosal Dissection
Viaarxiv icon

GOPT: Generalizable Online 3D Bin Packing via Transformer-based Deep Reinforcement Learning

Add code
Sep 09, 2024
Figure 1 for GOPT: Generalizable Online 3D Bin Packing via Transformer-based Deep Reinforcement Learning
Figure 2 for GOPT: Generalizable Online 3D Bin Packing via Transformer-based Deep Reinforcement Learning
Figure 3 for GOPT: Generalizable Online 3D Bin Packing via Transformer-based Deep Reinforcement Learning
Figure 4 for GOPT: Generalizable Online 3D Bin Packing via Transformer-based Deep Reinforcement Learning
Viaarxiv icon

Surgical-VQLA++: Adversarial Contrastive Learning for Calibrated Robust Visual Question-Localized Answering in Robotic Surgery

Add code
Aug 09, 2024
Viaarxiv icon