Picture for Weidong Cai

Weidong Cai

RealSyn: An Effective and Scalable Multimodal Interleaved Document Transformation Paradigm

Add code
Feb 18, 2025
Viaarxiv icon

Medical Image Registration Meets Vision Foundation Model: Prototype Learning and Contour Awareness

Add code
Feb 17, 2025
Viaarxiv icon

NVS-SQA: Exploring Self-Supervised Quality Representation Learning for Neurally Synthesized Scenes without References

Add code
Jan 11, 2025
Viaarxiv icon

Cross-View Consistency Regularisation for Knowledge Distillation

Add code
Dec 21, 2024
Viaarxiv icon

Gotta Hear Them All: Sound Source Aware Vision to Audio Generation

Add code
Nov 26, 2024
Figure 1 for Gotta Hear Them All: Sound Source Aware Vision to Audio Generation
Figure 2 for Gotta Hear Them All: Sound Source Aware Vision to Audio Generation
Figure 3 for Gotta Hear Them All: Sound Source Aware Vision to Audio Generation
Figure 4 for Gotta Hear Them All: Sound Source Aware Vision to Audio Generation
Viaarxiv icon

Cell as Point: One-Stage Framework for Efficient Cell Tracking

Add code
Nov 22, 2024
Viaarxiv icon

ORID: Organ-Regional Information Driven Framework for Radiology Report Generation

Add code
Nov 20, 2024
Viaarxiv icon

AMNCutter: Affinity-Attention-Guided Multi-View Normalized Cutter for Unsupervised Surgical Instrument Segmentation

Add code
Nov 07, 2024
Figure 1 for AMNCutter: Affinity-Attention-Guided Multi-View Normalized Cutter for Unsupervised Surgical Instrument Segmentation
Figure 2 for AMNCutter: Affinity-Attention-Guided Multi-View Normalized Cutter for Unsupervised Surgical Instrument Segmentation
Figure 3 for AMNCutter: Affinity-Attention-Guided Multi-View Normalized Cutter for Unsupervised Surgical Instrument Segmentation
Figure 4 for AMNCutter: Affinity-Attention-Guided Multi-View Normalized Cutter for Unsupervised Surgical Instrument Segmentation
Viaarxiv icon

TractShapeNet: Efficient Multi-Shape Learning with 3D Tractography Point Clouds

Add code
Oct 29, 2024
Figure 1 for TractShapeNet: Efficient Multi-Shape Learning with 3D Tractography Point Clouds
Figure 2 for TractShapeNet: Efficient Multi-Shape Learning with 3D Tractography Point Clouds
Figure 3 for TractShapeNet: Efficient Multi-Shape Learning with 3D Tractography Point Clouds
Figure 4 for TractShapeNet: Efficient Multi-Shape Learning with 3D Tractography Point Clouds
Viaarxiv icon

DINeuro: Distilling Knowledge from 2D Natural Images via Deformable Tubular Transferring Strategy for 3D Neuron Reconstruction

Add code
Oct 29, 2024
Figure 1 for DINeuro: Distilling Knowledge from 2D Natural Images via Deformable Tubular Transferring Strategy for 3D Neuron Reconstruction
Figure 2 for DINeuro: Distilling Knowledge from 2D Natural Images via Deformable Tubular Transferring Strategy for 3D Neuron Reconstruction
Figure 3 for DINeuro: Distilling Knowledge from 2D Natural Images via Deformable Tubular Transferring Strategy for 3D Neuron Reconstruction
Figure 4 for DINeuro: Distilling Knowledge from 2D Natural Images via Deformable Tubular Transferring Strategy for 3D Neuron Reconstruction
Viaarxiv icon