Picture for Pingchuan Ma

Pingchuan Ma

Does VLM Classification Benefit from LLM Description Semantics?

Add code
Dec 16, 2024
Figure 1 for Does VLM Classification Benefit from LLM Description Semantics?
Figure 2 for Does VLM Classification Benefit from LLM Description Semantics?
Figure 3 for Does VLM Classification Benefit from LLM Description Semantics?
Figure 4 for Does VLM Classification Benefit from LLM Description Semantics?
Viaarxiv icon

ROICtrl: Boosting Instance Control for Visual Generation

Add code
Nov 27, 2024
Figure 1 for ROICtrl: Boosting Instance Control for Visual Generation
Figure 2 for ROICtrl: Boosting Instance Control for Visual Generation
Figure 3 for ROICtrl: Boosting Instance Control for Visual Generation
Figure 4 for ROICtrl: Boosting Instance Control for Visual Generation
Viaarxiv icon

Learning Object Properties Using Robot Proprioception via Differentiable Robot-Object Interaction

Add code
Oct 04, 2024
Viaarxiv icon

ADEPT-Z: Zero-Shot Automated Circuit Topology Search for Pareto-Optimal Photonic Tensor Cores

Add code
Oct 02, 2024
Figure 1 for ADEPT-Z: Zero-Shot Automated Circuit Topology Search for Pareto-Optimal Photonic Tensor Cores
Figure 2 for ADEPT-Z: Zero-Shot Automated Circuit Topology Search for Pareto-Optimal Photonic Tensor Cores
Figure 3 for ADEPT-Z: Zero-Shot Automated Circuit Topology Search for Pareto-Optimal Photonic Tensor Cores
Figure 4 for ADEPT-Z: Zero-Shot Automated Circuit Topology Search for Pareto-Optimal Photonic Tensor Cores
Viaarxiv icon

WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians

Add code
Sep 26, 2024
Figure 1 for WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians
Figure 2 for WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians
Figure 3 for WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians
Figure 4 for WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians
Viaarxiv icon

KAN 2.0: Kolmogorov-Arnold Networks Meet Science

Add code
Aug 19, 2024
Viaarxiv icon

Diffusion Models and Representation Learning: A Survey

Add code
Jun 30, 2024
Figure 1 for Diffusion Models and Representation Learning: A Survey
Figure 2 for Diffusion Models and Representation Learning: A Survey
Figure 3 for Diffusion Models and Representation Learning: A Survey
Figure 4 for Diffusion Models and Representation Learning: A Survey
Viaarxiv icon

Dynamic Data Pruning for Automatic Speech Recognition

Add code
Jun 26, 2024
Viaarxiv icon

MSRS: Training Multimodal Speech Recognition Models from Scratch with Sparse Mask Optimization

Add code
Jun 25, 2024
Figure 1 for MSRS: Training Multimodal Speech Recognition Models from Scratch with Sparse Mask Optimization
Figure 2 for MSRS: Training Multimodal Speech Recognition Models from Scratch with Sparse Mask Optimization
Figure 3 for MSRS: Training Multimodal Speech Recognition Models from Scratch with Sparse Mask Optimization
Figure 4 for MSRS: Training Multimodal Speech Recognition Models from Scratch with Sparse Mask Optimization
Viaarxiv icon

PIC2O-Sim: A Physics-Inspired Causality-Aware Dynamic Convolutional Neural Operator for Ultra-Fast Photonic Device FDTD Simulation

Add code
Jun 24, 2024
Viaarxiv icon