Picture for Heng Wang

Heng Wang

Fast Prompt Alignment for Text-to-Image Generation

Add code
Dec 11, 2024
Viaarxiv icon

Gotta Hear Them All: Sound Source Aware Vision to Audio Generation

Add code
Nov 26, 2024
Figure 1 for Gotta Hear Them All: Sound Source Aware Vision to Audio Generation
Figure 2 for Gotta Hear Them All: Sound Source Aware Vision to Audio Generation
Figure 3 for Gotta Hear Them All: Sound Source Aware Vision to Audio Generation
Figure 4 for Gotta Hear Them All: Sound Source Aware Vision to Audio Generation
Viaarxiv icon

Real-time and Downtime-tolerant Fault Diagnosis for Railway Turnout Machines (RTMs) Empowered with Cloud-Edge Pipeline Parallelism

Add code
Nov 04, 2024
Figure 1 for Real-time and Downtime-tolerant Fault Diagnosis for Railway Turnout Machines (RTMs) Empowered with Cloud-Edge Pipeline Parallelism
Figure 2 for Real-time and Downtime-tolerant Fault Diagnosis for Railway Turnout Machines (RTMs) Empowered with Cloud-Edge Pipeline Parallelism
Figure 3 for Real-time and Downtime-tolerant Fault Diagnosis for Railway Turnout Machines (RTMs) Empowered with Cloud-Edge Pipeline Parallelism
Figure 4 for Real-time and Downtime-tolerant Fault Diagnosis for Railway Turnout Machines (RTMs) Empowered with Cloud-Edge Pipeline Parallelism
Viaarxiv icon

DINeuro: Distilling Knowledge from 2D Natural Images via Deformable Tubular Transferring Strategy for 3D Neuron Reconstruction

Add code
Oct 29, 2024
Figure 1 for DINeuro: Distilling Knowledge from 2D Natural Images via Deformable Tubular Transferring Strategy for 3D Neuron Reconstruction
Figure 2 for DINeuro: Distilling Knowledge from 2D Natural Images via Deformable Tubular Transferring Strategy for 3D Neuron Reconstruction
Figure 3 for DINeuro: Distilling Knowledge from 2D Natural Images via Deformable Tubular Transferring Strategy for 3D Neuron Reconstruction
Figure 4 for DINeuro: Distilling Knowledge from 2D Natural Images via Deformable Tubular Transferring Strategy for 3D Neuron Reconstruction
Viaarxiv icon

DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction

Add code
Sep 30, 2024
Figure 1 for DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction
Figure 2 for DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction
Figure 3 for DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction
Figure 4 for DAOcc: 3D Object Detection Assisted Multi-Sensor Fusion for 3D Occupancy Prediction
Viaarxiv icon

BUPTCMCC-6G-CMG+: A GBSM-Based ISAC Channel Model Simulator

Add code
Sep 22, 2024
Viaarxiv icon

Enhancing Advanced Visual Reasoning Ability of Large Language Models

Add code
Sep 21, 2024
Figure 1 for Enhancing Advanced Visual Reasoning Ability of Large Language Models
Figure 2 for Enhancing Advanced Visual Reasoning Ability of Large Language Models
Figure 3 for Enhancing Advanced Visual Reasoning Ability of Large Language Models
Figure 4 for Enhancing Advanced Visual Reasoning Ability of Large Language Models
Viaarxiv icon

Explaining Datasets in Words: Statistical Models with Natural Language Parameters

Add code
Sep 13, 2024
Figure 1 for Explaining Datasets in Words: Statistical Models with Natural Language Parameters
Figure 2 for Explaining Datasets in Words: Statistical Models with Natural Language Parameters
Figure 3 for Explaining Datasets in Words: Statistical Models with Natural Language Parameters
Figure 4 for Explaining Datasets in Words: Statistical Models with Natural Language Parameters
Viaarxiv icon

VMAS: Video-to-Music Generation via Semantic Alignment in Web Music Videos

Add code
Sep 11, 2024
Viaarxiv icon

Future Does Matter: Boosting 3D Object Detection with Temporal Motion Estimation in Point Cloud Sequences

Add code
Sep 06, 2024
Viaarxiv icon