Picture for Honggang Zhang

Honggang Zhang

Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation

Add code
Oct 14, 2024
Figure 1 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Figure 2 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Figure 3 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Figure 4 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Viaarxiv icon

Unveiling and Mitigating Bias in Audio Visual Segmentation

Add code
Jul 23, 2024
Figure 1 for Unveiling and Mitigating Bias in Audio Visual Segmentation
Figure 2 for Unveiling and Mitigating Bias in Audio Visual Segmentation
Figure 3 for Unveiling and Mitigating Bias in Audio Visual Segmentation
Figure 4 for Unveiling and Mitigating Bias in Audio Visual Segmentation
Viaarxiv icon

Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes

Add code
Jul 15, 2024
Viaarxiv icon

Can Textual Semantics Mitigate Sounding Object Segmentation Preference?

Add code
Jul 15, 2024
Viaarxiv icon

Communication and Control Co-Design in 6G: Sequential Decision-Making with LLMs

Add code
Jul 06, 2024
Figure 1 for Communication and Control Co-Design in 6G: Sequential Decision-Making with LLMs
Figure 2 for Communication and Control Co-Design in 6G: Sequential Decision-Making with LLMs
Figure 3 for Communication and Control Co-Design in 6G: Sequential Decision-Making with LLMs
Figure 4 for Communication and Control Co-Design in 6G: Sequential Decision-Making with LLMs
Viaarxiv icon

We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning?

Add code
Jul 01, 2024
Viaarxiv icon

Adaptive Layer Splitting for Wireless LLM Inference in Edge Computing: A Model-Based Reinforcement Learning Approach

Add code
Jun 06, 2024
Viaarxiv icon

Snake Learning: A Communication- and Computation-Efficient Distributed Learning Framework for 6G

Add code
May 06, 2024
Viaarxiv icon

More than Vanilla Fusion: a Simple, Decoupling-free, Attention Module for Multimodal Fusion Based on Signal Theory

Add code
Dec 12, 2023
Viaarxiv icon

Self-Critical Alternate Learning based Semantic Broadcast Communication

Add code
Dec 03, 2023
Viaarxiv icon