Picture for Junbo Zhang

Junbo Zhang

Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering

Add code
Mar 17, 2025
Viaarxiv icon

Improving Open-world Continual Learning under the Constraints of Scarce Labeled Data

Add code
Feb 28, 2025
Viaarxiv icon

Order-Robust Class Incremental Learning: Graph-Driven Dynamic Similarity Grouping

Add code
Feb 27, 2025
Viaarxiv icon

The ICME 2025 Audio Encoder Capability Challenge

Add code
Jan 25, 2025
Viaarxiv icon

UMOD: A Novel and Effective Urban Metro Origin-Destination Flow Prediction Method

Add code
Sep 08, 2024
Figure 1 for UMOD: A Novel and Effective Urban Metro Origin-Destination Flow Prediction Method
Figure 2 for UMOD: A Novel and Effective Urban Metro Origin-Destination Flow Prediction Method
Figure 3 for UMOD: A Novel and Effective Urban Metro Origin-Destination Flow Prediction Method
Figure 4 for UMOD: A Novel and Effective Urban Metro Origin-Destination Flow Prediction Method
Viaarxiv icon

Personalized Federated Continual Learning via Multi-granularity Prompt

Add code
Jun 27, 2024
Figure 1 for Personalized Federated Continual Learning via Multi-granularity Prompt
Figure 2 for Personalized Federated Continual Learning via Multi-granularity Prompt
Figure 3 for Personalized Federated Continual Learning via Multi-granularity Prompt
Figure 4 for Personalized Federated Continual Learning via Multi-granularity Prompt
Viaarxiv icon

Enhancing Automated Audio Captioning via Large Language Models with Optimized Audio Encoding

Add code
Jun 19, 2024
Viaarxiv icon

Scaling up masked audio encoder learning for general audio classification

Add code
Jun 11, 2024
Figure 1 for Scaling up masked audio encoder learning for general audio classification
Figure 2 for Scaling up masked audio encoder learning for general audio classification
Figure 3 for Scaling up masked audio encoder learning for general audio classification
Figure 4 for Scaling up masked audio encoder learning for general audio classification
Viaarxiv icon

Bridging Language Gaps in Audio-Text Retrieval

Add code
Jun 11, 2024
Figure 1 for Bridging Language Gaps in Audio-Text Retrieval
Figure 2 for Bridging Language Gaps in Audio-Text Retrieval
Figure 3 for Bridging Language Gaps in Audio-Text Retrieval
Figure 4 for Bridging Language Gaps in Audio-Text Retrieval
Viaarxiv icon

Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion

Add code
Mar 12, 2024
Figure 1 for Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion
Figure 2 for Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion
Figure 3 for Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion
Figure 4 for Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion
Viaarxiv icon