Picture for Hao Ma

Hao Ma

State Key Laboratory of Information Engineering in Survering, Mapping and Remote Sensing, Wuhan University

Multi-IF: Benchmarking LLMs on Multi-Turn and Multilingual Instructions Following

Add code
Oct 21, 2024
Viaarxiv icon

Preference Optimization with Multi-Sample Comparisons

Add code
Oct 16, 2024
Viaarxiv icon

Coevolving with the Other You: Fine-Tuning LLM with Sequential Cooperative Multi-Agent Reinforcement Learning

Add code
Oct 08, 2024
Viaarxiv icon

The Perfect Blend: Redefining RLHF with Mixture of Judges

Add code
Sep 30, 2024
Figure 1 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Figure 2 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Figure 3 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Figure 4 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Viaarxiv icon

Language-Queried Target Sound Extraction Without Parallel Training Data

Add code
Sep 14, 2024
Figure 1 for Language-Queried Target Sound Extraction Without Parallel Training Data
Figure 2 for Language-Queried Target Sound Extraction Without Parallel Training Data
Figure 3 for Language-Queried Target Sound Extraction Without Parallel Training Data
Figure 4 for Language-Queried Target Sound Extraction Without Parallel Training Data
Viaarxiv icon

ECAFormer: Low-light Image Enhancement using Cross Attention

Add code
Jun 19, 2024
Figure 1 for ECAFormer: Low-light Image Enhancement using Cross Attention
Figure 2 for ECAFormer: Low-light Image Enhancement using Cross Attention
Figure 3 for ECAFormer: Low-light Image Enhancement using Cross Attention
Figure 4 for ECAFormer: Low-light Image Enhancement using Cross Attention
Viaarxiv icon

Stochastic Online Optimization for Cyber-Physical and Robotic Systems

Add code
Apr 08, 2024
Figure 1 for Stochastic Online Optimization for Cyber-Physical and Robotic Systems
Figure 2 for Stochastic Online Optimization for Cyber-Physical and Robotic Systems
Figure 3 for Stochastic Online Optimization for Cyber-Physical and Robotic Systems
Figure 4 for Stochastic Online Optimization for Cyber-Physical and Robotic Systems
Viaarxiv icon

CLAPSep: Leveraging Contrastive Pre-trained Models for Multi-Modal Query-Conditioned Target Sound Extraction

Add code
Feb 27, 2024
Figure 1 for CLAPSep: Leveraging Contrastive Pre-trained Models for Multi-Modal Query-Conditioned Target Sound Extraction
Figure 2 for CLAPSep: Leveraging Contrastive Pre-trained Models for Multi-Modal Query-Conditioned Target Sound Extraction
Figure 3 for CLAPSep: Leveraging Contrastive Pre-trained Models for Multi-Modal Query-Conditioned Target Sound Extraction
Figure 4 for CLAPSep: Leveraging Contrastive Pre-trained Models for Multi-Modal Query-Conditioned Target Sound Extraction
Viaarxiv icon

Poisoning Attacks against Recommender Systems: A Survey

Add code
Jan 14, 2024
Viaarxiv icon

Extending Whisper with prompt tuning to target-speaker ASR

Add code
Dec 13, 2023
Viaarxiv icon