Picture for Mohan Li

Mohan Li

Federated Concept-Based Models: Interpretable models with distributed supervision

Add code
Feb 04, 2026
Viaarxiv icon

ICPO: Illocution-Calibrated Policy Optimization for Multi-Turn Conversation

Add code
Jan 20, 2026
Viaarxiv icon

Towards Robust and Secure Embodied AI: A Survey on Vulnerabilities and Attacks

Add code
Feb 18, 2025
Viaarxiv icon

Neural Honeytrace: A Robust Plug-and-Play Watermarking Framework against Model Extraction Attacks

Add code
Jan 16, 2025
Viaarxiv icon

A Survey on Federated Learning in Human Sensing

Add code
Jan 07, 2025
Viaarxiv icon

WHISMA: A Speech-LLM to Perform Zero-shot Spoken Language Understanding

Add code
Aug 29, 2024
Figure 1 for WHISMA: A Speech-LLM to Perform Zero-shot Spoken Language Understanding
Figure 2 for WHISMA: A Speech-LLM to Perform Zero-shot Spoken Language Understanding
Figure 3 for WHISMA: A Speech-LLM to Perform Zero-shot Spoken Language Understanding
Figure 4 for WHISMA: A Speech-LLM to Perform Zero-shot Spoken Language Understanding
Viaarxiv icon

Prompting Whisper for QA-driven Zero-shot End-to-end Spoken Language Understanding

Add code
Jun 21, 2024
Figure 1 for Prompting Whisper for QA-driven Zero-shot End-to-end Spoken Language Understanding
Figure 2 for Prompting Whisper for QA-driven Zero-shot End-to-end Spoken Language Understanding
Figure 3 for Prompting Whisper for QA-driven Zero-shot End-to-end Spoken Language Understanding
Figure 4 for Prompting Whisper for QA-driven Zero-shot End-to-end Spoken Language Understanding
Viaarxiv icon

DiaLoc: An Iterative Approach to Embodied Dialog Localization

Add code
Mar 11, 2024
Viaarxiv icon

Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition

Add code
Apr 24, 2023
Figure 1 for Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition
Figure 2 for Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition
Figure 3 for Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition
Figure 4 for Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition
Viaarxiv icon

Non-autoregressive End-to-end Approaches for Joint Automatic Speech Recognition and Spoken Language Understanding

Add code
Apr 21, 2023
Figure 1 for Non-autoregressive End-to-end Approaches for Joint Automatic Speech Recognition and Spoken Language Understanding
Figure 2 for Non-autoregressive End-to-end Approaches for Joint Automatic Speech Recognition and Spoken Language Understanding
Figure 3 for Non-autoregressive End-to-end Approaches for Joint Automatic Speech Recognition and Spoken Language Understanding
Viaarxiv icon