Picture for Liang Xie

Liang Xie

AVE Speech Dataset: A Comprehensive Benchmark for Multi-Modal Speech Recognition Integrating Audio, Visual, and Electromyographic Signals

Add code
Jan 28, 2025
Figure 1 for AVE Speech Dataset: A Comprehensive Benchmark for Multi-Modal Speech Recognition Integrating Audio, Visual, and Electromyographic Signals
Figure 2 for AVE Speech Dataset: A Comprehensive Benchmark for Multi-Modal Speech Recognition Integrating Audio, Visual, and Electromyographic Signals
Figure 3 for AVE Speech Dataset: A Comprehensive Benchmark for Multi-Modal Speech Recognition Integrating Audio, Visual, and Electromyographic Signals
Figure 4 for AVE Speech Dataset: A Comprehensive Benchmark for Multi-Modal Speech Recognition Integrating Audio, Visual, and Electromyographic Signals
Viaarxiv icon

LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition

Add code
Jan 08, 2025
Figure 1 for LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition
Figure 2 for LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition
Figure 3 for LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition
Figure 4 for LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition
Viaarxiv icon

Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models

Add code
Dec 19, 2024
Viaarxiv icon

SciPIP: An LLM-based Scientific Paper Idea Proposer

Add code
Oct 30, 2024
Figure 1 for SciPIP: An LLM-based Scientific Paper Idea Proposer
Figure 2 for SciPIP: An LLM-based Scientific Paper Idea Proposer
Figure 3 for SciPIP: An LLM-based Scientific Paper Idea Proposer
Figure 4 for SciPIP: An LLM-based Scientific Paper Idea Proposer
Viaarxiv icon

Delving into the Reversal Curse: How Far Can Large Language Models Generalize?

Add code
Oct 24, 2024
Figure 1 for Delving into the Reversal Curse: How Far Can Large Language Models Generalize?
Figure 2 for Delving into the Reversal Curse: How Far Can Large Language Models Generalize?
Figure 3 for Delving into the Reversal Curse: How Far Can Large Language Models Generalize?
Figure 4 for Delving into the Reversal Curse: How Far Can Large Language Models Generalize?
Viaarxiv icon

Instance-adaptive Zero-shot Chain-of-Thought Prompting

Add code
Sep 30, 2024
Figure 1 for Instance-adaptive Zero-shot Chain-of-Thought Prompting
Figure 2 for Instance-adaptive Zero-shot Chain-of-Thought Prompting
Figure 3 for Instance-adaptive Zero-shot Chain-of-Thought Prompting
Figure 4 for Instance-adaptive Zero-shot Chain-of-Thought Prompting
Viaarxiv icon

From Yes-Men to Truth-Tellers: Addressing Sycophancy in Large Language Models with Pinpoint Tuning

Add code
Sep 03, 2024
Viaarxiv icon

From Redundancy to Relevance: Enhancing Explainability in Multimodal Large Language Models

Add code
Jun 04, 2024
Viaarxiv icon

Landmark-Guided Cross-Speaker Lip Reading with Mutual Information Regularization

Add code
Mar 24, 2024
Figure 1 for Landmark-Guided Cross-Speaker Lip Reading with Mutual Information Regularization
Figure 2 for Landmark-Guided Cross-Speaker Lip Reading with Mutual Information Regularization
Figure 3 for Landmark-Guided Cross-Speaker Lip Reading with Mutual Information Regularization
Figure 4 for Landmark-Guided Cross-Speaker Lip Reading with Mutual Information Regularization
Viaarxiv icon

Safe-VLN: Collision Avoidance for Vision-and-Language Navigation of Autonomous Robots Operating in Continuous Environments

Add code
Nov 06, 2023
Viaarxiv icon