Picture for Zhengtao Yu

Zhengtao Yu

Multilingual Generative Retrieval via Cross-lingual Semantic Compression

Add code
Oct 09, 2025
Viaarxiv icon

FLEXI: Benchmarking Full-duplex Human-LLM Speech Interaction

Add code
Sep 26, 2025
Viaarxiv icon

SageLM: A Multi-aspect and Explainable Large Language Model for Speech Judgement

Add code
Aug 28, 2025
Viaarxiv icon

One Size Does Not Fit All: A Distribution-Aware Sparsification for More Precise Model Merging

Add code
Aug 08, 2025
Viaarxiv icon

Leveraging Unit Language Guidance to Advance Speech Modeling in Textless Speech-to-Speech Translation

Add code
May 21, 2025
Figure 1 for Leveraging Unit Language Guidance to Advance Speech Modeling in Textless Speech-to-Speech Translation
Figure 2 for Leveraging Unit Language Guidance to Advance Speech Modeling in Textless Speech-to-Speech Translation
Figure 3 for Leveraging Unit Language Guidance to Advance Speech Modeling in Textless Speech-to-Speech Translation
Figure 4 for Leveraging Unit Language Guidance to Advance Speech Modeling in Textless Speech-to-Speech Translation
Viaarxiv icon

Unsupervised Graph Clustering with Deep Structural Entropy

Add code
May 20, 2025
Viaarxiv icon

BiDeV: Bilateral Defusing Verification for Complex Claim Fact-Checking

Add code
Feb 22, 2025
Figure 1 for BiDeV: Bilateral Defusing Verification for Complex Claim Fact-Checking
Figure 2 for BiDeV: Bilateral Defusing Verification for Complex Claim Fact-Checking
Figure 3 for BiDeV: Bilateral Defusing Verification for Complex Claim Fact-Checking
Figure 4 for BiDeV: Bilateral Defusing Verification for Complex Claim Fact-Checking
Viaarxiv icon

Phrase Decoupling Cross-Modal Hierarchical Matching and Progressive Position Correction for Visual Grounding

Add code
Oct 31, 2024
Figure 1 for Phrase Decoupling Cross-Modal Hierarchical Matching and Progressive Position Correction for Visual Grounding
Figure 2 for Phrase Decoupling Cross-Modal Hierarchical Matching and Progressive Position Correction for Visual Grounding
Figure 3 for Phrase Decoupling Cross-Modal Hierarchical Matching and Progressive Position Correction for Visual Grounding
Figure 4 for Phrase Decoupling Cross-Modal Hierarchical Matching and Progressive Position Correction for Visual Grounding
Viaarxiv icon

A Mixed-Language Multi-Document News Summarization Dataset and a Graphs-Based Extract-Generate Model

Add code
Oct 13, 2024
Viaarxiv icon

2D-TPE: Two-Dimensional Positional Encoding Enhances Table Understanding for Large Language Models

Add code
Sep 29, 2024
Viaarxiv icon