Picture for Zhifei Li

Zhifei Li

MyGram: Modality-aware Graph Transformer with Global Distribution for Multi-modal Entity Alignment

Add code
Jan 17, 2026
Viaarxiv icon

MacVQA: Adaptive Memory Allocation and Global Noise Filtering for Continual Visual Question Answering

Add code
Jan 05, 2026
Viaarxiv icon

Let the Barbarians In: How AI Can Accelerate Systems Performance Research

Add code
Dec 22, 2025
Viaarxiv icon

KeenKT: Knowledge Mastery-State Disambiguation for Knowledge Tracing

Add code
Dec 21, 2025
Viaarxiv icon

FrontierCS: Evolving Challenges for Evolving Intelligence

Add code
Dec 17, 2025
Figure 1 for FrontierCS: Evolving Challenges for Evolving Intelligence
Figure 2 for FrontierCS: Evolving Challenges for Evolving Intelligence
Figure 3 for FrontierCS: Evolving Challenges for Evolving Intelligence
Figure 4 for FrontierCS: Evolving Challenges for Evolving Intelligence
Viaarxiv icon

Analyzing Planner Design Trade-offs for MAPF under Realistic Simulation

Add code
Dec 10, 2025
Viaarxiv icon

LEANN: A Low-Storage Vector Index

Add code
Jun 09, 2025
Viaarxiv icon

Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens

Add code
Mar 03, 2025
Figure 1 for Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Figure 2 for Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Figure 3 for Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Figure 4 for Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
Viaarxiv icon

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Add code
Feb 06, 2025
Viaarxiv icon

CoDiff-VC: A Codec-Assisted Diffusion Model for Zero-shot Voice Conversion

Add code
Dec 03, 2024
Figure 1 for CoDiff-VC: A Codec-Assisted Diffusion Model for Zero-shot Voice Conversion
Figure 2 for CoDiff-VC: A Codec-Assisted Diffusion Model for Zero-shot Voice Conversion
Figure 3 for CoDiff-VC: A Codec-Assisted Diffusion Model for Zero-shot Voice Conversion
Figure 4 for CoDiff-VC: A Codec-Assisted Diffusion Model for Zero-shot Voice Conversion
Viaarxiv icon