Picture for Min Zhang

Min Zhang

Jake

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Add code
Dec 12, 2024
Viaarxiv icon

ZigZagkv: Dynamic KV Cache Compression for Long-context Modeling based on Layer Uncertainty

Add code
Dec 12, 2024
Viaarxiv icon

CMT: A Memory Compression Method for Continual Knowledge Learning of Large Language Models

Add code
Dec 10, 2024
Viaarxiv icon

Multi-Level Correlation Network For Few-Shot Image Classification

Add code
Dec 04, 2024
Viaarxiv icon

Towards Rich Emotions in 3D Avatars: A Text-to-3D Avatar Generation Benchmark

Add code
Dec 03, 2024
Viaarxiv icon

Learning Monotonic Attention in Transducer for Streaming Generation

Add code
Nov 26, 2024
Viaarxiv icon

DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization

Add code
Nov 21, 2024
Viaarxiv icon

Hard-Synth: Synthesizing Diverse Hard Samples for ASR using Zero-Shot TTS and LLM

Add code
Nov 20, 2024
Viaarxiv icon

Interpret the Internal States of Recommendation Model with Sparse Autoencoder

Add code
Nov 09, 2024
Viaarxiv icon

Beyond Utility: Evaluating LLM as Recommender

Add code
Nov 01, 2024
Figure 1 for Beyond Utility: Evaluating LLM as Recommender
Figure 2 for Beyond Utility: Evaluating LLM as Recommender
Figure 3 for Beyond Utility: Evaluating LLM as Recommender
Figure 4 for Beyond Utility: Evaluating LLM as Recommender
Viaarxiv icon