Picture for Jingbo Zhu

Jingbo Zhu

Foundations of Large Language Models

Add code
Jan 16, 2025
Viaarxiv icon

Optimizing Speech Multi-View Feature Fusion through Conditional Computation

Add code
Jan 14, 2025
Viaarxiv icon

Boosting Text-To-Image Generation via Multilingual Prompting in Large Multimodal Models

Add code
Jan 13, 2025
Viaarxiv icon

SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment

Add code
Jan 07, 2025
Viaarxiv icon

Early Exit Is a Natural Capability in Transformer-based Models: An Empirical Study on Early Exit without Joint Optimization

Add code
Dec 02, 2024
Figure 1 for Early Exit Is a Natural Capability in Transformer-based Models: An Empirical Study on Early Exit without Joint Optimization
Figure 2 for Early Exit Is a Natural Capability in Transformer-based Models: An Empirical Study on Early Exit without Joint Optimization
Figure 3 for Early Exit Is a Natural Capability in Transformer-based Models: An Empirical Study on Early Exit without Joint Optimization
Figure 4 for Early Exit Is a Natural Capability in Transformer-based Models: An Empirical Study on Early Exit without Joint Optimization
Viaarxiv icon

Predictor-Corrector Enhanced Transformers with Exponential Moving Average Coefficient Learning

Add code
Nov 05, 2024
Viaarxiv icon

Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-context Models

Add code
Oct 07, 2024
Figure 1 for Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-context Models
Figure 2 for Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-context Models
Figure 3 for Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-context Models
Figure 4 for Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-context Models
Viaarxiv icon

LRHP: Learning Representations for Human Preferences via Preference Pairs

Add code
Oct 06, 2024
Viaarxiv icon

A Modular-based Strategy for Mitigating Gradient Conflicts in Simultaneous Speech Translation

Add code
Sep 24, 2024
Figure 1 for A Modular-based Strategy for Mitigating Gradient Conflicts in Simultaneous Speech Translation
Figure 2 for A Modular-based Strategy for Mitigating Gradient Conflicts in Simultaneous Speech Translation
Figure 3 for A Modular-based Strategy for Mitigating Gradient Conflicts in Simultaneous Speech Translation
Figure 4 for A Modular-based Strategy for Mitigating Gradient Conflicts in Simultaneous Speech Translation
Viaarxiv icon

More Effective LLM Compressed Tokens with Uniformly Spread Position Identifiers and Compression Loss

Add code
Sep 22, 2024
Viaarxiv icon