Picture for Yutao Zeng

Yutao Zeng

Frac-Connections: Fractional Extension of Hyper-Connections

Add code
Mar 18, 2025
Viaarxiv icon

HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization

Add code
Mar 06, 2025
Viaarxiv icon

Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models

Add code
Feb 21, 2025
Viaarxiv icon

SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models

Add code
Feb 18, 2025
Viaarxiv icon

Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling

Add code
Jan 28, 2025
Viaarxiv icon

Ultra-Sparse Memory Network

Add code
Nov 19, 2024
Viaarxiv icon

AlignXIE: Improving Multilingual Information Extraction by Cross-Lingual Alignment

Add code
Nov 07, 2024
Figure 1 for AlignXIE: Improving Multilingual Information Extraction by Cross-Lingual Alignment
Figure 2 for AlignXIE: Improving Multilingual Information Extraction by Cross-Lingual Alignment
Figure 3 for AlignXIE: Improving Multilingual Information Extraction by Cross-Lingual Alignment
Figure 4 for AlignXIE: Improving Multilingual Information Extraction by Cross-Lingual Alignment
Viaarxiv icon

Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models

Add code
Nov 06, 2024
Viaarxiv icon

Hyper-Connections

Add code
Sep 29, 2024
Figure 1 for Hyper-Connections
Figure 2 for Hyper-Connections
Figure 3 for Hyper-Connections
Figure 4 for Hyper-Connections
Viaarxiv icon

Self-Improvement Programming for Temporal Knowledge Graph Question Answering

Add code
Apr 02, 2024
Figure 1 for Self-Improvement Programming for Temporal Knowledge Graph Question Answering
Figure 2 for Self-Improvement Programming for Temporal Knowledge Graph Question Answering
Figure 3 for Self-Improvement Programming for Temporal Knowledge Graph Question Answering
Figure 4 for Self-Improvement Programming for Temporal Knowledge Graph Question Answering
Viaarxiv icon