Picture for Xinrong Zhang

Xinrong Zhang

Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling

Add code
Oct 09, 2024
Figure 1 for Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling
Figure 2 for Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling
Figure 3 for Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling
Figure 4 for Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling
Viaarxiv icon

Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models

Add code
Jun 22, 2024
Viaarxiv icon

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

Add code
Apr 09, 2024
Figure 1 for MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Figure 2 for MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Figure 3 for MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Figure 4 for MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Viaarxiv icon

$\infty$Bench: Extending Long Context Evaluation Beyond 100K Tokens

Add code
Feb 24, 2024
Viaarxiv icon

Unlock Predictable Scaling from Emergent Abilities

Add code
Oct 05, 2023
Figure 1 for Unlock Predictable Scaling from Emergent Abilities
Figure 2 for Unlock Predictable Scaling from Emergent Abilities
Figure 3 for Unlock Predictable Scaling from Emergent Abilities
Figure 4 for Unlock Predictable Scaling from Emergent Abilities
Viaarxiv icon

Phocus: Picking Valuable Research from a Sea of Citations

Add code
Jan 14, 2022
Figure 1 for Phocus: Picking Valuable Research from a Sea of Citations
Figure 2 for Phocus: Picking Valuable Research from a Sea of Citations
Figure 3 for Phocus: Picking Valuable Research from a Sea of Citations
Figure 4 for Phocus: Picking Valuable Research from a Sea of Citations
Viaarxiv icon