Picture for Bingheng Wu

Bingheng Wu

Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model Architecture

Add code
Dec 16, 2024
Viaarxiv icon

Cheems: Wonderful Matrices More Efficient and More Effective Architecture

Add code
Jul 25, 2024
Viaarxiv icon

OTCE: Hybrid SSM and Attention with Cross Domain Mixture of Experts to construct Observer-Thinker-Conceiver-Expresser

Add code
Jun 25, 2024
Figure 1 for OTCE: Hybrid SSM and Attention with Cross Domain Mixture of Experts to construct Observer-Thinker-Conceiver-Expresser
Figure 2 for OTCE: Hybrid SSM and Attention with Cross Domain Mixture of Experts to construct Observer-Thinker-Conceiver-Expresser
Figure 3 for OTCE: Hybrid SSM and Attention with Cross Domain Mixture of Experts to construct Observer-Thinker-Conceiver-Expresser
Figure 4 for OTCE: Hybrid SSM and Attention with Cross Domain Mixture of Experts to construct Observer-Thinker-Conceiver-Expresser
Viaarxiv icon