Picture for Zhirong Yang

Zhirong Yang

Aalto University

MaTrRec: Uniting Mamba and Transformer for Sequential Recommendation

Add code
Jul 27, 2024
Viaarxiv icon

Self-Distillation Improves DNA Sequence Inference

Add code
May 14, 2024
Viaarxiv icon

NLEBench+NorGLM: A Comprehensive Empirical Analysis and Benchmark Dataset for Generative Language Models in Norwegian

Add code
Dec 03, 2023
Viaarxiv icon

ChordMixer: A Scalable Neural Attention Model for Sequences with Different Lengths

Add code
Jun 12, 2022
Figure 1 for ChordMixer: A Scalable Neural Attention Model for Sequences with Different Lengths
Figure 2 for ChordMixer: A Scalable Neural Attention Model for Sequences with Different Lengths
Figure 3 for ChordMixer: A Scalable Neural Attention Model for Sequences with Different Lengths
Figure 4 for ChordMixer: A Scalable Neural Attention Model for Sequences with Different Lengths
Viaarxiv icon

Paramixer: Parameterizing Mixing Links in Sparse Factors Works Better than Dot-Product Self-Attention

Add code
Apr 22, 2022
Figure 1 for Paramixer: Parameterizing Mixing Links in Sparse Factors Works Better than Dot-Product Self-Attention
Figure 2 for Paramixer: Parameterizing Mixing Links in Sparse Factors Works Better than Dot-Product Self-Attention
Figure 3 for Paramixer: Parameterizing Mixing Links in Sparse Factors Works Better than Dot-Product Self-Attention
Figure 4 for Paramixer: Parameterizing Mixing Links in Sparse Factors Works Better than Dot-Product Self-Attention
Viaarxiv icon

Classification of Long Sequential Data using Circular Dilated Convolutional Neural Networks

Add code
Jan 06, 2022
Figure 1 for Classification of Long Sequential Data using Circular Dilated Convolutional Neural Networks
Figure 2 for Classification of Long Sequential Data using Circular Dilated Convolutional Neural Networks
Figure 3 for Classification of Long Sequential Data using Circular Dilated Convolutional Neural Networks
Figure 4 for Classification of Long Sequential Data using Circular Dilated Convolutional Neural Networks
Viaarxiv icon

T-SNE Is Not Optimized to Reveal Clusters in Data

Add code
Oct 06, 2021
Figure 1 for T-SNE Is Not Optimized to Reveal Clusters in Data
Figure 2 for T-SNE Is Not Optimized to Reveal Clusters in Data
Figure 3 for T-SNE Is Not Optimized to Reveal Clusters in Data
Figure 4 for T-SNE Is Not Optimized to Reveal Clusters in Data
Viaarxiv icon

Sparse Factorization of Large Square Matrices

Add code
Sep 16, 2021
Figure 1 for Sparse Factorization of Large Square Matrices
Figure 2 for Sparse Factorization of Large Square Matrices
Figure 3 for Sparse Factorization of Large Square Matrices
Figure 4 for Sparse Factorization of Large Square Matrices
Viaarxiv icon

Stochastic Cluster Embedding

Add code
Aug 18, 2021
Figure 1 for Stochastic Cluster Embedding
Figure 2 for Stochastic Cluster Embedding
Figure 3 for Stochastic Cluster Embedding
Figure 4 for Stochastic Cluster Embedding
Viaarxiv icon

Word Embedding based on Low-Rank Doubly Stochastic Matrix Decomposition

Add code
Dec 12, 2018
Figure 1 for Word Embedding based on Low-Rank Doubly Stochastic Matrix Decomposition
Figure 2 for Word Embedding based on Low-Rank Doubly Stochastic Matrix Decomposition
Figure 3 for Word Embedding based on Low-Rank Doubly Stochastic Matrix Decomposition
Viaarxiv icon