Picture for Wenyu Du

Wenyu Du

Learning from Failures in Multi-Attempt Reinforcement Learning

Add code
Mar 04, 2025
Viaarxiv icon

Finite State Automata Inside Transformers with Chain-of-Thought: A Mechanistic Study on State Tracking

Add code
Feb 27, 2025
Viaarxiv icon

Towards Understanding Fine-Tuning Mechanisms of LLMs via Circuit Analysis

Add code
Feb 17, 2025
Viaarxiv icon

Unlocking Continual Learning Abilities in Language Models

Add code
Jun 25, 2024
Figure 1 for Unlocking Continual Learning Abilities in Language Models
Figure 2 for Unlocking Continual Learning Abilities in Language Models
Figure 3 for Unlocking Continual Learning Abilities in Language Models
Figure 4 for Unlocking Continual Learning Abilities in Language Models
Viaarxiv icon

Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training

Add code
May 24, 2024
Viaarxiv icon

m2mKD: Module-to-Module Knowledge Distillation for Modular Transformers

Add code
Feb 26, 2024
Viaarxiv icon

f-Divergence Minimization for Sequence-Level Knowledge Distillation

Add code
Jul 27, 2023
Viaarxiv icon

Graphix-T5: Mixing Pre-Trained Transformers with Graph-Aware Layers for Text-to-SQL Parsing

Add code
Jan 18, 2023
Figure 1 for Graphix-T5: Mixing Pre-Trained Transformers with Graph-Aware Layers for Text-to-SQL Parsing
Figure 2 for Graphix-T5: Mixing Pre-Trained Transformers with Graph-Aware Layers for Text-to-SQL Parsing
Figure 3 for Graphix-T5: Mixing Pre-Trained Transformers with Graph-Aware Layers for Text-to-SQL Parsing
Figure 4 for Graphix-T5: Mixing Pre-Trained Transformers with Graph-Aware Layers for Text-to-SQL Parsing
Viaarxiv icon

Optimizing Stock Option Forecasting with the Assembly of Machine Learning Models and Improved Trading Strategies

Add code
Nov 29, 2022
Viaarxiv icon

Application of Convolutional Neural Networks with Quasi-Reversibility Method Results for Option Forecasting

Add code
Aug 25, 2022
Figure 1 for Application of Convolutional Neural Networks with Quasi-Reversibility Method Results for Option Forecasting
Figure 2 for Application of Convolutional Neural Networks with Quasi-Reversibility Method Results for Option Forecasting
Figure 3 for Application of Convolutional Neural Networks with Quasi-Reversibility Method Results for Option Forecasting
Figure 4 for Application of Convolutional Neural Networks with Quasi-Reversibility Method Results for Option Forecasting
Viaarxiv icon