Picture for Jingjing Xu

Jingjing Xu

Enhancing Auto-regressive Chain-of-Thought through Loop-Aligned Reasoning

Add code
Feb 12, 2025
Viaarxiv icon

Teaching Language Models to Critique via Reinforcement Learning

Add code
Feb 05, 2025
Viaarxiv icon

Efficient Supernet Training with Orthogonal Softmax for Scalable ASR Model Compression

Add code
Jan 31, 2025
Viaarxiv icon

A Unified Hyperparameter Optimization Pipeline for Transformer-Based Time Series Forecasting Models

Add code
Jan 02, 2025
Figure 1 for A Unified Hyperparameter Optimization Pipeline for Transformer-Based Time Series Forecasting Models
Figure 2 for A Unified Hyperparameter Optimization Pipeline for Transformer-Based Time Series Forecasting Models
Figure 3 for A Unified Hyperparameter Optimization Pipeline for Transformer-Based Time Series Forecasting Models
Figure 4 for A Unified Hyperparameter Optimization Pipeline for Transformer-Based Time Series Forecasting Models
Viaarxiv icon

The Rise and Down of Babel Tower: Investigating the Evolution Process of Multilingual Code Large Language Model

Add code
Dec 10, 2024
Figure 1 for The Rise and Down of Babel Tower: Investigating the Evolution Process of Multilingual Code Large Language Model
Figure 2 for The Rise and Down of Babel Tower: Investigating the Evolution Process of Multilingual Code Large Language Model
Figure 3 for The Rise and Down of Babel Tower: Investigating the Evolution Process of Multilingual Code Large Language Model
Figure 4 for The Rise and Down of Babel Tower: Investigating the Evolution Process of Multilingual Code Large Language Model
Viaarxiv icon

Why Does the Effective Context Length of LLMs Fall Short?

Add code
Oct 24, 2024
Figure 1 for Why Does the Effective Context Length of LLMs Fall Short?
Figure 2 for Why Does the Effective Context Length of LLMs Fall Short?
Figure 3 for Why Does the Effective Context Length of LLMs Fall Short?
Figure 4 for Why Does the Effective Context Length of LLMs Fall Short?
Viaarxiv icon

FAN: Fourier Analysis Networks

Add code
Oct 03, 2024
Figure 1 for FAN: Fourier Analysis Networks
Figure 2 for FAN: Fourier Analysis Networks
Figure 3 for FAN: Fourier Analysis Networks
Figure 4 for FAN: Fourier Analysis Networks
Viaarxiv icon

Survey and Taxonomy: The Role of Data-Centric AI in Transformer-Based Time Series Forecasting

Add code
Jul 29, 2024
Viaarxiv icon

Let the Code LLM Edit Itself When You Edit the Code

Add code
Jul 03, 2024
Viaarxiv icon

An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing

Add code
Mar 25, 2024
Figure 1 for An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing
Figure 2 for An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing
Figure 3 for An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing
Figure 4 for An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing
Viaarxiv icon