Picture for Zhenghao Lin

Zhenghao Lin

Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability

Add code
Feb 02, 2026
Viaarxiv icon

Sigma-MoE-Tiny Technical Report

Add code
Dec 19, 2025
Figure 1 for Sigma-MoE-Tiny Technical Report
Figure 2 for Sigma-MoE-Tiny Technical Report
Figure 3 for Sigma-MoE-Tiny Technical Report
Figure 4 for Sigma-MoE-Tiny Technical Report
Viaarxiv icon

SIGMA: An AI-Empowered Training Stack on Early-Life Hardware

Add code
Dec 15, 2025
Figure 1 for SIGMA: An AI-Empowered Training Stack on Early-Life Hardware
Figure 2 for SIGMA: An AI-Empowered Training Stack on Early-Life Hardware
Figure 3 for SIGMA: An AI-Empowered Training Stack on Early-Life Hardware
Figure 4 for SIGMA: An AI-Empowered Training Stack on Early-Life Hardware
Viaarxiv icon

Generalized Category Discovery in Event-Centric Contexts: Latent Pattern Mining with LLMs

Add code
May 29, 2025
Viaarxiv icon

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Add code
Jan 23, 2025
Figure 1 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 2 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 3 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 4 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Viaarxiv icon

Collaborative Optimization in Financial Data Mining Through Deep Learning and ResNeXt

Add code
Dec 23, 2024
Figure 1 for Collaborative Optimization in Financial Data Mining Through Deep Learning and ResNeXt
Figure 2 for Collaborative Optimization in Financial Data Mining Through Deep Learning and ResNeXt
Viaarxiv icon

Integrative Analysis of Financial Market Sentiment Using CNN and GRU for Risk Prediction and Alert Systems

Add code
Dec 13, 2024
Figure 1 for Integrative Analysis of Financial Market Sentiment Using CNN and GRU for Risk Prediction and Alert Systems
Figure 2 for Integrative Analysis of Financial Market Sentiment Using CNN and GRU for Risk Prediction and Alert Systems
Figure 3 for Integrative Analysis of Financial Market Sentiment Using CNN and GRU for Risk Prediction and Alert Systems
Figure 4 for Integrative Analysis of Financial Market Sentiment Using CNN and GRU for Risk Prediction and Alert Systems
Viaarxiv icon

Revolutionizing Database Q&A with Large Language Models: Comprehensive Benchmark and Evaluation

Add code
Sep 05, 2024
Figure 1 for Revolutionizing Database Q&A with Large Language Models: Comprehensive Benchmark and Evaluation
Figure 2 for Revolutionizing Database Q&A with Large Language Models: Comprehensive Benchmark and Evaluation
Figure 3 for Revolutionizing Database Q&A with Large Language Models: Comprehensive Benchmark and Evaluation
Figure 4 for Revolutionizing Database Q&A with Large Language Models: Comprehensive Benchmark and Evaluation
Viaarxiv icon

Rho-1: Not All Tokens Are What You Need

Add code
Apr 11, 2024
Figure 1 for Rho-1: Not All Tokens Are What You Need
Figure 2 for Rho-1: Not All Tokens Are What You Need
Figure 3 for Rho-1: Not All Tokens Are What You Need
Figure 4 for Rho-1: Not All Tokens Are What You Need
Viaarxiv icon

Ensuring Safe and High-Quality Outputs: A Guideline Library Approach for Language Models

Add code
Mar 23, 2024
Viaarxiv icon