Picture for Shuai Zhang

Shuai Zhang

Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking

Add code
Feb 04, 2025
Viaarxiv icon

DReSS: Data-driven Regularized Structured Streamlining for Large Language Models

Add code
Jan 29, 2025
Figure 1 for DReSS: Data-driven Regularized Structured Streamlining for Large Language Models
Figure 2 for DReSS: Data-driven Regularized Structured Streamlining for Large Language Models
Figure 3 for DReSS: Data-driven Regularized Structured Streamlining for Large Language Models
Figure 4 for DReSS: Data-driven Regularized Structured Streamlining for Large Language Models
Viaarxiv icon

Is FISHER All You Need in The Multi-AUV Underwater Target Tracking Task?

Add code
Dec 05, 2024
Figure 1 for Is FISHER All You Need in The Multi-AUV Underwater Target Tracking Task?
Figure 2 for Is FISHER All You Need in The Multi-AUV Underwater Target Tracking Task?
Figure 3 for Is FISHER All You Need in The Multi-AUV Underwater Target Tracking Task?
Figure 4 for Is FISHER All You Need in The Multi-AUV Underwater Target Tracking Task?
Viaarxiv icon

Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS

Add code
Nov 27, 2024
Viaarxiv icon

Multimodal Instruction Tuning with Hybrid State Space Models

Add code
Nov 13, 2024
Figure 1 for Multimodal Instruction Tuning with Hybrid State Space Models
Figure 2 for Multimodal Instruction Tuning with Hybrid State Space Models
Figure 3 for Multimodal Instruction Tuning with Hybrid State Space Models
Figure 4 for Multimodal Instruction Tuning with Hybrid State Space Models
Viaarxiv icon

Unraveling the Gradient Descent Dynamics of Transformers

Add code
Nov 12, 2024
Figure 1 for Unraveling the Gradient Descent Dynamics of Transformers
Figure 2 for Unraveling the Gradient Descent Dynamics of Transformers
Figure 3 for Unraveling the Gradient Descent Dynamics of Transformers
Figure 4 for Unraveling the Gradient Descent Dynamics of Transformers
Viaarxiv icon

PalmBench: A Comprehensive Benchmark of Compressed Large Language Models on Mobile Platforms

Add code
Oct 05, 2024
Viaarxiv icon

CausalVE: Face Video Privacy Encryption via Causal Video Prediction

Add code
Sep 28, 2024
Figure 1 for CausalVE: Face Video Privacy Encryption via Causal Video Prediction
Figure 2 for CausalVE: Face Video Privacy Encryption via Causal Video Prediction
Figure 3 for CausalVE: Face Video Privacy Encryption via Causal Video Prediction
Figure 4 for CausalVE: Face Video Privacy Encryption via Causal Video Prediction
Viaarxiv icon

Dynamic 2D Gaussians: Geometrically accurate radiance fields for dynamic objects

Add code
Sep 21, 2024
Figure 1 for Dynamic 2D Gaussians: Geometrically accurate radiance fields for dynamic objects
Figure 2 for Dynamic 2D Gaussians: Geometrically accurate radiance fields for dynamic objects
Figure 3 for Dynamic 2D Gaussians: Geometrically accurate radiance fields for dynamic objects
Figure 4 for Dynamic 2D Gaussians: Geometrically accurate radiance fields for dynamic objects
Viaarxiv icon

LHQ-SVC: Lightweight and High Quality Singing Voice Conversion Modeling

Add code
Sep 13, 2024
Viaarxiv icon