Picture for Wei Niu

Wei Niu

WISE-Flow: Workflow-Induced Structured Experience for Self-Evolving Conversational Service Agents

Add code
Jan 13, 2026
Viaarxiv icon

From Bits to Chips: An LLM-based Hardware-Aware Quantization Agent for Streamlined Deployment of LLMs

Add code
Jan 07, 2026
Viaarxiv icon

OUSAC: Optimized Guidance Scheduling with Adaptive Caching for DiT Acceleration

Add code
Dec 16, 2025
Figure 1 for OUSAC: Optimized Guidance Scheduling with Adaptive Caching for DiT Acceleration
Figure 2 for OUSAC: Optimized Guidance Scheduling with Adaptive Caching for DiT Acceleration
Figure 3 for OUSAC: Optimized Guidance Scheduling with Adaptive Caching for DiT Acceleration
Figure 4 for OUSAC: Optimized Guidance Scheduling with Adaptive Caching for DiT Acceleration
Viaarxiv icon

TSLA: A Task-Specific Learning Adaptation for Semantic Segmentation on Autonomous Vehicles Platform

Add code
Aug 17, 2025
Figure 1 for TSLA: A Task-Specific Learning Adaptation for Semantic Segmentation on Autonomous Vehicles Platform
Figure 2 for TSLA: A Task-Specific Learning Adaptation for Semantic Segmentation on Autonomous Vehicles Platform
Figure 3 for TSLA: A Task-Specific Learning Adaptation for Semantic Segmentation on Autonomous Vehicles Platform
Figure 4 for TSLA: A Task-Specific Learning Adaptation for Semantic Segmentation on Autonomous Vehicles Platform
Viaarxiv icon

RCR-Router: Efficient Role-Aware Context Routing for Multi-Agent LLM Systems with Structured Memory

Add code
Aug 06, 2025
Viaarxiv icon

Structured Agent Distillation for Large Language Model

Add code
May 20, 2025
Figure 1 for Structured Agent Distillation for Large Language Model
Figure 2 for Structured Agent Distillation for Large Language Model
Figure 3 for Structured Agent Distillation for Large Language Model
Figure 4 for Structured Agent Distillation for Large Language Model
Viaarxiv icon

QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge

Add code
Mar 20, 2025
Figure 1 for QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge
Figure 2 for QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge
Figure 3 for QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge
Figure 4 for QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge
Viaarxiv icon

Learning LLM Preference over Intra-Dialogue Pairs: A Framework for Utterance-level Understandings

Add code
Mar 07, 2025
Figure 1 for Learning LLM Preference over Intra-Dialogue Pairs: A Framework for Utterance-level Understandings
Figure 2 for Learning LLM Preference over Intra-Dialogue Pairs: A Framework for Utterance-level Understandings
Figure 3 for Learning LLM Preference over Intra-Dialogue Pairs: A Framework for Utterance-level Understandings
Figure 4 for Learning LLM Preference over Intra-Dialogue Pairs: A Framework for Utterance-level Understandings
Viaarxiv icon

RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation

Add code
Jan 08, 2025
Figure 1 for RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation
Figure 2 for RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation
Figure 3 for RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation
Figure 4 for RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation
Viaarxiv icon

LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers

Add code
Dec 17, 2024
Viaarxiv icon