Picture for Wenqi Shao

Wenqi Shao

$\textbf{EMOS}$: $\textbf{E}$mbodiment-aware Heterogeneous $\textbf{M}$ulti-robot $\textbf{O}$perating $\textbf{S}$ystem with LLM Agents

Add code
Oct 30, 2024
Viaarxiv icon

TP-Eval: Tap Multimodal LLMs' Potential in Evaluation by Customizing Prompts

Add code
Oct 23, 2024
Viaarxiv icon

ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification and KV Cache Compression

Add code
Oct 11, 2024
Viaarxiv icon

Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping

Add code
Oct 11, 2024
Figure 1 for Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Figure 2 for Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Figure 3 for Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Figure 4 for Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Viaarxiv icon

DCP: Learning Accelerator Dataflow for Neural Network via Propagation

Add code
Oct 09, 2024
Figure 1 for DCP: Learning Accelerator Dataflow for Neural Network via Propagation
Figure 2 for DCP: Learning Accelerator Dataflow for Neural Network via Propagation
Figure 3 for DCP: Learning Accelerator Dataflow for Neural Network via Propagation
Figure 4 for DCP: Learning Accelerator Dataflow for Neural Network via Propagation
Viaarxiv icon

PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs

Add code
Oct 07, 2024
Figure 1 for PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Figure 2 for PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Figure 3 for PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Figure 4 for PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs
Viaarxiv icon

Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation

Add code
Oct 07, 2024
Viaarxiv icon

HRVMamba: High-Resolution Visual State Space Model for Dense Prediction

Add code
Oct 04, 2024
Viaarxiv icon

Task-Oriented Diffusion Inversion for High-Fidelity Text-based Editing

Add code
Aug 23, 2024
Viaarxiv icon

HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model

Add code
Aug 18, 2024
Viaarxiv icon