Picture for Zili Wang

Zili Wang

Continuous Speculative Decoding for Autoregressive Image Generation

Add code
Nov 18, 2024
Viaarxiv icon

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Add code
Nov 07, 2024
Figure 1 for OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Figure 2 for OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Figure 3 for OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Figure 4 for OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Viaarxiv icon

BoxMap: Efficient Structural Mapping and Navigation

Add code
Oct 08, 2024
Figure 1 for BoxMap: Efficient Structural Mapping and Navigation
Figure 2 for BoxMap: Efficient Structural Mapping and Navigation
Figure 3 for BoxMap: Efficient Structural Mapping and Navigation
Figure 4 for BoxMap: Efficient Structural Mapping and Navigation
Viaarxiv icon

Post-hoc Reward Calibration: A Case Study on Length Bias

Add code
Sep 25, 2024
Viaarxiv icon

Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis

Add code
Sep 10, 2024
Figure 1 for Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis
Figure 2 for Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis
Figure 3 for Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis
Figure 4 for Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis
Viaarxiv icon

Layerwise Recurrent Router for Mixture-of-Experts

Add code
Aug 13, 2024
Figure 1 for Layerwise Recurrent Router for Mixture-of-Experts
Figure 2 for Layerwise Recurrent Router for Mixture-of-Experts
Figure 3 for Layerwise Recurrent Router for Mixture-of-Experts
Figure 4 for Layerwise Recurrent Router for Mixture-of-Experts
Viaarxiv icon

AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation

Add code
Aug 03, 2024
Figure 1 for AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation
Figure 2 for AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation
Figure 3 for AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation
Figure 4 for AVESFormer: Efficient Transformer Design for Real-Time Audio-Visual Segmentation
Viaarxiv icon

DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training

Add code
Aug 01, 2024
Figure 1 for DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training
Figure 2 for DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training
Figure 3 for DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training
Figure 4 for DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising Training
Viaarxiv icon

R3D-AD: Reconstruction via Diffusion for 3D Anomaly Detection

Add code
Jul 15, 2024
Viaarxiv icon

A Closer Look into Mixture-of-Experts in Large Language Models

Add code
Jun 26, 2024
Figure 1 for A Closer Look into Mixture-of-Experts in Large Language Models
Figure 2 for A Closer Look into Mixture-of-Experts in Large Language Models
Figure 3 for A Closer Look into Mixture-of-Experts in Large Language Models
Figure 4 for A Closer Look into Mixture-of-Experts in Large Language Models
Viaarxiv icon