Picture for Zicheng Liu

Zicheng Liu

Agent Laboratory: Using LLM Agents as Research Assistants

Add code
Jan 08, 2025
Figure 1 for Agent Laboratory: Using LLM Agents as Research Assistants
Figure 2 for Agent Laboratory: Using LLM Agents as Research Assistants
Figure 3 for Agent Laboratory: Using LLM Agents as Research Assistants
Figure 4 for Agent Laboratory: Using LLM Agents as Research Assistants
Viaarxiv icon

SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer

Add code
Dec 14, 2024
Viaarxiv icon

B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens

Add code
Dec 13, 2024
Figure 1 for B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens
Figure 2 for B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens
Figure 3 for B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens
Figure 4 for B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens
Viaarxiv icon

Conditional Text-to-Image Generation with Reference Guidance

Add code
Nov 22, 2024
Viaarxiv icon

Unveiling the Backbone-Optimizer Coupling Bias in Visual Representation Learning

Add code
Oct 08, 2024
Figure 1 for Unveiling the Backbone-Optimizer Coupling Bias in Visual Representation Learning
Figure 2 for Unveiling the Backbone-Optimizer Coupling Bias in Visual Representation Learning
Figure 3 for Unveiling the Backbone-Optimizer Coupling Bias in Visual Representation Learning
Figure 4 for Unveiling the Backbone-Optimizer Coupling Bias in Visual Representation Learning
Viaarxiv icon

Tuning Timestep-Distilled Diffusion Model Using Pairwise Sample Optimization

Add code
Oct 04, 2024
Figure 1 for Tuning Timestep-Distilled Diffusion Model Using Pairwise Sample Optimization
Figure 2 for Tuning Timestep-Distilled Diffusion Model Using Pairwise Sample Optimization
Figure 3 for Tuning Timestep-Distilled Diffusion Model Using Pairwise Sample Optimization
Figure 4 for Tuning Timestep-Distilled Diffusion Model Using Pairwise Sample Optimization
Viaarxiv icon

A Survey on Mixup Augmentations and Beyond

Add code
Sep 08, 2024
Figure 1 for A Survey on Mixup Augmentations and Beyond
Figure 2 for A Survey on Mixup Augmentations and Beyond
Figure 3 for A Survey on Mixup Augmentations and Beyond
Figure 4 for A Survey on Mixup Augmentations and Beyond
Viaarxiv icon

AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition

Add code
Aug 21, 2024
Figure 1 for AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition
Figure 2 for AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition
Figure 3 for AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition
Figure 4 for AutoDirector: Online Auto-scheduling Agents for Multi-sensory Composition
Viaarxiv icon

MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities

Add code
Aug 01, 2024
Figure 1 for MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
Figure 2 for MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
Figure 3 for MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
Figure 4 for MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
Viaarxiv icon

IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation

Add code
Jul 15, 2024
Figure 1 for IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
Figure 2 for IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
Figure 3 for IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
Figure 4 for IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation
Viaarxiv icon