Picture for Zuxuan Wu

Zuxuan Wu

Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders

Add code
Oct 27, 2024
Viaarxiv icon

DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation

Add code
Sep 11, 2024
Figure 1 for DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation
Figure 2 for DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation
Figure 3 for DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation
Figure 4 for DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation
Viaarxiv icon

GenRec: Unifying Video Generation and Recognition with Diffusion Models

Add code
Aug 27, 2024
Figure 1 for GenRec: Unifying Video Generation and Recognition with Diffusion Models
Figure 2 for GenRec: Unifying Video Generation and Recognition with Diffusion Models
Figure 3 for GenRec: Unifying Video Generation and Recognition with Diffusion Models
Figure 4 for GenRec: Unifying Video Generation and Recognition with Diffusion Models
Viaarxiv icon

Downstream Transfer Attack: Adversarial Attacks on Downstream Models with Pre-trained Vision Transformers

Add code
Aug 03, 2024
Figure 1 for Downstream Transfer Attack: Adversarial Attacks on Downstream Models with Pre-trained Vision Transformers
Figure 2 for Downstream Transfer Attack: Adversarial Attacks on Downstream Models with Pre-trained Vision Transformers
Figure 3 for Downstream Transfer Attack: Adversarial Attacks on Downstream Models with Pre-trained Vision Transformers
Figure 4 for Downstream Transfer Attack: Adversarial Attacks on Downstream Models with Pre-trained Vision Transformers
Viaarxiv icon

V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results

Add code
Jun 17, 2024
Viaarxiv icon

Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms

Add code
Jun 13, 2024
Viaarxiv icon

OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation

Add code
Jun 13, 2024
Figure 1 for OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation
Figure 2 for OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation
Figure 3 for OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation
Figure 4 for OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation
Viaarxiv icon

Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation

Add code
Jun 11, 2024
Viaarxiv icon

AutoTVG: A New Vision-language Pre-training Paradigm for Temporal Video Grounding

Add code
Jun 11, 2024
Viaarxiv icon

AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction

Add code
Jun 10, 2024
Viaarxiv icon