Picture for Jun Zhu

Jun Zhu

Tsinghua University

LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models

Add code
Nov 14, 2024
Figure 1 for LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models
Figure 2 for LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models
Figure 3 for LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models
Figure 4 for LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models
Viaarxiv icon

MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue

Add code
Nov 06, 2024
Figure 1 for MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue
Figure 2 for MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue
Figure 3 for MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue
Figure 4 for MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue
Viaarxiv icon

ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation

Add code
Nov 04, 2024
Figure 1 for ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation
Figure 2 for ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation
Figure 3 for ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation
Figure 4 for ManiBox: Enhancing Spatial Grasping Generalization via Scalable Simulation Data Generation
Viaarxiv icon

Localization, balance and affinity: a stronger multifaceted collaborative salient object detector in remote sensing images

Add code
Oct 31, 2024
Viaarxiv icon

Consistency Diffusion Bridge Models

Add code
Oct 31, 2024
Viaarxiv icon

Decentralized Hybrid Precoding for Massive MU-MIMO ISAC

Add code
Oct 21, 2024
Viaarxiv icon

FrameBridge: Improving Image-to-Video Generation with Bridge Models

Add code
Oct 20, 2024
Viaarxiv icon

Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation

Add code
Oct 17, 2024
Figure 1 for Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation
Figure 2 for Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation
Figure 3 for Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation
Figure 4 for Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation
Viaarxiv icon

Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment

Add code
Oct 12, 2024
Figure 1 for Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment
Figure 2 for Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment
Figure 3 for Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment
Figure 4 for Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment
Viaarxiv icon

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Add code
Oct 10, 2024
Figure 1 for RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Figure 2 for RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Figure 3 for RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Figure 4 for RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
Viaarxiv icon