Picture for Zirui Wang

Zirui Wang

Learning Humanoid Locomotion with Perceptive Internal Model

Add code
Nov 21, 2024
Viaarxiv icon

HiMemFormer: Hierarchical Memory-Aware Transformer for Multi-Agent Action Anticipation

Add code
Nov 03, 2024
Viaarxiv icon

MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning

Add code
Sep 30, 2024
Viaarxiv icon

Quantum Machine Learning for Semiconductor Fabrication: Modeling GaN HEMT Contact Process

Add code
Sep 17, 2024
Figure 1 for Quantum Machine Learning for Semiconductor Fabrication: Modeling GaN HEMT Contact Process
Figure 2 for Quantum Machine Learning for Semiconductor Fabrication: Modeling GaN HEMT Contact Process
Figure 3 for Quantum Machine Learning for Semiconductor Fabrication: Modeling GaN HEMT Contact Process
Figure 4 for Quantum Machine Learning for Semiconductor Fabrication: Modeling GaN HEMT Contact Process
Viaarxiv icon

Semantic-Guided Multimodal Sentiment Decoding with Adversarial Temporal-Invariant Learning

Add code
Sep 11, 2024
Figure 1 for Semantic-Guided Multimodal Sentiment Decoding with Adversarial Temporal-Invariant Learning
Figure 2 for Semantic-Guided Multimodal Sentiment Decoding with Adversarial Temporal-Invariant Learning
Figure 3 for Semantic-Guided Multimodal Sentiment Decoding with Adversarial Temporal-Invariant Learning
Figure 4 for Semantic-Guided Multimodal Sentiment Decoding with Adversarial Temporal-Invariant Learning
Viaarxiv icon

Robust Temporal-Invariant Learning in Multimodal Disentanglement

Add code
Aug 30, 2024
Figure 1 for Robust Temporal-Invariant Learning in Multimodal Disentanglement
Figure 2 for Robust Temporal-Invariant Learning in Multimodal Disentanglement
Figure 3 for Robust Temporal-Invariant Learning in Multimodal Disentanglement
Figure 4 for Robust Temporal-Invariant Learning in Multimodal Disentanglement
Viaarxiv icon

CatFree3D: Category-agnostic 3D Object Detection with Diffusion

Add code
Aug 22, 2024
Figure 1 for CatFree3D: Category-agnostic 3D Object Detection with Diffusion
Figure 2 for CatFree3D: Category-agnostic 3D Object Detection with Diffusion
Figure 3 for CatFree3D: Category-agnostic 3D Object Detection with Diffusion
Figure 4 for CatFree3D: Category-agnostic 3D Object Detection with Diffusion
Viaarxiv icon

GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting

Add code
Aug 20, 2024
Viaarxiv icon

ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use Capabilities

Add code
Aug 08, 2024
Figure 1 for ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use Capabilities
Figure 2 for ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use Capabilities
Figure 3 for ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use Capabilities
Figure 4 for ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use Capabilities
Viaarxiv icon

Apple Intelligence Foundation Language Models

Add code
Jul 29, 2024
Figure 1 for Apple Intelligence Foundation Language Models
Figure 2 for Apple Intelligence Foundation Language Models
Figure 3 for Apple Intelligence Foundation Language Models
Figure 4 for Apple Intelligence Foundation Language Models
Viaarxiv icon