Picture for Zihao Wang

Zihao Wang

ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting

Add code
Oct 23, 2024
Viaarxiv icon

LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models

Add code
Oct 13, 2024
Figure 1 for LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
Figure 2 for LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
Figure 3 for LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
Figure 4 for LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
Viaarxiv icon

Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models

Add code
Oct 10, 2024
Figure 1 for Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models
Figure 2 for Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models
Figure 3 for Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models
Figure 4 for Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models
Viaarxiv icon

Precise Interception Flight Targets by Image-based Visual Servoing of Multicopter

Add code
Sep 26, 2024
Viaarxiv icon

Transtreaming: Adaptive Delay-aware Transformer for Real-time Streaming Perception

Add code
Sep 10, 2024
Figure 1 for Transtreaming: Adaptive Delay-aware Transformer for Real-time Streaming Perception
Figure 2 for Transtreaming: Adaptive Delay-aware Transformer for Real-time Streaming Perception
Figure 3 for Transtreaming: Adaptive Delay-aware Transformer for Real-time Streaming Perception
Figure 4 for Transtreaming: Adaptive Delay-aware Transformer for Real-time Streaming Perception
Viaarxiv icon

MetaBGM: Dynamic Soundtrack Transformation For Continuous Multi-Scene Experiences With Ambient Awareness And Personalization

Add code
Sep 05, 2024
Viaarxiv icon

PSNE: Efficient Spectral Sparsification Algorithms for Scaling Network Embedding

Add code
Aug 05, 2024
Figure 1 for PSNE: Efficient Spectral Sparsification Algorithms for Scaling Network Embedding
Figure 2 for PSNE: Efficient Spectral Sparsification Algorithms for Scaling Network Embedding
Figure 3 for PSNE: Efficient Spectral Sparsification Algorithms for Scaling Network Embedding
Figure 4 for PSNE: Efficient Spectral Sparsification Algorithms for Scaling Network Embedding
Viaarxiv icon

A UAV-Enabled Time-Sensitive Data Collection Scheme for Grassland Monitoring Edge Networks

Add code
Jul 30, 2024
Viaarxiv icon

SaMoye: Zero-shot Singing Voice Conversion Based on Feature Disentanglement and Synthesis

Add code
Jul 11, 2024
Viaarxiv icon

MuDiT & MuSiT: Alignment with Colloquial Expression in Description-to-Song Generation

Add code
Jul 03, 2024
Viaarxiv icon