Picture for Zhihui Wang

Zhihui Wang

Owl-AuraID 1.0: An Intelligent System for Autonomous Scientific Instrumentation and Scientific Data Analysis

Add code
Mar 31, 2026
Viaarxiv icon

FD$^2$: A Dedicated Framework for Fine-Grained Dataset Distillation

Add code
Mar 26, 2026
Viaarxiv icon

Towards Governance-Oriented Low-Altitude Intelligence: A Management-Centric Multi-Modal Benchmark With Implicitly Coordinated Vision-Language Reasoning Framework

Add code
Jan 27, 2026
Viaarxiv icon

Toward Multi-Fidelity Machine Learning Force Field for Cathode Materials

Add code
Nov 14, 2025
Viaarxiv icon

Propagating Sparse Depth via Depth Foundation Model for Out-of-Distribution Depth Completion

Add code
Aug 07, 2025
Figure 1 for Propagating Sparse Depth via Depth Foundation Model for Out-of-Distribution Depth Completion
Figure 2 for Propagating Sparse Depth via Depth Foundation Model for Out-of-Distribution Depth Completion
Figure 3 for Propagating Sparse Depth via Depth Foundation Model for Out-of-Distribution Depth Completion
Figure 4 for Propagating Sparse Depth via Depth Foundation Model for Out-of-Distribution Depth Completion
Viaarxiv icon

3DAxisPrompt: Promoting the 3D Grounding and Reasoning in GPT-4o

Add code
Mar 17, 2025
Viaarxiv icon

Commenting Higher-level Code Unit: Full Code, Reduced Code, or Hierarchical Code Summarization

Add code
Mar 13, 2025
Figure 1 for Commenting Higher-level Code Unit: Full Code, Reduced Code, or Hierarchical Code Summarization
Figure 2 for Commenting Higher-level Code Unit: Full Code, Reduced Code, or Hierarchical Code Summarization
Figure 3 for Commenting Higher-level Code Unit: Full Code, Reduced Code, or Hierarchical Code Summarization
Figure 4 for Commenting Higher-level Code Unit: Full Code, Reduced Code, or Hierarchical Code Summarization
Viaarxiv icon

SeisMoLLM: Advancing Seismic Monitoring via Cross-modal Transfer with Pre-trained Large Language Model

Add code
Feb 27, 2025
Figure 1 for SeisMoLLM: Advancing Seismic Monitoring via Cross-modal Transfer with Pre-trained Large Language Model
Figure 2 for SeisMoLLM: Advancing Seismic Monitoring via Cross-modal Transfer with Pre-trained Large Language Model
Figure 3 for SeisMoLLM: Advancing Seismic Monitoring via Cross-modal Transfer with Pre-trained Large Language Model
Figure 4 for SeisMoLLM: Advancing Seismic Monitoring via Cross-modal Transfer with Pre-trained Large Language Model
Viaarxiv icon

Towards Efficient and Intelligent Laser Weeding: Method and Dataset for Weed Stem Detection

Add code
Feb 10, 2025
Figure 1 for Towards Efficient and Intelligent Laser Weeding: Method and Dataset for Weed Stem Detection
Figure 2 for Towards Efficient and Intelligent Laser Weeding: Method and Dataset for Weed Stem Detection
Figure 3 for Towards Efficient and Intelligent Laser Weeding: Method and Dataset for Weed Stem Detection
Figure 4 for Towards Efficient and Intelligent Laser Weeding: Method and Dataset for Weed Stem Detection
Viaarxiv icon

B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens

Add code
Dec 13, 2024
Figure 1 for B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens
Figure 2 for B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens
Figure 3 for B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens
Figure 4 for B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens
Viaarxiv icon