Picture for Zun Wang

Zun Wang

NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models

Add code
Jul 17, 2024
Viaarxiv icon

Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models

Add code
Jul 09, 2024
Viaarxiv icon

Infusing Self-Consistency into Density Functional Theory Hamiltonian Prediction via Deep Equilibrium Models

Add code
Jun 06, 2024
Viaarxiv icon

SE3Set: Harnessing equivariant hypergraph neural networks for molecular representation learning

Add code
May 26, 2024
Viaarxiv icon

InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding

Add code
Mar 22, 2024
Viaarxiv icon

Self-Consistency Training for Hamiltonian Prediction

Add code
Mar 14, 2024
Viaarxiv icon

Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey

Add code
Mar 05, 2024
Viaarxiv icon

MVBench: A Comprehensive Multi-modal Video Understanding Benchmark

Add code
Dec 03, 2023
Viaarxiv icon

Does AI for science need another ImageNet Or totally different benchmarks? A case study of machine learning force fields

Add code
Aug 11, 2023
Figure 1 for Does AI for science need another ImageNet Or totally different benchmarks? A case study of machine learning force fields
Figure 2 for Does AI for science need another ImageNet Or totally different benchmarks? A case study of machine learning force fields
Figure 3 for Does AI for science need another ImageNet Or totally different benchmarks? A case study of machine learning force fields
Figure 4 for Does AI for science need another ImageNet Or totally different benchmarks? A case study of machine learning force fields
Viaarxiv icon

Scaling Data Generation in Vision-and-Language Navigation

Add code
Aug 09, 2023
Viaarxiv icon