Picture for Mingkui Tan

Mingkui Tan

Nanyang Technological University

Generating Long-form Story Using Dynamic Hierarchical Outlining with Memory-Enhancement

Add code
Dec 18, 2024
Viaarxiv icon

Core Context Aware Attention for Long Context Language Modeling

Add code
Dec 17, 2024
Viaarxiv icon

Adversarial Purification by Consistency-aware Latent Space Optimization on Data Manifolds

Add code
Dec 11, 2024
Viaarxiv icon

Dynamic Ensemble Reasoning for LLM Experts

Add code
Dec 10, 2024
Viaarxiv icon

Towards Long Video Understanding via Fine-detailed Video Story Generation

Add code
Dec 09, 2024
Viaarxiv icon

LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences

Add code
Dec 02, 2024
Viaarxiv icon

Enhancing Perception Capabilities of Multimodal LLMs with Training-free Fusion

Add code
Dec 02, 2024
Viaarxiv icon

A Cross-Scene Benchmark for Open-World Drone Active Tracking

Add code
Dec 01, 2024
Viaarxiv icon

Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation

Add code
Nov 19, 2024
Figure 1 for Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation
Figure 2 for Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation
Figure 3 for Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation
Figure 4 for Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation
Viaarxiv icon

Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs

Add code
Sep 27, 2024
Viaarxiv icon