Picture for Mingkui Tan

Mingkui Tan

Nanyang Technological University

Generating Long-form Story Using Dynamic Hierarchical Outlining with Memory-Enhancement

Add code
Dec 18, 2024
Viaarxiv icon

Core Context Aware Attention for Long Context Language Modeling

Add code
Dec 17, 2024
Viaarxiv icon

Adversarial Purification by Consistency-aware Latent Space Optimization on Data Manifolds

Add code
Dec 11, 2024
Viaarxiv icon

Dynamic Ensemble Reasoning for LLM Experts

Add code
Dec 10, 2024
Figure 1 for Dynamic Ensemble Reasoning for LLM Experts
Figure 2 for Dynamic Ensemble Reasoning for LLM Experts
Figure 3 for Dynamic Ensemble Reasoning for LLM Experts
Figure 4 for Dynamic Ensemble Reasoning for LLM Experts
Viaarxiv icon

Towards Long Video Understanding via Fine-detailed Video Story Generation

Add code
Dec 09, 2024
Figure 1 for Towards Long Video Understanding via Fine-detailed Video Story Generation
Figure 2 for Towards Long Video Understanding via Fine-detailed Video Story Generation
Figure 3 for Towards Long Video Understanding via Fine-detailed Video Story Generation
Figure 4 for Towards Long Video Understanding via Fine-detailed Video Story Generation
Viaarxiv icon

LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences

Add code
Dec 02, 2024
Viaarxiv icon

Enhancing Perception Capabilities of Multimodal LLMs with Training-free Fusion

Add code
Dec 02, 2024
Viaarxiv icon

A Cross-Scene Benchmark for Open-World Drone Active Tracking

Add code
Dec 01, 2024
Viaarxiv icon

Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation

Add code
Nov 19, 2024
Figure 1 for Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation
Figure 2 for Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation
Figure 3 for Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation
Figure 4 for Robust 3D Semantic Occupancy Prediction with Calibration-free Spatial Transformation
Viaarxiv icon

Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs

Add code
Sep 27, 2024
Viaarxiv icon