Picture for Ji Ma

Ji Ma

Pruning All-Rounder: Rethinking and Improving Inference Efficiency for Large Vision Language Models

Add code
Dec 09, 2024
Viaarxiv icon

Can Machines Think Like Humans? A Behavioral Evaluation of LLM-Agents in Dictator Games

Add code
Oct 28, 2024
Viaarxiv icon

Content-decoupled Contrastive Learning-based Implicit Degradation Modeling for Blind Image Super-Resolution

Add code
Aug 10, 2024
Viaarxiv icon

AHMF: Adaptive Hybrid-Memory-Fusion Model for Driver Attention Prediction

Add code
Jul 24, 2024
Viaarxiv icon

CAMON: Cooperative Agents for Multi-Object Navigation with LLM-based Conversations

Add code
Jun 30, 2024
Viaarxiv icon

C3L: Content Correlated Vision-Language Instruction Tuning Data Generation via Contrastive Learning

Add code
May 21, 2024
Viaarxiv icon

How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites

Add code
Apr 29, 2024
Viaarxiv icon

ASPIRe: An Informative Trajectory Planner with Mutual Information Approximation for Target Search and Tracking

Add code
Mar 04, 2024
Viaarxiv icon

DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments

Add code
Feb 29, 2024
Viaarxiv icon

VoroNav: Voronoi-based Zero-shot Object Navigation with Large Language Model

Add code
Jan 05, 2024
Viaarxiv icon