Picture for Mingda Li

Mingda Li

Quantum Measurement Group, Massachusetts Institute of Technology, Cambridge, MA, USA, Department of Nuclear Science and Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA

Unstructured Text Enhanced Open-domain Dialogue System: A Systematic Survey

Add code
Nov 14, 2024
Viaarxiv icon

Large Language Model-Guided Prediction Toward Quantum Materials Synthesis

Add code
Oct 28, 2024
Viaarxiv icon

Policy-driven Knowledge Selection and Response Generation for Document-grounded Dialogue

Add code
Oct 21, 2024
Viaarxiv icon

TRACE: Temporal Grounding Video LLM via Causal Event Modeling

Add code
Oct 08, 2024
Figure 1 for TRACE: Temporal Grounding Video LLM via Causal Event Modeling
Figure 2 for TRACE: Temporal Grounding Video LLM via Causal Event Modeling
Figure 3 for TRACE: Temporal Grounding Video LLM via Causal Event Modeling
Figure 4 for TRACE: Temporal Grounding Video LLM via Causal Event Modeling
Viaarxiv icon

Enhancing Long Video Understanding via Hierarchical Event-Based Memory

Add code
Sep 10, 2024
Figure 1 for Enhancing Long Video Understanding via Hierarchical Event-Based Memory
Figure 2 for Enhancing Long Video Understanding via Hierarchical Event-Based Memory
Figure 3 for Enhancing Long Video Understanding via Hierarchical Event-Based Memory
Figure 4 for Enhancing Long Video Understanding via Hierarchical Event-Based Memory
Viaarxiv icon

TC-LLaVA: Rethinking the Transfer from Image to Video Understanding with Temporal Considerations

Add code
Sep 05, 2024
Viaarxiv icon

Bridging the Language Gap: Enhancing Multilingual Prompt-Based Code Generation in LLMs via Zero-Shot Cross-Lingual Transfer

Add code
Aug 19, 2024
Viaarxiv icon

Structural Constraint Integration in Generative Model for Discovery of Quantum Material Candidates

Add code
Jul 05, 2024
Viaarxiv icon

Unraveling and Mitigating Retriever Inconsistencies in Retrieval-Augmented Large Language Models

Add code
Jun 04, 2024
Figure 1 for Unraveling and Mitigating Retriever Inconsistencies in Retrieval-Augmented Large Language Models
Figure 2 for Unraveling and Mitigating Retriever Inconsistencies in Retrieval-Augmented Large Language Models
Figure 3 for Unraveling and Mitigating Retriever Inconsistencies in Retrieval-Augmented Large Language Models
Figure 4 for Unraveling and Mitigating Retriever Inconsistencies in Retrieval-Augmented Large Language Models
Viaarxiv icon

VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding

Add code
May 22, 2024
Viaarxiv icon