Picture for Fandong Meng

Fandong Meng

Personalized Language Model Learning on Text Data Without User Identifiers

Add code
Jan 10, 2025
Viaarxiv icon

DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought

Add code
Dec 23, 2024
Viaarxiv icon

PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension

Add code
Dec 16, 2024
Figure 1 for PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension
Figure 2 for PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension
Figure 3 for PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension
Figure 4 for PunchBench: Benchmarking MLLMs in Multimodal Punchline Comprehension
Viaarxiv icon

Retrieval-Augmented Machine Translation with Unstructured Knowledge

Add code
Dec 05, 2024
Viaarxiv icon

Extralonger: Toward a Unified Perspective of Spatial-Temporal Factors for Extra-Long-Term Traffic Forecasting

Add code
Oct 30, 2024
Viaarxiv icon

CRAT: A Multi-Agent Framework for Causality-Enhanced Reflective and Retrieval-Augmented Translation with Large Language Models

Add code
Oct 28, 2024
Figure 1 for CRAT: A Multi-Agent Framework for Causality-Enhanced Reflective and Retrieval-Augmented Translation with Large Language Models
Figure 2 for CRAT: A Multi-Agent Framework for Causality-Enhanced Reflective and Retrieval-Augmented Translation with Large Language Models
Figure 3 for CRAT: A Multi-Agent Framework for Causality-Enhanced Reflective and Retrieval-Augmented Translation with Large Language Models
Figure 4 for CRAT: A Multi-Agent Framework for Causality-Enhanced Reflective and Retrieval-Augmented Translation with Large Language Models
Viaarxiv icon

MiniPLM: Knowledge Distillation for Pre-Training Language Models

Add code
Oct 22, 2024
Viaarxiv icon

On the token distance modeling ability of higher RoPE attention dimension

Add code
Oct 11, 2024
Figure 1 for On the token distance modeling ability of higher RoPE attention dimension
Figure 2 for On the token distance modeling ability of higher RoPE attention dimension
Figure 3 for On the token distance modeling ability of higher RoPE attention dimension
Figure 4 for On the token distance modeling ability of higher RoPE attention dimension
Viaarxiv icon

DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory

Add code
Oct 10, 2024
Figure 1 for DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
Figure 2 for DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
Figure 3 for DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
Figure 4 for DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory
Viaarxiv icon

MaskMamba: A Hybrid Mamba-Transformer Model for Masked Image Generation

Add code
Sep 30, 2024
Viaarxiv icon