Picture for Xuezhe Ma

Xuezhe Ma

AIDE: Agentically Improve Visual Language Model with Domain Experts

Add code
Feb 13, 2025
Viaarxiv icon

MegaCOIN: Enhancing Medium-Grained Color Perception for Vision-Language Models

Add code
Dec 05, 2024
Viaarxiv icon

PatentEdits: Framing Patent Novelty as Textual Entailment

Add code
Nov 20, 2024
Figure 1 for PatentEdits: Framing Patent Novelty as Textual Entailment
Figure 2 for PatentEdits: Framing Patent Novelty as Textual Entailment
Figure 3 for PatentEdits: Framing Patent Novelty as Textual Entailment
Figure 4 for PatentEdits: Framing Patent Novelty as Textual Entailment
Viaarxiv icon

DecoPrompt : Decoding Prompts Reduces Hallucinations when Large Language Models Meet False Premises

Add code
Nov 12, 2024
Viaarxiv icon

LLM The Genius Paradox: A Linguistic and Math Expert's Struggle with Simple Word-based Counting Problems

Add code
Oct 18, 2024
Viaarxiv icon

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Add code
Aug 20, 2024
Figure 1 for Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model
Figure 2 for Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model
Figure 3 for Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model
Figure 4 for Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model
Viaarxiv icon

Towards Chapter-to-Chapter Context-Aware Literary Translation via Large Language Models

Add code
Jul 12, 2024
Viaarxiv icon

Light-weight Fine-tuning Method for Defending Adversarial Noise in Pre-trained Medical Vision-Language Models

Add code
Jul 02, 2024
Viaarxiv icon

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Add code
Apr 12, 2024
Viaarxiv icon

Evaluating Large Language Models on Controlled Generation Tasks

Add code
Oct 23, 2023
Viaarxiv icon