Picture for Yuliang Liu

Yuliang Liu

R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal Models

Add code
Oct 23, 2024
Viaarxiv icon

PDF-WuKong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling

Add code
Oct 08, 2024
Viaarxiv icon

LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models

Add code
Sep 04, 2024
Viaarxiv icon

Mini-Monkey: Multi-Scale Adaptive Cropping for Multimodal Large Language Models

Add code
Aug 09, 2024
Viaarxiv icon

Mini-Monkey: Alleviate the Sawtooth Effect by Multi-Scale Adaptive Cropping

Add code
Aug 04, 2024
Viaarxiv icon

Multi-Prompting Decoder Helps Better Language Understanding

Add code
Jun 10, 2024
Viaarxiv icon

MoE Jetpack: From Dense Checkpoints to Adaptive Mixture of Experts for Vision Tasks

Add code
Jun 07, 2024
Viaarxiv icon

Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction

Add code
Jun 05, 2024
Figure 1 for Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction
Figure 2 for Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction
Figure 3 for Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction
Figure 4 for Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction
Viaarxiv icon

Deciphering Oracle Bone Language with Diffusion Models

Add code
Jun 02, 2024
Viaarxiv icon

Dataset and Benchmark for Urdu Natural Scenes Text Detection, Recognition and Visual Question Answering

Add code
May 21, 2024
Viaarxiv icon