Picture for Hongxia Yang

Hongxia Yang

InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection

Add code
Jan 08, 2025
Viaarxiv icon

Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning

Add code
Jan 06, 2025
Viaarxiv icon

InfiFusion: A Unified Framework for Enhanced Cross-Model Reasoning via LLM Fusion

Add code
Jan 06, 2025
Figure 1 for InfiFusion: A Unified Framework for Enhanced Cross-Model Reasoning via LLM Fusion
Viaarxiv icon

Autoregressive Models in Vision: A Survey

Add code
Nov 08, 2024
Figure 1 for Autoregressive Models in Vision: A Survey
Figure 2 for Autoregressive Models in Vision: A Survey
Figure 3 for Autoregressive Models in Vision: A Survey
Figure 4 for Autoregressive Models in Vision: A Survey
Viaarxiv icon

DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation

Add code
Oct 24, 2024
Figure 1 for DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
Figure 2 for DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
Figure 3 for DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
Figure 4 for DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
Viaarxiv icon

Unconstrained Model Merging for Enhanced LLM Reasoning

Add code
Oct 17, 2024
Figure 1 for Unconstrained Model Merging for Enhanced LLM Reasoning
Figure 2 for Unconstrained Model Merging for Enhanced LLM Reasoning
Figure 3 for Unconstrained Model Merging for Enhanced LLM Reasoning
Figure 4 for Unconstrained Model Merging for Enhanced LLM Reasoning
Viaarxiv icon

BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data

Add code
Oct 01, 2024
Figure 1 for BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data
Figure 2 for BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data
Figure 3 for BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data
Figure 4 for BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data
Viaarxiv icon

Law of Vision Representation in MLLMs

Add code
Aug 29, 2024
Figure 1 for Law of Vision Representation in MLLMs
Figure 2 for Law of Vision Representation in MLLMs
Figure 3 for Law of Vision Representation in MLLMs
Figure 4 for Law of Vision Representation in MLLMs
Viaarxiv icon

Retrieval-Augmented Mixture of LoRA Experts for Uploadable Machine Learning

Add code
Jun 24, 2024
Viaarxiv icon

Causal Distillation for Alleviating Performance Heterogeneity in Recommender Systems

Add code
May 31, 2024
Viaarxiv icon