Picture for Jiaxing Huang

Jiaxing Huang

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

Add code
Mar 17, 2025
Viaarxiv icon

Strategic priorities for transformative progress in advancing biology with proteomics and artificial intelligence

Add code
Feb 21, 2025
Figure 1 for Strategic priorities for transformative progress in advancing biology with proteomics and artificial intelligence
Figure 2 for Strategic priorities for transformative progress in advancing biology with proteomics and artificial intelligence
Viaarxiv icon

Reasoning with Reinforced Functional Token Tuning

Add code
Feb 19, 2025
Viaarxiv icon

PhysReason: A Comprehensive Benchmark towards Physics-Based Reasoning

Add code
Feb 17, 2025
Viaarxiv icon

Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation

Add code
Jan 30, 2025
Figure 1 for Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation
Figure 2 for Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation
Figure 3 for Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation
Figure 4 for Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation
Viaarxiv icon

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Add code
Dec 24, 2024
Figure 1 for Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search
Figure 2 for Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search
Figure 3 for Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search
Figure 4 for Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search
Viaarxiv icon

SPAgent: Adaptive Task Decomposition and Model Selection for General Video Generation and Editing

Add code
Nov 28, 2024
Viaarxiv icon

A Survey on Vision Autoregressive Model

Add code
Nov 13, 2024
Viaarxiv icon

Historical Test-time Prompt Tuning for Vision Foundation Models

Add code
Oct 27, 2024
Figure 1 for Historical Test-time Prompt Tuning for Vision Foundation Models
Figure 2 for Historical Test-time Prompt Tuning for Vision Foundation Models
Figure 3 for Historical Test-time Prompt Tuning for Vision Foundation Models
Figure 4 for Historical Test-time Prompt Tuning for Vision Foundation Models
Viaarxiv icon

Open-Vocabulary Object Detection via Language Hierarchy

Add code
Oct 27, 2024
Figure 1 for Open-Vocabulary Object Detection via Language Hierarchy
Figure 2 for Open-Vocabulary Object Detection via Language Hierarchy
Figure 3 for Open-Vocabulary Object Detection via Language Hierarchy
Figure 4 for Open-Vocabulary Object Detection via Language Hierarchy
Viaarxiv icon