Picture for Li Shen

Li Shen

Joint Selection for Large-Scale Pre-Training Data via Policy Gradient-based Mask Learning

Add code
Dec 30, 2025
Viaarxiv icon

Reason-KE++: Aligning the Process, Not Just the Outcome, for Faithful LLM Knowledge Editing

Add code
Nov 16, 2025
Figure 1 for Reason-KE++: Aligning the Process, Not Just the Outcome, for Faithful LLM Knowledge Editing
Figure 2 for Reason-KE++: Aligning the Process, Not Just the Outcome, for Faithful LLM Knowledge Editing
Figure 3 for Reason-KE++: Aligning the Process, Not Just the Outcome, for Faithful LLM Knowledge Editing
Figure 4 for Reason-KE++: Aligning the Process, Not Just the Outcome, for Faithful LLM Knowledge Editing
Viaarxiv icon

Hindsight Distillation Reasoning with Knowledge Encouragement Preference for Knowledge-based Visual Question Answering

Add code
Nov 14, 2025
Figure 1 for Hindsight Distillation Reasoning with Knowledge Encouragement Preference for Knowledge-based Visual Question Answering
Figure 2 for Hindsight Distillation Reasoning with Knowledge Encouragement Preference for Knowledge-based Visual Question Answering
Figure 3 for Hindsight Distillation Reasoning with Knowledge Encouragement Preference for Knowledge-based Visual Question Answering
Figure 4 for Hindsight Distillation Reasoning with Knowledge Encouragement Preference for Knowledge-based Visual Question Answering
Viaarxiv icon

fastbmRAG: A Fast Graph-Based RAG Framework for Efficient Processing of Large-Scale Biomedical Literature

Add code
Nov 13, 2025
Viaarxiv icon

Learning to Pose Problems: Reasoning-Driven and Solver-Adaptive Data Synthesis for Large Reasoning Models

Add code
Nov 13, 2025
Figure 1 for Learning to Pose Problems: Reasoning-Driven and Solver-Adaptive Data Synthesis for Large Reasoning Models
Figure 2 for Learning to Pose Problems: Reasoning-Driven and Solver-Adaptive Data Synthesis for Large Reasoning Models
Figure 3 for Learning to Pose Problems: Reasoning-Driven and Solver-Adaptive Data Synthesis for Large Reasoning Models
Figure 4 for Learning to Pose Problems: Reasoning-Driven and Solver-Adaptive Data Synthesis for Large Reasoning Models
Viaarxiv icon

Adaptive Defense against Harmful Fine-Tuning for Large Language Models via Bayesian Data Scheduler

Add code
Oct 31, 2025
Viaarxiv icon

Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications

Add code
Oct 31, 2025
Figure 1 for Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications
Figure 2 for Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications
Figure 3 for Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications
Figure 4 for Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications
Viaarxiv icon

Unveiling the Power of Multiple Gossip Steps: A Stability-Based Generalization Analysis in Decentralized Training

Add code
Oct 09, 2025
Viaarxiv icon

Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives

Add code
Sep 26, 2025
Figure 1 for Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives
Figure 2 for Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives
Figure 3 for Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives
Figure 4 for Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives
Viaarxiv icon

Beyond Two-Stage Training: Cooperative SFT and RL for LLM Reasoning

Add code
Sep 08, 2025
Viaarxiv icon