Picture for Mang Wang

Mang Wang

From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning

Add code
Nov 06, 2024
Figure 1 for From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning
Figure 2 for From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning
Figure 3 for From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning
Figure 4 for From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning
Viaarxiv icon

HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems

Add code
Nov 05, 2024
Viaarxiv icon

Baichuan Alignment Technical Report

Add code
Oct 19, 2024
Figure 1 for Baichuan Alignment Technical Report
Figure 2 for Baichuan Alignment Technical Report
Figure 3 for Baichuan Alignment Technical Report
Figure 4 for Baichuan Alignment Technical Report
Viaarxiv icon

RichRAG: Crafting Rich Responses for Multi-faceted Queries in Retrieval-Augmented Generation

Add code
Jun 18, 2024
Figure 1 for RichRAG: Crafting Rich Responses for Multi-faceted Queries in Retrieval-Augmented Generation
Figure 2 for RichRAG: Crafting Rich Responses for Multi-faceted Queries in Retrieval-Augmented Generation
Figure 3 for RichRAG: Crafting Rich Responses for Multi-faceted Queries in Retrieval-Augmented Generation
Figure 4 for RichRAG: Crafting Rich Responses for Multi-faceted Queries in Retrieval-Augmented Generation
Viaarxiv icon

Make Continual Learning Stronger via C-Flat

Add code
Apr 01, 2024
Viaarxiv icon

Baichuan 2: Open Large-scale Language Models

Add code
Sep 20, 2023
Viaarxiv icon

Progressive Learning without Forgetting

Add code
Nov 28, 2022
Viaarxiv icon

Overcoming Catastrophic Forgetting in Incremental Object Detection via Elastic Response Distillation

Add code
Apr 05, 2022
Figure 1 for Overcoming Catastrophic Forgetting in Incremental Object Detection via Elastic Response Distillation
Figure 2 for Overcoming Catastrophic Forgetting in Incremental Object Detection via Elastic Response Distillation
Figure 3 for Overcoming Catastrophic Forgetting in Incremental Object Detection via Elastic Response Distillation
Figure 4 for Overcoming Catastrophic Forgetting in Incremental Object Detection via Elastic Response Distillation
Viaarxiv icon

Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics

Add code
Feb 01, 2022
Figure 1 for Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics
Figure 2 for Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics
Figure 3 for Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics
Figure 4 for Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics
Viaarxiv icon

Response-based Distillation for Incremental Object Detection

Add code
Oct 26, 2021
Figure 1 for Response-based Distillation for Incremental Object Detection
Figure 2 for Response-based Distillation for Incremental Object Detection
Figure 3 for Response-based Distillation for Incremental Object Detection
Figure 4 for Response-based Distillation for Incremental Object Detection
Viaarxiv icon