Picture for Kun Kuang

Kun Kuang

R3HF: Reward Redistribution for Enhancing Reinforcement Learning from Human Feedback

Add code
Nov 13, 2024
Viaarxiv icon

Causality for Large Language Models

Add code
Oct 20, 2024
Viaarxiv icon

Gold Panning in Vocabulary: An Adaptive Method for Vocabulary Expansion of Domain-Specific LLMs

Add code
Oct 02, 2024
Figure 1 for Gold Panning in Vocabulary: An Adaptive Method for Vocabulary Expansion of Domain-Specific LLMs
Figure 2 for Gold Panning in Vocabulary: An Adaptive Method for Vocabulary Expansion of Domain-Specific LLMs
Figure 3 for Gold Panning in Vocabulary: An Adaptive Method for Vocabulary Expansion of Domain-Specific LLMs
Figure 4 for Gold Panning in Vocabulary: An Adaptive Method for Vocabulary Expansion of Domain-Specific LLMs
Viaarxiv icon

Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering

Add code
Sep 24, 2024
Figure 1 for Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering
Figure 2 for Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering
Figure 3 for Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering
Figure 4 for Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering
Viaarxiv icon

RexUniNLU: Recursive Method with Explicit Schema Instructor for Universal NLU

Add code
Sep 09, 2024
Viaarxiv icon

IntOPE: Off-Policy Evaluation in the Presence of Interference

Add code
Aug 24, 2024
Figure 1 for IntOPE: Off-Policy Evaluation in the Presence of Interference
Figure 2 for IntOPE: Off-Policy Evaluation in the Presence of Interference
Figure 3 for IntOPE: Off-Policy Evaluation in the Presence of Interference
Figure 4 for IntOPE: Off-Policy Evaluation in the Presence of Interference
Viaarxiv icon

Semantic Alignment for Multimodal Large Language Models

Add code
Aug 23, 2024
Figure 1 for Semantic Alignment for Multimodal Large Language Models
Figure 2 for Semantic Alignment for Multimodal Large Language Models
Figure 3 for Semantic Alignment for Multimodal Large Language Models
Figure 4 for Semantic Alignment for Multimodal Large Language Models
Viaarxiv icon

Xinyu: An Efficient LLM-based System for Commentary Generation

Add code
Aug 21, 2024
Figure 1 for Xinyu: An Efficient LLM-based System for Commentary Generation
Figure 2 for Xinyu: An Efficient LLM-based System for Commentary Generation
Figure 3 for Xinyu: An Efficient LLM-based System for Commentary Generation
Figure 4 for Xinyu: An Efficient LLM-based System for Commentary Generation
Viaarxiv icon

Leveraging Invariant Principle for Heterophilic Graph Structure Distribution Shifts

Add code
Aug 18, 2024
Viaarxiv icon

Causal Agent based on Large Language Model

Add code
Aug 13, 2024
Viaarxiv icon