Picture for Dawei Feng

Dawei Feng

Exploring structure diversity in atomic resolution microscopy with graph neural networks

Add code
Oct 23, 2024
Viaarxiv icon

AutoFeedback: An LLM-based Framework for Efficient and Accurate API Request Generation

Add code
Oct 09, 2024
Viaarxiv icon

Online Self-Preferring Language Models

Add code
May 23, 2024
Figure 1 for Online Self-Preferring Language Models
Figure 2 for Online Self-Preferring Language Models
Figure 3 for Online Self-Preferring Language Models
Figure 4 for Online Self-Preferring Language Models
Viaarxiv icon

IGOT: Information Gain Optimized Tokenizer on Domain Adaptive Pretraining

Add code
May 16, 2024
Viaarxiv icon

Optimistic Model Rollouts for Pessimistic Offline Policy Optimization

Add code
Jan 11, 2024
Viaarxiv icon

Uncertainty-Penalized Reinforcement Learning from Human Feedback with Diverse Reward LoRA Ensembles

Add code
Dec 30, 2023
Viaarxiv icon

Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning

Add code
Aug 24, 2022
Figure 1 for Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning
Figure 2 for Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning
Figure 3 for Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning
Figure 4 for Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning
Viaarxiv icon

Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration

Add code
Aug 24, 2022
Figure 1 for Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration
Figure 2 for Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration
Figure 3 for Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration
Figure 4 for Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration
Viaarxiv icon

Nuclear Norm Maximization Based Curiosity-Driven Learning

Add code
May 28, 2022
Figure 1 for Nuclear Norm Maximization Based Curiosity-Driven Learning
Figure 2 for Nuclear Norm Maximization Based Curiosity-Driven Learning
Figure 3 for Nuclear Norm Maximization Based Curiosity-Driven Learning
Figure 4 for Nuclear Norm Maximization Based Curiosity-Driven Learning
Viaarxiv icon

FINT: Field-aware INTeraction Neural Network For CTR Prediction

Add code
Jul 30, 2021
Figure 1 for FINT: Field-aware INTeraction Neural Network For CTR Prediction
Figure 2 for FINT: Field-aware INTeraction Neural Network For CTR Prediction
Figure 3 for FINT: Field-aware INTeraction Neural Network For CTR Prediction
Figure 4 for FINT: Field-aware INTeraction Neural Network For CTR Prediction
Viaarxiv icon