Picture for Simin Fan

Simin Fan

HyperINF: Unleashing the HyperPower of the Schulz's Method for Data Influence Estimation

Add code
Oct 07, 2024
Viaarxiv icon

Dynamic Gradient Alignment for Online Data Mixing

Add code
Oct 03, 2024
Viaarxiv icon

Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants

Add code
Aug 07, 2024
Viaarxiv icon

Deep Grokking: Would Deep Neural Networks Generalize Better?

Add code
May 29, 2024
Viaarxiv icon

MEDITRON-70B: Scaling Medical Pretraining for Large Language Models

Add code
Nov 27, 2023
Figure 1 for MEDITRON-70B: Scaling Medical Pretraining for Large Language Models
Figure 2 for MEDITRON-70B: Scaling Medical Pretraining for Large Language Models
Figure 3 for MEDITRON-70B: Scaling Medical Pretraining for Large Language Models
Figure 4 for MEDITRON-70B: Scaling Medical Pretraining for Large Language Models
Viaarxiv icon

Irreducible Curriculum for Language Model Pretraining

Add code
Oct 23, 2023
Viaarxiv icon

DoGE: Domain Reweighting with Generalization Estimation

Add code
Oct 23, 2023
Viaarxiv icon

Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs

Add code
Apr 30, 2022
Figure 1 for Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs
Figure 2 for Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs
Figure 3 for Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs
Figure 4 for Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs
Viaarxiv icon