Picture for Minlie Huang

Minlie Huang

EJ

MiniPLM: Knowledge Distillation for Pre-Training Language Models

Add code
Oct 22, 2024
Viaarxiv icon

BlackDAN: A Black-Box Multi-Objective Approach for Effective and Contextual Jailbreaking of Large Language Models

Add code
Oct 13, 2024
Figure 1 for BlackDAN: A Black-Box Multi-Objective Approach for Effective and Contextual Jailbreaking of Large Language Models
Figure 2 for BlackDAN: A Black-Box Multi-Objective Approach for Effective and Contextual Jailbreaking of Large Language Models
Figure 3 for BlackDAN: A Black-Box Multi-Objective Approach for Effective and Contextual Jailbreaking of Large Language Models
Figure 4 for BlackDAN: A Black-Box Multi-Objective Approach for Effective and Contextual Jailbreaking of Large Language Models
Viaarxiv icon

Data Selection via Optimal Control for Language Models

Add code
Oct 09, 2024
Viaarxiv icon

Seeker: Enhancing Exception Handling in Code with LLM-based Multi-Agent Approach

Add code
Oct 09, 2024
Viaarxiv icon

LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models

Add code
Sep 05, 2024
Figure 1 for LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
Figure 2 for LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
Figure 3 for LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
Figure 4 for LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
Viaarxiv icon

How Well Do Large Language Models Serve as End-to-End Secure Code Producers?

Add code
Aug 20, 2024
Viaarxiv icon

Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules

Add code
Jul 09, 2024
Viaarxiv icon

Benchmarking Complex Instruction-Following with Multiple Constraints Composition

Add code
Jul 04, 2024
Viaarxiv icon

Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks

Add code
Jul 03, 2024
Viaarxiv icon

AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models

Add code
Jun 24, 2024
Viaarxiv icon