Picture for Mengdi Wang

Mengdi Wang

A First-order Generative Bilevel Optimization Framework for Diffusion Models

Add code
Feb 12, 2025
Viaarxiv icon

MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations

Add code
Feb 10, 2025
Viaarxiv icon

ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates

Add code
Feb 10, 2025
Viaarxiv icon

ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization

Add code
Feb 06, 2025
Viaarxiv icon

RTLMarker: Protecting LLM-Generated RTL Copyright via a Hardware Watermarking Framework

Add code
Jan 05, 2025
Figure 1 for RTLMarker: Protecting LLM-Generated RTL Copyright via a Hardware Watermarking Framework
Figure 2 for RTLMarker: Protecting LLM-Generated RTL Copyright via a Hardware Watermarking Framework
Figure 3 for RTLMarker: Protecting LLM-Generated RTL Copyright via a Hardware Watermarking Framework
Figure 4 for RTLMarker: Protecting LLM-Generated RTL Copyright via a Hardware Watermarking Framework
Viaarxiv icon

On the Statistical Complexity for Offline and Low-Adaptive Reinforcement Learning with Structures

Add code
Jan 03, 2025
Viaarxiv icon

LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds

Add code
Dec 06, 2024
Figure 1 for LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds
Figure 2 for LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds
Figure 3 for LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds
Figure 4 for LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds
Viaarxiv icon

Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment

Add code
Nov 27, 2024
Figure 1 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 2 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 3 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Figure 4 for Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
Viaarxiv icon

One-Layer Transformer Provably Learns One-Nearest Neighbor In Context

Add code
Nov 16, 2024
Viaarxiv icon

CUIfy the XR: An Open-Source Package to Embed LLM-powered Conversational Agents in XR

Add code
Nov 07, 2024
Viaarxiv icon