Picture for Carole-Jean Wu

Carole-Jean Wu

Revisiting Reliability in Large-Scale Machine Learning Research Clusters

Add code
Oct 29, 2024
Viaarxiv icon

Beyond Efficiency: Scaling AI Sustainably

Add code
Jun 08, 2024
Viaarxiv icon

Is Flash Attention Stable?

Add code
May 05, 2024
Viaarxiv icon

LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding

Add code
Apr 29, 2024
Viaarxiv icon

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Add code
Apr 18, 2024
Viaarxiv icon

Croissant: A Metadata Format for ML-Ready Datasets

Add code
Mar 28, 2024
Viaarxiv icon

CHAI: Clustered Head Attention for Efficient LLM Inference

Add code
Mar 12, 2024
Viaarxiv icon

HeteroSwitch: Characterizing and Taming System-Induced Data Heterogeneity in Federated Learning

Add code
Mar 07, 2024
Viaarxiv icon

Generative AI Beyond LLMs: System Implications of Multi-Modal Generation

Add code
Dec 22, 2023
Viaarxiv icon

Decoding Data Quality via Synthetic Corruptions: Embedding-guided Pruning of Code Data

Add code
Dec 05, 2023
Viaarxiv icon