Picture for Chuan Wu

Chuan Wu

The University of Hong Kong

FedMoE-DA: Federated Mixture of Experts via Domain Aware Fine-grained Aggregation

Add code
Nov 04, 2024
Figure 1 for FedMoE-DA: Federated Mixture of Experts via Domain Aware Fine-grained Aggregation
Figure 2 for FedMoE-DA: Federated Mixture of Experts via Domain Aware Fine-grained Aggregation
Figure 3 for FedMoE-DA: Federated Mixture of Experts via Domain Aware Fine-grained Aggregation
Figure 4 for FedMoE-DA: Federated Mixture of Experts via Domain Aware Fine-grained Aggregation
Viaarxiv icon

Forewarned is Forearmed: Leveraging LLMs for Data Synthesis through Failure-Inducing Exploration

Add code
Oct 22, 2024
Figure 1 for Forewarned is Forearmed: Leveraging LLMs for Data Synthesis through Failure-Inducing Exploration
Figure 2 for Forewarned is Forearmed: Leveraging LLMs for Data Synthesis through Failure-Inducing Exploration
Figure 3 for Forewarned is Forearmed: Leveraging LLMs for Data Synthesis through Failure-Inducing Exploration
Figure 4 for Forewarned is Forearmed: Leveraging LLMs for Data Synthesis through Failure-Inducing Exploration
Viaarxiv icon

ProReason: Multi-Modal Proactive Reasoning with Decoupled Eyesight and Wisdom

Add code
Oct 18, 2024
Figure 1 for ProReason: Multi-Modal Proactive Reasoning with Decoupled Eyesight and Wisdom
Figure 2 for ProReason: Multi-Modal Proactive Reasoning with Decoupled Eyesight and Wisdom
Figure 3 for ProReason: Multi-Modal Proactive Reasoning with Decoupled Eyesight and Wisdom
Figure 4 for ProReason: Multi-Modal Proactive Reasoning with Decoupled Eyesight and Wisdom
Viaarxiv icon

QSpec: Speculative Decoding with Complementary Quantization Schemes

Add code
Oct 15, 2024
Figure 1 for QSpec: Speculative Decoding with Complementary Quantization Schemes
Figure 2 for QSpec: Speculative Decoding with Complementary Quantization Schemes
Figure 3 for QSpec: Speculative Decoding with Complementary Quantization Schemes
Figure 4 for QSpec: Speculative Decoding with Complementary Quantization Schemes
Viaarxiv icon

MoS: Unleashing Parameter Efficiency of Low-Rank Adaptation with Mixture of Shards

Add code
Oct 01, 2024
Figure 1 for MoS: Unleashing Parameter Efficiency of Low-Rank Adaptation with Mixture of Shards
Figure 2 for MoS: Unleashing Parameter Efficiency of Low-Rank Adaptation with Mixture of Shards
Figure 3 for MoS: Unleashing Parameter Efficiency of Low-Rank Adaptation with Mixture of Shards
Figure 4 for MoS: Unleashing Parameter Efficiency of Low-Rank Adaptation with Mixture of Shards
Viaarxiv icon

HybridFlow: A Flexible and Efficient RLHF Framework

Add code
Sep 28, 2024
Figure 1 for HybridFlow: A Flexible and Efficient RLHF Framework
Figure 2 for HybridFlow: A Flexible and Efficient RLHF Framework
Figure 3 for HybridFlow: A Flexible and Efficient RLHF Framework
Figure 4 for HybridFlow: A Flexible and Efficient RLHF Framework
Viaarxiv icon

How Far Can Cantonese NLP Go? Benchmarking Cantonese Capabilities of Large Language Models

Add code
Aug 29, 2024
Viaarxiv icon

ByteCheckpoint: A Unified Checkpointing System for LLM Development

Add code
Jul 29, 2024
Figure 1 for ByteCheckpoint: A Unified Checkpointing System for LLM Development
Figure 2 for ByteCheckpoint: A Unified Checkpointing System for LLM Development
Figure 3 for ByteCheckpoint: A Unified Checkpointing System for LLM Development
Figure 4 for ByteCheckpoint: A Unified Checkpointing System for LLM Development
Viaarxiv icon

QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices

Add code
Jul 02, 2024
Figure 1 for QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices
Figure 2 for QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices
Figure 3 for QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices
Figure 4 for QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices
Viaarxiv icon

Data Augmentation of Multi-turn Psychological Dialogue via Knowledge-driven Progressive Thought Prompting

Add code
Jun 24, 2024
Viaarxiv icon