Picture for Su Zhu

Su Zhu

Reducing Tool Hallucination via Reliability Alignment

Add code
Dec 05, 2024
Figure 1 for Reducing Tool Hallucination via Reliability Alignment
Figure 2 for Reducing Tool Hallucination via Reliability Alignment
Figure 3 for Reducing Tool Hallucination via Reliability Alignment
Figure 4 for Reducing Tool Hallucination via Reliability Alignment
Viaarxiv icon

Compressing KV Cache for Long-Context LLM Inference with Inter-Layer Attention Similarity

Add code
Dec 03, 2024
Viaarxiv icon

SciDFM: A Large Language Model with Mixture-of-Experts for Science

Add code
Sep 27, 2024
Figure 1 for SciDFM: A Large Language Model with Mixture-of-Experts for Science
Figure 2 for SciDFM: A Large Language Model with Mixture-of-Experts for Science
Figure 3 for SciDFM: A Large Language Model with Mixture-of-Experts for Science
Figure 4 for SciDFM: A Large Language Model with Mixture-of-Experts for Science
Viaarxiv icon

Evolving Subnetwork Training for Large Language Models

Add code
Jun 11, 2024
Viaarxiv icon

Sparsity-Accelerated Training for Large Language Models

Add code
Jun 03, 2024
Viaarxiv icon

A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic Frames

Add code
Feb 28, 2024
Figure 1 for A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic Frames
Figure 2 for A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic Frames
Figure 3 for A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic Frames
Figure 4 for A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic Frames
Viaarxiv icon

ChemDFM: Dialogue Foundation Model for Chemistry

Add code
Jan 26, 2024
Figure 1 for ChemDFM: Dialogue Foundation Model for Chemistry
Figure 2 for ChemDFM: Dialogue Foundation Model for Chemistry
Figure 3 for ChemDFM: Dialogue Foundation Model for Chemistry
Figure 4 for ChemDFM: Dialogue Foundation Model for Chemistry
Viaarxiv icon

On the Structural Generalization in Text-to-SQL

Add code
Jan 21, 2023
Viaarxiv icon

OPAL: Ontology-Aware Pretrained Language Model for End-to-End Task-Oriented Dialogue

Add code
Sep 10, 2022
Figure 1 for OPAL: Ontology-Aware Pretrained Language Model for End-to-End Task-Oriented Dialogue
Figure 2 for OPAL: Ontology-Aware Pretrained Language Model for End-to-End Task-Oriented Dialogue
Figure 3 for OPAL: Ontology-Aware Pretrained Language Model for End-to-End Task-Oriented Dialogue
Figure 4 for OPAL: Ontology-Aware Pretrained Language Model for End-to-End Task-Oriented Dialogue
Viaarxiv icon

DialogZoo: Large-Scale Dialog-Oriented Task Learning

Add code
May 25, 2022
Figure 1 for DialogZoo: Large-Scale Dialog-Oriented Task Learning
Figure 2 for DialogZoo: Large-Scale Dialog-Oriented Task Learning
Figure 3 for DialogZoo: Large-Scale Dialog-Oriented Task Learning
Figure 4 for DialogZoo: Large-Scale Dialog-Oriented Task Learning
Viaarxiv icon