Picture for Dong Zhang

Dong Zhang

Parameter-efficient Fine-tuning for improved Convolutional Baseline for Brain Tumor Segmentation in Sub-Saharan Africa Adult Glioma Dataset

Add code
Dec 18, 2024
Viaarxiv icon

Divide-and-Conquer: Confluent Triple-Flow Network for RGB-T Salient Object Detection

Add code
Dec 02, 2024
Viaarxiv icon

MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning

Add code
Nov 05, 2024
Figure 1 for MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning
Figure 2 for MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning
Figure 3 for MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning
Figure 4 for MME-Finance: A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning
Viaarxiv icon

A Survey on Bundle Recommendation: Methods, Applications, and Challenges

Add code
Nov 01, 2024
Figure 1 for A Survey on Bundle Recommendation: Methods, Applications, and Challenges
Figure 2 for A Survey on Bundle Recommendation: Methods, Applications, and Challenges
Figure 3 for A Survey on Bundle Recommendation: Methods, Applications, and Challenges
Figure 4 for A Survey on Bundle Recommendation: Methods, Applications, and Challenges
Viaarxiv icon

BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments

Add code
Oct 31, 2024
Figure 1 for BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments
Figure 2 for BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments
Figure 3 for BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments
Figure 4 for BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments
Viaarxiv icon

MetaAlign: Align Large Language Models with Diverse Preferences during Inference Time

Add code
Oct 18, 2024
Figure 1 for MetaAlign: Align Large Language Models with Diverse Preferences during Inference Time
Figure 2 for MetaAlign: Align Large Language Models with Diverse Preferences during Inference Time
Figure 3 for MetaAlign: Align Large Language Models with Diverse Preferences during Inference Time
Figure 4 for MetaAlign: Align Large Language Models with Diverse Preferences during Inference Time
Viaarxiv icon

IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities

Add code
Oct 09, 2024
Figure 1 for IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities
Figure 2 for IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities
Figure 3 for IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities
Figure 4 for IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities
Viaarxiv icon

ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech

Add code
Sep 24, 2024
Viaarxiv icon

Labeled-to-Unlabeled Distribution Alignment for Partially-Supervised Multi-Organ Medical Image Segmentation

Add code
Sep 05, 2024
Figure 1 for Labeled-to-Unlabeled Distribution Alignment for Partially-Supervised Multi-Organ Medical Image Segmentation
Figure 2 for Labeled-to-Unlabeled Distribution Alignment for Partially-Supervised Multi-Organ Medical Image Segmentation
Figure 3 for Labeled-to-Unlabeled Distribution Alignment for Partially-Supervised Multi-Organ Medical Image Segmentation
Figure 4 for Labeled-to-Unlabeled Distribution Alignment for Partially-Supervised Multi-Organ Medical Image Segmentation
Viaarxiv icon

Aligning Medical Images with General Knowledge from Large Language Models

Add code
Aug 31, 2024
Figure 1 for Aligning Medical Images with General Knowledge from Large Language Models
Figure 2 for Aligning Medical Images with General Knowledge from Large Language Models
Figure 3 for Aligning Medical Images with General Knowledge from Large Language Models
Figure 4 for Aligning Medical Images with General Knowledge from Large Language Models
Viaarxiv icon