Picture for Bin Zhang

Bin Zhang

DIP: Diffusion Learning of Inconsistency Pattern for General DeepFake Detection

Add code
Oct 31, 2024
Viaarxiv icon

A Novel Method to Metigate Demographic and Expert Bias in ICD Coding with Causal Inference

Add code
Oct 18, 2024
Viaarxiv icon

SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image Understanding

Add code
Oct 15, 2024
Viaarxiv icon

IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities

Add code
Oct 09, 2024
Figure 1 for IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities
Figure 2 for IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities
Figure 3 for IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities
Figure 4 for IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities
Viaarxiv icon

Probing Causality Manipulation of Large Language Models

Add code
Aug 26, 2024
Viaarxiv icon

QPO: Query-dependent Prompt Optimization via Multi-Loop Offline Reinforcement Learning

Add code
Aug 20, 2024
Viaarxiv icon

Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning

Add code
Aug 18, 2024
Figure 1 for Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning
Figure 2 for Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning
Figure 3 for Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning
Figure 4 for Beyond Local Views: Global State Inference with Diffusion Models for Cooperative Multi-Agent Reinforcement Learning
Viaarxiv icon

Multi-modal Evidential Fusion Network for Trusted PET/CT Tumor Segmentation

Add code
Jun 26, 2024
Viaarxiv icon

A Refined 3D Gaussian Representation for High-Quality Dynamic Scene Reconstruction

Add code
May 28, 2024
Viaarxiv icon

M4U: Evaluating Multilingual Understanding and Reasoning for Large Multimodal Models

Add code
May 24, 2024
Viaarxiv icon