Picture for Kun Xie

Kun Xie

A Vision-and-Knowledge Enhanced Large Language Model for Generalizable Pedestrian Crossing Behavior Inference

Add code
Jan 02, 2026
Viaarxiv icon

PriorRG: Prior-Guided Contrastive Pre-training and Coarse-to-Fine Decoding for Chest X-ray Report Generation

Add code
Aug 07, 2025
Figure 1 for PriorRG: Prior-Guided Contrastive Pre-training and Coarse-to-Fine Decoding for Chest X-ray Report Generation
Figure 2 for PriorRG: Prior-Guided Contrastive Pre-training and Coarse-to-Fine Decoding for Chest X-ray Report Generation
Figure 3 for PriorRG: Prior-Guided Contrastive Pre-training and Coarse-to-Fine Decoding for Chest X-ray Report Generation
Figure 4 for PriorRG: Prior-Guided Contrastive Pre-training and Coarse-to-Fine Decoding for Chest X-ray Report Generation
Viaarxiv icon

Chi-Square Wavelet Graph Neural Networks for Heterogeneous Graph Anomaly Detection

Add code
May 25, 2025
Figure 1 for Chi-Square Wavelet Graph Neural Networks for Heterogeneous Graph Anomaly Detection
Figure 2 for Chi-Square Wavelet Graph Neural Networks for Heterogeneous Graph Anomaly Detection
Figure 3 for Chi-Square Wavelet Graph Neural Networks for Heterogeneous Graph Anomaly Detection
Figure 4 for Chi-Square Wavelet Graph Neural Networks for Heterogeneous Graph Anomaly Detection
Viaarxiv icon

FireRedTTS-1S: An Upgraded Streamable Foundation Text-to-Speech System

Add code
Mar 26, 2025
Figure 1 for FireRedTTS-1S: An Upgraded Streamable Foundation Text-to-Speech System
Figure 2 for FireRedTTS-1S: An Upgraded Streamable Foundation Text-to-Speech System
Figure 3 for FireRedTTS-1S: An Upgraded Streamable Foundation Text-to-Speech System
Viaarxiv icon

Enhanced Contrastive Learning with Multi-view Longitudinal Data for Chest X-ray Report Generation

Add code
Feb 27, 2025
Figure 1 for Enhanced Contrastive Learning with Multi-view Longitudinal Data for Chest X-ray Report Generation
Figure 2 for Enhanced Contrastive Learning with Multi-view Longitudinal Data for Chest X-ray Report Generation
Figure 3 for Enhanced Contrastive Learning with Multi-view Longitudinal Data for Chest X-ray Report Generation
Figure 4 for Enhanced Contrastive Learning with Multi-view Longitudinal Data for Chest X-ray Report Generation
Viaarxiv icon

MCL: Multi-view Enhanced Contrastive Learning for Chest X-ray Report Generation

Add code
Nov 15, 2024
Figure 1 for MCL: Multi-view Enhanced Contrastive Learning for Chest X-ray Report Generation
Figure 2 for MCL: Multi-view Enhanced Contrastive Learning for Chest X-ray Report Generation
Figure 3 for MCL: Multi-view Enhanced Contrastive Learning for Chest X-ray Report Generation
Figure 4 for MCL: Multi-view Enhanced Contrastive Learning for Chest X-ray Report Generation
Viaarxiv icon

Drone Data Analytics for Measuring Traffic Metrics at Intersections in High-Density Areas

Add code
Nov 04, 2024
Viaarxiv icon

FireRedTTS: A Foundation Text-To-Speech Framework for Industry-Level Generative Speech Applications

Add code
Sep 05, 2024
Figure 1 for FireRedTTS: A Foundation Text-To-Speech Framework for Industry-Level Generative Speech Applications
Figure 2 for FireRedTTS: A Foundation Text-To-Speech Framework for Industry-Level Generative Speech Applications
Figure 3 for FireRedTTS: A Foundation Text-To-Speech Framework for Industry-Level Generative Speech Applications
Figure 4 for FireRedTTS: A Foundation Text-To-Speech Framework for Industry-Level Generative Speech Applications
Viaarxiv icon

SoCodec: A Semantic-Ordered Multi-Stream Speech Codec for Efficient Language Model Based Text-to-Speech Synthesis

Add code
Sep 02, 2024
Figure 1 for SoCodec: A Semantic-Ordered Multi-Stream Speech Codec for Efficient Language Model Based Text-to-Speech Synthesis
Figure 2 for SoCodec: A Semantic-Ordered Multi-Stream Speech Codec for Efficient Language Model Based Text-to-Speech Synthesis
Figure 3 for SoCodec: A Semantic-Ordered Multi-Stream Speech Codec for Efficient Language Model Based Text-to-Speech Synthesis
Figure 4 for SoCodec: A Semantic-Ordered Multi-Stream Speech Codec for Efficient Language Model Based Text-to-Speech Synthesis
Viaarxiv icon

KPG: Key Propagation Graph Generator for Rumor Detection based on Reinforcement Learning

Add code
May 21, 2024
Viaarxiv icon