Picture for Yiming Sun

Yiming Sun

Detect-and-Guide: Self-regulation of Diffusion Models for Safe Text-to-Image Generation via Guideline Token Optimization

Add code
Mar 19, 2025
Viaarxiv icon

Rethinking Multi-modal Object Detection from the Perspective of Mono-Modality Feature Learning

Add code
Mar 14, 2025
Viaarxiv icon

Scalable In-Context Learning on Tabular Data via Retrieval-Augmented Large Language Models

Add code
Feb 05, 2025
Figure 1 for Scalable In-Context Learning on Tabular Data via Retrieval-Augmented Large Language Models
Figure 2 for Scalable In-Context Learning on Tabular Data via Retrieval-Augmented Large Language Models
Figure 3 for Scalable In-Context Learning on Tabular Data via Retrieval-Augmented Large Language Models
Figure 4 for Scalable In-Context Learning on Tabular Data via Retrieval-Augmented Large Language Models
Viaarxiv icon

Integrating Object Detection Modality into Visual Language Model for Enhanced Autonomous Driving Agent

Add code
Nov 08, 2024
Viaarxiv icon

Dynamic Brightness Adaptation for Robust Multi-modal Image Fusion

Add code
Nov 07, 2024
Figure 1 for Dynamic Brightness Adaptation for Robust Multi-modal Image Fusion
Figure 2 for Dynamic Brightness Adaptation for Robust Multi-modal Image Fusion
Figure 3 for Dynamic Brightness Adaptation for Robust Multi-modal Image Fusion
Figure 4 for Dynamic Brightness Adaptation for Robust Multi-modal Image Fusion
Viaarxiv icon

ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model

Add code
Nov 04, 2024
Figure 1 for ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model
Figure 2 for ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model
Figure 3 for ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model
Figure 4 for ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model
Viaarxiv icon

Learning Multimodal Cues of Children's Uncertainty

Add code
Oct 17, 2024
Figure 1 for Learning Multimodal Cues of Children's Uncertainty
Figure 2 for Learning Multimodal Cues of Children's Uncertainty
Figure 3 for Learning Multimodal Cues of Children's Uncertainty
Figure 4 for Learning Multimodal Cues of Children's Uncertainty
Viaarxiv icon

Transfer Learning with Clinical Concept Embeddings from Large Language Models

Add code
Sep 20, 2024
Viaarxiv icon

Free-form Grid Structure Form Finding based on Machine Learning and Multi-objective Optimisation

Add code
Jul 13, 2024
Figure 1 for Free-form Grid Structure Form Finding based on Machine Learning and Multi-objective Optimisation
Figure 2 for Free-form Grid Structure Form Finding based on Machine Learning and Multi-objective Optimisation
Figure 3 for Free-form Grid Structure Form Finding based on Machine Learning and Multi-objective Optimisation
Figure 4 for Free-form Grid Structure Form Finding based on Machine Learning and Multi-objective Optimisation
Viaarxiv icon

One-shot Active Learning Based on Lewis Weight Sampling for Multiple Deep Models

Add code
May 23, 2024
Viaarxiv icon