Picture for Feifei Li

Feifei Li

VERTIGO: Visual Preference Optimization for Cinematic Camera Trajectory Generation

Add code
Apr 02, 2026
Viaarxiv icon

SafeRoPE: Risk-specific Head-wise Embedding Rotation for Safe Generation in Rectified Flow Transformers

Add code
Apr 02, 2026
Viaarxiv icon

SmartSight: Mitigating Hallucination in Video-LLMs Without Compromising Video Understanding via Temporal Attention Collapse

Add code
Dec 21, 2025
Viaarxiv icon

InfoCons: Identifying Interpretable Critical Concepts in Point Clouds via Information Theory

Add code
May 26, 2025
Viaarxiv icon

Detect-and-Guide: Self-regulation of Diffusion Models for Safe Text-to-Image Generation via Guideline Token Optimization

Add code
Mar 19, 2025
Figure 1 for Detect-and-Guide: Self-regulation of Diffusion Models for Safe Text-to-Image Generation via Guideline Token Optimization
Figure 2 for Detect-and-Guide: Self-regulation of Diffusion Models for Safe Text-to-Image Generation via Guideline Token Optimization
Figure 3 for Detect-and-Guide: Self-regulation of Diffusion Models for Safe Text-to-Image Generation via Guideline Token Optimization
Figure 4 for Detect-and-Guide: Self-regulation of Diffusion Models for Safe Text-to-Image Generation via Guideline Token Optimization
Viaarxiv icon

PoI: Pixel of Interest for Novel View Synthesis Assisted Scene Coordinate Regression

Add code
Feb 07, 2025
Viaarxiv icon

Efficient MedSAMs: Segment Anything in Medical Images on Laptop

Add code
Dec 20, 2024
Figure 1 for Efficient MedSAMs: Segment Anything in Medical Images on Laptop
Figure 2 for Efficient MedSAMs: Segment Anything in Medical Images on Laptop
Figure 3 for Efficient MedSAMs: Segment Anything in Medical Images on Laptop
Viaarxiv icon

A Reinforcement Learning-Based Automatic Video Editing Method Using Pre-trained Vision-Language Model

Add code
Nov 07, 2024
Figure 1 for A Reinforcement Learning-Based Automatic Video Editing Method Using Pre-trained Vision-Language Model
Figure 2 for A Reinforcement Learning-Based Automatic Video Editing Method Using Pre-trained Vision-Language Model
Figure 3 for A Reinforcement Learning-Based Automatic Video Editing Method Using Pre-trained Vision-Language Model
Figure 4 for A Reinforcement Learning-Based Automatic Video Editing Method Using Pre-trained Vision-Language Model
Viaarxiv icon

CORAG: A Cost-Constrained Retrieval Optimization System for Retrieval-Augmented Generation

Add code
Nov 01, 2024
Figure 1 for CORAG: A Cost-Constrained Retrieval Optimization System for Retrieval-Augmented Generation
Figure 2 for CORAG: A Cost-Constrained Retrieval Optimization System for Retrieval-Augmented Generation
Figure 3 for CORAG: A Cost-Constrained Retrieval Optimization System for Retrieval-Augmented Generation
Figure 4 for CORAG: A Cost-Constrained Retrieval Optimization System for Retrieval-Augmented Generation
Viaarxiv icon

ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object

Add code
Oct 14, 2024
Figure 1 for ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object
Figure 2 for ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object
Figure 3 for ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object
Figure 4 for ROA-BEV: 2D Region-Oriented Attention for BEV-based 3D Object
Viaarxiv icon