Picture for Yang Zhou

Yang Zhou

Yahoo! Labs

RadAlign: Advancing Radiology Report Generation with Vision-Language Concept Alignment

Add code
Jan 13, 2025
Viaarxiv icon

SenseRAG: Constructing Environmental Knowledge Bases with Proactive Querying for LLM-Based Autonomous Driving

Add code
Jan 08, 2025
Viaarxiv icon

Free-viewpoint Human Animation with Pose-correlated Reference Selection

Add code
Dec 23, 2024
Viaarxiv icon

DMesh++: An Efficient Differentiable Mesh for Complex Shapes

Add code
Dec 21, 2024
Viaarxiv icon

OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving

Add code
Dec 19, 2024
Viaarxiv icon

AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving

Add code
Dec 19, 2024
Figure 1 for AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving
Figure 2 for AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving
Figure 3 for AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving
Figure 4 for AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving
Viaarxiv icon

MotionBridge: Dynamic Video Inbetweening with Flexible Controls

Add code
Dec 17, 2024
Viaarxiv icon

ShotVL: Human-Centric Highlight Frame Retrieval via Language Queries

Add code
Dec 17, 2024
Viaarxiv icon

Move-in-2D: 2D-Conditioned Human Motion Generation

Add code
Dec 17, 2024
Figure 1 for Move-in-2D: 2D-Conditioned Human Motion Generation
Figure 2 for Move-in-2D: 2D-Conditioned Human Motion Generation
Figure 3 for Move-in-2D: 2D-Conditioned Human Motion Generation
Figure 4 for Move-in-2D: 2D-Conditioned Human Motion Generation
Viaarxiv icon

VividFace: A Diffusion-Based Hybrid Framework for High-Fidelity Video Face Swapping

Add code
Dec 15, 2024
Viaarxiv icon