Picture for Haiyang Yu

Haiyang Yu

UMIT: Unifying Medical Imaging Tasks via Vision-Language Models

Add code
Mar 20, 2025
Viaarxiv icon

EVE: Towards End-to-End Video Subtitle Extraction with Vision-Language Models

Add code
Mar 06, 2025
Viaarxiv icon

Text2Scenario: Text-Driven Scenario Generation for Autonomous Driving Test

Add code
Mar 04, 2025
Viaarxiv icon

DeepSolution: Boosting Complex Engineering Solution Design via Tree-based Exploration and Bi-point Thinking

Add code
Feb 28, 2025
Viaarxiv icon

ChatReID: Open-ended Interactive Person Retrieval via Hierarchical Progressive Tuning for Vision Language Models

Add code
Feb 27, 2025
Viaarxiv icon

Learning to Discover Regulatory Elements for Gene Expression Prediction

Add code
Feb 19, 2025
Viaarxiv icon

AdvSwap: Covert Adversarial Perturbation with High Frequency Info-swapping for Autonomous Driving Perception

Add code
Feb 12, 2025
Viaarxiv icon

PreMixer: MLP-Based Pre-training Enhanced MLP-Mixers for Large-scale Traffic Forecasting

Add code
Dec 18, 2024
Figure 1 for PreMixer: MLP-Based Pre-training Enhanced MLP-Mixers for Large-scale Traffic Forecasting
Figure 2 for PreMixer: MLP-Based Pre-training Enhanced MLP-Mixers for Large-scale Traffic Forecasting
Figure 3 for PreMixer: MLP-Based Pre-training Enhanced MLP-Mixers for Large-scale Traffic Forecasting
Figure 4 for PreMixer: MLP-Based Pre-training Enhanced MLP-Mixers for Large-scale Traffic Forecasting
Viaarxiv icon

A Geometry-Aware Message Passing Neural Network for Modeling Aerodynamics over Airfoils

Add code
Dec 13, 2024
Viaarxiv icon

Dynamic-VLM: Simple Dynamic Visual Token Compression for VideoLLM

Add code
Dec 12, 2024
Viaarxiv icon