Picture for Jielin Qiu

Jielin Qiu

Evaluating Durability: Benchmark Insights into Multimodal Watermarking

Add code
Jun 06, 2024
Viaarxiv icon

Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition

Add code
Mar 19, 2024
Viaarxiv icon

SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM

Add code
Mar 07, 2024
Figure 1 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 2 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 3 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 4 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Viaarxiv icon

Offline Reinforcement Learning with Imbalanced Datasets

Add code
Jul 29, 2023
Viaarxiv icon

Embodied Executable Policy Learning with Language-based Scene Summarization

Add code
Jun 09, 2023
Viaarxiv icon

MultiSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos

Add code
Jun 07, 2023
Viaarxiv icon

Multimodal Representation Learning of Cardiovascular Magnetic Resonance Imaging

Add code
Apr 16, 2023
Viaarxiv icon

Converting ECG Signals to Images for Efficient Image-text Retrieval via Encoding

Add code
Apr 13, 2023
Figure 1 for Converting ECG Signals to Images for Efficient Image-text Retrieval via Encoding
Figure 2 for Converting ECG Signals to Images for Efficient Image-text Retrieval via Encoding
Figure 3 for Converting ECG Signals to Images for Efficient Image-text Retrieval via Encoding
Figure 4 for Converting ECG Signals to Images for Efficient Image-text Retrieval via Encoding
Viaarxiv icon

Align and Attend: Multimodal Summarization with Dual Contrastive Losses

Add code
Mar 13, 2023
Viaarxiv icon

Interpolation for Robust Learning: Data Augmentation on Geodesics

Add code
Feb 07, 2023
Viaarxiv icon