Picture for Philipp Harzig

Philipp Harzig

Global Average Feature Augmentation for Robust Semantic Segmentation with Transformers

Add code
Dec 02, 2024
Viaarxiv icon

Extended Self-Critical Pipeline for Transforming Videos to Text (TRECVID-VTT Task 2021) -- Team: MMCUniAugsburg

Add code
Dec 28, 2021
Figure 1 for Extended Self-Critical Pipeline for Transforming Videos to Text (TRECVID-VTT Task 2021) -- Team: MMCUniAugsburg
Figure 2 for Extended Self-Critical Pipeline for Transforming Videos to Text (TRECVID-VTT Task 2021) -- Team: MMCUniAugsburg
Figure 3 for Extended Self-Critical Pipeline for Transforming Videos to Text (TRECVID-VTT Task 2021) -- Team: MMCUniAugsburg
Figure 4 for Extended Self-Critical Pipeline for Transforming Videos to Text (TRECVID-VTT Task 2021) -- Team: MMCUniAugsburg
Viaarxiv icon

Synchronized Audio-Visual Frames with Fractional Positional Encoding for Transformers in Video-to-Text Translation

Add code
Dec 28, 2021
Figure 1 for Synchronized Audio-Visual Frames with Fractional Positional Encoding for Transformers in Video-to-Text Translation
Figure 2 for Synchronized Audio-Visual Frames with Fractional Positional Encoding for Transformers in Video-to-Text Translation
Figure 3 for Synchronized Audio-Visual Frames with Fractional Positional Encoding for Transformers in Video-to-Text Translation
Figure 4 for Synchronized Audio-Visual Frames with Fractional Positional Encoding for Transformers in Video-to-Text Translation
Viaarxiv icon

Addressing Data Bias Problems for Chest X-ray Image Report Generation

Add code
Aug 06, 2019
Figure 1 for Addressing Data Bias Problems for Chest X-ray Image Report Generation
Figure 2 for Addressing Data Bias Problems for Chest X-ray Image Report Generation
Figure 3 for Addressing Data Bias Problems for Chest X-ray Image Report Generation
Figure 4 for Addressing Data Bias Problems for Chest X-ray Image Report Generation
Viaarxiv icon

Image Captioning with Clause-Focused Metrics in a Multi-Modal Setting for Marketing

Add code
May 06, 2019
Figure 1 for Image Captioning with Clause-Focused Metrics in a Multi-Modal Setting for Marketing
Figure 2 for Image Captioning with Clause-Focused Metrics in a Multi-Modal Setting for Marketing
Figure 3 for Image Captioning with Clause-Focused Metrics in a Multi-Modal Setting for Marketing
Figure 4 for Image Captioning with Clause-Focused Metrics in a Multi-Modal Setting for Marketing
Viaarxiv icon

Multimodal Image Captioning for Marketing Analysis

Add code
Feb 06, 2018
Figure 1 for Multimodal Image Captioning for Marketing Analysis
Figure 2 for Multimodal Image Captioning for Marketing Analysis
Figure 3 for Multimodal Image Captioning for Marketing Analysis
Viaarxiv icon