Picture for Roger Zimmermann

Roger Zimmermann

TAIL: Text-Audio Incremental Learning

Add code
Mar 06, 2025
Viaarxiv icon

Facilitate Collaboration between Large Language Model and Task-specific Model for Time Series Anomaly Detection

Add code
Jan 10, 2025
Viaarxiv icon

Improving Multimodal LLMs Ability In Geometry Problem Solving, Reasoning, And Multistep Scoring

Add code
Dec 01, 2024
Viaarxiv icon

Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts

Add code
Oct 14, 2024
Figure 1 for Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts
Figure 2 for Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts
Figure 3 for Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts
Figure 4 for Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts
Viaarxiv icon

Manifold-Aware Local Feature Modeling for Semi-Supervised Medical Image Segmentation

Add code
Oct 14, 2024
Viaarxiv icon

Grounding is All You Need? Dual Temporal Grounding for Video Dialog

Add code
Oct 08, 2024
Figure 1 for Grounding is All You Need? Dual Temporal Grounding for Video Dialog
Figure 2 for Grounding is All You Need? Dual Temporal Grounding for Video Dialog
Figure 3 for Grounding is All You Need? Dual Temporal Grounding for Video Dialog
Figure 4 for Grounding is All You Need? Dual Temporal Grounding for Video Dialog
Viaarxiv icon

DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving

Add code
Jul 22, 2024
Figure 1 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
Figure 2 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
Figure 3 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
Figure 4 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
Viaarxiv icon

Described Spatial-Temporal Video Detection

Add code
Jul 08, 2024
Figure 1 for Described Spatial-Temporal Video Detection
Figure 2 for Described Spatial-Temporal Video Detection
Figure 3 for Described Spatial-Temporal Video Detection
Figure 4 for Described Spatial-Temporal Video Detection
Viaarxiv icon

Do As I Do: Pose Guided Human Motion Copy

Add code
Jun 24, 2024
Figure 1 for Do As I Do: Pose Guided Human Motion Copy
Figure 2 for Do As I Do: Pose Guided Human Motion Copy
Figure 3 for Do As I Do: Pose Guided Human Motion Copy
Figure 4 for Do As I Do: Pose Guided Human Motion Copy
Viaarxiv icon

PetalView: Fine-grained Location and Orientation Extraction of Street-view Images via Cross-view Local Search with Supplementary Materials

Add code
Jun 19, 2024
Figure 1 for PetalView: Fine-grained Location and Orientation Extraction of Street-view Images via Cross-view Local Search with Supplementary Materials
Figure 2 for PetalView: Fine-grained Location and Orientation Extraction of Street-view Images via Cross-view Local Search with Supplementary Materials
Figure 3 for PetalView: Fine-grained Location and Orientation Extraction of Street-view Images via Cross-view Local Search with Supplementary Materials
Figure 4 for PetalView: Fine-grained Location and Orientation Extraction of Street-view Images via Cross-view Local Search with Supplementary Materials
Viaarxiv icon