Picture for Roger Zimmermann

Roger Zimmermann

Facilitate Collaboration between Large Language Model and Task-specific Model for Time Series Anomaly Detection

Add code
Jan 10, 2025
Viaarxiv icon

Improving Multimodal LLMs Ability In Geometry Problem Solving, Reasoning, And Multistep Scoring

Add code
Dec 01, 2024
Viaarxiv icon

Manifold-Aware Local Feature Modeling for Semi-Supervised Medical Image Segmentation

Add code
Oct 14, 2024
Viaarxiv icon

Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts

Add code
Oct 14, 2024
Figure 1 for Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts
Figure 2 for Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts
Figure 3 for Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts
Figure 4 for Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts
Viaarxiv icon

Grounding is All You Need? Dual Temporal Grounding for Video Dialog

Add code
Oct 08, 2024
Figure 1 for Grounding is All You Need? Dual Temporal Grounding for Video Dialog
Figure 2 for Grounding is All You Need? Dual Temporal Grounding for Video Dialog
Figure 3 for Grounding is All You Need? Dual Temporal Grounding for Video Dialog
Figure 4 for Grounding is All You Need? Dual Temporal Grounding for Video Dialog
Viaarxiv icon

DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving

Add code
Jul 22, 2024
Figure 1 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
Figure 2 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
Figure 3 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
Figure 4 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
Viaarxiv icon

Described Spatial-Temporal Video Detection

Add code
Jul 08, 2024
Figure 1 for Described Spatial-Temporal Video Detection
Figure 2 for Described Spatial-Temporal Video Detection
Figure 3 for Described Spatial-Temporal Video Detection
Figure 4 for Described Spatial-Temporal Video Detection
Viaarxiv icon

Do As I Do: Pose Guided Human Motion Copy

Add code
Jun 24, 2024
Figure 1 for Do As I Do: Pose Guided Human Motion Copy
Figure 2 for Do As I Do: Pose Guided Human Motion Copy
Figure 3 for Do As I Do: Pose Guided Human Motion Copy
Figure 4 for Do As I Do: Pose Guided Human Motion Copy
Viaarxiv icon

PetalView: Fine-grained Location and Orientation Extraction of Street-view Images via Cross-view Local Search with Supplementary Materials

Add code
Jun 19, 2024
Figure 1 for PetalView: Fine-grained Location and Orientation Extraction of Street-view Images via Cross-view Local Search with Supplementary Materials
Figure 2 for PetalView: Fine-grained Location and Orientation Extraction of Street-view Images via Cross-view Local Search with Supplementary Materials
Figure 3 for PetalView: Fine-grained Location and Orientation Extraction of Street-view Images via Cross-view Local Search with Supplementary Materials
Figure 4 for PetalView: Fine-grained Location and Orientation Extraction of Street-view Images via Cross-view Local Search with Supplementary Materials
Viaarxiv icon

Predicting Parking Availability in Singapore with Cross-Domain Data: A New Dataset and A Data-Driven Approach

Add code
May 29, 2024
Figure 1 for Predicting Parking Availability in Singapore with Cross-Domain Data: A New Dataset and A Data-Driven Approach
Figure 2 for Predicting Parking Availability in Singapore with Cross-Domain Data: A New Dataset and A Data-Driven Approach
Figure 3 for Predicting Parking Availability in Singapore with Cross-Domain Data: A New Dataset and A Data-Driven Approach
Figure 4 for Predicting Parking Availability in Singapore with Cross-Domain Data: A New Dataset and A Data-Driven Approach
Viaarxiv icon