Picture for Roger Zimmermann

Roger Zimmermann

Improving Multimodal LLMs Ability In Geometry Problem Solving, Reasoning, And Multistep Scoring

Add code
Dec 01, 2024
Viaarxiv icon

Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts

Add code
Oct 14, 2024
Figure 1 for Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts
Figure 2 for Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts
Figure 3 for Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts
Figure 4 for Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts
Viaarxiv icon

Manifold-Aware Local Feature Modeling for Semi-Supervised Medical Image Segmentation

Add code
Oct 14, 2024
Viaarxiv icon

Grounding is All You Need? Dual Temporal Grounding for Video Dialog

Add code
Oct 08, 2024
Figure 1 for Grounding is All You Need? Dual Temporal Grounding for Video Dialog
Figure 2 for Grounding is All You Need? Dual Temporal Grounding for Video Dialog
Figure 3 for Grounding is All You Need? Dual Temporal Grounding for Video Dialog
Figure 4 for Grounding is All You Need? Dual Temporal Grounding for Video Dialog
Viaarxiv icon

DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving

Add code
Jul 22, 2024
Figure 1 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
Figure 2 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
Figure 3 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
Figure 4 for DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving
Viaarxiv icon

Described Spatial-Temporal Video Detection

Add code
Jul 08, 2024
Figure 1 for Described Spatial-Temporal Video Detection
Figure 2 for Described Spatial-Temporal Video Detection
Figure 3 for Described Spatial-Temporal Video Detection
Figure 4 for Described Spatial-Temporal Video Detection
Viaarxiv icon

Do As I Do: Pose Guided Human Motion Copy

Add code
Jun 24, 2024
Viaarxiv icon

PetalView: Fine-grained Location and Orientation Extraction of Street-view Images via Cross-view Local Search with Supplementary Materials

Add code
Jun 19, 2024
Figure 1 for PetalView: Fine-grained Location and Orientation Extraction of Street-view Images via Cross-view Local Search with Supplementary Materials
Figure 2 for PetalView: Fine-grained Location and Orientation Extraction of Street-view Images via Cross-view Local Search with Supplementary Materials
Figure 3 for PetalView: Fine-grained Location and Orientation Extraction of Street-view Images via Cross-view Local Search with Supplementary Materials
Figure 4 for PetalView: Fine-grained Location and Orientation Extraction of Street-view Images via Cross-view Local Search with Supplementary Materials
Viaarxiv icon

Predicting Parking Availability in Singapore with Cross-Domain Data: A New Dataset and A Data-Driven Approach

Add code
May 29, 2024
Figure 1 for Predicting Parking Availability in Singapore with Cross-Domain Data: A New Dataset and A Data-Driven Approach
Figure 2 for Predicting Parking Availability in Singapore with Cross-Domain Data: A New Dataset and A Data-Driven Approach
Figure 3 for Predicting Parking Availability in Singapore with Cross-Domain Data: A New Dataset and A Data-Driven Approach
Figure 4 for Predicting Parking Availability in Singapore with Cross-Domain Data: A New Dataset and A Data-Driven Approach
Viaarxiv icon

Prompt-Enhanced Spatio-Temporal Graph Transfer Learning

Add code
May 21, 2024
Viaarxiv icon