Picture for Efstratios Gavves

Efstratios Gavves

From MLP to NeoMLP: Leveraging Self-Attention for Neural Fields

Add code
Dec 11, 2024
Viaarxiv icon

Any-Resolution AI-Generated Image Detection by Spectral Learning

Add code
Nov 28, 2024
Viaarxiv icon

CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation

Add code
Nov 07, 2024
Figure 1 for CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation
Figure 2 for CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation
Figure 3 for CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation
Figure 4 for CaPo: Cooperative Plan Optimization for Efficient Embodied Multi-Agent Cooperation
Viaarxiv icon

Language Agents Meet Causality -- Bridging LLMs and Causal World Models

Add code
Oct 25, 2024
Viaarxiv icon

NARAIM: Native Aspect Ratio Autoregressive Image Models

Add code
Oct 13, 2024
Figure 1 for NARAIM: Native Aspect Ratio Autoregressive Image Models
Figure 2 for NARAIM: Native Aspect Ratio Autoregressive Image Models
Figure 3 for NARAIM: Native Aspect Ratio Autoregressive Image Models
Figure 4 for NARAIM: Native Aspect Ratio Autoregressive Image Models
Viaarxiv icon

SIGMA: Sinkhorn-Guided Masked Video Modeling

Add code
Jul 22, 2024
Viaarxiv icon

GeneralAD: Anomaly Detection Across Domains by Attending to Distorted Features

Add code
Jul 17, 2024
Figure 1 for GeneralAD: Anomaly Detection Across Domains by Attending to Distorted Features
Figure 2 for GeneralAD: Anomaly Detection Across Domains by Attending to Distorted Features
Figure 3 for GeneralAD: Anomaly Detection Across Domains by Attending to Distorted Features
Figure 4 for GeneralAD: Anomaly Detection Across Domains by Attending to Distorted Features
Viaarxiv icon

VISA: Reasoning Video Object Segmentation via Large Language Models

Add code
Jul 16, 2024
Figure 1 for VISA: Reasoning Video Object Segmentation via Large Language Models
Figure 2 for VISA: Reasoning Video Object Segmentation via Large Language Models
Figure 3 for VISA: Reasoning Video Object Segmentation via Large Language Models
Figure 4 for VISA: Reasoning Video Object Segmentation via Large Language Models
Viaarxiv icon

Mining Open Semantics from CLIP: A Relation Transition Perspective for Few-Shot Learning

Add code
Jun 17, 2024
Figure 1 for Mining Open Semantics from CLIP: A Relation Transition Perspective for Few-Shot Learning
Figure 2 for Mining Open Semantics from CLIP: A Relation Transition Perspective for Few-Shot Learning
Figure 3 for Mining Open Semantics from CLIP: A Relation Transition Perspective for Few-Shot Learning
Figure 4 for Mining Open Semantics from CLIP: A Relation Transition Perspective for Few-Shot Learning
Viaarxiv icon

Grounding Continuous Representations in Geometry: Equivariant Neural Fields

Add code
Jun 11, 2024
Viaarxiv icon