Picture for Zhixi Cai

Zhixi Cai

NEUSIS: A Compositional Neuro-Symbolic Framework for Autonomous Perception, Reasoning, and Planning in Complex UAV Search Missions

Add code
Sep 16, 2024
Figure 1 for NEUSIS: A Compositional Neuro-Symbolic Framework for Autonomous Perception, Reasoning, and Planning in Complex UAV Search Missions
Figure 2 for NEUSIS: A Compositional Neuro-Symbolic Framework for Autonomous Perception, Reasoning, and Planning in Complex UAV Search Missions
Figure 3 for NEUSIS: A Compositional Neuro-Symbolic Framework for Autonomous Perception, Reasoning, and Planning in Complex UAV Search Missions
Figure 4 for NEUSIS: A Compositional Neuro-Symbolic Framework for Autonomous Perception, Reasoning, and Planning in Complex UAV Search Missions
Viaarxiv icon

MRAC Track 1: 2nd Workshop on Multimodal, Generative and Responsible Affective Computing

Add code
Sep 11, 2024
Viaarxiv icon

1M-Deepfakes Detection Challenge

Add code
Sep 11, 2024
Figure 1 for 1M-Deepfakes Detection Challenge
Figure 2 for 1M-Deepfakes Detection Challenge
Figure 3 for 1M-Deepfakes Detection Challenge
Figure 4 for 1M-Deepfakes Detection Challenge
Viaarxiv icon

JRDB-Social: A Multifaceted Robotic Dataset for Understanding of Context and Dynamics of Human Interactions Within Social Groups

Add code
Apr 06, 2024
Figure 1 for JRDB-Social: A Multifaceted Robotic Dataset for Understanding of Context and Dynamics of Human Interactions Within Social Groups
Figure 2 for JRDB-Social: A Multifaceted Robotic Dataset for Understanding of Context and Dynamics of Human Interactions Within Social Groups
Figure 3 for JRDB-Social: A Multifaceted Robotic Dataset for Understanding of Context and Dynamics of Human Interactions Within Social Groups
Figure 4 for JRDB-Social: A Multifaceted Robotic Dataset for Understanding of Context and Dynamics of Human Interactions Within Social Groups
Viaarxiv icon

HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning

Add code
Mar 19, 2024
Figure 1 for HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning
Figure 2 for HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning
Figure 3 for HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning
Figure 4 for HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning
Viaarxiv icon

AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset

Add code
Nov 26, 2023
Figure 1 for AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset
Figure 2 for AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset
Figure 3 for AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset
Figure 4 for AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset
Viaarxiv icon

Pavlok-Nudge: A Feedback Mechanism for Atomic Behaviour Modification with Snoring Usecase

Add code
May 11, 2023
Viaarxiv icon

"Glitch in the Matrix!": A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization

Add code
May 05, 2023
Viaarxiv icon

MARLIN: Masked Autoencoder for facial video Representation LearnINg

Add code
Nov 12, 2022
Viaarxiv icon

Do You Really Mean That? Content Driven Audio-Visual Deepfake Dataset and Multimodal Method for Temporal Forgery Localization

Add code
Apr 13, 2022
Figure 1 for Do You Really Mean That? Content Driven Audio-Visual Deepfake Dataset and Multimodal Method for Temporal Forgery Localization
Figure 2 for Do You Really Mean That? Content Driven Audio-Visual Deepfake Dataset and Multimodal Method for Temporal Forgery Localization
Figure 3 for Do You Really Mean That? Content Driven Audio-Visual Deepfake Dataset and Multimodal Method for Temporal Forgery Localization
Figure 4 for Do You Really Mean That? Content Driven Audio-Visual Deepfake Dataset and Multimodal Method for Temporal Forgery Localization
Viaarxiv icon