Picture for Manan Suri

Manan Suri

VisDoM: Multi-Document QA with Visually Rich Elements Using Multimodal Retrieval-Augmented Generation

Add code
Dec 14, 2024
Viaarxiv icon

x-RAGE: eXtended Reality -- Action & Gesture Events Dataset

Add code
Oct 25, 2024
Figure 1 for x-RAGE: eXtended Reality -- Action & Gesture Events Dataset
Figure 2 for x-RAGE: eXtended Reality -- Action & Gesture Events Dataset
Figure 3 for x-RAGE: eXtended Reality -- Action & Gesture Events Dataset
Figure 4 for x-RAGE: eXtended Reality -- Action & Gesture Events Dataset
Viaarxiv icon

DocEdit-v2: Document Structure Editing Via Multimodal LLM Grounding

Add code
Oct 21, 2024
Figure 1 for DocEdit-v2: Document Structure Editing Via Multimodal LLM Grounding
Figure 2 for DocEdit-v2: Document Structure Editing Via Multimodal LLM Grounding
Figure 3 for DocEdit-v2: Document Structure Editing Via Multimodal LLM Grounding
Figure 4 for DocEdit-v2: Document Structure Editing Via Multimodal LLM Grounding
Viaarxiv icon

Non-Invasive Qualitative Vibration Analysis using Event Camera

Add code
Oct 18, 2024
Figure 1 for Non-Invasive Qualitative Vibration Analysis using Event Camera
Figure 2 for Non-Invasive Qualitative Vibration Analysis using Event Camera
Figure 3 for Non-Invasive Qualitative Vibration Analysis using Event Camera
Figure 4 for Non-Invasive Qualitative Vibration Analysis using Event Camera
Viaarxiv icon

A Survey of Graph and Attention Based Hyperspectral Image Classification Methods for Remote Sensing Data

Add code
Oct 16, 2023
Viaarxiv icon

ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER

Add code
Jun 01, 2023
Viaarxiv icon

The Geometry of Multilingual Language Models: An Equality Lens

Add code
May 13, 2023
Figure 1 for The Geometry of Multilingual Language Models: An Equality Lens
Figure 2 for The Geometry of Multilingual Language Models: An Equality Lens
Figure 3 for The Geometry of Multilingual Language Models: An Equality Lens
Figure 4 for The Geometry of Multilingual Language Models: An Equality Lens
Viaarxiv icon

CoSyn: Detecting Implicit Hate Speech in Online Conversations Using a Context Synergized Hyperbolic Network

Add code
Mar 10, 2023
Viaarxiv icon

WADER at SemEval-2023 Task 9: A Weak-labelling framework for Data augmentation in tExt Regression Tasks

Add code
Mar 05, 2023
Figure 1 for WADER at SemEval-2023 Task 9: A Weak-labelling framework for Data augmentation in tExt Regression Tasks
Figure 2 for WADER at SemEval-2023 Task 9: A Weak-labelling framework for Data augmentation in tExt Regression Tasks
Figure 3 for WADER at SemEval-2023 Task 9: A Weak-labelling framework for Data augmentation in tExt Regression Tasks
Figure 4 for WADER at SemEval-2023 Task 9: A Weak-labelling framework for Data augmentation in tExt Regression Tasks
Viaarxiv icon

A novel multimodal dynamic fusion network for disfluency detection in spoken utterances

Add code
Nov 27, 2022
Viaarxiv icon