Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sangdo Han

CL3DOR: Contrastive Learning for 3D Large Multimodal Models via Odds Ratio on High-Resolution Point Clouds

Jan 07, 2025

Keonwoo Kim, Yeongjae Cho, Taebaek Hwang, Minsoo Jo, Sangdo Han

Abstract:Recent research has demonstrated that Large Language Models (LLMs) are not limited to text-only tasks but can also function as multimodal models across various modalities, including audio, images, and videos. In particular, research on 3D Large Multimodal Models (3D LMMs) is making notable strides, driven by the potential of processing higher-dimensional data like point clouds. However, upon closer examination, we find that the visual and textual content within each sample of existing training datasets lacks both high informational granularity and clarity, which serve as a bottleneck for precise cross-modal understanding. To address these issues, we propose CL3DOR, Contrastive Learning for 3D large multimodal models via Odds ratio on high-Resolution point clouds, designed to ensure greater specificity and clarity in both visual and textual content. Specifically, we increase the density of point clouds per object and construct informative hard negative responses in the training dataset to penalize unwanted responses. To leverage hard negative responses, we incorporate the odds ratio as an auxiliary term for contrastive learning into the conventional language modeling loss. CL3DOR achieves state-of-the-art performance in 3D scene understanding and reasoning benchmarks. Additionally, we demonstrate the effectiveness of CL3DOR's key components through extensive experiments.

Via

Access Paper or Ask Questions

Cross-lingual Transfer Learning for Fake News Detector in a Low-Resource Language

Aug 26, 2022

Sangdo Han

Figure 1 for Cross-lingual Transfer Learning for Fake News Detector in a Low-Resource Language

Figure 2 for Cross-lingual Transfer Learning for Fake News Detector in a Low-Resource Language

Figure 3 for Cross-lingual Transfer Learning for Fake News Detector in a Low-Resource Language

Figure 4 for Cross-lingual Transfer Learning for Fake News Detector in a Low-Resource Language

Abstract:Development of methods to detect fake news (FN) in low-resource languages has been impeded by a lack of training data. In this study, we solve the problem by using only training data from a high-resource language. Our FN-detection system permitted this strategy by applying adversarial learning that transfers the detection knowledge through languages. To assist the knowledge transfer, our system judges the reliability of articles by exploiting source information, which is a cross-lingual feature that represents the credibility of the speaker. In experiments, our system got 3.71% higher accuracy than a system that uses a machine-translated training dataset. In addition, our suggested cross-lingual feature exploitation for fake news detection improved accuracy by 3.03%.

* I've withdraw this paper from a journal during revision. There were two reasons. First, data was not verified enough. Data verification steps are required. Second, even the average accuracy was higher than baseline, but it was not stable enough. However, I think the news embedding truely represents credibility of the speakers. I hope that the knowledge I've got would help for other researchers

Via

Access Paper or Ask Questions