Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nikita Andreev

Inverse Entropic Optimal Transport Solves Semi-supervised Learning via Data Likelihood Maximization

Oct 03, 2024

Mikhail Persiianov, Arip Asadulaev, Nikita Andreev, Nikita Starodubcev, Dmitry Baranchuk, Anastasis Kratsios, Evgeny Burnaev, Alexander Korotin

Figure 1 for Inverse Entropic Optimal Transport Solves Semi-supervised Learning via Data Likelihood Maximization

Figure 2 for Inverse Entropic Optimal Transport Solves Semi-supervised Learning via Data Likelihood Maximization

Figure 3 for Inverse Entropic Optimal Transport Solves Semi-supervised Learning via Data Likelihood Maximization

Figure 4 for Inverse Entropic Optimal Transport Solves Semi-supervised Learning via Data Likelihood Maximization

Abstract:Learning conditional distributions $\pi^*(\cdot|x)$ is a central problem in machine learning, which is typically approached via supervised methods with paired data $(x,y) \sim \pi^*$. However, acquiring paired data samples is often challenging, especially in problems such as domain translation. This necessitates the development of $\textit{semi-supervised}$ models that utilize both limited paired data and additional unpaired i.i.d. samples $x \sim \pi^*_x$ and $y \sim \pi^*_y$ from the marginal distributions. The usage of such combined data is complex and often relies on heuristic approaches. To tackle this issue, we propose a new learning paradigm that integrates both paired and unpaired data $\textbf{seamlessly}$ through the data likelihood maximization techniques. We demonstrate that our approach also connects intriguingly with inverse entropic optimal transport (OT). This finding allows us to apply recent advances in computational OT to establish a $\textbf{light}$ learning algorithm to get $\pi^*(\cdot|x)$. Furthermore, we demonstrate through empirical tests that our method effectively learns conditional distributions using paired and unpaired data simultaneously.

Via

Access Paper or Ask Questions

Papilusion at DAGPap24: Paper or Illusion? Detecting AI-generated Scientific Papers

Jul 24, 2024

Nikita Andreev, Alexander Shirnin, Vladislav Mikhailov, Ekaterina Artemova

Figure 1 for Papilusion at DAGPap24: Paper or Illusion? Detecting AI-generated Scientific Papers

Figure 2 for Papilusion at DAGPap24: Paper or Illusion? Detecting AI-generated Scientific Papers

Figure 3 for Papilusion at DAGPap24: Paper or Illusion? Detecting AI-generated Scientific Papers

Figure 4 for Papilusion at DAGPap24: Paper or Illusion? Detecting AI-generated Scientific Papers

Abstract:This paper presents Papilusion, an AI-generated scientific text detector developed within the DAGPap24 shared task on detecting automatically generated scientific papers. We propose an ensemble-based approach and conduct ablation studies to analyze the effect of the detector configurations on the performance. Papilusion is ranked 6th on the leaderboard, and we improve our performance after the competition ended, achieving 99.46 (+9.63) of the F1-score on the official test set.

* to appear in DAGPAP 2024 proceedings

Via

Access Paper or Ask Questions

AIpom at SemEval-2024 Task 8: Detecting AI-produced Outputs in M4

Mar 28, 2024

Alexander Shirnin, Nikita Andreev, Vladislav Mikhailov, Ekaterina Artemova

Abstract:This paper describes AIpom, a system designed to detect a boundary between human-written and machine-generated text (SemEval-2024 Task 8, Subtask C: Human-Machine Mixed Text Detection). We propose a two-stage pipeline combining predictions from an instruction-tuned decoder-only model and encoder-only sequence taggers. AIpom is ranked second on the leaderboard while achieving a Mean Absolute Error of 15.94. Ablation studies confirm the benefits of pipelining encoder and decoder models, particularly in terms of improved performance.

* 2nd place at SemEval-2024 Task 8, Subtask C, to appear in SemEval-2024 proceedings

Via

Access Paper or Ask Questions