Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Vadim Porvatov

Sberbank, National University of Science and Technology MISIS

Position of Uncertainty: A Cross-Linguistic Study of Positional Bias in Large Language Models

May 22, 2025

Menschikov Mikhail, Alexander Kharitonov, Maiia Kotyga, Vadim Porvatov, Anna Zhukovskaya, David Kagramanyan, Egor Shvetsov, Evgeny Burnaev

Abstract:Large language models exhibit positional bias -- systematic neglect of information at specific context positions -- yet its interplay with linguistic diversity remains poorly understood. We present a cross-linguistic study across five typologically distinct languages (English, Russian, German, Hindi, Vietnamese), examining how positional bias interacts with model uncertainty, syntax, and prompting. Key findings: (1) Positional bias is model-driven, with language-specific variations -- Qwen2.5-7B favors late positions, challenging assumptions of early-token bias; (2) Explicit positional guidance (e.g., correct context is at position X) reduces accuracy across languages, undermining prompt-engineering practices; (3) Aligning context with positional bias increases entropy, yet minimal entropy does not predict accuracy. (4) We further uncover that LLMs differently impose dominant word order in free-word-order languages like Hindi.

Via

Access Paper or Ask Questions

Revising deep learning methods in parking lot occupancy detection

Jun 08, 2023

Anastasia Martynova, Mikhail Kuznetsov, Vadim Porvatov, Vladislav Tishin, Andrey Kuznetsov, Natalia Semenova, Ksenia Kuznetsova

Abstract:Parking guidance systems have recently become a popular trend as a part of the smart cities' paradigm of development. The crucial part of such systems is the algorithm allowing drivers to search for available parking lots across regions of interest. The classic approach to this task is based on the application of neural network classifiers to camera records. However, existing systems demonstrate a lack of generalization ability and appropriate testing regarding specific visual conditions. In this study, we extensively evaluate state-of-the-art parking lot occupancy detection algorithms, compare their prediction quality with the recently emerged vision transformers, and propose a new pipeline based on EfficientNet architecture. Performed computational experiments have demonstrated the performance increase in the case of our model, which was evaluated on 5 different datasets.

* 15 pages, 7 figures

Via

Access Paper or Ask Questions

GCT-TTE: Graph Convolutional Transformer for Travel Time Estimation

Jun 07, 2023

Vladimir Mashurov, Vaagn Chopurian, Vadim Porvatov, Arseny Ivanov, Natalia Semenova

Abstract:This paper introduces a new transformer-based model for the problem of travel time estimation. The key feature of the proposed GCT-TTE architecture is the utilization of different data modalities capturing different properties of an input path. Along with the extensive study regarding the model configuration, we implemented and evaluated a sufficient number of actual baselines for path-aware and path-blind settings. The conducted computational experiments have confirmed the viability of our pipeline, which outperformed state-of-the-art models on both considered datasets. Additionally, GCT-TTE was deployed as a web service accessible for further experiments with user-defined routes.

Via

Access Paper or Ask Questions

Transformer-based classification of premise in tweets related to COVID-19

Sep 08, 2022

Vadim Porvatov, Natalia Semenova

Figure 1 for Transformer-based classification of premise in tweets related to COVID-19

Figure 2 for Transformer-based classification of premise in tweets related to COVID-19

Figure 3 for Transformer-based classification of premise in tweets related to COVID-19

Abstract:Automation of social network data assessment is one of the classic challenges of natural language processing. During the COVID-19 pandemic, mining people's stances from public messages have become crucial regarding understanding attitudes towards health orders. In this paper, the authors propose the predictive model based on transformer architecture to classify the presence of premise in Twitter texts. This work is completed as part of the Social Media Mining for Health (SMM4H) Workshop 2022. We explored modern transformer-based classifiers in order to construct the pipeline efficiently capturing tweets semantics. Our experiments on a Twitter dataset showed that RoBERTa is superior to the other transformer models in the case of the premise prediction task. The model achieved competitive performance with respect to ROC AUC value 0.807, and 0.7648 for the F1 score.

* Accepted at SMM4H Workshop of COLING'22

Via

Access Paper or Ask Questions

Logistics, Graphs, and Transformers: Towards improving Travel Time Estimation

Jul 12, 2022

Natalia Semenova, Vadim Porvatov, Vladislav Tishin, Artyom Sosedka, Vladislav Zamkovoy

Figure 1 for Logistics, Graphs, and Transformers: Towards improving Travel Time Estimation

Figure 2 for Logistics, Graphs, and Transformers: Towards improving Travel Time Estimation

Abstract:The problem of travel time estimation is widely considered as the fundamental challenge of modern logistics. The complex nature of interconnections between spatial aspects of roads and temporal dynamics of ground transport still preserves an area to experiment with. However, the total volume of currently accumulated data encourages the construction of the learning models which have the perspective to significantly outperform earlier solutions. In order to address the problems of travel time estimation, we propose a new method based on transformer architecture - TransTTE.

* 4 pages, 1 figure, 1 table. Accepted at PKDD'22 demonstration track

Via

Access Paper or Ask Questions

Citation network applications in a scientific co-authorship recommender system

Nov 22, 2021

Vladislav Tishin, Artyom Sosedka, Peter Ibragimov, Vadim Porvatov

Figure 1 for Citation network applications in a scientific co-authorship recommender system

Figure 2 for Citation network applications in a scientific co-authorship recommender system

Abstract:The problem of co-authors selection in the area of scientific collaborations might be a daunting one. In this paper, we propose a new pipeline that effectively utilizes citation data in the link prediction task on the co-authorship network. In particular, we explore the capabilities of a recommender system based on data aggregation strategies on different graphs. Since graph neural networks proved their efficiency on a wide range of tasks related to recommendation systems, we leverage them as a relevant method for the forecasting of potential collaborations in the scientific community.

* 7 pages

Via

Access Paper or Ask Questions

Hybrid Graph Embedding Techniques in Estimated Time of Arrival Task

Oct 08, 2021

Vadim Porvatov, Natalia Semenova, Andrey Chertok

Figure 1 for Hybrid Graph Embedding Techniques in Estimated Time of Arrival Task

Figure 2 for Hybrid Graph Embedding Techniques in Estimated Time of Arrival Task

Figure 3 for Hybrid Graph Embedding Techniques in Estimated Time of Arrival Task

Figure 4 for Hybrid Graph Embedding Techniques in Estimated Time of Arrival Task

Abstract:Recently, deep learning has achieved promising results in the calculation of Estimated Time of Arrival (ETA), which is considered as predicting the travel time from the start point to a certain place along a given path. ETA plays an essential role in intelligent taxi services or automotive navigation systems. A common practice is to use embedding vectors to represent the elements of a road network, such as road segments and crossroads. Road elements have their own attributes like length, presence of crosswalks, lanes number, etc. However, many links in the road network are traversed by too few floating cars even in large ride-hailing platforms and affected by the wide range of temporal events. As the primary goal of the research, we explore the generalization ability of different spatial embedding strategies and propose a two-stage approach to deal with such problems.

* Accepted in ICCNA 2021

Via

Access Paper or Ask Questions