Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Łukasz Garncarek

Arctic-TILT. Business Document Understanding at Sub-Billion Scale

Aug 08, 2024

Łukasz Borchmann, Michał Pietruszka, Wojciech Jaśkowski, Dawid Jurkiewicz, Piotr Halama, Paweł Józiak, Łukasz Garncarek, Paweł Liskowski, Karolina Szyndler, Andrzej Gretkowski(+6 more)

Abstract:The vast portion of workloads employing LLMs involves answering questions grounded on PDF or scan content. We introduce the Arctic-TILT achieving accuracy on par with models 1000$\times$ its size on these use cases. It can be fine-tuned and deployed on a single 24GB GPU, lowering operational costs while processing Visually Rich Documents with up to 400k tokens. The model establishes state-of-the-art results on seven diverse Document Understanding benchmarks, as well as provides reliable confidence scores and quick inference, which are essential for processing files in large-scale or time-sensitive enterprise environments.

Via

Access Paper or Ask Questions

A Wiener process perspective on local intrinsic dimension estimation methods

Jun 24, 2024

Piotr Tempczyk, Łukasz Garncarek, Dominik Filipiak, Adam Kurpisz

Figure 1 for A Wiener process perspective on local intrinsic dimension estimation methods

Abstract:Local intrinsic dimension (LID) estimation methods have received a lot of attention in recent years thanks to the progress in deep neural networks and generative modeling. In opposition to old non-parametric methods, new methods use generative models to approximate diffused dataset density and scale the methods to high-dimensional datasets like images. In this paper, we investigate the recent state-of-the-art parametric LID estimation methods from the perspective of the Wiener process. We explore how these methods behave when their assumptions are not met. We give an extended mathematical description of those methods and their error as a function of the probability density of the data.

Via

Access Paper or Ask Questions

LIDL: Local Intrinsic Dimension Estimation Using Approximate Likelihood

Jun 29, 2022

Piotr Tempczyk, Rafał Michaluk, Łukasz Garncarek, Przemysław Spurek, Jacek Tabor, Adam Goliński

Figure 1 for LIDL: Local Intrinsic Dimension Estimation Using Approximate Likelihood

Figure 2 for LIDL: Local Intrinsic Dimension Estimation Using Approximate Likelihood

Figure 3 for LIDL: Local Intrinsic Dimension Estimation Using Approximate Likelihood

Figure 4 for LIDL: Local Intrinsic Dimension Estimation Using Approximate Likelihood

Abstract:Most of the existing methods for estimating the local intrinsic dimension of a data distribution do not scale well to high-dimensional data. Many of them rely on a non-parametric nearest neighbors approach which suffers from the curse of dimensionality. We attempt to address that challenge by proposing a novel approach to the problem: Local Intrinsic Dimension estimation using approximate Likelihood (LIDL). Our method relies on an arbitrary density estimation method as its subroutine and hence tries to sidestep the dimensionality challenge by making use of the recent progress in parametric neural methods for likelihood estimation. We carefully investigate the empirical properties of the proposed method, compare them with our theoretical predictions, and show that LIDL yields competitive results on the standard benchmarks for this problem and that it scales to thousands of dimensions. What is more, we anticipate this approach to improve further with the continuing advances in the density estimation literature.

* ICML 2022

Via

Access Paper or Ask Questions

STable: Table Generation Framework for Encoder-Decoder Models

Jun 08, 2022

Michał Pietruszka, Michał Turski, Łukasz Borchmann, Tomasz Dwojak, Gabriela Pałka, Karolina Szyndler, Dawid Jurkiewicz, Łukasz Garncarek

Figure 1 for STable: Table Generation Framework for Encoder-Decoder Models

Figure 2 for STable: Table Generation Framework for Encoder-Decoder Models

Figure 3 for STable: Table Generation Framework for Encoder-Decoder Models

Figure 4 for STable: Table Generation Framework for Encoder-Decoder Models

Abstract:The output structure of database-like tables, consisting of values structured in horizontal rows and vertical columns identifiable by name, can cover a wide range of NLP tasks. Following this constatation, we propose a framework for text-to-table neural models applicable to problems such as extraction of line items, joint entity and relation extraction, or knowledge base population. The permutation-based decoder of our proposal is a generalized sequential method that comprehends information from all cells in the table. The training maximizes the expected log-likelihood for a table's content across all random permutations of the factorization order. During the content inference, we exploit the model's ability to generate cells in any order by searching over possible orderings to maximize the model's confidence and avoid substantial error accumulation, which other sequential models are prone to. Experiments demonstrate a high practical value of the framework, which establishes state-of-the-art results on several challenging datasets, outperforming previous solutions by up to 15%.

Via

Access Paper or Ask Questions

LAMBERT: Layout-Aware language Modeling using BERT for information extraction

Mar 06, 2020

Łukasz Garncarek, Rafał Powalski, Tomasz Stanisławek, Bartosz Topolski, Piotr Halama, Filip Graliński

Figure 1 for LAMBERT: Layout-Aware language Modeling using BERT for information extraction

Figure 2 for LAMBERT: Layout-Aware language Modeling using BERT for information extraction

Figure 3 for LAMBERT: Layout-Aware language Modeling using BERT for information extraction

Figure 4 for LAMBERT: Layout-Aware language Modeling using BERT for information extraction

Abstract:In this paper we introduce a novel approach to the problem of understanding documents where the local semantics is influenced by non-trivial layout. Namely, we modify the Transformer architecture in a way that allows it to use the graphical features defined by the layout, without the need to re-learn the language semantics from scratch, thanks to starting the training process from a model pretrained on classical language modeling tasks.

* v1: 9 pages; work in progress; this version of the paper was submitted to review on Dec 10, 2019, and subsequently withdrawn on Feb 17, 2020 v2: 17 pages

Via

Access Paper or Ask Questions