Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Karen Rafferty

SynFinTabs: A Dataset of Synthetic Financial Tables for Information and Table Extraction

Dec 05, 2024

Ethan Bradley, Muhammad Roman, Karen Rafferty, Barry Devereux

Abstract:Table extraction from document images is a challenging AI problem, and labelled data for many content domains is difficult to come by. Existing table extraction datasets often focus on scientific tables due to the vast amount of academic articles that are readily available, along with their source code. However, there are significant layout and typographical differences between tables found across scientific, financial, and other domains. Current datasets often lack the words, and their positions, contained within the tables, instead relying on unreliable OCR to extract these features for training modern machine learning models on natural language processing tasks. Therefore, there is a need for a more general method of obtaining labelled data. We present SynFinTabs, a large-scale, labelled dataset of synthetic financial tables. Our hope is that our method of generating these synthetic tables is transferable to other domains. To demonstrate the effectiveness of our dataset in training models to extract information from table images, we create FinTabQA, a layout large language model trained on an extractive question-answering task. We test our model using real-world financial tables and compare it to a state-of-the-art generative model and discuss the results. We make the dataset, model, and dataset generation code publicly available.

* 12 pages, 8 figures

Via

Access Paper or Ask Questions

Uncertainty in Supply Chain Digital Twins: A Quantum-Classical Hybrid Approach

Nov 15, 2024

Abdullah Abdullah, Fannya Ratana Sandjaja, Ayesha Abdul Majeed, Gyan Wickremasinghe, Karen Rafferty, Vishal Sharma

Figure 1 for Uncertainty in Supply Chain Digital Twins: A Quantum-Classical Hybrid Approach

Figure 2 for Uncertainty in Supply Chain Digital Twins: A Quantum-Classical Hybrid Approach

Figure 3 for Uncertainty in Supply Chain Digital Twins: A Quantum-Classical Hybrid Approach

Figure 4 for Uncertainty in Supply Chain Digital Twins: A Quantum-Classical Hybrid Approach

Abstract:This study investigates uncertainty quantification (UQ) using quantum-classical hybrid machine learning (ML) models for applications in complex and dynamic fields, such as attaining resiliency in supply chain digital twins and financial risk assessment. Although quantum feature transformations have been integrated into ML models for complex data tasks, a gap exists in determining their impact on UQ within their hybrid architectures (quantum-classical approach). This work applies existing UQ techniques for different models within a hybrid framework, examining how quantum feature transformation affects uncertainty propagation. Increasing qubits from 4 to 16 shows varied model responsiveness to outlier detection (OD) samples, which is a critical factor for resilient decision-making in dynamic environments. This work shows how quantum computing techniques can transform data features for UQ, particularly when combined with traditional methods.

Via

Access Paper or Ask Questions

Representing Camera Response Function by a Single Latent Variable and Fully Connected Neural Network

Sep 08, 2022

Yunfeng Zhao, Stuart Ferguson, Huiyu Zhou, Karen Rafferty

Figure 1 for Representing Camera Response Function by a Single Latent Variable and Fully Connected Neural Network

Figure 2 for Representing Camera Response Function by a Single Latent Variable and Fully Connected Neural Network

Figure 3 for Representing Camera Response Function by a Single Latent Variable and Fully Connected Neural Network

Figure 4 for Representing Camera Response Function by a Single Latent Variable and Fully Connected Neural Network

Abstract:Modelling the mapping from scene irradiance to image intensity is essential for many computer vision tasks. Such mapping is known as the camera response. Most digital cameras use a nonlinear function to map irradiance, as measured by the sensor to an image intensity used to record the photograph. Modelling of the response is necessary for the nonlinear calibration. In this paper, a new high-performance camera response model that uses a single latent variable and fully connected neural network is proposed. The model is produced using unsupervised learning with an autoencoder on real-world (example) camera responses. Neural architecture searching is then used to find the optimal neural network architecture. A latent distribution learning approach was introduced to constrain the latent distribution. The proposed model achieved state-of-the-art CRF representation accuracy in a number of benchmark tests, but is almost twice as fast as the best current models when performing the maximum likelihood estimation during camera response calibration due to the simple yet efficient model representation.

* 11 pages, 4 figures, 3 tables. Accepted by Signal, Image and Video Processing

Via

Access Paper or Ask Questions

Colour alignment for relative colour constancy via non-standard references

Dec 30, 2021

Yunfeng Zhao, Stuart Ferguson, Huiyu Zhou, Chris Elliott, Karen Rafferty

Figure 1 for Colour alignment for relative colour constancy via non-standard references

Figure 2 for Colour alignment for relative colour constancy via non-standard references

Figure 3 for Colour alignment for relative colour constancy via non-standard references

Figure 4 for Colour alignment for relative colour constancy via non-standard references

Abstract:Relative colour constancy is an essential requirement for many scientific imaging applications. However, most digital cameras differ in their image formations and native sensor output is usually inaccessible, e.g., in smartphone camera applications. This makes it hard to achieve consistent colour assessment across a range of devices, and that undermines the performance of computer vision algorithms. To resolve this issue, we propose a colour alignment model that considers the camera image formation as a black-box and formulates colour alignment as a three-step process: camera response calibration, response linearisation, and colour matching. The proposed model works with non-standard colour references, i.e., colour patches without knowing the true colour values, by utilising a novel balance-of-linear-distances feature. It is equivalent to determining the camera parameters through an unsupervised process. It also works with a minimum number of corresponding colour patches across the images to be colour aligned to deliver the applicable processing. Two challenging image datasets collected by multiple cameras under various illumination and exposure conditions were used to evaluate the model. Performance benchmarks demonstrated that our model achieved superior performance compared to other popular and state-of-the-art methods.

* 13 pages, 10 figures, 2 tables

Via

Access Paper or Ask Questions