Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mahesh Bhosale

ChartReformer: Natural Language-Driven Chart Image Editing

Mar 01, 2024

Pengyu Yan, Mahesh Bhosale, Jay Lal, Bikhyat Adhikari, David Doermann

Figure 1 for ChartReformer: Natural Language-Driven Chart Image Editing

Figure 2 for ChartReformer: Natural Language-Driven Chart Image Editing

Figure 3 for ChartReformer: Natural Language-Driven Chart Image Editing

Figure 4 for ChartReformer: Natural Language-Driven Chart Image Editing

Abstract:Chart visualizations are essential for data interpretation and communication; however, most charts are only accessible in image format and lack the corresponding data tables and supplementary information, making it difficult to alter their appearance for different application scenarios. To eliminate the need for original underlying data and information to perform chart editing, we propose ChartReformer, a natural language-driven chart image editing solution that directly edits the charts from the input images with the given instruction prompts. The key in this method is that we allow the model to comprehend the chart and reason over the prompt to generate the corresponding underlying data table and visual attributes for new charts, enabling precise edits. Additionally, to generalize ChartReformer, we define and standardize various types of chart editing, covering style, layout, format, and data-centric edits. The experiments show promising results for the natural language-driven chart image editing.

Via

Access Paper or Ask Questions

Player Re-Identification Using Body Part Appearences

Oct 23, 2023

Mahesh Bhosale, Abhishek Kumar, David Doermann

Abstract:We propose a neural network architecture that learns body part appearances for soccer player re-identification. Our model consists of a two-stream network (one stream for appearance map extraction and the other for body part map extraction) and a bilinear-pooling layer that generates and spatially pools the body part map. Each local feature of the body part map is obtained by a bilinear mapping of the corresponding local appearance and body part descriptors. Our novel representation yields a robust image-matching feature map, which results from combining the local similarities of the relevant body parts with the weighted appearance similarity. Our model does not require any part annotation on the SoccerNet-V3 re-identification dataset to train the network. Instead, we use a sub-network of an existing pose estimation network (OpenPose) to initialize the part substream and then train the entire network to minimize the triplet loss. The appearance stream is pre-trained on the ImageNet dataset, and the part stream is trained from scratch for the SoccerNet-V3 dataset. We demonstrate the validity of our model by showing that it outperforms state-of-the-art models such as OsNet and InceptionNet.

Via

Access Paper or Ask Questions

LineFormer: Rethinking Line Chart Data Extraction as Instance Segmentation

May 03, 2023

Jay Lal, Aditya Mitkari, Mahesh Bhosale, David Doermann

Figure 1 for LineFormer: Rethinking Line Chart Data Extraction as Instance Segmentation

Figure 2 for LineFormer: Rethinking Line Chart Data Extraction as Instance Segmentation

Figure 3 for LineFormer: Rethinking Line Chart Data Extraction as Instance Segmentation

Figure 4 for LineFormer: Rethinking Line Chart Data Extraction as Instance Segmentation

Abstract:Data extraction from line-chart images is an essential component of the automated document understanding process, as line charts are a ubiquitous data visualization format. However, the amount of visual and structural variations in multi-line graphs makes them particularly challenging for automated parsing. Existing works, however, are not robust to all these variations, either taking an all-chart unified approach or relying on auxiliary information such as legends for line data extraction. In this work, we propose LineFormer, a robust approach to line data extraction using instance segmentation. We achieve state-of-the-art performance on several benchmark synthetic and real chart datasets. Our implementation is available at https://github.com/TheJaeLal/LineFormer .

* Accepted to ICDAR 2023

Via

Access Paper or Ask Questions