Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shailik Sarkar

Virginia Tech

Chasing the Timber Trail: Machine Learning to Reveal Harvest Location Misrepresentation

Feb 19, 2025

Shailik Sarkar, Raquib Bin Yousuf, Linhan Wang, Brian Mayer, Thomas Mortier, Victor Deklerck, Jakub Truszkowski, John C. Simeone, Marigold Norman, Jade Saunders(+2 more)

Abstract:Illegal logging poses a significant threat to global biodiversity, climate stability, and depresses international prices for legal wood harvesting and responsible forest products trade, affecting livelihoods and communities across the globe. Stable isotope ratio analysis (SIRA) is rapidly becoming an important tool for determining the harvest location of traded, organic, products. The spatial pattern in stable isotope ratio values depends on factors such as atmospheric and environmental conditions and can thus be used for geographical identification. We present here the results of a deployed machine learning pipeline where we leverage both isotope values and atmospheric variables to determine timber harvest location. Additionally, the pipeline incorporates uncertainty estimation to facilitate the interpretation of harvest location determination for analysts. We present our experiments on a collection of oak (Quercus spp.) tree samples from its global range. Our pipeline outperforms comparable state-of-the-art models determining geographic harvest origin of commercially traded wood products, and has been used by European enforcement agencies to identify illicit Russian and Belarusian timber entering the EU market. We also identify opportunities for further advancement of our framework and how it can be generalized to help identify the origin of falsely labeled organic products throughout the supply chain.

* 9 pages, 5 figures

Via

Access Paper or Ask Questions

Exposing LLM Vulnerabilities: Adversarial Scam Detection and Performance

Dec 01, 2024

Chen-Wei Chang, Shailik Sarkar, Shutonu Mitra, Qi Zhang, Hossein Salemi, Hemant Purohit, Fengxiu Zhang, Michin Hong, Jin-Hee Cho, Chang-Tien Lu

Figure 1 for Exposing LLM Vulnerabilities: Adversarial Scam Detection and Performance

Figure 2 for Exposing LLM Vulnerabilities: Adversarial Scam Detection and Performance

Figure 3 for Exposing LLM Vulnerabilities: Adversarial Scam Detection and Performance

Figure 4 for Exposing LLM Vulnerabilities: Adversarial Scam Detection and Performance

Abstract:Can we trust Large Language Models (LLMs) to accurately predict scam? This paper investigates the vulnerabilities of LLMs when facing adversarial scam messages for the task of scam detection. We addressed this issue by creating a comprehensive dataset with fine-grained labels of scam messages, including both original and adversarial scam messages. The dataset extended traditional binary classes for the scam detection task into more nuanced scam types. Our analysis showed how adversarial examples took advantage of vulnerabilities of a LLM, leading to high misclassification rate. We evaluated the performance of LLMs on these adversarial scam messages and proposed strategies to improve their robustness.

* 4 pages, 2024 IEEE International Conference on Big Data workshop BigEACPS 2024

Via

Access Paper or Ask Questions

Deep diffusion-based forecasting of COVID-19 by incorporating network-level mobility information

Nov 09, 2021

Padmaksha Roy, Shailik Sarkar, Subhodip Biswas, Fanglan Chen, Zhiqian Chen, Naren Ramakrishnan, Chang-Tien Lu

Figure 1 for Deep diffusion-based forecasting of COVID-19 by incorporating network-level mobility information

Figure 2 for Deep diffusion-based forecasting of COVID-19 by incorporating network-level mobility information

Figure 3 for Deep diffusion-based forecasting of COVID-19 by incorporating network-level mobility information

Figure 4 for Deep diffusion-based forecasting of COVID-19 by incorporating network-level mobility information

Abstract:Modeling the spatiotemporal nature of the spread of infectious diseases can provide useful intuition in understanding the time-varying aspect of the disease spread and the underlying complex spatial dependency observed in people's mobility patterns. Besides, the county level multiple related time series information can be leveraged to make a forecast on an individual time series. Adding to this challenge is the fact that real-time data often deviates from the unimodal Gaussian distribution assumption and may show some complex mixed patterns. Motivated by this, we develop a deep learning-based time-series model for probabilistic forecasting called Auto-regressive Mixed Density Dynamic Diffusion Network(ARM3Dnet), which considers both people's mobility and disease spread as a diffusion process on a dynamic directed graph. The Gaussian Mixture Model layer is implemented to consider the multimodal nature of the real-time data while learning from multiple related time series. We show that our model, when trained with the best combination of dynamic covariate features and mixture components, can outperform both traditional statistical and deep learning models in forecasting the number of Covid-19 deaths and cases at the county level in the United States.

* Published as conference paper at ASONAM 2021, Research Track
* 8 pages

Via

Access Paper or Ask Questions