Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shlok Gilda

AI-Assisted Engineering Should Track the Epistemic Status and Temporal Validity of Architectural Decisions

Jan 28, 2026

Sankalp Gilda, Shlok Gilda

Abstract:This position paper argues that AI-assisted software engineering requires explicit mechanisms for tracking the epistemic status and temporal validity of architectural decisions. LLM coding assistants generate decisions faster than teams can validate them, yet no widely-adopted framework distinguishes conjecture from verified knowledge, prevents trust inflation through conservative aggregation, or detects when evidence expires. We propose three requirements for responsible AI-assisted engineering: (1) epistemic layers that separate unverified hypotheses from empirically validated claims, (2) conservative assurance aggregation grounded in the Gödel t-norm that prevents weak evidence from inflating confidence, and (3) automated evidence decay tracking that surfaces stale assumptions before they cause failures. We formalize these requirements as the First Principles Framework (FPF), ground its aggregation semantics in fuzzy logic, and define a quintet of invariants that any valid aggregation operator must satisfy. Our retrospective audit applying FPF criteria to two internal projects found that 20-25% of architectural decisions had stale evidence within two months, validating the need for temporal accountability. We outline research directions including learnable aggregation operators, federated evidence sharing, and SMT-based claim validation.

* 18 pages, 6 figures, 13 tables. Position paper

Via

Access Paper or Ask Questions

Predicting Different Types of Subtle Toxicity in Unhealthy Online Conversations

Jun 07, 2021

Shlok Gilda, Mirela Silva, Luiz Giovanini, Daniela Oliveira

Figure 1 for Predicting Different Types of Subtle Toxicity in Unhealthy Online Conversations

Figure 2 for Predicting Different Types of Subtle Toxicity in Unhealthy Online Conversations

Figure 3 for Predicting Different Types of Subtle Toxicity in Unhealthy Online Conversations

Figure 4 for Predicting Different Types of Subtle Toxicity in Unhealthy Online Conversations

Abstract:This paper investigates the use of machine learning models for the classification of unhealthy online conversations containing one or more forms of subtler abuse, such as hostility, sarcasm, and generalization. We leveraged a public dataset of 44K online comments containing healthy and unhealthy comments labeled with seven forms of subtle toxicity. We were able to distinguish between these comments with a top micro F1-score, macro F1-score, and ROC-AUC of 88.76%, 67.98%, and 0.71, respectively. Hostile comments were easier to detect than other types of unhealthy comments. We also conducted a sentiment analysis which revealed that most types of unhealthy comments were associated with a slight negative sentiment, with hostile comments being the most negative ones.

Via

Access Paper or Ask Questions