Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kavana Venkatesh

PhysicsAgentABM: Physics-Guided Generative Agent-Based Modeling

Feb 05, 2026

Kavana Venkatesh, Yinhan He, Jundong Li, Jiaming Cui

Abstract:Large language model (LLM)-based multi-agent systems enable expressive agent reasoning but are expensive to scale and poorly calibrated for timestep-aligned state-transition simulation, while classical agent-based models (ABMs) offer interpretability but struggle to integrate rich individual-level signals and non-stationary behaviors. We propose PhysicsAgentABM, which shifts inference to behaviorally coherent agent clusters: state-specialized symbolic agents encode mechanistic transition priors, a multimodal neural transition model captures temporal and interaction dynamics, and uncertainty-aware epistemic fusion yields calibrated cluster-level transition distributions. Individual agents then stochastically realize transitions under local constraints, decoupling population inference from entity-level variability. We further introduce ANCHOR, an LLM agent-driven clustering strategy based on cross-contextual behavioral responses and a novel contrastive loss, reducing LLM calls by up to 6-8 times. Experiments across public health, finance, and social sciences show consistent gains in event-time accuracy and calibration over mechanistic, neural, and LLM baselines. By re-architecting generative ABM around population-level inference with uncertainty-aware neuro-symbolic fusion, PhysicsAgentABM establishes a new paradigm for scalable and calibrated simulation with LLMs.

Via

Access Paper or Ask Questions

Personalized Image Editing in Text-to-Image Diffusion Models via Collaborative Direct Preference Optimization

Nov 06, 2025

Connor Dunlop, Matthew Zheng, Kavana Venkatesh, Pinar Yanardag

Abstract:Text-to-image (T2I) diffusion models have made remarkable strides in generating and editing high-fidelity images from text. Yet, these models remain fundamentally generic, failing to adapt to the nuanced aesthetic preferences of individual users. In this work, we present the first framework for personalized image editing in diffusion models, introducing Collaborative Direct Preference Optimization (C-DPO), a novel method that aligns image edits with user-specific preferences while leveraging collaborative signals from like-minded individuals. Our approach encodes each user as a node in a dynamic preference graph and learns embeddings via a lightweight graph neural network, enabling information sharing across users with overlapping visual tastes. We enhance a diffusion model's editing capabilities by integrating these personalized embeddings into a novel DPO objective, which jointly optimizes for individual alignment and neighborhood coherence. Comprehensive experiments, including user studies and quantitative benchmarks, demonstrate that our method consistently outperforms baselines in generating edits that are aligned with user preferences.

* Published at NeurIPS'25 Main Conference

Via

Access Paper or Ask Questions

CREA: A Collaborative Multi-Agent Framework for Creative Content Generation with Diffusion Models

Apr 07, 2025

Kavana Venkatesh, Connor Dunlop, Pinar Yanardag

Abstract:Creativity in AI imagery remains a fundamental challenge, requiring not only the generation of visually compelling content but also the capacity to add novel, expressive, and artistically rich transformations to images. Unlike conventional editing tasks that rely on direct prompt-based modifications, creative image editing demands an autonomous, iterative approach that balances originality, coherence, and artistic intent. To address this, we introduce CREA, a novel multi-agent collaborative framework that mimics the human creative process. Our framework leverages a team of specialized AI agents who dynamically collaborate to conceptualize, generate, critique, and enhance images. Through extensive qualitative and quantitative evaluations, we demonstrate that CREA significantly outperforms state-of-the-art methods in diversity, semantic alignment, and creative transformation. By structuring creativity as a dynamic, agentic process, CREA redefines the intersection of AI and art, paving the way for autonomous AI-driven artistic exploration, generative design, and human-AI co-creation. To the best of our knowledge, this is the first work to introduce the task of creative editing.

* Project URL: https://crea-diffusion.github.io

Via

Access Paper or Ask Questions

FluxSpace: Disentangled Semantic Editing in Rectified Flow Transformers

Dec 12, 2024

Yusuf Dalva, Kavana Venkatesh, Pinar Yanardag

Figure 1 for FluxSpace: Disentangled Semantic Editing in Rectified Flow Transformers

Figure 2 for FluxSpace: Disentangled Semantic Editing in Rectified Flow Transformers

Figure 3 for FluxSpace: Disentangled Semantic Editing in Rectified Flow Transformers

Figure 4 for FluxSpace: Disentangled Semantic Editing in Rectified Flow Transformers

Abstract:Rectified flow models have emerged as a dominant approach in image generation, showcasing impressive capabilities in high-quality image synthesis. However, despite their effectiveness in visual generation, rectified flow models often struggle with disentangled editing of images. This limitation prevents the ability to perform precise, attribute-specific modifications without affecting unrelated aspects of the image. In this paper, we introduce FluxSpace, a domain-agnostic image editing method leveraging a representation space with the ability to control the semantics of images generated by rectified flow transformers, such as Flux. By leveraging the representations learned by the transformer blocks within the rectified flow models, we propose a set of semantically interpretable representations that enable a wide range of image editing tasks, from fine-grained image editing to artistic creation. This work offers a scalable and effective image editing approach, along with its disentanglement capabilities.

* Project Page: https://fluxspace.github.io

Via

Access Paper or Ask Questions

Context Canvas: Enhancing Text-to-Image Diffusion Models with Knowledge Graph-Based RAG

Dec 12, 2024

Kavana Venkatesh, Yusuf Dalva, Ismini Lourentzou, Pinar Yanardag

Figure 1 for Context Canvas: Enhancing Text-to-Image Diffusion Models with Knowledge Graph-Based RAG

Figure 2 for Context Canvas: Enhancing Text-to-Image Diffusion Models with Knowledge Graph-Based RAG

Figure 3 for Context Canvas: Enhancing Text-to-Image Diffusion Models with Knowledge Graph-Based RAG

Figure 4 for Context Canvas: Enhancing Text-to-Image Diffusion Models with Knowledge Graph-Based RAG

Abstract:We introduce a novel approach to enhance the capabilities of text-to-image models by incorporating a graph-based RAG. Our system dynamically retrieves detailed character information and relational data from the knowledge graph, enabling the generation of visually accurate and contextually rich images. This capability significantly improves upon the limitations of existing T2I models, which often struggle with the accurate depiction of complex or culturally specific subjects due to dataset constraints. Furthermore, we propose a novel self-correcting mechanism for text-to-image models to ensure consistency and fidelity in visual outputs, leveraging the rich context from the graph to guide corrections. Our qualitative and quantitative experiments demonstrate that Context Canvas significantly enhances the capabilities of popular models such as Flux, Stable Diffusion, and DALL-E, and improves the functionality of ControlNet for fine-grained image editing tasks. To our knowledge, Context Canvas represents the first application of graph-based RAG in enhancing T2I models, representing a significant advancement for producing high-fidelity, context-aware multi-faceted images.

* Project Page: https://context-canvas.github.io/

Via

Access Paper or Ask Questions

Fault Analysis And Predictive Maintenance Of Induction Motor Using Machine Learning

Sep 16, 2024

Kavana Venkatesh, Neethi M

Figure 1 for Fault Analysis And Predictive Maintenance Of Induction Motor Using Machine Learning

Figure 2 for Fault Analysis And Predictive Maintenance Of Induction Motor Using Machine Learning

Figure 3 for Fault Analysis And Predictive Maintenance Of Induction Motor Using Machine Learning

Figure 4 for Fault Analysis And Predictive Maintenance Of Induction Motor Using Machine Learning

Abstract:Induction motors are one of the most crucial electrical equipment and are extensively used in industries in a wide range of applications. This paper presents a machine learning model for the fault detection and classification of induction motor faults by using three phase voltages and currents as inputs. The aim of this work is to protect vital electrical components and to prevent abnormal event progression through early detection and diagnosis. This work presents a fast forward artificial neural network model to detect some of the commonly occurring electrical faults like overvoltage, under voltage, single phasing, unbalanced voltage, overload, ground fault. A separate model free monitoring system wherein the motor itself acts like a sensor is presented and the only monitored signals are the input given to the motor. Limits for current and voltage values are set for the faulty and healthy conditions, which is done by a classifier. Real time data from a 0.33 HP induction motor is used to train and test the neural network. The model so developed analyses the voltage and current values given at a particular instant and classifies the data into no fault or the specific fault. The model is then interfaced with a real motor to accurately detect and classify the faults so that further necessary action can be taken.

* ICEECCOT-2018, Mysuru, India, 2018, pp. 1-6
* Presented at ICEECCOT-2018, Published in IEEE Xplore, 6 pages, 3 figures

Via

Access Paper or Ask Questions