Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xiaojing Zhang

Data-Driven Optical To Thermal Inference in Pool Boiling Using Generative Adversarial Networks

May 01, 2025

Qianxi Fu, Youngjoon Suh, Xiaojing Zhang, Yoonjin Won

Abstract:Phase change plays a critical role in thermal management systems, yet quantitative characterization of multiphase heat transfer remains limited by the challenges of measuring temperature fields in chaotic, rapidly evolving flow regimes. While computational methods offer spatiotemporal resolution in idealized cases, replicating complex experimental conditions remains prohibitively difficult. Here, we present a data-driven framework that leverages a conditional generative adversarial network (CGAN) to infer temperature fields from geometric phase contours in a canonical pool boiling configuration where advanced data collection techniques are restricted. Using high-speed imaging data and simulation-informed training, our model demonstrates the ability to reconstruct temperature fields with errors below 6%. We further show that standard data augmentation strategies are effective in enhancing both accuracy and physical plausibility of the predicted maps across both simulation and experimental datasets when precise physical constraints are not applicable. Our results highlight the potential of deep generative models to bridge the gap between observable multiphase phenomena and underlying thermal transport, offering a powerful approach to augment and interpret experimental measurements in complex two-phase systems.

* 17 pages, 5 figures, supplemental information

Via

Access Paper or Ask Questions

Interpreting and Improving Optimal Control Problems with Directional Corrections

Apr 01, 2025

Trevor Barron, Xiaojing Zhang

Abstract:Many robotics tasks, such as path planning or trajectory optimization, are formulated as optimal control problems (OCPs). The key to obtaining high performance lies in the design of the OCP's objective function. In practice, the objective function consists of a set of individual components that must be carefully modeled and traded off such that the OCP has the desired solution. It is often challenging to balance multiple components to achieve the desired solution and to understand, when the solution is undesired, the impact of individual cost components. In this paper, we present a framework addressing these challenges based on the concept of directional corrections. Specifically, given the solution to an OCP that is deemed undesirable, and access to an expert providing the direction of change that would increase the desirability of the solution, our method analyzes the individual cost components for their "consistency" with the provided directional correction. This information can be used to improve the OCP formulation, e.g., by increasing the weight of consistent cost components, or reducing the weight of - or even redesigning - inconsistent cost components. We also show that our framework can automatically tune parameters of the OCP to achieve consistency with a set of corrections.

* Paper accepted for publication at IEEE Robotics and Automation Letters (RA-L)

Via

Access Paper or Ask Questions

$\texttt{PatentAgent}$: Intelligent Agent for Automated Pharmaceutical Patent Analysis

Oct 25, 2024

Xin Wang, Yifan Zhang, Xiaojing Zhang, Longhui Yu, Xinna Lin, Jindong Jiang, Bin Ma, Kaicheng Yu

Abstract:Pharmaceutical patents play a vital role in biochemical industries, especially in drug discovery, providing researchers with unique early access to data, experimental results, and research insights. With the advancement of machine learning, patent analysis has evolved from manual labor to tasks assisted by automatic tools. However, there still lacks an unified agent that assists every aspect of patent analysis, from patent reading to core chemical identification. Leveraging the capabilities of Large Language Models (LLMs) to understand requests and follow instructions, we introduce the $\textbf{first}$ intelligent agent in this domain, $\texttt{PatentAgent}$, poised to advance and potentially revolutionize the landscape of pharmaceutical research. $\texttt{PatentAgent}$ comprises three key end-to-end modules -- $\textit{PA-QA}$, $\textit{PA-Img2Mol}$, and $\textit{PA-CoreId}$ -- that respectively perform (1) patent question-answering, (2) image-to-molecular-structure conversion, and (3) core chemical structure identification, addressing the essential needs of scientists and practitioners in pharmaceutical patent analysis. Each module of $\texttt{PatentAgent}$ demonstrates significant effectiveness with the updated algorithm and the synergistic design of $\texttt{PatentAgent}$ framework. $\textit{PA-Img2Mol}$ outperforms existing methods across CLEF, JPO, UOB, and USPTO patent benchmarks with an accuracy gain between 2.46% and 8.37% while $\textit{PA-CoreId}$ realizes accuracy improvement ranging from 7.15% to 7.62% on PatentNetML benchmark. Our code and dataset will be publicly available.

* 7 pages

Via

Access Paper or Ask Questions

BioKGBench: A Knowledge Graph Checking Benchmark of AI Agent for Biomedical Science

Jun 29, 2024

Xinna Lin, Siqi Ma, Junjie Shan, Xiaojing Zhang, Shell Xu Hu, Tiannan Guo, Stan Z. Li, Kaicheng Yu

Abstract:Pursuing artificial intelligence for biomedical science, a.k.a. AI Scientist, draws increasing attention, where one common approach is to build a copilot agent driven by Large Language Models (LLMs). However, to evaluate such systems, people either rely on direct Question-Answering (QA) to the LLM itself, or in a biomedical experimental manner. How to precisely benchmark biomedical agents from an AI Scientist perspective remains largely unexplored. To this end, we draw inspiration from one most important abilities of scientists, understanding the literature, and introduce BioKGBench. In contrast to traditional evaluation benchmark that only focuses on factual QA, where the LLMs are known to have hallucination issues, we first disentangle "Understanding Literature" into two atomic abilities, i) "Understanding" the unstructured text from research papers by performing scientific claim verification, and ii) Ability to interact with structured Knowledge-Graph Question-Answering (KGQA) as a form of "Literature" grounding. We then formulate a novel agent task, dubbed KGCheck, using KGQA and domain-based Retrieval-Augmented Generation (RAG) to identify the factual errors of existing large-scale knowledge graph databases. We collect over two thousand data for two atomic tasks and 225 high-quality annotated data for the agent task. Surprisingly, we discover that state-of-the-art agents, both daily scenarios and biomedical ones, have either failed or inferior performance on our benchmark. We then introduce a simple yet effective baseline, dubbed BKGAgent. On the widely used popular knowledge graph, we discover over 90 factual errors which provide scenarios for agents to make discoveries and demonstrate the effectiveness of our approach. The code and data are available at https://github.com/westlake-autolab/BioKGBench.

Via

Access Paper or Ask Questions

a novel attention-based network for fast salient object detection

Dec 20, 2021

Bin Zhang, Yang Wu, Xiaojing Zhang, Ming Ma

Figure 1 for a novel attention-based network for fast salient object detection

Figure 2 for a novel attention-based network for fast salient object detection

Figure 3 for a novel attention-based network for fast salient object detection

Figure 4 for a novel attention-based network for fast salient object detection

Abstract:In the current salient object detection network, the most popular method is using U-shape structure. However, the massive number of parameters leads to more consumption of computing and storage resources which are not feasible to deploy on the limited memory device. Some others shallow layer network will not maintain the same accuracy compared with U-shape structure and the deep network structure with more parameters will not converge to a global minimum loss with great speed. To overcome all of these disadvantages, we proposed a new deep convolution network architecture with three contributions: (1) using smaller convolution neural networks (CNNs) to compress the model in our improved salient object features compression and reinforcement extraction module (ISFCREM) to reduce parameters of the model. (2) introducing channel attention mechanism in ISFCREM to weigh different channels for improving the ability of feature representation. (3) applying a new optimizer to accumulate the long-term gradient information during training to adaptively tune the learning rate. The results demonstrate that the proposed method can compress the model to 1/3 of the original size nearly without losing the accuracy and converging faster and more smoothly on six widely used datasets of salient object detection compared with the others models. Our code is published in https://gitee.com/binzhangbinzhangbin/code-a-novel-attention-based-network-for-fast-salient-object-detection.git

Via

Access Paper or Ask Questions

A Distributed Multi-Robot Coordination Algorithm for Navigation in Tight Environments

Jun 20, 2020

Roya Firoozi, Laura Ferranti, Xiaojing Zhang, Sebastian Nejadnik, Francesco Borrelli

Figure 1 for A Distributed Multi-Robot Coordination Algorithm for Navigation in Tight Environments

Figure 2 for A Distributed Multi-Robot Coordination Algorithm for Navigation in Tight Environments

Figure 3 for A Distributed Multi-Robot Coordination Algorithm for Navigation in Tight Environments

Figure 4 for A Distributed Multi-Robot Coordination Algorithm for Navigation in Tight Environments

Abstract:This work presents a distributed method for multi-robot coordination based on nonlinear model predictive control (NMPC) and dual decomposition. Our approach allows the robots to coordinate in tight spaces (e.g., highway lanes, parking lots, warehouses, canals, etc.) by using a polytopic description of each robot's shape and formulating the collision avoidance as a dual optimization problem. Our method accommodates heterogeneous teams of robots (i.e., robots with different polytopic shapes and dynamic models can be part of the same team) and can be used to avoid collisions in $n$-dimensional spaces. Starting from a centralized implementation of the NMPC problem, we show how to exploit the problem structure to allow the robots to cooperate (while communicating their intentions to the neighbors) and compute collision-free paths in a distributed way in real time. By relying on a bi-level optimization scheme, our design decouples the optimization of the robot states and of the collision-avoidance variables to create real time coordination strategies. Finally, we apply our method for the autonomous navigation of a platoon of connected vehicles on a simulation setting. We compare our design with the centralized NMPC design to show the computational benefits of the proposed distributed algorithm. In addition, we demonstrate our method for coordination of a heterogeneous team of robots (with different polytopic shapes).

Via

Access Paper or Ask Questions

Formation and Reconfiguration of Tight Multi-Lane Platoons

Mar 19, 2020

Roya Firoozi, Xiaojing Zhang, Francesco Borrelli

Figure 1 for Formation and Reconfiguration of Tight Multi-Lane Platoons

Figure 2 for Formation and Reconfiguration of Tight Multi-Lane Platoons

Figure 3 for Formation and Reconfiguration of Tight Multi-Lane Platoons

Figure 4 for Formation and Reconfiguration of Tight Multi-Lane Platoons

Abstract:Advances in vehicular communication technologies are expected to facilitate cooperative driving in the future. Connected and Automated Vehicles (CAVs) are able to collaboratively plan and execute driving maneuvers by sharing their perceptual knowledge and future plans. In this paper, we present an architecture for autonomous navigation of tight multi-lane platoons travelling on public roads. Using the proposed approach, CAVs are able to form single or multi-lane platoons of various geometrical configurations. They are able to reshape and adjust their configurations according to changes in the environment. The proposed architecture consists of three main components: an online decision-maker, an offline motion planner and an online path-follower. The decision-maker selects the desired platoon configuration based on real-time information about the surrounding traffic. The motion planner uses an optimization-based approach for cooperative formation and reconfiguration in tight spaces. The motion planner uses a Model Predictive Control scheme to plan smooth, dynamically feasible and collision-free trajectories for all the vehicles within the platoon. The paper addresses online computation limitations by employing a family of maneuvers pre-computed offline and stored on the vehicles' control units to be executed by a low-level path-following feedback controller in real-time based on the selected desired configuration. We demonstrate the effectiveness of our approach through simulations of three case studies: 1) formation reconfiguration 2) obstacle avoidance, and 3) bench-marking against behavior-based planning in which the desired formation is achieved using a sequence of motion primitives. Videos and software can be found online here https://github.com/RoyaFiroozi/Centralized-Planning.

Via

Access Paper or Ask Questions

Improving Neural Network Classifier using Gradient-based Floating Centroid Method

Jul 21, 2019

Mazharul Islam, Shuangrong Liu, Lin Wang, Xiaojing Zhang

Figure 1 for Improving Neural Network Classifier using Gradient-based Floating Centroid Method

Figure 2 for Improving Neural Network Classifier using Gradient-based Floating Centroid Method

Figure 3 for Improving Neural Network Classifier using Gradient-based Floating Centroid Method

Figure 4 for Improving Neural Network Classifier using Gradient-based Floating Centroid Method

Abstract:Floating centroid method (FCM) offers an efficient way to solve a fixed-centroid problem for the neural network classifiers. However, evolutionary computation as its optimization method restrains the FCM to achieve satisfactory performance for different neural network structures, because of the high computational complexity and inefficiency. Traditional gradient-based methods have been extensively adopted to optimize the neural network classifiers. In this study, a gradient-based floating centroid (GDFC) method is introduced to address the fixed centroid problem for the neural network classifiers optimized by gradient-based methods. Furthermore, a new loss function for optimizing GDFC is introduced. The experimental results display that GDFC obtains promising classification performance than the comparison methods on the benchmark datasets.

Via

Access Paper or Ask Questions

Safe and Near-Optimal Policy Learning for Model Predictive Control using Primal-Dual Neural Networks

Jun 19, 2019

Xiaojing Zhang, Monimoy Bujarbaruah, Francesco Borrelli

Figure 1 for Safe and Near-Optimal Policy Learning for Model Predictive Control using Primal-Dual Neural Networks

Figure 2 for Safe and Near-Optimal Policy Learning for Model Predictive Control using Primal-Dual Neural Networks

Abstract:In this paper, we propose a novel framework for approximating the explicit MPC law for linear parameter-varying systems using supervised learning. In contrast to most existing approaches, we not only learn the control policy, but also a "certificate policy", that allows us to estimate the sub-optimality of the learned control policy online, during execution-time. We learn both these policies from data using supervised learning techniques, and also provide a randomized method that allows us to guarantee the quality of each learned policy, measured in terms of feasibility and optimality. This in turn allows us to bound the probability of the learned control policy of being infeasible or suboptimal, where the check is performed by the certificate policy. Since our algorithm does not require the solution of an optimization problem during run-time, it can be deployed even on resource-constrained systems. We illustrate the efficacy of the proposed framework on a vehicle dynamics control problem where we demonstrate a speedup of up to two orders of magnitude compared to online optimization with minimal performance degradation.

* IEEE American Control Conference (ACC) 2019, July 9-12, Philadelphia, PA, USA

Via

Access Paper or Ask Questions

Optimization-Based Collision Avoidance

Jun 10, 2018

Xiaojing Zhang, Alexander Liniger, Francesco Borrelli

Figure 1 for Optimization-Based Collision Avoidance

Figure 2 for Optimization-Based Collision Avoidance

Figure 3 for Optimization-Based Collision Avoidance

Figure 4 for Optimization-Based Collision Avoidance

Abstract:This paper presents a novel method for reformulating non-differentiable collision avoidance constraints into smooth nonlinear constraints using strong duality of convex optimization. We focus on a controlled object whose goal is to avoid obstacles while moving in an n-dimensional space. The proposed reformulation does not introduce approximations, and applies to general obstacles and controlled objects that can be represented in an n-dimensional space as the finite union of convex sets. Furthermore, we connect our results with the notion of signed distance, which is widely used in traditional trajectory generation algorithms. Our method can be used in generic navigation and trajectory planning tasks, and the smoothness property allows the use of general-purpose gradient- and Hessian-based optimization algorithms. Finally, in case a collision cannot be avoided, our framework allows us to find "least-intrusive" trajectories, measured in terms of penetration. We demonstrate the efficacy of our framework on a quadcopter navigation and automated parking problem, and our numerical experiments suggest that the proposed methods enable real-time optimization-based trajectory planning problems in tight environments. Source code of our implementation is provided at https://github.com/XiaojingGeorgeZhang/OBCA.

* 27 pages, 9 figures, 2 tables

Via

Access Paper or Ask Questions