Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

David Yang

Effect of nearby Metals on Electro-Quasistatic Human Body Communication

Oct 06, 2025

Samyadip Sarkar, Arunashish Datta, David Yang, Mayukh Nath, Shovan Maity, Shreyas Sen

Abstract:In recent decades Human Body Communication has emerged as a promising alternative to traditional radio wave communication, utilizing the body's conductive properties for low-power connectivity among wearables. This method harnesses the human body as an energy-efficient channel for data transmission within the electro-quasistatic frequency range, enabling advancements in human-machine interaction. While prior work has noted the role of parasitic return paths in such capacitively coupled systems, the influence of surrounding metallic objects on these paths, which are critical for EQS wireless signaling, has not been fully explored. This paper fills that gap with a structured study of how various conducting objects, from non-grounded (floating) metals and grounded metals to enclosed metallic environments such as elevators and cars, affect the body-communication channel. We present a theoretical framework supported by finite element method simulations and experiments with wearable devices. Results show that metallic objects within 20 cm of devices can reduce transmission loss by about 10 dB. When a device ground connects to a grounded metallic object, channel gain can increase by at least 20 dB. Contact area during touch-based interactions with grounded metals produces contact-impedance dependent high-pass channel characteristics. Proximity to metallic objects introduces variability within a critical distance, with grounded metals producing a larger overall effect than floating metals. These findings improve understanding of body-centric communication links and inform design for healthcare, consumer electronics, defense, and industrial applications.

* 18 pages, 25 Figures, 2 Tables, 5 Appendix

Via

Access Paper or Ask Questions

Beyond the First Read: AI-Assisted Perceptual Error Detection in Chest Radiography Accounting for Interobserver Variability

Jun 16, 2025

Adhrith Vutukuri, Akash Awasthi, David Yang, Carol C. Wu, Hien Van Nguyen

Abstract:Chest radiography is widely used in diagnostic imaging. However, perceptual errors -- especially overlooked but visible abnormalities -- remain common and clinically significant. Current workflows and AI systems provide limited support for detecting such errors after interpretation and often lack meaningful human--AI collaboration. We introduce RADAR (Radiologist--AI Diagnostic Assistance and Review), a post-interpretation companion system. RADAR ingests finalized radiologist annotations and CXR images, then performs regional-level analysis to detect and refer potentially missed abnormal regions. The system supports a "second-look" workflow and offers suggested regions of interest (ROIs) rather than fixed labels to accommodate inter-observer variation. We evaluated RADAR on a simulated perceptual-error dataset derived from de-identified CXR cases, using F1 score and Intersection over Union (IoU) as primary metrics. RADAR achieved a recall of 0.78, precision of 0.44, and an F1 score of 0.56 in detecting missed abnormalities in the simulated perceptual-error dataset. Although precision is moderate, this reduces over-reliance on AI by encouraging radiologist oversight in human--AI collaboration. The median IoU was 0.78, with more than 90% of referrals exceeding 0.5 IoU, indicating accurate regional localization. RADAR effectively complements radiologist judgment, providing valuable post-read support for perceptual-error detection in CXR interpretation. Its flexible ROI suggestions and non-intrusive integration position it as a promising tool in real-world radiology workflows. To facilitate reproducibility and further evaluation, we release a fully open-source web implementation alongside a simulated error dataset. All code, data, demonstration videos, and the application are publicly available at https://github.com/avutukuri01/RADAR.

* 25 pages

Via

Access Paper or Ask Questions

Edge-boosted graph learning for functional brain connectivity analysis

Apr 21, 2025

David Yang, Mostafa Abdelmegeed, John Modl, Minjeong Kim

Abstract:Predicting disease states from functional brain connectivity is critical for the early diagnosis of severe neurodegenerative diseases such as Alzheimer's Disease and Parkinson's Disease. Existing studies commonly employ Graph Neural Networks (GNNs) to infer clinical diagnoses from node-based brain connectivity matrices generated through node-to-node similarities of regionally averaged fMRI signals. However, recent neuroscience studies found that such node-based connectivity does not accurately capture ``functional connections" within the brain. This paper proposes a novel approach to brain network analysis that emphasizes edge functional connectivity (eFC), shifting the focus to inter-edge relationships. Additionally, we introduce a co-embedding technique to integrate edge functional connections effectively. Experimental results on the ADNI and PPMI datasets demonstrate that our method significantly outperforms state-of-the-art GNN methods in classifying functional brain networks.

* Accepted at IEEE International Symposium on Biomedical Imaging (ISBI) 2025, 4 pages

Via

Access Paper or Ask Questions

CompCap: Improving Multimodal Large Language Models with Composite Captions

Dec 06, 2024

Xiaohui Chen, Satya Narayan Shukla, Mahmoud Azab, Aashu Singh, Qifan Wang, David Yang, ShengYun Peng, Hanchao Yu, Shen Yan, Xuewen Zhang(+1 more)

Figure 1 for CompCap: Improving Multimodal Large Language Models with Composite Captions

Figure 2 for CompCap: Improving Multimodal Large Language Models with Composite Captions

Figure 3 for CompCap: Improving Multimodal Large Language Models with Composite Captions

Figure 4 for CompCap: Improving Multimodal Large Language Models with Composite Captions

Abstract:How well can Multimodal Large Language Models (MLLMs) understand composite images? Composite images (CIs) are synthetic visuals created by merging multiple visual elements, such as charts, posters, or screenshots, rather than being captured directly by a camera. While CIs are prevalent in real-world applications, recent MLLM developments have primarily focused on interpreting natural images (NIs). Our research reveals that current MLLMs face significant challenges in accurately understanding CIs, often struggling to extract information or perform complex reasoning based on these images. We find that existing training data for CIs are mostly formatted for question-answer tasks (e.g., in datasets like ChartQA and ScienceQA), while high-quality image-caption datasets, critical for robust vision-language alignment, are only available for NIs. To bridge this gap, we introduce Composite Captions (CompCap), a flexible framework that leverages Large Language Models (LLMs) and automation tools to synthesize CIs with accurate and detailed captions. Using CompCap, we curate CompCap-118K, a dataset containing 118K image-caption pairs across six CI types. We validate the effectiveness of CompCap-118K by supervised fine-tuning MLLMs of three sizes: xGen-MM-inst.-4B and LLaVA-NeXT-Vicuna-7B/13B. Empirical results show that CompCap-118K significantly enhances MLLMs' understanding of CIs, yielding average gains of 1.7%, 2.0%, and 2.9% across eleven benchmarks, respectively.

Via

Access Paper or Ask Questions

Practical Phase Retrieval Using Double Deep Image Priors

Nov 02, 2022

Zhong Zhuang, David Yang, Felix Hofmann, David Barmherzig, Ju Sun

Abstract:Phase retrieval (PR) concerns the recovery of complex phases from complex magnitudes. We identify the connection between the difficulty level and the number and variety of symmetries in PR problems. We focus on the most difficult far-field PR (FFPR), and propose a novel method using double deep image priors. In realistic evaluation, our method outperforms all competing methods by large margins. As a single-instance method, our method requires no training data and minimal hyperparameter tuning, and hence enjoys good practicality.

Via

Access Paper or Ask Questions

Object-Centric Unsupervised Image Captioning

Dec 02, 2021

Zihang Meng, David Yang, Xuefei Cao, Ashish Shah, Ser-Nam Lim

Figure 1 for Object-Centric Unsupervised Image Captioning

Figure 2 for Object-Centric Unsupervised Image Captioning

Figure 3 for Object-Centric Unsupervised Image Captioning

Figure 4 for Object-Centric Unsupervised Image Captioning

Abstract:Training an image captioning model in an unsupervised manner without utilizing annotated image-caption pairs is an important step towards tapping into a wider corpus of text and images. In the supervised setting, image-caption pairs are "well-matched", where all objects mentioned in the sentence appear in the corresponding image. These pairings are, however, not available in the unsupervised setting. To overcome this, a main school of research that has been shown to be effective in overcoming this is to construct pairs from the images and texts in the training set according to their overlap of objects. Unlike in the supervised setting, these constructed pairings are however not guaranteed to have fully overlapping set of objects. Our work in this paper overcomes this by harvesting objects corresponding to a given sentence from the training set, even if they don't belong to the same image. When used as input to a transformer, such mixture of objects enable larger if not full object coverage, and when supervised by the corresponding sentence, produced results that outperform current state of the art unsupervised methods by a significant margin. Building upon this finding, we further show that (1) additional information on relationship between objects and attributes of objects also helps in boosting performance; and (2) our method also extends well to non-English image captioning, which usually suffers from a scarcer level of annotations. Our findings are supported by strong empirical results.

Via

Access Paper or Ask Questions

Phase Retrieval using Single-Instance Deep Generative Prior

Jun 22, 2021

Kshitij Tayal, Raunak Manekar, Zhong Zhuang, David Yang, Vipin Kumar, Felix Hofmann, Ju Sun

Figure 1 for Phase Retrieval using Single-Instance Deep Generative Prior

Abstract:Several deep learning methods for phase retrieval exist, but most of them fail on realistic data without precise support information. We propose a novel method based on single-instance deep generative prior that works well on complex-valued crystal data.

Via

Access Paper or Ask Questions

Multimodal Fusion Refiner Networks

Apr 08, 2021

Sethuraman Sankaran, David Yang, Ser-Nam Lim

Figure 1 for Multimodal Fusion Refiner Networks

Figure 2 for Multimodal Fusion Refiner Networks

Figure 3 for Multimodal Fusion Refiner Networks

Figure 4 for Multimodal Fusion Refiner Networks

Abstract:Tasks that rely on multi-modal information typically include a fusion module that combines information from different modalities. In this work, we develop a Refiner Fusion Network (ReFNet) that enables fusion modules to combine strong unimodal representation with strong multimodal representations. ReFNet combines the fusion network with a decoding/defusing module, which imposes a modality-centric responsibility condition. This approach addresses a big gap in existing multimodal fusion frameworks by ensuring that both unimodal and fused representations are strongly encoded in the latent fusion space. We demonstrate that the Refiner Fusion Network can improve upon performance of powerful baseline fusion modules such as multimodal transformers. The refiner network enables inducing graphical representations of the fused embeddings in the latent space, which we prove under certain conditions and is supported by strong empirical results in the numerical experiments. These graph structures are further strengthened by combining the ReFNet with a Multi-Similarity contrastive loss function. The modular nature of Refiner Fusion Network lends itself to be combined with different fusion architectures easily, and in addition, the refiner step can be applied for pre-training on unlabeled datasets, thus leveraging unsupervised data towards improving performance. We demonstrate the power of Refiner Fusion Networks on three datasets, and further show that they can maintain performance with only a small fraction of labeled data.

* 11 pages, 6 figures

Via

Access Paper or Ask Questions

Robust Deep Learning with Active Noise Cancellation for Spatial Computing

Nov 16, 2020

Li Chen, David Yang, Purvi Goel, Ilknur Kabul

Figure 1 for Robust Deep Learning with Active Noise Cancellation for Spatial Computing

Figure 2 for Robust Deep Learning with Active Noise Cancellation for Spatial Computing

Figure 3 for Robust Deep Learning with Active Noise Cancellation for Spatial Computing

Figure 4 for Robust Deep Learning with Active Noise Cancellation for Spatial Computing

Abstract:This paper proposes CANC, a Co-teaching Active Noise Cancellation method, applied in spatial computing to address deep learning trained with extreme noisy labels. Deep learning algorithms have been successful in spatial computing for land or building footprint recognition. However a lot of noise exists in ground truth labels due to how labels are collected in spatial computing and satellite imagery. Existing methods to deal with extreme label noise conduct clean sample selection and do not utilize the remaining samples. Such techniques can be wasteful due to the cost of data retrieval. Our proposed CANC algorithm not only conserves high-cost training samples but also provides active label correction to better improve robust deep learning with extreme noisy labels. We demonstrate the effectiveness of CANC for building footprint recognition for spatial computing.

Via

Access Paper or Ask Questions