Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Mingzhe Wang

RcAE: Recursive Reconstruction Framework for Unsupervised Industrial Anomaly Detection

Dec 12, 2025

Rongcheng Wu, Hao Zhu, Shiying Zhang, Mingzhe Wang, Zhidong Li, Hui Li, Jianlong Zhou, Jiangtao Cui, Fang Chen, Pingyang Sun(+2 more)

Figure 1 for RcAE: Recursive Reconstruction Framework for Unsupervised Industrial Anomaly Detection

Figure 2 for RcAE: Recursive Reconstruction Framework for Unsupervised Industrial Anomaly Detection

Figure 3 for RcAE: Recursive Reconstruction Framework for Unsupervised Industrial Anomaly Detection

Figure 4 for RcAE: Recursive Reconstruction Framework for Unsupervised Industrial Anomaly Detection

Abstract:Unsupervised industrial anomaly detection requires accurately identifying defects without labeled data. Traditional autoencoder-based methods often struggle with incomplete anomaly suppression and loss of fine details, as their single-pass decoding fails to effectively handle anomalies with varying severity and scale. We propose a recursive architecture for autoencoder (RcAE), which performs reconstruction iteratively to progressively suppress anomalies while refining normal structures. Unlike traditional single-pass models, this recursive design naturally produces a sequence of reconstructions, progressively exposing suppressed abnormal patterns. To leverage this reconstruction dynamics, we introduce a Cross Recursion Detection (CRD) module that tracks inconsistencies across recursion steps, enhancing detection of both subtle and large-scale anomalies. Additionally, we incorporate a Detail Preservation Network (DPN) to recover high-frequency textures typically lost during reconstruction. Extensive experiments demonstrate that our method significantly outperforms existing non-diffusion methods, and achieves performance on par with recent diffusion models with only 10% of their parameters and offering substantially faster inference. These results highlight the practicality and efficiency of our approach for real-world applications.

* 19 pages, 7 figures, to be published in AAAI-26

Via

Access Paper or Ask Questions

When Fuzzing Meets LLMs: Challenges and Opportunities

Apr 25, 2024

Yu Jiang, Jie Liang, Fuchen Ma, Yuanliang Chen, Chijin Zhou, Yuheng Shen, Zhiyong Wu, Jingzhou Fu, Mingzhe Wang, ShanShan Li(+1 more)

Abstract:Fuzzing, a widely-used technique for bug detection, has seen advancements through Large Language Models (LLMs). Despite their potential, LLMs face specific challenges in fuzzing. In this paper, we identified five major challenges of LLM-assisted fuzzing. To support our findings, we revisited the most recent papers from top-tier conferences, confirming that these challenges are widespread. As a remedy, we propose some actionable recommendations to help improve applying LLM in Fuzzing and conduct preliminary evaluations on DBMS fuzzing. The results demonstrate that our recommendations effectively address the identified challenges.

Via

Access Paper or Ask Questions

Infinite Photorealistic Worlds using Procedural Generation

Jun 26, 2023

Alexander Raistrick, Lahav Lipson, Zeyu Ma, Lingjie Mei, Mingzhe Wang, Yiming Zuo, Karhan Kayan, Hongyu Wen, Beining Han, Yihan Wang(+5 more)

Figure 1 for Infinite Photorealistic Worlds using Procedural Generation

Figure 2 for Infinite Photorealistic Worlds using Procedural Generation

Figure 3 for Infinite Photorealistic Worlds using Procedural Generation

Figure 4 for Infinite Photorealistic Worlds using Procedural Generation

Abstract:We introduce Infinigen, a procedural generator of photorealistic 3D scenes of the natural world. Infinigen is entirely procedural: every asset, from shape to texture, is generated from scratch via randomized mathematical rules, using no external source and allowing infinite variation and composition. Infinigen offers broad coverage of objects and scenes in the natural world including plants, animals, terrains, and natural phenomena such as fire, cloud, rain, and snow. Infinigen can be used to generate unlimited, diverse training data for a wide range of computer vision tasks including object detection, semantic segmentation, optical flow, and 3D reconstruction. We expect Infinigen to be a useful resource for computer vision research and beyond. Please visit https://infinigen.org for videos, code and pre-generated data.

* Accepted to CVPR 2023, Camera Ready Version. Update 06/26/23: Change the open-source license to BSD

Via

Access Paper or Ask Questions

Adaptive Task Offloading for Space Missions: A State-Graph-Based Approach

Nov 16, 2022

Jiaqi Cao, Shengli Zhang, Mingzhe Wang, Qingxia Chen, Houtian Wang, Naijin Liu

Abstract:Advances in space exploration have led to an explosion of tasks. Conventionally, these tasks are offloaded to ground servers for enhanced computing capability, or to adjacent low-earth-orbit satellites for reduced transmission delay. However, the overall delay is determined by both computation and transmission costs. The existing offloading schemes, while being highly-optimized for either costs, can be abysmal for the overall performance. The computation-transmission cost dilemma is yet to be solved. In this paper, we propose an adaptive offloading scheme to reduce the overall delay. The core idea is to jointly model and optimize the transmission-computation process over the entire network. Specifically, to represent the computation state migrations, we generalize graph nodes with multiple states. In this way, the joint optimization problem is transformed into a shortest path problem over the state graph. We further provide an extended Dijkstra's algorithm for efficient path finding. Simulation results show that the proposed scheme outperforms the ground and one-hop offloading schemes by up to 37.56% and 39.35% respectively on SpaceCube v2.0.

Via

Access Paper or Ask Questions

A Unified Framework of Surrogate Loss by Refactoring and Interpolation

Jul 27, 2020

Lanlan Liu, Mingzhe Wang, Jia Deng

Figure 1 for A Unified Framework of Surrogate Loss by Refactoring and Interpolation

Figure 2 for A Unified Framework of Surrogate Loss by Refactoring and Interpolation

Figure 3 for A Unified Framework of Surrogate Loss by Refactoring and Interpolation

Figure 4 for A Unified Framework of Surrogate Loss by Refactoring and Interpolation

Abstract:We introduce UniLoss, a unified framework to generate surrogate losses for training deep networks with gradient descent, reducing the amount of manual design of task-specific surrogate losses. Our key observation is that in many cases, evaluating a model with a performance metric on a batch of examples can be refactored into four steps: from input to real-valued scores, from scores to comparisons of pairs of scores, from comparisons to binary variables, and from binary variables to the final performance metric. Using this refactoring we generate differentiable approximations for each non-differentiable step through interpolation. Using UniLoss, we can optimize for different tasks and metrics using one unified framework, achieving comparable performance compared with task-specific losses. We validate the effectiveness of UniLoss on three tasks and four datasets. Code is available at https://github.com/princeton-vl/uniloss.

* Accepted to ECCV 2020

Via

Access Paper or Ask Questions

Learning to Prove Theorems by Learning to Generate Theorems

Feb 17, 2020

Mingzhe Wang, Jia Deng

Figure 1 for Learning to Prove Theorems by Learning to Generate Theorems

Figure 2 for Learning to Prove Theorems by Learning to Generate Theorems

Figure 3 for Learning to Prove Theorems by Learning to Generate Theorems

Figure 4 for Learning to Prove Theorems by Learning to Generate Theorems

Abstract:We consider the task of automated theorem proving, a key AI task. Deep learning has shown promise for training theorem provers, but there are limited human-written theorems and proofs available for supervised learning. To address this limitation, we propose to learn a neural generator that automatically synthesizes theorems and proofs for the purpose of training a theorem prover. Experiments on real-world tasks demonstrate that synthetic data from our approach improves the theorem prover and advances the state of the art of automated theorem proving in Metamath.

Via

Access Paper or Ask Questions

Speaker Naming in Movies

Sep 24, 2018

Mahmoud Azab, Mingzhe Wang, Max Smith, Noriyuki Kojima, Jia Deng, Rada Mihalcea

Abstract:We propose a new model for speaker naming in movies that leverages visual, textual, and acoustic modalities in an unified optimization framework. To evaluate the performance of our model, we introduce a new dataset consisting of six episodes of the Big Bang Theory TV show and eighteen full movies covering different genres. Our experiments show that our multimodal model significantly outperforms several competitive baselines on the average weighted F-score metric. To demonstrate the effectiveness of our framework, we design an end-to-end memory network model that leverages our speaker naming model and achieves state-of-the-art results on the subtitles task of the MovieQA 2017 Challenge.

Via

Access Paper or Ask Questions

Premise Selection for Theorem Proving by Deep Graph Embedding

Sep 28, 2017

Mingzhe Wang, Yihe Tang, Jian Wang, Jia Deng

Figure 1 for Premise Selection for Theorem Proving by Deep Graph Embedding

Figure 2 for Premise Selection for Theorem Proving by Deep Graph Embedding

Figure 3 for Premise Selection for Theorem Proving by Deep Graph Embedding

Figure 4 for Premise Selection for Theorem Proving by Deep Graph Embedding

Abstract:We propose a deep learning-based approach to the problem of premise selection: selecting mathematical statements relevant for proving a given conjecture. We represent a higher-order logic formula as a graph that is invariant to variable renaming but still fully preserves syntactic and semantic information. We then embed the graph into a vector via a novel embedding method that preserves the information of edge ordering. Our approach achieves state-of-the-art results on the HolStep dataset, improving the classification accuracy from 83% to 90.3%.

* Mingzhe Wang and Yihe Tang contributed equally

Via

Access Paper or Ask Questions

LINE: Large-scale Information Network Embedding

Mar 12, 2015

Jian Tang, Meng Qu, Mingzhe Wang, Ming Zhang, Jun Yan, Qiaozhu Mei

Figure 1 for LINE: Large-scale Information Network Embedding

Figure 2 for LINE: Large-scale Information Network Embedding

Figure 3 for LINE: Large-scale Information Network Embedding

Figure 4 for LINE: Large-scale Information Network Embedding

Abstract:This paper studies the problem of embedding very large information networks into low-dimensional vector spaces, which is useful in many tasks such as visualization, node classification, and link prediction. Most existing graph embedding methods do not scale for real world information networks which usually contain millions of nodes. In this paper, we propose a novel network embedding method called the "LINE," which is suitable for arbitrary types of information networks: undirected, directed, and/or weighted. The method optimizes a carefully designed objective function that preserves both the local and global network structures. An edge-sampling algorithm is proposed that addresses the limitation of the classical stochastic gradient descent and improves both the effectiveness and the efficiency of the inference. Empirical experiments prove the effectiveness of the LINE on a variety of real-world information networks, including language networks, social networks, and citation networks. The algorithm is very efficient, which is able to learn the embedding of a network with millions of vertices and billions of edges in a few hours on a typical single machine. The source code of the LINE is available online.

* WWW 2015

Via

Access Paper or Ask Questions