Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sangjun Han

Data Intelligence Laboratory, LG AI Research

Simplifying Formal Proof-Generating Models with ChatGPT and Basic Searching Techniques

Feb 05, 2025

Sangjun Han, Taeil Hur, Youngmi Hur, Kathy Sangkyung Lee, Myungyoon Lee, Hyojae Lim

Abstract:The challenge of formal proof generation has a rich history, but with modern techniques, we may finally be at the stage of making actual progress in real-life mathematical problems. This paper explores the integration of ChatGPT and basic searching techniques to simplify generating formal proofs, with a particular focus on the miniF2F dataset. We demonstrate how combining a large language model like ChatGPT with a formal language such as Lean, which has the added advantage of being verifiable, enhances the efficiency and accessibility of formal proof generation. Despite its simplicity, our best-performing Lean-based model surpasses all known benchmarks with a 31.15% pass rate. We extend our experiments to include other datasets and employ alternative language models, showcasing our models' comparable performance in diverse settings and allowing for a more nuanced analysis of our results. Our findings offer insights into AI-assisted formal proof generation, suggesting a promising direction for future research in formal mathematical proof.

Via

Access Paper or Ask Questions

Meent: Differentiable Electromagnetic Simulator for Machine Learning

Jun 11, 2024

Yongha Kim, Anthony W. Jung, Sanmun Kim, Kevin Octavian, Doyoung Heo, Chaejin Park, Jeongmin Shin, Sunghyun Nam, Chanhyung Park, Juho Park(+5 more)

Figure 1 for Meent: Differentiable Electromagnetic Simulator for Machine Learning

Figure 2 for Meent: Differentiable Electromagnetic Simulator for Machine Learning

Figure 3 for Meent: Differentiable Electromagnetic Simulator for Machine Learning

Figure 4 for Meent: Differentiable Electromagnetic Simulator for Machine Learning

Abstract:Electromagnetic (EM) simulation plays a crucial role in analyzing and designing devices with sub-wavelength scale structures such as solar cells, semiconductor devices, image sensors, future displays and integrated photonic devices. Specifically, optics problems such as estimating semiconductor device structures and designing nanophotonic devices provide intriguing research topics with far-reaching real world impact. Traditional algorithms for such tasks require iteratively refining parameters through simulations, which often yield sub-optimal results due to the high computational cost of both the algorithms and EM simulations. Machine learning (ML) emerged as a promising candidate to mitigate these challenges, and optics research community has increasingly adopted ML algorithms to obtain results surpassing classical methods across various tasks. To foster a synergistic collaboration between the optics and ML communities, it is essential to have an EM simulation software that is user-friendly for both research communities. To this end, we present Meent, an EM simulation software that employs rigorous coupled-wave analysis (RCWA). Developed in Python and equipped with automatic differentiation (AD) capabilities, Meent serves as a versatile platform for integrating ML into optics research and vice versa. To demonstrate its utility as a research platform, we present three applications of Meent: 1) generating a dataset for training neural operator, 2) serving as an environment for the reinforcement learning of nanophotonic device optimization, and 3) providing a solution for inverse problems with gradient-based optimizers. These applications highlight Meent's potential to advance both EM simulation and ML methodologies. The code is available at https://github.com/kc-ml2/meent with the MIT license to promote the cross-polinations of ideas among academic researchers and industry practitioners.

* under review

Via

Access Paper or Ask Questions

Can We Utilize Pre-trained Language Models within Causal Discovery Algorithms?

Nov 19, 2023

Chanhui Lee, Juhyeon Kim, Yongjun Jeong, Juhyun Lyu, Junghee Kim, Sangmin Lee, Sangjun Han, Hyeokjun Choe, Soyeon Park, Woohyung Lim(+2 more)

Abstract:Scaling laws have allowed Pre-trained Language Models (PLMs) into the field of causal reasoning. Causal reasoning of PLM relies solely on text-based descriptions, in contrast to causal discovery which aims to determine the causal relationships between variables utilizing data. Recently, there has been current research regarding a method that mimics causal discovery by aggregating the outcomes of repetitive causal reasoning, achieved through specifically designed prompts. It highlights the usefulness of PLMs in discovering cause and effect, which is often limited by a lack of data, especially when dealing with multiple variables. Conversely, the characteristics of PLMs which are that PLMs do not analyze data and they are highly dependent on prompt design leads to a crucial limitation for directly using PLMs in causal discovery. Accordingly, PLM-based causal reasoning deeply depends on the prompt design and carries out the risk of overconfidence and false predictions in determining causal relationships. In this paper, we empirically demonstrate the aforementioned limitations of PLM-based causal reasoning through experiments on physics-inspired synthetic data. Then, we propose a new framework that integrates prior knowledge obtained from PLM with a causal discovery algorithm. This is accomplished by initializing an adjacency matrix for causal discovery and incorporating regularization using prior knowledge. Our proposed framework not only demonstrates improved performance through the integration of PLM and causal discovery but also suggests how to leverage PLM-extracted prior knowledge with existing causal discovery algorithms.

Via

Access Paper or Ask Questions

Systematic Analysis of Music Representations from BERT

Jun 06, 2023

Sangjun Han, Hyeongrae Ihm, Woohyung Lim

Abstract:There have been numerous attempts to represent raw data as numerical vectors that effectively capture semantic and contextual information. However, in the field of symbolic music, previous works have attempted to validate their music embeddings by observing the performance improvement of various fine-tuning tasks. In this work, we directly analyze embeddings from BERT and BERT with contrastive learning trained on bar-level MIDI, inspecting their musical information that can be obtained from MIDI events. We observe that the embeddings exhibit distinct characteristics of information depending on the contrastive objectives and the choice of layers. Our code is available at https://github.com/sjhan91/MusicBERT.

Via

Access Paper or Ask Questions

Instrument Separation of Symbolic Music by Explicitly Guided Diffusion Model

Sep 05, 2022

Sangjun Han, Hyeongrae Ihm, DaeHan Ahn, Woohyung Lim

Figure 1 for Instrument Separation of Symbolic Music by Explicitly Guided Diffusion Model

Figure 2 for Instrument Separation of Symbolic Music by Explicitly Guided Diffusion Model

Figure 3 for Instrument Separation of Symbolic Music by Explicitly Guided Diffusion Model

Abstract:Similar to colorization in computer vision, instrument separation is to assign instrument labels (e.g. piano, guitar...) to notes from unlabeled mixtures which contain only performance information. To address the problem, we adopt diffusion models and explicitly guide them to preserve consistency between mixtures and music. The quantitative results show that our proposed model can generate high-fidelity samples for multitrack symbolic music with creativity.

* Submitted to NeurIPS 2022 Workshop on Machine Learning for Creativity and Design

Via

Access Paper or Ask Questions

Laplacian Pyramid-like Autoencoder

Aug 26, 2022

Sangjun Han, Taeil Hur, Youngmi Hur

Abstract:In this paper, we develop the Laplacian pyramid-like autoencoder (LPAE) by adding the Laplacian pyramid (LP) concept widely used to analyze images in Signal Processing. LPAE decomposes an image into the approximation image and the detail image in the encoder part and then tries to reconstruct the original image in the decoder part using the two components. We use LPAE for experiments on classifications and super-resolution areas. Using the detail image and the smaller-sized approximation image as inputs of a classification network, our LPAE makes the model lighter. Moreover, we show that the performance of the connected classification networks has remained substantially high. In a super-resolution area, we show that the decoder part gets a high-quality reconstruction image by setting to resemble the structure of LP. Consequently, LPAE improves the original results by combining the decoder part of the autoencoder and the super-resolution network.

* Intelligent Computing, SAI 2022. Lecture Notes in Networks and Systems, vol 507, pp 59-78
* 20 pages, 3 figures, 5 tables, Science and Information Conference 2022, Intelligent Computing

Via

Access Paper or Ask Questions

Symbolic Music Loop Generation with Neural Discrete Representations

Aug 11, 2022

Sangjun Han, Hyeongrae Ihm, Moontae Lee, Woohyung Lim

Figure 1 for Symbolic Music Loop Generation with Neural Discrete Representations

Figure 2 for Symbolic Music Loop Generation with Neural Discrete Representations

Figure 3 for Symbolic Music Loop Generation with Neural Discrete Representations

Figure 4 for Symbolic Music Loop Generation with Neural Discrete Representations

Abstract:Since most of music has repetitive structures from motifs to phrases, repeating musical ideas can be a basic operation for music composition. The basic block that we focus on is conceptualized as loops which are essential ingredients of music. Furthermore, meaningful note patterns can be formed in a finite space, so it is sufficient to represent them with combinations of discrete symbols as done in other domains. In this work, we propose symbolic music loop generation via learning discrete representations. We first extract loops from MIDI datasets using a loop detector and then learn an autoregressive model trained by discrete latent codes of the extracted loops. We show that our model outperforms well-known music generative models in terms of both fidelity and diversity, evaluating on random space. Our code and supplementary materials are available at https://github.com/sjhan91/Loop_VQVAE_Official.

* Camera-ready Version for ISMIR 2022

Via

Access Paper or Ask Questions

Symbolic Music Loop Generation with VQ-VAE

Nov 15, 2021

Sangjun Han, Hyeongrae Ihm, Woohyung Lim

Figure 1 for Symbolic Music Loop Generation with VQ-VAE

Figure 2 for Symbolic Music Loop Generation with VQ-VAE

Figure 3 for Symbolic Music Loop Generation with VQ-VAE

Figure 4 for Symbolic Music Loop Generation with VQ-VAE

Abstract:Music is a repetition of patterns and rhythms. It can be composed by repeating a certain number of bars in a structured way. In this paper, the objective is to generate a loop of 8 bars that can be used as a building block of music. Even considering musical diversity, we assume that music patterns familiar to humans can be defined in a finite set. With explicit rules to extract loops from music, we found that discrete representations are sufficient to model symbolic music sequences. Among VAE family, musical properties from VQ-VAE are better observed rather than other models. Further, to emphasize musical structure, we have manipulated discrete latent features to be repetitive so that the properties are more strengthened. Quantitative and qualitative experiments are extensively conducted to verify our assumptions.

Via

Access Paper or Ask Questions