Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Thong Q. Nguyen

Natural Language Commanding via Program Synthesis

Jun 06, 2023

Apurva Gandhi, Thong Q. Nguyen, Huitian Jiao, Robert Steen, Ameya Bhatawdekar

Figure 1 for Natural Language Commanding via Program Synthesis

Figure 2 for Natural Language Commanding via Program Synthesis

Figure 3 for Natural Language Commanding via Program Synthesis

Figure 4 for Natural Language Commanding via Program Synthesis

Abstract:We present Semantic Interpreter, a natural language-friendly AI system for productivity software such as Microsoft Office that leverages large language models (LLMs) to execute user intent across application features. While LLMs are excellent at understanding user intent expressed as natural language, they are not sufficient for fulfilling application-specific user intent that requires more than text-to-text transformations. We therefore introduce the Office Domain Specific Language (ODSL), a concise, high-level language specialized for performing actions in and interacting with entities in Office applications. Semantic Interpreter leverages an Analysis-Retrieval prompt construction method with LLMs for program synthesis, translating natural language user utterances to ODSL programs that can be transpiled to application APIs and then executed. We focus our discussion primarily on a research exploration for Microsoft PowerPoint.

Via

Access Paper or Ask Questions

Data Augmentation at the LHC through Analysis-specific Fast Simulation with Deep Learning

Oct 05, 2020

Cheng Chen, Olmo Cerri, Thong Q. Nguyen, Jean-Roch Vlimant, Maurizio Pierini

Figure 1 for Data Augmentation at the LHC through Analysis-specific Fast Simulation with Deep Learning

Figure 2 for Data Augmentation at the LHC through Analysis-specific Fast Simulation with Deep Learning

Figure 3 for Data Augmentation at the LHC through Analysis-specific Fast Simulation with Deep Learning

Figure 4 for Data Augmentation at the LHC through Analysis-specific Fast Simulation with Deep Learning

Abstract:We present a fast simulation application based on a Deep Neural Network, designed to create large analysis-specific datasets. Taking as an example the generation of W+jet events produced in sqrt(s)= 13 TeV proton-proton collisions, we train a neural network to model detector resolution effects as a transfer function acting on an analysis-specific set of relevant features, computed at generation level, i.e., in absence of detector effects. Based on this model, we propose a novel fast-simulation workflow that starts from a large amount of generator-level events to deliver large analysis-specific samples. The adoption of this approach would result in about an order-of-magnitude reduction in computing and storage requirements for the collision simulation workflow. This strategy could help the high energy physics community to face the computing challenges of the future High-Luminosity LHC.

* 15 pages, 12 figures

Via

Access Paper or Ask Questions

Adversarially Learned Anomaly Detection on CMS Open Data: re-discovering the top quark

May 04, 2020

Oliver Knapp, Guenther Dissertori, Olmo Cerri, Thong Q. Nguyen, Jean-Roch Vlimant, Maurizio Pierini

Figure 1 for Adversarially Learned Anomaly Detection on CMS Open Data: re-discovering the top quark

Figure 2 for Adversarially Learned Anomaly Detection on CMS Open Data: re-discovering the top quark

Figure 3 for Adversarially Learned Anomaly Detection on CMS Open Data: re-discovering the top quark

Figure 4 for Adversarially Learned Anomaly Detection on CMS Open Data: re-discovering the top quark

Abstract:We apply an Adversarially Learned Anomaly Detection (ALAD) algorithm to the problem of detecting new physics processes in proton-proton collisions at the Large Hadron Collider. Anomaly detection based on ALAD matches performances reached by Variational Autoencoders, with a substantial improvement in some cases. Training the ALAD algorithm on 4.4 fb-1 of 8 TeV CMS Open Data, we show how a data-driven anomaly detection and characterization would work in real life, re-discovering the top quark by identifying the main features of the t-tbar experimental signature at the LHC.

* 16 pages, 9 figures

Via

Access Paper or Ask Questions

Variational Autoencoders for New Physics Mining at the Large Hadron Collider

Dec 05, 2018

Olmo Cerri, Thong Q. Nguyen, Maurizio Pierini, Maria Spiropulu, Jean-Roch Vlimant

Figure 1 for Variational Autoencoders for New Physics Mining at the Large Hadron Collider

Figure 2 for Variational Autoencoders for New Physics Mining at the Large Hadron Collider

Figure 3 for Variational Autoencoders for New Physics Mining at the Large Hadron Collider

Figure 4 for Variational Autoencoders for New Physics Mining at the Large Hadron Collider

Abstract:Using variational autoencoders trained on known physics processes, we develop a one-side p-value test to isolate previously unseen processes as outlier events. Since the autoencoder training does not depend on any specific new physics signature, the proposed procedure has a weak dependence on underlying assumptions about the nature of new physics. An event selection based on this algorithm would be complementary to classic LHC searches, typically based on model-dependent hypothesis testing. Such an algorithm would deliver a list of anomalous events, that the experimental collaborations could further scrutinize and even release as a catalog, similarly to what is typically done in other scientific domains. Repeated patterns in this dataset could motivate new scenarios for beyond-the-standard-model physics and inspire new searches, to be performed on future data with traditional supervised approaches. Running in the trigger system of the LHC experiments, such an application could identify anomalous events that would be otherwise lost, extending the scientific reach of the LHC.

* 17 pages, 9 figures, 4 tables

Via

Access Paper or Ask Questions

Topology classification with deep learning to improve real-time event selection at the LHC

Aug 01, 2018

Thong Q. Nguyen, Daniel Weitekamp III, Dustin Anderson, Roberto Castello, Olmo Cerri, Maurizio Pierini, Maria Spiropulu, Jean-Roch Vlimant

Figure 1 for Topology classification with deep learning to improve real-time event selection at the LHC

Figure 2 for Topology classification with deep learning to improve real-time event selection at the LHC

Figure 3 for Topology classification with deep learning to improve real-time event selection at the LHC

Figure 4 for Topology classification with deep learning to improve real-time event selection at the LHC

Abstract:We show how event topology classification based on deep learning could be used to improve the purity of data samples selected in real time at at the Large Hadron Collider. We consider different data representations, on which different kinds of multi-class classifiers are trained. Both raw data and high-level features are utilized. In the considered examples, a filter based on the classifier's score can be trained to retain ~99% of the interesting events and reduce the false-positive rate by as much as one order of magnitude for certain background processes. By operating such a filter as part of the online event selection infrastructure of the LHC experiments, one could benefit from a more flexible and inclusive selection strategy while reducing the amount of downstream resources wasted in processing false positives. The saved resources could be translated into a reduction of the detector operation cost or into an effective increase of storage and processing capabilities, which could be reinvested to extend the physics reach of the LHC experiments.

* 15 pages, 9 figures, 2 tables

Via

Access Paper or Ask Questions