Skip to main content

Showing 1–50 of 70 results for author: Nguyen, T Q

.
  1. arXiv:2406.19445  [pdf, other

    hep-ph astro-ph.HE

    X-Ray Constraints on Dark Photon Tridents

    Authors: Tim Linden, Thong T. Q. Nguyen, Tim M. P. Tait

    Abstract: Dark photons that are sufficiently light and/or weakly-interacting represent a compelling vision of dark matter. Dark photon decay into three photons, which we call the dark photon trident, can be the dominant channel when the dark photon mass falls below the electron pair threshold and can produce a significant flux of x-rays. We use 16 years of data from INTEGRAL/SPI to constrain sub-MeV dark ph… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 4+3 pages, 4 figures. Comments are welcome!

  2. arXiv:2404.11152  [pdf, other

    eess.IV cs.CV

    Multi-target and multi-stage liver lesion segmentation and detection in multi-phase computed tomography scans

    Authors: Abdullah F. Al-Battal, Soan T. M. Duong, Van Ha Tang, Quang Duc Tran, Steven Q. H. Truong, Chien Phan, Truong Q. Nguyen, Cheolhong An

    Abstract: Multi-phase computed tomography (CT) scans use contrast agents to highlight different anatomical structures within the body to improve the probability of identifying and detecting anatomical structures of interest and abnormalities such as liver lesions. Yet, detecting these lesions remains a challenging task as these lesions vary significantly in their size, shape, texture, and contrast with resp… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  3. arXiv:2402.01839  [pdf, other

    hep-ph astro-ph.CO astro-ph.HE

    Indirect Searches for Dark Photon-Photon Tridents in Celestial Objects

    Authors: Tim Linden, Thong T. Q. Nguyen, Tim M. P. Tait

    Abstract: We model and constrain the unique indirect detection signature produced by dark matter particles that annihilate through a $U(1)$ gauge symmetry into dark photons that subsequently decay into three-photon final states. We focus on scenarios where the dark photon is long-lived, and show that $γ$-ray probes of celestial objects can set strong constraints on the dark matter/baryon scattering cross se… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: 12 pages, 7 figures (8 sub-figures), 3 tables

  4. arXiv:2402.01003  [pdf

    stat.AP stat.ME

    Practical challenges in mediation analysis: A guide for applied researchers

    Authors: Megan S. Schuler, Donna L. Coffman, Elizabeth A. Stuart, Trang Q. Nguyen, Brian Vegetabile, Daniel F. McCaffrey

    Abstract: Mediation analysis is a statistical approach that can provide insights regarding the intermediary processes by which an intervention or exposure affects a given outcome. Mediation analyses rose to prominence, particularly in social science research, with the publication of the seminal paper by Baron and Kenny and is now commonly applied in many research disciplines, including health services resea… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  5. arXiv:2312.12292  [pdf, other

    hep-ph astro-ph.CO astro-ph.HE

    Celestial Objects as Dark Matter Colliders

    Authors: Thong T. Q. Nguyen

    Abstract: In the vicinity of the Milky Way Galactic Center, celestial bodies, including neutron stars, reside within a dense dark matter environment. This study explores the accumulation of dark matter by neutron stars through dark matter-nucleon interactions, leading to increased internal dark matter density. Consequently, dark matter annihilation produces long-lived mediators that escape and decay into ne… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 3 pages, 2 figures, Proceedings for the "Window on the Universe" conference celebrating the 30th anniversary of "Rencontres de Vietnam", August 2023, Quy Nhon, Vietnam

  6. Identification of complier and noncomplier average causal effects in the presence of latent missing-at-random (LMAR) outcomes: a unifying view and choices of assumptions

    Authors: Trang Quynh Nguyen, Michelle C. Carlson, Elizabeth A. Stuart

    Abstract: The study of treatment effects is often complicated by noncompliance and missing data. In the one-sided noncompliance setting where of interest are the complier and noncomplier average causal effects (CACE and NACE), we address outcome missingness of the \textit{latent missing at random} type (LMAR, also known as \textit{latent ignorability}). That is, conditional on covariates and treatment assig… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Journal ref: Biostatistics, 2024

  7. arXiv:2312.10785  [pdf, other

    hep-ph astro-ph.CO

    Self-interacting Vectorial Dark Matter in a SM-like Dark Sector

    Authors: Van Que Tran, Thong T. Q. Nguyen, Tzu-Chiang Yuan

    Abstract: A $SU(2)_D \times U(1)_D$ gauge-Higgs sector, an exact dark copy of the Standard Model (SM) one, is proposed. It is demonstrated that the dark gauge bosons ${\cal W}^{(p,m)}$, in analogous to the SM $W^\pm$, can fulfill the role as a self-interacting vector dark matter candidate, solving the core versus cusp and missing satellites problems faced by the conventional paradigm of collisionless weakly… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: 42 pages, 9 figures

  8. arXiv:2310.01413  [pdf

    eess.IV cs.AI cs.CV

    A multi-institutional pediatric dataset of clinical radiology MRIs by the Children's Brain Tumor Network

    Authors: Ariana M. Familiar, Anahita Fathi Kazerooni, Hannah Anderson, Aliaksandr Lubneuski, Karthik Viswanathan, Rocky Breslow, Nastaran Khalili, Sina Bagheri, Debanjan Haldar, Meen Chul Kim, Sherjeel Arif, Rachel Madhogarhia, Thinh Q. Nguyen, Elizabeth A. Frenkel, Zeinab Helili, Jessica Harrison, Keyvan Farahani, Marius George Linguraru, Ulas Bagci, Yury Velichko, Jeffrey Stevens, Sarah Leary, Robert M. Lober, Stephani Campion, Amy A. Smith , et al. (15 additional authors not shown)

    Abstract: Pediatric brain and spinal cancers remain the leading cause of cancer-related death in children. Advancements in clinical decision-support in pediatric neuro-oncology utilizing the wealth of radiology imaging data collected through standard care, however, has significantly lagged other domains. Such data is ripe for use with predictive analytics such as artificial intelligence (AI) methods, which… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  9. arXiv:2309.17166  [pdf, other

    cs.CV cs.AI

    Advances in Kidney Biopsy Lesion Assessment through Dense Instance Segmentation

    Authors: Zhan Xiong, Junling He, Pieter Valkema, Tri Q. Nguyen, Maarten Naesens, Jesper Kers, Fons J. Verbeek

    Abstract: Renal biopsies are the gold standard for diagnosis of kidney diseases. Lesion scores made by renal pathologists are semi-quantitative and exhibit high inter-observer variability. Automating lesion classification within segmented anatomical structures can provide decision support in quantification analysis and reduce the inter-observer variability. Nevertheless, classifying lesions in regions-of-in… ▽ More

    Submitted 28 March, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: 16 pages, 15 figures, 6 tables, Journal

  10. arXiv:2306.03460  [pdf, other

    cs.LG cs.CL cs.HC

    Natural Language Commanding via Program Synthesis

    Authors: Apurva Gandhi, Thong Q. Nguyen, Huitian Jiao, Robert Steen, Ameya Bhatawdekar

    Abstract: We present Semantic Interpreter, a natural language-friendly AI system for productivity software such as Microsoft Office that leverages large language models (LLMs) to execute user intent across application features. While LLMs are excellent at understanding user intent expressed as natural language, they are not sufficient for fulfilling application-specific user intent that requires more than t… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

  11. arXiv:2305.18361  [pdf, other

    eess.IV cs.CV

    Deep learning network to correct axial and coronal eye motion in 3D OCT retinal imaging

    Authors: Yiqian Wang, Alexandra Warter, Melina Cavichini, Varsha Alex, Dirk-Uwe G. Bartsch, William R. Freeman, Truong Q. Nguyen, Cheolhong An

    Abstract: Optical Coherence Tomography (OCT) is one of the most important retinal imaging technique. However, involuntary motion artifacts still pose a major challenge in OCT imaging that compromises the quality of downstream analysis, such as retinal layer segmentation and OCT Angiography. We propose deep learning based neural networks to correct axial and coronal motion artifacts in OCT based on a single… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  12. arXiv:2303.16299  [pdf, other

    stat.ME stat.ML

    Comparison of Methods that Combine Multiple Randomized Trials to Estimate Heterogeneous Treatment Effects

    Authors: Carly Lupton Brantner, Trang Quynh Nguyen, Tengjie Tang, Congwen Zhao, Hwanhee Hong, Elizabeth A. Stuart

    Abstract: Individualized treatment decisions can improve health outcomes, but using data to make these decisions in a reliable, precise, and generalizable way is challenging with a single dataset. Leveraging multiple randomized controlled trials allows for the combination of datasets with unconfounded treatment assignment to better estimate heterogeneous treatment effects. This paper discusses several non-p… ▽ More

    Submitted 15 November, 2023; v1 submitted 28 March, 2023; originally announced March 2023.

  13. arXiv:2303.14381  [pdf, other

    cs.CV

    3D Facial Imperfection Regeneration: Deep learning approach and 3D printing prototypes

    Authors: Phuong D. Nguyen, Thinh D. Le, Duong Q. Nguyen, Thanh Q. Nguyen, Li-Wei Chou, H. Nguyen-Xuan

    Abstract: This study explores the potential of a fully convolutional mesh autoencoder model for regenerating 3D nature faces with the presence of imperfect areas. We utilize deep learning approaches in graph processing and analysis to investigate the capabilities model in recreating a filling part for facial scars. Our approach in dataset creation is able to generate a facial scar rationally in a virtual sp… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

  14. arXiv:2303.05032  [pdf, other

    stat.ME

    Sensitivity analysis for principal ignorability violation in estimating complier and noncomplier average causal effects

    Authors: Trang Quynh Nguyen, Elizabeth A. Stuart, Daniel O. Scharfstein, Elizabeth L. Ogburn

    Abstract: An important strategy for identifying principal causal effects, which are often used in settings with noncompliance, is to invoke the principal ignorability (PI) assumption. As PI is untestable, it is important to gauge how sensitive effect estimates are to its violation. We focus on this task for the common one-sided noncompliance setting where there are two principal strata, compliers and noncom… ▽ More

    Submitted 28 March, 2024; v1 submitted 8 March, 2023; originally announced March 2023.

  15. arXiv:2302.13428  [pdf, ps, other

    stat.ME

    Methods for Integrating Trials and Non-Experimental Data to Examine Treatment Effect Heterogeneity

    Authors: Carly Lupton Brantner, Ting-Hsuan Chang, Trang Quynh Nguyen, Hwanhee Hong, Leon Di Stefano, Elizabeth A. Stuart

    Abstract: Estimating treatment effects conditional on observed covariates can improve the ability to tailor treatments to particular individuals. Doing so effectively requires dealing with potential confounding, and also enough data to adequately estimate effect moderation. A recent influx of work has looked into estimating treatment effect heterogeneity using data from multiple randomized controlled trials… ▽ More

    Submitted 28 March, 2023; v1 submitted 26 February, 2023; originally announced February 2023.

  16. Leptoquark search at the Forward Physics Facility

    Authors: Kingman Cheung, Thong T. Q. Nguyen, C. J. Ouseph

    Abstract: In this study, we calculate the sensitivity reach on the vector leptoquark (LQ) $U_1$ at the experiments proposed in Forward Physics Facility (FPF), including FASER$ν$, FASER$\nu2$, FLArE (10 tons), and FLArE (100 tons) using the neutrino-nucleon scattering ($νN \rightarrow νN'$ and $νN \rightarrow l N'$). We cover a wide mass range of $10^{-3}$ GeV $\leq M_{LQ}\leq 10^4$ GeV. The new result shows… ▽ More

    Submitted 15 August, 2023; v1 submitted 10 February, 2023; originally announced February 2023.

    Comments: 21 pages, 10 figures. Adding two subfigures on the TeV mass LQ mass regime

    Journal ref: Phys. Rev. D 108 (2023) 3, 036014

  17. arXiv:2301.07066  [pdf, ps, other

    stat.ME

    Multiple imputation for propensity score analysis with covariates missing at random: some clarity on within and across methods

    Authors: Trang Quynh Nguyen, Elizabeth A. Stuart

    Abstract: In epidemiology and social sciences, propensity score methods are popular for estimating treatment effects using observational data, and multiple imputation is popular for handling covariate missingness. However, how to appropriately use multiple imputation for propensity score analysis is not completely clear. This paper aims to bring clarity on the consistency (or lack thereof) of methods that h… ▽ More

    Submitted 28 August, 2023; v1 submitted 17 January, 2023; originally announced January 2023.

  18. arXiv:2212.12547  [pdf, other

    hep-ph astro-ph.CO astro-ph.HE

    Bounds on Long-lived Dark Matter Mediators from Neutron Stars

    Authors: Thong T. Q. Nguyen, Tim M. P. Tait

    Abstract: Neutron stars close to the Galactic center are expected to swim in a dense background of dark matter. For models in which the dark matter has efficient interactions with neutrons, they are expected to accumulate their own local cloud of dark matter, making them appealing targets for observations seeking signs of dark matter annihilation. For theories with very light mediators, the dark matter may… ▽ More

    Submitted 15 June, 2023; v1 submitted 23 December, 2022; originally announced December 2022.

    Comments: 7 pages, 6 figures (2 sub-figures). Published version on PRD

    Journal ref: Phys. Rev. D 107 (2023) 115016

  19. arXiv:2208.10971  [pdf, other

    hep-ph

    Obliquely Scrutinizing a Hidden SM-like Gauge Model

    Authors: Van Que Tran, Thong T. Q. Nguyen, Tzu-Chiang Yuan

    Abstract: In view of the recent high precision measurement of the Standard Model $W$ boson mass at the CDF II detector, we compute the contributions to the oblique parameters $S$, $T$ and $U$ coming from the two additional Higgs doublets (one inert and one hidden) as well as the hidden neutral dark gauge bosons and extra heavy fermions in the gauged two-Higgs-doublet model (G2HDM). While the effects from th… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

    Comments: 48 pages, 16 figures

  20. arXiv:2112.03946  [pdf

    q-fin.ST cs.LG cs.NE

    Generative Adversarial Network (GAN) and Enhanced Root Mean Square Error (ERMSE): Deep Learning for Stock Price Movement Prediction

    Authors: Ashish Kumar, Abeer Alsadoon, P. W. C. Prasad, Salma Abdullah, Tarik A. Rashid, Duong Thu Hang Pham, Tran Quoc Vinh Nguyen

    Abstract: The prediction of stock price movement direction is significant in financial circles and academic. Stock price contains complex, incomplete, and fuzzy information which makes it an extremely difficult task to predict its development trend. Predicting and analysing financial data is a nonlinear, time-dependent problem. With rapid development in machine learning and deep learning, this task can be p… ▽ More

    Submitted 30 November, 2021; originally announced December 2021.

    Comments: 18 pages. Multimed Tools Appl, 2021

  21. Low-temperature acanthite-like phase of Cu$_{2}$S: A first-principles study on electronic and transport properties

    Authors: Ho Ngoc Nam, Katsuhiro Suzuki, Tien Quang Nguyen, Akira Masago, Hikari Shinya, Tetsuya Fukushima, Kazunori Sato

    Abstract: The mobility and disorder in the lattice of Cu atoms as liquid-like behavior is an important characteristic affecting the thermoelectric properties of Cu$_{2}$S. In this study, using a theoretical model called acanthite-like structure for Cu$_{2}$S at a low-temperature range, we systematically investigate the electronic structure, intrinsic defect formation, and transport properties by first-princ… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: 10 pages

  22. arXiv:2108.03986  [pdf, other

    physics.ins-det hep-ex

    Autoencoders on FPGAs for real-time, unsupervised new physics detection at 40 MHz at the Large Hadron Collider

    Authors: Ekaterina Govorkova, Ema Puljak, Thea Aarrestad, Thomas James, Vladimir Loncar, Maurizio Pierini, Adrian Alan Pol, Nicolò Ghielmetti, Maksymilian Graczyk, Sioni Summers, Jennifer Ngadiuba, Thong Q. Nguyen, Javier Duarte, Zhenbin Wu

    Abstract: In this paper, we show how to adapt and deploy anomaly detection algorithms based on deep autoencoders, for the unsupervised detection of new physics signatures in the extremely challenging environment of a real-time event selection system at the Large Hadron Collider (LHC). We demonstrate that new physics signatures can be enhanced by three orders of magnitude, while staying within the strict lat… ▽ More

    Submitted 12 August, 2021; v1 submitted 9 August, 2021; originally announced August 2021.

    Report number: FERMILAB-PUB-21-487-CMS

    Journal ref: Nature Machine Intelligence 4, 154 (2022)

  23. arXiv:2108.02929  [pdf

    cs.CV

    VinaFood21: A Novel Dataset for Evaluating Vietnamese Food Recognition

    Authors: Thuan Trong Nguyen, Thuan Q. Nguyen, Dung Vo, Vi Nguyen, Ngoc Ho, Nguyen D. Vo, Kiet Van Nguyen, Khang Nguyen

    Abstract: Vietnam is such an attractive tourist destination with its stunning and pristine landscapes and its top-rated unique food and drink. Among thousands of Vietnamese dishes, foreigners and native people are interested in easy-to-eat tastes and easy-to-do recipes, along with reasonable prices, mouthwatering flavors, and popularity. Due to the diversity and almost all the dishes have significant simila… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

  24. arXiv:2106.13849  [pdf, other

    cs.CV eess.IV

    A CNN Segmentation-Based Approach to Object Detection and Tracking in Ultrasound Scans with Application to the Vagus Nerve Detection

    Authors: Abdullah F. Al-Battal, Yan Gong, Lu Xu, Timothy Morton, Chen Du, Yifeng Bu 1, Imanuel R Lerman, Radhika Madhavan, Truong Q. Nguyen

    Abstract: Ultrasound scanning is essential in several medical diagnostic and therapeutic applications. It is used to visualize and analyze anatomical features and structures that influence treatment plans. However, it is both labor intensive, and its effectiveness is operator dependent. Real-time accurate and robust automatic detection and tracking of anatomical structures while scanning would significantly… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

    Comments: 7 pages , 4 figures, submitted to the IEEE EMBC 2021 conference

  25. arXiv:2105.01691  [pdf, other

    cs.CL

    Data Augmentation by Concatenation for Low-Resource Translation: A Mystery and a Solution

    Authors: Toan Q. Nguyen, Kenton Murray, David Chiang

    Abstract: In this paper, we investigate the driving factors behind concatenation, a simple but effective data augmentation method for low-resource neural machine translation. Our experiments suggest that discourse context is unlikely the cause for the improvement of about +1 BLEU across four language pairs. Instead, we demonstrate that the improvement comes from three other factors unrelated to discourse: c… ▽ More

    Submitted 2 July, 2021; v1 submitted 4 May, 2021; originally announced May 2021.

    Comments: Accepted at IWSLT 2021

  26. Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets

    Authors: Julia Kreutzer, Isaac Caswell, Lisa Wang, Ahsan Wahab, Daan van Esch, Nasanbayar Ulzii-Orshikh, Allahsera Tapo, Nishant Subramani, Artem Sokolov, Claytone Sikasote, Monang Setyawan, Supheakmungkol Sarin, Sokhar Samb, Benoît Sagot, Clara Rivera, Annette Rios, Isabel Papadimitriou, Salomey Osei, Pedro Ortiz Suarez, Iroro Orife, Kelechi Ogueji, Andre Niyongabo Rubungo, Toan Q. Nguyen, Mathias Müller, André Müller , et al. (27 additional authors not shown)

    Abstract: With the success of large-scale pre-training and multilingual modeling in Natural Language Processing (NLP), recent years have seen a proliferation of large, web-mined text datasets covering hundreds of languages. We manually audit the quality of 205 language-specific corpora released with five major public datasets (CCAligned, ParaCrawl, WikiMatrix, OSCAR, mC4). Lower-resource corpora have system… ▽ More

    Submitted 21 February, 2022; v1 submitted 22 March, 2021; originally announced March 2021.

    Comments: Accepted at TACL; pre-MIT Press publication version

    Journal ref: Transactions of the Association for Computational Linguistics (2022) 10: 50-72

  27. arXiv:2103.04447  [pdf, ps, other

    cs.DM cs.DS math.CO

    Termination of Multipartite Graph Series Arising from Complex Network Modelling

    Authors: Matthieu Latapy, Thi Ha Duong Phan, Christophe Crespelle, Thanh Qui Nguyen

    Abstract: An intense activity is nowadays devoted to the definition of models capturing the properties of complex networks. Among the most promising approaches, it has been proposed to model these graphs via their clique incidence bipartite graphs. However, this approach has, until now, severe limitations resulting from its incapacity to reproduce a key property of this object: the overlap** nature of cli… ▽ More

    Submitted 7 March, 2021; originally announced March 2021.

    Comments: Published in LNCS, proceedings of the 4th International Conference on Combinatorial Optimization and Applications (COCOA), 2010

  28. Causal mediation analysis: From simple to more robust strategies for estimation of marginal natural (in)direct effects

    Authors: Trang Quynh Nguyen, Elizabeth L. Ogburn, Ian Schmid, Elizabeth B. Sarker, Noah Greifer, Ina M. Koning, Elizabeth A. Stuart

    Abstract: This paper aims to provide practitioners of causal mediation analysis with a better understanding of estimation options. We take as inputs two familiar strategies (weighting and model-based prediction) and a simple way of combining them (weighted models), and show how a range of estimators can be generated, with different modeling requirements and robustness properties. The primary goal is to help… ▽ More

    Submitted 13 January, 2023; v1 submitted 11 February, 2021; originally announced February 2021.

    MSC Class: 62D20

    Journal ref: Statistics Surveys. 2023. 17:1-41

  29. arXiv:2102.04990  [pdf, other

    cs.CV cs.CL

    In Defense of Scene Graphs for Image Captioning

    Authors: Kien Nguyen, Subarna Tripathi, Bang Du, Tanaya Guha, Truong Q. Nguyen

    Abstract: The mainstream image captioning models rely on Convolutional Neural Network (CNN) image features to generate captions via recurrent models. Recently, image scene graphs have been used to augment captioning models so as to leverage their structural semantics, such as object entities, relationships and attributes. Several studies have noted that the naive use of scene graphs from a black-box scene g… ▽ More

    Submitted 17 August, 2021; v1 submitted 9 February, 2021; originally announced February 2021.

    Comments: Accepted to ICCV 2021

  30. Sensitivity analyses for effect modifiers not observed in the target population when generalizing treatment effects from a randomized controlled trial: Assumptions, models, effect scales, data scenarios, and implementation details

    Authors: Trang Quynh Nguyen, Benjamin Ackerman, Ian Schmid, Stephen R. Cole, Elizabeth A. Stuart

    Abstract: Background: Randomized controlled trials are often used to inform policy and practice for broad populations. The average treatment effect (ATE) for a target population, however, may be different from the ATE observed in a trial if there are effect modifiers whose distribution in the target population is different that from that in the trial. Methods exist to use trial data to estimate the target p… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

    Journal ref: PLOS ONE. 2018. 13(12): e0208795

  31. Clarifying causal mediation analysis: Effect identification via three assumptions and five potential outcomes

    Authors: Trang Quynh Nguyen, Ian Schmid, Elizabeth L. Ogburn, Elizabeth A. Stuart

    Abstract: Causal mediation analysis is complicated with multiple effect definitions that require different sets of assumptions for identification. This paper provides a systematic explanation of such assumptions. We define five potential outcome types whose means are involved in various effect definitions. We tackle their mean/distribution's identification, starting with the one that requires the weakest as… ▽ More

    Submitted 7 July, 2022; v1 submitted 18 November, 2020; originally announced November 2020.

    Journal ref: Journal of Causal Inference. 2022. 10:246-279

  32. arXiv:2010.01835  [pdf, other

    physics.comp-ph cs.LG hep-ex hep-ph

    Data Augmentation at the LHC through Analysis-specific Fast Simulation with Deep Learning

    Authors: Cheng Chen, Olmo Cerri, Thong Q. Nguyen, Jean-Roch Vlimant, Maurizio Pierini

    Abstract: We present a fast simulation application based on a Deep Neural Network, designed to create large analysis-specific datasets. Taking as an example the generation of W+jet events produced in sqrt(s)= 13 TeV proton-proton collisions, we train a neural network to model detector resolution effects as a transfer function acting on an analysis-specific set of relevant features, computed at generation le… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: 15 pages, 12 figures

  33. arXiv:2005.01598  [pdf, other

    hep-ex cs.LG hep-ph

    Adversarially Learned Anomaly Detection on CMS Open Data: re-discovering the top quark

    Authors: Oliver Knapp, Guenther Dissertori, Olmo Cerri, Thong Q. Nguyen, Jean-Roch Vlimant, Maurizio Pierini

    Abstract: We apply an Adversarially Learned Anomaly Detection (ALAD) algorithm to the problem of detecting new physics processes in proton-proton collisions at the Large Hadron Collider. Anomaly detection based on ALAD matches performances reached by Variational Autoencoders, with a substantial improvement in some cases. Training the ALAD algorithm on 4.4 fb-1 of 8 TeV CMS Open Data, we show how a data-driv… ▽ More

    Submitted 3 October, 2020; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: 16 pages, 9 figures

  34. First-principles calculation of electronic density of states and Seebeck coefficient in transition-metal-doped Si-Ge alloys

    Authors: Ryo Yamada, Akira Masago, Tetsuya Fukushima, Hikari Shinya, Tien Quang Nguyen, Kazunori Sato

    Abstract: High $ZT$ value and large Seebeck coefficient have been reported in the nanostructured Fe-doped Si-Ge alloys. In this work, the large Seebeck coefficient in Fe-doped Si-Ge systems is qualitatively reproduced from the computed electronic density of states, where a hybrid functional, HSE06, is used for an exchange-correlation functional, as well as a special quasi-random structure (SQS) for a disord… ▽ More

    Submitted 28 January, 2020; originally announced January 2020.

    Comments: 6 pages

  35. Particle Generative Adversarial Networks for full-event simulation at the LHC and their application to pileup description

    Authors: Jesus Arjona Martinez, Thong Q Nguyen, Maurizio Pierini, Maria Spiropulu, Jean-Roch Vlimant

    Abstract: We investigate how a Generative Adversarial Network could be used to generate a list of particle four-momenta from LHC proton collisions, allowing one to define a generative model that could abstract from the irregularities of typical detector geometries. As an example of application, we show how such an architecture could be used as a generator of LHC parasitic collisions (pileup). We present two… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

    Comments: 7 pages, 5 figures. To be appeared in Proceedings of the 19th International Workshop on Advanced Computing and Analysis Techniques in Physics Research

    Journal ref: J. Phys. : Conf. Ser. 1525 (2020) 012081

  36. arXiv:1910.14659  [pdf, other

    cs.CL cs.LG eess.AS stat.ML

    Masked Language Model Scoring

    Authors: Julian Salazar, Davis Liang, Toan Q. Nguyen, Katrin Kirchhoff

    Abstract: Pretrained masked language models (MLMs) require finetuning for most NLP tasks. Instead, we evaluate MLMs out of the box via their pseudo-log-likelihood scores (PLLs), which are computed by masking tokens one by one. We show that PLLs outperform scores from autoregressive language models like GPT-2 in a variety of tasks. By rescoring ASR and NMT hypotheses, RoBERTa reduces an end-to-end LibriSpeec… ▽ More

    Submitted 31 December, 2020; v1 submitted 31 October, 2019; originally announced October 2019.

    Comments: ACL 2020 camera-ready (presented July 2020)

    Journal ref: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (2020), 2699-2712

  37. arXiv:1910.06717  [pdf, other

    cs.CL cs.LG stat.ML

    Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

    Authors: Kenton Murray, Jeffery Kinnison, Toan Q. Nguyen, Walter Scheirer, David Chiang

    Abstract: Neural sequence-to-sequence models, particularly the Transformer, are the state of the art in machine translation. Yet these neural networks are very sensitive to architecture and hyperparameter settings. Optimizing these settings by grid or random search is computationally expensive because it requires many training runs. In this paper, we incorporate architecture search into a single training ru… ▽ More

    Submitted 1 October, 2019; originally announced October 2019.

    Comments: The 3rd Workshop on Neural Generation and Translation (WNGT 2019)

  38. arXiv:1910.05895  [pdf, other

    cs.CL cs.LG stat.ML

    Transformers without Tears: Improving the Normalization of Self-Attention

    Authors: Toan Q. Nguyen, Julian Salazar

    Abstract: We evaluate three simple, normalization-centric changes to improve Transformer training. First, we show that pre-norm residual connections (PreNorm) and smaller initializations enable warmup-free, validation-based training with large learning rates. Second, we propose $\ell_2$ normalization with a single scale parameter (ScaleNorm) for faster training and better performance. Finally, we reaffirm t… ▽ More

    Submitted 29 December, 2019; v1 submitted 13 October, 2019; originally announced October 2019.

    Comments: Accepted to IWSLT 2019 (oral); code is available at https://github.com/tnq177/transformers_without_tears

  39. arXiv:1910.05600  [pdf, other

    stat.AP

    Partially Pooled Propensity Score Models for Average Treatment Effect Estimation with Multilevel Data

    Authors: You** Lee, Trang Q. Nguyen, Elizabeth A. Stuart

    Abstract: Causal inference analyses often use existing observational data, which in many cases has some clustering of individuals. In this paper we discuss propensity score weighting methods in a multilevel setting where within clusters individuals share unmeasured confounders that are related to treatment assignment and the potential outcomes. We focus in particular on settings where models with fixed clus… ▽ More

    Submitted 22 December, 2020; v1 submitted 12 October, 2019; originally announced October 2019.

  40. Interaction networks for the identification of boosted $H\to b\overline{b}$ decays

    Authors: Eric A. Moreno, Thong Q. Nguyen, Jean-Roch Vlimant, Olmo Cerri, Harvey B. Newman, Avikar Periwal, Maria Spiropulu, Javier M. Duarte, Maurizio Pierini

    Abstract: We develop an algorithm based on an interaction network to identify high-transverse-momentum Higgs bosons decaying to bottom quark-antiquark pairs and distinguish them from ordinary jets that reflect the configurations of quarks and gluons at short distances. The algorithm's inputs are features of the reconstructed charged particles in a jet and the secondary vertices associated with them. Describ… ▽ More

    Submitted 28 July, 2020; v1 submitted 26 September, 2019; originally announced September 2019.

    Comments: 20 pages, 8 figures, 6 tables, version published in PRD

    Report number: FERMILAB-PUB-19-492-CMS-E

    Journal ref: Phys. Rev. D 102, 012010 (2020)

  41. JEDI-net: a jet identification algorithm based on interaction networks

    Authors: Eric A. Moreno, Olmo Cerri, Javier M. Duarte, Harvey B. Newman, Thong Q. Nguyen, Avikar Periwal, Maurizio Pierini, Aidana Serikova, Maria Spiropulu, Jean-Roch Vlimant

    Abstract: We investigate the performance of a jet identification algorithm based on interaction networks (JEDI-net) to identify all-hadronic decays of high-momentum heavy particles produced at the LHC and distinguish them from ordinary jets originating from the hadronization of quarks and gluons. The jet dynamics are described as a set of one-to-one interactions between the jet constituents. Based on a repr… ▽ More

    Submitted 27 January, 2020; v1 submitted 14 August, 2019; originally announced August 2019.

    Comments: 16 pages, 9 figures, 7 tables

    Report number: FERMILAB-PUB-19-360-PPD

    Journal ref: Eur. Phys. J. C 80, 58 (2020)

  42. arXiv:1907.12709  [pdf, other

    stat.ME

    Propensity score analysis with latent covariates: Measurement error bias correction using the covariate's posterior mean, aka the inclusive factor score

    Authors: Trang Quynh Nguyen, Elizabeth A. Stuart

    Abstract: We address measurement error bias in propensity score (PS) analysis due to covariates that are latent variables. In the setting where latent covariate $X$ is measured via multiple error-prone items $\mathbf{W}$, PS analysis using several proxies for $X$ -- the $\mathbf{W}$ items themselves, a summary score (mean/sum of the items), or the conventional factor score (cFS , i.e., predicted value of… ▽ More

    Submitted 11 February, 2020; v1 submitted 29 July, 2019; originally announced July 2019.

  43. Clarifying causal mediation analysis for the applied researcher: Defining effects based on what we want to learn

    Authors: Trang Quynh Nguyen, Ian Schmid, Elizabeth A. Stuart

    Abstract: The incorporation of causal inference in mediation analysis has led to theoretical and methodological advancements -- effect definitions with causal interpretation, clarification of assumptions required for effect identification, and an expanding array of options for effect estimation. However, the literature on these results is fast-growing and complex, which may be confusing to researchers unfam… ▽ More

    Submitted 15 May, 2020; v1 submitted 17 April, 2019; originally announced April 2019.

    Journal ref: Psychological Methods. 2021. 26(2):255-271

  44. arXiv:1901.07838  [pdf, other

    cs.CV

    Toward Joint Image Generation and Compression using Generative Adversarial Networks

    Authors: Byeongkeun Kang, Subarna Tripathi, Truong Q. Nguyen

    Abstract: In this paper, we present a generative adversarial network framework that generates compressed images instead of synthesizing raw RGB images and compressing them separately. In the real world, most images and videos are stored and transferred in a compressed format to save storage capacity and data transfer bandwidth. However, since typical generative adversarial networks generate raw RGB images,… ▽ More

    Submitted 23 January, 2019; originally announced January 2019.

  45. Random Forest with Learned Representations for Semantic Segmentation

    Authors: Byeongkeun Kang, Truong Q. Nguyen

    Abstract: In this work, we present a random forest framework that learns the weights, shapes, and sparsities of feature representations for real-time semantic segmentation. Typical filters (kernels) have predetermined shapes and sparsities and learn only weights. A few feature extraction methods fix weights and learn only shapes and sparsities. These predetermined constraints restrict learning and extractin… ▽ More

    Submitted 23 January, 2019; originally announced January 2019.

  46. arXiv:1811.10276  [pdf, other

    hep-ex cs.LG hep-ph

    Variational Autoencoders for New Physics Mining at the Large Hadron Collider

    Authors: Olmo Cerri, Thong Q. Nguyen, Maurizio Pierini, Maria Spiropulu, Jean-Roch Vlimant

    Abstract: Using variational autoencoders trained on known physics processes, we develop a one-sided threshold test to isolate previously unseen processes as outlier events. Since the autoencoder training does not depend on any specific new physics signature, the proposed procedure doesn't make specific assumptions on the nature of new physics. An event selection based on this algorithm would be complementar… ▽ More

    Submitted 13 June, 2019; v1 submitted 26 November, 2018; originally announced November 2018.

    Comments: 29 pages, 12 figures, 5 tables

    Journal ref: J. High Energ. Phys. (2019) 2019: 36

  47. arXiv:1807.00083  [pdf, other

    hep-ex cs.LG hep-ph physics.data-an

    Topology classification with deep learning to improve real-time event selection at the LHC

    Authors: Thong Q. Nguyen, Daniel Weitekamp III, Dustin Anderson, Roberto Castello, Olmo Cerri, Maurizio Pierini, Maria Spiropulu, Jean-Roch Vlimant

    Abstract: We show how event topology classification based on deep learning could be used to improve the purity of data samples selected in real time at at the Large Hadron Collider. We consider different data representations, on which different kinds of multi-class classifiers are trained. Both raw data and high-level features are utilized. In the considered examples, a filter based on the classifier's scor… ▽ More

    Submitted 2 September, 2019; v1 submitted 29 June, 2018; originally announced July 2018.

    Comments: This is a pre-print of an article published in Computing and Software for Big Science. The final authenticated version is available online at: https://doi.org/10.1007/s41781-019-0028-1

    Journal ref: Comput Softw Big Sci (2019) 3: 12

  48. arXiv:1805.10558  [pdf, other

    cs.CV

    DPW-SDNet: Dual Pixel-Wavelet Domain Deep CNNs for Soft Decoding of JPEG-Compressed Images

    Authors: Honggang Chen, Xiaohai He, Linbo Qing, Shuhua Xiong, Truong Q. Nguyen

    Abstract: JPEG is one of the widely used lossy compression methods. JPEG-compressed images usually suffer from compression artifacts including blocking and blurring, especially at low bit-rates. Soft decoding is an effective solution to improve the quality of compressed images without changing codec or introducing extra coding bits. Inspired by the excellent performance of the deep convolutional neural netw… ▽ More

    Submitted 26 May, 2018; originally announced May 2018.

    Comments: CVPRW 2018

  49. arXiv:1803.04477  [pdf, other

    cs.CV

    Correction by Projection: Denoising Images with Generative Adversarial Networks

    Authors: Subarna Tripathi, Zachary C. Lipton, Truong Q. Nguyen

    Abstract: Generative adversarial networks (GANs) transform low-dimensional latent vectors into visually plausible images. If the real dataset contains only clean images, then ostensibly, the manifold learned by the GAN should contain only clean images. In this paper, we propose to denoise corrupted images by finding the nearest point on the GAN manifold, recovering latent vectors by minimizing distances in… ▽ More

    Submitted 12 March, 2018; originally announced March 2018.

  50. arXiv:1802.01458  [pdf, other

    eess.IV cs.CV math.ST stat.ML

    Image denoising with generalized Gaussian mixture model patch priors

    Authors: Charles-Alban Deledalle, Shibin Parameswaran, Truong Q. Nguyen

    Abstract: Patch priors have become an important component of image restoration. A powerful approach in this category of restoration algorithms is the popular Expected Patch Log-Likelihood (EPLL) algorithm. EPLL uses a Gaussian mixture model (GMM) prior learned on clean image patches as a way to regularize degraded patches. In this paper, we show that a generalized Gaussian mixture model (GGMM) captures the… ▽ More

    Submitted 11 June, 2018; v1 submitted 5 February, 2018; originally announced February 2018.