Skip to main content

Showing 1–20 of 20 results for author: Ng, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.02745  [pdf, other

    cs.LG

    Measuring Stochastic Data Complexity with Boltzmann Influence Functions

    Authors: Nathan Ng, Roger Grosse, Marzyeh Ghassemi

    Abstract: Estimating the uncertainty of a model's prediction on a test point is a crucial part of ensuring reliability and calibration under distribution shifts. A minimum description length approach to this problem uses the predictive normalized maximum likelihood (pNML) distribution, which considers every possible label for a data point, and decreases confidence in a prediction if other labels are also co… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  2. arXiv:2402.08225  [pdf, other

    cs.LG

    Improving Black-box Robustness with In-Context Rewriting

    Authors: Kyle O'Brien, Nathan Ng, Isha Puri, Jorge Mendez, Hamid Palangi, Yoon Kim, Marzyeh Ghassemi, Thomas Hartvigsen

    Abstract: Machine learning models often excel on in-distribution (ID) data but struggle with unseen out-of-distribution (OOD) inputs. Most techniques for improving OOD robustness are not applicable to settings where the model is effectively a black box, such as when the weights are frozen, retraining is costly, or the model is leveraged via an API. Test-time augmentation (TTA) is a simple post-hoc technique… ▽ More

    Submitted 15 February, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  3. arXiv:2309.01670  [pdf, other

    q-bio.GN cs.LG

    Blind Biological Sequence Denoising with Self-Supervised Set Learning

    Authors: Nathan Ng, Ji Won Park, Jae Hyeon Lee, Ryan Lewis Kelly, Stephen Ra, Kyunghyun Cho

    Abstract: Biological sequence analysis relies on the ability to denoise the imprecise output of sequencing platforms. We consider a common setting where a short sequence is read out repeatedly using a high-throughput long-read platform to generate multiple subreads, or noisy observations of the same sequence. Denoising these subreads with alignment-based approaches often fails when too few subreads are avai… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

  4. arXiv:2304.04598  [pdf

    cs.SD eess.AS eess.SP

    In-situ crack and keyhole pore detection in laser directed energy deposition through acoustic signal and deep learning

    Authors: Lequn Chen, Xiling Yao, Chaolin Tan, Weiyang He, **long Su, Fei Weng, Youxiang Chew, Nicholas Poh Huat Ng, Seung Ki Moon

    Abstract: Cracks and keyhole pores are detrimental defects in alloys produced by laser directed energy deposition (LDED). Laser-material interaction sound may hold information about underlying complex physical events such as crack propagation and pores formation. However, due to the noisy environment and intricate signal content, acoustic-based monitoring in LDED has received little attention. This paper pr… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: 36 Pages, 16 Figures, accepted at journal Additive Manufacturing

  5. arXiv:2212.10387  [pdf, other

    cs.DB cs.DC

    Tuning the Tail Latency of Distributed Queries Using Replication

    Authors: Nathan Ng, Hung Le, Marco Serafini

    Abstract: Querying graph data with low latency is an important requirement in application domains such as social networks and knowledge graphs. Graph queries perform multiple hops between vertices. When data is partitioned and stored across multiple servers, queries executing at one server often need to hop to vertices stored by another server. Such distributed traversals represent a performance bottleneck… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: An earlier version of this paper was submitted in April 2022. Previous versions are available at https://marcoserafini.github.io/projects/latency-replication/index.html

  6. arXiv:2209.05364  [pdf, other

    cs.LG stat.ML

    If Influence Functions are the Answer, Then What is the Question?

    Authors: Juhan Bae, Nathan Ng, Alston Lo, Marzyeh Ghassemi, Roger Grosse

    Abstract: Influence functions efficiently estimate the effect of removing a single training data point on a model's learned parameters. While influence estimates align well with leave-one-out retraining for linear models, recent works have shown this alignment is often poor in neural networks. In this work, we investigate the specific factors that cause this discrepancy by decomposing it into five separate… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Comments: 28 pages, 6 figures

  7. arXiv:2207.02093  [pdf, other

    cs.LG stat.ML

    Predicting Out-of-Domain Generalization with Neighborhood Invariance

    Authors: Nathan Ng, Neha Hulkund, Kyunghyun Cho, Marzyeh Ghassemi

    Abstract: Develo** and deploying machine learning models safely depends on the ability to characterize and compare their abilities to generalize to new environments. Although recent work has proposed a variety of methods that can directly predict or theoretically bound the generalization capacity of a model, they rely on strong assumptions such as matching train/test distributions and access to model grad… ▽ More

    Submitted 17 July, 2023; v1 submitted 5 July, 2022; originally announced July 2022.

    Comments: 38 pages, 5 figures, 28 tables

  8. arXiv:2112.04963  [pdf, other

    cs.LG physics.ao-ph

    Model-Agnostic Hybrid Numerical Weather Prediction and Machine Learning Paradigm for Solar Forecasting in the Tropics

    Authors: Nigel Yuan Yun Ng, Harish Gopalan, Venugopalan S. G. Raghavan, Chin Chun Ooi

    Abstract: Numerical weather prediction (NWP) and machine learning (ML) methods are popular for solar forecasting. However, NWP models have multiple possible physical parameterizations, which requires site-specific NWP optimization. This is further complicated when regional NWP models are used with global climate models with different possible parameterizations. In this study, an alternative approach is prop… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

  9. The Multiscenario Multienvironment BioSecure Multimodal Database (BMDB)

    Authors: Javier Ortega-Garcia, Julian Fierrez, Fernando Alonso-Fernandez, Javier Galbally, Manuel R Freire, Joaquin Gonzalez-Rodriguez, Carmen Garcia-Mateo, Jose-Luis Alba-Castro, Elisardo Gonzalez-Agulla, Enrique Otero-Muras, Sonia Garcia-Salicetti, Lorene Allano, Bao Ly-Van, Bernadette Dorizzi, Josef Kittler, Thirimachos Bourlai, Norman Poh, Farzin Deravi, Ming NR Ng, Michael Fairhurst, Jean Hennebert, Andreas Humm, Massimo Tistarelli, Linda Brodo, Jonas Richiardi , et al. (7 additional authors not shown)

    Abstract: A new multimodal biometric database designed and acquired within the framework of the European BioSecure Network of Excellence is presented. It is comprised of more than 600 individuals acquired simultaneously in three scenarios: 1) over the Internet, 2) in an office environment with desktop PC, and 3) in indoor/outdoor environments with mobile portable hardware. The three scenarios include a comm… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: Published at IEEE Transactions on Pattern Analysis and Machine Intelligence journal

  10. arXiv:2011.00136  [pdf, other

    cs.CL cs.LG

    Improving Dialogue Breakdown Detection with Semi-Supervised Learning

    Authors: Nathan Ng, Marzyeh Ghassemi, Narendran Thangarajan, Jiacheng Pan, Qi Guo

    Abstract: Building user trust in dialogue agents requires smooth and consistent dialogue exchanges. However, agents can easily lose conversational context and generate irrelevant utterances. These situations are called dialogue breakdown, where agent utterances prevent users from continuing the conversation. Building systems to detect dialogue breakdown allows agents to recover appropriately or avoid breakd… ▽ More

    Submitted 19 January, 2023; v1 submitted 30 October, 2020; originally announced November 2020.

    Comments: 6 pages, 1 figure, accepted at the NeurIPS Workshop on Human in the Loop Dialogue Systems

  11. arXiv:2009.10195  [pdf, other

    cs.CL cs.LG stat.ML

    SSMBA: Self-Supervised Manifold Based Data Augmentation for Improving Out-of-Domain Robustness

    Authors: Nathan Ng, Kyunghyun Cho, Marzyeh Ghassemi

    Abstract: Models that perform well on a training domain often fail to generalize to out-of-domain (OOD) examples. Data augmentation is a common method used to prevent overfitting and improve OOD generalization. However, in natural language, it is difficult to generate new examples that stay on the underlying data manifold. We introduce SSMBA, a data augmentation method for generating synthetic training exam… ▽ More

    Submitted 4 October, 2020; v1 submitted 21 September, 2020; originally announced September 2020.

    Comments: 16 pages, 8 figures, to be published in EMNLP 2020

  12. arXiv:1908.05731  [pdf, ps, other

    cs.CL

    Simple and Effective Noisy Channel Modeling for Neural Machine Translation

    Authors: Kyra Yee, Nathan Ng, Yann N. Dauphin, Michael Auli

    Abstract: Previous work on neural noisy channel modeling relied on latent variable models that incrementally process the source and target sentence. This makes decoding decisions based on partial source prefixes even though the full source is available. We pursue an alternative approach based on standard sequence to sequence models which utilize the entire source. These models perform remarkably well as cha… ▽ More

    Submitted 15 August, 2019; originally announced August 2019.

    Comments: EMNLP 2019

  13. arXiv:1907.06616  [pdf, ps, other

    cs.CL

    Facebook FAIR's WMT19 News Translation Task Submission

    Authors: Nathan Ng, Kyra Yee, Alexei Baevski, Myle Ott, Michael Auli, Sergey Edunov

    Abstract: This paper describes Facebook FAIR's submission to the WMT19 shared news translation task. We participate in two language pairs and four language directions, English <-> German and English <-> Russian. Following our submission from last year, our baseline systems are large BPE-based transformer models trained with the Fairseq sequence modeling toolkit which rely on sampled back-translations. This… ▽ More

    Submitted 15 July, 2019; originally announced July 2019.

    Comments: 7 pages; WMT

  14. arXiv:1904.04419  [pdf, other

    cs.CV cs.LG

    Embryo staging with weakly-supervised region selection and dynamically-decoded predictions

    Authors: Tingfung Lau, Nathan Ng, Julian Gingold, Nina Desai, Julian McAuley, Zachary C. Lipton

    Abstract: To optimize clinical outcomes, fertility clinics must strategically select which embryos to transfer. Common selection heuristics are formulas expressed in terms of the durations required to reach various developmental milestones, quantities historically annotated manually by experienced embryologists based on time-lapse EmbryoScope videos. We propose a new method for automatic embryo staging that… ▽ More

    Submitted 8 April, 2019; originally announced April 2019.

  15. Multiparty Session Type-safe Web Development with Static Linearity

    Authors: Jonathan King, Nicholas Ng, Nobuko Yoshida

    Abstract: Modern web applications can now offer desktop-like experiences from within the browser, thanks to technologies such as WebSockets, which enable low-latency duplex communication between the browser and the server. While these advances are great for the user experience, they represent a new responsibility for web developers who now need to manage and verify the correctness of more complex and potent… ▽ More

    Submitted 2 April, 2019; originally announced April 2019.

    Comments: In Proceedings PLACES 2019, arXiv:1904.00396

    Journal ref: EPTCS 291, 2019, pp. 35-46

  16. arXiv:1904.01038  [pdf, other

    cs.CL

    fairseq: A Fast, Extensible Toolkit for Sequence Modeling

    Authors: Myle Ott, Sergey Edunov, Alexei Baevski, Angela Fan, Sam Gross, Nathan Ng, David Grangier, Michael Auli

    Abstract: fairseq is an open-source sequence modeling toolkit that allows researchers and developers to train custom models for translation, summarization, language modeling, and other text generation tasks. The toolkit is based on PyTorch and supports distributed training across multiple GPUs and machines. We also support fast mixed-precision training and inference on modern GPUs. A demo video can be found… ▽ More

    Submitted 1 April, 2019; originally announced April 2019.

    Comments: NAACL 2019 Demo paper

  17. arXiv:1702.05386  [pdf, other

    stat.ML cs.LG cs.NE

    Predicting Surgery Duration with Neural Heteroscedastic Regression

    Authors: Nathan Ng, Rodney A Gabriel, Julian McAuley, Charles Elkan, Zachary C Lipton

    Abstract: Scheduling surgeries is a challenging task due to the fundamental uncertainty of the clinical environment, as well as the risks and costs associated with under- and over-booking. We investigate neural regression algorithms to estimate the parameters of surgery case durations, focusing on the issue of heteroscedasticity. We seek to simultaneously estimate the duration of each surgery, as well as a… ▽ More

    Submitted 12 July, 2017; v1 submitted 17 February, 2017; originally announced February 2017.

  18. arXiv:1610.08843  [pdf, other

    cs.PL cs.LO

    Fencing off Go: Liveness and Safety for Channel-based Programming (extended version)

    Authors: Julien Lange, Nicholas Ng, Bernardo Toninho, Nobuko Yoshida

    Abstract: Go is a production-level statically typed programming language whose design features explicit message-passing primitives and lightweight threads, enabling (and encouraging) programmers to develop concurrent systems where components interact through communication more so than by lock-based shared memory concurrency. Go can only detect global deadlocks at runtime, but provides no compile-time protec… ▽ More

    Submitted 28 February, 2017; v1 submitted 27 October, 2016; originally announced October 2016.

  19. arXiv:1312.2705  [pdf, other

    cs.DC cs.LO cs.PL

    Towards deductive verification of MPI programs against session types

    Authors: Eduardo R. B. Marques, Francisco Martins, Vasco T. Vasconcelos, Nicholas Ng, Nuno Martins

    Abstract: The Message Passing Interface (MPI) is the de facto standard message-passing infrastructure for develo** parallel applications. Two decades after the first version of the library specification, MPI-based applications are nowadays routinely deployed on super and cluster computers. These applications, written in C or Fortran, exhibit intricate message passing behaviours, making it hard to statical… ▽ More

    Submitted 10 December, 2013; originally announced December 2013.

    Comments: In Proceedings PLACES 2013, arXiv:1312.2218

    Journal ref: EPTCS 137, 2013, pp. 103-113

  20. arXiv:1305.5278  [pdf, other

    quant-ph cond-mat.stat-mech cs.IT

    The second laws of quantum thermodynamics

    Authors: Fernando G. S. L. Brandao, MichaƂ Horodecki, Nelly Huei Ying Ng, Jonathan Oppenheim, Stephanie Wehner

    Abstract: The second law of thermodynamics tells us which state transformations are so statistically unlikely that they are effectively forbidden. Its original formulation, due to Clausius, states that "Heat can never pass from a colder to a warmer body without some other change, connected therewith, occurring at the same time". The second law applies to systems composed of many particles interacting; howev… ▽ More

    Submitted 25 September, 2014; v1 submitted 22 May, 2013; originally announced May 2013.

    Comments: v3: 39 pages, 2 figures. Substantial expansion of the previous text, conditions in terms of generalised alpha free energies, addition on discussion about the role of zeroeth and first laws of thermodynamics, addition of two new figures

    Journal ref: PNAS 112, 3275 (2015)