Skip to main content

Showing 1–23 of 23 results for author: Lam, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05189  [pdf, other

    stat.AP cs.AI

    Analyzing the factors that are involved in length of inpatient stay at the hospital for diabetes patients

    Authors: Jorden Lam, Kunpeng Xu

    Abstract: The paper investigates the escalating concerns surrounding the surge in diabetes cases, exacerbated by the COVID-19 pandemic, and the subsequent strain on medical resources. The research aims to construct a predictive model quantifying factors influencing inpatient hospital stay durations for diabetes patients, offering insights to hospital administrators for improved patient management strategies… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  2. Semi-supervised Domain Adaptation on Graphs with Contrastive Learning and Minimax Entropy

    Authors: Jiaren Xiao, Quanyu Dai, Xiao Shen, Xiaochen Xie, **g Dai, James Lam, Ka-Wai Kwok

    Abstract: Label scarcity in a graph is frequently encountered in real-world applications due to the high cost of data labeling. To this end, semi-supervised domain adaptation (SSDA) on graphs aims to leverage the knowledge of a labeled source graph to aid in node classification on a target graph with limited labels. SSDA tasks need to overcome the domain gap between the source and target graphs. However, to… ▽ More

    Submitted 4 April, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

    Journal ref: Neurocomputing (2024)

  3. arXiv:2308.11596  [pdf, other

    cs.CL

    SeamlessM4T: Massively Multilingual & Multimodal Machine Translation

    Authors: Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Cora Meglioli, David Dale, Ning Dong, Paul-Ambroise Duquenne, Hady Elsahar, Hongyu Gong, Kevin Heffernan, John Hoffman, Christopher Klaiber, Pengwei Li, Daniel Licht, Jean Maillard, Alice Rakotoarison, Kaushik Ram Sadagopan, Guillaume Wenzek, Ethan Ye, Bapi Akula, Peng-Jen Chen, Naji El Hachem, Brian Ellis, Gabriel Mejia Gonzalez, Justin Haaheim , et al. (43 additional authors not shown)

    Abstract: What does it take to create the Babel Fish, a tool that can help individuals translate speech between any two languages? While recent breakthroughs in text-based models have pushed machine translation coverage beyond 200 languages, unified speech-to-speech translation models have yet to achieve similar strides. More specifically, conventional speech-to-speech translation systems rely on cascaded s… ▽ More

    Submitted 24 October, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

    ACM Class: I.2.7

  4. arXiv:2305.11746  [pdf, other

    cs.CL

    HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation

    Authors: David Dale, Elena Voita, Janice Lam, Prangthip Hansanti, Christophe Ropers, Elahe Kalbassi, Cynthia Gao, Loïc Barrault, Marta R. Costa-jussà

    Abstract: Hallucinations in machine translation are translations that contain information completely unrelated to the input. Omissions are translations that do not include some of the input information. While both cases tend to be catastrophic errors undermining user trust, annotated data with these types of pathologies is extremely scarce and is limited to a few high-resource languages. In this work, we re… ▽ More

    Submitted 5 December, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    ACM Class: I.2.7

    Journal ref: EMNLP 2023

  5. arXiv:2305.11553  [pdf, other

    cs.CL

    Unsupervised Scientific Abstract Segmentation with Normalized Mutual Information

    Authors: Yingqiang Gao, Jessica Lam, Nianlong Gu, Richard H. R. Hahnloser

    Abstract: The abstracts of scientific papers consist of premises and conclusions. Structured abstracts explicitly highlight the conclusion sentences, whereas non-structured abstracts may have conclusion sentences at uncertain positions. This implicit nature of conclusion positions makes the automatic segmentation of scientific abstracts into premises and conclusions a challenging task. In this work, we empi… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  6. arXiv:2305.03270  [pdf, other

    cs.RO

    Deep RL at Scale: Sorting Waste in Office Buildings with a Fleet of Mobile Manipulators

    Authors: Alexander Herzog, Kanishka Rao, Karol Hausman, Yao Lu, Paul Wohlhart, Mengyuan Yan, Jessica Lin, Montserrat Gonzalez Arenas, Ted Xiao, Daniel Kappler, Daniel Ho, Jarek Rettinghouse, Yevgen Chebotar, Kuang-Huei Lee, Keerthana Gopalakrishnan, Ryan Julian, Adrian Li, Chuyuan Kelly Fu, Bob Wei, Sangeetha Ramesh, Khem Holden, Kim Kleiven, David Rendleman, Sean Kirmani, Jeff Bingham , et al. (15 additional authors not shown)

    Abstract: We describe a system for deep reinforcement learning of robotic manipulation skills applied to a large-scale real-world task: sorting recyclables and trash in office buildings. Real-world deployment of deep RL policies requires not only effective training algorithms, but the ability to bootstrap real-world training and enable broad generalization. To this end, our system combines scalable deep RL… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: Published at Robotics: Science and Systems 2023

  7. arXiv:2209.15392  [pdf, other

    quant-ph cs.CE cs.ET math.QA

    Improving the Efficiency of Payments Systems Using Quantum Computing

    Authors: Christopher McMahon, Donald McGillivray, Ajit Desai, Francisco Rivadeneyra, Jean-Paul Lam, Thomas Lo, Danica Marsden, Vladimir Skavysh

    Abstract: High-value payment systems (HVPSs) are typically liquidity-intensive as the payment requests are indivisible and settled on a gross basis. Finding the right order in which payments should be processed to maximize the liquidity efficiency of these systems is an $NP$-hard combinatorial optimization problem, which quantum algorithms may be able to tackle at meaningful scales. We developed an algorith… ▽ More

    Submitted 17 January, 2023; v1 submitted 19 September, 2022; originally announced September 2022.

  8. arXiv:2208.06322  [pdf, other

    stat.ML cs.LG

    EEGNN: Edge Enhanced Graph Neural Network with a Bayesian Nonparametric Graph Model

    Authors: Yirui Liu, Xinghao Qiao, Liying Wang, Jessica Lam

    Abstract: Training deep graph neural networks (GNNs) poses a challenging task, as the performance of GNNs may suffer from the number of hidden message-passing layers. The literature has focused on the proposals of {over-smoothing} and {under-reaching} to explain the performance deterioration of deep GNNs. In this paper, we propose a new explanation for such deteriorated performance phenomenon, {mis-simplifi… ▽ More

    Submitted 23 February, 2023; v1 submitted 12 August, 2022; originally announced August 2022.

  9. arXiv:2207.04672  [pdf

    cs.CL cs.AI

    No Language Left Behind: Scaling Human-Centered Machine Translation

    Authors: NLLB Team, Marta R. Costa-jussà, James Cross, Onur Çelebi, Maha Elbayad, Kenneth Heafield, Kevin Heffernan, Elahe Kalbassi, Janice Lam, Daniel Licht, Jean Maillard, Anna Sun, Skyler Wang, Guillaume Wenzek, Al Youngblood, Bapi Akula, Loic Barrault, Gabriel Mejia Gonzalez, Prangthip Hansanti, John Hoffman, Semarley Jarrett, Kaushik Ram Sadagopan, Dirk Rowe, Shannon Spruit, Chau Tran , et al. (14 additional authors not shown)

    Abstract: Driven by the goal of eradicating language barriers on a global scale, machine translation has solidified itself as a key focus of artificial intelligence research today. However, such efforts have coalesced around a small subset of languages, leaving behind the vast majority of mostly low-resource languages. What does it take to break the 200 language barrier while ensuring safe, high quality res… ▽ More

    Submitted 25 August, 2022; v1 submitted 11 July, 2022; originally announced July 2022.

    Comments: 190 pages

    MSC Class: 68T50 ACM Class: I.2.7

  10. arXiv:2205.08533  [pdf, ps, other

    cs.CL

    Consistent Human Evaluation of Machine Translation across Language Pairs

    Authors: Daniel Licht, Cynthia Gao, Janice Lam, Francisco Guzman, Mona Diab, Philipp Koehn

    Abstract: Obtaining meaningful quality scores for machine translation systems through human evaluation remains a challenge given the high variability between human evaluators, partly due to subjective expectations for translation quality for different language pairs. We propose a new metric called XSTS that is more focused on semantic equivalence and a cross-lingual calibration method that enables more cons… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: 10 pages

  11. arXiv:2203.10323  [pdf, other

    cs.CR eess.SY

    Differential Private Discrete Noise Adding Mechanism: Conditions, Properties and Optimization

    Authors: Shuying Qin, Jian** He, Chongrong Fang, James Lam

    Abstract: Differential privacy is a standard framework to quantify the privacy loss in the data anonymization process. To preserve differential privacy, a random noise adding mechanism is widely adopted, where the trade-off between data privacy level and data utility is of great concern. The privacy and utility properties for the continuous noise adding mechanism have been well studied. However, the related… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

  12. Adversarially Regularized Graph Attention Networks for Inductive Learning on Partially Labeled Graphs

    Authors: Jiaren Xiao, Quanyu Dai, Xiaochen Xie, James Lam, Ka-Wai Kwok

    Abstract: The high cost of data labeling often results in node label shortage in real applications. To improve node classification accuracy, graph-based semi-supervised learning leverages the ample unlabeled nodes to train together with the scarce available labeled nodes. However, most existing methods require the information of all nodes, including those to be predicted, during model training, which is not… ▽ More

    Submitted 13 March, 2023; v1 submitted 7 June, 2021; originally announced June 2021.

    Journal ref: Knowledge-Based Systems (2023)

  13. arXiv:2103.14587  [pdf, other

    cs.LG cs.CY

    Deep-AIR: A Hybrid CNN-LSTM Framework for Air Quality Modeling in Metropolitan Cities

    Authors: Yang Han, Qi Zhang, Victor O. K. Li, Jacqueline C. K. Lam

    Abstract: Air pollution has long been a serious environmental health challenge, especially in metropolitan cities, where air pollutant concentrations are exacerbated by the street canyon effect and high building density. Whilst accurately monitoring and forecasting air pollution are highly crucial, existing data-driven models fail to fully address the complex interaction between air pollution and urban dyna… ▽ More

    Submitted 25 March, 2021; originally announced March 2021.

  14. arXiv:2103.03993  [pdf, other

    cs.RO physics.app-ph

    Modeling the locomotion of articulated soft robots in granular medium

    Authors: Yayun Du, Jacqueline Lam, Karunesh Sachanandani, Mohammad Khalid Jawed

    Abstract: Soft robots, in contrast to their rigid counter parts, have infinite degrees of freedom that are coupled with their interaction with the environment. We consider the locomotion of an untethered robot, in the granular medium, comprised of multiple flexible flagella that rotate about an axis by a motor. Drag from the grains causes the flagella to deform and the deformed shape generates a net forward… ▽ More

    Submitted 5 March, 2021; originally announced March 2021.

    Comments: Supplementary Video: https://youtu.be/hppBMe-cEvk

  15. arXiv:2010.13686  [pdf, other

    physics.chem-ph cs.LG physics.comp-ph

    Infrared spectra of neutral polycyclic aromatic hydrocarbons by machine learning

    Authors: Gaétan Laurens, Malalatiana Rabary, Julien Lam, Daniel Peláez, Abdul-Rahman Allouche

    Abstract: The Interest in polycyclic aromatic hydrocarbons (PAHs) spans numerous fields and infrared spectroscopy is usually the method of choice to disentangle their molecular structure. In order to compute vibrational frequencies, numerous theoretical studies employ either quantum calculation methods, or empirical potentials, but it remains difficult to combine the accuracy of the first approach with the… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: 13 pages, 6 figures

  16. arXiv:2004.09681  [pdf, other

    cs.CV

    Facial Action Unit Intensity Estimation via Semantic Correspondence Learning with Dynamic Graph Convolution

    Authors: Yingruo Fan, Jacqueline C. K. Lam, Victor O. K. Li

    Abstract: The intensity estimation of facial action units (AUs) is challenging due to subtle changes in the person's facial appearance. Previous approaches mainly rely on probabilistic models or predefined rules for modeling co-occurrence relationships among AUs, leading to limited generalization. In contrast, we present a new learning framework that automatically learns the latent relationships of AUs via… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

    Comments: Accepted at AAAI2020

  17. arXiv:2003.03474  [pdf, other

    cs.CR cs.LG stat.ML

    Machine Learning based Anomaly Detection for 5G Networks

    Authors: Jordan Lam, Robert Abbas

    Abstract: Protecting the networks of tomorrow is set to be a challenging domain due to increasing cyber security threats and widening attack surfaces created by the Internet of Things (IoT), increased network heterogeneity, increased use of virtualisation technologies and distributed architectures. This paper proposes SDS (Software Defined Security) as a means to provide an automated, flexible and scalable… ▽ More

    Submitted 6 March, 2020; originally announced March 2020.

  18. arXiv:2001.04508  [pdf, ps, other

    stat.ML cs.LG

    CATVI: Conditional and Adaptively Truncated Variational Inference for Hierarchical Bayesian Nonparametric Models

    Authors: Yirui Liu, Xinghao Qiao, Jessica Lam

    Abstract: Current variational inference methods for hierarchical Bayesian nonparametric models can neither characterize the correlation structure among latent variables due to the mean-field setting, nor infer the true posterior dimension because of the universal truncation. To overcome these limitations, we propose the conditional and adaptively truncated variational inference method (CATVI) by maximizing… ▽ More

    Submitted 5 April, 2022; v1 submitted 13 January, 2020; originally announced January 2020.

  19. arXiv:1810.09390  [pdf, other

    stat.ML cs.LG

    A minimax near-optimal algorithm for adaptive rejection sampling

    Authors: Juliette Achdou, Joseph C. Lam, Alexandra Carpentier, Gilles Blanchard

    Abstract: Rejection Sampling is a fundamental Monte-Carlo method. It is used to sample from distributions admitting a probability density function which can be evaluated exactly at any given point, albeit at a high computational cost. However, without proper tuning, this technique implies a high rejection rate. Several methods have been explored to cope with this problem, based on the principle of adaptivel… ▽ More

    Submitted 22 October, 2018; originally announced October 2018.

    Comments: 32 pages, 4 figures. Submitted to ALT 2019

    MSC Class: 62D05; 62L12; 62G05 (Primary) 62L05; 62G07 (Secondary) ACM Class: G.3; I.2.6

  20. arXiv:1807.10575  [pdf

    cs.CV cs.HC

    Multi-Region Ensemble Convolutional Neural Network for Facial Expression Recognition

    Authors: Yingruo Fan, Jacqueline C. K. Lam, Victor O. K. Li

    Abstract: Facial expressions play an important role in conveying the emotional states of human beings. Recently, deep learning approaches have been applied to image recognition field due to the discriminative power of Convolutional Neural Network (CNN). In this paper, we first propose a novel Multi-Region Ensemble CNN (MRE-CNN) framework for facial expression recognition, which aims to enhance the learning… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

    Comments: 10pages, 5 figures, Accepted by ICANN 2018

  21. arXiv:1801.03190  [pdf, other

    cs.DS cs.DB cs.DM cs.SI

    Risk-Averse Matchings over Uncertain Graph Databases

    Authors: Charalampos E. Tsourakakis, Shreyas Sekar, Johnson Lam, Liu Yang

    Abstract: A large number of applications such as querying sensor networks, and analyzing protein-protein interaction (PPI) networks, rely on mining uncertain graph and hypergraph databases. In this work we study the following problem: given an uncertain, weighted (hyper)graph, how can we efficiently find a (hyper)matching with high expected reward, and low risk? This problem naturally arises in the contex… ▽ More

    Submitted 9 January, 2018; originally announced January 2018.

    Comments: 25 pages

  22. arXiv:1609.08767  [pdf, ps, other

    cs.DS

    The Subset Assignment Problem for Data Placement in Caches

    Authors: Shahram Ghandeharizadeh, Sandy Irani, Jenny Lam

    Abstract: We introduce the subset assignment problem in which items of varying sizes are placed in a set of bins with limited capacity. Items can be replicated and placed in any subset of the bins. Each (item, subset) pair has an associated cost. Not assigning an item to any of the bins is not free in general and can potentially be the most expensive option. The goal is to minimize the total cost of assigni… ▽ More

    Submitted 1 October, 2016; v1 submitted 28 September, 2016; originally announced September 2016.

  23. arXiv:1605.09425  [pdf, other

    cs.MM cs.DS

    Models and Algorithms for Graph Watermarking

    Authors: David Eppstein, Michael T. Goodrich, Jenny Lam, Nil Mamano, Michael Mitzenmacher, Manuel Torres

    Abstract: We introduce models and algorithmic foundations for graph watermarking. Our frameworks include security definitions and proofs, as well as characterizations when graph watermarking is algorithmically feasible, in spite of the fact that the general problem is NP-complete by simple reductions from the subgraph isomorphism or graph edit distance problems. In the digital watermarking of many types of… ▽ More

    Submitted 30 May, 2016; originally announced May 2016.