Skip to main content

Showing 1–12 of 12 results for author: Ooi, H

Searching in archive cs. Search in all archives.
.
  1. Accelerating evolutionary exploration through language model-based transfer learning

    Authors: Maximilian Reissmann, Yuan Fang, Andrew S. H. Ooi, Richard D. Sandberg

    Abstract: Gene expression programming is an evolutionary optimization algorithm with the potential to generate interpretable and easily implementable equations for regression problems. Despite knowledge gained from previous optimizations being potentially available, the initial candidate solutions are typically generated randomly at the beginning and often only include features or terms based on preliminary… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  2. arXiv:2402.07827  [pdf, other

    cs.CL

    Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model

    Authors: Ahmet Üstün, Viraat Aryabumi, Zheng-Xin Yong, Wei-Yin Ko, Daniel D'souza, Gbemileke Onilude, Neel Bhandari, Shivalika Singh, Hui-Lee Ooi, Amr Kayid, Freddie Vargus, Phil Blunsom, Shayne Longpre, Niklas Muennighoff, Marzieh Fadaee, Julia Kreutzer, Sara Hooker

    Abstract: Recent breakthroughs in large language models (LLMs) have centered around a handful of data-rich languages. What does it take to broaden access to breakthroughs beyond first-class citizen languages? Our work introduces Aya, a massively multilingual generative language model that follows instructions in 101 languages of which over 50% are considered as lower-resourced. Aya outperforms mT0 and BLOOM… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  3. arXiv:2302.11000  [pdf, other

    cs.LG cs.AI q-bio.QM

    CHA2: CHemistry Aware Convex Hull Autoencoder Towards Inverse Molecular Design

    Authors: Mohammad Sajjad Ghaemi, Hang Hu, Anguang Hu, Hsu Kiang Ooi

    Abstract: Optimizing molecular design and discovering novel chemical structures to meet certain objectives, such as quantitative estimates of the drug-likeness score (QEDs), is NP-hard due to the vast combinatorial design space of discrete molecular structures, which makes it near impossible to explore the entire search space comprehensively to exploit de novo structures with properties of interest. To addr… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

  4. arXiv:2302.10952  [pdf

    cs.LG q-bio.BM

    Machine learning for the prediction of safe and biologically active organophosphorus molecules

    Authors: Hang Hu, Hsu Kiang Ooi, Mohammad Sajjad Ghaemi, Anguang Hu

    Abstract: Drug discovery is a complex process with a large molecular space to be considered. By constraining the search space, the fragment-based drug design is an approach that can effectively sample the chemical space of interest. Here we propose a framework of Recurrent Neural Networks (RNN) with an attention model to sample the chemical space of organophosphorus molecules using the fragment-based approa… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

  5. arXiv:2204.02474  [pdf, other

    q-bio.BM cs.LG

    Generative Enriched Sequential Learning (ESL) Approach for Molecular Design via Augmented Domain Knowledge

    Authors: Mohammad Sajjad Ghaemi, Karl Grantham, Isaac Tamblyn, Yifeng Li, Hsu Kiang Ooi

    Abstract: Deploying generative machine learning techniques to generate novel chemical structures based on molecular fingerprint representation has been well established in molecular design. Typically, sequential learning (SL) schemes such as hidden Markov models (HMM) and, more recently, in the sequential deep learning context, recurrent neural network (RNN) and long short-term memory (LSTM) were used exten… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

    Comments: 6 pages

  6. arXiv:2102.01011  [pdf, other

    cs.NE cs.AI

    Deep Evolutionary Learning for Molecular Design

    Authors: Yifeng Li, Hsu Kiang Ooi, Alain Tchagang

    Abstract: In this paper, we propose a deep evolutionary learning (DEL) process that integrates fragment-based deep generative model and multi-objective evolutionary computation for molecular design. Our approach enables (1) evolutionary operations in the latent space of the generative model, rather than the structural space, to generate novel promising molecular structures for the next evolutionary generati… ▽ More

    Submitted 27 December, 2020; originally announced February 2021.

  7. arXiv:2003.13644  [pdf, other

    cs.CV

    Supervised and Unsupervised Detections for Multiple Object Tracking in Traffic Scenes: A Comparative Study

    Authors: Hui-Lee Ooi, Guillaume-Alexandre Bilodeau, Nicolas Saunier

    Abstract: In this paper, we propose a multiple object tracker, called MF-Tracker, that integrates multiple classical features (spatial distances and colours) and modern features (detection labels and re-identification features) in its tracking framework. Since our tracker can work with detections coming either from unsupervised and supervised object detectors, we also investigated the impact of supervised a… ▽ More

    Submitted 30 March, 2020; originally announced March 2020.

    Comments: Accepted for ICIAR 2020

  8. The Rockerverse: Packages and Applications for Containerization with R

    Authors: Daniel Nüst, Dirk Eddelbuettel, Dom Bennett, Robrecht Cannoodt, Dav Clark, Gergely Daroczi, Mark Edmondson, Colin Fay, Ellis Hughes, Lars Kjeldgaard, Sean Lopp, Ben Marwick, Heather Nolis, Jacqueline Nolis, Hong Ooi, Karthik Ram, Noam Ross, Lori Shepherd, Péter Sólymos, Tyson Lee Swetnam, Nitesh Turaga, Charlotte Van Petegem, Jason Williams, Craig Willis, Nan Xiao

    Abstract: The Rocker Project provides widely used Docker images for R across different application scenarios. This article surveys downstream projects that build upon the Rocker Project images and presents the current state of R packages for managing Docker images and controlling containers. These use cases cover diverse topics such as package development, reproducible research, collaborative work, cloud-ba… ▽ More

    Submitted 17 August, 2020; v1 submitted 28 January, 2020; originally announced January 2020.

    Comments: Source code for article available at https://github.com/nuest/rockerverse-paper/ Updated version includes some new paragraphs and corrections throughout the text; full diff available at https://github.com/nuest/rockerverse-paper/compare/preprint.v2...preprint.v3

    MSC Class: 68N01 ACM Class: D.2.6; D.2.7; K.6.3

    Journal ref: The R Journal (2020), 12:1, pages 437-461

  9. arXiv:1905.06381  [pdf, other

    cs.CV

    Tracking in Urban Traffic Scenes from Background Subtraction and Object Detection

    Authors: Hui-Lee Ooi, Guillaume-Alexandre Bilodeau, Nicolas Saunier

    Abstract: In this paper, we propose to combine detections from background subtraction and from a multiclass object detector for multiple object tracking (MOT) in urban traffic scenes. These objects are associated across frames using spatial, colour and class label information, and trajectory prediction is evaluated to yield the final MOT outputs. The proposed method was tested on the Urban tracker dataset a… ▽ More

    Submitted 15 May, 2019; originally announced May 2019.

  10. arXiv:1904.06236  [pdf, other

    cs.CV

    Multimodal Machine Learning-based Knee Osteoarthritis Progression Prediction from Plain Radiographs and Clinical Data

    Authors: Aleksei Tiulpin, Stefan Klein, Sita M. A. Bierma-Zeinstra, Jérôme Thevenot, Esa Rahtu, Joyce van Meurs, Edwin H. G. Oei, Simo Saarakkala

    Abstract: Knee osteoarthritis (OA) is the most common musculoskeletal disease without a cure, and current treatment options are limited to symptomatic relief. Prediction of OA progression is a very challenging and timely issue, and it could, if resolved, accelerate the disease modifying drug development and ultimately help to prevent millions of total joint replacement surgeries performed annually. Here, we… ▽ More

    Submitted 6 May, 2019; v1 submitted 12 April, 2019; originally announced April 2019.

  11. arXiv:1809.02073  [pdf, other

    cs.CV

    Multiple Object Tracking in Urban Traffic Scenes with a Multiclass Object Detector

    Authors: Hui-Lee Ooi, Guillaume-Alexandre Bilodeau, Nicolas Saunier, David-Alexandre Beaupré

    Abstract: Multiple object tracking (MOT) in urban traffic aims to produce the trajectories of the different road users that move across the field of view with different directions and speeds and that can have varying appearances and sizes. Occlusions and interactions among the different objects are expected and common due to the nature of urban road traffic. In this work, a tracking framework employing clas… ▽ More

    Submitted 6 September, 2018; originally announced September 2018.

    Comments: 13th International Symposium on Visual Computing (ISVC)

  12. arXiv:1805.02262  [pdf, other

    cs.CL

    Construction of the Literature Graph in Semantic Scholar

    Authors: Waleed Ammar, Dirk Groeneveld, Chandra Bhagavatula, Iz Beltagy, Miles Crawford, Doug Downey, Jason Dunkelberger, Ahmed Elgohary, Sergey Feldman, Vu Ha, Rodney Kinney, Sebastian Kohlmeier, Kyle Lo, Tyler Murray, Hsu-Han Ooi, Matthew Peters, Joanna Power, Sam Skjonsberg, Lucy Lu Wang, Chris Wilhelm, Zheng Yuan, Madeleine van Zuylen, Oren Etzioni

    Abstract: We describe a deployed scalable system for organizing published scientific literature into a heterogeneous graph to facilitate algorithmic manipulation and discovery. The resulting literature graph consists of more than 280M nodes, representing papers, authors, entities and various interactions between them (e.g., authorships, citations, entity mentions). We reduce literature graph construction in… ▽ More

    Submitted 6 May, 2018; originally announced May 2018.

    Comments: To appear in NAACL 2018 industry track