Skip to main content

Showing 1–22 of 22 results for author: Teo, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.02329  [pdf

    cs.AR cs.AI cs.CL

    Digital ASIC Design with Ongoing LLMs: Strategies and Prospects

    Authors: Maoyang Xiang, Emil Goh, T. Hui Teo

    Abstract: The escalating complexity of modern digital systems has imposed significant challenges on integrated circuit (IC) design, necessitating tools that can simplify the IC design flow. The advent of Large Language Models (LLMs) has been seen as a promising development, with the potential to automate the generation of Hardware Description Language (HDL) code, thereby streamlining digital IC design. Howe… ▽ More

    Submitted 25 April, 2024; originally announced May 2024.

    Comments: 8 pages, 2 figures, 1 table

  2. arXiv:2405.00308  [pdf

    cs.CR stat.AP

    FPGA Digital Dice using Pseudo Random Number Generator

    Authors: Michael Lim Kee Hian, Ten Wei Lin, Zachary Wu Xuan, Stephanie-Ann Loy, Maoyang Xiang, T. Hui Teo

    Abstract: The goal of this project is to design a digital dice that displays dice numbers in real-time. The number is generated by a pseudo-random number generator (PRNG) using XORshift algorithm that is implemented in Verilog HDL on an FPGA. The digital dice is equipped with tilt sensor, display, power management circuit, and rechargeable battery hosted in a 3D printed dice casing. By shaking the digital d… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 15 pages, 5 figures

  3. arXiv:2404.19246  [pdf

    cs.CR cs.AR

    Logistic Map Pseudo Random Number Generator in FPGA

    Authors: Mateo Jalen Andrew Calderon, Lee Jun Lei Lucas, Syarifuddin Azhar Bin Rosli, Stephanie See Hui Ying, Jarell Lim En Yu, Maoyang Xiang, T. Hui Teo

    Abstract: This project develops a pseudo-random number generator (PRNG) using the logistic map, implemented in Verilog HDL on an FPGA and processes its output through a Central Limit Theorem (CLT) function to achieve a Gaussian distribution. The system integrates additional FPGA modules for real-time interaction and visualisation, including a clock generator, UART interface, XADC, and a 7-segment display dr… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: 10 pages, 6 figures

  4. arXiv:2404.16504  [pdf

    cs.CR eess.SP

    Hardware Implementation of Double Pendulum Pseudo Random Number Generator

    Authors: Jarrod Lim, Tom Manuel Opalla Piccio, Chua Min Jie Michelle, Maoyang Xiang, T. Hui Teo

    Abstract: The objective of this project is to utilize an FPGA board which is the CMOD A7 35t to obtain a pseudo random number which can be used for encryption. We aim to achieve this by leveraging the inherent randomness present in environmental data captured by sensors. This data will be used as a seed to initialize an algorithm implemented on the CMOD A7 35t FPGA board. The project will focus on interfaci… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 15 pages, 12 figure

  5. arXiv:2403.10542  [pdf, other

    cs.AR cs.CV

    SF-MMCN: A Low Power Re-configurable Server Flow Convolution Neural Network Accelerator

    Authors: Huan-Ke Hsu, I-Chyn Wey, T. Hui Teo

    Abstract: Convolution Neural Network (CNN) accelerators have been developed rapidly in recent studies. There are lots of CNN accelerators equipped with a variety of function and algorithm which results in low power and high-speed performances. However, the scale of a PE array in traditional CNN accelerators is too big, which costs the most energy consumption while conducting multiply and accumulation (MAC)… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 16 pages, 16 figures

  6. arXiv:2403.07039  [pdf

    cs.AR cs.AI cs.CL

    From English to ASIC: Hardware Implementation with Large Language Model

    Authors: Emil Goh, Maoyang Xiang, I-Chyn Wey, T. Hui Teo

    Abstract: In the realm of ASIC engineering, the landscape has been significantly reshaped by the rapid development of LLM, paralleled by an increase in the complexity of modern digital circuits. This complexity has escalated the requirements for HDL coding, necessitating a higher degree of precision and sophistication. However, challenges have been faced due to the less-than-optimal performance of modern la… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 15 pages, 1 figure

  7. PEFA: Parameter-Free Adapters for Large-scale Embedding-based Retrieval Models

    Authors: Wei-Cheng Chang, Jyun-Yu Jiang, Jiong Zhang, Mutasem Al-Darabsah, Choon Hui Teo, Cho-Jui Hsieh, Hsiang-Fu Yu, S. V. N. Vishwanathan

    Abstract: Embedding-based Retrieval Models (ERMs) have emerged as a promising framework for large-scale text retrieval problems due to powerful large language models. Nevertheless, fine-tuning ERMs to reach state-of-the-art results can be expensive due to the extreme scale of data as well as the complexity of multi-stages pipelines (e.g., pre-training, fine-tuning, distillation). In this work, we propose th… ▽ More

    Submitted 5 December, 2023; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: Accept by WSDM 2024

  8. arXiv:2310.19297  [pdf, other

    cs.LG cs.CV cs.CY

    On Measuring Fairness in Generative Models

    Authors: Christopher T. H. Teo, Milad Abdollahzadeh, Ngai-Man Cheung

    Abstract: Recently, there has been increased interest in fair generative models. In this work, we conduct, for the first time, an in-depth study on fairness measurement, a critical component in gauging progress on fair generative models. We make three contributions. First, we conduct a study that reveals that the existing fairness measurement framework has considerable measurement errors, even when highly a… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted in NeurIPS23

  9. arXiv:2307.14397  [pdf, other

    cs.CV cs.LG

    A Survey on Generative Modeling with Limited Data, Few Shots, and Zero Shot

    Authors: Milad Abdollahzadeh, Touba Malekzadeh, Christopher T. H. Teo, Keshigeyan Chandrasegaran, Guimeng Liu, Ngai-Man Cheung

    Abstract: In machine learning, generative modeling aims to learn to generate new data statistically similar to the training data distribution. In this paper, we survey learning generative models under limited data, few shots and zero shot, referred to as Generative Modeling under Data Constraint (GM-DC). This is an important topic when data acquisition is challenging, e.g. healthcare applications. We discus… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: Technical Survey. Touba Malekzadeh, Christopher T.H. Teo, Keshigeyan Chandrasegaran contribute equally

  10. arXiv:2209.04378  [pdf, other

    cs.IR cs.CL cs.LG stat.ML

    MICO: Selective Search with Mutual Information Co-training

    Authors: Zhanyu Wang, Xiao Zhang, Hyokun Yun, Choon Hui Teo, Trishul Chilimbi

    Abstract: In contrast to traditional exhaustive search, selective search first clusters documents into several groups before all the documents are searched exhaustively by a query, to limit the search executed within one group or only a few groups. Selective search is designed to reduce the latency and computation in modern large-scale search systems. In this study, we propose MICO, a Mutual Information CO-… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

    Journal ref: Proceedings of the 29th International Conference on Computational Linguistics (COLING). 2022

  11. arXiv:2208.05663  [pdf, other

    cs.IR

    On the Value of Behavioral Representations for Dense Retrieval

    Authors: Nan Jiang, Dhivya Eswaran, Choon Hui Teo, Yexiang Xue, Yesh Dattatreya, Sujay Sanghavi, Vishy Vishwanathan

    Abstract: We consider text retrieval within dense representational space in real-world settings such as e-commerce search where (a) document popularity and (b) diversity of queries associated with a document have a skewed distribution. Most of the contemporary dense retrieval literature presents two shortcomings in these settings. (1) They learn an almost equal number of representations per document, agnost… ▽ More

    Submitted 11 August, 2022; originally announced August 2022.

  12. arXiv:2206.02164  [pdf, other

    cs.LG cs.AI stat.ME

    Estimating and Mitigating the Congestion Effect of Curbside Pick-ups and Drop-offs: A Causal Inference Approach

    Authors: Xiaohui Liu, Sean Qian, Hock-Hai Teo, Wei Ma

    Abstract: Curb space is one of the busiest areas in urban road networks. Especially in recent years, the rapid increase of ride-hailing trips and commercial deliveries has induced massive pick-ups/drop-offs (PUDOs), which occupy the limited curb space that was designed and built decades ago. These PUDOs could jam curbside utilization and disturb the mainline traffic flow, evidently leading to significant ne… ▽ More

    Submitted 2 January, 2024; v1 submitted 5 June, 2022; originally announced June 2022.

    Comments: Accepted at Transportation Science

  13. arXiv:2110.06125  [pdf, other

    cs.IR cs.LG

    Embracing Structure in Data for Billion-Scale Semantic Product Search

    Authors: Vihan Lakshman, Choon Hui Teo, Xiaowen Chu, Priyanka Nigam, Abhinandan Patni, Pooja Maknikar, SVN Vishwanathan

    Abstract: We present principled approaches to train and deploy dyadic neural embedding models at the billion scale, focusing our investigation on the application of semantic product search. When training a dyadic model, one seeks to embed two different types of entities (e.g., queries and documents or users and movies) in a common vector space such that pairs with high relevance are positioned nearby. Durin… ▽ More

    Submitted 12 October, 2021; originally announced October 2021.

    Comments: 10 pages

  14. arXiv:2107.07754  [pdf, other

    cs.LG

    Measuring Fairness in Generative Models

    Authors: Christopher T. H Teo, Ngai-Man Cheung

    Abstract: Deep generative models have made much progress in improving training stability and quality of generated data. Recently there has been increased interest in the fairness of deep-generated data. Fairness is important in many applications, e.g. law enforcement, as biases will affect efficacy. Central to fair data generation are the fairness metrics for the assessment and evaluation of different gener… ▽ More

    Submitted 16 July, 2021; originally announced July 2021.

    Comments: Accepted in ICML 2021 Workshop - Machine Learning for Data: Automated Creation, Privacy, Bias

  15. arXiv:2008.07030  [pdf, other

    eess.IV cs.CV cs.LG

    Training CNN Classifiers for Semantic Segmentation using Partially Annotated Images: with Application on Human Thigh and Calf MRI

    Authors: Chun Kit Wong, Stephanie Marchesseau, Maria Kalimeri, Tiang Siew Yap, Serena S. H. Teo, Lingaraj Krishna, Alfredo Franco-Obregón, Stacey K. H. Tay, Chin Meng Khoo, Philip T. H. Lee, Melvin K. S. Leow, John J. Totman, Mary C. Stephenson

    Abstract: Objective: Medical image datasets with pixel-level labels tend to have a limited number of organ or tissue label classes annotated, even when the images have wide anatomical coverage. With supervised learning, multiple classifiers are usually needed given these partially annotated datasets. In this work, we propose a set of strategies to train one single classifier in segmenting all label classes… ▽ More

    Submitted 16 August, 2020; originally announced August 2020.

    Comments: Submitted to IEEE Transactions on Medical Imaging (Special Issue on Annotation-Efficient Deep Learning for Medical Imaging)

  16. A Study of Context Dependencies in Multi-page Product Search

    Authors: Ke** Bi, Choon Hui Teo, Yesh Dattatreya, Vijai Mohan, W. Bruce Croft

    Abstract: In product search, users tend to browse results on multiple search result pages (SERPs) (e.g., for queries on clothing and shoes) before deciding which item to purchase. Users' clicks can be considered as implicit feedback which indicates their preferences and used to re-rank subsequent SERPs. Relevance feedback (RF) techniques are usually involved to deal with such scenarios. However, these metho… ▽ More

    Submitted 9 January, 2020; v1 submitted 9 September, 2019; originally announced September 2019.

    Comments: Accepted by CIKM 2019. arXiv admin note: substantial text overlap with arXiv:1909.02065

  17. arXiv:1909.02065  [pdf, other

    cs.IR

    Leverage Implicit Feedback for Context-aware Product Search

    Authors: Ke** Bi, Choon Hui Teo, Yesh Dattatreya, Vijai Mohan, W. Bruce Croft

    Abstract: Product search serves as an important entry point for online shop**. In contrast to web search, the retrieved results in product search not only need to be relevant but also should satisfy customers' preferences in order to elicit purchases. Previous work has shown the efficacy of purchase history in personalized product search. However, customers with little or no purchase history do not benefi… ▽ More

    Submitted 9 January, 2020; v1 submitted 4 September, 2019; originally announced September 2019.

    Comments: Presented at 2019 SIGIR Workshop on eCommerce (ECOM'19)

  18. arXiv:1907.00937  [pdf, other

    cs.IR cs.CL

    Semantic Product Search

    Authors: Priyanka Nigam, Yiwei Song, Vijai Mohan, Vihan Lakshman, Weitian, Ding, Ankit Shingavi, Choon Hui Teo, Hao Gu, Bing Yin

    Abstract: We study the problem of semantic matching in product search, that is, given a customer query, retrieve all semantically related products from the catalog. Pure lexical matching via an inverted index falls short in this respect due to several factors: a) lack of understanding of hypernyms, synonyms, and antonyms, b) fragility to morphological variants (e.g. "woman" vs. "women"), and c) sensitivity… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

    Comments: 10 pages, 7 figures, KDD 2019 (Applied Data Science Track)

  19. arXiv:1905.13289  [pdf, other

    cs.LG stat.ML

    On the Accuracy of Influence Functions for Measuring Group Effects

    Authors: Pang Wei Koh, Kai-Siang Ang, Hubert H. K. Teo, Percy Liang

    Abstract: Influence functions estimate the effect of removing a training point on a model without the need to retrain. They are based on a first-order Taylor approximation that is guaranteed to be accurate for sufficiently small changes to the model, and so are commonly used to study the effect of individual points in large datasets. However, we often want to study the effects of large groups of training po… ▽ More

    Submitted 21 November, 2019; v1 submitted 30 May, 2019; originally announced May 2019.

  20. arXiv:1810.01477  [pdf, other

    cs.IR cs.LG stat.ML

    Adaptive, Personalized Diversity for Visual Discovery

    Authors: Choon Hui Teo, Houssam Nassif, Daniel Hill, Sriram Srinavasan, Mitchell Goodman, Vijai Mohan, SVN Vishwanathan

    Abstract: Search queries are appropriate when users have explicit intent, but they perform poorly when the intent is difficult to express or if the user is simply looking to be inspired. Visual browsing systems allow e-commerce platforms to address these scenarios while offering the user an engaging shop** experience. Here we explore extensions in the direction of adaptive personalization and item diversi… ▽ More

    Submitted 2 October, 2018; originally announced October 2018.

    Comments: Best Paper Award

    Journal ref: Adaptive, Personalized Diversity for Visual Discovery. Teo CH, Nassif H, Hill D, Srinavasan S, Goodman M, Mohan V, and Vishwanathan SVN. ACM Conference on Recommender Systems (RecSys'16), Boston, pp. 35-38, 2016

  21. arXiv:1806.10751  [pdf, other

    cs.NI

    Design Considerations for Low Power Internet Protocols

    Authors: Hudson Ayers, Paul Crews, Hubert Teo, Conor McAvity, Amit Levy, Philip Levis

    Abstract: Over the past 10 years, low-power wireless networks have transitioned to supporting IPv6 connectivity through 6LoWPAN, a set of standards which specify how to aggressively compress IPv6 packets over low-power wireless links such as 802.15.4. We find that different low-power IPv6 stacks are unable to communicate using 6LoWPAN, and therefore IP, due to design tradeoffs between code size and energy… ▽ More

    Submitted 21 January, 2020; v1 submitted 27 June, 2018; originally announced June 2018.

  22. arXiv:cs/0306127  [pdf

    cs.MS

    Development of a Java Package for Matrix Programming

    Authors: Ngee-Peng Lim, Maurice HT Ling, Shawn YC Lim, Ji-Hee Choi, Henry BK Teo

    Abstract: We had assembled a Java package, known as MatrixPak, of four classes for the purpose of numerical matrix computation. The classes are matrix, matrix_operations, StrToMatrix, and MatrixToStr; all of which are inherited from java.lang.Object class. Class matrix defines a matrix as a two-dimensional array of float types, and contains the following mathematical methods: transpose, adjoint, determina… ▽ More

    Submitted 24 June, 2003; originally announced June 2003.

    Comments: Secondary school (high school) student project report. Foundation for JMaths project

    ACM Class: K.3.0; G.m