Search | arXiv e-print repository

Learning to Deliver: a Foundation Model for the Montreal Capacitated Vehicle Routing Problem

Authors: Samuel J. K. Chin, Matthias Winkenbach, Akash Srivastava

Abstract: In this paper, we present the Foundation Model for the Montreal Capacitated Vehicle Routing Problem (FM-MCVRP), a novel Deep Learning (DL) model that approximates high-quality solutions to a variant of the Capacitated Vehicle Routing Problem (CVRP) that characterizes many real-world applications. The so-called Montreal Capacitated Vehicle Routing Problem (MCVRP), first formally described by Bengio… ▽ More In this paper, we present the Foundation Model for the Montreal Capacitated Vehicle Routing Problem (FM-MCVRP), a novel Deep Learning (DL) model that approximates high-quality solutions to a variant of the Capacitated Vehicle Routing Problem (CVRP) that characterizes many real-world applications. The so-called Montreal Capacitated Vehicle Routing Problem (MCVRP), first formally described by Bengio et al. (2021), is defined on a fixed and finite graph, which is analogous to a city. Each MCVRP instance is essentially the sub-graph connecting a randomly sampled subset of the nodes in the fixed graph, which represent a set of potential addresses in a real-world delivery problem on a given day. Our work exploits this problem structure to frame the MCVRP as an analogous Natural Language Processing (NLP) task. Specifically, we leverage a Transformer architecture embedded in a Large Language Model (LLM) framework to train our model in a supervised manner on computationally inexpensive, sub-optimal MCVRP solutions obtained algorithmically. Through comprehensive computational experiments, we show that FM-MCVRP produces better MCVRP solutions than the training data and generalizes to larger sized problem instances not seen during training. Even when compared to near-optimal solutions from state-of-the-art heuristics, FM-MCVRP yields competitive results despite being trained on inferior data. For instance, for 400-customer problems, FM-MCVRP solutions on average fall within 2% of the benchmark. Our results further demonstrate that unlike prior works in the literature, FM-MCVRP is a unified model, which performs consistently and reliably on a range of problem instance sizes and parameter values such as the vehicle capacity. △ Less

Submitted 28 February, 2024; originally announced March 2024.

arXiv:2312.14871 [pdf, other]

BrainVis: Exploring the Bridge between Brain and Visual Signals via Image Reconstruction

Authors: Honghao Fu, Zhiqi Shen, **g Jih Chin, Hao Wang

Abstract: Analyzing and reconstructing visual stimuli from brain signals effectively advances understanding of the human visual system. However, the EEG signals are complex and contain a amount of noise. This leads to substantial limitations in existing works of visual stimuli reconstruction from EEG, such as difficulties in aligning EEG embeddings with the fine-grained semantic information and a heavy reli… ▽ More Analyzing and reconstructing visual stimuli from brain signals effectively advances understanding of the human visual system. However, the EEG signals are complex and contain a amount of noise. This leads to substantial limitations in existing works of visual stimuli reconstruction from EEG, such as difficulties in aligning EEG embeddings with the fine-grained semantic information and a heavy reliance on additional large self-collected dataset for training. To address these challenges, we propose a novel approach called BrainVis. Firstly, we divide the EEG signals into various units and apply a self-supervised approach on them to obtain EEG time-domain features, in an attempt to ease the training difficulty. Additionally, we also propose to utilize the frequency-domain features to enhance the EEG representations. Then, we simultaneously align EEG time-frequency embeddings with the interpolation of the coarse and fine-grained semantics in the CLIP space, to highlight the primary visual components and reduce the cross-modal alignment difficulty. Finally, we adopt the cascaded diffusion models to reconstruct images. Our proposed BrainVis outperforms state of the arts in both semantic fidelity reconstruction and generation quality. Notably, we reduce the training data scale to 10% of the previous work. △ Less

Submitted 22 December, 2023; originally announced December 2023.

arXiv:2311.07178 [pdf, other]

Game Solving with Online Fine-Tuning

Authors: Ti-Rong Wu, Hung Guei, Ting Han Wei, Chung-Chin Shih, Jui-Te Chin, I-Chen Wu

Abstract: Game solving is a similar, yet more difficult task than mastering a game. Solving a game typically means to find the game-theoretic value (outcome given optimal play), and optionally a full strategy to follow in order to achieve that outcome. The AlphaZero algorithm has demonstrated super-human level play, and its powerful policy and value predictions have also served as heuristics in game solving… ▽ More Game solving is a similar, yet more difficult task than mastering a game. Solving a game typically means to find the game-theoretic value (outcome given optimal play), and optionally a full strategy to follow in order to achieve that outcome. The AlphaZero algorithm has demonstrated super-human level play, and its powerful policy and value predictions have also served as heuristics in game solving. However, to solve a game and obtain a full strategy, a winning response must be found for all possible moves by the losing player. This includes very poor lines of play from the losing side, for which the AlphaZero self-play process will not encounter. AlphaZero-based heuristics can be highly inaccurate when evaluating these out-of-distribution positions, which occur throughout the entire search. To address this issue, this paper investigates applying online fine-tuning while searching and proposes two methods to learn tailor-designed heuristics for game solving. Our experiments show that using online fine-tuning can solve a series of challenging 7x7 Killall-Go problems, using only 23.54% of computation time compared to the baseline without online fine-tuning. Results suggest that the savings scale with problem size. Our method can further be extended to any tree search algorithm for problem solving. Our code is available at https://rlg.iis.sinica.edu.tw/papers/neurips2023-online-fine-tuning-solver. △ Less

Submitted 13 November, 2023; originally announced November 2023.

Comments: Accepted by the 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

arXiv:2309.11681 [pdf]

doi 10.1145/3637365

Like My Aunt Dorothy: Effects of Conversational Styles on Perceptions, Acceptance and Metaphorical Descriptions of Voice Assistants during Later Adulthood

Authors: Jessie Chin, Smit Desai, Sheny Lin, Shannon Mejia

Abstract: Little research has investigated the design of conversational styles of voice assistants (VA) for adults in their later adulthood with varying personalities. In this Wizard of Oz experiment, 34 middle-aged (50 to 64 years old) and 24 older adults (65 to 80 years old) participated in a user study at a simulated home, interacting with a VA using either formal or informal language. Older adults with… ▽ More Little research has investigated the design of conversational styles of voice assistants (VA) for adults in their later adulthood with varying personalities. In this Wizard of Oz experiment, 34 middle-aged (50 to 64 years old) and 24 older adults (65 to 80 years old) participated in a user study at a simulated home, interacting with a VA using either formal or informal language. Older adults with higher agreeableness perceived VA as being more likeable than middle-aged adults. Middle-aged adults showed similar technology acceptance toward the informal and formal VA, and older adults preferred using informal VA, especially those with low agreeableness. Further, while both middle-aged and older adults frequently anthropomorphized VAs by using human metaphors for them, older adults compared formal VA with professionals (e.g., librarians, teachers) and informal VA with their close ones (e.g., spouses, relatives). Overall, the conversational style showed differential effects on the perceptions of middle-aged and older adults, suggesting personalized design implications. △ Less

Submitted 20 September, 2023; originally announced September 2023.

arXiv:2211.03506 [pdf, other]

CAN Bus: The Future of Additive Manufacturing (3D Printing)

Authors: Jun-Cheng Chin, Tyler Cultice, Himanshu Thapliyal

Abstract: Additive Manufacturing (AM) is gaining renewed popularity and attention due to low-cost fabrication systems proliferating the market. Current communication protocols used in AM limit the connection flexibility between the control board and peripherals; they are often complex in their wiring and thus restrict their avenue of expansion. Thus, the Controller Area Network (CAN) bus is an attractive pa… ▽ More Additive Manufacturing (AM) is gaining renewed popularity and attention due to low-cost fabrication systems proliferating the market. Current communication protocols used in AM limit the connection flexibility between the control board and peripherals; they are often complex in their wiring and thus restrict their avenue of expansion. Thus, the Controller Area Network (CAN) bus is an attractive pathway for inter-hardware connections due to its innate quality. However, the combination of CAN and AM is not well explored and documented in existing literature. This article aims to provide examples of CAN bus applications in AM. △ Less

Submitted 27 October, 2022; originally announced November 2022.

Comments: 6 pages

Journal ref: IEEE Consumer Electronics Magazine, 2022

arXiv:2203.00237 [pdf]

doi 10.1089/cyber.2022.0116

Mental Health Pandemic during the COVID-19 Outbreak: Social Media as a Window to Public Mental Health

Authors: Michelle Bak, Chungyi Chiu, Jessie Chin

Abstract: Intensified preventive measures during the COVID-19 pandemic, such as lockdown and social distancing, heavily increased the perception of social isolation (i.e., a discrepancy between one's social needs and the provisions of the social environment) among young adults. Social isolation is closely associated with situational loneliness (i.e., loneliness emerging from environmental change), a risk fa… ▽ More Intensified preventive measures during the COVID-19 pandemic, such as lockdown and social distancing, heavily increased the perception of social isolation (i.e., a discrepancy between one's social needs and the provisions of the social environment) among young adults. Social isolation is closely associated with situational loneliness (i.e., loneliness emerging from environmental change), a risk factor for depressive symptoms. Prior research suggested vulnerable young adults are likely to seek support from an online social platform such as Reddit, a perceived comfortable environment for lonely individuals to seek mental health help through anonymous communication with a broad social network. Therefore, this study aims to identify and analyze depression-related dialogues on loneliness subreddits during the COVID-19 outbreak, with the implications on depression-related infoveillance during the pandemic. Our study utilized logistic regression and topic modeling to classify and examine depression-related discussions on loneliness subreddits before and during the pandemic. Our results showed significant increases in the volume of depression-related discussions (i.e., topics related to mental health, social interaction, family, and emotion) where challenges were reported during the pandemic. We also found a switch in dominant topics emerging from depression-related discussions on loneliness subreddits, from dating (prepandemic) to online interaction and community (pandemic), suggesting the increased expressions or need of online social support during the pandemic. The current findings suggest the potential of social media to serve as a window for monitoring public mental health. Our future study will clinically validate the current approach, which has implications for designing a surveillance system during the crisis. △ Less

Submitted 25 April, 2023; v1 submitted 1 March, 2022; originally announced March 2022.

arXiv:2110.11536 [pdf, other]

Neural-guided, Bidirectional Program Search for Abstraction and Reasoning

Authors: Simon Alford, Anshula Gandhi, Akshay Rangamani, Andrzej Banburski, Tony Wang, Sylee Dandekar, John Chin, Tomaso Poggio, Peter Chin

Abstract: One of the challenges facing artificial intelligence research today is designing systems capable of utilizing systematic reasoning to generalize to new tasks. The Abstraction and Reasoning Corpus (ARC) measures such a capability through a set of visual reasoning tasks. In this paper we report incremental progress on ARC and lay the foundations for two approaches to abstraction and reasoning not ba… ▽ More One of the challenges facing artificial intelligence research today is designing systems capable of utilizing systematic reasoning to generalize to new tasks. The Abstraction and Reasoning Corpus (ARC) measures such a capability through a set of visual reasoning tasks. In this paper we report incremental progress on ARC and lay the foundations for two approaches to abstraction and reasoning not based in brute-force search. We first apply an existing program synthesis system called DreamCoder to create symbolic abstractions out of tasks solved so far, and show how it enables solving of progressively more challenging ARC tasks. Second, we design a reasoning algorithm motivated by the way humans approach ARC. Our algorithm constructs a search graph and reasons over this graph structure to discover task solutions. More specifically, we extend existing execution-guided program synthesis approaches with deductive reasoning based on function inverse semantics to enable a neural-guided bidirectional search algorithm. We demonstrate the effectiveness of the algorithm on three domains: ARC, 24-Game tasks, and a 'double-and-add' arithmetic puzzle. △ Less

Submitted 26 October, 2021; v1 submitted 21 October, 2021; originally announced October 2021.

Comments: Published as a conference paper at Complex Networks 2021

arXiv:2103.00032 [pdf]

doi 10.22152/programming-journal.org/2021/5/13

Finding Bugs with Specification-Based Testing is Easy!

Authors: Janice Chin, David Pearce

Abstract: Automated specification-based testing has a long history with several notable tools having emerged. For example, QuickCheck for Haskell focuses on testing against user-provided properties. Others, such as JMLUnit, use specifications in the form of pre- and post-conditions to drive testing. An interesting (and under-explored) question is how effective this approach is at finding bugs in practice… ▽ More Automated specification-based testing has a long history with several notable tools having emerged. For example, QuickCheck for Haskell focuses on testing against user-provided properties. Others, such as JMLUnit, use specifications in the form of pre- and post-conditions to drive testing. An interesting (and under-explored) question is how effective this approach is at finding bugs in practice. In general, one would assume automated testing is less effective at bug finding than static verification. But, how much less effective? To shed light on this question, we consider automated testing of programs written in Whiley -- a language with first-class support for specifications. Whilst originally designed with static verification in mind, we have anecdotally found automated testing for Whiley surprisingly useful and cost-effective. For example, when an error is detected with automated testing, a counterexample is always provided. This has motivated the more rigorous empirical examination presented in this paper. To that end, we provide a technical discussion of the implementation behind an automated testing tool for Whiley. Here, a key usability concern is the ability to parameterise the input space, and we present novel approaches for references and lambdas. We then report on several large experiments investigating the tool's effectiveness at bug finding using a range of benchmarks, including a suite of 1800+ mutants. The results indicate the automated testing is effective in many cases, and that sampling offers useful performance benefits with only modest reductions in bug-finding capability. Finally, we report on some real-world uses of the tool where it has proved effective at finding bugs (such as in the standard library). △ Less

Submitted 26 February, 2021; originally announced March 2021.

Journal ref: The Art, Science, and Engineering of Programming, 2021, Vol. 5, Issue 3, Article 13

arXiv:2002.05408 [pdf, other]

doi 10.1016/j.apenergy.2020.116075

Consumer Privacy Protection using Flexible Thermal Loads: Theoretical Limits and Practical Considerations

Authors: Jun-Xing Chin, Kyri Baker, Gabriela Hug

Abstract: The increasing adoption of smart meters introduces growing concerns about consumer privacy risks stemming from high resolution metering data. To counter these risks, there have been various works in actively sha** the grid-visible energy consumption profile using controllable loads such as energy storage systems (ESSs) and flexible consumer loads. In this paper, we compare the use of flexible th… ▽ More The increasing adoption of smart meters introduces growing concerns about consumer privacy risks stemming from high resolution metering data. To counter these risks, there have been various works in actively sha** the grid-visible energy consumption profile using controllable loads such as energy storage systems (ESSs) and flexible consumer loads. In this paper, we compare the use of flexible thermal-based consumer loads (FTLs) against ESSs for consumer privacy protection. By first assuming ideal conditions, and subsequently bringing them closer to reality, the limitations of using FTLs for privacy protection are identified. Through theoretical analyses and realistic simulations, it is shown that, due to the limitations in the operation of FTLs, without significant over-sizing of systems and sacrifices in consumer comfort, FTLs of much higher equivalent energy storage capacity are required to afford the same level of protection as ESSs. Nonetheless, given their increasing ubiquity, controllable FTLs should be considered for use in consumer privacy protection. △ Less

Submitted 28 October, 2020; v1 submitted 13 February, 2020; originally announced February 2020.

Comments: This is the accepted version of the article published in Applied Energy, and is shared under the Creative Commons CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/

Journal ref: Applied Energy, vol. 281, 2021

arXiv:1612.05120 [pdf, other]

doi 10.1109/TSG.2017.2703158

Privacy-Protecting Energy Management Unit through Model-Distribution Predictive Control

Authors: Jun-Xing Chin, Tomas Tinoco De Rubira, Gabriela Hug

Abstract: The roll-out of smart meters in electricity networks introduces risks for consumer privacy due to increased measurement frequency and granularity. Through various Non-Intrusive Load Monitoring techniques, consumer behavior may be inferred from their metering data. In this paper, we propose an energy management method that reduces energy cost and protects privacy through the minimization of informa… ▽ More The roll-out of smart meters in electricity networks introduces risks for consumer privacy due to increased measurement frequency and granularity. Through various Non-Intrusive Load Monitoring techniques, consumer behavior may be inferred from their metering data. In this paper, we propose an energy management method that reduces energy cost and protects privacy through the minimization of information leakage. The method is based on a Model Predictive Controller that utilizes energy storage and local generation, and that predicts the effects of its actions on the statistics of the actual energy consumption of a consumer and that seen by the grid. Computationally, the method requires solving a Mixed-Integer Quadratic Program of manageable size whenever new meter readings are available. We simulate the controller on generated residential load profiles with different privacy costs in a two-tier time-of-use energy pricing environment. Results show that information leakage is effectively reduced at the expense of increased energy cost. The results also show that with the proposed controller the consumer load profile seen by the grid resembles a mixture between that obtained with Non-Intrusive Load Leveling and Lazy Step**. △ Less

Submitted 27 June, 2017; v1 submitted 15 December, 2016; originally announced December 2016.

Comments: Accepted for publication in IEEE Transactions on Smart Grid 2017, special issue on Distributed Control and Efficient Optimization Methods for Smart Grid

arXiv:1612.03975 [pdf, other]

ConceptNet 5.5: An Open Multilingual Graph of General Knowledge

Authors: Robyn Speer, Joshua Chin, Catherine Havasi

Abstract: Machine learning about language can be improved by supplying it with specific knowledge and sources of external information. We present here a new version of the linked open data resource ConceptNet that is particularly well suited to be used with modern NLP techniques such as word embeddings. ConceptNet is a knowledge graph that connects words and phrases of natural language with labeled edges.… ▽ More Machine learning about language can be improved by supplying it with specific knowledge and sources of external information. We present here a new version of the linked open data resource ConceptNet that is particularly well suited to be used with modern NLP techniques such as word embeddings. ConceptNet is a knowledge graph that connects words and phrases of natural language with labeled edges. Its knowledge is collected from many sources that include expert-created resources, crowd-sourcing, and games with a purpose. It is designed to represent the general knowledge involved in understanding language, improving natural language applications by allowing the application to better understand the meanings behind the words people use. When ConceptNet is combined with word embeddings acquired from distributional semantics (such as word2vec), it provides applications with understanding that they would not acquire from distributional semantics alone, nor from narrower resources such as WordNet or DBPedia. We demonstrate this with state-of-the-art results on intrinsic evaluations of word relatedness that translate into improvements on applications of word vectors, including solving SAT-style analogies. △ Less

Submitted 11 December, 2018; v1 submitted 12 December, 2016; originally announced December 2016.

ACM Class: I.2.7

Journal ref: AAAI 31 (2017) 4444-4451

arXiv:1609.05081 [pdf, other]

Asymptotic Analysis of Equivalences and Core-Structures in Kronecker-Style Graph Models

Authors: Alex J. Chin, Timothy D. Goodrich, Michael P. O'Brien, Felix Reidl, Blair D. Sullivan, Andrew van der Poel

Abstract: Growing interest in modeling large, complex networks has spurred significant research into generative graph models. Kronecker-style models (SKG and R-MAT) are often used due to their scalability and ability to mimic key properties of real-world networks. Although a few papers theoretically establish these models' behavior for specific parameters, many claims used to justify their use are supported… ▽ More Growing interest in modeling large, complex networks has spurred significant research into generative graph models. Kronecker-style models (SKG and R-MAT) are often used due to their scalability and ability to mimic key properties of real-world networks. Although a few papers theoretically establish these models' behavior for specific parameters, many claims used to justify their use are supported only empirically. In this work, we prove several results using asymptotic analysis which illustrate that empirical studies may not fully capture the true behavior of the models. Paramount to the widespread adoption of Kronecker-style models was the introduction of a linear-time edge-sampling variant (R-MAT), which existing literature typically treats as interchangeable with SKG. We prove that although several R-MAT formulations are asymptotically equivalent, their behavior diverges from that of SKG. Further, we show these results are observable even at relatively small graph sizes. Second, we consider a case where asymptotic analysis reveals unexpected behavior within a given model. △ Less

Submitted 16 September, 2016; originally announced September 2016.

arXiv:1604.01692 [pdf, other]

An Ensemble Method to Produce High-Quality Word Embeddings (2016)

Authors: Robyn Speer, Joshua Chin

Abstract: A currently successful approach to computational semantics is to represent words as embeddings in a machine-learned vector space. We present an ensemble method that combines embeddings produced by GloVe (Pennington et al., 2014) and word2vec (Mikolov et al., 2013) with structured knowledge from the semantic networks ConceptNet (Speer and Havasi, 2012) and PPDB (Ganitkevitch et al., 2013), merging… ▽ More A currently successful approach to computational semantics is to represent words as embeddings in a machine-learned vector space. We present an ensemble method that combines embeddings produced by GloVe (Pennington et al., 2014) and word2vec (Mikolov et al., 2013) with structured knowledge from the semantic networks ConceptNet (Speer and Havasi, 2012) and PPDB (Ganitkevitch et al., 2013), merging their information into a common representation with a large, multilingual vocabulary. The embeddings it produces achieve state-of-the-art performance on many word-similarity evaluations. Its score of $ρ= .596$ on an evaluation of rare words (Luong et al., 2013) is 16% higher than the previous best known system. △ Less

Submitted 19 December, 2019; v1 submitted 6 April, 2016; originally announced April 2016.

Comments: Corrected author name, revised reproducibility instructions that didn't work anymore. 12 pages, 3 figures

MSC Class: I.2.7 ACM Class: I.2.7

arXiv:cs/0501021 [pdf, ps, other]

doi 10.1098/rsta.2005.1618

Large-scale lattice Boltzmann simulations of complex fluids: advances through the advent of computational grids

Authors: J. Harting, J. Chin, M. Venturoli, P. V. Coveney

Abstract: During the last two years the RealityGrid project has allowed us to be one of the few scientific groups involved in the development of computational grids. Since smoothly working production grids are not yet available, we have been able to substantially influence the direction of software development and grid deployment within the project. In this paper we review our results from large scale thr… ▽ More During the last two years the RealityGrid project has allowed us to be one of the few scientific groups involved in the development of computational grids. Since smoothly working production grids are not yet available, we have been able to substantially influence the direction of software development and grid deployment within the project. In this paper we review our results from large scale three-dimensional lattice Boltzmann simulations performed over the last two years. We describe how the proactive use of computational steering and advanced job migration and visualization techniques enabled us to do our scientific work more efficiently. The projects reported on in this paper are studies of complex fluid flows under shear or in porous media, as well as large-scale parameter searches, and studies of the self-organisation of liquid cubic mesophases. Movies are available at http://www.ica1.uni-stuttgart.de/~jens/pub/05/05-PhilTransReview.html △ Less

Submitted 11 January, 2005; originally announced January 2005.

Comments: 18 pages, 9 figures, 4 movies available, accepted for publication in Phil. Trans. R. Soc. London Series A

Journal ref: Phil. Trans. R. Soc. London Series A 363 1895-1915 (2005)

arXiv:cs/0405015 [pdf, ps, other]

A High-Level Reconfigurable Computing Platform Software Frameworks

Authors: Darran Nathan, Kelvin Lim Mun Kit, Kelly Choo Hon Min, Philip Wong Jit Chin, Andreas Weisensee

Abstract: Reconfigurable computing refers to the use of processors, such as Field Programmable Gate Arrays (FPGAs), that can be modified at the hardware level to take on different processing tasks. A reconfigurable computing platform describes the hardware and software base on top of which modular extensions can be created, depending on the desired application. Such reconfigurable computing platforms can… ▽ More Reconfigurable computing refers to the use of processors, such as Field Programmable Gate Arrays (FPGAs), that can be modified at the hardware level to take on different processing tasks. A reconfigurable computing platform describes the hardware and software base on top of which modular extensions can be created, depending on the desired application. Such reconfigurable computing platforms can take on varied designs and implementations, according to the constraints imposed and features desired by the scope of applications. This paper introduces a PC-based reconfigurable computing platform software frameworks that is flexible and extensible enough to abstract the different hardware types and functionality that different PCs may have. The requirements of the software platform, architectural issues addressed, rationale behind the decisions made, and frameworks design implemented are discussed. △ Less

Submitted 4 May, 2004; originally announced May 2004.

Comments: 4 pages, 8 figures

ACM Class: D.2.11

Showing 1–15 of 15 results for author: Chin, J