-
Implantable silicon neural probes with nanophotonic phased arrays for single-lobe beam steering
Authors:
Fu-Der Chen,
Ankita Sharma,
Tianyuan Xue,
Youngho Jung,
Alperen Govdeli,
Jason C. C. Mak,
Homeira Moradi Chameh,
Mandana Movahed,
Michael G. K. Brunk,
Xianshu Luo,
Hongyao Chua,
Patrick Guo-Qiang Lo,
Taufik A Valiante,
Wesley D. Sacher,
Joyce K. S. Poon
Abstract:
In brain activity map** experiments using optogenetics, patterned illumination is crucial for deterministic and localized stimulation of neurons. However, due to optical scattering in brain tissue, light-emitting implantable devices are needed to bring precise patterned illumination to deep brain regions. A promising solution is silicon neural probes with integrated nanophotonic circuits that fo…
▽ More
In brain activity map** experiments using optogenetics, patterned illumination is crucial for deterministic and localized stimulation of neurons. However, due to optical scattering in brain tissue, light-emitting implantable devices are needed to bring precise patterned illumination to deep brain regions. A promising solution is silicon neural probes with integrated nanophotonic circuits that form tailored beam emission patterns without lenses. Here, we demonstrate neural probes with grating-based light emitters that generate a single steerable light beam across $> 60\%$ of the steering range with $\ge 4$ dB of background suppression for optogenetic photostimulation. The light emitters, optimized for blue or amber light, combine end-fire optical phased arrays with slab gratings to suppress higher-order sidelobes.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
Re2LLM: Reflective Reinforcement Large Language Model for Session-based Recommendation
Authors:
Ziyan Wang,
Yingpeng Du,
Zhu Sun,
Haoyan Chua,
Kaidong Feng,
Wenya Wang,
Jie Zhang
Abstract:
Large Language Models (LLMs) are emerging as promising approaches to enhance session-based recommendation (SBR), where both prompt-based and fine-tuning-based methods have been widely investigated to align LLMs with SBR. However, the former methods struggle with optimal prompts to elicit the correct reasoning of LLMs due to the lack of task-specific feedback, leading to unsatisfactory recommendati…
▽ More
Large Language Models (LLMs) are emerging as promising approaches to enhance session-based recommendation (SBR), where both prompt-based and fine-tuning-based methods have been widely investigated to align LLMs with SBR. However, the former methods struggle with optimal prompts to elicit the correct reasoning of LLMs due to the lack of task-specific feedback, leading to unsatisfactory recommendations. Although the latter methods attempt to fine-tune LLMs with domain-specific knowledge, they face limitations such as high computational costs and reliance on open-source backbones. To address such issues, we propose a Reflective Reinforcement Large Language Model (Re2LLM) for SBR, guiding LLMs to focus on specialized knowledge essential for more accurate recommendations effectively and efficiently. In particular, we first design the Reflective Exploration Module to effectively extract knowledge that is readily understandable and digestible by LLMs. To be specific, we direct LLMs to examine recommendation errors through self-reflection and construct a knowledge base (KB) comprising hints capable of rectifying these errors. To efficiently elicit the correct reasoning of LLMs, we further devise the Reinforcement Utilization Module to train a lightweight retrieval agent. It learns to select hints from the constructed KB based on the task-specific feedback, where the hints can serve as guidance to help correct LLMs reasoning for better recommendations. Extensive experiments on multiple real-world datasets demonstrate that our method consistently outperforms state-of-the-art methods.
△ Less
Submitted 19 April, 2024; v1 submitted 25 March, 2024;
originally announced March 2024.
-
Large Language Model with Graph Convolution for Recommendation
Authors:
Yingpeng Du,
Ziyan Wang,
Zhu Sun,
Haoyan Chua,
Hongzhi Liu,
Zhonghai Wu,
Yining Ma,
Jie Zhang,
Youchen Sun
Abstract:
In recent years, efforts have been made to use text information for better user profiling and item characterization in recommendations. However, text information can sometimes be of low quality, hindering its effectiveness for real-world applications. With knowledge and reasoning capabilities capsuled in Large Language Models (LLMs), utilizing LLMs emerges as a promising way for description improv…
▽ More
In recent years, efforts have been made to use text information for better user profiling and item characterization in recommendations. However, text information can sometimes be of low quality, hindering its effectiveness for real-world applications. With knowledge and reasoning capabilities capsuled in Large Language Models (LLMs), utilizing LLMs emerges as a promising way for description improvement. However, existing ways of prompting LLMs with raw texts ignore structured knowledge of user-item interactions, which may lead to hallucination problems like inconsistent description generation. To this end, we propose a Graph-aware Convolutional LLM method to elicit LLMs to capture high-order relations in the user-item graph. To adapt text-based LLMs with structured graphs, We use the LLM as an aggregator in graph processing, allowing it to understand graph-based information step by step. Specifically, the LLM is required for description enhancement by exploring multi-hop neighbors layer by layer, thereby propagating information progressively in the graph. To enable LLMs to capture large-scale graph information, we break down the description task into smaller parts, which drastically reduces the context length of the token input with each step. Extensive experiments on three real-world datasets show that our method consistently outperforms state-of-the-art methods.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
Implantable Photonic Neural Probes with Out-of-Plane Focusing Grating Emitters
Authors:
Tianyuan Xue,
Andrei Stalmashonak,
Fu-Der Chen,
Peisheng Ding,
Xianshu Luo,
Hongyao Chua,
Guo-Qiang Lo,
Wesley D. Sacher,
Joyce K. S. Poon
Abstract:
We have designed, fabricated, and characterized implantable silicon neural probes with nanophotonic grating emitters that focus the emitted light at a specified distance above the surface of the probe for spatially precise optogenetic targeting of neurons. Using the holographic principle, we designed gratings for wavelengths of 488 and 594 nm, targeting the excitation spectra of the optogenetic ac…
▽ More
We have designed, fabricated, and characterized implantable silicon neural probes with nanophotonic grating emitters that focus the emitted light at a specified distance above the surface of the probe for spatially precise optogenetic targeting of neurons. Using the holographic principle, we designed gratings for wavelengths of 488 and 594 nm, targeting the excitation spectra of the optogenetic actuators Channelrhodopsin-2 and Chrimson, respectively. The measured optical emission pattern of these emitters in non-scattering medium and tissue matched well with simulations. To our knowledge, this is the first report of focused spots with the size scale of a neuron soma in brain tissue formed from implantable neural probes.
△ Less
Submitted 10 January, 2024; v1 submitted 9 January, 2024;
originally announced January 2024.
-
Room-temperature waveguide-coupled silicon single-photon avalanche diodes
Authors:
Alperen Govdeli,
John N. Straguzzi,
Zheng Yong,
Yiding Lin,
Xianshu Luo,
Hongyao Chua,
Guo-Qiang Lo,
Wesley D. Sacher,
Joyce K. S. Poon
Abstract:
Single photon detection is important for a wide range of low-light applications, including quantum information processing, spectroscopy, and light detection and ranging (LiDAR). A key challenge in these applications has been to integrate single-photon detection capability into photonic circuits for the realization of complex photonic microsystems. Short-wavelength ($λ$ < 1.1 $μ$m) integrated photo…
▽ More
Single photon detection is important for a wide range of low-light applications, including quantum information processing, spectroscopy, and light detection and ranging (LiDAR). A key challenge in these applications has been to integrate single-photon detection capability into photonic circuits for the realization of complex photonic microsystems. Short-wavelength ($λ$ < 1.1 $μ$m) integrated photonics platforms that use silicon (Si) as photodetectors offer the opportunity to achieve single-photon avalanche diodes (SPADs) that operate at or near room temperature. Here, we report the first waveguide-coupled Si SPAD. The device is monolithically integrated in a Si photonic platform and operates in the visible spectrum. The device exhibited a single photon detection efficiency of > 6% for wavelengths of 488 nm and 532 nm with an excess voltage less than 20% of the breakdown voltage. The dark count rate was below 100 kHz at room temperature, with the possibility of improving by approximately 35% by reducing the temperature to -5$^{\circ}$C.
△ Less
Submitted 25 January, 2024; v1 submitted 15 October, 2023;
originally announced October 2023.
-
MERLIon CCS Challenge Evaluation Plan
Authors:
Leibny Paola Garcia Perera,
Y. H. Victoria Chua,
Hexin Liu,
Fei Ting Woon,
Andy W. H. Khong,
Justin Dauwels,
Sanjeev Khudanpur,
Suzy J. Styles
Abstract:
This paper introduces the inaugural Multilingual Everyday Recordings- Language Identification on Code-Switched Child-Directed Speech (MERLIon CCS) Challenge, focused on develo** robust language identification and language diarization systems that are reliable for non-standard, accented, spontaneous code-switched, child-directed speech collected via Zoom. Aligning closely with Interspeech 2023 th…
▽ More
This paper introduces the inaugural Multilingual Everyday Recordings- Language Identification on Code-Switched Child-Directed Speech (MERLIon CCS) Challenge, focused on develo** robust language identification and language diarization systems that are reliable for non-standard, accented, spontaneous code-switched, child-directed speech collected via Zoom. Aligning closely with Interspeech 2023 theme, the main objectives of this inaugural challenge are to present a unique first-of-its-kind Zoom videocall dataset featuring English-Mandarin spontaneous code-switched child-directed speech, benchmark the current and novel language identification and language diarization systems in a code-switching scenario including extremely short utterances, and test the robustness of such systems under accented speech. The MERLIon CCS challenge features two task: language identification (Task 1) and language diarization (Task 2). Two tracks, open and closed, are available for each task, differing by the volume of data systems can be trained on. This paper describes the dataset, dataset annotation protocol, challenge tasks, open and closed tracks, evaluation metrics, and evaluation protocol.
△ Less
Submitted 30 May, 2023;
originally announced May 2023.
-
Investigating model performance in language identification: beyond simple error statistics
Authors:
Suzy J. Styles,
Victoria Y. H. Chua,
Fei Ting Woon,
Hexin Liu,
Leibny Paola Garcia Perera,
Sanjeev Khudanpur,
Andy W. H. Khong,
Justin Dauwels
Abstract:
Language development experts need tools that can automatically identify languages from fluent, conversational speech, and provide reliable estimates of usage rates at the level of an individual recording. However, language identification systems are typically evaluated on metrics such as equal error rate and balanced accuracy, applied at the level of an entire speech corpus. These overview metrics…
▽ More
Language development experts need tools that can automatically identify languages from fluent, conversational speech, and provide reliable estimates of usage rates at the level of an individual recording. However, language identification systems are typically evaluated on metrics such as equal error rate and balanced accuracy, applied at the level of an entire speech corpus. These overview metrics do not provide information about model performance at the level of individual speakers, recordings, or units of speech with different linguistic characteristics. Overview statistics may therefore mask systematic errors in model performance for some subsets of the data, and consequently, have worse performance on data derived from some subsets of human speakers, creating a kind of algorithmic bias. In the current paper, we investigate how well a number of language identification systems perform on individual recordings and speech units with different linguistic properties in the MERLIon CCS Challenge. The Challenge dataset features accented English-Mandarin code-switched child-directed speech.
△ Less
Submitted 30 May, 2023;
originally announced May 2023.
-
MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization
Authors:
Victoria Y. H. Chua,
Hexin Liu,
Leibny Paola Garcia Perera,
Fei Ting Woon,
**yi Wong,
Xiangyu Zhang,
Sanjeev Khudanpur,
Andy W. H. Khong,
Justin Dauwels,
Suzy J. Styles
Abstract:
To enhance the reliability and robustness of language identification (LID) and language diarization (LD) systems for heterogeneous populations and scenarios, there is a need for speech processing models to be trained on datasets that feature diverse language registers and speech patterns. We present the MERLIon CCS challenge, featuring a first-of-its-kind Zoom video call dataset of parent-child sh…
▽ More
To enhance the reliability and robustness of language identification (LID) and language diarization (LD) systems for heterogeneous populations and scenarios, there is a need for speech processing models to be trained on datasets that feature diverse language registers and speech patterns. We present the MERLIon CCS challenge, featuring a first-of-its-kind Zoom video call dataset of parent-child shared book reading, of over 30 hours with over 300 recordings, annotated by multilingual transcribers using a high-fidelity linguistic transcription protocol. The audio corpus features spontaneous and in-the-wild English-Mandarin code-switching, child-directed speech in non-standard accents with diverse language-mixing patterns recorded in a variety of home environments. This report describes the corpus, as well as LID and LD results for our baseline and several systems submitted to the MERLIon CCS challenge using the corpus.
△ Less
Submitted 30 May, 2023;
originally announced May 2023.
-
Implantable Photonic Neural Probes with 3D-Printed Microfluidics and Applications to Uncaging
Authors:
Xin Mu,
Fu-Der Chen,
Ka My Dang,
Michael G. K. Brunk,
Jianfeng Li,
Hannes Wahn,
Andrei Stalmashonak,
Peisheng Ding,
Xianshu Luo,
Hongyao Chua,
Guo-Qiang Lo,
Joyce K. S. Poon,
Wesley D. Sacher
Abstract:
Advances in chip-scale photonic-electronic integration are enabling a new generation of foundry-manufacturable implantable silicon neural probes incorporating nanophotonic waveguides and microelectrodes for optogenetic stimulation and electrophysiological recording in neuroscience research. Further extending neural probe functionalities with integrated microfluidics is a direct approach to achieve…
▽ More
Advances in chip-scale photonic-electronic integration are enabling a new generation of foundry-manufacturable implantable silicon neural probes incorporating nanophotonic waveguides and microelectrodes for optogenetic stimulation and electrophysiological recording in neuroscience research. Further extending neural probe functionalities with integrated microfluidics is a direct approach to achieve neurochemical injection and sampling capabilities. In this work, we use two-photon polymerization 3D printing to integrate microfluidic channels onto photonic neural probes, which include silicon nitride nanophotonic waveguides and grating emitters. The customizability of 3D printing enables a unique geometry of microfluidics that conforms to the shape of each neural probe, enabling integration of microfluidics with a variety of existing neural probes while avoiding the complexities of monolithic microfluidics integration. We demonstrate the photonic and fluidic functionalities of the neural probes via fluorescein injection in agarose gel and photoloysis of caged fluorescein in solution and in flxed brain tissue.
△ Less
Submitted 25 April, 2023; v1 submitted 20 April, 2023;
originally announced April 2023.
-
Microcantilever-integrated photonic circuits for broadband laser beam scanning
Authors:
Saeed Sharif Azadeh,
Jason C. C. Mak,
Hong Chen,
Xianshu Luo,
Fu-Der Chen,
Hongyao Chua,
Frank Weiss,
Christopher Alexiev,
Andrei Stalmashonak,
Youngho Jung,
John N. Straguzzi,
Guo-Qiang Lo,
Wesley D. Sacher,
Joyce K. S. Poon
Abstract:
Laser beam scanning is central to many applications, including displays, microscopy, three-dimensional map**, and quantum information. Reducing the scanners to microchip form factors has spurred the development of very-large-scale photonic integrated circuits of optical phased arrays and focal plane switched arrays. An outstanding challenge remains to simultaneously achieve a compact footprint,…
▽ More
Laser beam scanning is central to many applications, including displays, microscopy, three-dimensional map**, and quantum information. Reducing the scanners to microchip form factors has spurred the development of very-large-scale photonic integrated circuits of optical phased arrays and focal plane switched arrays. An outstanding challenge remains to simultaneously achieve a compact footprint, broad wavelength operation, and low power consumption. Here, we introduce a laser beam scanner that meets these requirements. Using microcantilevers embedded with silicon nitride nanophotonic circuitry, we demonstrate broadband, one- and two-dimensional steering of light with wavelengths from 410 nm to 700 nm. The microcantilevers have ultracompact ~0.1 mm$^2$ areas, consume ~31 to 46 mW of power, are simple to control, and emit a single light beam. The microcantilevers are monolithically integrated in an active photonic platform on 200-mm silicon wafers. The microcantilever-integrated photonic circuits miniaturize and simplify light projectors to enable versatile, power-efficient, and broadband laser scanner microchips.
△ Less
Submitted 11 October, 2022; v1 submitted 25 July, 2022;
originally announced July 2022.
-
Power-Efficient Silicon Nitride Thermo-Optic Phase Shifters for Visible Light
Authors:
Zheng Yong,
Hong Chen,
Xianshu Luo,
Alperen Govdeli,
Hongyao Chua,
Saeed S. Azadeh,
Andrei Stalmashonak,
Guo-Qiang Lo,
Joyce K. S. Poon,
Wesley D. Sacher
Abstract:
We demonstrate power-efficient, thermo-optic, silicon nitride waveguide phase shifters for blue, green, and yellow wavelengths. The phase shifters operated with low power consumption due to a suspended structure and multi-pass waveguide design. The devices were fabricated on 200-mm silicon wafers using deep ultraviolet lithography as part of an active visible-light integrated photonics platform. T…
▽ More
We demonstrate power-efficient, thermo-optic, silicon nitride waveguide phase shifters for blue, green, and yellow wavelengths. The phase shifters operated with low power consumption due to a suspended structure and multi-pass waveguide design. The devices were fabricated on 200-mm silicon wafers using deep ultraviolet lithography as part of an active visible-light integrated photonics platform. The measured power consumption to achieve a $π$ phase shift (averaged over multiple devices) was 0.78, 0.93, 1.09, and 1.20 mW at wavelengths of 445, 488, 532, and 561 nm, respectively. The phase shifters were integrated into Mach-Zehnder interferometer switches, and $10- 90$\% rise(fall) times of about 570(590) $μ$s were measured.
△ Less
Submitted 15 November, 2021;
originally announced November 2021.
-
Towards Plug-and-Play Visual Graph Query Interfaces: Data-driven Canned Pattern Selection for Large Networks
Authors:
Zifeng Yuan,
Huey Eng Chua,
Sourav S Bhowmick,
Zekun Ye,
Wook-Shin Han,
Byron Choi
Abstract:
Canned patterns (i.e. small subgraph patterns) in visual graph query interfaces (a.k.a GUI) facilitate efficient query formulation by enabling pattern-at-a-time construction mode. However, existing GUIs for querying large networks either do not expose any canned patterns or if they do then they are typically selected manually based on domain knowledge. Unfortunately, manual generation of canned pa…
▽ More
Canned patterns (i.e. small subgraph patterns) in visual graph query interfaces (a.k.a GUI) facilitate efficient query formulation by enabling pattern-at-a-time construction mode. However, existing GUIs for querying large networks either do not expose any canned patterns or if they do then they are typically selected manually based on domain knowledge. Unfortunately, manual generation of canned patterns is not only labor intensive but may also lack diversity for supporting efficient visual formulation of a wide range of subgraph queries. In this paper, we present a novel generic and extensible framework called TATTOO that takes a data-driven approach to automatically selecting canned patterns for a GUI from large networks. Specifically, it first decomposes the underlying network into truss-infested and truss-oblivious regions. Then candidate canned patterns capturing different real-world query topologies are generated from these regions. Canned patterns based on a user-specified plug are then selected for the GUI from these candidates by maximizing coverage and diversity, and by minimizing the cognitive load of the pattern set. Experimental studies with real-world datasets demonstrate the benefits of TATTOO. Importantly, this work takes a concrete step towards realizing plug-and-play visual graph query interfaces for large networks.
△ Less
Submitted 21 July, 2021;
originally announced July 2021.
-
Audio Adversarial Examples: Attacks Using Vocal Masks
Authors:
Kai Yuan Tay,
Lynnette Ng,
Wei Han Chua,
Lucerne Loke,
Danqi Ye,
Melissa Chua
Abstract:
We construct audio adversarial examples on automatic Speech-To-Text systems . Given any audio waveform, we produce an another by overlaying an audio vocal mask generated from the original audio. We apply our audio adversarial attack to five SOTA STT systems: DeepSpeech, Julius, Kaldi, wav2letter@anywhere and CMUSphinx. In addition, we engaged human annotators to transcribe the adversarial audio. O…
▽ More
We construct audio adversarial examples on automatic Speech-To-Text systems . Given any audio waveform, we produce an another by overlaying an audio vocal mask generated from the original audio. We apply our audio adversarial attack to five SOTA STT systems: DeepSpeech, Julius, Kaldi, wav2letter@anywhere and CMUSphinx. In addition, we engaged human annotators to transcribe the adversarial audio. Our experiments show that these adversarial examples fool State-Of-The-Art Speech-To-Text systems, yet humans are able to consistently pick out the speech. The feasibility of this attack introduces a new domain to study machine and human perception of speech.
△ Less
Submitted 5 February, 2021; v1 submitted 4 February, 2021;
originally announced February 2021.
-
SIR Simulation of COVID-19 Pandemic in Malaysia: Will the Vaccination Program be Effective?
Authors:
W. K. Wong,
Filbert H. Juwono,
Tock H. Chua
Abstract:
Since the end of 2019, COVID-19 has significantly affected the lives of people around the world. Towards the end of 2020, several COVID-19 vaccine candidates with relatively high efficacy have been reported in the final phase of clinical trials. Vaccines have been considered as critical tools for opening up social and economic activities, thereby lessening the impact of this disease on the society…
▽ More
Since the end of 2019, COVID-19 has significantly affected the lives of people around the world. Towards the end of 2020, several COVID-19 vaccine candidates with relatively high efficacy have been reported in the final phase of clinical trials. Vaccines have been considered as critical tools for opening up social and economic activities, thereby lessening the impact of this disease on the society. This paper presents a simulation of COVID-19 spread using modified Susceptible-Infected-Removed (SIR) model under vaccine intervention in several localities of Malaysia, i.e. those cities or states with high relatively COVID-19 cases such as Kuala Lumpur, Penang, Sabah, and Sarawak. The results show that at different vaccine efficacy levels (0.75, 0.85, and 0.95), the curves of active infection vary slightly, indicating that vaccines with efficacy above 0.75 would produce the herd immunity required to level the curves. In addition, disparity is significant between implementing or not implementing a vaccination program. Simulation results also show that lowering the reproduction number, R0 is necessary to keep the infection curve flat despite vaccination. This is due to the assumption that vaccination is mostly carried out gradually at the assumed fixed rate. The statement is based on our simulation results with two values of R0: 1.1 and 1.2, indicative of reduction of R0 by social distancing. The lower R0 shows a smaller peak amplitude about half the value simulated with R0=1.2. In conclusion, the simulation model suggests a two-pronged strategy to combat the COVID-19 pandemic in Malaysia: vaccination and compliance with standard operating procedure issued by the World Health Organization (e.g. social distancing).
△ Less
Submitted 19 January, 2021;
originally announced January 2021.
-
Using Data Analytics to predict students score
Authors:
Nang Laik Ma,
Gim Hong Chua
Abstract:
Education is very important to Singapore, and the government has continued to invest heavily in our education system to become one of the world-class systems today. A strong foundation of Science, Technology, Engineering, and Mathematics (STEM) was what underpinned Singapore's development over the past 50 years. PISA is a triennial international survey that evaluates education systems worldwide by…
▽ More
Education is very important to Singapore, and the government has continued to invest heavily in our education system to become one of the world-class systems today. A strong foundation of Science, Technology, Engineering, and Mathematics (STEM) was what underpinned Singapore's development over the past 50 years. PISA is a triennial international survey that evaluates education systems worldwide by testing the skills and knowledge of 15-year-old students who are nearing the end of compulsory education. In this paper, the authors used the PISA data from 2012 and 2015 and developed machine learning techniques to predictive the students' scores and understand the inter-relationships among social, economic, and education factors. The insights gained would be useful to have fresh perspectives on education, useful for policy formulation.
△ Less
Submitted 19 November, 2020;
originally announced December 2020.
-
Semi-supervised and Unsupervised Methods for Heart Sounds Classification in Restricted Data Environments
Authors:
Balagopal Unnikrishnan,
Pranshu Ranjan Singh,
Xulei Yang,
Matthew Chin Heng Chua
Abstract:
Automated heart sounds classification is a much-required diagnostic tool in the view of increasing incidences of heart related diseases worldwide. In this study, we conduct a comprehensive study of heart sounds classification by using various supervised, semi-supervised and unsupervised approaches on the PhysioNet/CinC 2016 Challenge dataset. Supervised approaches, including deep learning and mach…
▽ More
Automated heart sounds classification is a much-required diagnostic tool in the view of increasing incidences of heart related diseases worldwide. In this study, we conduct a comprehensive study of heart sounds classification by using various supervised, semi-supervised and unsupervised approaches on the PhysioNet/CinC 2016 Challenge dataset. Supervised approaches, including deep learning and machine learning methods, require large amounts of labelled data to train the models, which are challenging to obtain in most practical scenarios. In view of the need to reduce the labelling burden for clinical practices, where human labelling is both expensive and time-consuming, semi-supervised or even unsupervised approaches in restricted data setting are desirable. A GAN based semi-supervised method is therefore proposed, which allows the usage of unlabelled data samples to boost the learning of data distribution. It achieves a better performance in terms of AUROC over the supervised baseline when limited data samples exist. Furthermore, several unsupervised methods are explored as an alternative approach by considering the given problem as an anomaly detection scenario. In particular, the unsupervised feature extraction using 1D CNN Autoencoder coupled with one-class SVM obtains good performance without any data labelling. The potential of the proposed semi-supervised and unsupervised methods may lead to a workflow tool in the future for the creation of higher quality datasets.
△ Less
Submitted 3 June, 2020;
originally announced June 2020.
-
TRACER: A Framework for Facilitating Accurate and Interpretable Analytics for High Stakes Applications
Authors:
Kai** Zheng,
Shaofeng Cai,
Horng Ruey Chua,
Wei Wang,
Kee Yuan Ngiam,
Beng Chin Ooi
Abstract:
In high stakes applications such as healthcare and finance analytics, the interpretability of predictive models is required and necessary for domain practitioners to trust the predictions. Traditional machine learning models, e.g., logistic regression (LR), are easy to interpret in nature. However, many of these models aggregate time-series data without considering the temporal correlations and va…
▽ More
In high stakes applications such as healthcare and finance analytics, the interpretability of predictive models is required and necessary for domain practitioners to trust the predictions. Traditional machine learning models, e.g., logistic regression (LR), are easy to interpret in nature. However, many of these models aggregate time-series data without considering the temporal correlations and variations. Therefore, their performance cannot match up to recurrent neural network (RNN) based models, which are nonetheless difficult to interpret. In this paper, we propose a general framework TRACER to facilitate accurate and interpretable predictions, with a novel model TITV devised for healthcare analytics and other high stakes applications such as financial investment and risk management. Different from LR and other existing RNN-based models, TITV is designed to capture both the time-invariant and the time-variant feature importance using a feature-wise transformation subnetwork and a self-attention subnetwork, for the feature influence shared over the entire time series and the time-related importance respectively. Healthcare analytics is adopted as a driving use case, and we note that the proposed TRACER is also applicable to other domains, e.g., fintech. We evaluate the accuracy of TRACER extensively in two real-world hospital datasets, and our doctors/clinicians further validate the interpretability of TRACER in both the patient level and the feature level. Besides, TRACER is also validated in a high stakes financial application and a critical temperature forecasting application. The experimental results confirm that TRACER facilitates both accurate and interpretable analytics for high stakes applications.
△ Less
Submitted 24 March, 2020;
originally announced March 2020.
-
BiRA-Net: Bilinear Attention Net for Diabetic Retinopathy Grading
Authors:
Ziyuan Zhao,
Kerui Zhang,
Xuejie Hao,
**g Tian,
Matthew Chin Heng Chua,
Li Chen,
Xin Xu
Abstract:
Diabetic retinopathy (DR) is a common retinal disease that leads to blindness. For diagnosis purposes, DR image grading aims to provide automatic DR grade classification, which is not addressed in conventional research methods of binary DR image classification. Small objects in the eye images, like lesions and microaneurysms, are essential to DR grading in medical imaging, but they could easily be…
▽ More
Diabetic retinopathy (DR) is a common retinal disease that leads to blindness. For diagnosis purposes, DR image grading aims to provide automatic DR grade classification, which is not addressed in conventional research methods of binary DR image classification. Small objects in the eye images, like lesions and microaneurysms, are essential to DR grading in medical imaging, but they could easily be influenced by other objects. To address these challenges, we propose a new deep learning architecture, called BiRA-Net, which combines the attention model for feature extraction and bilinear model for fine-grained classification. Furthermore, in considering the distance between different grades of different DR categories, we propose a new loss function, called grading loss, which leads to improved training convergence of the proposed approach. Experimental results are provided to demonstrate the superior performance of the proposed approach.
△ Less
Submitted 1 July, 2019; v1 submitted 15 May, 2019;
originally announced May 2019.
-
Multimedia-Video for Learning
Authors:
Kah Hean Chua,
Ming Yeo Oh,
Loo Kang Wee,
Ching Tan
Abstract:
Multimedia engages an audience through a combination of text, audio, still images, animation, video, or interactivity-based content formats. Along this vein, free platforms have been seen to allow budding enthusiasts to create multimedia content. For example, Google sites (Wee, 2012b) offer creative opportunities in website development that enable text insertion, still image, video and animation e…
▽ More
Multimedia engages an audience through a combination of text, audio, still images, animation, video, or interactivity-based content formats. Along this vein, free platforms have been seen to allow budding enthusiasts to create multimedia content. For example, Google sites (Wee, 2012b) offer creative opportunities in website development that enable text insertion, still image, video and animation embedding, along with audio and hyper-interactive links to simulations (Christian & Esquembre, 2012; Wee, 2013; Wee, Goh, & Chew, 2013; Wee, Goh, & Lim, 2013; Wee, Lee, Chew, Wong, & Tan, 2015). This chapter focuses on the video aspect of multimedia, which can be positioned as a component to any effective self-paced on-line lesson that would be available anytime, anywhere via computer or mobile devices. The multimedia video approach aims to help users overcome barriers in creating engaging, effective and meaningful content (Barron & Darling-Hammond, 2008) for teaching and learning in an online environment.
△ Less
Submitted 3 February, 2015;
originally announced February 2015.
-
Microbial community pattern detection in human body habitats via ensemble clustering framework
Authors:
Peng Yang,
Xiaoquan Su,
Le Ou-Yang,
Hon-Nian Chua,
Xiao-Li Li,
Kang Ning
Abstract:
The human habitat is a host where microbial species evolve, function, and continue to evolve. Elucidating how microbial communities respond to human habitats is a fundamental and critical task, as establishing baselines of human microbiome is essential in understanding its role in human disease and health. However, current studies usually overlook a complex and interconnected landscape of human mi…
▽ More
The human habitat is a host where microbial species evolve, function, and continue to evolve. Elucidating how microbial communities respond to human habitats is a fundamental and critical task, as establishing baselines of human microbiome is essential in understanding its role in human disease and health. However, current studies usually overlook a complex and interconnected landscape of human microbiome and limit the ability in particular body habitats with learning models of specific criterion. Therefore, these methods could not capture the real-world underlying microbial patterns effectively. To obtain a comprehensive view, we propose a novel ensemble clustering framework to mine the structure of microbial community pattern on large-scale metagenomic data. Particularly, we first build a microbial similarity network via integrating 1920 metagenomic samples from three body habitats of healthy adults. Then a novel symmetric Nonnegative Matrix Factorization (NMF) based ensemble model is proposed and applied onto the network to detect clustering pattern. Extensive experiments are conducted to evaluate the effectiveness of our model on deriving microbial community with respect to body habitat and host gender. From clustering results, we observed that body habitat exhibits a strong bound but non-unique microbial structural patterns. Meanwhile, human microbiome reveals different degree of structural variations over body habitat and host gender. In summary, our ensemble clustering framework could efficiently explore integrated clustering results to accurately identify microbial communities, and provide a comprehensive view for a set of microbial communities. Such trends depict an integrated biography of microbial communities, which offer a new insight towards uncovering pathogenic model of human microbiome.
△ Less
Submitted 4 January, 2015; v1 submitted 21 December, 2014;
originally announced December 2014.