Search | arXiv e-print repository

Hardware Realization of Neuromorphic Computing with a 4-Port Photonic Reservoir for Modulation Format Identification

Authors: Enes Şeker, Rijil Thomas, Guillermo von Hünefeld, Stephan Suckow, Mahdi Kaveh, Gregor Ronniger, Pooyan Safari, Isaac Sackey, David Stahl, Colja Schubert, Johannes Karl Fischer, Ronald Freund, Max C. Lemme

Abstract: The fields of machine learning and artificial intelligence drive researchers to explore energy-efficient, brain-inspired new hardware. Reservoir computing encompasses recurrent neural networks for sequential data processing and matches the performance of other recurrent networks with less training and lower costs. However, traditional software-based neural networks suffer from high energy consumpt… ▽ More The fields of machine learning and artificial intelligence drive researchers to explore energy-efficient, brain-inspired new hardware. Reservoir computing encompasses recurrent neural networks for sequential data processing and matches the performance of other recurrent networks with less training and lower costs. However, traditional software-based neural networks suffer from high energy consumption due to computational demands and massive data transfer needs. Photonic reservoir computing overcomes this challenge with energy-efficient neuromorphic photonic integrated circuits or NeuroPICs. Here, we introduce a reservoir NeuroPIC used for modulation format identification in C-band telecommunication network monitoring. It is built on a silicon-on-insulator platform with a 4-port reservoir architecture consisting of a set of physical nodes connected via delay lines. We comprehensively describe the NeuroPIC design and fabrication, experimentally demonstrate its performance, and compare it with simulations. The NeuroPIC incorporates non-linearity through a simple digital readout and achieves close to 100% accuracy in identifying several configurations of quadrature amplitude modulation formats transmitted over 20 km of optical fiber at 32 GBaud symbol rate. The NeuroPIC performance is robust against fabrication imperfections like waveguide propagation loss, phase randomization, etc. and delay line length variations. Furthermore, the experimental results exceeded numerical simulations, which we attribute to enhanced signal interference in the experimental NeuroPIC output. Our energy-efficient photonic approach has the potential for high-speed temporal data processing in a variety of applications. △ Less

Submitted 19 June, 2024; originally announced June 2024.

Comments: 32 pages, including supporting information

arXiv:2312.10093 [pdf]

doi 10.4126/FRL01-006461895

Verbesserung des Record Linkage für die Gesundheitsforschung in Deutschland

Authors: Timm Intemann, Knut Kaulke, Dennis-Kenji Kipker, Vanessa Lettieri, Christoph Stallmann, Carsten O. Schmidt, Lars Geidel, Martin Bialke, Christopher Hampf, Dana Stahl, Martin Lablans, Florens Rohde, Martin Franke, Klaus Kraywinkel, Joachim Kieschke, Sebastian Bartholomäus, Anatol-Fiete Näher, Galina Tremper, Mohamed Lambarki, Stefanie March, Fabian Prasser, Anna Christine Haber, Johannes Drepper, Irene Schlünder, Toralf Kirsten , et al. (5 additional authors not shown)

Abstract: Record linkage means linking data from multiple sources. This approach enables the answering of scientific questions that cannot be addressed using single data sources due to limited variables. The potential of linked data for health research is enormous, as it can enhance prevention, treatment, and population health policies. Due the sensitivity of health data, there are strict legal requirements… ▽ More Record linkage means linking data from multiple sources. This approach enables the answering of scientific questions that cannot be addressed using single data sources due to limited variables. The potential of linked data for health research is enormous, as it can enhance prevention, treatment, and population health policies. Due the sensitivity of health data, there are strict legal requirements to prevent potential misuse. However, these requirements also limit the use of health data for research, thereby hindering innovations in prevention and care. Also, comprehensive Record linkage in Germany is often challenging due to lacking unique personal identifiers or interoperable solutions. Rather, the need to protect data is often weighed against the importance of research aiming at healthcare enhancements: for instance, data protection officers may demand the informed consent of individual study participants for data linkage, even when this is not mandatory. Furthermore, legal frameworks may be interpreted differently on varying occasions. Given both, technical and legal challenges, record linkage for health research in Germany falls behind the standards of other European countries. To ensure successful record linkage, case-specific solutions must be developed, tested, and modified as necessary before implementation. This paper discusses limitations and possibilities of various data linkage approaches tailored to different use cases in compliance with the European General Data Protection Regulation. It further describes requirements for achieving a more research-friendly approach to linking health data records in Germany. Additionally, it provides recommendations to legislators. The objective of this work is to improve record linkage for health research in Germany. △ Less

Submitted 14 December, 2023; originally announced December 2023.

Comments: in German language

arXiv:2309.02237 [pdf]

Sample Size in Natural Language Processing within Healthcare Research

Authors: Jaya Chaturvedi, Diana Shamsutdinova, Felix Zimmer, Sumithra Velupillai, Daniel Stahl, Robert Stewart, Angus Roberts

Abstract: Sample size calculation is an essential step in most data-based disciplines. Large enough samples ensure representativeness of the population and determine the precision of estimates. This is true for most quantitative studies, including those that employ machine learning methods, such as natural language processing, where free-text is used to generate predictions and classify instances of text. W… ▽ More Sample size calculation is an essential step in most data-based disciplines. Large enough samples ensure representativeness of the population and determine the precision of estimates. This is true for most quantitative studies, including those that employ machine learning methods, such as natural language processing, where free-text is used to generate predictions and classify instances of text. Within the healthcare domain, the lack of sufficient corpora of previously collected data can be a limiting factor when determining sample sizes for new studies. This paper tries to address the issue by making recommendations on sample sizes for text classification tasks in the healthcare domain. Models trained on the MIMIC-III database of critical care records from Beth Israel Deaconess Medical Center were used to classify documents as having or not having Unspecified Essential Hypertension, the most common diagnosis code in the database. Simulations were performed using various classifiers on different sample sizes and class proportions. This was repeated for a comparatively less common diagnosis code within the database of diabetes mellitus without mention of complication. Smaller sample sizes resulted in better results when using a K-nearest neighbours classifier, whereas larger sample sizes provided better results with support vector machines and BERT models. Overall, a sample size larger than 1000 was sufficient to provide decent performance metrics. The simulations conducted within this study provide guidelines that can be used as recommendations for selecting appropriate sample sizes and class proportions, and for predicting expected performance, when building classifiers for textual healthcare data. The methodology used here can be modified for sample size estimates calculations with other datasets. △ Less

Submitted 5 September, 2023; originally announced September 2023.

Comments: Submitted to Journal of Biomedical Informatics

arXiv:2308.14835 [pdf, other]

AI ATAC 1: An Evaluation of Prominent Commercial Malware Detectors

Authors: Robert A. Bridges, Brian Weber, Justin M. Beaver, Jared M. Smith, Miki E. Verma, Savannah Norem, Kevin Spakes, Cory Watson, Jeff A. Nichols, Brian Jewell, Michael. D. Iannacone, Chelsey Dunivan Stahl, Kelly M. T. Huffer, T. Sean Oesch

Abstract: This work presents an evaluation of six prominent commercial endpoint malware detectors, a network malware detector, and a file-conviction algorithm from a cyber technology vendor. The evaluation was administered as the first of the Artificial Intelligence Applications to Autonomous Cybersecurity (AI ATAC) prize challenges, funded by / completed in service of the US Navy. The experiment employed 1… ▽ More This work presents an evaluation of six prominent commercial endpoint malware detectors, a network malware detector, and a file-conviction algorithm from a cyber technology vendor. The evaluation was administered as the first of the Artificial Intelligence Applications to Autonomous Cybersecurity (AI ATAC) prize challenges, funded by / completed in service of the US Navy. The experiment employed 100K files (50/50% benign/malicious) with a stratified distribution of file types, including ~1K zero-day program executables (increasing experiment size two orders of magnitude over previous work). We present an evaluation process of delivering a file to a fresh virtual machine donning the detection technology, waiting 90s to allow static detection, then executing the file and waiting another period for dynamic detection; this allows greater fidelity in the observational data than previous experiments, in particular, resource and time-to-detection statistics. To execute all 800K trials (100K files $\times$ 8 tools), a software framework is designed to choreographed the experiment into a completely automated, time-synced, and reproducible workflow with substantial parallelization. A cost-benefit model was configured to integrate the tools' recall, precision, time to detection, and resource requirements into a single comparable quantity by simulating costs of use. This provides a ranking methodology for cyber competitions and a lens through which to reason about the varied statistical viewpoints of the results. These statistical and cost-model results provide insights on state of commercial malware detection. △ Less

Submitted 28 August, 2023; originally announced August 2023.

arXiv:2306.10334 [pdf]

A Machine Learning Approach for Predicting Deterioration in Alzheimer's Disease

Authors: Henry Musto, Daniel Stamate, Ida Pu, Daniel Stahl

Abstract: This paper explores deterioration in Alzheimers Disease using Machine Learning. Subjects were split into two datasets based on baseline diagnosis (Cognitively Normal, Mild Cognitive Impairment), with outcome of deterioration at final visit (a binomial essentially yes/no categorisation) using data from the Alzheimers Disease Neuroimaging Initiative (demographics, genetics, CSF, imaging, and neurops… ▽ More This paper explores deterioration in Alzheimers Disease using Machine Learning. Subjects were split into two datasets based on baseline diagnosis (Cognitively Normal, Mild Cognitive Impairment), with outcome of deterioration at final visit (a binomial essentially yes/no categorisation) using data from the Alzheimers Disease Neuroimaging Initiative (demographics, genetics, CSF, imaging, and neuropsychological testing etc). Six machine learning models, including gradient boosting, were built, and evaluated on these datasets using a nested crossvalidation procedure, with the best performing models being put through repeated nested cross-validation at 100 iterations. We were able to demonstrate good predictive ability using CART predicting which of those in the cognitively normal group deteriorated and received a worse diagnosis (AUC = 0.88). For the mild cognitive impairment group, we were able to achieve good predictive ability for deterioration with Elastic Net (AUC = 0.76). △ Less

Submitted 17 June, 2023; originally announced June 2023.

arXiv:2306.10330 [pdf]

Predicting Risk of Dementia with Survival Machine Learning and Statistical Methods: Results on the English Longitudinal Study of Ageing Cohort

Authors: Daniel Stamate, Henry Musto, Olesya Ajnakina, Daniel Stahl

Abstract: Machine learning models that aim to predict dementia onset usually follow the classification methodology ignoring the time until an event happens. This study presents an alternative, using survival analysis within the context of machine learning techniques. Two survival method extensions based on machine learning algorithms of Random Forest and Elastic Net are applied to train, optimise, and valid… ▽ More Machine learning models that aim to predict dementia onset usually follow the classification methodology ignoring the time until an event happens. This study presents an alternative, using survival analysis within the context of machine learning techniques. Two survival method extensions based on machine learning algorithms of Random Forest and Elastic Net are applied to train, optimise, and validate predictive models based on the English Longitudinal Study of Ageing ELSA cohort. The two survival machine learning models are compared with the conventional statistical Cox proportional hazard model, proving their superior predictive capability and stability on the ELSA data, as demonstrated by computationally intensive procedures such as nested cross-validation and Monte Carlo validation. This study is the first to apply survival machine learning to the ELSA data, and demonstrates in this case the superiority of AI based predictive modelling approaches over the widely employed Cox statistical approach in survival analysis. Implications, methodological considerations, and future research directions are discussed. △ Less

Submitted 17 June, 2023; originally announced June 2023.

Comments: Henry Musto is joint first author

arXiv:2306.10326 [pdf]

Predicting Alzheimers Disease Diagnosis Risk over Time with Survival Machine Learning on the ADNI Cohort

Authors: Henry Musto, Daniel Stamate, Ida Pu, Daniel Stahl

Abstract: The rise of Alzheimers Disease worldwide has prompted a search for efficient tools which can be used to predict deterioration in cognitive decline leading to dementia. In this paper, we explore the potential of survival machine learning as such a tool for building models capable of predicting not only deterioration but also the likely time to deterioration. We demonstrate good predictive ability (… ▽ More The rise of Alzheimers Disease worldwide has prompted a search for efficient tools which can be used to predict deterioration in cognitive decline leading to dementia. In this paper, we explore the potential of survival machine learning as such a tool for building models capable of predicting not only deterioration but also the likely time to deterioration. We demonstrate good predictive ability (0.86 C-Index), lending support to its use in clinical investigation and prediction of Alzheimers Disease risk. △ Less

Submitted 17 June, 2023; originally announced June 2023.

arXiv:2104.08600 [pdf]

doi 10.21437/Interspeech.2021-1240

Remote smartphone-based speech collection: acceptance and barriers in individuals with major depressive disorder

Authors: Judith Dineley, Grace Lavelle, Daniel Leightley, Faith Matcham, Sara Siddi, Maria Teresa Peñarrubia-María, Katie M. White, Alina Ivan, Carolin Oetzmann, Sara Simblett, Erin Dawe-Lane, Stuart Bruce, Daniel Stahl, Yatharth Ranjan, Zulqarnain Rashid, Pauline Conde, Amos A. Folarin, Josep Maria Haro, Til Wykes, Richard J. B. Dobson, Vaibhav A. Narayan, Matthew Hotopf, Björn W. Schuller, Nicholas Cummins, The RADAR-CNS Consortium

Abstract: The ease of in-the-wild speech recording using smartphones has sparked considerable interest in the combined application of speech, remote measurement technology (RMT) and advanced analytics as a research and healthcare tool. For this to be realised, the acceptability of remote speech collection to the user must be established, in addition to feasibility from an analytical perspective. To understa… ▽ More The ease of in-the-wild speech recording using smartphones has sparked considerable interest in the combined application of speech, remote measurement technology (RMT) and advanced analytics as a research and healthcare tool. For this to be realised, the acceptability of remote speech collection to the user must be established, in addition to feasibility from an analytical perspective. To understand the acceptance, facilitators, and barriers of smartphone-based speech recording, we invited 384 individuals with major depressive disorder (MDD) from the Remote Assessment of Disease and Relapse - Central Nervous System (RADAR-CNS) research programme in Spain and the UK to complete a survey on their experiences recording their speech. In this analysis, we demonstrate that study participants were more comfortable completing a scripted speech task than a free speech task. For both speech tasks, we found depression severity and country to be significant predictors of comfort. Not seeing smartphone notifications of the scheduled speech tasks, low mood and forgetfulness were the most commonly reported obstacles to providing speech recordings. △ Less

Submitted 30 August, 2021; v1 submitted 17 April, 2021; originally announced April 2021.

Comments: Accepted to Interspeech 2021. Formatting changes + minor language edits

ACM Class: H.1.2

Journal ref: Proc. Interspeech 2021, pp. 631-635

arXiv:1707.03354 [pdf]

doi 10.1007/s40473-017-0104-y

Computational Psychiatry in Borderline Personality Disorder

Authors: Sarah K Fineberg, Dylan Stahl, Philip Corlett

Abstract: Purpose of review: We review the literature on the use and potential use of computational psychiatry methods in Borderline Personality Disorder. Recent findings: Computational approaches have been used in psychiatry to increase our understanding of the molecular, circuit, and behavioral basis of mental illness. This is of particular interest in BPD, where the collection of ecologically valid dat… ▽ More Purpose of review: We review the literature on the use and potential use of computational psychiatry methods in Borderline Personality Disorder. Recent findings: Computational approaches have been used in psychiatry to increase our understanding of the molecular, circuit, and behavioral basis of mental illness. This is of particular interest in BPD, where the collection of ecologically valid data, especially in interpersonal settings, is becoming more common and more often subject to quantification. Methods that test learning and memory in social contexts, collect data from real-world settings, and relate behavior to molecular and circuit networks are yielding data of particular interest. Summary: Research in BPD should focus on collaborative efforts to design and interpret experiments with direct relevance to core BPD symptoms and potential for translation to the clinic. △ Less

Submitted 5 March, 2018; v1 submitted 11 July, 2017; originally announced July 2017.

Journal ref: Current Behavioral Neuroscience Reports, March 2017, Vol 4, Issue 1, pp31-40

arXiv:1705.09873 [pdf]

Individuals with Borderline Personality Disorder Show Larger Preferred Social Distance in Live Dyadic Interactions

Authors: Sarah K Fineberg, Jacob Leavitt, Christopher D Landry, Eli S Neustadter, Rebecca Lesser, Dylan Stahl, Sasha Deutsch-Link, Philip R Corlett

Abstract: Personal space (PS) regulation is a key component of effective social engagement. PS varies among individuals and is regulated by brain circuits involving the amygdala and the frontoparietal network. Others have reported that simulated PS intrusions suggest larger preferred interpersonal distance (PID) and a central role of amygdala hyperactivity in PS regulation in Borderline Personality Disorder… ▽ More Personal space (PS) regulation is a key component of effective social engagement. PS varies among individuals and is regulated by brain circuits involving the amygdala and the frontoparietal network. Others have reported that simulated PS intrusions suggest larger preferred interpersonal distance (PID) and a central role of amygdala hyperactivity in PS regulation in Borderline Personality Disorder (BPD). This study is the first report of live interpersonal distance preferences and relation to specific symptoms in BPD. We found a 2-fold larger PID in BPD than control (n=30, n=23). There were no significant differences in PID in BPD subject by medication status or pre-study diagnosis, and no significant correlations between PID and intensity of BPD, mood, anxiety, impulsive, or psychotic symptoms. In summary, PID is larger in BPD than control subjects. Unexpectedly, BPD subject PID did not differ in by medication status and did not correlate with intensity of any of the symptom types tested. We discuss these findings in context of severe attachment disturbances in BPD and the relationship between metaphoric social distance in the attachment framework. Future work is needed to identify neural circuits underlying PS regulation in BPD, individual differences in attachment, and relationship to symptom trajectory. △ Less

Submitted 27 May, 2017; originally announced May 2017.

Comments: pre-print

arXiv:1701.05461

Convergence and Divergence of Mersenne Variations of the $3x+1$ Function

Authors: Denver Stahl

Abstract: The Collatz problem is one of many names (the Collatz Problem, the Syracuse Problem, the Hailstone Problem, the 3x+1 problem). Most commonly, however, the problem goes by either the 3x+1 problem or the Collatz problem. In addition to having many names, the Collatz problem has many variations, such as those in the form introduced by Jeffrey Lagarias in 1985. This writing discusses several variation… ▽ More The Collatz problem is one of many names (the Collatz Problem, the Syracuse Problem, the Hailstone Problem, the 3x+1 problem). Most commonly, however, the problem goes by either the 3x+1 problem or the Collatz problem. In addition to having many names, the Collatz problem has many variations, such as those in the form introduced by Jeffrey Lagarias in 1985. This writing discusses several variations of the Collatz function which involve the Mersenne numbers. Following that, we observe the convergent cycles of these functions which we can then relate back to the original Collatz 3x+1 function. Lastly, we give a proof of the No Divergent Trajectories Theorem and show why the same cannot be shown for similar functions. △ Less

Submitted 3 May, 2017; v1 submitted 19 January, 2017; originally announced January 2017.

Comments: Due to current developments, results of this paper have been proven incorrect

Showing 1–11 of 11 results for author: Stahl, D