-
Multi-target and multi-stage liver lesion segmentation and detection in multi-phase computed tomography scans
Authors:
Abdullah F. Al-Battal,
Soan T. M. Duong,
Van Ha Tang,
Quang Duc Tran,
Steven Q. H. Truong,
Chien Phan,
Truong Q. Nguyen,
Cheolhong An
Abstract:
Multi-phase computed tomography (CT) scans use contrast agents to highlight different anatomical structures within the body to improve the probability of identifying and detecting anatomical structures of interest and abnormalities such as liver lesions. Yet, detecting these lesions remains a challenging task as these lesions vary significantly in their size, shape, texture, and contrast with resp…
▽ More
Multi-phase computed tomography (CT) scans use contrast agents to highlight different anatomical structures within the body to improve the probability of identifying and detecting anatomical structures of interest and abnormalities such as liver lesions. Yet, detecting these lesions remains a challenging task as these lesions vary significantly in their size, shape, texture, and contrast with respect to surrounding tissue. Therefore, radiologists need to have an extensive experience to be able to identify and detect these lesions. Segmentation-based neural networks can assist radiologists with this task. Current state-of-the-art lesion segmentation networks use the encoder-decoder design paradigm based on the UNet architecture where the multi-phase CT scan volume is fed to the network as a multi-channel input. Although this approach utilizes information from all the phases and outperform single-phase segmentation networks, we demonstrate that their performance is not optimal and can be further improved by incorporating the learning from models trained on each single-phase individually. Our approach comprises three stages. The first stage identifies the regions within the liver where there might be lesions at three different scales (4, 8, and 16 mm). The second stage includes the main segmentation model trained using all the phases as well as a segmentation model trained on each of the phases individually. The third stage uses the multi-phase CT volumes together with the predictions from each of the segmentation models to generate the final segmentation map. Overall, our approach improves relative liver lesion segmentation performance by 1.6% while reducing performance variability across subjects by 8% when compared to the current state-of-the-art models.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Searching for Intermediate Mass Black Holes in Globular Clusters Through Tidal Disruption Events
Authors:
Vivian L. Tang,
Piero Madau,
Elisa Bortolas,
Eric W. Peng
Abstract:
Intermediate mass black holes (IMBHs) may be the link between stellar mass holes and the supermassive variety in the nuclei of galaxies, and globular clusters (GCs) may be one of the most promising environments for their formation. Here we carry out a pilot study of the observability of tidal disruption events (TDEs) from 10^3 Msun < M_BH < 10^5 Msun IMBHs embedded in stellar cusps at the center o…
▽ More
Intermediate mass black holes (IMBHs) may be the link between stellar mass holes and the supermassive variety in the nuclei of galaxies, and globular clusters (GCs) may be one of the most promising environments for their formation. Here we carry out a pilot study of the observability of tidal disruption events (TDEs) from 10^3 Msun < M_BH < 10^5 Msun IMBHs embedded in stellar cusps at the center of GCs. We model the long super-Eddington accretion phase and ensuing optical flare, and derive the disruption rate of main-sequence stars as a function of black hole mass and GC properties with the help of a 1D Fokker-Planck approach. The photospheric emission of the adiabatically expanding outflow dominates the observable radiation and peaks in the NUV/optical bands, outshining the brightness of the (old) stellar population of GCs in Virgo for a period of months to years. A search for TDE events in a sample of nearly 4,000 GCs observed at multiple epochs by the Next Generation Virgo Cluster Survey (NGVS) yields null results. Given our model predictions, this sample is too small to set stringent constraints on the present-day occupation fraction of GCs hosting IMBHs. Naturally, better simulations of the properties of the cluster central stellar distribution, TDE light curves and rates, together with larger surveys of GCs are all needed to gain deeper insights into the presence of IMBHs in GCs.
△ Less
Submitted 19 December, 2023;
originally announced December 2023.
-
Synthesizing Affective Neurophysiological Signals Using Generative Models: A Review Paper
Authors:
Alireza F. Nia,
Vanessa Tang,
Gonzalo Maso Talou,
Mark Billinghurst
Abstract:
The integration of emotional intelligence in machines is an important step in advancing human-computer interaction. This demands the development of reliable end-to-end emotion recognition systems. However, the scarcity of public affective datasets presents a challenge. In this literature review, we emphasize the use of generative models to address this issue in neurophysiological signals, particul…
▽ More
The integration of emotional intelligence in machines is an important step in advancing human-computer interaction. This demands the development of reliable end-to-end emotion recognition systems. However, the scarcity of public affective datasets presents a challenge. In this literature review, we emphasize the use of generative models to address this issue in neurophysiological signals, particularly Electroencephalogram (EEG) and Functional Near-Infrared Spectroscopy (fNIRS). We provide a comprehensive analysis of different generative models used in the field, examining their input formulation, deployment strategies, and methodologies for evaluating the quality of synthesized data. This review serves as a comprehensive overview, offering insights into the advantages, challenges, and promising future directions in the application of generative models in emotion recognition systems. Through this review, we aim to facilitate the progression of neurophysiological data augmentation, thereby supporting the development of more efficient and reliable emotion recognition systems.
△ Less
Submitted 5 June, 2023;
originally announced June 2023.
-
Managed Geo-Distributed Feature Store: Architecture and System Design
Authors:
Anya Li,
Bhala Ranganathan,
Feng Pan,
Mickey Zhang,
Qianjun Xu,
Runhan Li,
Sethu Raman,
Shail Paragbhai Shah,
Vivienne Tang
Abstract:
Companies are using machine learning to solve real-world problems and are develo** hundreds to thousands of features in the process. They are building feature engineering pipelines as part of MLOps life cycle to transform data from various data sources and materialize the same for future consumption. Without feature stores, different teams across various business groups would maintain the above…
▽ More
Companies are using machine learning to solve real-world problems and are develo** hundreds to thousands of features in the process. They are building feature engineering pipelines as part of MLOps life cycle to transform data from various data sources and materialize the same for future consumption. Without feature stores, different teams across various business groups would maintain the above process independently, which can lead to conflicting and duplicated features in the system. Data scientists find it hard to search for and reuse existing features and it is painful to maintain version control. Furthermore, feature correctness violations related to online (inferencing) - offline (training) skews and data leakage are common. Although the machine learning community has extensively discussed the need for feature stores and their purpose, this paper aims to capture the core architectural components that make up a managed feature store and to share the design learning in building such a system.
△ Less
Submitted 31 May, 2023;
originally announced May 2023.
-
FNetAR: Mixing Tokens with Autoregressive Fourier Transforms
Authors:
Tim Lou,
Michael Park,
Mohammad Ramezanali,
Vincent Tang
Abstract:
In this note we examine the autoregressive generalization of the FNet algorithm, in which self-attention layers from the standard Transformer architecture are substituted with a trivial sparse-uniformsampling procedure based on Fourier transforms. Using the Wikitext-103 benchmark, we demonstratethat FNetAR retains state-of-the-art performance (25.8 ppl) on the task of causal language modelingcompa…
▽ More
In this note we examine the autoregressive generalization of the FNet algorithm, in which self-attention layers from the standard Transformer architecture are substituted with a trivial sparse-uniformsampling procedure based on Fourier transforms. Using the Wikitext-103 benchmark, we demonstratethat FNetAR retains state-of-the-art performance (25.8 ppl) on the task of causal language modelingcompared to a Transformer-XL baseline (24.2 ppl) with only half the number self-attention layers,thus providing further evidence for the superfluity of deep neural networks with heavily compoundedattention mechanisms. The autoregressive Fourier transform could likely be used for parameterreduction on most Transformer-based time-series prediction models.
△ Less
Submitted 22 July, 2021;
originally announced July 2021.
-
Applying Machine Learning to Crowd-sourced Data from Earthquake Detective
Authors:
Omkar Ranadive,
Suzan van der Lee,
Vivian Tang,
Kevin Chao
Abstract:
Dynamically triggered earthquakes and tremor generate two classes of weak seismic signals whose detection, identification, and authentication traditionally call for laborious analyses. Machine learning (ML) has grown in recent years to be a powerful efficiency-boosting tool in geophysical analyses, including the detection of specific signals in time series. However, detecting weak signals that are…
▽ More
Dynamically triggered earthquakes and tremor generate two classes of weak seismic signals whose detection, identification, and authentication traditionally call for laborious analyses. Machine learning (ML) has grown in recent years to be a powerful efficiency-boosting tool in geophysical analyses, including the detection of specific signals in time series. However, detecting weak signals that are buried in noise challenges ML algorithms, in part because ubiquitous training data is not always available. Under these circumstances, ML can be as ineffective as human experts are inefficient. At this intersection of effectiveness and efficiency, we leverage a third tool that has grown in popularity over the past decade: Citizen science. Citizen science project Earthquake Detective leverages the eyes and ears of volunteers to detect and classify weak signals in seismograms from potentially dynamically triggered (PDT) events. Here, we present the Earthquake Detective data set - A crowd-sourced set of labels on PDT earthquakes and tremor. We apply Machine Learning to classify these PDT seismic events and explore the challenges faced in segregating and classifying such weak signals. We confirm that with an image- and wavelet-based algorithm, machine learning can detect signals from small earthquakes. In addition, we report that our ML algorithm can also detect signals from PDT tremor, which has not been previously demonstrated. The citizen science data set of classifications and ML code are available online.
△ Less
Submitted 15 June, 2022; v1 submitted 4 November, 2020;
originally announced November 2020.
-
Towards defining reference materials for extracellular vesicle size, concentration, refractive index and epitope abundance
Authors:
Joshua A. Welsh,
Edwin van der Pol,
Britta A. Bettin,
David R. F. Carter,
An Hendrix,
Metka Lenassi,
Marc-André Langlois,
Alicia Llorente,
Arthur S. van de Nes,
Rienk Nieuwland,
Vera Tang,
Lili Wang,
Kenneth W. Witwer,
Jennifer C. Jones
Abstract:
Accurate characterization of extracellular vesicles (EVs) is critical to explore their diagnostic and therapeutic applications. As the EV research field has developed, so too have the techniques used to characterize them. The development of reference materials is required for the standardization of these techniques. This work, initiated from the ISEV 2017 Biomarker Workshop in Birmingham, UK, and…
▽ More
Accurate characterization of extracellular vesicles (EVs) is critical to explore their diagnostic and therapeutic applications. As the EV research field has developed, so too have the techniques used to characterize them. The development of reference materials is required for the standardization of these techniques. This work, initiated from the ISEV 2017 Biomarker Workshop in Birmingham, UK, and with further discussion during the ISEV 2019 Standardization Workshop in Ghent, Belgium, sets out to elucidate which reference materials are required and which are currently available to standardize commonly used analysis platforms for characterizing EV size, concentration, refractive index, and epitope expression. Due to their predominant use, a particular focus is placed on the optical methods nanoparticle tracking analysis and flow cytometry.
△ Less
Submitted 20 April, 2020;
originally announced April 2020.
-
Calibrated Intervention and Containment of the COVID-19 Pandemic
Authors:
Liang Tian,
Xuefei Li,
Fei Qi,
Qian-Yuan Tang,
Viola Tang,
Jiang Liu,
Zhiyuan Li,
Xingye Cheng,
Xuanxuan Li,
Yingchen Shi,
Haiguang Liu,
Lei-Han Tang
Abstract:
Within a short period of time, COVID-19 grew into a world-wide pandemic. Transmission by pre-symptomatic and asymptomatic viral carriers rendered intervention and containment of the disease extremely challenging. Based on reported infection case studies, we construct an epidemiological model that focuses on transmission around the symptom onset. The model is calibrated against incubation period an…
▽ More
Within a short period of time, COVID-19 grew into a world-wide pandemic. Transmission by pre-symptomatic and asymptomatic viral carriers rendered intervention and containment of the disease extremely challenging. Based on reported infection case studies, we construct an epidemiological model that focuses on transmission around the symptom onset. The model is calibrated against incubation period and pairwise transmission statistics during the initial outbreaks of the pandemic outside Wuhan with minimal non-pharmaceutical interventions. Mathematical treatment of the model yields explicit expressions for the size of latent and pre-symptomatic subpopulations during the exponential growth phase, with the local epidemic growth rate as input. We then explore reduction of the basic reproduction number R_0 through specific disease control measures such as contact tracing, testing, social distancing, wearing masks and sheltering in place. When these measures are implemented in combination, their effects on R_0 multiply. We also compare our model behaviour to the first wave of the COVID-19 spreading in various affected regions and highlight generic and less generic features of the pandemic development.
△ Less
Submitted 17 November, 2020; v1 submitted 16 March, 2020;
originally announced March 2020.
-
Updated Results of a Solid-State Sensor Irradiation Study for ILC Extreme Forward Calorimetry
Authors:
George Courcoubetis,
Wyatt Crockett,
Vitaliy Fadeyev,
Thomas Kelley,
Forest Martinez-McKinney,
Bruce A. Schumm,
Edwin Spencer,
Vivian Tang,
Max Wilder
Abstract:
Detectors proposed for the International Linear Collider (ILC) incorporate a tungsten sampling calorimeter (`BeamCal') intended to reconstruct showers of electrons, positrons and photons that emerge from the interaction point of the collider with angles between 5 and 50 milliradians. For the innermost radius of this calorimeter, radiation doses at shower-max are expected to reach 100 MRad per year…
▽ More
Detectors proposed for the International Linear Collider (ILC) incorporate a tungsten sampling calorimeter (`BeamCal') intended to reconstruct showers of electrons, positrons and photons that emerge from the interaction point of the collider with angles between 5 and 50 milliradians. For the innermost radius of this calorimeter, radiation doses at shower-max are expected to reach 100 MRad per year, primarily due to minimum-ionizing electrons and positrons that arise in the induced electromagnetic showers of e+e- `beamstrahlung' pairs produced in the ILC beam-beam interaction. However, radiation damage to calorimeter sensors may be dominated by hadrons induced by nuclear interactions of shower photons, which are much more likely to contribute to the non-ionizing energy loss that has been observed to damage sensors exposed to hadronic radiation. We report here on the results of SLAC Experiment T-506, for which several different types of silicon diode and gallium-arsenide sensors were exposed to doses of radiation induced by showering electrons of energy 3.5-10.6 GeV. By embedding the sensor under irradiation within a tungsten radiator, the exposure incorporated hadronic species that would potentially contribute to the degradation of a sensor mounted in a precision sampling calorimeter. Depending on sensor technology, efficient charge collection was observed for doses as large as 220 MRad.
△ Less
Submitted 25 March, 2015;
originally announced March 2015.