Skip to main content

Showing 1–34 of 34 results for author: Loew, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.03005  [pdf, other

    cs.LG cs.AI

    Safe Reinforcement Learning with Learned Non-Markovian Safety Constraints

    Authors: Siow Meng Low, Akshat Kumar

    Abstract: In safe Reinforcement Learning (RL), safety cost is typically defined as a function dependent on the immediate state and actions. In practice, safety constraints can often be non-Markovian due to the insufficient fidelity of state representation, and safety cost may not be known. We therefore address a general setting where safety labels (e.g., safe or unsafe) are associated with state-action traj… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  2. arXiv:2304.03081  [pdf, other

    cs.LG cs.AI

    Safe MDP Planning by Learning Temporal Patterns of Undesirable Trajectories and Averting Negative Side Effects

    Authors: Siow Meng Low, Akshat Kumar, Scott Sanner

    Abstract: In safe MDP planning, a cost function based on the current state and action is often used to specify safety aspects. In the real world, often the state representation used may lack sufficient fidelity to specify such safety constraints. Operating based on an incomplete model can often produce unintended negative side effects (NSEs). To address these challenges, first, we associate safety signals w… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

  3. arXiv:2303.04769  [pdf, other

    cs.MS cs.PF

    SMaLL: A Software Framework for portable Machine Learning Libraries

    Authors: Upasana Sridhar, Nicholai Tukanov, Elliott Binder, Tze Meng Low, Scott McMillan, Martin D. Schatz

    Abstract: Interest in deploying Deep Neural Network (DNN) inference on edge devices has resulted in an explosion of the number and types of hardware platforms to use. While the high-level programming interface, such as TensorFlow, can be readily ported across different devices, high-performance inference implementations rely on a good map** of the high-level interface to the target hardware platform. Comm… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

    Comments: 14 pages, 12 figures

  4. CaraNet: Context Axial Reverse Attention Network for Segmentation of Small Medical Objects

    Authors: Ange Lou, Shuyue Guan, Murray Loew

    Abstract: Segmenting medical images accurately and reliably is important for disease diagnosis and treatment. It is a challenging task because of the wide variety of objects' sizes, shapes, and scanning modalities. Recently, many convolutional neural networks (CNN) have been designed for segmentation tasks and achieved great success. Few studies, however, have fully considered the sizes of objects, and thus… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    Comments: arXiv admin note: text overlap with arXiv:2108.07368

    Journal ref: Journal of Medical Imaging 10(1), 014005 (18 February 2023)

  5. arXiv:2301.11272  [pdf, other

    cs.CY stat.AP

    Location-based Activity Behavior Deviation Detection for Nursing Home using IoT Devices

    Authors: Billy Pik Lik Lau, Zann Koh, Yuren Zhou, Benny Kai Kiat Ng, Chau Yuen, Mui Lang Low

    Abstract: With the advancement of the Internet of Things(IoT) and pervasive computing applications, it provides a better opportunity to understand the behavior of the aging population. However, in a nursing home scenario, common sensors and techniques used to track an elderly living alone are not suitable. In this paper, we design a location-based tracking system for a four-story nursing home - The Salvatio… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

    Comments: 12 pages

  6. arXiv:2203.12679  [pdf, other

    cs.AI cs.LG

    Sample-efficient Iterative Lower Bound Optimization of Deep Reactive Policies for Planning in Continuous MDPs

    Authors: Siow Meng Low, Akshat Kumar, Scott Sanner

    Abstract: Recent advances in deep learning have enabled optimization of deep reactive policies (DRPs) for continuous MDP planning by encoding a parametric policy as a deep neural network and exploiting automatic differentiation in an end-to-end model-based gradient descent framework. This approach has proven effective for optimizing DRPs in nonlinear continuous MDPs, but it requires a large number of sample… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

  7. arXiv:2201.02771  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    A Sneak Attack on Segmentation of Medical Images Using Deep Neural Network Classifiers

    Authors: Shuyue Guan, Murray Loew

    Abstract: Instead of using current deep-learning segmentation models (like the UNet and variants), we approach the segmentation problem using trained Convolutional Neural Network (CNN) classifiers, which automatically extract important features from images for classification. Those extracted features can be visualized and formed into heatmaps using Gradient-weighted Class Activation Map** (Grad-CAM). This… ▽ More

    Submitted 27 January, 2022; v1 submitted 8 January, 2022; originally announced January 2022.

    Comments: 8 pages, 10 figures. Accepted by IEEE AIPR 2021 (Oral)

    Report number: 13

  8. arXiv:2110.01409  [pdf, other

    cs.DC cs.PF

    Delayed Asynchronous Iterative Graph Algorithms

    Authors: Mark P. Blanco, Scott McMillan, Tze Meng Low

    Abstract: Iterative graph algorithms often compute intermediate values and update them as computation progresses. Updated output values are used as inputs for computations in current or subsequent iterations; hence the number of iterations required for values to converge can potentially reduce if the newest values are asynchronously made available to other updates computed in the same iteration. In a multi-… ▽ More

    Submitted 29 September, 2021; originally announced October 2021.

    Comments: 6 pages, 6 figures, 2 tables, IEEE High Performance Extreme Computing (HPEC) Conference 2021

  9. arXiv:2109.05180  [pdf, other

    cs.LG math.ST

    A Novel Intrinsic Measure of Data Separability

    Authors: Shuyue Guan, Murray Loew

    Abstract: In machine learning, the performance of a classifier depends on both the classifier model and the separability/complexity of datasets. To quantitatively measure the separability of datasets, we create an intrinsic measure -- the Distance-based Separability Index (DSI), which is independent of the classifier model. We consider the situation in which different classes of data are mixed in the same d… ▽ More

    Submitted 11 September, 2021; originally announced September 2021.

    Comments: 16 pages, 12 figures. arXiv admin note: substantial text overlap with arXiv:2005.13120

  10. arXiv:2108.07368  [pdf

    eess.IV cs.CV

    CaraNet: Context Axial Reverse Attention Network for Segmentation of Small Medical Objects

    Authors: Ange Lou, Shuyue Guan, Hanseok Ko, Murray Loew

    Abstract: Segmenting medical images accurately and reliably is important for disease diagnosis and treatment. It is a challenging task because of the wide variety of objects' sizes, shapes, and scanning modalities. Recently, many convolutional neural networks (CNN) have been designed for segmentation tasks and achieved great success. Few studies, however, have fully considered the sizes of objects, and thus… ▽ More

    Submitted 13 January, 2022; v1 submitted 16 August, 2021; originally announced August 2021.

    Comments: Accepted by SPIE Medical Imaging: Image Processing (oral presentation)

  11. arXiv:2106.09794  [pdf, other

    cs.LG cs.CV cs.DM

    A Distance-based Separability Measure for Internal Cluster Validation

    Authors: Shuyue Guan, Murray Loew

    Abstract: To evaluate clustering results is a significant part of cluster analysis. Since there are no true class labels for clustering in typical unsupervised learning, many internal cluster validity indices (CVIs), which use predicted labels and data, have been created. Without true labels, to design an effective CVI is as difficult as to create a clustering method. And it is crucial to have more CVIs bec… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: It is an extended version of the paper: arXiv:2009.01328

  12. arXiv:2105.04075  [pdf

    cs.CV eess.IV

    CFPNet-M: A Light-Weight Encoder-Decoder Based Network for Multimodal Biomedical Image Real-Time Segmentation

    Authors: Ange Lou, Shuyue Guan, Murray Loew

    Abstract: Currently, developments of deep learning techniques are providing instrumental to identify, classify, and quantify patterns in medical images. Segmentation is one of the important applications in medical image analysis. In this regard, U-Net is the predominant approach to medical image segmentation tasks. However, we found that those U-Net based models have limitations in several aspects, for exam… ▽ More

    Submitted 30 May, 2021; v1 submitted 9 May, 2021; originally announced May 2021.

  13. arXiv:2103.12212  [pdf

    cs.CV

    CFPNet: Channel-wise Feature Pyramid for Real-Time Semantic Segmentation

    Authors: Ange Lou, Murray Loew

    Abstract: Real-time semantic segmentation is playing a more important role in computer vision, due to the growing demand for mobile devices and autonomous driving. Therefore, it is very important to achieve a good trade-off among performance, model size and inference speed. In this paper, we propose a Channel-wise Feature Pyramid (CFP) module to balance those factors. Based on the CFP module, we built CFPNe… ▽ More

    Submitted 2 June, 2021; v1 submitted 22 March, 2021; originally announced March 2021.

    Comments: Accepted by ICIP 2021

  14. arXiv:2101.01386  [pdf, other

    cs.CV cs.AI cs.LG

    Understanding the Ability of Deep Neural Networks to Count Connected Components in Images

    Authors: Shuyue Guan, Murray Loew

    Abstract: Humans can count very fast by subitizing, but slow substantially as the number of objects increases. Previous studies have shown a trained deep neural network (DNN) detector can count the number of objects in an amount of time that increases slowly with the number of objects. Such a phenomenon suggests the subitizing ability of DNNs, and unlike humans, it works equally well for large numbers. Many… ▽ More

    Submitted 5 January, 2021; originally announced January 2021.

    Comments: 7 pages, 12 figures. Accepted by IEEE AIPR 2020 (Oral)

  15. arXiv:2011.00376  [pdf

    eess.IV cs.CV

    Segmentation of Infrared Breast Images Using MultiResUnet Neural Network

    Authors: Ange Lou, Shuyue Guan, Nada Kamona, Murray Loew

    Abstract: Breast cancer is the second leading cause of death for women in the U.S. Early detection of breast cancer is key to higher survival rates of breast cancer patients. We are investigating infrared (IR) thermography as a noninvasive adjunct to mammography for breast cancer screening. IR imaging is radiation-free, pain-free, and non-contact. Automatic segmentation of the breast area from the acquired… ▽ More

    Submitted 31 October, 2020; originally announced November 2020.

    Comments: 6 pages. Accepted by IEEE AIPR 2019 (Oral)

  16. arXiv:2010.13380  [pdf, other

    cs.LG cs.AI math.CO

    The training accuracy of two-layer neural networks: its estimation and understanding using random datasets

    Authors: Shuyue Guan, Murray Loew

    Abstract: Although the neural network (NN) technique plays an important role in machine learning, understanding the mechanism of NN models and the transparency of deep learning still require more basic research. In this study, we propose a novel theory based on space partitioning to estimate the approximate training accuracy for two-layer neural networks on random datasets without training. There appear to… ▽ More

    Submitted 9 November, 2023; v1 submitted 26 October, 2020; originally announced October 2020.

    Comments: 11 pages, 6 figures. Accepted by IEEE AIPR 2023 (Oral)

  17. arXiv:2009.07974  [pdf

    cs.LG cs.AI cs.CV stat.ML

    Analysis of Generalizability of Deep Neural Networks Based on the Complexity of Decision Boundary

    Authors: Shuyue Guan, Murray Loew

    Abstract: For supervised learning models, the analysis of generalization ability (generalizability) is vital because the generalizability expresses how well a model will perform on unseen data. Traditional generalization methods, such as the VC dimension, do not apply to deep neural network (DNN) models. Thus, new theories to explain the generalizability of DNNs are required. In this study, we hypothesize t… ▽ More

    Submitted 16 September, 2020; originally announced September 2020.

    Comments: 7 pages, 11 figures. Accepted by ICMLA 2020

    Journal ref: 19th IEEE International Conference on Machine Learning and Applications (ICMLA), 2020, pp. 101-106

  18. Towards an Objective Metric for the Performance of Exact Triangle Count

    Authors: Mark P. Blanco, Scott McMillan, Tze Meng Low

    Abstract: The performance of graph algorithms is often measured in terms of the number of traversed edges per second (TEPS). However, this performance metric is inadequate for a graph operation such as exact triangle counting. In triangle counting, execution times on graphs with a similar number of edges can be distinctly different as demonstrated by results from the past Graph Challenge entries. We discuss… ▽ More

    Submitted 29 September, 2021; v1 submitted 16 September, 2020; originally announced September 2020.

    Comments: 6 Pages, 2020 IEEE High Performance Extreme Computing Conference(HPEC)

  19. Exploration of Fine-Grained Parallelism for Load Balancing Eager K-truss on GPU and CPU

    Authors: Mark Blanco, Tze Meng Low, Kyungjoo Kim

    Abstract: In this work we present a performance exploration on Eager K-truss, a linear-algebraic formulation of the K-truss graph algorithm. We address performance issues related to load imbalance of parallel tasks in symmetric, triangular graphs by presenting a fine-grained parallel approach to executing the support computation. This approach also increases available parallelism, making it amenable to GPU… ▽ More

    Submitted 16 September, 2020; originally announced September 2020.

    Comments: 2019 IEEE High Performance Extreme Computing Conference (HPEC)

  20. arXiv:2009.01328  [pdf

    cs.LG cs.CV stat.ML

    An Internal Cluster Validity Index Using a Distance-based Separability Measure

    Authors: Shuyue Guan, Murray Loew

    Abstract: To evaluate clustering results is a significant part of cluster analysis. There are no true class labels for clustering in typical unsupervised learning. Thus, a number of internal evaluations, which use predicted labels and data, have been created. They are also named internal cluster validity indices (CVIs). Without true labels, to design an effective CVI is not simple because it is similar to c… ▽ More

    Submitted 4 January, 2021; v1 submitted 2 September, 2020; originally announced September 2020.

    Comments: 8 pages, 4 figures. Accepted by IEEE ICTAI 2020 (Long Paper & Oral Presentation)

    Journal ref: IEEE 32nd International Conference on Tools with Artificial Intelligence (ICTAI), 2020, pp. 827-834

  21. arXiv:2007.14638  [pdf, other

    eess.IV cs.CV cs.LG

    COVID-19 CT Image Synthesis with a Conditional Generative Adversarial Network

    Authors: Yifan Jiang, Han Chen, Murray Loew, Hanseok Ko

    Abstract: Coronavirus disease 2019 (COVID-19) is an ongoing global pandemic that has spread rapidly since December 2019. Real-time reverse transcription polymerase chain reaction (rRT-PCR) and chest computed tomography (CT) imaging both play an important role in COVID-19 diagnosis. Chest CT imaging offers the benefits of quick reporting, a low cost, and high sensitivity for the detection of pulmonary infect… ▽ More

    Submitted 2 December, 2020; v1 submitted 29 July, 2020; originally announced July 2020.

    Comments: Accepted by IEEE Journal of Biomedical and Health Informatics (J-BHI)

  22. arXiv:2006.00414  [pdf

    eess.IV cs.CV

    DC-UNet: Rethinking the U-Net Architecture with Dual Channel Efficient CNN for Medical Images Segmentation

    Authors: Ange Lou, Shuyue Guan, Murray Loew

    Abstract: Recently, deep learning has become much more popular in computer vision area. The Convolution Neural Network (CNN) has brought a breakthrough in images segmentation areas, especially, for medical images. In this regard, U-Net is the predominant approach to medical image segmentation task. The U-Net not only performs well in segmenting multimodal medical images generally, but also in some tough cas… ▽ More

    Submitted 30 May, 2020; originally announced June 2020.

  23. arXiv:2005.13120  [pdf

    cs.LG stat.ML

    Data Separability for Neural Network Classifiers and the Development of a Separability Index

    Authors: Shuyue Guan, Murray Loew, Hanseok Ko

    Abstract: In machine learning, the performance of a classifier depends on both the classifier model and the dataset. For a specific neural network classifier, the training process varies with the training set used; some training data make training accuracy fast converged to high values, while some data may lead to slowly converged to lower accuracy. To quantify this phenomenon, we created the Distance-based… ▽ More

    Submitted 28 May, 2020; v1 submitted 26 May, 2020; originally announced May 2020.

    Comments: 11 pages, 12 figures

  24. arXiv:2002.12345  [pdf, other

    cs.CV cs.LG eess.IV

    A Novel Measure to Evaluate Generative Adversarial Networks Based on Direct Analysis of Generated Images

    Authors: Shuyue Guan, Murray Loew

    Abstract: The Generative Adversarial Network (GAN) is a state-of-the-art technique in the field of deep learning. A number of recent papers address the theory and applications of GANs in various fields of image processing. Fewer studies, however, have directly evaluated GAN outputs. Those that have been conducted focused on using classification performance, e.g., Inception Score (IS) and statistical metrics… ▽ More

    Submitted 7 April, 2021; v1 submitted 27 February, 2020; originally announced February 2020.

    Comments: 16 pages, 11 figures. Accepted by the Neural Computing and Applications journal

    Report number: NCAA-D-20-03011

    Journal ref: Neural Comput & Applic 33, 13921-13936 (2021)

  25. arXiv:1912.08350  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Automating Vitiligo Skin Lesion Segmentation Using Convolutional Neural Networks

    Authors: Makena Low, Priyanka Raina

    Abstract: For several skin conditions such as vitiligo, accurate segmentation of lesions from skin images is the primary measure of disease progression and severity. Existing methods for vitiligo lesion segmentation require manual intervention. Unfortunately, manual segmentation is time and labor-intensive, as well as irreproducible between physicians. We introduce a convolutional neural network (CNN) that… ▽ More

    Submitted 16 December, 2019; originally announced December 2019.

  26. Delta-step** SSSP: from Vertices and Edges to GraphBLAS Implementations

    Authors: Upasana Sridhar, Mark Blanco, Rahul Mayuranath, Daniele G. Spampinato, Tze Meng Low, Scott McMillan

    Abstract: GraphBLAS is an interface for implementing graph algorithms. Algorithms implemented using the GraphBLAS interface are cast in terms of linear algebra-like operations. However, many graph algorithms are canonically described in terms of operations on vertices and/or edges. Despite the known duality between these two representations, the differences in the way algorithms are described using the two… ▽ More

    Submitted 16 September, 2020; v1 submitted 15 November, 2019; originally announced November 2019.

    Comments: 10 pages, 4 figures, IPDPSW GRAPL 2019 Workshop

    Journal ref: IEEE International Parallel and Distributed Processing Symposium Workshops, 2019, pp 241 to 250

  27. arXiv:1909.00303  [pdf, other

    cs.CL

    Higher-order Comparisons of Sentence Encoder Representations

    Authors: Mostafa Abdou, Artur Kulmizev, Felix Hill, Daniel M. Low, Anders Søgaard

    Abstract: Representational Similarity Analysis (RSA) is a technique developed by neuroscientists for comparing activity patterns of different measurement modalities (e.g., fMRI, electrophysiology, behavior). As a framework, RSA has several advantages over existing approaches to interpretation of language encoders based on probing or diagnostic classification: namely, it does not require large training sampl… ▽ More

    Submitted 5 September, 2019; v1 submitted 31 August, 2019; originally announced September 2019.

    Comments: EMNLP 2019

  28. arXiv:1904.10119  [pdf, other

    cs.MS cs.DC

    A Flexible Framework for Parallel Multi-Dimensional DFTs

    Authors: Doru Thom Popovici, Martin D. Schatz, Franz Franchetti, Tze Meng Low

    Abstract: Multi-dimensional discrete Fourier transforms (DFT) are typically decomposed into multiple 1D transforms. Hence, parallel implementations of any multi-dimensional DFT focus on parallelizing within or across the 1D DFT. Existing DFT packages exploit the inherent parallelism across the 1D DFTs and offer rigid frameworks, that cannot be extended to incorporate both forms of parallelism and various da… ▽ More

    Submitted 22 December, 2019; v1 submitted 22 April, 2019; originally announced April 2019.

  29. arXiv:1903.01042  [pdf, other

    cs.IT cs.DC cs.LG cs.PF

    CodeNet: Training Large Scale Neural Networks in Presence of Soft-Errors

    Authors: Sanghamitra Dutta, Ziqian Bai, Tze Meng Low, Pulkit Grover

    Abstract: This work proposes the first strategy to make distributed training of neural networks resilient to computing errors, a problem that has remained unsolved despite being first posed in 1956 by von Neumann. He also speculated that the efficiency and reliability of the human brain is obtained by allowing for low power but error-prone components with redundancy for error-resilience. It is surprising th… ▽ More

    Submitted 3 March, 2019; originally announced March 2019.

    Comments: Currently under review

  30. A Unified Coded Deep Neural Network Training Strategy Based on Generalized PolyDot Codes for Matrix Multiplication

    Authors: Sanghamitra Dutta, Ziqian Bai, Haewon Jeong, Tze Meng Low, Pulkit Grover

    Abstract: This paper has two contributions. First, we propose a novel coded matrix multiplication technique called Generalized PolyDot codes that advances on existing methods for coded matrix multiplication under storage and communication constraints. This technique uses "garbage alignment," i.e., aligning computations in coded computing that are not a part of the desired output. Generalized PolyDot codes b… ▽ More

    Submitted 26 November, 2018; originally announced November 2018.

    Comments: Presented in part at the IEEE International Symposium on Information Theory 2018 (Submission Date: Jan 12 2018); Currently under review at the IEEE Transactions on Information Theory

  31. arXiv:1809.10170  [pdf, other

    cs.LG cs.DC stat.ML

    High Performance Zero-Memory Overhead Direct Convolutions

    Authors: Jiyuan Zhang, Franz Franchetti, Tze Meng Low

    Abstract: The computation of convolution layers in deep neural networks typically rely on high performance routines that trade space for time by using additional memory (either for packing purposes or required as part of the algorithm) to improve performance. The problems with such an approach are two-fold. First, these routines incur additional memory overhead which reduces the overall size of the network… ▽ More

    Submitted 19 September, 2018; originally announced September 2018.

    Comments: the 35th International Conference on Machine Learning(ICML 2018), camera ready

  32. arXiv:1805.09891  [pdf, other

    cs.IT cs.DC

    Coded FFT and Its Communication Overhead

    Authors: Haewon Jeong, Tze Meng Low, Pulkit Grover

    Abstract: We propose a coded computing strategy and examine communication costs of coded computing algorithms to make distributed Fast Fourier Transform (FFT) resilient to errors during the computation. We apply maximum distance separable (MDS) codes to a widely used "Transpose" algorithm for parallel FFT. In the uncoded distributed FFT algorithm, the most expensive step is a single "all-to-all" communicati… ▽ More

    Submitted 24 May, 2018; originally announced May 2018.

  33. arXiv:1611.08035  [pdf, other

    cs.MS

    Automating the Last-Mile for High Performance Dense Linear Algebra

    Authors: Richard Michael Veras, Tze Meng Low, Tyler Michael Smith, Robert van de Geijn, Franz Franchetti

    Abstract: High performance dense linear algebra (DLA) libraries often rely on a general matrix multiply (Gemm) kernel that is implemented using assembly or with vector intrinsics. In particular, the real-valued Gemm kernels provide the overwhelming fraction of performance for the complex-valued Gemm kernels, along with the entire level-3 BLAS and many of the real and complex LAPACK routines. Thus,achieving… ▽ More

    Submitted 28 April, 2017; v1 submitted 23 November, 2016; originally announced November 2016.

  34. arXiv:1301.7744  [pdf, ps, other

    math.NA cs.MS

    Exploiting Symmetry in Tensors for High Performance: Multiplication with Symmetric Tensors

    Authors: Martin D. Schatz, Tze Meng Low, Robert A. van de Geijn, Tamara G. Kolda

    Abstract: Symmetric tensor operations arise in a wide variety of computations. However, the benefits of exploiting symmetry in order to reduce storage and computation is in conflict with a desire to simplify memory access patterns. In this paper, we propose a blocked data structure (Blocked Compact Symmetric Storage) wherein we consider the tensor by blocks and store only the unique blocks of a symmetric te… ▽ More

    Submitted 9 April, 2014; v1 submitted 31 January, 2013; originally announced January 2013.

    MSC Class: 15-02 (Primary)

    Journal ref: SIAM Journal on Scientific Computing, Vol. 36, No. 5, pp. C453-C479, September 2014