Skip to main content

Showing 1–17 of 17 results for author: Pattichis, M

.
  1. arXiv:2405.02317  [pdf, other

    cs.CV eess.IV

    Long-term Human Participation Assessment In Collaborative Learning Environments Using Dynamic Scene Analysis

    Authors: Wen**g Shi, Phuong Tran, Sylvia Celedón-Pattichis, Marios S. Pattichis

    Abstract: The paper develops datasets and methods to assess student participation in real-life collaborative learning environments. In collaborative learning environments, students are organized into small groups where they are free to interact within their group. Thus, students can move around freely causing issues with strong pose variation, move out and re-enter the camera scene, or face away from the ca… ▽ More

    Submitted 14 April, 2024; originally announced May 2024.

  2. arXiv:2402.00261  [pdf, other

    cs.CV cs.LG

    Understanding Neural Network Systems for Image Analysis using Vector Spaces and Inverse Maps

    Authors: Rebecca Pattichis, Marios S. Pattichis

    Abstract: There is strong interest in develo** mathematical methods that can be used to understand complex neural networks used in image analysis. In this paper, we introduce techniques from Linear Algebra to model neural network layers as maps between signal spaces. First, we demonstrate how signal spaces can be used to visualize weight spaces and convolutional layer kernels. We also demonstrate how resi… ▽ More

    Submitted 31 January, 2024; originally announced February 2024.

  3. arXiv:2312.05352  [pdf, other

    cs.CV cs.LG eess.IV

    A Review of Machine Learning Methods Applied to Video Analysis Systems

    Authors: Marios S. Pattichis, Venkatesh Jatla, Alvaro E. Ullao Cerna

    Abstract: The paper provides a survey of the development of machine-learning techniques for video analysis. The survey provides a summary of the most popular deep learning methods used for human activity recognition. We discuss how popular architectures perform on standard datasets and highlight the differences from real-life datasets dominated by multiple activities performed by multiple participants over… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  4. The Importance of the Instantaneous Phase for classification using Convolutional Neural Networks

    Authors: Luis Sanchez Tapia, Marios S. Pattichis, Sylvia Celedon-Pattichis, Carlos Lopez Leiva

    Abstract: Large-scale training of Convolutional Neural Networks (CNN) is extremely demanding in terms of computational resources. Also, for specific applications, the standard use of transfer learning also tends to require far more resources than what may be needed. This work examines the impact of using AM-FM representations as input images for CNN classification applications. A comparison was made between… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

  5. arXiv:2201.01380  [pdf, other

    eess.IV astro-ph.SR cs.CV

    Image Processing Methods for Coronal Hole Segmentation, Matching, and Map Classification

    Authors: V. Jatla, M. S. Pattichis, C. N. Arge

    Abstract: The paper presents the results from a multi-year effort to develop and validate image processing methods for selecting the best physical models based on solar image observations. The approach consists of selecting the physical models based on their agreement with coronal holes extracted from the images. Ultimately, the goal is to use physical models to predict geomagnetic storms. We decompose the… ▽ More

    Submitted 4 January, 2022; originally announced January 2022.

    Journal ref: IEEE Transactions on Image Processing 29 (2019): 1641-1653

  6. arXiv:2112.13463  [pdf, other

    cs.SD eess.AS eess.IV

    Bilingual Speech Recognition by Estimating Speaker Geometry from Video Data

    Authors: Luis Sanchez Tapia, Antonio Gomez, Mario Esparza, Venkatesh Jatla, Marios Pattichis, Sylvia Celedón-Pattichis, Carlos LópezLeiva

    Abstract: Speech recognition is very challenging in student learning environments that are characterized by significant cross-talk and background noise. To address this problem, we present a bilingual speech recognition system that uses an interactive video analysis system to estimate the 3D speaker geometry for realistic audio simulations. We demonstrate the use of our system in generating a complex audio… ▽ More

    Submitted 26 December, 2021; originally announced December 2021.

    Comments: 11 pages, 6 figures

    Journal ref: The 19th International Conference on Computer Analysis of Images and Patterns (CAIP), 2021

  7. arXiv:2112.13150  [pdf, other

    cs.AR cs.CV cs.DC eess.IV eess.SP

    Fast 2D Convolutions and Cross-Correlations Using Scalable Architectures

    Authors: Cesar Carranza, Daniel Llamocca, Marios Pattichis

    Abstract: The manuscript describes fast and scalable architectures and associated algorithms for computing convolutions and cross-correlations. The basic idea is to map 2D convolutions and cross-correlations to a collection of 1D convolutions and cross-correlations in the transform domain. This is accomplished through the use of the Discrete Periodic Radon Transform (DPRT) for general kernels and the use of… ▽ More

    Submitted 24 December, 2021; originally announced December 2021.

    Comments: The paper develops the fastest known methods for computing 2D convolutions in hardware

    Journal ref: IEEE Transactions on Image Processing 26.5 (2017): 2230-2245

  8. arXiv:2112.13149  [pdf, other

    cs.AR cs.CV cs.DC eess.IV eess.SP

    Fast and Scalable Computation of the Forward and Inverse Discrete Periodic Radon Transform

    Authors: Cesar Carranza, Daniel Llamocca, Marios Pattichis

    Abstract: The Discrete Periodic Radon Transform (DPRT) has been extensively used in applications that involve image reconstructions from projections. This manuscript introduces a fast and scalable approach for computing the forward and inverse DPRT that is based on the use of: (i) a parallel array of fixed-point adder trees, (ii) circular shift registers to remove the need for accessing external memory comp… ▽ More

    Submitted 24 December, 2021; originally announced December 2021.

    Comments: This paper has been published as follows: C. Carranza, D. Llamocca, and M. Pattichis. "Fast and scalable computation of the forward and inverse discrete periodic radon transform", IEEE Transactions on Image Processing, 25(1):119-133, Jan 2016

    Journal ref: IEEE Transactions on Image Processing, 25(1):119-133, Jan 2016

  9. arXiv:2112.12217  [pdf, other

    eess.IV

    Person Detection in Collaborative Group Learning Environments Using Multiple Representations

    Authors: Wen**g Shi, Marios S. Pattichis, Sylvia Celedón-Pattichis, Carlos LópezLeiva

    Abstract: We introduce the problem of detecting a group of students from classroom videos. The problem requires the detection of students from different angles and the separation of the group from other groups in long videos (one to one and a half hours). We use multiple image representations to solve the problem. We use FM components to separate each group from background groups, AM-FM components for detec… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

  10. arXiv:2110.13269  [pdf, other

    cs.CV

    Facial Recognition in Collaborative Learning Videos

    Authors: Phuong Tran, Marios Pattichis, Sylvia Celedón-Pattichis, Carlos LópezLeiva

    Abstract: Face recognition in collaborative learning videos presents many challenges. In collaborative learning videos, students sit around a typical table at different positions to the recording camera, come and go, move around, get partially or fully occluded. Furthermore, the videos tend to be very long, requiring the development of fast and accurate methods. We develop a dynamic system of recognizing pa… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

  11. arXiv:2110.07646  [pdf, other

    cs.CV eess.IV

    Talking Detection In Collaborative Learning Environments

    Authors: Wen**g Shi, Marios S. Pattichis, Sylvia Celedón-Pattichis, Carlos LópezLeiva

    Abstract: We study the problem of detecting talking activities in collaborative learning videos. Our approach uses head detection and projections of the log-magnitude of optical flow vectors to reduce the problem to a simple classification of small projection images without the need for training complex, 3-D activity classification systems. The small projection images are then easily classified using a simp… ▽ More

    Submitted 14 October, 2021; originally announced October 2021.

  12. arXiv:2110.07070  [pdf, other

    cs.CV

    Fast Hand Detection in Collaborative Learning Environments

    Authors: Sravani Teeparthi, Venkatesh Jatla, Marios S. Pattichis, Sylvia Celedon Pattichis, Carlos LopezLeiva

    Abstract: Long-term object detection requires the integration of frame-based results over several seconds. For non-deformable objects, long-term detection is often addressed using object detection followed by video tracking. Unfortunately, tracking is inapplicable to objects that undergo dramatic changes in appearance from frame to frame. As a related example, we study hand detection over long video recordi… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

  13. Adaptive Video Encoding For Different Video Codecs

    Authors: Gangadharan Esakki, Andreas Panayides, Venkatesh Jatla, Marios Pattichis

    Abstract: By 2022, we expect video traffic to reach 82% of the total internet traffic. Undoubtedly, the abundance of video-driven applications will likely lead internet video traffic percentage to a further increase in the near future, enabled by associate advances in video devices' capabilities. In response to this ever-growing demand, the Alliance for Open Media (AOM) and the Joint Video Experts Team (JVE… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

    Comments: Video codecs, Video signal processing, Video coding, Video compression, Video quality, Video streaming, Adaptive video streaming, Versatile Video Coding, AV1, HEVC

    Journal ref: IEEE Access 2021

  14. arXiv:1911.04048  [pdf, other

    stat.ML cs.LG eess.IV eess.SP stat.AP

    Multidataset Independent Subspace Analysis with Application to Multimodal Fusion

    Authors: Rogers F. Silva, Sergey M. Plis, Tulay Adali, Marios S. Pattichis, Vince D. Calhoun

    Abstract: In the last two decades, unsupervised latent variable models---blind source separation (BSS) especially---have enjoyed a strong reputation for the interpretable features they produce. Seldom do these models combine the rich diversity of information available in multiple datasets. Multidatasets, on the other hand, yield joint solutions otherwise unavailable in isolation, with a potential for pivota… ▽ More

    Submitted 10 November, 2019; originally announced November 2019.

    Comments: For associated code, see https://github.com/rsilva8/MISA For associated data, see https://github.com/rsilva8/MISA-data Submitted to IEEE Transactions on Image Processing on Nov/7/2019: 13 pages, 8 figures Supplement: 16 pages, 5 figures

    ACM Class: G.1.6; G.2.1; G.3; H.1.1; J.3; I.5.1; I.2.6

  15. Estimating Total Open Heliospheric Magnetic Flux

    Authors: S. Wallace, C. N. Arge, M. Pattichis, R. A. Hock-Mysliwiec, C. J. Henney

    Abstract: Over the solar-activity cycle, there are extended periods where significant discrepancies occur between the spacecraft-observed total (unsigned) open magnetic flux and that determined from coronal models. In this article, the total open heliospheric magnetic flux is computed using two different methods and then compared with results obtained from in-situ interplanetary magnetic-field observations.… ▽ More

    Submitted 29 March, 2019; originally announced March 2019.

    Comments: 20 pages, 6 figures

    Journal ref: Solar Phys (2019) 294:19

  16. arXiv:1901.08125  [pdf, other

    cs.LG stat.ML

    Interpretable Neural Networks for Predicting Mortality Risk using Multi-modal Electronic Health Records

    Authors: Alvaro E. Ulloa Cerna, Marios Pattichis, David P. vanMaanen, Linyuan **g, Aalpen A. Patel, Joshua V. Stough, Christopher M. Haggerty, Brandon K. Fornwalt

    Abstract: We present an interpretable neural network for predicting an important clinical outcome (1-year mortality) from multi-modal Electronic Health Record (EHR) data. Our approach builds on prior multi-modal machine learning models by now enabling visualization of how individual factors contribute to the overall outcome risk, assuming other factors remain constant, which was previously impossible. We… ▽ More

    Submitted 23 January, 2019; originally announced January 2019.

    Comments: Submitted to IEEE JBHI

    Journal ref: IEEE Journal of Biomedical and Health Informatics, 2019

  17. arXiv:1811.10553  [pdf

    cs.LG cs.AI cs.CV q-bio.QM

    A deep neural network to enhance prediction of 1-year mortality using echocardiographic videos of the heart

    Authors: Alvaro Ulloa, Linyuan **g, Christopher W Good, David P vanMaanen, Sushravya Raghunath, Jonathan D Suever, Christopher D Nevius, Gregory J Wehner, Dustin Hartzel, Joseph B Leader, Amro Alsaid, Aalpen A Patel, H Lester Kirchner, Marios S Pattichis, Christopher M Haggerty, Brandon K Fornwalt

    Abstract: Predicting future clinical events helps physicians guide appropriate intervention. Machine learning has tremendous promise to assist physicians with predictions based on the discovery of complex patterns from historical data, such as large, longitudinal electronic health records (EHR). This study is a first attempt to demonstrate such capabilities using raw echocardiographic videos of the heart. W… ▽ More

    Submitted 14 May, 2019; v1 submitted 26 November, 2018; originally announced November 2018.

    Comments: We updated results with improved performance after dropout bug in tensorflow v1.12. We also added learning curves showing promise in video model with more samples