Skip to main content

Showing 1–28 of 28 results for author: Ajitesh

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17235  [pdf, other

    cs.CV cs.AI cs.DC

    Task-Agnostic Federated Learning

    Authors: Zhengtao Yao, Hong Nguyen, Ajitesh Srivastava, Jose Luis Ambite

    Abstract: In the realm of medical imaging, leveraging large-scale datasets from various institutions is crucial for develo** precise deep learning models, yet privacy concerns frequently impede data sharing. federated learning (FL) emerges as a prominent solution for preserving privacy while facilitating collaborative learning. However, its application in real-world scenarios faces several obstacles, such… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2401.03390  [pdf, other

    q-bio.PE cs.LG physics.soc-ph

    Dynamics-based Feature Augmentation of Graph Neural Networks for Variant Emergence Prediction

    Authors: Majd Al Aawar, Srikar Mutnuri, Mansooreh Montazerin, Ajitesh Srivastava

    Abstract: During the COVID-19 pandemic, a major driver of new surges has been the emergence of new variants. When a new variant emerges in one or more countries, other nations monitor its spread in preparation for its potential arrival. The impact of the new variant and the timings of epidemic peaks in a country highly depend on when the variant arrives. The current methods for predicting the spread of new… ▽ More

    Submitted 28 May, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

  3. arXiv:2312.12774  [pdf, other

    cs.DC

    Data Extraction, Transformation, and Loading Process Automation for Algorithmic Trading Machine Learning Modelling and Performance Optimization

    Authors: Nassi Ebadifard, Ajitesh Parihar, Youry Khmelevsky, Gaetan Hains, Albert Wong, Frank Zhang

    Abstract: A data warehouse efficiently prepares data for effective and fast data analysis and modelling using machine learning algorithms. This paper discusses existing solutions for the Data Extraction, Transformation, and Loading (ETL) process and automation for algorithmic trading algorithms. Integrating the Data Warehouses and, in the future, the Data Lakes with the Machine Learning Algorithms gives eno… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  4. arXiv:2310.10902  [pdf, other

    cs.AR eess.SP

    Reuse Kernels or Activations? A Flexible Dataflow for Low-latency Spectral CNN Acceleration

    Authors: Yue Niu, Rajgopal Kannan, Ajitesh Srivastava, Viktor Prasanna

    Abstract: Spectral-domain CNNs have been shown to be more efficient than traditional spatial CNNs in terms of reducing computation complexity. However they come with a `kernel explosion' problem that, even after compression (pruning), imposes a high memory burden and off-chip bandwidth requirement for kernel access. This creates a performance gap between the potential acceleration offered by compression and… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 11 pages, 11 figures Accepted to ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (FPGA) 2020

  5. arXiv:2309.03579  [pdf, other

    cs.LG cs.AI

    DTW+S: Shape-based Comparison of Time-series with Ordered Local Trend

    Authors: Ajitesh Srivastava

    Abstract: Measuring distance or similarity between time-series data is a fundamental aspect of many applications including classification, clustering, and ensembling/alignment. Existing measures may fail to capture similarities among local trends (shapes) and may even produce misleading results. Our goal is to develop a measure that looks for similar trends occurring around similar times and is easily inter… ▽ More

    Submitted 29 November, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: 11 pages, 11 figures Update: Included barycenter averaging with DTW+S along with results

  6. arXiv:2309.01108  [pdf, other

    eess.AS cs.LG cs.SD

    Acoustic-to-articulatory inversion for dysarthric speech: Are pre-trained self-supervised representations favorable?

    Authors: Sarthak Kumar Maharana, Krishna Kamal Adidam, Shoumik Nandi, Ajitesh Srivastava

    Abstract: Acoustic-to-articulatory inversion (AAI) involves map** from the acoustic to the articulatory space. Signal-processing features like the MFCCs, have been widely used for the AAI task. For subjects with dysarthric speech, AAI is challenging because of an imprecise and indistinct pronunciation. In this work, we perform AAI for dysarthric speech using representations from pre-trained self-supervise… ▽ More

    Submitted 9 February, 2024; v1 submitted 3 September, 2023; originally announced September 2023.

    Comments: Accepted to IEEE ICASSP Workshops 2024

  7. arXiv:2307.06643  [pdf, other

    cs.SI physics.soc-ph

    Nowcasting Temporal Trends Using Indirect Surveys

    Authors: Ajitesh Srivastava, Juan Marcos Ramírez, Sergio Díaz-Aranda, Jose Aguilar, Antonio Ortega, Antonio Fernández Anta, Rosa Elvira Lillo

    Abstract: Indirect surveys, in which respondents provide information about other people they know, have been proposed for estimating (nowcasting) the size of a \emph{hidden population} where privacy is important or the hidden population is hard to reach. Examples include estimating casualties in an earthquake, conditions among female sex workers, and the prevalence of drug use and infectious diseases. The N… ▽ More

    Submitted 14 December, 2023; v1 submitted 13 July, 2023; originally announced July 2023.

    Comments: Accepted at AAAI 2024

    ACM Class: G.3

  8. arXiv:2211.07360  [pdf, other

    q-bio.NC cs.CV cs.LG

    Spatio-Temporal Attention in Multi-Granular Brain Chronnectomes for Detection of Autism Spectrum Disorder

    Authors: James Orme-Rogers, Ajitesh Srivastava

    Abstract: The traditional methods for detecting autism spectrum disorder (ASD) are expensive, subjective, and time-consuming, often taking years for a diagnosis, with many children growing well into adolescence and even adulthood before finally confirming the disorder. Recently, graph-based learning techniques have demonstrated impressive results on resting-state functional magnetic resonance imaging (rs-fM… ▽ More

    Submitted 29 October, 2022; originally announced November 2022.

    Comments: 6 pages, 2 figures

  9. arXiv:2206.08967  [pdf, other

    cs.LG

    Random Forest of Epidemiological Models for Influenza Forecasting

    Authors: Majd Al Aawar, Ajitesh Srivastava

    Abstract: Forecasting the hospitalizations caused by the Influenza virus is vital for public health planning so that hospitals can be better prepared for an influx of patients. Many forecasting methods have been used in real-time during the Influenza seasons and submitted to the CDC for public communication. The forecasting models range from mechanistic models, and auto-regression models to machine learning… ▽ More

    Submitted 17 June, 2022; originally announced June 2022.

  10. arXiv:2205.14778  [pdf, other

    cs.AR cs.LG

    TransforMAP: Transformer for Memory Access Prediction

    Authors: Pengmiao Zhang, Ajitesh Srivastava, Anant V. Nori, Rajgopal Kannan, Viktor K. Prasanna

    Abstract: Data Prefetching is a technique that can hide memory latency by fetching data before it is needed by a program. Prefetching relies on accurate memory access prediction, to which task machine learning based methods are increasingly applied. Unlike previous approaches that learn from deltas or offsets and perform one access prediction, we develop TransforMAP, based on the powerful Transformer model,… ▽ More

    Submitted 29 May, 2022; originally announced May 2022.

  11. Fine-Grained Address Segmentation for Attention-Based Variable-Degree Prefetching

    Authors: Pengmiao Zhang, Ajitesh Srivastava, Anant V. Nori, Rajgopal Kannan, Viktor K. Prasanna

    Abstract: Machine learning algorithms have shown potential to improve prefetching performance by accurately predicting future memory accesses. Existing approaches are based on the modeling of text prediction, considering prefetching as a classification problem for sequence prediction. However, the vast and sparse memory address space leads to large vocabulary, which makes this modeling impractical. The numb… ▽ More

    Submitted 1 May, 2022; originally announced May 2022.

  12. Design and Implementation of Knowledge Base for Runtime Management of Software Defined Hardware

    Authors: Hongkuan Zhou, Ajitesh Srivastava, Rajgopal Kannan, Viktor Prasanna

    Abstract: Runtime-reconfigurable software coupled with reconfigurable hardware is highly desirable as a means towards maximizing runtime efficiency without compromising programmability. Compilers for such software systems are extremely difficult to design as they must leverage different types of hardware at runtime. To address the need for static and dynamic compiler optimization of workflows matched to dyn… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: HPEC'19

  13. arXiv:2201.07858  [pdf, ps, other

    cs.LG cs.AI

    Decoupling the Depth and Scope of Graph Neural Networks

    Authors: Hanqing Zeng, Muhan Zhang, Yinglong Xia, Ajitesh Srivastava, Andrey Malevich, Rajgopal Kannan, Viktor Prasanna, Long **, Ren Chen

    Abstract: State-of-the-art Graph Neural Networks (GNNs) have limited scalability with respect to the graph and model sizes. On large graphs, increasing the model depth often means exponential expansion of the scope (i.e., receptive field). Beyond just a few layers, two fundamental challenges emerge: 1. degraded expressivity due to oversmoothing, and 2. expensive computation due to neighborhood explosion. We… ▽ More

    Submitted 19 January, 2022; originally announced January 2022.

    Comments: Accepted to NeurIPS 2021

    Journal ref: Advances in Neural Information Processing Systems, 2021

  14. Accelerating Large Scale Real-Time GNN Inference using Channel Pruning

    Authors: Hongkuan Zhou, Ajitesh Srivastava, Hanqing Zeng, Rajgopal Kannan, Viktor Prasanna

    Abstract: Graph Neural Networks (GNNs) are proven to be powerful models to generate node embedding for downstream applications. However, due to the high computation complexity of GNN inference, it is hard to deploy GNNs for large-scale or real-time applications. In this paper, we propose to accelerate GNN inference by pruning the dimensions in each layer with negligible accuracy loss. Our pruning framework… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

  15. arXiv:2104.05596  [pdf

    cs.CL

    Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages

    Authors: Gowtham Ramesh, Sumanth Doddapaneni, Aravinth Bheemaraj, Mayank Jobanputra, Raghavan AK, Ajitesh Sharma, Sujit Sahoo, Harshita Diddee, Mahalakshmi J, Divyanshu Kakwani, Navneet Kumar, Aswin Pradeep, Srihari Nagaraj, Kumar Deepak, Vivek Raghavan, Anoop Kunchukuttan, Pratyush Kumar, Mitesh Shantadevi Khapra

    Abstract: We present Samanantar, the largest publicly available parallel corpora collection for Indic languages. The collection contains a total of 49.7 million sentence pairs between English and 11 Indic languages (from two language families). Specifically, we compile 12.4 million sentence pairs from existing, publicly-available parallel corpora, and additionally mine 37.4 million sentence pairs from the w… ▽ More

    Submitted 12 June, 2023; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: Accepted to the Transactions of the Association for Computational Linguistics (TACL)

  16. arXiv:2102.02842  [pdf, other

    cs.LG cs.AI

    The EpiBench Platform to Propel AI/ML-based Epidemic Forecasting: A Prototype Demonstration Reaching Human Expert-level Performance

    Authors: Ajitesh Srivastava, Tianjian Xu, Viktor K. Prasanna

    Abstract: During the COVID-19 pandemic, a significant effort has gone into develo** ML-driven epidemic forecasting techniques. However, benchmarks do not exist to claim if a new AI/ML technique is better than the existing ones. The "covid-forecast-hub" is a collection of more than 30 teams, including us, that submit their forecasts weekly to the CDC. It is not possible to declare whether one method is bet… ▽ More

    Submitted 4 February, 2021; originally announced February 2021.

    Comments: 8 pages, 6 figures. Accepted at the 5th International Workshop on Health Intelligence in conjunction with the Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI-21)

  17. arXiv:2012.01380   

    cs.LG

    Deep Graph Neural Networks with Shallow Subgraph Samplers

    Authors: Hanqing Zeng, Muhan Zhang, Yinglong Xia, Ajitesh Srivastava, Andrey Malevich, Rajgopal Kannan, Viktor Prasanna, Long **, Ren Chen

    Abstract: While Graph Neural Networks (GNNs) are powerful models for learning representations on graphs, most state-of-the-art models do not have significant accuracy gain beyond two to three layers. Deep GNNs fundamentally need to address: 1). expressivity challenge due to oversmoothing, and 2). computation challenge due to neighborhood explosion. We propose a simple "deep GNN, shallow sampler" design prin… ▽ More

    Submitted 23 March, 2022; v1 submitted 2 December, 2020; originally announced December 2020.

    Comments: The complete version of this paper is accepted to NeurIPS 2021, available on arXiv under the new title "Decoupling the depth and scope of graph neural networks" (arXiv:2201.07858). This version, "Deep graph neural networks with shallow subgraph samplers", is a short version and we withdraw it to avoid confusion. Please always refer to arXiv:2201.07858

  18. Accurate, Efficient and Scalable Training of Graph Neural Networks

    Authors: Hanqing Zeng, Hongkuan Zhou, Ajitesh Srivastava, Rajgopal Kannan, Viktor Prasanna

    Abstract: Graph Neural Networks (GNNs) are powerful deep learning models to generate node embeddings on graphs. When applying deep GNNs on large graphs, it is still challenging to perform training in an efficient and scalable way. We propose a novel parallel training framework. Through sampling small subgraphs as minibatches, we reduce training workload by orders of magnitude compared with state-of-the-art… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

    Comments: 43 pages, 8 figures. arXiv admin note: text overlap with arXiv:1810.11899

    Journal ref: Journal of Parallel and Distributed Computing, Volume 147, January 2021, Pages 166-183

  19. arXiv:2007.05180  [pdf, other

    q-bio.PE cs.LG physics.soc-ph

    Fast and Accurate Forecasting of COVID-19 Deaths Using the SIkJ$α$ Model

    Authors: Ajitesh Srivastava, Tianjian Xu, Viktor K. Prasanna

    Abstract: Forecasting the effect of COVID-19 is essential to design policies that may prepare us to handle the pandemic. Many methods have already been proposed, particularly, to forecast reported cases and deaths at country-level and state-level. Many of these methods are based on traditional epidemiological model which rely on simulations or Bayesian inference to simultaneously learn many parameters at a… ▽ More

    Submitted 12 July, 2020; v1 submitted 10 July, 2020; originally announced July 2020.

    Comments: Fixed a typo

  20. arXiv:2006.02127  [pdf, other

    q-bio.PE cs.LG

    Data-driven Identification of Number of Unreported Cases for COVID-19: Bounds and Limitations

    Authors: Ajitesh Srivastava, Viktor K. Prasanna

    Abstract: Accurate forecasts for COVID-19 are necessary for better preparedness and resource management. Specifically, deciding the response over months or several months requires accurate long-term forecasts which is particularly challenging as the model errors accumulate with time. A critical factor that can hinder accurate long-term forecasts, is the number of unreported/asymptomatic cases. While there h… ▽ More

    Submitted 9 July, 2020; v1 submitted 3 June, 2020; originally announced June 2020.

    Comments: Fixed a typo

  21. arXiv:2004.11372  [pdf, other

    q-bio.PE cs.LG q-bio.QM stat.ML

    Learning to Forecast and Forecasting to Learn from the COVID-19 Pandemic

    Authors: Ajitesh Srivastava, Viktor K. Prasanna

    Abstract: Accurate forecasts of COVID-19 is central to resource management and building strategies to deal with the epidemic. We propose a heterogeneous infection rate model with human mobility for epidemic modeling, a preliminary version of which we have successfully used during DARPA Grand Challenge 2014. By linearizing the model and using weighted least squares, our model is able to quickly adapt to chan… ▽ More

    Submitted 4 May, 2020; v1 submitted 23 April, 2020; originally announced April 2020.

    Comments: 12 pages, 8 figures. Added a figure

  22. arXiv:2003.07497  [pdf, other

    cs.PF cs.LG

    Towards High Performance, Portability, and Productivity: Lightweight Augmented Neural Networks for Performance Prediction

    Authors: Ajitesh Srivastava, Naifeng Zhang, Rajgopal Kannan, Viktor K. Prasanna

    Abstract: Writing high-performance code requires significant expertise in the programming language, compiler optimizations, and hardware knowledge. This often leads to poor productivity and portability and is inconvenient for a non-programmer domain-specialist such as a Physicist. More desirable is a high-level language where the domain-specialist simply specifies the workload in terms of high-level operati… ▽ More

    Submitted 30 August, 2020; v1 submitted 16 March, 2020; originally announced March 2020.

  23. arXiv:1910.11103  [pdf, ps, other

    cs.CV eess.SP

    SPEC2: SPECtral SParsE CNN Accelerator on FPGAs

    Authors: Yue Niu, Hanqing Zeng, Ajitesh Srivastava, Kartik Lakhotia, Rajgopal Kannan, Yanzhi Wang, Viktor Prasanna

    Abstract: To accelerate inference of Convolutional Neural Networks (CNNs), various techniques have been proposed to reduce computation redundancy. Converting convolutional layers into frequency domain significantly reduces the computation complexity of the sliding window operations in space domain. On the other hand, weight pruning techniques address the redundancy in model parameters by converting dense co… ▽ More

    Submitted 10 October, 2023; v1 submitted 16 October, 2019; originally announced October 2019.

    Comments: This is a 10-page conference paper in 26TH IEEE International Conference On High Performance Computing, Data, and Analytics (HiPC)

  24. arXiv:1908.09070  [pdf, other

    cs.NI cs.PF eess.SY

    Optimizing Inter-Datacenter Tail Flow Completion Times using Best Worst-case Routing

    Authors: Max Noormohammadpour, Ajitesh Srivastava, Cauligi S. Raghavendra

    Abstract: Flow routing over inter-datacenter networks is a well-known problem where the network assigns a path to a newly arriving flow potentially according to the network conditions and the properties of the new flow. An essential system-wide performance metric for a routing algorithm is the flow completion times, which affect the performance of applications running across multiple datacenters. Current st… ▽ More

    Submitted 23 August, 2019; originally announced August 2019.

    Comments: Accepted for publication in the 2019 57th Annual Allerton Conference on Communication, Control, and Computing (Allerton)

  25. Hotel Recommendation System

    Authors: Aditi A. Mavalankar, Ajitesh Gupta, Chetan Gandotra, Rishabh Misra

    Abstract: One of the first things to do while planning a trip is to book a good place to stay. Booking a hotel online can be an overwhelming task with thousands of hotels to choose from, for every destination. Motivated by the importance of these situations, we decided to work on the task of recommending hotels to users. We used Expedia's hotel recommendation dataset, which has a variety of features that he… ▽ More

    Submitted 21 August, 2019; v1 submitted 20 August, 2019; originally announced August 2019.

    Comments: arXiv admin note: text overlap with arXiv:1703.02915 by other authors

  26. arXiv:1907.04931  [pdf, other

    cs.LG stat.ML

    GraphSAINT: Graph Sampling Based Inductive Learning Method

    Authors: Hanqing Zeng, Hongkuan Zhou, Ajitesh Srivastava, Rajgopal Kannan, Viktor Prasanna

    Abstract: Graph Convolutional Networks (GCNs) are powerful models for learning representations of attributed graphs. To scale GCNs to large graphs, state-of-the-art methods use various layer sampling techniques to alleviate the "neighbor explosion" problem during minibatch training. We propose GraphSAINT, a graph sampling based inductive learning method that improves training efficiency and accuracy in a fu… ▽ More

    Submitted 15 February, 2020; v1 submitted 10 July, 2019; originally announced July 2019.

    Comments: Published at ICLR 2020; Code release: github.com/GraphSAINT/GraphSAINT

  27. arXiv:1810.11899  [pdf, ps, other

    cs.LG cs.PF stat.ML

    Accurate, Efficient and Scalable Graph Embedding

    Authors: Hanqing Zeng, Hongkuan Zhou, Ajitesh Srivastava, Rajgopal Kannan, Viktor Prasanna

    Abstract: The Graph Convolutional Network (GCN) model and its variants are powerful graph embedding tools for facilitating classification and clustering on graphs. However, a major challenge is to reduce the complexity of layered GCNs and make them parallelizable and scalable on very large graphs -- state-of the art techniques are unable to achieve scalability without losing accuracy and efficiency. In this… ▽ More

    Submitted 5 August, 2020; v1 submitted 28 October, 2018; originally announced October 2018.

    Comments: 10 pages. 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

  28. arXiv:1810.00169  [pdf, ps, other

    cs.NI cs.DS cs.PF eess.SY

    On Minimizing the Completion Times of Long Flows over Inter-Datacenter WAN

    Authors: Mohammad Noormohammadpour, Ajitesh Srivastava, Cauligi S. Raghavendra

    Abstract: Long flows contribute huge volumes of traffic over inter-datacenter WAN. The Flow Completion Time (FCT) is a vital network performance metric that affects the running time of distributed applications and the users' quality of experience. Flow routing techniques based on propagation or queuing latency or instantaneous link utilization are insufficient for minimization of the long flows' FCT. We pro… ▽ More

    Submitted 29 September, 2018; originally announced October 2018.

    Comments: Accepted for publication in IEEE Communications Letters

    Journal ref: IEEE Communications Letters 22 (2018) 2475 - 2478