Skip to main content

Showing 1–21 of 21 results for author: Chatterjee, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.17212  [pdf

    cs.ET cs.CV

    Scrutinizing Data from Sky: An Examination of Its Veracity in Area Based Traffic Contexts

    Authors: Yawar Ali, Krishnan K N, Debashis Ray Sarkar, K. Ramachandra Rao, Niladri Chatterjee, Ashish Bhaskar

    Abstract: Traffic data collection has been an overwhelming task for researchers as well as authorities over the years. With the advancement in technology and introduction of various tools for processing and extracting traffic data the task has been made significantly convenient. Data from Sky (DFS) is one such tool, based on image processing and artificial intelligence (AI), that provides output for macrosc… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  2. arXiv:2310.01593  [pdf, other

    cs.LG cs.AI stat.AP

    Prescribed Fire Modeling using Knowledge-Guided Machine Learning for Land Management

    Authors: Somya Sharma Chatterjee, Kelly Lindsay, Neel Chatterjee, Rohan Patil, Ilkay Altintas De Callafon, Michael Steinbach, Daniel Giron, Mai H. Nguyen, Vipin Kumar

    Abstract: In recent years, the increasing threat of devastating wildfires has underscored the need for effective prescribed fire management. Process-based computer simulations have traditionally been employed to plan prescribed fires for wildfire prevention. However, even simplified process models like QUIC-Fire are too compute-intensive to be used for real-time decision-making, especially when weather cond… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  3. arXiv:2211.00184  [pdf, other

    cs.LG

    FL Games: A Federated Learning Framework for Distribution Shifts

    Authors: Sharut Gupta, Kartik Ahuja, Mohammad Havaei, Niladri Chatterjee, Yoshua Bengio

    Abstract: Federated learning aims to train predictive models for data that is distributed across clients, under the orchestration of a server. However, participating clients typically each hold data from a different distribution, which can yield to catastrophic generalization on data from a different client, which represents a new domain. In this work, we argue that in order to generalize better across non-… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Comments: Accepted as ORAL at NeurIPS Workshop on Federated Learning: Recent Advances and New Challenges. arXiv admin note: text overlap with arXiv:2205.11101

  4. arXiv:2205.11101  [pdf, other

    cs.LG cs.AI

    FL Games: A federated learning framework for distribution shifts

    Authors: Sharut Gupta, Kartik Ahuja, Mohammad Havaei, Niladri Chatterjee, Yoshua Bengio

    Abstract: Federated learning aims to train predictive models for data that is distributed across clients, under the orchestration of a server. However, participating clients typically each hold data from a different distribution, whereby predictive models with strong in-distribution generalization can fail catastrophically on unseen domains. In this work, we argue that in order to generalize better across n… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

  5. arXiv:2204.09781  [pdf

    cs.DL cs.CL cs.IR cs.LG

    Multi-label classification for biomedical literature: an overview of the BioCreative VII LitCovid Track for COVID-19 literature topic annotations

    Authors: Qingyu Chen, Alexis Allot, Robert Leaman, Rezarta Islamaj Doğan, **gcheng Du, Li Fang, Kai Wang, Shuo Xu, Yuefu Zhang, Parsa Bagherzadeh, Sabine Bergler, Aakash Bhatnagar, Nidhir Bhavsar, Yung-Chun Chang, Sheng-Jie Lin, Wentai Tang, Hongtong Zhang, Ilija Tavchioski, Senja Pollak, Shubo Tian, **feng Zhang, Yulia Otmakhova, Antonio Jimeno Yepes, Hang Dong, Honghan Wu , et al. (14 additional authors not shown)

    Abstract: The COVID-19 pandemic has been severely impacting global society since December 2019. Massive research has been undertaken to understand the characteristics of the virus and design vaccines and drugs. The related findings have been reported in biomedical literature at a rate of about 10,000 articles on COVID-19 per month. Such rapid growth significantly challenges manual curation and interpretatio… ▽ More

    Submitted 3 June, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

  6. arXiv:2203.17081  [pdf, other

    cs.LG cs.AI cs.CL cs.IR

    Interpretation of Black Box NLP Models: A Survey

    Authors: Shivani Choudhary, Niladri Chatterjee, Subir Kumar Saha

    Abstract: An increasing number of machine learning models have been deployed in domains with high stakes such as finance and healthcare. Despite their superior performances, many models are black boxes in nature which are hard to explain. There are growing efforts for researchers to develop methods to interpret these black-box models. Post hoc explanations based on perturbations, such as LIME, are widely us… ▽ More

    Submitted 31 March, 2022; originally announced March 2022.

  7. arXiv:2111.13296  [pdf, ps, other

    cs.LG stat.CO stat.ML

    Approximate Bayesian Computation for Physical Inverse Modeling

    Authors: Neel Chatterjee, Somya Sharma, Sarah Swisher, Snigdhansu Chatterjee

    Abstract: Semiconductor device models are essential to understand the charge transport in thin film transistors (TFTs). Using these TFT models to draw inference involves estimating parameters used to fit to the experimental data. These experimental data can involve extracted charge carrier mobility or measured current. Estimating these parameters help us draw inferences about device performance. Fitting a T… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

  8. arXiv:2104.02188  [pdf, other

    cs.AR cs.DC cs.LG

    GPU Domain Specialization via Composable On-Package Architecture

    Authors: Yaosheng Fu, Evgeny Bolotin, Niladrish Chatterjee, David Nellans, Stephen W. Keckler

    Abstract: As GPUs scale their low precision matrix math throughput to boost deep learning (DL) performance, they upset the balance between math throughput and memory system capabilities. We demonstrate that converged GPU design trying to address diverging architectural requirements between FP32 (or larger) based HPC and FP16 (or smaller) based DL workloads results in sub-optimal configuration for either of… ▽ More

    Submitted 5 April, 2021; originally announced April 2021.

  9. arXiv:2009.11796  [pdf

    cs.IR

    Automatic Extraction of Agriculture Terms from Domain Text: A Survey of Tools and Techniques

    Authors: Niladri Chatterjee, Neha Kaushik

    Abstract: Agriculture is a key component in any country's development. Domain-specific knowledge resources serve to gain insight into the domain. Existing knowledge resources such as AGROVOC and NAL Thesaurus are developed and maintained by the domain experts. Population of terms into these knowledge resources can be automated by using automatic term extraction tools for processing unstructured agricultural… ▽ More

    Submitted 24 September, 2020; originally announced September 2020.

  10. arXiv:2008.12552  [pdf, other

    cs.LG cs.CL

    Probabilistic Random Indexing for Continuous Event Detection

    Authors: Yashank Singh, Niladri Chatterjee

    Abstract: The present paper explores a novel variant of Random Indexing (RI) based representations for encoding language data with a view to using them in a dynamic scenario where events are happening in a continuous fashion. As the size of the representations in the general method of onehot encoding grows linearly with the size of the vocabulary, they become non-scalable for online purposes with high volum… ▽ More

    Submitted 9 December, 2021; v1 submitted 28 August, 2020; originally announced August 2020.

    Comments: 8 pages, 12 figures

  11. arXiv:2008.01297  [pdf, other

    cs.CL cs.DS

    An improved Bayesian TRIE based model for SMS text normalization

    Authors: Abhinava Sikdar, Niladri Chatterjee

    Abstract: Normalization of SMS text, commonly known as texting language, is being pursued for more than a decade. A probabilistic approach based on the Trie data structure was proposed in literature which was found to be better performing than HMM based approaches proposed earlier in predicting the correct alternative for an out-of-lexicon word. However, success of the Trie based approach depends largely on… ▽ More

    Submitted 18 November, 2020; v1 submitted 3 August, 2020; originally announced August 2020.

    Comments: 7 pages, 8 figures, under review at Pattern Recognition Letters

  12. arXiv:2004.10560  [pdf

    q-fin.ST cs.CE

    Examining Lead-Lag Relationships In-Depth, With Focus On FX Market As Covid-19 Crises Unfolds

    Authors: Kartikay Gupta, Niladri Chatterjee

    Abstract: The lead-lag relationship plays a vital role in financial markets. It is the phenomenon where a certain price-series lags behind and partially replicates the movement of leading time-series. The present research proposes a new technique which helps better identify the lead-lag relationship empirically. Apart from better identifying the lead-lag path, the technique also gives a measure for adjudgin… ▽ More

    Submitted 9 May, 2020; v1 submitted 22 April, 2020; originally announced April 2020.

    Comments: Suggestions are welcome. In the second version, a citation has been updated on request from the corresponding author

    MSC Class: 91G30; 68T10; ACM Class: G.3.3

  13. arXiv:2002.03259  [pdf

    cs.CL cs.IR

    Rough Set based Aggregate Rank Measure & its Application to Supervised Multi Document Summarization

    Authors: Nidhika Yadav, Niladri Chatterjee

    Abstract: Most problems in Machine Learning cater to classification and the objects of universe are classified to a relevant class. Ranking of classified objects of universe per decision class is a challenging problem. We in this paper propose a novel Rough Set based membership called Rank Measure to solve to this problem. It shall be utilized for ranking the elements to a particular class. It differs from… ▽ More

    Submitted 8 February, 2020; originally announced February 2020.

    Comments: The paper proposes a novel Rough Set based technique to compute rank in a decision system. This is further evaluated on the problem of Supervised Text Summarization. The paper contains 9 pages, illustrative examples, theoretical properties, and experimental evaluations on standard datasets

  14. arXiv:1908.00289  [pdf, other

    cs.AR

    Runtime Mitigation of Packet Drop Attacks in Fault-tolerant Networks-on-Chip

    Authors: N Prasad, Navonil Chatterjee, Santanu Chattopadhyay, Indrajit Chakrabarti

    Abstract: Fault-tolerant routing (FTR) in Networks-on-Chip (NoCs) has become a common practice to sustain the performance of multi-core systems with an increasing number of faults on a chip. On the other hand, usage of third-party intellectual property blocks has made security a primary concern in modern day designs. This article presents a mechanism to mitigate a denial-of-service attack, namely packet dro… ▽ More

    Submitted 1 August, 2019; originally announced August 2019.

    Comments: 23 pages, 17 figures

  15. Selecting stock pairs for pairs trading while incorporating lead-lag relationship

    Authors: Kartikay Gupta, Niladri Chatterjee

    Abstract: Pairs Trading is carried out in the financial market to earn huge profits from known equilibrium relation between pairs of stock. In financial markets, seldom it is seen that stock pairs are correlated at particular lead or lag. This lead-lag relationship has been empirically studied in various financial markets. Earlier research works have suggested various measures for identifying the best pairs… ▽ More

    Submitted 31 December, 2019; v1 submitted 12 June, 2019; originally announced June 2019.

    Comments: Better updated version in lots of ways to be uploaded soon

  16. DeLTA: GPU Performance Model for Deep Learning Applications with In-depth Memory System Traffic Analysis

    Authors: Sangkug Lym, Donghyuk Lee, Mike O'Connor, Niladrish Chatterjee, Mattan Erez

    Abstract: Training convolutional neural networks (CNNs) requires intense compute throughput and high memory bandwidth. Especially, convolution layers account for the majority of the execution time of CNN training, and GPUs are commonly used to accelerate these layer workloads. GPU design optimization for efficient CNN training acceleration requires the accurate modeling of how their performance improves whe… ▽ More

    Submitted 2 April, 2019; originally announced April 2019.

  17. arXiv:1901.11013  [pdf

    q-fin.GN cs.CE

    Top performing stocks recommendation strategy for portfolio

    Authors: Kartikay Gupta, Niladri Chatterjee

    Abstract: Stock return forecasting is of utmost importance in the business world. This has been the favourite topic of research for many academicians since decades. Recently, regularization techniques have reported to tremendously increase the forecast accuracy of the simple regression model. Still, this model cannot incorporate the effect of things like a major natural disaster, large foreign influence, et… ▽ More

    Submitted 10 August, 2019; v1 submitted 30 January, 2019; originally announced January 2019.

    Comments: 20 pages, 9 Tables, 3 figures. Comments are invited. In the last version, Methodological details corrected at one point, results unchanged

    MSC Class: 68T37

  18. arXiv:1807.05102  [pdf, other

    cs.AR

    What Your DRAM Power Models Are Not Telling You: Lessons from a Detailed Experimental Study

    Authors: Saugata Ghose, Abdullah Giray Yağlıkçı, Raghav Gupta, Donghyuk Lee, Kais Kudrolli, William X. Liu, Hasan Hassan, Kevin K. Chang, Niladrish Chatterjee, Aditya Agrawal, Mike O'Connor, Onur Mutlu

    Abstract: Main memory (DRAM) consumes as much as half of the total system power in a computer today, resulting in a growing need to develop new DRAM architectures and systems that consume less power. Researchers have long relied on DRAM power models that are based off of standardized current measurements provided by vendors, called IDD values. Unfortunately, we find that these models are highly inaccurate,… ▽ More

    Submitted 13 July, 2018; originally announced July 2018.

    Comments: presented at SIGMETRICS 2018

  19. arXiv:1805.03175  [pdf, other

    cs.AR

    Voltron: Understanding and Exploiting the Voltage-Latency-Reliability Trade-Offs in Modern DRAM Chips to Improve Energy Efficiency

    Authors: Kevin K. Chang, Abdullah Giray Yaglıkçı, Saugata Ghose, Aditya Agrawal, Niladrish Chatterjee, Abhijith Kashyap, Donghyuk Lee, Mike O'Connor, Hasan Hassan, Onur Mutlu

    Abstract: This paper summarizes our work on experimental characterization and analysis of reduced-voltage operation in modern DRAM chips, which was published in SIGMETRICS 2017, and examines the work's significance and future potential. We take a comprehensive approach to understanding and exploiting the latency and reliability characteristics of modern DRAM when the DRAM supply voltage is lowered below t… ▽ More

    Submitted 8 May, 2018; originally announced May 2018.

  20. Understanding Reduced-Voltage Operation in Modern DRAM Chips: Characterization, Analysis, and Mechanisms

    Authors: Kevin K. Chang, Abdullah Giray Yağlıkçı, Saugata Ghose, Aditya Agrawal, Niladrish Chatterjee, Abhijith Kashyap, Donghyuk Lee, Mike O'Connor, Hasan Hassan, Onur Mutlu

    Abstract: The energy consumption of DRAM is a critical concern in modern computing systems. Improvements in manufacturing process technology have allowed DRAM vendors to lower the DRAM supply voltage conservatively, which reduces some of the DRAM energy consumption. We would like to reduce the DRAM supply voltage more aggressively, to further reduce energy. Aggressive supply voltage reduction requires a tho… ▽ More

    Submitted 29 May, 2017; originally announced May 2017.

    Comments: 25 pages, 25 figures, 7 tables, Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS)

  21. arXiv:1705.01626  [pdf, other

    cs.LG cs.AR

    Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks

    Authors: Minsoo Rhu, Mike O'Connor, Niladrish Chatterjee, Jeff Pool, Stephen W. Keckler

    Abstract: Popular deep learning frameworks require users to fine-tune their memory usage so that the training data of a deep neural network (DNN) fits within the GPU physical memory. Prior work tries to address this restriction by virtualizing the memory usage of DNNs, enabling both CPU and GPU memory to be utilized for memory allocations. Despite its merits, virtualizing memory can incur significant perfor… ▽ More

    Submitted 3 May, 2017; originally announced May 2017.