Skip to main content

Showing 1–50 of 201 results for author: Nitish

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.06910  [pdf, other

    cs.IR cs.AI cs.LG

    Fine-grained large-scale content recommendations for MSX sellers

    Authors: Manpreet Singh, Ravdeep Pasricha, Ravi Prasad Kondapalli, Kiran R, Nitish Singh, Akshita Agarwalla, Manoj R, Manish Prabhakar, Laurent Boué

    Abstract: One of the most critical tasks of Microsoft sellers is to meticulously track and nurture potential business opportunities through proactive engagement and tailored solutions. Recommender systems play a central role to help sellers achieve their goals. In this paper, we present a content recommendation model which surfaces various types of content (technical documentation, comparison with competito… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Journal ref: Microsoft Journal of Applied Research, Volume 21, 2024

  2. arXiv:2407.04440  [pdf, ps, other

    cs.LG cs.NE

    Wavelet-based Temporal Attention Improves Traffic Forecasting

    Authors: Yash Jakhmola, Nitish Kumar Mishra, Kripabandhu Ghosh, Tanujit Chakraborty

    Abstract: Spatio-temporal forecasting of traffic flow data represents a typical problem in the field of machine learning, impacting urban traffic management systems. Traditional statistical and machine learning methods cannot adequately handle both the temporal and spatial dependencies in these complex traffic flow datasets. A prevalent approach in the field is to combine graph convolutional networks and mu… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  3. arXiv:2406.12158  [pdf, other

    cs.CL cs.AI

    LLMs Are Prone to Fallacies in Causal Inference

    Authors: Nitish Joshi, Abulhair Saparov, Yixin Wang, He He

    Abstract: Recent work shows that causal facts can be effectively extracted from LLMs through prompting, facilitating the creation of causal graphs for causal inference tasks. However, it is unclear if this success is limited to explicitly-mentioned causal facts in the pretraining data which the model can memorize. Thus, this work investigates: Can LLMs infer causal relations from other relational data in te… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  4. arXiv:2405.12995  [pdf, other

    cond-mat.mtrl-sci cs.CE

    High-fidelity level-set modeling of diffusive solid-state phase transformations for polycrystalline materials

    Authors: Nitish Chandrappa, Marc Bernacki

    Abstract: The formation of microstructures in metallic alloys during hot metal forming involves simultaneous metallurgical complex phenomena. Traditional high-fidelity numerical frameworks used on the polycrystalline scale tend to focus on single-phase microstructures or isolate phase transformations from grain boundary migration mechanisms. The level-set method is highlighted as effective in proposing a gl… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  5. arXiv:2405.09464  [pdf, other

    quant-ph cs.PF

    Scalable Scheduling Policies for Quantum Satellite Networks

    Authors: Albert Williams, Nitish K. Panigrahy, Andrew McGregor, Don Towsley

    Abstract: As Low Earth Orbit (LEO) satellite mega constellations continue to be deployed for satellite internet and recent successful experiments in satellite-based quantum entanglement distribution emerge, a natural question arises: How should we coordinate transmissions and design scalable scheduling policies for a quantum satellite internet? In this work, we consider the problem of transmission schedulin… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  6. arXiv:2404.16816  [pdf, other

    cs.CL

    IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages

    Authors: Harman Singh, Nitish Gupta, Shikhar Bharadwaj, Dinesh Tewari, Partha Talukdar

    Abstract: As large language models (LLMs) see increasing adoption across the globe, it is imperative for LLMs to be representative of the linguistic diversity of the world. India is a linguistically diverse country of 1.4 Billion people. To facilitate research on multilingual LLM evaluation, we release IndicGenBench - the largest benchmark for evaluating LLMs on user-facing generation tasks across a diverse… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  7. arXiv:2404.14248  [pdf, other

    cs.CV

    NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results

    Authors: Xiaoning Liu, Zongwei Wu, Ao Li, Florin-Alexandru Vasluianu, Yulun Zhang, Shuhang Gu, Le Zhang, Ce Zhu, Radu Timofte, Zhi **, Hongjun Wu, Chenxi Wang, Haitao Ling, Yuanhao Cai, Hao Bian, Yuxin Zheng, **g Lin, Alan Yuille, Ben Shao, ** Guo, Tianli Liu, Mohao Wu, Yixu Feng, Shuo Hou, Haotian Lin , et al. (87 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 low light image enhancement challenge, highlighting the proposed solutions and results. The aim of this challenge is to discover an effective network design or solution capable of generating brighter, clearer, and visually appealing results when dealing with a variety of conditions, including ultra-high resolution (4K and beyond), non-uniform illumination, backlig… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: NTIRE 2024 Challenge Report

  8. arXiv:2403.00781  [pdf, other

    cs.IR cs.AI cs.LG cs.MM

    ChatDiet: Empowering Personalized Nutrition-Oriented Food Recommender Chatbots through an LLM-Augmented Framework

    Authors: Zhongqi Yang, Elahe Khatibi, Nitish Nagesh, Mahyar Abbasian, Iman Azimi, Ramesh Jain, Amir M. Rahmani

    Abstract: The profound impact of food on health necessitates advanced nutrition-oriented food recommendation services. Conventional methods often lack the crucial elements of personalization, explainability, and interactivity. While Large Language Models (LLMs) bring interpretability and explainability, their standalone use falls short of achieving true personalization. In this paper, we introduce ChatDiet,… ▽ More

    Submitted 16 March, 2024; v1 submitted 18 February, 2024; originally announced March 2024.

    Comments: Accepted by The IEEE/ACM international conference on Connected Health: Applications, Systems and Engineering Technologies (CHASE) 2024

  9. arXiv:2402.10153  [pdf, other

    cs.CL

    Knowledge-Infused LLM-Powered Conversational Health Agent: A Case Study for Diabetes Patients

    Authors: Mahyar Abbasian, Zhongqi Yang, Elahe Khatibi, Pengfei Zhang, Nitish Nagesh, Iman Azimi, Ramesh Jain, Amir M. Rahmani

    Abstract: Effective diabetes management is crucial for maintaining health in diabetic patients. Large Language Models (LLMs) have opened new avenues for diabetes management, facilitating their efficacy. However, current LLM-based approaches are limited by their dependence on general sources and lack of integration with domain-specific knowledge, leading to inaccurate responses. In this paper, we propose a k… ▽ More

    Submitted 28 February, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: 4 pages, 3 figures, and 2 tables, conference paper

  10. arXiv:2402.07411  [pdf, other

    cs.LG

    Potential-Based Reward Sha** For Intrinsic Motivation

    Authors: Grant C. Forbes, Nitish Gupta, Leonardo Villalobos-Arias, Colin M. Potts, Arnav Jhala, David L. Roberts

    Abstract: Recently there has been a proliferation of intrinsic motivation (IM) reward-sha** methods to learn in complex and sparse-reward environments. These methods can often inadvertently change the set of optimal policies in an environment, leading to suboptimal behavior. Previous work on mitigating the risks of reward sha**, particularly through potential-based reward sha** (PBRS), has not been ap… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: Extended version of paper appearing in AAMAS 2024

    ACM Class: I.2.6

  11. arXiv:2401.17217  [pdf, other

    cs.HC cs.CV

    GazeGPT: Augmenting Human Capabilities using Gaze-contingent Contextual AI for Smart Eyewear

    Authors: Robert Konrad, Nitish Padmanaban, J. Gabriel Buckmaster, Kevin C. Boyle, Gordon Wetzstein

    Abstract: Multimodal large language models (LMMs) excel in world knowledge and problem-solving abilities. Through the use of a world-facing camera and contextual AI, emerging smart accessories aim to provide a seamless interface between humans and LMMs. Yet, these wearable computing systems lack an understanding of the user's attention. We introduce GazeGPT as a new user interaction paradigm for contextual… ▽ More

    Submitted 31 January, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: Project video: https://youtu.be/AuDFHHTK_m8

  12. arXiv:2401.10823  [pdf, ps, other

    cs.NI quant-ph

    Reconfigurable Intelligent Surface (RIS)-Assisted Entanglement Distribution in FSO Quantum Networks

    Authors: Mahdi Chehimi, Mohamed Elhattab, Walid Saad, Gayane Vardoyan, Nitish K. Panigrahy, Chadi Assi, Don Towsley

    Abstract: Quantum networks (QNs) relying on free-space optical (FSO) quantum channels can support quantum applications in environments wherein establishing an optical fiber infrastructure is challenging and costly. However, FSO-based QNs require a clear line-of-sight (LoS) between users, which is challenging due to blockages and natural obstacles. In this paper, a reconfigurable intelligent surface (RIS)-as… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: 13 pages, 7 figures, 1 table

  13. arXiv:2401.04732  [pdf, other

    cs.IR cs.AI cs.LG

    A case study of Generative AI in MSX Sales Copilot: Improving seller productivity with a real-time question-answering system for content recommendation

    Authors: Manpreet Singh, Ravdeep Pasricha, Nitish Singh, Ravi Prasad Kondapalli, Manoj R, Kiran R, Laurent Boué

    Abstract: In this paper, we design a real-time question-answering system specifically targeted for hel** sellers get relevant material/documentation they can share live with their customers or refer to during a call. Taking the Seismic content repository as a relatively large scale example of a diverse dataset of sales material, we demonstrate how LLM embeddings of sellers' queries can be matched with the… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Journal ref: Microsoft Journal of Applied Research, Volume 20, 2024

  14. arXiv:2401.02412  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    LLM Augmented LLMs: Expanding Capabilities through Composition

    Authors: Rachit Bansal, Bidisha Samanta, Siddharth Dalmia, Nitish Gupta, Shikhar Vashishth, Sriram Ganapathy, Abhishek Bapna, Prateek Jain, Partha Talukdar

    Abstract: Foundational models with billions of parameters which have been trained on large corpora of data have demonstrated non-trivial skills in a variety of domains. However, due to their monolithic structure, it is challenging and expensive to augment them or impart new skills. On the other hand, due to their adaptation abilities, several new instances of these models are being trained towards new domai… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: 17 pages, 2 figures, 8 tables

  15. arXiv:2311.02630  [pdf

    cs.CR cs.AI cs.CY cs.LG

    The New Frontier of Cybersecurity: Emerging Threats and Innovations

    Authors: Daksh Dave, Gauransh Sawhney, Pushkar Aggarwal, Nitish Silswal, Dhruv Khut

    Abstract: In today's digitally interconnected world, cybersecurity threats have reached unprecedented levels, presenting a pressing concern for individuals, organizations, and governments. This study employs a qualitative research approach to comprehensively examine the diverse threats of cybersecurity and their impacts across various sectors. Four primary categories of threats are identified and analyzed,… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

    Comments: 6 pages, 2 Tables

    Journal ref: 2023 29th International Conference on Telecommunications (ICT), pp. 1-6, 2023

  16. arXiv:2310.18168  [pdf, other

    cs.CL cs.AI cs.LG

    Personas as a Way to Model Truthfulness in Language Models

    Authors: Nitish Joshi, Javier Rando, Abulhair Saparov, Najoung Kim, He He

    Abstract: Large language models (LLMs) are trained on vast amounts of text from the internet, which contains both factual and misleading information about the world. While unintuitive from a classic view of LMs, recent work has shown that the truth value of a statement can be elicited from the model's representations. This paper presents an explanation for why LMs appear to know the truth despite not being… ▽ More

    Submitted 6 February, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

  17. arXiv:2310.11266  [pdf

    cs.CL cs.AI cs.NE

    Emulating Human Cognitive Processes for Expert-Level Medical Question-Answering with Large Language Models

    Authors: Khushboo Verma, Marina Moore, Stephanie Wottrich, Karla Robles López, Nishant Aggarwal, Zeel Bhatt, Aagamjit Singh, Bradford Unroe, Salah Basheer, Nitish Sachdeva, Prinka Arora, Harmanjeet Kaur, Tanupreet Kaur, Tevon Hood, Anahi Marquez, Tushar Varshney, Nanfu Deng, Azaan Ramani, Pawanraj Ishwara, Maimoona Saeed, Tatiana López Velarde Peña, Bryan Barksdale, Sushovan Guha, Satwant Kumar

    Abstract: In response to the pressing need for advanced clinical problem-solving tools in healthcare, we introduce BooksMed, a novel framework based on a Large Language Model (LLM). BooksMed uniquely emulates human cognitive processes to deliver evidence-based and reliable responses, utilizing the GRADE (Grading of Recommendations, Assessment, Development, and Evaluations) framework to effectively quantify… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  18. arXiv:2310.10606  [pdf, other

    cs.RO cs.LG

    BayRnTune: Adaptive Bayesian Domain Randomization via Strategic Fine-tuning

    Authors: Tianle Huang, Nitish Sontakke, K. Niranjan Kumar, Irfan Essa, Stefanos Nikolaidis, Dennis W. Hong, Sehoon Ha

    Abstract: Domain randomization (DR), which entails training a policy with randomized dynamics, has proven to be a simple yet effective algorithm for reducing the gap between simulation and the real world. However, DR often requires careful tuning of randomization parameters. Methods like Bayesian Domain Randomization (Bayesian DR) and Active Domain Randomization (Adaptive DR) address this issue by automatin… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

  19. arXiv:2310.09723  [pdf, other

    cs.IT eess.SP

    A generalization of the achievable rate of a MISO system using Bode-Fano wideband matching theory

    Authors: Nitish Deshpande, Miguel R. Castellanos, Saeed R. Khosravirad, **feng Du, Harish Viswanathan, Robert W. Heath Jr

    Abstract: Impedance-matching networks affect power transfer from the radio frequency (RF) chains to the antennas. Their design impacts the signal to noise ratio (SNR) and the achievable rate. In this paper, we maximize the information-theoretic achievable rate of a multiple-input-single-output (MISO) system with wideband matching constraints. Using a multiport circuit theory approach with frequency-selectiv… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

  20. arXiv:2308.11673  [pdf, other

    eess.SP cs.LG

    WEARS: Wearable Emotion AI with Real-time Sensor data

    Authors: Dhruv Limbani, Daketi Yatin, Nitish Chaturvedi, Vaishnavi Moorthy, Pushpalatha M, Harichandana BSS, Sumit Kumar

    Abstract: Emotion prediction is the field of study to understand human emotions. Existing methods focus on modalities like text, audio, facial expressions, etc., which could be private to the user. Emotion can be derived from the subject's psychological data as well. Various approaches that employ combinations of physiological sensors for emotion recognition have been proposed. Yet, not all sensors are simp… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  21. arXiv:2308.11442  [pdf, other

    cs.CV

    SDeMorph: Towards Better Facial De-morphing from Single Morph

    Authors: Nitish Shukla

    Abstract: Face Recognition Systems (FRS) are vulnerable to morph attacks. A face morph is created by combining multiple identities with the intention to fool FRS and making it match the morph with multiple identities. Current Morph Attack Detection (MAD) can detect the morph but are unable to recover the identities used to create the morph with satisfactory outcomes. Existing work in de-morphing is mostly r… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

  22. arXiv:2307.06492  [pdf, other

    quant-ph cs.NI

    Universal Quantum Walk Control Plane for Quantum Networks

    Authors: Matheus Guedes de Andrade, Nitish K. Panigrahy, Wenhan Dai, Saikat Guha, Don Towsley

    Abstract: Quantum networks are complex systems formed by the interaction among quantum processors through quantum channels. Analogous to classical computer networks, quantum networks allow for the distribution of quantum operations among quantum processors. In this work, we describe a Quantum Walk Control Protocol (QWCP) to perform distributed quantum operations in a quantum network. We consider a generaliz… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: 27 pages; 2 figures. A preliminary version of this work was presented at IEEE International Conference on Quantum Computing and Engineering 2021 (QCE21). arXiv admin note: text overlap with arXiv:2106.09839

  23. arXiv:2306.14846  [pdf, other

    cs.RO cs.CV cs.LG

    ViNT: A Foundation Model for Visual Navigation

    Authors: Dhruv Shah, Ajay Sridhar, Nitish Dashora, Kyle Stachowicz, Kevin Black, Noriaki Hirose, Sergey Levine

    Abstract: General-purpose pre-trained models ("foundation models") have enabled practitioners to produce generalizable solutions for individual machine learning problems with datasets that are significantly smaller than those required for learning from scratch. Such models are typically trained on large and diverse datasets with weak supervision, consuming much more training data than is available for any i… ▽ More

    Submitted 24 October, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: Accepted for oral presentation at CoRL 2023

  24. arXiv:2305.15269  [pdf, other

    cs.CL cs.AI

    Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples

    Authors: Abulhair Saparov, Richard Yuanzhe Pang, Vishakh Padmakumar, Nitish Joshi, Seyed Mehran Kazemi, Najoung Kim, He He

    Abstract: Given the intractably large size of the space of proofs, any model that is capable of general deductive reasoning must generalize to proofs of greater complexity. Recent studies have shown that large language models (LLMs) possess some abstract deductive reasoning ability given chain-of-thought prompts. However, they have primarily been tested on proofs using modus ponens or of a specific size, an… ▽ More

    Submitted 3 November, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Published as a conference paper at NeurIPS 2023

  25. arXiv:2305.13299  [pdf, other

    cs.CL cs.AI cs.LG

    Measuring Inductive Biases of In-Context Learning with Underspecified Demonstrations

    Authors: Chenglei Si, Dan Friedman, Nitish Joshi, Shi Feng, Danqi Chen, He He

    Abstract: In-context learning (ICL) is an important paradigm for adapting large language models (LLMs) to new tasks, but the generalization behavior of ICL remains poorly understood. We investigate the inductive biases of ICL from the perspective of feature bias: which feature ICL is more likely to use given a set of underspecified demonstrations in which two features are equally predictive of the labels. F… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  26. XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages

    Authors: Sebastian Ruder, Jonathan H. Clark, Alexander Gutkin, Mihir Kale, Min Ma, Massimo Nicosia, Shruti Rijhwani, Parker Riley, Jean-Michel A. Sarr, Xinyi Wang, John Wieting, Nitish Gupta, Anna Katanova, Christo Kirov, Dana L. Dickinson, Brian Roark, Bidisha Samanta, Connie Tao, David I. Adelani, Vera Axelrod, Isaac Caswell, Colin Cherry, Dan Garrette, Reeve Ingle, Melvin Johnson , et al. (2 additional authors not shown)

    Abstract: Data scarcity is a crucial issue for the development of highly multilingual NLP systems. Yet for many under-represented languages (ULs) -- languages for which NLP re-search is particularly far behind in meeting user needs -- it is feasible to annotate small amounts of data. Motivated by this, we propose XTREME-UP, a benchmark defined by: its focus on the scarce-data scenario rather than zero-shot;… ▽ More

    Submitted 24 May, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

  27. arXiv:2305.08696  [pdf, other

    cs.NI quant-ph

    Scaling Limits of Quantum Repeater Networks

    Authors: Mahdi Chehimi, Shahrooz Pouryousef, Nitish K. Panigrahy, Don Towsley, Walid Saad

    Abstract: Quantum networks (QNs) are a promising platform for secure communications, enhanced sensing, and efficient distributed quantum computing. However, due to the fragile nature of quantum states, these networks face significant challenges in terms of scalability. In this paper, the scaling limits of quantum repeater networks (QRNs) are analyzed. The goal of this work is to maximize the overall length,… ▽ More

    Submitted 26 July, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: 6 pages, 10 figures

  28. arXiv:2305.03317  [pdf, other

    cs.DC

    StarPlat: A Versatile DSL for Graph Analytics

    Authors: Nibedita Behera, Ashwina Kumar, Ebenezer Rajadurai T, Sai Nitish, Rajesh Pandian M, Rupesh Nasre

    Abstract: Graphs model several real-world phenomena. With the growth of unstructured and semi-structured data, parallelization of graph algorithms is inevitable. Unfortunately, due to inherent irregularity of computation, memory access, and communication, graph algorithms are traditionally challenging to parallelize. To tame this challenge, several libraries, frameworks, and domain-specific languages (DSLs)… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: 30 pages, 21 figures

  29. arXiv:2305.03231  [pdf, other

    quant-ph cs.NI cs.PF

    Resource Management in Quantum Virtual Private Networks

    Authors: Shahrooz Pouryousef, Nitish K. Panigrahy, Monimoy Deb Purkayastha, Sabyasachi Mukhopadhyay, Gert Grammel, Domenico Di Mola, Don Towsley

    Abstract: In this study, we develop a resource management framework for a quantum virtual private network (qVPN), which involves the sharing of an underlying public quantum network by multiple organizations for quantum entanglement distribution. Our approach involves resolving the issue of link entanglement resource allocation in a qVPN by utilizing a centralized optimization framework. We provide insights… ▽ More

    Submitted 7 July, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

  30. arXiv:2304.04386  [pdf, ps, other

    cs.LG cs.CR cs.CV

    Generating Adversarial Attacks in the Latent Space

    Authors: Nitish Shukla, Sudipta Banerjee

    Abstract: Adversarial attacks in the input (pixel) space typically incorporate noise margins such as $L_1$ or $L_{\infty}$-norm to produce imperceptibly perturbed data that confound deep learning networks. Such noise margins confine the magnitude of permissible noise. In this work, we propose injecting adversarial perturbations in the latent (feature) space using a generative adversarial network, removing t… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

  31. arXiv:2303.15331  [pdf, other

    cs.RO

    Learning a Single Policy for Diverse Behaviors on a Quadrupedal Robot using Scalable Motion Imitation

    Authors: Arnaud Klipfel, Nitish Sontakke, Ren Liu, Sehoon Ha

    Abstract: Learning various motor skills for quadrupedal robots is a challenging problem that requires careful design of task-specific mathematical models or reward descriptions. In this work, we propose to learn a single capable policy using deep reinforcement learning by imitating a large number of reference motions, including walking, turning, pacing, jum**, sitting, and lying. On top of the existing mo… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  32. arXiv:2303.13974   

    cs.LG

    Mixed-Type Wafer Classification For Low Memory Devices Using Knowledge Distillation

    Authors: Nitish Shukla, Anurima Dey, Srivatsan K

    Abstract: Manufacturing wafers is an intricate task involving thousands of steps. Defect Pattern Recognition (DPR) of wafer maps is crucial for determining the root cause of production defects, which may further provide insight for yield improvement in wafer foundry. During manufacturing, various defects may appear standalone in the wafer or may appear as different combinations. Identifying multiple defects… ▽ More

    Submitted 18 October, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

    Comments: Study is not relevant

  33. arXiv:2303.13827   

    cs.CV cs.LG

    Efficient Mixed-Type Wafer Defect Pattern Recognition Using Compact Deformable Convolutional Transformers

    Authors: Nitish Shukla

    Abstract: Manufacturing wafers is an intricate task involving thousands of steps. Defect Pattern Recognition (DPR) of wafer maps is crucial to find the root cause of the issue and further improving the yield in the wafer foundry. Mixed-type DPR is much more complicated compared to single-type DPR due to varied spatial features, the uncertainty of defects, and the number of defects present. To accurately pre… ▽ More

    Submitted 16 October, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

    Comments: Study is not relevant

  34. arXiv:2303.11632   

    cs.CV cs.LG eess.IV

    An Embarrassingly Simple Approach for Wafer Feature Extraction and Defect Pattern Recognition

    Authors: Nitish Shukla

    Abstract: Identifying defect patterns in a wafer map during manufacturing is crucial to find the root cause of the underlying issue and provides valuable insights on improving yield in the foundry. Currently used methods use deep neural networks to identify the defects. These methods are generally very huge and have significant inference time. They also require GPU support to efficiently operate. All these… ▽ More

    Submitted 16 October, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: study is not relevant

  35. arXiv:2303.09597  [pdf, other

    cs.RO cs.AI

    Residual Physics Learning and System Identification for Sim-to-real Transfer of Policies on Buoyancy Assisted Legged Robots

    Authors: Nitish Sontakke, Hosik Chae, Sangjoon Lee, Tianle Huang, Dennis W. Hong, Sehoon Ha

    Abstract: The light and soft characteristics of Buoyancy Assisted Lightweight Legged Unit (BALLU) robots have a great potential to provide intrinsically safe interactions in environments involving humans, unlike many heavy and rigid robots. However, their unique and sensitive dynamics impose challenges to obtaining robust control policies in the real world. In this work, we demonstrate robust sim-to-real tr… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

  36. arXiv:2303.08774  [pdf, other

    cs.CL cs.AI

    GPT-4 Technical Report

    Authors: OpenAI, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-Shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko , et al. (256 additional authors not shown)

    Abstract: We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based mo… ▽ More

    Submitted 4 March, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 100 pages; updated authors list; fixed author names and added citation

  37. arXiv:2301.03996  [pdf, other

    eess.IV cs.IT

    Collaborative Semantic Communication for Edge Inference

    Authors: Wing Fei Lo, Nitish Mital, Haotian Wu, Deniz GĂĽndĂĽz

    Abstract: We study the collaborative image retrieval problem at the wireless edge, where multiple edge devices capture images of the same object from different angles and locations, which are then used jointly to retrieve similar images at the edge server over a shared multiple access channel (MAC). We propose two novel deep learning-based joint source and channel coding (JSCC) schemes for the task over bot… ▽ More

    Submitted 12 February, 2023; v1 submitted 10 January, 2023; originally announced January 2023.

    MSC Class: 94A24 ACM Class: E.4

  38. arXiv:2212.04549  [pdf, other

    cs.RO eess.SY

    Optimizing Real-Time Performances for Timed-Loop Racing under F1TENTH

    Authors: Nitish Gupta, Kurt Wilson, Zhishan Guo

    Abstract: Motion planning and control in autonomous car racing are one of the most challenging and safety-critical tasks due to high speed and dynamism. The lower-level control nodes are expected to be highly optimized due to resource constraints of onboard embedded processing units, although there are strict latency requirements. Some of these guarantees can be provided at the application level, such as us… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Journal ref: Proceedings of the 43rd IEEE Real-Time Systems Symposium (RTSS), Industry Challenge, Houston, US, Dec. 2022

  39. arXiv:2212.01694  [pdf, other

    quant-ph cs.NI cs.PF

    A Quantum Overlay Network for Efficient Entanglement Distribution

    Authors: Shahrooz Pouryousef, Nitish K. Panigrahy, Don Towsley

    Abstract: Distributing quantum entanglements over long distances is essential for the realization of a global scale quantum Internet. Most of the prior work and proposals assume an on-demand distribution of entanglements which may result in significant network resource under-utilization. In this work, we introduce Quantum Overlay Networks (QONs) for efficient entanglement distribution in quantum networks. W… ▽ More

    Submitted 3 December, 2022; originally announced December 2022.

  40. arXiv:2212.01463  [pdf, other

    quant-ph cs.NI cs.PF

    On the Capacity Region of a Quantum Switch with Entanglement Purification

    Authors: Nitish K. Panigrahy, Thirupathaiah Vasantam, Don Towsley, Leandros Tassiulas

    Abstract: Quantum switches are envisioned to be an integral component of future entanglement distribution networks. They can provide high quality entanglement distribution service to end-users by performing quantum operations such as entanglement swap** and entanglement purification. In this work, we characterize the capacity region of such a quantum switch under noisy channel transmissions and imperfect… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

    Comments: 10 pages, 4 figures, accepted for a talk at the IEEE International Conference on Computer Communications (INFOCOM), 2023

  41. arXiv:2210.14011  [pdf, other

    cs.CL cs.LG

    Are All Spurious Features in Natural Language Alike? An Analysis through a Causal Lens

    Authors: Nitish Joshi, Xiang Pan, He He

    Abstract: The term `spurious correlations' has been used in NLP to informally denote any undesirable feature-label correlations. However, a correlation can be undesirable because (i) the feature is irrelevant to the label (e.g. punctuation in a review), or (ii) the feature's effect on the label depends on the context (e.g. negation words in a review), which is ubiquitous in language tasks. In case (i), we w… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: EMNLP 2022

  42. arXiv:2210.07313  [pdf, other

    cs.CL cs.LG

    Bootstrap** Multilingual Semantic Parsers using Large Language Models

    Authors: Abhijeet Awasthi, Nitish Gupta, Bidisha Samanta, Shachi Dave, Sunita Sarawagi, Partha Talukdar

    Abstract: Despite cross-lingual generalization demonstrated by pre-trained multilingual models, the translate-train paradigm of transferring English datasets across multiple languages remains to be a key mechanism for training task-specific multilingual models. However, for many low-resource languages, the availability of a reliable translation service entails significant amounts of costly human-annotated t… ▽ More

    Submitted 11 February, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: EACL-23

  43. arXiv:2210.01302  [pdf, other

    cs.LG cs.CV

    Nuisances via Negativa: Adjusting for Spurious Correlations via Data Augmentation

    Authors: Aahlad Puli, Nitish Joshi, Yoav Wald, He He, Rajesh Ranganath

    Abstract: In prediction tasks, there exist features that are related to the label in the same way across different settings for that task; these are semantic features or semantics. Features with varying relationships to the label are nuisances. For example, in detecting cows from natural images, the shape of the head is semantic but because images of cows often have grass backgrounds but not always, the bac… ▽ More

    Submitted 3 July, 2024; v1 submitted 3 October, 2022; originally announced October 2022.

  44. ReAct: A Review Comment Dataset for Actionability (and more)

    Authors: Gautam Choudhary, Natwar Modani, Nitish Maurya

    Abstract: Review comments play an important role in the evolution of documents. For a large document, the number of review comments may become large, making it difficult for the authors to quickly grasp what the comments are about. It is important to identify the nature of the comments to identify which comments require some action on the part of document authors, along with identifying the types of these c… ▽ More

    Submitted 2 October, 2022; originally announced October 2022.

    Comments: Published at WISE 2021

  45. arXiv:2209.02600  [pdf

    cs.CV

    Domain Engineering for Applied Monocular Reconstruction of Parametric Faces

    Authors: Igor Borovikov, Karine Levonyan, Jon Rein, Pawel Wrotek, Nitish Victor

    Abstract: Many modern online 3D applications and video games rely on parametric models of human faces for creating believable avatars. However, manually reproducing someone's facial likeness with a parametric model is difficult and time-consuming. Machine Learning solution for that task is highly desirable but is also challenging. The paper proposes a novel approach to the so-called Face-to-Parameters probl… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

    Comments: An extended SIPP 2022 conference paper. arXiv admin note: substantial text overlap with arXiv:2208.02935

    Journal ref: Signal and Image Processing: An International Journal, August 2022, Volume 13, No 2/3/4, pages 33-51

  46. arXiv:2208.09434  [pdf, other

    cs.CE cond-mat.mtrl-sci

    A level-set formulation to simulate diffusive solid/solid phase transformation in polycrystalline metallic materials -- Application to austenite decomposition in steels

    Authors: Nitish Chandrappa, Marc Bernacki

    Abstract: Numerous full-field numerical methods exist concerning the digital description of polycrystalline materials and the modeling of their evolution during thermomechanical treatments. However, these strategies are globally dedicated to the modeling of recrystallization and grain growth for single-phase materials, or to the modeling of phase transformations without considering recrystallization and rel… ▽ More

    Submitted 19 August, 2022; originally announced August 2022.

  47. arXiv:2208.03645  [pdf, other

    cs.IR cs.AI

    Generating Negative Samples for Sequential Recommendation

    Authors: Yongjun Chen, Jia Li, Zhiwei Liu, Nitish Shirish Keskar, Huan Wang, Julian McAuley, Caiming Xiong

    Abstract: To make Sequential Recommendation (SR) successful, recent works focus on designing effective sequential encoders, fusing side information, and mining extra positive self-supervision signals. The strategy of sampling negative items at each time step is less explored. Due to the dynamics of users' interests and model updates during training, considering randomly sampled items from a user's non-inter… ▽ More

    Submitted 7 August, 2022; originally announced August 2022.

  48. arXiv:2208.02935  [pdf

    cs.CV

    Applied monocular reconstruction of parametric faces with domain engineering

    Authors: Igor Borovikov, Karine Levonyan, Jon Rein, Pawel Wrotek, Nitish Victor

    Abstract: Many modern online 3D applications and videogames rely on parametric models of human faces for creating believable avatars. However, manual reproduction of someone's facial likeness with a parametric model is difficult and time-consuming. Machine Learning solution for that task is highly desirable but is also challenging. The paper proposes a novel approach to the so-called Face-to-Parameters prob… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: 16 pages; SIPP 2022, 10th International Conference on Signal, Image Processing and Pattern Recognition (London, United Kingdom)

  49. arXiv:2207.08489  [pdf, other

    eess.IV cs.CV

    Neural Distributed Image Compression with Cross-Attention Feature Alignment

    Authors: Nitish Mital, Ezgi Ozyilkan, Ali Garjani, Deniz Gunduz

    Abstract: We consider the problem of compressing an information source when a correlated one is available as side information only at the decoder side, which is a special case of the distributed source coding problem in information theory. In particular, we consider a pair of stereo images, which have overlap** fields of view, and are captured by a synchronized and calibrated pair of cameras as correlated… ▽ More

    Submitted 5 January, 2023; v1 submitted 18 July, 2022; originally announced July 2022.

    Comments: 16 pages, 15 figures, presented in WACV 2023

  50. arXiv:2207.00630  [pdf, other

    cs.AI

    QA Is the New KR: Question-Answer Pairs as Knowledge Bases

    Authors: Wenhu Chen, William W. Cohen, Michiel De Jong, Nitish Gupta, Alessandro Presta, Pat Verga, John Wieting

    Abstract: In this position paper, we propose a new approach to generating a type of knowledge base (KB) from text, based on question generation and entity linking. We argue that the proposed type of KB has many of the key advantages of a traditional symbolic KB: in particular, it consists of small modular components, which can be combined compositionally to answer complex queries, including relational queri… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.