Skip to main content

Showing 1–50 of 278 results for author: Ra, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14373  [pdf, other

    cs.AI cs.CL cs.CY cs.HC cs.MA

    Artificial Leviathan: Exploring Social Evolution of LLM Agents Through the Lens of Hobbesian Social Contract Theory

    Authors: Gordon Dai, Weijia Zhang, **han Li, Siqi Yang, Chidera Onochie lbe, Srihas Rao, Arthur Caetano, Misha Sra

    Abstract: The emergence of Large Language Models (LLMs) and advancements in Artificial Intelligence (AI) offer an opportunity for computational social science research at scale. Building upon prior explorations of LLM agent design, our work introduces a simulated agent society where complex social relationships dynamically form and evolve over time. Agents are imbued with psychological drives and placed in… ▽ More

    Submitted 1 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2406.13384  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    Straight Through Gumbel Softmax Estimator based Bimodal Neural Architecture Search for Audio-Visual Deepfake Detection

    Authors: Aravinda Reddy PN, Raghavendra Ramachandra, Krothapalli Sreenivasa Rao, Pabitra Mitra, Vinod Rathod

    Abstract: Deepfakes are a major security risk for biometric authentication. This technology creates realistic fake videos that can impersonate real people, fooling systems that rely on facial features and voice patterns for identification. Existing multimodal deepfake detectors rely on conventional fusion methods, such as majority rule and ensemble voting, which often struggle to adapt to changing data char… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  3. arXiv:2406.07559  [pdf, other

    cs.CR eess.SY

    After the Breach: Incident Response within Enterprises

    Authors: Sumanth Rao

    Abstract: Enterprises are constantly under attack from sophisticated adversaries. These adversaries use a variety of techniques to first gain access to the enterprise, then spread laterally inside its networks, establish persistence, and finally exfiltrate sensitive data, or hold it for ransom. While historically, enterprises have used different Incident Response systems that monitor hosts, servers, or netw… ▽ More

    Submitted 13 June, 2024; v1 submitted 30 April, 2024; originally announced June 2024.

  4. arXiv:2406.04482  [pdf, other

    cs.CL cs.AI cs.HC cs.SE

    Automatic Bug Detection in LLM-Powered Text-Based Games Using LLMs

    Authors: Claire **, Sudha Rao, Xiangyu Peng, Portia Botchway, Jessica Quaye, Chris Brockett, Bill Dolan

    Abstract: Advancements in large language models (LLMs) are revolutionizing interactive game design, enabling dynamic plotlines and interactions between players and non-player characters (NPCs). However, LLMs may exhibit flaws such as hallucinations, forgetfulness, or misinterpretations of prompts, causing logical inconsistencies and unexpected deviations from intended designs. Automated techniques for detec… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted for publication in Findings of the Association for Computational Linguistics: ACL 2024

  5. arXiv:2405.11070  [pdf, other

    cs.AI cs.CL cs.LG

    Jill Watson: A Virtual Teaching Assistant powered by ChatGPT

    Authors: Karan Taneja, Pratyusha Maiti, Sandeep Kakar, Pranav Guruprasad, Sanjeev Rao, Ashok K. Goel

    Abstract: Conversational AI agents often require extensive datasets for training that are not publicly released, are limited to social chit-chat or handling a specific domain, and may not be easily extended to accommodate the latest advances in AI technologies. This paper introduces Jill Watson, a conversational Virtual Teaching Assistant (VTA) leveraging the capabilities of ChatGPT. Jill Watson based on Ch… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  6. arXiv:2405.08927  [pdf, ps, other

    cs.DS

    Expanderizing Higher Order Random Walks

    Authors: Vedat Levi Alev, Shravas Rao

    Abstract: We study a variant of the down-up and up-down walks over an $n$-partite simplicial complex, which we call expanderized higher order random walks -- where the sequence of updated coordinates correspond to the sequence of vertices visited by a random walk over an auxiliary expander graph $H$. When $H$ is the clique, this random walk reduces to the usual down-up walk and when $H$ is the directed cycl… ▽ More

    Submitted 3 June, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

  7. arXiv:2404.17027  [pdf, other

    cs.CL cs.AI

    Player-Driven Emergence in LLM-Driven Game Narrative

    Authors: Xiangyu Peng, Jessica Quaye, Sudha Rao, Weijia Xu, Portia Botchway, Chris Brockett, Nebojsa Jojic, Gabriel DesGarennes, Ken Lobb, Michael Xu, Jorge Leandro, Claire **, Bill Dolan

    Abstract: We explore how interaction with large language models (LLMs) can give rise to emergent behaviors, empowering players to participate in the evolution of game narratives. Our testbed is a text-adventure game in which players attempt to solve a mystery under a fixed narrative premise, but can freely interact with non-player characters generated by GPT-4, a large language model. We recruit 28 gamers t… ▽ More

    Submitted 3 June, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Accepted at IEEE Conference on Games 2024

    Journal ref: IEEE Conference on Games 2024

  8. arXiv:2404.12679  [pdf, other

    cs.CV cs.CR

    MLSD-GAN -- Generating Strong High Quality Face Morphing Attacks using Latent Semantic Disentanglement

    Authors: Aravinda Reddy PN, Raghavendra Ramachandra, Krothapalli Sreenivasa Rao, Pabitra Mitra

    Abstract: Face-morphing attacks are a growing concern for biometric researchers, as they can be used to fool face recognition systems (FRS). These attacks can be generated at the image level (supervised) or representation level (unsupervised). Previous unsupervised morphing attacks have relied on generative adversarial networks (GANs). More recently, researchers have used linear interpolation of StyleGAN-en… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  9. arXiv:2404.05959  [pdf

    physics.optics cs.AI

    Map Optical Properties to Subwavelength Structures Directly via a Diffusion Model

    Authors: Shijie Rao, Kaiyu Cui, Yidong Huang, Jiawei Yang, Yali Li, Sheng** Wang, Xue Feng, Fang Liu, Wei Zhang

    Abstract: Subwavelength photonic structures and metamaterials provide revolutionary approaches for controlling light. The inverse design methods proposed for these subwavelength structures are vital to the development of new photonic devices. However, most of the existing inverse design methods cannot realize direct map** from optical properties to photonic structures but instead rely on forward simulatio… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  10. arXiv:2404.03800  [pdf, other

    cs.LG cs.HC

    Learning Social Fairness Preferences from Non-Expert Stakeholder Opinions in Kidney Placement

    Authors: Mukund Telukunta, Sukruth Rao, Gabriella Stickney, Venkata Sriram Siddardh Nadendla, Casey Canfield

    Abstract: Modern kidney placement incorporates several intelligent recommendation systems which exhibit social discrimination due to biases inherited from training data. Although initial attempts were made in the literature to study algorithmic fairness in kidney placement, these methods replace true outcomes with surgeons' decisions due to the long delays involved in recording such outcomes reliably. Howev… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: Submitted to CHIL (Conference on Health, Inference, and Learning) 2024

  11. arXiv:2403.14063  [pdf, other

    cs.LG cs.CE q-fin.CP q-fin.PM

    DiffSTOCK: Probabilistic relational Stock Market Predictions using Diffusion Models

    Authors: Divyanshu Daiya, Monika Yadav, Harshit Singh Rao

    Abstract: In this work, we propose an approach to generalize denoising diffusion probabilistic models for stock market predictions and portfolio management. Present works have demonstrated the efficacy of modeling interstock relations for market time-series forecasting and utilized Graph-based learning models for value prediction and portfolio management. Though convincing, these deterministic approaches st… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Accepted for presentation to the 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024), Seoul, Korea

  12. arXiv:2403.04370  [pdf, ps, other

    cs.MA

    Cooperative Task Execution in Multi-Agent Systems

    Authors: Karishma, Shrisha Rao

    Abstract: We propose a multi-agent system that enables groups of agents to collaborate and work autonomously to execute tasks. Groups can work in a decentralized manner and can adapt to dynamic changes in the environment. Groups of agents solve assigned tasks by exploring the solution space cooperatively based on the highest reward first. The tasks have a dependency structure associated with them. We rigoro… ▽ More

    Submitted 20 May, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: 14 pages, in LNCS format

    MSC Class: 68T42 ACM Class: I.2.11

  13. BOXREC: Recommending a Box of Preferred Outfits in Online Shop**

    Authors: Debopriyo Banerjee, Krothapalli Sreenivasa Rao, Shamik Sural, Niloy Ganguly

    Abstract: Over the past few years, automation of outfit composition has gained much attention from the research community. Most of the existing outfit recommendation systems focus on pairwise item compatibility prediction (using visual and text features) to score an outfit combination having several items, followed by recommendation of top-n outfits or a capsule wardrobe having a collection of outfits based… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Journal ref: ACM Trans. Intell. Syst. Technol. 11, 6, Article 69 (December 2020), pages 69:1-69:28

  14. arXiv:2402.13237  [pdf, other

    cs.LO cs.FL

    Continuous Pushdown VASS in One Dimension are Easy

    Authors: Guillermo A. Perez, Shrisha Rao

    Abstract: A pushdown vector addition system with states (PVASS) extends the model of vector addition systems with a pushdown stack. The algorithmic analysis of PVASS has applications such as static analysis of recursive programs manipulating integer variables. Unfortunately, reachability analysis, even for one-dimensional PVASS is not known to be decidable. We relax the model of one-dimensional PVASS to mak… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: 2 tables, 6 figures, 12 pages

  15. arXiv:2402.03119  [pdf, other

    cs.CV cs.AI cs.LG

    Good Teachers Explain: Explanation-Enhanced Knowledge Distillation

    Authors: Amin Parchami-Araghi, Moritz Böhle, Sukrut Rao, Bernt Schiele

    Abstract: Knowledge Distillation (KD) has proven effective for compressing large teacher models into smaller student models. While it is well known that student models can achieve similar accuracies as the teachers, it has also been shown that they nonetheless often do not learn the same function. It is, however, often highly desirable that the student's and teacher's functions share similar properties such… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 21 pages, 12 figures

  16. arXiv:2401.12863  [pdf, other

    cs.CL cs.AI

    KAM-CoT: Knowledge Augmented Multimodal Chain-of-Thoughts Reasoning

    Authors: Debjyoti Mondal, Suraj Modi, Subhadarshi Panda, Rituraj Singh, Godawari Sudhakar Rao

    Abstract: Large Language Models (LLMs) have demonstrated impressive performance in natural language processing tasks by leveraging chain of thought (CoT) that enables step-by-step thinking. Extending LLMs with multimodal capabilities is the recent interest, but incurs computational cost and requires substantial hardware resources. To address these challenges, we propose KAM-CoT a framework that integrates C… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: AAAI 2024

  17. arXiv:2401.05683  [pdf, other

    cs.GT cs.AI

    Deep Learning Meets Mechanism Design: Key Results and Some Novel Applications

    Authors: V. Udaya Sankar, Vishisht Srihari Rao, Y. Narahari

    Abstract: Mechanism design is essentially reverse engineering of games and involves inducing a game among strategic agents in a way that the induced game satisfies a set of desired properties in an equilibrium of the game. Desirable properties for a mechanism include incentive compatibility, individual rationality, welfare maximisation, revenue maximisation (or cost minimisation), fairness of allocation, et… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  18. arXiv:2401.05627  [pdf, other

    cs.DS

    Deterministic Near-Linear Time Minimum Cut in Weighted Graphs

    Authors: Monika Henzinger, Jason Li, Satish Rao, Di Wang

    Abstract: In 1996, Karger [Kar96] gave a startling randomized algorithm that finds a minimum-cut in a (weighted) graph in time $O(m\log^3n)$ which he termed near-linear time meaning linear (in the size of the input) times a polylogarthmic factor. In this paper, we give the first deterministic algorithm which runs in near-linear time for weighted graphs. Previously, the breakthrough results of Kawarabayash… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: SODA 2024, 60 pages

  19. arXiv:2401.05121  [pdf, other

    cs.ET cs.LG

    Photonics for Sustainable Computing

    Authors: Farbin Fayza, Satyavolu Papa Rao, Darius Bunandar, Udit Gupta, Ajay Joshi

    Abstract: Photonic integrated circuits are finding use in a variety of applications including optical transceivers, LIDAR, bio-sensing, photonic quantum computing, and Machine Learning (ML). In particular, with the exponentially increasing sizes of ML models, photonics-based accelerators are getting special attention as a sustainable solution because they can perform ML inferences with multiple orders of ma… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

  20. arXiv:2401.01356  [pdf, other

    cs.IR

    Efficient Indexing of Meta-Data (Extracted from Educational Videos)

    Authors: Shalika Kumbham, Abhijit Debnath, Krothapalli Sreenivasa Rao

    Abstract: Video lectures are becoming more popular and in demand as online classroom teaching is becoming more prevalent. Massive Open Online Courses (MOOCs), such as NPTEL, have been creating high-quality educational content that is freely accessible to students online. A large number of colleges across the country are now using NPTEL videos in their classrooms. So more video lectures are being recorded, m… ▽ More

    Submitted 11 December, 2023; originally announced January 2024.

  21. arXiv:2312.17670  [pdf, other

    cs.CV cs.LG q-bio.QM q-bio.TO

    Benchmarking the CoW with the TopCoW Challenge: Topology-Aware Anatomical Segmentation of the Circle of Willis for CTA and MRA

    Authors: Kaiyuan Yang, Fabio Musio, Yihui Ma, Norman Juchler, Johannes C. Paetzold, Rami Al-Maskari, Luciano Höher, Hongwei Bran Li, Ibrahim Ethem Hamamci, Anjany Sekuboyina, Suprosanna Shit, Hou**g Huang, Chinmay Prabhakar, Ezequiel de la Rosa, Diana Waldmannstetter, Florian Kofler, Fernando Navarro, Martin Menten, Ivan Ezhov, Daniel Rueckert, Iris Vos, Ynte Ruigrok, Birgitta Velthuis, Hugo Kuijf, Julien Hämmerli , et al. (59 additional authors not shown)

    Abstract: The Circle of Willis (CoW) is an important network of arteries connecting major circulations of the brain. Its vascular architecture is believed to affect the risk, severity, and clinical outcome of serious neuro-vascular diseases. However, characterizing the highly variable CoW anatomy is still a manual and time-consuming expert task. The CoW is usually imaged by two angiographic imaging modaliti… ▽ More

    Submitted 29 April, 2024; v1 submitted 29 December, 2023; originally announced December 2023.

    Comments: 24 pages, 11 figures, 9 tables. Summary Paper for the MICCAI TopCoW 2023 Challenge

  22. arXiv:2312.11561  [pdf, other

    cs.LG cs.AI

    COPD-FlowNet: Elevating Non-invasive COPD Diagnosis with CFD Simulations

    Authors: Aryan Tyagi, Aryaman Rao, Shubhanshu Rao, Raj Kumar Singh

    Abstract: Chronic Obstructive Pulmonary Disorder (COPD) is a prevalent respiratory disease that significantly impacts the quality of life of affected individuals. This paper presents COPDFlowNet, a novel deep-learning framework that leverages a custom Generative Adversarial Network (GAN) to generate synthetic Computational Fluid Dynamics (CFD) velocity flow field images specific to the trachea of COPD patie… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: 2 pages 2 tables 3 figures

  23. arXiv:2312.04838  [pdf, other

    cs.CV

    Learning Generalizable Perceptual Representations for Data-Efficient No-Reference Image Quality Assessment

    Authors: Suhas Srinath, Shankhanil Mitra, Shika Rao, Rajiv Soundararajan

    Abstract: No-reference (NR) image quality assessment (IQA) is an important tool in enhancing the user experience in diverse visual applications. A major drawback of state-of-the-art NR-IQA techniques is their reliance on a large number of human annotations to train models for a target IQA application. To mitigate this requirement, there is a need for unsupervised learning of generalizable quality representa… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: Accepted to IEEE/CVF WACV 2024

  24. arXiv:2311.11720  [pdf, ps, other

    cs.RO eess.SY

    Design of Planar Collision-free Trochoidal Paths for a Multi-robot Swarm

    Authors: Adil Shiyas, Sachit Rao

    Abstract: In the literature, a distributed consensus protocol by which a connected swarm of agents can generate artistic patterns in 2-dimensional space is proposed. Motivated by this protocol, in this paper, we design the parameters of this protocol for a 3-agent swarm of non-holonomic robots of finite size that results in the generation of periodic trochoidal trajectories that satisfy a set of geometric a… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  25. arXiv:2311.09213  [pdf, other

    cs.CL

    GENEVA: GENErating and Visualizing branching narratives using LLMs

    Authors: Jorge Leandro, Sudha Rao, Michael Xu, Weijia Xu, Nebosja Jojic, Chris Brockett, Bill Dolan

    Abstract: Dialogue-based Role Playing Games (RPGs) require powerful storytelling. The narratives of these may take years to write and typically involve a large creative team. In this work, we demonstrate the potential of large generative text models to assist this process. \textbf{GENEVA}, a prototype tool, generates a rich narrative graph with branching and reconverging storylines that match a high-level n… ▽ More

    Submitted 5 June, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: Accepted at IEEE Conference on Games 2024

  26. arXiv:2311.09010  [pdf, ps, other

    quant-ph cs.DS

    Analysis of sum-of-squares relaxations for the quantum rotor model

    Authors: Sujit Rao

    Abstract: The noncommutative sum-of-squares (ncSoS) hierarchy was introduced by Navascués-Pironio-Acín as a sequence of semidefinite programming relaxations for approximating values of noncommutative polynomial optimization problems, which were originally intended to generalize quantum values of nonlocal games. Recent work has started to analyze the hierarchy for approximating ground energies of local Hamil… ▽ More

    Submitted 29 February, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: 34 pages, appeared at QIP 2024

  27. arXiv:2311.07889  [pdf, other

    cs.IT cs.DS

    Optimal RIP Matrices with Slightly Less Randomness

    Authors: Shravas Rao

    Abstract: A matrix $Φ\in \mathbb{R}^{Q \times N}$ satisfies the restricted isometry property if $\|Φx\|_2^2$ is approximately equal to $\|x\|_2^2$ for all $k$-sparse vectors $x$. We give a construction of RIP matrices with the optimal $Q = O(k \log(N/k))$ rows using $O(k\log(N/k)\log(k))$ bits of randomness. The main technical ingredient is an extension of the Hanson-Wright inequality to $ε$-biased distribu… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  28. arXiv:2310.12736  [pdf, other

    cs.CV

    ExtSwap: Leveraging Extended Latent Mapper for Generating High Quality Face Swap**

    Authors: Aravinda Reddy PN, K. Sreenivasa Rao, Raghavendra Ramachandra, Pabitra mitra

    Abstract: We present a novel face swap** method using the progressively growing structure of a pre-trained StyleGAN. Previous methods use different encoder decoder structures, embedding integration networks to produce high-quality results, but their quality suffers from entangled representation. We disentangle semantics by deriving identity and attribute features separately. By learning to map the concate… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  29. arXiv:2310.09370  [pdf, ps, other

    cs.CR cs.AI cs.DC

    Near-optimal Differentially Private Client Selection in Federated Settings

    Authors: Syed Eqbal Alam, Dhirendra Shukla, Shrisha Rao

    Abstract: We develop an iterative differentially private algorithm for client selection in federated settings. We consider a federated network wherein clients coordinate with a central server to complete a task; however, the clients decide whether to participate or not at a time step based on their preferences -- local computation and probabilistic intent. The algorithm does not require client-to-client inf… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: To appear in the proceedings of the 59th Annual Allerton Conference on Communication, Control, and Computing, September 2023, Monticello, Illinois, USA

  30. arXiv:2310.05972  [pdf, other

    cs.ET

    Normality of I-V Measurements Using ML

    Authors: Anees Al-Najjar, Nageswara S. V. Rao, Craig A. Bridges, Sheng Dai

    Abstract: Electrochemistry ecosystems are promising for accelerating the design and discovery of electrochemical systems for energy storage and conversion, by automating significant parts of workflows that combine synthesis and characterization experiments with computations. They require the integration of flow controllers, solvent containers, pumps, fraction collectors, and potentiostats, all connected to… ▽ More

    Submitted 28 September, 2023; originally announced October 2023.

    Comments: published at eScience 2023

    Journal ref: in 2023 IEEE 19th International Conference on e-Science (e-Science), Limassol, Cyprus, 2023 pp. 1-2

  31. arXiv:2309.07841  [pdf, other

    cs.CR cs.AI

    Two Timin': Repairing Smart Contracts With A Two-Layered Approach

    Authors: Abhinav Jain, Ehan Masud, Michelle Han, Rohan Dhillon, Sumukh Rao, Arya Joshi, Salar Cheema, Saurav Kumar

    Abstract: Due to the modern relevance of blockchain technology, smart contracts present both substantial risks and benefits. Vulnerabilities within them can trigger a cascade of consequences, resulting in significant losses. Many current papers primarily focus on classifying smart contracts for malicious intent, often relying on limited contract characteristics, such as bytecode or opcode. This paper propos… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: Submitted to the 2023 ICI Conference

  32. arXiv:2309.04433  [pdf, other

    cs.LG cs.AI

    Variations and Relaxations of Normalizing Flows

    Authors: Keegan Kelly, Lorena Piedras, Sukrit Rao, David Roth

    Abstract: Normalizing Flows (NFs) describe a class of models that express a complex target distribution as the composition of a series of bijective transformations over a simpler base distribution. By limiting the space of candidate transformations to diffeomorphisms, NFs enjoy efficient, exact sampling and density evaluation, enabling NFs to flexibly behave as both discriminative and generative models. The… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

  33. arXiv:2309.01670  [pdf, other

    q-bio.GN cs.LG

    Blind Biological Sequence Denoising with Self-Supervised Set Learning

    Authors: Nathan Ng, Ji Won Park, Jae Hyeon Lee, Ryan Lewis Kelly, Stephen Ra, Kyunghyun Cho

    Abstract: Biological sequence analysis relies on the ability to denoise the imprecise output of sequencing platforms. We consider a common setting where a short sequence is read out repeatedly using a high-throughput long-read platform to generate multiple subreads, or noisy observations of the same sequence. Denoising these subreads with alignment-based approaches often fails when too few subreads are avai… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

  34. arXiv:2308.16385  [pdf, other

    cs.LG cs.AI

    BenchTemp: A General Benchmark for Evaluating Temporal Graph Neural Networks

    Authors: Qiang Huang, Jiawei Jiang, Xi Susie Rao, Ce Zhang, Zhichao Han, Zitao Zhang, Xin Wang, Yongjun He, Quanqing Xu, Yang Zhao, Chuang Hu, Shuo Shang, Bo Du

    Abstract: To handle graphs in which features or connectivities are evolving over time, a series of temporal graph neural networks (TGNNs) have been proposed. Despite the success of these TGNNs, the previous TGNN evaluations reveal several limitations regarding four critical issues: 1) inconsistent datasets, 2) inconsistent evaluation pipelines, 3) lacking workload diversity, and 4) lacking efficient compari… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: 28 pages, 23 figures, 27 tables. Submitted to the Conference on Neural Information Processing Systems 2023 Track on Datasets and Benchmarks

  35. arXiv:2308.05326  [pdf, other

    q-bio.BM cs.LG

    OpenProteinSet: Training data for structural biology at scale

    Authors: Gustaf Ahdritz, Nazim Bouatta, Sachin Kadyan, Lukas Jarosch, Daniel Berenberg, Ian Fisk, Andrew M. Watkins, Stephen Ra, Richard Bonneau, Mohammed AlQuraishi

    Abstract: Multiple sequence alignments (MSAs) of proteins encode rich biological information and have been workhorses in bioinformatic methods for tasks like protein design and protein structure prediction for decades. Recent breakthroughs like AlphaFold2 that use transformers to attend directly over large quantities of raw MSAs have reaffirmed their importance. Generation of MSAs is highly computationally… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  36. arXiv:2307.07931  [pdf, other

    cs.MS

    ProtoX: A First Look

    Authors: Het Mankad, Sanil Rao, Brian Van Straalen, Phillip Colella, Franz Franchetti

    Abstract: We present a first look at ProtoX, a code generation framework for stencil and pointwise operations that occur frequently in the numerical solution of partial differential equations. ProtoX has Proto as its library frontend and SPIRAL as the backend. Proto is a C++ based domain specific library which optimizes the algorithms used to compute the numerical solution of partial differential equations.… ▽ More

    Submitted 15 July, 2023; originally announced July 2023.

  37. arXiv:2307.06883  [pdf, other

    cs.OH physics.ins-det

    Cyber Framework for Steering and Measurements Collection Over Instrument-Computing Ecosystems

    Authors: Anees Al-Najjar, Nageswara S. V. Rao, Ramanan Sankaran, Helia Zandi, Debangshu Mukherjee, Maxim Ziatdinov, Craig Bridges

    Abstract: We propose a framework to develop cyber solutions to support the remote steering of science instruments and measurements collection over instrument-computing ecosystems. It is based on provisioning separate data and control connections at the network level, and develo** software modules consisting of Python wrappers for instrument commands and Pyro server-client codes that make them available ac… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: Paper accepted for presentation at IEEE SMARTCOMP 2023

  38. arXiv:2307.03968  [pdf, other

    cs.CE math.NA

    Multi-Level Power Series Solution for Large Surface and Volume Electric Field Integral Equation

    Authors: Y. K. Negi, N. Balakrishnan, S. M. Rao

    Abstract: In this paper, we propose a new multilevel power series solution method for solving a large surface and volume electric field integral equation based H-Matrix. The proposed solution method converges in a fixed number of iterations and is solved at each level of the H-Matrix computation.The solution method avoids the computation of a full matrix, as it can be solved independently at each level, sta… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

    Comments: 8 pages. The Applied Computational Electromagnetics Society Journal (ACES) 2023

  39. arXiv:2306.14657  [pdf, other

    cs.RO eess.SY

    A Diversity Analysis of Safety Metrics Comparing Vehicle Performance in the Lead-Vehicle Interaction Regime

    Authors: Harnarayan Singh, Bowen Weng, Sughosh J. Rao, Devin Elsasser

    Abstract: Vehicle performance metrics analyze data sets consisting of subject vehicle's interactions with other road users in a nominal driving environment and provide certain performance measures as outputs. To the best of the authors' knowledge, the vehicle safety performance metrics research dates back to at least 1967. To date, there still does not exist a community-wide accepted metric or a set of metr… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: A modified manuscript of this preprint has been accepted to be published as a regular paper at IEEE Transactions on Intelligent Transportation Systems

  40. arXiv:2306.12360  [pdf, other

    q-bio.BM cs.LG

    Protein Discovery with Discrete Walk-Jump Sampling

    Authors: Nathan C. Frey, Daniel Berenberg, Karina Zadorozhny, Joseph Kleinhenz, Julien Lafrance-Vanasse, Isidro Hotzel, Yan Wu, Stephen Ra, Richard Bonneau, Kyunghyun Cho, Andreas Loukas, Vladimir Gligorijevic, Saeed Saremi

    Abstract: We resolve difficulties in training and sampling from a discrete generative model by learning a smoothed energy function, sampling from the smoothed data manifold with Langevin Markov chain Monte Carlo (MCMC), and projecting back to the true data manifold with one-step denoising. Our Discrete Walk-Jump Sampling formalism combines the contrastive divergence training of an energy-based model and imp… ▽ More

    Submitted 15 March, 2024; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: ICLR 2024 oral presentation, top 1.2% of submissions; {ICLR 2023 Physics for Machine Learning, NeurIPS 2023 GenBio, MLCB 2023} Spotlight

  41. 3HAN: A Deep Neural Network for Fake News Detection

    Authors: Sneha Singhania, Nigel Fernandez, Shrisha Rao

    Abstract: The rapid spread of fake news is a serious problem calling for AI solutions. We employ a deep learning based automated detector through a three level hierarchical attention network (3HAN) for fast, accurate detection of fake news. 3HAN has three levels, one each for words, sentences, and the headline, and constructs a news vector: an effective representation of an input news article, by processing… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: Published as a conference paper at ICONIP 2017

  42. arXiv:2306.10701  [pdf

    physics.optics cs.ET

    Metasurface-based Spectral Convolutional Neural Network for Matter Meta-imaging

    Authors: Kaiyu Cui, Shijie Rao, Sheng Xu, Yidong Huang, Jiawei Yang, Jian Xiong, Chenxuan Wang, Xue Feng, Fang Liu, Wei Zhang, Yali Li, Sheng** Wang

    Abstract: Convolutional neural networks (CNNs) are representative models of artificial neural networks (ANNs), that form the backbone of modern computer vision. However, the considerable power consumption and limited computing speed of electrical computing platforms restrict further development of CNNs. Optical neural networks are considered the next-generation physical implementations of ANNs to break the… ▽ More

    Submitted 27 June, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

  43. arXiv:2306.07473  [pdf, other

    cs.LG q-bio.QM

    3D molecule generation by denoising voxel grids

    Authors: Pedro O. Pinheiro, Joshua Rackers, Joseph Kleinhenz, Michael Maser, Omar Mahmood, Andrew Martin Watkins, Stephen Ra, Vishnu Sresht, Saeed Saremi

    Abstract: We propose a new score-based approach to generate 3D molecules represented as atomic densities on regular grids. First, we train a denoising neural network that learns to map from a smooth distribution of noisy molecules to the distribution of real molecules. Then, we follow the neural empirical Bayes framework (Saremi and Hyvarinen, 19) and generate molecules in two steps: (i) sample noisy densit… ▽ More

    Submitted 8 March, 2024; v1 submitted 12 June, 2023; originally announced June 2023.

  44. arXiv:2306.00344  [pdf, other

    cs.LG stat.ML

    BOtied: Multi-objective Bayesian optimization with tied multivariate ranks

    Authors: Ji Won Park, Nataša Tagasovska, Michael Maser, Stephen Ra, Kyunghyun Cho

    Abstract: Many scientific and industrial applications require the joint optimization of multiple, potentially competing objectives. Multi-objective Bayesian optimization (MOBO) is a sample-efficient framework for identifying Pareto-optimal solutions. At the heart of MOBO is the acquisition function, which determines the next candidate to evaluate by navigating the best compromises among the objectives. In t… ▽ More

    Submitted 7 June, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: 12 pages (+9 appendix), 13 figures. Accepted at ICML 2024

  45. arXiv:2305.12815  [pdf, other

    cs.CL

    Investigating Agency of LLMs in Human-AI Collaboration Tasks

    Authors: Ashish Sharma, Sudha Rao, Chris Brockett, Akanksha Malhotra, Nebojsa Jojic, Bill Dolan

    Abstract: Agency, the capacity to proactively shape events, is central to how humans interact and collaborate. While LLMs are being developed to simulate human behavior and serve as human-like agents, little attention has been given to the Agency that these models should possess in order to proactively manage the direction of interaction and collaboration. In this paper, we investigate Agency as a desirable… ▽ More

    Submitted 7 February, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: EACL 2024

  46. arXiv:2305.05739  [pdf, ps, other

    cs.LO cs.AI

    Graph-Based Reductions for Parametric and Weighted MDPs

    Authors: Kasper Engelen, Guillermo A. Pérez, Shrisha Rao

    Abstract: We study the complexity of reductions for weighted reachability in parametric Markov decision processes. That is, we say a state p is never worse than q if for all valuations of the polynomial indeterminates it is the case that the maximal expected weight that can be reached from p is greater than the same value from q. In terms of computational complexity, we establish that determining whether p… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  47. arXiv:2304.02048  [pdf

    cond-mat.mtrl-sci cs.LG

    Deep Learning for Automated Experimentation in Scanning Transmission Electron Microscopy

    Authors: Sergei V. Kalinin, Debangshu Mukherjee, Kevin M. Roccapriore, Ben Blaiszik, Ayana Ghosh, Maxim A. Ziatdinov, A. Al-Najjar, Christina Doty, Sarah Akers, Nageswara S. Rao, Joshua C. Agar, Steven R. Spurgeon

    Abstract: Machine learning (ML) has become critical for post-acquisition data analysis in (scanning) transmission electron microscopy, (S)TEM, imaging and spectroscopy. An emerging trend is the transition to real-time analysis and closed-loop microscope operation. The effective use of ML in electron microscopy now requires the development of strategies for microscopy-centered experiment workflow design and… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: Review Article

  48. arXiv:2304.01510  [pdf, other

    cs.MA cs.CR cs.DC eess.SY

    A Communication-efficient Local Differentially Private Algorithm in Federated Optimization

    Authors: Syed Eqbal Alam, Dhirendra Shukla, Shrisha Rao

    Abstract: Federated optimization, wherein several agents in a network collaborate with a central server to achieve optimal social cost over the network with no requirement for exchanging information among agents, has attracted significant interest from the research community. In this context, agents demand resources based on their local computation. Due to the exchange of optimization parameters such as sta… ▽ More

    Submitted 19 October, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    ACM Class: I.2.11

    Journal ref: IEEE Access, vol. 11, pp. 58254-58268, 2023

  49. arXiv:2303.14334  [pdf, other

    cs.HC cs.AI cs.CL

    The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces

    Authors: Kyle Lo, Joseph Chee Chang, Andrew Head, Jonathan Bragg, Amy X. Zhang, Cassidy Trier, Chloe Anastasiades, Tal August, Russell Authur, Danielle Bragg, Erin Bransom, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Yen-Sung Chen, Evie Yu-Yen Cheng, Yvonne Chou, Doug Downey, Rob Evans, Raymond Fok, Fangzhou Hu, Regan Huff, Dongyeop Kang, Tae Soo Kim, Rodney Kinney , et al. (30 additional authors not shown)

    Abstract: Scholarly publications are key to the transfer of knowledge from scholars to others. However, research papers are information-dense, and as the volume of the scientific literature grows, the need for new technology to support the reading process grows. In contrast to the process of finding papers, which has been transformed by Internet technology, the experience of reading research papers has chan… ▽ More

    Submitted 23 April, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

  50. arXiv:2303.11932  [pdf, other

    cs.CV cs.AI cs.LG

    Using Explanations to Guide Models

    Authors: Sukrut Rao, Moritz Böhle, Amin Parchami-Araghi, Bernt Schiele

    Abstract: Deep neural networks are highly performant, but might base their decision on spurious or background features that co-occur with certain classes, which can hurt generalization. To mitigate this issue, the usage of 'model guidance' has gained popularity recently: for this, models are guided to be "right for the right reasons" by regularizing the models' explanations to highlight the right features.… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 38 pages, 35 figures, 4 tables