Skip to main content

Showing 1–24 of 24 results for author: Swamy, V

.
  1. arXiv:2405.20079  [pdf, other

    cs.CL cs.CY cs.LG

    Student Answer Forecasting: Transformer-Driven Answer Choice Prediction for Language Learning

    Authors: Elena Grazia Gado, Tommaso Martorella, Luca Zunino, Paola Mejia-Domenzain, Vinitra Swamy, Jibril Frej, Tanja Käser

    Abstract: Intelligent Tutoring Systems (ITS) enhance personalized learning by predicting student answers to provide immediate and customized instruction. However, recent research has primarily focused on the correctness of the answer rather than the student's performance on specific answer choices, limiting insights into students' thought processes and potential misconceptions. To address this gap, we prese… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Accepted as a poster paper at EDM 2024: 17th International Conference on Educational Data Mining in Atlanta, USA

  2. arXiv:2402.02933  [pdf, other

    cs.LG cs.CY cs.HC

    InterpretCC: Intrinsic User-Centric Interpretability through Global Mixture of Experts

    Authors: Vinitra Swamy, Syrielle Montariol, Julian Blackwell, Jibril Frej, Martin Jaggi, Tanja Käser

    Abstract: Interpretability for neural networks is a trade-off between three key requirements: 1) faithfulness of the explanation (i.e., how perfectly it explains the prediction), 2) understandability of the explanation by humans, and 3) model performance. Most existing methods compromise one or more of these requirements; e.g., post-hoc approaches provide limited faithfulness, automatically identified featu… ▽ More

    Submitted 29 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  3. arXiv:2311.16079  [pdf, other

    cs.CL cs.AI cs.LG

    MEDITRON-70B: Scaling Medical Pretraining for Large Language Models

    Authors: Zeming Chen, Alejandro Hernández Cano, Angelika Romanou, Antoine Bonnet, Kyle Matoba, Francesco Salvi, Matteo Pagliardini, Simin Fan, Andreas Köpf, Amirkeivan Mohtashami, Alexandre Sallinen, Alireza Sakhaeirad, Vinitra Swamy, Igor Krawczuk, Deniz Bayazit, Axel Marmet, Syrielle Montariol, Mary-Anne Hartley, Martin Jaggi, Antoine Bosselut

    Abstract: Large language models (LLMs) can potentially democratize access to medical knowledge. While many efforts have been made to harness and improve LLMs' medical knowledge and reasoning capacities, the resulting models are either closed-source (e.g., PaLM, GPT-4) or limited in scale (<= 13B parameters), which restricts their abilities. In this work, we improve access to large-scale medical LLMs by rele… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  4. arXiv:2311.03311  [pdf, other

    cs.CL cs.CY

    Unraveling Downstream Gender Bias from Large Language Models: A Study on AI Educational Writing Assistance

    Authors: Thiemo Wambsganss, Xiaotian Su, Vinitra Swamy, Seyed Parsa Neshaei, Roman Rietsche, Tanja Käser

    Abstract: Large Language Models (LLMs) are increasingly utilized in educational tasks such as providing writing suggestions to students. Despite their potential, LLMs are known to harbor inherent biases which may negatively impact learners. Previous studies have investigated bias in models and data representations separately, neglecting the potential impact of LLM bias on human writing. In this paper, we in… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Accepted as a full paper at EMNLP Findings 2023

  5. arXiv:2309.14118  [pdf, other

    cs.LG

    MultiModN- Multimodal, Multi-Task, Interpretable Modular Networks

    Authors: Vinitra Swamy, Malika Satayeva, Jibril Frej, Thierry Bossy, Thijs Vogels, Martin Jaggi, Tanja Käser, Mary-Anne Hartley

    Abstract: Predicting multiple real-world tasks in a single model often requires a particularly diverse feature space. Multimodal (MM) models aim to extract the synergistic predictive potential of multiple data types to create a shared feature space with aligned semantic meaning across inputs of drastically varying sizes (i.e. images, text, sound). Most current MM architectures fuse these representations in… ▽ More

    Submitted 6 November, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted as a full paper at NeurIPS 2023 in New Orleans, USA

  6. arXiv:2307.00364  [pdf, other

    cs.LG cs.AI cs.CY cs.HC

    The future of human-centric eXplainable Artificial Intelligence (XAI) is not post-hoc explanations

    Authors: Vinitra Swamy, Jibril Frej, Tanja Käser

    Abstract: Explainable Artificial Intelligence (XAI) plays a crucial role in enabling human understanding and trust in deep learning systems. As models get larger, more ubiquitous, and pervasive in aspects of daily life, explainability is necessary to minimize adverse effects of model mistakes. Unfortunately, current approaches in human-centric XAI (e.g. predictive tasks in healthcare, education, or personal… ▽ More

    Submitted 28 May, 2024; v1 submitted 1 July, 2023; originally announced July 2023.

    Comments: Viewpoint paper, under review at JAIR

  7. arXiv:2212.08955  [pdf, other

    cs.CY cs.HC cs.LG

    Trusting the Explainers: Teacher Validation of Explainable Artificial Intelligence for Course Design

    Authors: Vinitra Swamy, Sijia Du, Mirko Marras, Tanja Käser

    Abstract: Deep learning models for learning analytics have become increasingly popular over the last few years; however, these approaches are still not widely adopted in real-world settings, likely due to a lack of trust and transparency. In this paper, we tackle this issue by implementing explainable AI methods for black-box neural networks. This work focuses on the context of online and blended learning a… ▽ More

    Submitted 6 March, 2023; v1 submitted 17 December, 2022; originally announced December 2022.

    Comments: Accepted as a full paper (Best Paper nominee) at LAK 2023: The 13th International Learning Analytics and Knowledge Conference, March 13-17, 2023, Arlington, Texas, USA

  8. arXiv:2212.01133  [pdf, other

    cs.LG cs.CY

    RIPPLE: Concept-Based Interpretation for Raw Time Series Models in Education

    Authors: Mohammad Asadi, Vinitra Swamy, Jibril Frej, Julien Vignoud, Mirko Marras, Tanja Käser

    Abstract: Time series is the most prevalent form of input data for educational prediction tasks. The vast majority of research using time series data focuses on hand-crafted features, designed by experts for predictive performance and interpretability. However, extracting these features is labor-intensive for humans and computers. In this paper, we propose an approach that utilizes irregular multivariate ti… ▽ More

    Submitted 28 February, 2023; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: Accepted as a full paper at AAAI 2023: 37th AAAI Conference on Artificial Intelligence (EAAI: AI for Education Special Track), 7-14 of February 2023, Washington DC, USA

  9. arXiv:2209.10335  [pdf, other

    cs.CL cs.CY

    Bias at a Second Glance: A Deep Dive into Bias for German Educational Peer-Review Data Modeling

    Authors: Thiemo Wambsganss, Vinitra Swamy, Roman Rietsche, Tanja Käser

    Abstract: Natural Language Processing (NLP) has become increasingly utilized to provide adaptivity in educational applications. However, recent research has highlighted a variety of biases in pre-trained language models. While existing studies investigate bias in different domains, they are limited in addressing fine-grained analysis on educational and multilingual corpora. In this work, we analyze bias acr… ▽ More

    Submitted 22 September, 2022; v1 submitted 21 September, 2022; originally announced September 2022.

    Comments: Accepted as a full paper at COLING 2022: The 29th International Conference on Computational Linguistics, 12-17 of October 2022, Gyeongju, Republic of Korea

  10. arXiv:2207.00551  [pdf, other

    cs.LG cs.CY

    Evaluating the Explainers: Black-Box Explainable Machine Learning for Student Success Prediction in MOOCs

    Authors: Vinitra Swamy, Bahar Radmehr, Natasa Krco, Mirko Marras, Tanja Käser

    Abstract: Neural networks are ubiquitous in applied machine learning for education. Their pervasive success in predictive performance comes alongside a severe weakness, the lack of explainability of their decisions, especially relevant in human-centric fields. We implement five state-of-the-art methodologies for explaining black-box machine learning models (LIME, PermutationSHAP, KernelSHAP, DiCE, CEM) and… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted as a full paper at EDM 2022: The 15th International Conference on Educational Data Mining, 24-27 of July 2022, Durham

  11. arXiv:2205.01064  [pdf, other

    cs.CY cs.LG

    Meta Transfer Learning for Early Success Prediction in MOOCs

    Authors: Vinitra Swamy, Mirko Marras, Tanja Käser

    Abstract: Despite the increasing popularity of massive open online courses (MOOCs), many suffer from high dropout and low success rates. Early prediction of student success for targeted intervention is therefore essential to ensure no student is left behind in a course. There exists a large body of research in success prediction for MOOCs, focusing mainly on training models from scratch for individual cours… ▽ More

    Submitted 25 April, 2022; originally announced May 2022.

    Comments: Accepted at the 2022 ACM Conference on Learning at Scale (L@S 2022)

  12. arXiv:2111.08546  [pdf, other

    cs.LG cs.CL

    Interpreting Language Models Through Knowledge Graph Extraction

    Authors: Vinitra Swamy, Angelika Romanou, Martin Jaggi

    Abstract: Transformer-based language models trained on large text corpora have enjoyed immense popularity in the natural language processing community and are commonly used as a starting point for downstream tasks. While these models are undeniably useful, it is a challenge to quantify their performance beyond traditional accuracy metrics. In this paper, we compare BERT-based language models through snapsho… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: Published at NeurIPS 2021: eXplainable AI for Debugging and Diagnosis Workshop

  13. arXiv:2110.07525  [pdf, other

    cs.IT

    Connection Management xAPP for O-RAN RIC: A Graph Neural Network and Reinforcement Learning Approach

    Authors: Oner Orhan, Vasuki Narasimha Swamy, Thomas Tetzlaff, Marcel Nassar, Hosein Nikopour, Shilpa Talwar

    Abstract: Connection management is an important problem for any wireless network to ensure smooth and well-balanced operation throughout. Traditional methods for connection management (specifically user-cell association) consider sub-optimal and greedy solutions such as connection of each user to a cell with maximum receive power. However, network performance can be improved by leveraging machine learning (… ▽ More

    Submitted 20 October, 2021; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: paper accepted to the IEEE International Conference on Machine Learning and Applications (ICMLA 2021)

  14. arXiv:1806.08777  [pdf, other

    cs.IT

    Wireless Channel Dynamics and Robustness for Ultra-Reliable Low-Latency Communications

    Authors: Vasuki Narasimha Swamy, Paul Rigge, Gireeja Ranade, Borivoje Nikolic, Anant Sahai

    Abstract: Interactive, immersive and critical applications demand ultra-reliable low-latency communication (URLLC). To build wireless communication systems that can support these applications, understanding the characteristics of the wireless medium is paramount. Although wireless channel characteristics and dynamics have been extensively studied, it is important to revisit these concepts in the context of… ▽ More

    Submitted 22 June, 2018; originally announced June 2018.

    Comments: Submitted to IEEE JSAC Special Issue on Ultra-Reliable Low-Latency Communications in Wireless Networks

  15. arXiv:1803.05143  [pdf, other

    cs.IT

    Network Coding for Real-time Wireless Communication for Automation

    Authors: Vasuki Narasimha Swamy, Paul Rigge, Gireeja Ranade, Anant Sahai, Borivoje Nikolic

    Abstract: Real-time applications require latencies on the order of a millisecond with very high reliabilities, paralleling the requirements for high-performance industrial control. Current wireless technologies like WiFi, Bluetooth, LTE, etc. are unable to meet these stringent latency and reliability requirements, forcing the use of wired systems. This paper introduces a wireless communication protocol base… ▽ More

    Submitted 14 March, 2018; originally announced March 2018.

    Comments: A preliminary version of this work appeared at IEEE WCNC 2016

  16. arXiv:1701.01894  [pdf, other

    eess.SY

    Modeling Actuation Constraints for IoT Applications

    Authors: Bharathan Balaji, Brad Campbell, Amit Levy, Xiaozhou Li, Addison Mayberry, Nirupam Roy, Vasuki Narasimha Swamy, Longqi Yang, Victor Bahl, Ranveer Chandra, Ratul Mahajan

    Abstract: Internet of Things (IoT) promises to bring ease of monitoring, better efficiency and innovative services across many domains with connected devices around us. With information from critical parts of infrastructure and powerful cloud-based data analytics, many applications can be developed to gain insights about IoT systems as well as transform their capabilities. Actuation applications form an ess… ▽ More

    Submitted 7 January, 2017; originally announced January 2017.

    Comments: Microsoft Research Student Summit - Internet of Things Working Group

  17. arXiv:1609.02968  [pdf, other

    cs.IT eess.SY

    Real-time Cooperative Communication for Automation over Wireless

    Authors: Vasuki Narasimha Swamy, Sahaana Suri, Paul Rigge, Matthew Weiner, Gireeja Ranade, Anant Sahai, Borivoje Nikolic

    Abstract: High-performance industrial automation systems rely on tens of simultaneously active sensors and actuators and have stringent communication latency and reliability requirements. Current wireless technologies like WiFi, Bluetooth, and LTE are unable to meet these requirements, forcing the use of wired communication in industrial control systems. This paper introduces a wireless communication protoc… ▽ More

    Submitted 23 January, 2017; v1 submitted 9 September, 2016; originally announced September 2016.

    Comments: A preliminary version of this work appeared at IEEE International Conference on Communications 2015

  18. arXiv:1505.05711  [pdf, ps, other

    cond-mat.mtrl-sci cond-mat.mes-hall

    Resistance minimum and electrical conduction mechanism in polycrystalline CoFeB thin films

    Authors: G. Venkat Swamy, P. K. Rout, Manju Singh, R. K. Rakshit

    Abstract: The temperature dependent resistance $R$($T$) of polycrystalline ferromagnetic CoFeB thin films of varying thickness are analyzed considering various electrical scattering processes. We observe a resistance minimum in $R$($T$) curves below $\simeq$ 29 K, which can be explained as an effect of intergranular Coulomb interaction in a granular system. The structural and Coulomb interaction related sca… ▽ More

    Submitted 26 October, 2015; v1 submitted 21 May, 2015; originally announced May 2015.

    Journal ref: J. Phys. D Appl. Phys. 48, 475002 (2015)

  19. Low-Complexity Interactive Algorithms for Synchronization from Deletions, Insertions, and Substitutions

    Authors: Ramji Venkataramanan, Vasuki Narasimha Swamy, Kannan Ramchandran

    Abstract: Consider two remote nodes having binary sequences $X$ and $Y$, respectively. $Y$ is an edited version of ${X}$, where the editing involves random deletions, insertions, and substitutions, possibly in bursts. The goal is for the node with $Y$ to reconstruct $X$ with minimal exchange of information over a noiseless link. The communication is measured in terms of both the total number of bits exchang… ▽ More

    Submitted 12 September, 2015; v1 submitted 8 October, 2013; originally announced October 2013.

    Journal ref: IEEE Transactions on Information Theory, vol. 61, no. 10, pp. 5670-5689, October 2015

  20. arXiv:1305.7335  [pdf, ps, other

    cond-mat.mtrl-sci

    Effect of Thermal Annealing on Boron Diffusion, Micro-structural, Electrical and Magnetic properties of Laser Ablated CoFeB Thin Films

    Authors: G. Venkat Swamy, Himanshu Pandey, A. K. Srivastava, M. K. Dalai, K. K. Maurya, Rashmi, R. K. Rakshit

    Abstract: We report on Boron diffusion and subsequent crystallization of Co$_{40}$Fe$_{40}$B$_{20}$ (CoFeB) thin films on SiO$_2$/Si(001) substrate using pulsed laser deposition. Secondary ion mass spectroscopy reveals Boron diffusion at the interface in both amorphous and crystalline phase of CoFeB. High-resolution transmission electron microscopy reveals a small fraction of nano-crystallites embedded in t… ▽ More

    Submitted 31 May, 2013; originally announced May 2013.

    Comments: 16 pages, 6 figures

    Journal ref: AIP Advances 3, 072129 (2013)

  21. arXiv:1210.3187  [pdf, ps, other

    cs.IT cs.NI

    An asymptotically optimal push-pull method for multicasting over a random network

    Authors: Vasuki Narasimha Swamy, Srikrishna Bhashyam, Rajesh Sundaresan, Pramod Viswanath

    Abstract: We consider allcast and multicast flow problems where either all of the nodes or only a subset of the nodes may be in session. Traffic from each node in the session has to be sent to every other node in the session. If the session does not consist of all the nodes, the remaining nodes act as relays. The nodes are connected by undirected links whose capacities are independent and identically distri… ▽ More

    Submitted 8 February, 2013; v1 submitted 11 October, 2012; originally announced October 2012.

    Comments: 13 pages, extended version of paper presented at the IEEE International Symposium on Information Theory (ISIT) 2012, minor revision to text to address review comments, to appear in IEEE Transactions in information theory

  22. Image Compression and Watermarking scheme using Scalar Quantization

    Authors: Kilari Veera Swamy, B. Chandra Mohan, Y. V. Bhaskar Reddy, S. Srinivas Kumar

    Abstract: This paper presents a new compression technique and image watermarking algorithm based on Contourlet Transform (CT). For image compression, an energy based quantization is used. Scalar quantization is explored for image watermarking. Double filter bank structure is used in CT. The Laplacian Pyramid (LP) is used to capture the point discontinuities, and then followed by a Directional Filter Bank (D… ▽ More

    Submitted 29 March, 2010; originally announced March 2010.

    Comments: 11 Pages, IJNGN Journal 2010

    Journal ref: International Journal of Next-Generation Networks 2.1 (2010) 37-47

  23. arXiv:0907.1464  [pdf

    cond-mat.mtrl-sci cond-mat.other

    Cotunnite-structured titanium dioxide: the hardest known oxide

    Authors: L. S. Dubrovinsky, N. A. Dubrovinskaia, V. Swamy, J. Muscat, N. M. Harrison, R. Ahuja, B. Holm

    Abstract: Despite great technological importance and many investigations, a material with measured hardness comparable to that of diamond or cubic boron nitride has yet to be identified. Combined theoretical and experimental investigations led to the discovery of a new polymorph of titanium dioxide with titanium nine-coordinated to oxygen in the cotunnite (PbCl2) structure. Hardness measurements on the co… ▽ More

    Submitted 9 July, 2009; originally announced July 2009.

    Comments: This is full version of the paper published as Brief Communications in Nature, 410, 653-654

  24. Detection of Computer Generated Gravitational Waves in Numerical Cosmologies

    Authors: B. K. Berger, D. Garfinkle, V. Swamy

    Abstract: We propose to study the behavior of complicated numerical solutions to Einstein's equations for generic cosmologies by following the geodesic motion of a swarm of test particles. As an example, we consider a cylinder of test particles initially at rest in the plane symmetric Gowdy universe on $T^3 \times R$. For a circle of test particles in the symmetry plane, the geodesic equations predict evo… ▽ More

    Submitted 27 May, 1994; originally announced May 1994.

    Comments: 15 pages Plain TeX, 9 pages of figures available on request by FAX or mail

    Journal ref: Gen.Rel.Grav. 27 (1995) 511-527