Skip to main content

Showing 1–8 of 8 results for author: Sheshadri, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14775  [pdf, other

    physics.ao-ph cs.LG physics.flu-dyn physics.geo-ph

    Machine Learning Global Simulation of Nonlocal Gravity Wave Propagation

    Authors: Aman Gupta, Aditi Sheshadri, Sujit Roy, Vishal Gaur, Manil Maskey, Rahul Ramachandran

    Abstract: Global climate models typically operate at a grid resolution of hundreds of kilometers and fail to resolve atmospheric mesoscale processes, e.g., clouds, precipitation, and gravity waves (GWs). Model representation of these processes and their sources is essential to the global circulation and planetary energy budget, but subgrid scale contributions from these processes are often only approximatel… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 9 pages, 7 figures, no tables

  2. arXiv:2402.11917  [pdf, other

    cs.LG

    A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task

    Authors: Jannik Brinkmann, Abhay Sheshadri, Victor Levoso, Paul Swoboda, Christian Bartelt

    Abstract: Transformers demonstrate impressive performance on a range of reasoning benchmarks. To evaluate the degree to which these abilities are a result of actual reasoning, existing work has focused on develo** sophisticated benchmarks for behavioral studies. However, these studies do not provide insights into the internal mechanisms driving the observed capabilities. To improve our understanding of th… ▽ More

    Submitted 29 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  3. arXiv:2310.18417  [pdf, other

    cs.CL

    Teacher Perception of Automatically Extracted Grammar Concepts for L2 Language Learning

    Authors: Aditi Chaudhary, Arun Sampath, Ashwin Sheshadri, Antonios Anastasopoulos, Graham Neubig

    Abstract: One of the challenges in language teaching is how best to organize rules regarding syntax, semantics, or phonology in a meaningful manner. This not only requires content creators to have pedagogical skills, but also have that language's deep understanding. While comprehensive materials to develop such curricula are available in English and some broadly spoken languages, for many other languages, t… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP Findings 2023. arXiv admin note: substantial text overlap with arXiv:2206.05154

  4. arXiv:2305.14956  [pdf, other

    cs.CL

    Editing Common Sense in Transformers

    Authors: Anshita Gupta, Debanjan Mondal, Akshay Krishna Sheshadri, Wenlong Zhao, Xiang Lorraine Li, Sarah Wiegreffe, Niket Tandon

    Abstract: Editing model parameters directly in Transformers makes updating open-source transformer-based models possible without re-training (Meng et al., 2023). However, these editing methods have only been evaluated on statements about encyclopedic knowledge with a single correct answer. Commonsense knowledge with multiple correct answers, e.g., an apple can be green or red but not transparent, has not be… ▽ More

    Submitted 26 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted to EMNLP 2023 Main Conference. Anshita, Debanjan, Akshay are co-first authors. Code and datasets for all experiments are available at https://github.com/anshitag/memit_csk

  5. arXiv:2206.05154  [pdf, other

    cs.CL

    Teacher Perception of Automatically Extracted Grammar Concepts for L2 Language Learning

    Authors: Aditi Chaudhary, Arun Sampath, Ashwin Sheshadri, Antonios Anastasopoulos, Graham Neubig

    Abstract: One of the challenges of language teaching is how to organize the rules regarding syntax, semantics, or phonology of the language in a meaningful manner. This not only requires pedagogical skills, but also requires a deep understanding of that language. While comprehensive materials to develop such curricula are available in English and some broadly spoken languages, for many other languages, teac… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

    Comments: 18 pages

  6. arXiv:2101.05478  [pdf, other

    cs.CL cs.SD eess.AS

    WER-BERT: Automatic WER Estimation with BERT in a Balanced Ordinal Classification Paradigm

    Authors: Akshay Krishna Sheshadri, Anvesh Rao Vij**i, Sukhdeep Kharbanda

    Abstract: Automatic Speech Recognition (ASR) systems are evaluated using Word Error Rate (WER), which is calculated by comparing the number of errors between the ground truth and the transcription of the ASR system. This calculation, however, requires manual transcription of the speech signal to obtain the ground truth. Since transcribing audio signals is a costly process, Automatic WER Evaluation (e-WER) m… ▽ More

    Submitted 13 February, 2021; v1 submitted 14 January, 2021; originally announced January 2021.

    Comments: Accepted Long Paper at EACL 2021

  7. arXiv:1904.07331  [pdf, other

    cs.CY

    Predicting Student Performance Based on Online Study Habits: A Study of Blended Courses

    Authors: Adithya Sheshadri, Niki Gitinabard, Collin F. Lynch, Tiffany Barnes, Sarah Heckman

    Abstract: Online tools provide unique access to research students' study habits and problem-solving behavior. In MOOCs, this online data can be used to inform instructors and to provide automatic guidance to students. However, these techniques may not apply in blended courses with face to face and online components. We report on a study of integrated user-system interaction logs from 3 computer science cour… ▽ More

    Submitted 15 April, 2019; originally announced April 2019.

    Comments: Published in the International Conference on Educational Data Mining (EDM 2018)

  8. arXiv:1806.00755  [pdf, other

    cs.IR

    Mix and Match: Collaborative Expert-Crowd Judging for Building Test Collections Accurately and Affordably

    Authors: Mucahid Kutlu, Tyler McDonnell, Aashish Sheshadri, Tamer Elsayed, Matthew Lease

    Abstract: Crowdsourcing offers an affordable and scalable means to collect relevance judgments for IR test collections. However, crowd assessors may show higher variance in judgment quality than trusted assessors. In this paper, we investigate how to effectively utilize both groups of assessors in partnership. We specifically investigate how agreement in judging is correlated with three factors: relevance c… ▽ More

    Submitted 9 June, 2018; v1 submitted 3 June, 2018; originally announced June 2018.