Skip to main content

Showing 1–12 of 12 results for author: Mohebbi, H

.
  1. arXiv:2310.09925  [pdf, other

    cs.CL cs.AI cs.LG

    Homophone Disambiguation Reveals Patterns of Context Mixing in Speech Transformers

    Authors: Hosein Mohebbi, Grzegorz Chrupała, Willem Zuidema, Afra Alishahi

    Abstract: Transformers have become a key architecture in speech processing, but our understanding of how they build up representations of acoustic and linguistic structure is limited. In this study, we address this gap by investigating how measures of 'context-mixing' developed for text models can be adapted and applied to models of spoken language. We identify a linguistic phenomenon that is ideal for such… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 (main)

  2. arXiv:2310.03686  [pdf, other

    cs.CL

    DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers

    Authors: Anna Langedijk, Hosein Mohebbi, Gabriele Sarti, Willem Zuidema, Jaap Jumelet

    Abstract: In recent years, many interpretability methods have been proposed to help interpret the internal states of Transformer-models, at different levels of precision and complexity. Here, to analyze encoder-decoder Transformers, we propose a simple, new method: DecoderLens. Inspired by the LogitLens (for decoder-only Transformers), this method involves allowing the decoder to cross-attend representation… ▽ More

    Submitted 3 April, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: Accepted to Findings of NAACL 2024

  3. arXiv:2301.12971  [pdf, other

    cs.CL cs.LG

    Quantifying Context Mixing in Transformers

    Authors: Hosein Mohebbi, Willem Zuidema, Grzegorz Chrupała, Afra Alishahi

    Abstract: Self-attention weights and their transformed variants have been the main source of information for analyzing token-to-token interactions in Transformer-based models. But despite their ease of interpretation, these weights are not faithful to the models' decisions as they are only one part of an encoder, and other components in the encoder layer can have considerable impact on information mixing in… ▽ More

    Submitted 8 February, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: Accepted to EACL 2023 (main)

  4. arXiv:2203.08991  [pdf, other

    cs.CL

    AdapLeR: Speeding up Inference by Adaptive Length Reduction

    Authors: Ali Modarressi, Hosein Mohebbi, Mohammad Taher Pilehvar

    Abstract: Pre-trained language models have shown stellar performance in various downstream tasks. But, this usually comes at the cost of high latency and computation, hindering their usage in resource-limited settings. In this work, we propose a novel approach for reducing the computational cost of BERT with minimal loss in downstream performance. Our method dynamically eliminates less contributing tokens t… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: Accepted to ACL 2022 (main conference)

  5. arXiv:2203.07445  [pdf, other

    quant-ph cond-mat.supr-con

    Fluctuation Spectroscopy of Two-Level Systems in Superconducting Resonators

    Authors: J. H. Béjanin, Y. Ayadi, X. Xu, C. Zhu, H. R. Mohebbi, M. Mariantoni

    Abstract: Superconducting quantum computing is experiencing a tremendous growth. Although major milestones have already been achieved, useful quantum-computing applications are hindered by a variety of decoherence phenomena. Decoherence due to two-level systems (TLSs) hosted by amorphous dielectric materials is ubiquitous in planar superconducting devices. We use high-quality quasilumped element resonators… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

    Comments: 20 two-column pages (including App. and Supplement), 12 figures, 3 tables

    Journal ref: Phys. Rev. Applied 18, 034009 (2022)

  6. arXiv:2109.05958  [pdf, other

    cs.CL cs.AI

    Not All Models Localize Linguistic Knowledge in the Same Place: A Layer-wise Probing on BERToids' Representations

    Authors: Mohsen Fayyaz, Ehsan Aghazadeh, Ali Modarressi, Hosein Mohebbi, Mohammad Taher Pilehvar

    Abstract: Most of the recent works on probing representations have focused on BERT, with the presumption that the findings might be similar to the other models. In this work, we extend the probing studies to two other models in the family, namely ELECTRA and XLNet, showing that variations in the pre-training objectives or architectural choices can result in different behaviors in encoding linguistic informa… ▽ More

    Submitted 15 September, 2021; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: Accepted to BlackboxNLP Workshop at EMNLP 2021

  7. arXiv:2104.01477  [pdf, other

    cs.CL

    Exploring the Role of BERT Token Representations to Explain Sentence Probing Results

    Authors: Hosein Mohebbi, Ali Modarressi, Mohammad Taher Pilehvar

    Abstract: Several studies have been carried out on revealing linguistic features captured by BERT. This is usually achieved by training a diagnostic classifier on the representations obtained from different layers of BERT. The subsequent classification accuracy is then interpreted as the ability of the model in encoding the corresponding linguistic property. Despite providing insights, these studies have le… ▽ More

    Submitted 11 September, 2021; v1 submitted 3 April, 2021; originally announced April 2021.

    Comments: Accepted to EMNLP 2021 (main conference)

  8. arXiv:1910.01165  [pdf

    stat.AP cs.CY

    Indicators of retention in remote digital health studies: A cross-study evaluation of 100,000 participants

    Authors: Abhishek Pratap, Elias Chaibub Neto, Phil Snyder, Carl Stepnowsky, Noémie Elhadad, Daniel Grant, Matthew H. Mohebbi, Sean Mooney, Christine Suver, John Wilbanks, Lara Mangravite, Patrick Heagerty, Pat Arean, Larsson Omberg

    Abstract: Digital technologies such as smartphones are transforming the way scientists conduct biomedical research using real-world data. Several remotely-conducted studies have recruited thousands of participants over a span of a few months. Unfortunately, these studies are hampered by substantial participant attrition, calling into question the representativeness of the collected data including generaliza… ▽ More

    Submitted 2 October, 2019; originally announced October 2019.

  9. arXiv:1812.03227  [pdf, other

    cond-mat.supr-con

    Magnetic hysteresis of a superconducting microstrip resonator with a high edge barrier

    Authors: Sangil Kwon, Yong-Chao Tang, Hamid R. Mohebbi, David G. Cory, Guo-Xing Miao

    Abstract: We investigate the magnetic hysteresis of a superconducting microstrip resonator with a high edge barrier. We measure the magnetic hysteresis while either swee** a magnetic field or tuning the edge barrier by high microwave current. We show that the magnetic hysteresis of such a device is qualitatively different from that of one without an edge barrier and can be understood based on the generali… ▽ More

    Submitted 7 December, 2018; originally announced December 2018.

  10. arXiv:1811.09170  [pdf, other

    cond-mat.supr-con quant-ph

    Engineering Nonlinear Response of Superconducting Niobium Microstrip Resonators via Aluminum Cladding

    Authors: Sangil Kwon, Yong-Chao Tang, Hamid R. Mohebbi, Olaf W. B. Benningshof, David G. Cory, Guo-Xing Miao

    Abstract: In this work, we find that Al cladding on Nb microstrip resonators is an efficient way to suppress nonlinear responses induced by local Joule heating, resulting in improved microwave power handling capability. This improvement is likely due to the proximity effect between the Al and the Nb layers. The proximity effect is found to be controllable by tuning the thickness of the Al layer. We show tha… ▽ More

    Submitted 15 October, 2019; v1 submitted 22 November, 2018; originally announced November 2018.

    Journal ref: J. Appl. Phys. 126, 173906 (2019)

  11. arXiv:1802.05183  [pdf, other

    cond-mat.supr-con quant-ph

    Magnetic Field Dependent Microwave Losses in Superconducting Niobium Microstrip Resonators

    Authors: Sangil Kwon, Anita Fadavi Roudsari, Olaf W. B. Benningshof, Yong-Chao Tang, Hamid R. Mohebbi, Ivar A. J. Taminiau, Deler Langenberg, Shinyoung Lee, George Nichols, David G. Cory, Guo-Xing Miao

    Abstract: We describe an experimental protocol to characterize magnetic field dependent microwave losses in superconducting niobium microstrip resonators. Our approach provides a unified view that covers two well-known magnetic field dependent loss mechanisms: quasiparticle generation and vortex motion. We find that quasiparticle generation is the dominant loss mechanism for parallel magnetic fields. For pe… ▽ More

    Submitted 26 June, 2018; v1 submitted 14 February, 2018; originally announced February 2018.

    Journal ref: Journal of Applied Physics 124, 033903 (2018)

  12. A Learnable Despeckling Framework for Optical Coherence Tomography Images

    Authors: Saba Adabi, Elaheh Rashedi, Hamed Mohebbi, Xue-wen Chen, Silvia Conforto, Mohammad. R. Nasiriavanaki

    Abstract: Optical coherence tomography (OCT) is a prevalent, interferometric, high-resolution imaging method with broad biomedical applications. Nonetheless, OCT images suffer from an artifact, called speckle which degrades the image quality. Digital filters offer an opportunity for image improvement in clinical OCT devices where hardware modification to enhance images is expensive. To reduce speckle, a wid… ▽ More

    Submitted 4 November, 2017; originally announced November 2017.

    Comments: under review

    Journal ref: Journal of Biomedical Optics -2018