Skip to main content

Showing 1–21 of 21 results for author: Amir, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14511  [pdf, other

    cs.CL

    Investigating Mysteries of CoT-Augmented Distillation

    Authors: Somin Wadhwa, Silvio Amir, Byron C. Wallace

    Abstract: Eliciting "chain of thought" (CoT) rationales -- sequences of token that convey a "reasoning" process -- has been shown to consistently improve LLM performance on tasks like question answering. More recent efforts have shown that such rationales can also be used for model distillation: Including CoT sequences (elicited from a large "teacher" model) in addition to target labels when fine-tuning a s… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Draft; under review

  2. arXiv:2404.00152  [pdf, other

    cs.CL

    On-the-fly Definition Augmentation of LLMs for Biomedical NER

    Authors: Monica Munnangi, Sergey Feldman, Byron C Wallace, Silvio Amir, Tom Hope, Aakanksha Naik

    Abstract: Despite their general capabilities, LLMs still struggle on biomedical NER tasks, which are difficult due to the presence of specialized terminology and lack of training data. In this work we set out to improve LLM performance on biomedical NER in limited data settings via a new knowledge augmentation approach which incorporates definitions of relevant concepts on-the-fly. During this process, to p… ▽ More

    Submitted 23 April, 2024; v1 submitted 29 March, 2024; originally announced April 2024.

    Comments: To appear at NAACL 2024 (Main)

  3. arXiv:2401.00986  [pdf

    cs.CV cs.AI

    Real-Time Object Detection in Occluded Environment with Background Cluttering Effects Using Deep Learning

    Authors: Syed Muhammad Aamir, Hongbin Ma, Malak Abid Ali Khan, Muhammad Aaqib

    Abstract: Detection of small, undetermined moving objects or objects in an occluded environment with a cluttered background is the main problem of computer vision. This greatly affects the detection accuracy of deep learning models. To overcome these problems, we concentrate on deep learning models for real-time detection of cars and tanks in an occluded environment with a cluttered background employing SSD… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  4. Disentangling Structure and Appearance in ViT Feature Space

    Authors: Narek Tumanyan, Omer Bar-Tal, Shir Amir, Shai Bagon, Tali Dekel

    Abstract: We present a method for semantically transferring the visual appearance of one natural image to another. Specifically, our goal is to generate an image in which objects in a source structure image are "painted" with the visual appearance of their semantically related objects in a target appearance image. To integrate semantic information into our framework, our key idea is to leverage a pre-traine… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: Accepted to ACM Transactions on Graphics. arXiv admin note: substantial text overlap with arXiv:2201.00424

  5. arXiv:2309.04550  [pdf, other

    cs.CL

    Retrieving Evidence from EHRs with LLMs: Possibilities and Challenges

    Authors: Hiba Ahsan, Denis Jered McInerney, Jisoo Kim, Christopher Potter, Geoffrey Young, Silvio Amir, Byron C. Wallace

    Abstract: Unstructured data in Electronic Health Records (EHRs) often contains critical information -- complementary to imaging -- that could inform radiologists' diagnoses. But the large volume of notes often associated with patients together with time constraints renders manually identifying relevant evidence practically infeasible. In this work we propose and evaluate a zero-shot strategy for using LLMs… ▽ More

    Submitted 10 June, 2024; v1 submitted 8 September, 2023; originally announced September 2023.

  6. arXiv:2305.05003  [pdf, other

    cs.CL

    Revisiting Relation Extraction in the era of Large Language Models

    Authors: Somin Wadhwa, Silvio Amir, Byron C. Wallace

    Abstract: Relation extraction (RE) is the core NLP task of inferring semantic relationships between entities from text. Standard supervised RE techniques entail training modules to tag tokens comprising entity spans and then predict the relationship between them. Recent work has instead treated the problem as a \emph{sequence-to-sequence} task, linearizing relations between entities as target strings to be… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2023

  7. arXiv:2305.03642  [pdf, other

    cs.CL

    Jointly Extracting Interventions, Outcomes, and Findings from RCT Reports with LLMs

    Authors: Somin Wadhwa, Jay DeYoung, Benjamin Nye, Silvio Amir, Byron C. Wallace

    Abstract: Results from Randomized Controlled Trials (RCTs) establish the comparative effectiveness of interventions, and are in turn critical inputs for evidence-based care. However, results from RCTs are presented in (often unstructured) natural language articles describing the design, execution, and outcomes of trials; clinicians must manually extract findings pertaining to interventions and outcomes of i… ▽ More

    Submitted 17 July, 2023; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: Accepted to MLHC 2023

  8. arXiv:2210.09618  [pdf

    cs.CV

    Object Recognition in Different Lighting Conditions at Various Angles by Deep Learning Method

    Authors: Imran Khan Mirani, Chen Tianhua, Malak Abid Ali Khan, Syed Muhammad Aamir, Waseef Menhaj

    Abstract: Existing computer vision and object detection methods strongly rely on neural networks and deep learning. This active research area is used for applications such as autonomous driving, aerial photography, protection, and monitoring. Futuristic object detection methods rely on rectangular, boundary boxes drawn over an object to accurately locate its location. The modern object recognition algorithm… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

  9. arXiv:2210.06331  [pdf, other

    cs.CL

    RedHOT: A Corpus of Annotated Medical Questions, Experiences, and Claims on Social Media

    Authors: Somin Wadhwa, Vivek Khetan, Silvio Amir, Byron Wallace

    Abstract: We present Reddit Health Online Talk (RedHOT), a corpus of 22,000 richly annotated social media posts from Reddit spanning 24 health conditions. Annotations include demarcations of spans corresponding to medical claims, personal experiences, and questions. We collect additional granular annotations on identified claims. Specifically, we mark snippets that describe patient Populations, Intervention… ▽ More

    Submitted 7 February, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted to EACL 2023

  10. arXiv:2206.05619  [pdf, other

    cs.CV

    Deep Learning Models for Automated Classification of Dog Emotional States from Facial Expressions

    Authors: Tali Boneh-Shitrit, Shir Amir, Annika Bremhorst, Daniel S. Mills, Stefanie Riemer, Dror Fried, Anna Zamansky

    Abstract: Similarly to humans, facial expressions in animals are closely linked with emotional states. However, in contrast to the human domain, automated recognition of emotional states from facial expressions in animals is underexplored, mainly due to difficulties in data collection and establishment of ground truth concerning emotional states of non-verbal users. We apply recent deep learning techniques… ▽ More

    Submitted 11 June, 2022; originally announced June 2022.

  11. arXiv:2112.05814  [pdf, other

    cs.CV

    Deep ViT Features as Dense Visual Descriptors

    Authors: Shir Amir, Yossi Gandelsman, Shai Bagon, Tali Dekel

    Abstract: We study the use of deep features extracted from a pretrained Vision Transformer (ViT) as dense visual descriptors. We observe and empirically demonstrate that such features, when extractedfrom a self-supervised ViT model (DINO-ViT), exhibit several striking properties, including: (i) the features encode powerful, well-localized semantic information, at high spatial granularity, such as object par… ▽ More

    Submitted 15 October, 2022; v1 submitted 10 December, 2021; originally announced December 2021.

    Comments: Revised version - high res figures

  12. arXiv:2104.06338  [pdf, other

    cs.CL

    On the Impact of Random Seeds on the Fairness of Clinical Classifiers

    Authors: Silvio Amir, Jan-Willem van de Meent, Byron C. Wallace

    Abstract: Recent work has shown that fine-tuning large networks is surprisingly sensitive to changes in random seed(s). We explore the implications of this phenomenon for model fairness across demographic groups in clinical prediction tasks over electronic health records (EHR) in MIMIC-III -- the standard dataset in clinical NLP research. Apparent subgroup performance varies substantially for seeds that yie… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

    Comments: Accepted for publication at NAACL 2021

  13. arXiv:2010.06472  [pdf, other

    cs.CL

    Demographic Representation and Collective Storytelling in the Me Too Twitter Hashtag Activism Movement

    Authors: Aaron Mueller, Zach Wood-Doughty, Silvio Amir, Mark Dredze, Alicia L. Nobles

    Abstract: The #MeToo movement on Twitter has drawn attention to the pervasive nature of sexual harassment and violence. While #MeToo has been praised for providing support for self-disclosures of harassment or violence and shifting societal response, it has also been criticized for exemplifying how women of color have been discounted for their historical contributions to and excluded from feminist movements… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

    Comments: 27 pages (incl. 5 for references). Submitted to CSCW 2021

  14. Demonstrating Advantages of Neuromorphic Computation: A Pilot Study

    Authors: Timo Wunderlich, Akos F. Kungl, Eric Müller, Andreas Hartel, Yannik Stradmann, Syed Ahmed Aamir, Andreas Grübl, Arthur Heimbrecht, Korbinian Schreiber, David Stöckel, Christian Pehle, Sebastian Billaudelle, Gerd Kiene, Christian Mauch, Johannes Schemmel, Karlheinz Meier, Mihai A. Petrovici

    Abstract: Neuromorphic devices represent an attempt to mimic aspects of the brain's architecture and dynamics with the aim of replicating its hallmark functional capabilities in terms of computational power, robust learning and energy efficiency. We employ a single-chip prototype of the BrainScaleS 2 neuromorphic system to implement a proof-of-concept demonstration of reward-modulated spike-timing-dependent… ▽ More

    Submitted 8 March, 2019; v1 submitted 8 November, 2018; originally announced November 2018.

    Comments: Added measurements with noise in NEST simulation, add notice about journal publication. Frontiers in Neuromorphic Engineering (2019)

  15. arXiv:1804.01906  [pdf, other

    q-bio.NC cs.ET physics.bio-ph physics.comp-ph

    An Accelerated LIF Neuronal Network Array for a Large Scale Mixed-Signal Neuromorphic Architecture

    Authors: Syed Ahmed Aamir, Yannik Stradmann, Paul Müller, Christian Pehle, Andreas Hartel, Andreas Grübl, Johannes Schemmel, Karlheinz Meier

    Abstract: We present an array of leaky integrate-and-fire (LIF) neuron circuits designed for the second-generation BrainScaleS mixed-signal 65-nm CMOS neuromorphic hardware. The neuronal array is embedded in the analog network core of a scaled-down prototype HICANN-DLS chip. Designed as continuous-time circuits, the neurons are highly tunable and reconfigurable elements with accelerated dynamics. Each neuro… ▽ More

    Submitted 23 May, 2018; v1 submitted 5 April, 2018; originally announced April 2018.

    Comments: 14 pages, 9 Figures, accepted for publication in IEEE Transactions on Circuits and Systems I

  16. arXiv:1804.01840  [pdf, other

    q-bio.NC cs.ET physics.bio-ph

    A Mixed-Signal Structured AdEx Neuron for Accelerated Neuromorphic Cores

    Authors: Syed Ahmed Aamir, Paul Müller, Gerd Kiene, Laura Kriener, Yannik Stradmann, Andreas Grübl, Johannes Schemmel, Karlheinz Meier

    Abstract: Here we describe a multi-compartment neuron circuit based on the Adaptive-Exponential I&F (AdEx) model, developed for the second-generation BrainScaleS hardware. Based on an existing modular Leaky Integrate-and-Fire (LIF) architecture designed in 65 nm CMOS, the circuit features exponential spike generation, neuronal adaptation, inter-compartmental connections as well as a conductance-based reset.… ▽ More

    Submitted 29 May, 2018; v1 submitted 5 April, 2018; originally announced April 2018.

    Comments: 11 pages, 17 figures (including author photographs)

  17. arXiv:1705.00335  [pdf, other

    cs.CL cs.AI cs.SI

    Quantifying Mental Health from Social Media with Neural User Embeddings

    Authors: Silvio Amir, Glen Coppersmith, Paula Carvalho, Mário J. Silva, Byron C. Wallace

    Abstract: Mental illnesses adversely affect a significant proportion of the population worldwide. However, the methods traditionally used for estimating and characterizing the prevalence of mental health conditions are time-consuming and expensive. Consequently, best-available estimates concerning the prevalence of mental health conditions are often years out of date. Automated approaches to supplement thes… ▽ More

    Submitted 30 April, 2017; originally announced May 2017.

  18. arXiv:1701.00145  [pdf, other

    cs.CL

    Expanding Subjective Lexicons for Social Media Mining with Embedding Subspaces

    Authors: Silvio Amir, Rámon Astudillo, Wang Ling, Paula C. Carvalho, Mário J. Silva

    Abstract: Recent approaches for sentiment lexicon induction have capitalized on pre-trained word embeddings that capture latent semantic properties. However, embeddings obtained by optimizing performance of a given task (e.g. predicting contextual words) are sub-optimal for other applications. In this paper, we address this problem by exploiting task-specific representations, induced via embedding sub-space… ▽ More

    Submitted 6 January, 2017; v1 submitted 31 December, 2016; originally announced January 2017.

  19. arXiv:1607.00976  [pdf, other

    cs.CL cs.AI

    Modelling Context with User Embeddings for Sarcasm Detection in Social Media

    Authors: Silvio Amir, Byron C. Wallace, Hao Lyu, Paula Carvalho Mário J. Silva

    Abstract: We introduce a deep neural network for automated sarcasm detection. Recent work has emphasized the need for models to capitalize on contextual features, beyond lexical and syntactic cues present in utterances. For example, different speakers will tend to employ sarcasm regarding different subjects and, thus, sarcasm detection models ought to encode such speaker information. Current methods have ac… ▽ More

    Submitted 4 July, 2016; v1 submitted 4 July, 2016; originally announced July 2016.

    Comments: published as a conference paper at CONLL 2016

  20. POPmine: Tracking Political Opinion on the Web

    Authors: Pedro Saleiro, Sílvio Amir, Mário J. Silva, Carlos Soares

    Abstract: The automatic content analysis of mass media in the social sciences has become necessary and possible with the raise of social media and computational power. One particularly promising avenue of research concerns the use of opinion mining. We design and implement the POPmine system which is able to collect texts from web-based conventional media (news items in mainstream media sites) and social me… ▽ More

    Submitted 29 November, 2015; originally announced November 2015.

    Comments: 2015 IEEE International Conference on Computer and Information Technology, Ubiquitous Computing and Communications

  21. arXiv:1508.02096  [pdf, other

    cs.CL

    Finding Function in Form: Compositional Character Models for Open Vocabulary Word Representation

    Authors: Wang Ling, Tiago Luís, Luís Marujo, Ramón Fernandez Astudillo, Silvio Amir, Chris Dyer, Alan W. Black, Isabel Trancoso

    Abstract: We introduce a model for constructing vector representations of words by composing characters using bidirectional LSTMs. Relative to traditional word representation models that have independent vectors for each word type, our model requires only a single vector per character type and a fixed set of parameters for the compositional model. Despite the compactness of this model and, more importantly,… ▽ More

    Submitted 23 May, 2016; v1 submitted 9 August, 2015; originally announced August 2015.