Skip to main content

Showing 1–14 of 14 results for author: Sekhon, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2302.02016  [pdf, other

    cs.CL cs.AI

    Improving Interpretability via Explicit Word Interaction Graph Layer

    Authors: Arshdeep Sekhon, Hanjie Chen, Aman Shrivastava, Zhe Wang, Yangfeng Ji, Yanjun Qi

    Abstract: Recent NLP literature has seen growing interest in improving model interpretability. Along this direction, we propose a trainable neural network layer that learns a global interaction graph between words and then selects more informative words using the learned word interactions. Our layer, we call WIGRAPH, can plug into any neural network-based NLP text classifiers right after its word embedding… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

    Comments: 15 pages, AAAI 2023

    Journal ref: AAAI 2023

  2. arXiv:2205.05050  [pdf, other

    cs.CL cs.LG

    White-box Testing of NLP models with Mask Neuron Coverage

    Authors: Arshdeep Sekhon, Yangfeng Ji, Matthew B. Dwyer, Yanjun Qi

    Abstract: Recent literature has seen growing interest in using black-box strategies like CheckList for testing the behavior of NLP models. Research on white-box testing has developed a number of methods for evaluating how thoroughly the internal behavior of deep models is tested, but they are not applicable to NLP models. We propose a set of white-box testing methods that are customized for transformer-base… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

    Comments: Findings of NAACL 2022 submission, 12 pages

    Journal ref: Findings of NAACL 2022

  3. arXiv:2109.13305  [pdf, other

    cs.LG stat.ML

    ST-MAML: A Stochastic-Task based Method for Task-Heterogeneous Meta-Learning

    Authors: Zhe Wang, Jake Grigsby, Arshdeep Sekhon, Yanjun Qi

    Abstract: Optimization-based meta-learning typically assumes tasks are sampled from a single distribution - an assumption oversimplifies and limits the diversity of tasks that meta-learning can model. Handling tasks from multiple different distributions is challenging for meta-learning due to a so-called task ambiguity issue. This paper proposes a novel method, ST-MAML, that empowers model-agnostic meta-lea… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

  4. arXiv:2108.04990  [pdf, other

    cs.CL

    Perturbing Inputs for Fragile Interpretations in Deep Natural Language Processing

    Authors: Sanchit Sinha, Hanjie Chen, Arshdeep Sekhon, Yangfeng Ji, Yanjun Qi

    Abstract: Interpretability methods like Integrated Gradient and LIME are popular choices for explaining natural language model predictions with relative word importance scores. These interpretations need to be robust for trustworthy NLP applications in high-stake areas like medicine or finance. Our paper demonstrates how interpretations can be manipulated by making simple word perturbations on an input text… ▽ More

    Submitted 15 September, 2021; v1 submitted 10 August, 2021; originally announced August 2021.

    Comments: EMNLP-BlackboxNLP, 2021

  5. arXiv:2106.09011  [pdf, other

    cs.CV cs.LG cs.NE

    Evolving Image Compositions for Feature Representation Learning

    Authors: Paola Cascante-Bonilla, Arshdeep Sekhon, Yanjun Qi, Vicente Ordonez

    Abstract: Convolutional neural networks for visual recognition require large amounts of training samples and usually benefit from data augmentation. This paper proposes PatchMix, a data augmentation method that creates new samples by composing patches from pairs of images in a grid-like pattern. These new samples are assigned label scores that are proportional to the number of patches borrowed from each ima… ▽ More

    Submitted 31 March, 2022; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: Accepted to BMVC 2021. Camera-Ready version. Project page: https://paolacascante.com/patchmix/index.html

  6. arXiv:2103.02405  [pdf, other

    cs.LG cs.AI stat.ML

    Relate and Predict: Structure-Aware Prediction with Jointly Optimized Neural DAG

    Authors: Arshdeep Sekhon, Zhe Wang, Yanjun Qi

    Abstract: Understanding relationships between feature variables is one important way humans use to make decisions. However, state-of-the-art deep learning studies either focus on task-agnostic statistical dependency learning or do not model explicit feature dependencies during prediction. We propose a deep neural network framework, dGAP, to learn neural dependency Graph and optimize structure-Aware target P… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

    Comments: 8 pages, 6 figures, version appeared in ICML Workshop 2020 Graph Representation Learning and Beyond (GRL+)

  7. arXiv:2004.11494  [pdf, other

    cs.LG stat.ML

    Beyond Data Samples: Aligning Differential Networks Estimation with Scientific Knowledge

    Authors: Arshdeep Sekhon, Zhe Wang, Yanjun Qi

    Abstract: Learning the differential statistical dependency network between two contexts is essential for many real-life applications, mostly in the high dimensional low sample regime. In this paper, we propose a novel differential network estimator that allows integrating various sources of knowledge beyond data samples. The proposed estimator is scalable to a large number of variables and achieves a sharp… ▽ More

    Submitted 21 April, 2022; v1 submitted 23 April, 2020; originally announced April 2020.

    Comments: 9 pages of main draft; 25 pages of Appendix; 5 Tables ; 14 Figures ; Learning of Structure Difference between Two Graphical Models

    Journal ref: The 25th International Conference on Artificial Intelligence and Statistics 2022

  8. arXiv:1904.08049  [pdf, other

    cs.LG cs.IR stat.ML

    Neural Message Passing for Multi-Label Classification

    Authors: Jack Lanchantin, Arshdeep Sekhon, Yanjun Qi

    Abstract: Multi-label classification (MLC) is the task of assigning a set of target labels for a given sample. Modeling the combinatorial label interactions in MLC has been a long-haul challenge. We propose Label Message Passing (LaMP) Neural Networks to efficiently model the joint prediction of multiple labels. LaMP treats labels as nodes on a label-interaction graph and computes the hidden representation… ▽ More

    Submitted 16 April, 2019; originally announced April 2019.

    Comments: 19pages. We provide our code and datasets at https://github.com/QData/LaMP

  9. arXiv:1807.03878  [pdf, other

    cs.LG cs.CY stat.ML

    DeepDiff: Deep-learning for predicting Differential gene expression from histone modifications

    Authors: Arshdeep Sekhon, Ritambhara Singh, Yanjun Qi

    Abstract: Computational methods that predict differential gene expression from histone modification signals are highly desirable for understanding how histone modifications control the functional heterogeneity of cells through influencing differential gene regulation. Recent studies either failed to capture combinatorial effects on differential prediction or primarily only focused on cell type-specific anal… ▽ More

    Submitted 10 July, 2018; originally announced July 2018.

  10. arXiv:1806.00548  [pdf, other

    cs.LG stat.ML

    A Fast and Scalable Joint Estimator for Integrating Additional Knowledge in Learning Multiple Related Sparse Gaussian Graphical Models

    Authors: Beilun Wang, Arshdeep Sekhon, Yanjun Qi

    Abstract: We consider the problem of including additional knowledge in estimating sparse Gaussian graphical models (sGGMs) from aggregated samples, arising often in bioinformatics and neuroimaging applications. Previous joint sGGM estimators either fail to use existing knowledge or cannot scale-up to many tasks (large $K$) under a high-dimensional (large $p$) situation. In this paper, we propose a novel \un… ▽ More

    Submitted 16 April, 2019; v1 submitted 1 June, 2018; originally announced June 2018.

    Comments: ICML 2018; Proof and Design of W in Appendix; Available as R tool "jeek". This updated version correct a few equation errors

    Journal ref: International Conference on Machine Learning. 2018

  11. arXiv:1710.11238  [pdf, other

    cs.LG cs.AI stat.ML

    Prototype Matching Networks for Large-Scale Multi-label Genomic Sequence Classification

    Authors: Jack Lanchantin, Arshdeep Sekhon, Ritambhara Singh, Yanjun Qi

    Abstract: One of the fundamental tasks in understanding genomics is the problem of predicting Transcription Factor Binding Sites (TFBSs). With more than hundreds of Transcription Factors (TFs) as labels, genomic-sequence based TFBS prediction is a challenging multi-label classification task. There are two major biological mechanisms for TF binding: (1) sequence-specific binding patterns on genomes known as… ▽ More

    Submitted 10 November, 2017; v1 submitted 30 October, 2017; originally announced October 2017.

    Comments: 15 pages, 6 figures, 5 tables

  12. arXiv:1710.11223  [pdf, other

    cs.LG cs.AI stat.ML

    Fast and Scalable Learning of Sparse Changes in High-Dimensional Gaussian Graphical Model Structure

    Authors: Beilun Wang, Arshdeep Sekhon, Yanjun Qi

    Abstract: We focus on the problem of estimating the change in the dependency structures of two $p$-dimensional Gaussian Graphical models (GGMs). Previous studies for sparse change estimation in GGMs involve expensive and difficult non-smooth optimization. We propose a novel method, DIFFEE for estimating DIFFerential networks via an Elementary Estimator under a high-dimensional situation. DIFFEE is solved th… ▽ More

    Submitted 23 May, 2018; v1 submitted 30 October, 2017; originally announced October 2017.

    Comments: 20pages, 6 figures, 10 tables; at AISTAT 2018

  13. arXiv:1708.00339  [pdf, other

    cs.LG cs.AI cs.NE

    Attend and Predict: Understanding Gene Regulation by Selective Attention on Chromatin

    Authors: Ritambhara Singh, Jack Lanchantin, Arshdeep Sekhon, Yanjun Qi

    Abstract: The past decade has seen a revolution in genomic technologies that enable a flood of genome-wide profiling of chromatin marks. Recent literature tried to understand gene regulation by predicting gene expression from large-scale chromatin measurements. Two fundamental challenges exist for such learning tasks: (1) genome-wide chromatin signals are spatially structured, high-dimensional and highly mo… ▽ More

    Submitted 7 November, 2017; v1 submitted 1 August, 2017; originally announced August 2017.

    Comments: 12 pages; At NIPS 2017

  14. arXiv:1704.07468  [pdf, other

    cs.LG cs.AI cs.CC cs.CL cs.DS

    GaKCo: a Fast GApped k-mer string Kernel using COunting

    Authors: Ritambhara Singh, Arshdeep Sekhon, Kamran Kowsari, Jack Lanchantin, Beilun Wang, Yanjun Qi

    Abstract: String Kernel (SK) techniques, especially those using gapped $k$-mers as features (gk), have obtained great success in classifying sequences like DNA, protein, and text. However, the state-of-the-art gk-SK runs extremely slow when we increase the dictionary size ($Σ$) or allow more mismatches ($M$). This is because current gk-SK uses a trie-based algorithm to calculate co-occurrence of mismatched… ▽ More

    Submitted 18 September, 2017; v1 submitted 24 April, 2017; originally announced April 2017.

    Comments: @ECML 2017