Skip to main content

Showing 1–14 of 14 results for author: Emami, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.07376  [pdf, other

    cs.CL

    LLMs in Biomedicine: A study on clinical Named Entity Recognition

    Authors: Masoud Monajatipoor, Jiaxin Yang, Joel Stremmel, Melika Emami, Fazlolah Mohaghegh, Mozhdeh Rouhsedaghat, Kai-Wei Chang

    Abstract: Large Language Models (LLMs) demonstrate remarkable versatility in various NLP tasks but encounter distinct challenges in biomedicine due to medical language complexities and data scarcity. This paper investigates the application of LLMs in the medical domain by exploring strategies to enhance their performance for the Named-Entity Recognition (NER) task. Specifically, our study reveals the import… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  2. arXiv:2403.04714  [pdf, other

    cs.DC cs.AR

    Parendi: Thousand-Way Parallel RTL Simulation

    Authors: Mahyar Emami, Thomas Bourgeat, James Larus

    Abstract: Hardware development relies on simulations, particularly cycle-accurate RTL (Register Transfer Level) simulations, which consume significant time. As single-processor performance grows only slowly, conventional, single-threaded RTL simulation is becoming less practical for increasingly complex chips and systems. A solution is parallel RTL simulation, where ideally, simulators could run on thousand… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  3. arXiv:2312.07802  [pdf, other

    cs.LG cs.IT stat.ML

    Estimation of embedding vectors in high dimensions

    Authors: Golara Ahmadi Azar, Melika Emami, Alyson Fletcher, Sundeep Rangan

    Abstract: Embeddings are a basic initial feature extraction step in many machine learning models, particularly in natural language processing. An embedding attempts to map data tokens to a low-dimensional space where similar tokens are mapped to vectors that are close to one another by some metric in the embedding space. A basic question is how well can such embedding be learned? To study this problem, we c… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 12 pages, 7 figures

  4. arXiv:2310.03752  [pdf, other

    eess.SP cs.HC cs.LG cs.RO

    A Deep Learning Sequential Decoder for Transient High-Density Electromyography in Hand Gesture Recognition Using Subject-Embedded Transfer Learning

    Authors: Golara Ahmadi Azar, Qin Hu, Melika Emami, Alyson Fletcher, Sundeep Rangan, S. Farokh Atashzar

    Abstract: Hand gesture recognition (HGR) has gained significant attention due to the increasing use of AI-powered human-computer interfaces that can interpret the deep spatiotemporal dynamics of biosignals from the peripheral nervous system, such as surface electromyography (sEMG). These interfaces have a range of applications, including the control of extended reality, agile prosthetics, and exoskeletons.… ▽ More

    Submitted 23 September, 2023; originally announced October 2023.

  5. Manticore: Hardware-Accelerated RTL Simulation with Static Bulk-Synchronous Parallelism

    Authors: Mahyar Emami, Sahand Kashani, Keisuke Kamahori, Mohammad Sepehr Pourghannad, Ritik Raj, James R. Larus

    Abstract: The demise of Moore's Law and Dennard Scaling has revived interest in specialized computer architectures and accelerators. Verification and testing of this hardware depend heavily upon cycle-accurate simulation of register-transfer-level (RTL) designs. The fastest software RTL simulators can simulate designs at 1--1000 kHz, i.e., more than three orders of magnitude slower than hardware. Improved s… ▽ More

    Submitted 20 October, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

  6. arXiv:2201.08082  [pdf, other

    stat.ML cs.LG

    Kernel Methods and Multi-layer Perceptrons Learn Linear Models in High Dimensions

    Authors: Mojtaba Sahraee-Ardakan, Melikasadat Emami, Parthe Pandit, Sundeep Rangan, Alyson K. Fletcher

    Abstract: Empirical observation of high dimensional phenomena, such as the double descent behaviour, has attracted a lot of interest in understanding classical techniques such as kernel methods, and their implications to explain generalization properties of neural networks. Many recent works analyze such models in a certain high-dimensional regime where the covariates are independent and the number of sampl… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

  7. arXiv:2112.10950  [pdf, other

    eess.AS cs.LG cs.SD

    Augmented Contrastive Self-Supervised Learning for Audio Invariant Representations

    Authors: Melikasadat Emami, Dung Tran, Kazuhito Koishida

    Abstract: Improving generalization is a major challenge in audio classification due to labeled data scarcity. Self-supervised learning (SSL) methods tackle this by leveraging unlabeled data to learn useful features for downstream classification tasks. In this work, we propose an augmented contrastive SSL framework to learn invariant representations from unlabeled data. Our method applies various perturbatio… ▽ More

    Submitted 20 December, 2021; originally announced December 2021.

    Comments: 4 pages, 4 figures

  8. arXiv:2107.09333  [pdf, other

    cs.AR cs.CL cs.PF

    StreamBlocks: A compiler for heterogeneous dataflow computing (technical report)

    Authors: Endri Bezati, Mahyar Emami, Jörn Janneck, James Larus

    Abstract: To increase performance and efficiency, systems use FPGAs as reconfigurable accelerators. A key challenge in designing these systems is partitioning computation between processors and an FPGA. An appropriate division of labor may be difficult to predict in advance and require experiments and measurements. When an investigation requires rewriting part of the system in a new language or with a new p… ▽ More

    Submitted 20 July, 2021; originally announced July 2021.

    ACM Class: C.5; D.1.3; D.3.0; I.6.5; B.6.0; B.8.2; B.4.0

  9. arXiv:2101.07833  [pdf, ps, other

    cs.LG cs.NE eess.SY stat.ML

    Implicit Bias of Linear RNNs

    Authors: Melikasadat Emami, Mojtaba Sahraee-Ardakan, Parthe Pandit, Sundeep Rangan, Alyson K. Fletcher

    Abstract: Contemporary wisdom based on empirical studies suggests that standard recurrent neural networks (RNNs) do not perform well on tasks requiring long-term memory. However, precise reasoning for this behavior is still unknown. This paper provides a rigorous explanation of this property in the special case of linear RNNs. Although this work is limited to linear RNNs, even these systems have traditional… ▽ More

    Submitted 19 January, 2021; originally announced January 2021.

    Comments: 30 pages, 4 figures

  10. arXiv:2005.05053  [pdf, other

    q-bio.NC cs.LG cs.NE eess.SP stat.ML

    Low-Rank Nonlinear Decoding of $μ$-ECoG from the Primary Auditory Cortex

    Authors: Melikasadat Emami, Mojtaba Sahraee-Ardakan, Parthe Pandit, Alyson K. Fletcher, Sundeep Rangan, Michael Trumpis, Brinnae Bent, Chia-Han Chiang, Jonathan Viventi

    Abstract: This paper considers the problem of neural decoding from parallel neural measurements systems such as micro-electrocorticography ($μ$-ECoG). In systems with large numbers of array elements at very high sampling rates, the dimension of the raw measurement data may be large. Learning neural decoders for this high-dimensional data can be challenging, particularly when the number of training samples i… ▽ More

    Submitted 6 May, 2020; originally announced May 2020.

    Comments: 4 pages, 3 figures

  11. arXiv:2005.00180  [pdf, other

    cs.LG stat.ML

    Generalization Error of Generalized Linear Models in High Dimensions

    Authors: Melikasadat Emami, Mojtaba Sahraee-Ardakan, Parthe Pandit, Sundeep Rangan, Alyson K. Fletcher

    Abstract: At the heart of machine learning lies the question of generalizability of learned rules over previously unseen data. While over-parameterized models based on neural networks are now ubiquitous in machine learning applications, our understanding of their generalization capabilities is incomplete. This task is made harder by the non-convexity of the underlying learning problems. We provide a general… ▽ More

    Submitted 30 April, 2020; originally announced May 2020.

    Comments: 20 pages, 4 figures

  12. arXiv:1910.13672  [pdf, other

    cs.LG stat.ML

    Input-Output Equivalence of Unitary and Contractive RNNs

    Authors: M. Emami, M. Sahraee-Ardakan, S. Rangan, A. K. Fletcher

    Abstract: Unitary recurrent neural networks (URNNs) have been proposed as a method to overcome the vanishing and exploding gradient problem in modeling data with long-term dependencies. A basic question is how restrictive is the unitary constraint on the possible input-output map**s of such a network? This work shows that for any contractive RNN with ReLU activations, there is a URNN with at most twice th… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

  13. arXiv:1206.1984  [pdf

    cs.DC

    Energy-Aware Scheduling using Dynamic Voltage-Frequency Scaling

    Authors: Masnida Emami, Yashar Ghiasi, Nasrin Jaberi

    Abstract: The energy consumption issue in distributed computing systems has become quite critical due to environmental concerns. In response to this, many energy-aware scheduling algorithms have been developed primarily by using the dynamic voltage-frequency scaling (DVFS) capability incorporated in recent commodity processors. The majority of these algorithms involve two passes: schedule generation and sla… ▽ More

    Submitted 9 June, 2012; originally announced June 2012.

    Comments: arXiv admin note: text overlap with arXiv:1203.5160

  14. arXiv:1204.1225  [pdf

    cs.DC physics.geo-ph

    Distributed computing of Seismic Imaging Algorithms

    Authors: Masnida Emami, Ali Setayesh, Nasrin Jaberi

    Abstract: The primary use of technical computing in the oil and gas industries is for seismic imaging of the earth's subsurface, driven by the business need for making well-informed drilling decisions during petroleum exploration and production. Since each oil/gas well in exploration areas costs several tens of millions of dollars, producing high-quality seismic images in a reasonable time can significantly… ▽ More

    Submitted 5 April, 2012; originally announced April 2012.