Skip to main content

Showing 1–14 of 14 results for author: Pilanci, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.19328  [pdf, other

    cs.SD cs.LG eess.AS

    Subtractive Training for Music Stem Insertion using Latent Diffusion Models

    Authors: Ivan Villa-Renteria, Mason L. Wang, Zachary Shah, Zhe Li, Soohyun Kim, Neelesh Ramachandran, Mert Pilanci

    Abstract: We present Subtractive Training, a simple and novel method for synthesizing individual musical instrument stems given other instruments as context. This method pairs a dataset of complete music mixes with 1) a variant of the dataset lacking a specific stem, and 2) LLM-generated instructions describing how the missing stem should be reintroduced. We then fine-tune a pretrained text-to-audio diffusi… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.10254  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Towards Signal Processing In Large Language Models

    Authors: Prateek Verma, Mert Pilanci

    Abstract: This paper introduces the idea of applying signal processing inside a Large Language Model (LLM). With the recent explosion of generative AI, our work can help bridge two fields together, namely the field of signal processing and large language models. We draw parallels between classical Fourier-Transforms and Fourier Transform-like learnable time-frequency representations for every intermediate a… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 12 pages, 3 figures

  3. arXiv:2406.08904  [pdf, other

    cs.LG cs.SD eess.AS

    AdaPTwin: Low-Cost Adaptive Compression of Product Twins in Transformers

    Authors: Emil Biju, Anirudh Sriram, Mert Pilanci

    Abstract: While large transformer-based models have exhibited remarkable performance in speaker-independent speech recognition, their large size and computational requirements make them expensive or impractical to use in resource-constrained settings. In this work, we propose a low-rank adaptive compression technique called AdaPTwin that jointly compresses product-dependent pairs of weight matrices in the t… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 12 pages, 3 figures, submitted to NeurIPS 2024

  4. arXiv:2305.06482  [pdf, ps, other

    eess.IV

    Coil Sketching for computationally-efficient MR iterative reconstruction

    Authors: Julio A. Oscanoa, Frank Ong, Siddharth S. Iyer, Zhitao Li, Christopher M. Sandino, Batu Ozturkler, Daniel B. Ennis, Mert Pilanci, Shreyas S. Vasanawala

    Abstract: Purpose: Parallel imaging and compressed sensing reconstructions of large MRI datasets often have a prohibitive computational cost that bottlenecks clinical deployment, especially for 3D non-Cartesian acquisitions. One common approach is to reduce the number of coil channels actively used during reconstruction as in coil compression. While effective for Cartesian imaging, coil compression inherent… ▽ More

    Submitted 11 October, 2023; v1 submitted 10 May, 2023; originally announced May 2023.

    Comments: 19 pages, 7 figures, 3 tables

  5. arXiv:2302.13527  [pdf

    eess.AS cs.SD eess.SP

    Complex Clip** for Improved Generalization in Machine Learning

    Authors: Les Atlas, Nicholas Rasmussen, Felix Schwock, Mert Pilanci

    Abstract: For many machine learning applications, a common input representation is a spectrogram. The underlying representation for a spectrogram is a short time Fourier transform (STFT) which gives complex values. The spectrogram uses the magnitude of these complex values, a commonly used detector. Modern machine learning systems are commonly overparameterized, where possible ill-conditioning problems are… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: Submitted to IEEE Signal Processing Letters

  6. arXiv:2207.08393  [pdf, other

    eess.IV cs.CV

    GLEAM: Greedy Learning for Large-Scale Accelerated MRI Reconstruction

    Authors: Batu Ozturkler, Arda Sahiner, Tolga Ergen, Arjun D Desai, Christopher M Sandino, Shreyas Vasanawala, John M Pauly, Morteza Mardani, Mert Pilanci

    Abstract: Unrolled neural networks have recently achieved state-of-the-art accelerated MRI reconstruction. These networks unroll iterative optimization algorithms by alternating between physics-based consistency and neural-network based regularization. However, they require several iterations of a large neural network to handle high-dimensional imaging tasks such as 3D MRI. This limits traditional training… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

  7. arXiv:2204.10436  [pdf, other

    eess.IV cs.CV cs.LG

    Scale-Equivariant Unrolled Neural Networks for Data-Efficient Accelerated MRI Reconstruction

    Authors: Beliz Gunel, Arda Sahiner, Arjun D. Desai, Akshay S. Chaudhari, Shreyas Vasanawala, Mert Pilanci, John Pauly

    Abstract: Unrolled neural networks have enabled state-of-the-art reconstruction performance and fast inference times for the accelerated magnetic resonance imaging (MRI) reconstruction task. However, these approaches depend on fully-sampled scans as ground truth data which is either costly or not possible to acquire in many clinical medical imaging applications; hence, reducing dependence on data is desirab… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

  8. arXiv:2202.11277  [pdf, other

    cs.IT cs.LG eess.SP stat.ML

    Minimax Optimal Quantization of Linear Models: Information-Theoretic Limits and Efficient Algorithms

    Authors: Rajarshi Saha, Mert Pilanci, Andrea J. Goldsmith

    Abstract: High-dimensional models often have a large memory footprint and must be quantized after training before being deployed on resource-constrained edge devices for inference tasks. In this work, we develop an information-theoretic framework for the problem of quantizing a linear regressor learned from training data $(\mathbf{X}, \mathbf{y})$, for some underlying statistical relationship… ▽ More

    Submitted 30 August, 2022; v1 submitted 22 February, 2022; originally announced February 2022.

    Comments: 50 pages, 31 figures, 9 tables

  9. arXiv:2201.08522  [pdf, other

    cs.IT cs.CR eess.SP math.NA

    Orthonormal Sketches for Secure Coded Regression

    Authors: Neophytos Charalambides, Hessam Mahdavifar, Mert Pilanci, Alfred O. Hero III

    Abstract: In this work, we propose a method for speeding up linear regression distributively, while ensuring security. We leverage randomized sketching techniques, and improve straggler resilience in asynchronous systems. Specifically, we apply a random orthonormal matrix and then subsample in \textit{blocks}, to simultaneously secure the information and reduce the dimension of the regression problem. In ou… ▽ More

    Submitted 22 February, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

    Comments: 3 figures, 5 pages excluding appendices

    MSC Class: 65F10; 65F45; 68W15; 68W20; 68W25; 68P27; 68P30; ACM Class: E.3; E.4; G.1.2; G.1.3

  10. arXiv:2201.01669  [pdf, other

    eess.AS cs.LG cs.SD

    Using Deep Learning with Large Aggregated Datasets for COVID-19 Classification from Cough

    Authors: Esin Darici Haritaoglu, Nicholas Rasmussen, Daniel C. H. Tan, Jennifer Ranjani J., Jaclyn Xiao, Gunvant Chaudhari, Akanksha Rajput, Praveen Govindan, Christian Canham, Wei Chen, Minami Yamaura, Laura Gomezjurado, Aaron Broukhim, Amil Khanzada, Mert Pilanci

    Abstract: The Covid-19 pandemic has been one of the most devastating events in recent history, claiming the lives of more than 5 million people worldwide. Even with the worldwide distribution of vaccines, there is an apparent need for affordable, reliable, and accessible screening techniques to serve parts of the World that do not have access to Western medicine. Artificial Intelligence can provide a soluti… ▽ More

    Submitted 29 March, 2022; v1 submitted 5 January, 2022; originally announced January 2022.

  11. arXiv:2107.05680  [pdf, other

    cs.LG cs.CV eess.IV math.OC stat.ML

    Hidden Convexity of Wasserstein GANs: Interpretable Generative Models with Closed-Form Solutions

    Authors: Arda Sahiner, Tolga Ergen, Batu Ozturkler, Burak Bartan, John Pauly, Morteza Mardani, Mert Pilanci

    Abstract: Generative Adversarial Networks (GANs) are commonly used for modeling complex distributions of data. Both the generators and discriminators of GANs are often modeled by neural networks, posing a non-transparent optimization problem which is non-convex and non-concave over the generator and discriminator, respectively. Such networks are often heuristically optimized with gradient descent-ascent (GD… ▽ More

    Submitted 21 March, 2022; v1 submitted 12 July, 2021; originally announced July 2021.

    Comments: Published as paper in ICLR 2022. First two authors contributed equally to this work; 34 pages, 11 figures

  12. arXiv:2012.05169  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Convex Regularization Behind Neural Reconstruction

    Authors: Arda Sahiner, Morteza Mardani, Batu Ozturkler, Mert Pilanci, John Pauly

    Abstract: Neural networks have shown tremendous potential for reconstructing high-resolution images in inverse problems. The non-convex and opaque nature of neural networks, however, hinders their utility in sensitive applications such as medical imaging. To cope with this challenge, this paper advocates a convex duality framework that makes a two-layer fully-convolutional ReLU denoising network amenable to… ▽ More

    Submitted 9 December, 2020; originally announced December 2020.

  13. Linear Predictive Coding for Acute Stress Prediction from Computer Mouse Movements

    Authors: Lawrence H. Kim, Rahul Goel, Jia Liang, Mert Pilanci, Pablo E. Paredes

    Abstract: Prior work demonstrated the potential of using the Linear Predictive Coding (LPC) filter to approximate muscle stiffness and dam** from computer mouse movements to predict acute stress levels of users. Theoretically, muscle stiffness and dam** in the arm can be estimated using a mass-spring-damper (MSD) biomechanical model. However, the dam** frequency (i.e., stiffness) and dam** ratio val… ▽ More

    Submitted 15 December, 2021; v1 submitted 26 October, 2020; originally announced October 2020.

    Comments: The first three authors contributed equally. 5 pages, 6 figures, 2 tables, published at EMBC'21

    Journal ref: 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC)

  14. arXiv:2002.10674  [pdf, other

    cs.NE cs.LG eess.SP

    Separating the Effects of Batch Normalization on CNN Training Speed and Stability Using Classical Adaptive Filter Theory

    Authors: Elaina Chai, Mert Pilanci, Boris Murmann

    Abstract: Batch Normalization (BatchNorm) is commonly used in Convolutional Neural Networks (CNNs) to improve training speed and stability. However, there is still limited consensus on why this technique is effective. This paper uses concepts from the traditional adaptive filter domain to provide insight into the dynamics and inner workings of BatchNorm. First, we show that the convolution weight updates ha… ▽ More

    Submitted 1 June, 2021; v1 submitted 25 February, 2020; originally announced February 2020.

    Comments: Presented at Asilomar Conference on Signals, Systems, and Computers, 2020