Skip to main content

Showing 1–46 of 46 results for author: Azizi, S

.
  1. arXiv:2406.14854  [pdf, other

    cs.CV cs.AI eess.IV

    PEANO-ViT: Power-Efficient Approximations of Non-Linearities in Vision Transformers

    Authors: Mohammad Erfan Sadeghi, Arash Fayyazi, Seyedarmin Azizi, Massoud Pedram

    Abstract: The deployment of Vision Transformers (ViTs) on hardware platforms, specially Field-Programmable Gate Arrays (FPGAs), presents many challenges, which are mainly due to the substantial computational and power requirements of their non-linear functions, notably layer normalization, softmax, and Gaussian Error Linear Unit (GELU). These critical functions pose significant obstacles to efficient hardwa… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2406.12832  [pdf, other

    cs.CL cs.AI cs.LG

    LaMDA: Large Model Fine-Tuning via Spectrally Decomposed Low-Dimensional Adaptation

    Authors: Seyedarmin Azizi, Souvik Kundu, Massoud Pedram

    Abstract: Low-rank adaptation (LoRA) has become the default approach to fine-tune large language models (LLMs) due to its significant reduction in trainable parameters. However, trainable parameter demand for LoRA increases with increasing model embedding dimensions, leading to high compute costs. Additionally, its backward updates require storing high-dimensional intermediate activations and optimizer stat… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  3. arXiv:2406.06316  [pdf, other

    cs.CL cs.AI cs.CE cs.LG

    Tx-LLM: A Large Language Model for Therapeutics

    Authors: Juan Manuel Zambrano Chaves, Eric Wang, Tao Tu, Eeshit Dhaval Vaishnav, Byron Lee, S. Sara Mahdavi, Christopher Semturs, David Fleet, Vivek Natarajan, Shekoofeh Azizi

    Abstract: Develo** therapeutics is a lengthy and expensive process that requires the satisfaction of many different criteria, and AI models capable of expediting the process would be invaluable. However, the majority of current AI approaches address only a narrowly defined set of tasks, often circumscribed within a particular domain. To bridge this gap, we introduce Tx-LLM, a generalist large language mod… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  4. arXiv:2405.03162  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Advancing Multimodal Medical Capabilities of Gemini

    Authors: Lin Yang, Shawn Xu, Andrew Sellergren, Timo Kohlberger, Yuchen Zhou, Ira Ktena, Atilla Kiraly, Faruk Ahmed, Farhad Hormozdiari, Tiam Jaroensri, Eric Wang, Ellery Wulczyn, Fayaz Jamil, Theo Guidroz, Chuck Lau, Siyuan Qiao, Yun Liu, Akshay Goel, Kendall Park, Arnav Agharwal, Nick George, Yang Wang, Ryutaro Tanno, David G. T. Barrett, Wei-Hung Weng , et al. (22 additional authors not shown)

    Abstract: Many clinical tasks require an understanding of specialized data, such as medical images and genomics, which is not typically found in general-purpose large multimodal models. Building upon Gemini's multimodal models, we develop several models within the new Med-Gemini family that inherit core capabilities of Gemini and are optimized for medical use via fine-tuning with 2D and 3D radiology, histop… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  5. arXiv:2404.18416  [pdf, other

    cs.AI cs.CL cs.CV cs.LG

    Capabilities of Gemini Models in Medicine

    Authors: Khaled Saab, Tao Tu, Wei-Hung Weng, Ryutaro Tanno, David Stutz, Ellery Wulczyn, Fan Zhang, Tim Strother, Chunjong Park, Elahe Vedadi, Juanma Zambrano Chaves, Szu-Yeu Hu, Mike Schaekermann, Aishwarya Kamath, Yong Cheng, David G. T. Barrett, Cathy Cheung, Basil Mustafa, Anil Palepu, Daniel McDuff, Le Hou, Tomer Golany, Luyang Liu, Jean-baptiste Alayrac, Neil Houlsby , et al. (42 additional authors not shown)

    Abstract: Excellence in a wide variety of medical applications poses considerable challenges for AI, requiring advanced reasoning, access to up-to-date medical knowledge and understanding of complex multimodal data. Gemini models, with strong general capabilities in multimodal and long-context reasoning, offer exciting possibilities in medicine. Building on these core strengths of Gemini, we introduce Med-G… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  6. arXiv:2403.12025  [pdf, other

    cs.CY cs.CL cs.LG

    A Toolbox for Surfacing Health Equity Harms and Biases in Large Language Models

    Authors: Stephen R. Pfohl, Heather Cole-Lewis, Rory Sayres, Darlene Neal, Mercy Asiedu, Awa Dieng, Nenad Tomasev, Qazi Mamunur Rashid, Shekoofeh Azizi, Negar Rostamzadeh, Liam G. McCoy, Leo Anthony Celi, Yun Liu, Mike Schaekermann, Alanna Walton, Alicia Parrish, Chirag Nagpal, Preeti Singh, Akeiylah Dewitt, Philip Mansfield, Sushant Prakash, Katherine Heller, Alan Karthikesalingam, Christopher Semturs, Joelle Barral , et al. (5 additional authors not shown)

    Abstract: Large language models (LLMs) hold immense promise to serve complex health information needs but also have the potential to introduce harm and exacerbate health disparities. Reliably evaluating equity-related model failures is a critical step toward develo** systems that promote health equity. In this work, we present resources and methodologies for surfacing biases with potential to precipitate… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  7. arXiv:2402.11763  [pdf, other

    eess.SY

    Autonomous Hyperspectral Characterisation Station: Robotically Assisted Characterisation of Polymer Degradation

    Authors: Shayan Azizi, Ehsan Asadi, Shaun Howard, Benjamin W. Muir, Riley O'Shea, Alireza Bab-Hadiashar

    Abstract: This paper addresses the gap between the capabilities and utilisation of robotics and automation in laboratory settings and builds upon the concept of Self Driving Labs (SDL). %to significantly impact laboratory operations. We introduce an innovative approach to the temporal characterisation of materials. The article discusses the challenges posed by manual methods involving established laboratory… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: 11 pages, 12 figures

  8. arXiv:2402.06004  [pdf, other

    cs.CV cs.AI stat.ML

    Memory-Efficient Vision Transformers: An Activation-Aware Mixed-Rank Compression Strategy

    Authors: Seyedarmin Azizi, Mahdi Nazemi, Massoud Pedram

    Abstract: As Vision Transformers (ViTs) increasingly set new benchmarks in computer vision, their practical deployment on inference engines is often hindered by their significant memory bandwidth and (on-chip) memory footprint requirements. This paper addresses this memory limitation by introducing an activation-aware model compression methodology that uses selective low-rank weight tensor approximations of… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  9. arXiv:2401.05654  [pdf, other

    cs.AI cs.CL cs.LG

    Towards Conversational Diagnostic AI

    Authors: Tao Tu, Anil Palepu, Mike Schaekermann, Khaled Saab, Jan Freyberg, Ryutaro Tanno, Amy Wang, Brenna Li, Mohamed Amin, Nenad Tomasev, Shekoofeh Azizi, Karan Singhal, Yong Cheng, Le Hou, Albert Webson, Kavita Kulkarni, S Sara Mahdavi, Christopher Semturs, Juraj Gottweis, Joelle Barral, Katherine Chou, Greg S Corrado, Yossi Matias, Alan Karthikesalingam, Vivek Natarajan

    Abstract: At the heart of medicine lies the physician-patient dialogue, where skillful history-taking paves the way for accurate diagnosis, effective management, and enduring trust. Artificial Intelligence (AI) systems capable of diagnostic dialogue could increase accessibility, consistency, and quality of care. However, approximating clinicians' expertise is an outstanding grand challenge. Here, we introdu… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

    Comments: 46 pages, 5 figures in main text, 19 figures in appendix

  10. arXiv:2312.02210  [pdf, other

    cs.LG cs.AI

    Low-Precision Mixed-Computation Models for Inference on Edge

    Authors: Seyedarmin Azizi, Mahdi Nazemi, Mehdi Kamal, Massoud Pedram

    Abstract: This paper presents a mixed-computation neural network processing approach for edge applications that incorporates low-precision (low-width) Posit and low-precision fixed point (FixP) number systems. This mixed-computation approach employs 4-bit Posit (Posit4), which has higher precision around zero, for representing weights with high sensitivity, while it uses 4-bit FixP (FixP4) for representing… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  11. arXiv:2312.00164  [pdf, other

    cs.CY cs.AI

    Towards Accurate Differential Diagnosis with Large Language Models

    Authors: Daniel McDuff, Mike Schaekermann, Tao Tu, Anil Palepu, Amy Wang, Jake Garrison, Karan Singhal, Yash Sharma, Shekoofeh Azizi, Kavita Kulkarni, Le Hou, Yong Cheng, Yun Liu, S Sara Mahdavi, Sushant Prakash, Anupam Pathak, Christopher Semturs, Shwetak Patel, Dale R Webster, Ewa Dominowska, Juraj Gottweis, Joelle Barral, Katherine Chou, Greg S Corrado, Yossi Matias , et al. (3 additional authors not shown)

    Abstract: An accurate differential diagnosis (DDx) is a cornerstone of medical care, often reached through an iterative process of interpretation that combines clinical history, physical examination, investigations and procedures. Interactive interfaces powered by Large Language Models (LLMs) present new opportunities to both assist and automate aspects of this process. In this study, we introduce an LLM op… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  12. arXiv:2311.18260  [pdf, other

    eess.IV cs.CL cs.CV cs.LG

    Consensus, dissensus and synergy between clinicians and specialist foundation models in radiology report generation

    Authors: Ryutaro Tanno, David G. T. Barrett, Andrew Sellergren, Sumedh Ghaisas, Sumanth Dathathri, Abigail See, Johannes Welbl, Karan Singhal, Shekoofeh Azizi, Tao Tu, Mike Schaekermann, Rhys May, Roy Lee, SiWai Man, Zahra Ahmed, Sara Mahdavi, Yossi Matias, Joelle Barral, Ali Eslami, Danielle Belgrave, Vivek Natarajan, Shravya Shetty, Pushmeet Kohli, Po-Sen Huang, Alan Karthikesalingam , et al. (1 additional authors not shown)

    Abstract: Radiology reports are an instrumental part of modern medicine, informing key clinical decisions such as diagnosis and treatment. The worldwide shortage of radiologists, however, restricts access to expert care and imposes heavy workloads, contributing to avoidable errors and delays in report delivery. While recent progress in automated report generation with vision-language models offer clear pote… ▽ More

    Submitted 20 December, 2023; v1 submitted 30 November, 2023; originally announced November 2023.

  13. arXiv:2308.06422  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Sensitivity-Aware Mixed-Precision Quantization and Width Optimization of Deep Neural Networks Through Cluster-Based Tree-Structured Parzen Estimation

    Authors: Seyedarmin Azizi, Mahdi Nazemi, Arash Fayyazi, Massoud Pedram

    Abstract: As the complexity and computational demands of deep learning models rise, the need for effective optimization methods for neural network designs becomes paramount. This work introduces an innovative search mechanism for automatically selecting the best bit-width and layer-width for individual neural network layers. This leads to a marked enhancement in deep neural network efficiency. The search do… ▽ More

    Submitted 16 August, 2023; v1 submitted 11 August, 2023; originally announced August 2023.

  14. arXiv:2307.14334  [pdf, other

    cs.CL cs.CV

    Towards Generalist Biomedical AI

    Authors: Tao Tu, Shekoofeh Azizi, Danny Driess, Mike Schaekermann, Mohamed Amin, Pi-Chuan Chang, Andrew Carroll, Chuck Lau, Ryutaro Tanno, Ira Ktena, Basil Mustafa, Aakanksha Chowdhery, Yun Liu, Simon Kornblith, David Fleet, Philip Mansfield, Sushant Prakash, Renee Wong, Sunny Virmani, Christopher Semturs, S Sara Mahdavi, Bradley Green, Ewa Dominowska, Blaise Aguera y Arcas, Joelle Barral , et al. (7 additional authors not shown)

    Abstract: Medicine is inherently multimodal, with rich data modalities spanning text, imaging, genomics, and more. Generalist biomedical artificial intelligence (AI) systems that flexibly encode, integrate, and interpret this data at scale can potentially enable impactful applications ranging from scientific discovery to care delivery. To enable the development of these models, we first curate MultiMedBench… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  15. arXiv:2305.09617  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Expert-Level Medical Question Answering with Large Language Models

    Authors: Karan Singhal, Tao Tu, Juraj Gottweis, Rory Sayres, Ellery Wulczyn, Le Hou, Kevin Clark, Stephen Pfohl, Heather Cole-Lewis, Darlene Neal, Mike Schaekermann, Amy Wang, Mohamed Amin, Sami Lachgar, Philip Mansfield, Sushant Prakash, Bradley Green, Ewa Dominowska, Blaise Aguera y Arcas, Nenad Tomasev, Yun Liu, Renee Wong, Christopher Semturs, S. Sara Mahdavi, Joelle Barral , et al. (6 additional authors not shown)

    Abstract: Recent artificial intelligence (AI) systems have reached milestones in "grand challenges" ranging from Go to protein-folding. The capability to retrieve medical knowledge, reason over it, and answer medical questions comparably to physicians has long been viewed as one such grand challenge. Large language models (LLMs) have catalyzed significant progress in medical question answering; Med-PaLM w… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  16. arXiv:2305.04526  [pdf, other

    cs.CV

    CrAFT: Compression-Aware Fine-Tuning for Efficient Visual Task Adaptation

    Authors: Jung Hwan Heo, Seyedarmin Azizi, Arash Fayyazi, Massoud Pedram

    Abstract: Transfer learning has become a popular task adaptation method in the era of foundation models. However, many foundation models require large storage and computing resources, which makes off-the-shelf deployment impractical. Post-training compression techniques such as pruning and quantization can help lower deployment costs. Unfortunately, the resulting performance degradation limits the usability… ▽ More

    Submitted 8 July, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: Preprint

  17. arXiv:2304.09218  [pdf, other

    cs.CV

    Generative models improve fairness of medical classifiers under distribution shifts

    Authors: Ira Ktena, Olivia Wiles, Isabela Albuquerque, Sylvestre-Alvise Rebuffi, Ryutaro Tanno, Abhijit Guha Roy, Shekoofeh Azizi, Danielle Belgrave, Pushmeet Kohli, Alan Karthikesalingam, Taylan Cemgil, Sven Gowal

    Abstract: A ubiquitous challenge in machine learning is the problem of domain generalisation. This can exacerbate bias against groups or labels that are underrepresented in the datasets used for model development. Model bias can lead to unintended harms, especially in safety-critical applications like healthcare. Furthermore, the challenge is compounded by the difficulty of obtaining labelled data due to hi… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  18. arXiv:2304.08466  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Synthetic Data from Diffusion Models Improves ImageNet Classification

    Authors: Shekoofeh Azizi, Simon Kornblith, Chitwan Saharia, Mohammad Norouzi, David J. Fleet

    Abstract: Deep generative models are becoming increasingly powerful, now generating diverse high fidelity photo-realistic samples given text prompts. Have they reached the point where models of natural images can be used for generative data augmentation, hel** to improve challenging discriminative tasks? We show that large-scale text-to image diffusion models can be fine-tuned to produce class conditional… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

  19. arXiv:2304.07464  [pdf

    physics.ao-ph math.DS

    Non-Integer Dimension of Seasonal Land Surface Temperature (LST)

    Authors: Sepideh Azizi, Tahmineh Azizi

    Abstract: During few last years, climate change including global warming which is attributed to human activities and also its long-term adverse effects on the planet's functions have been identified as the most challenging discussion topics which have arisen many concerns and efforts to find the possible solutions. Since the warmth arising from Earth's landscapes affects the world's weather and climate patt… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

  20. arXiv:2303.02331  [pdf, other

    cs.CV cs.AI cs.LG

    Training-Free Acceleration of ViTs with Delayed Spatial Merging

    Authors: Jung Hwan Heo, Seyedarmin Azizi, Arash Fayyazi, Massoud Pedram

    Abstract: Token merging has emerged as a new paradigm that can accelerate the inference of Vision Transformers (ViTs) without any retraining or fine-tuning. To push the frontier of training-free acceleration in ViTs, we improve token merging by adding the perspectives of 1) activation outliers and 2) hierarchical representations. Through a careful analysis of the attention behavior in ViTs, we characterize… ▽ More

    Submitted 1 July, 2024; v1 submitted 4 March, 2023; originally announced March 2023.

    Comments: ICML 2024 ES-FoMo Workshop

  21. arXiv:2212.13138  [pdf, other

    cs.CL

    Large Language Models Encode Clinical Knowledge

    Authors: Karan Singhal, Shekoofeh Azizi, Tao Tu, S. Sara Mahdavi, Jason Wei, Hyung Won Chung, Nathan Scales, Ajay Tanwani, Heather Cole-Lewis, Stephen Pfohl, Perry Payne, Martin Seneviratne, Paul Gamble, Chris Kelly, Nathaneal Scharli, Aakanksha Chowdhery, Philip Mansfield, Blaise Aguera y Arcas, Dale Webster, Greg S. Corrado, Yossi Matias, Katherine Chou, Juraj Gottweis, Nenad Tomasev, Yun Liu , et al. (5 additional authors not shown)

    Abstract: Large language models (LLMs) have demonstrated impressive capabilities in natural language understanding and generation, but the quality bar for medical and clinical applications is high. Today, attempts to assess models' clinical knowledge typically rely on automated evaluations on limited benchmarks. There is no standard to evaluate model predictions and reasoning across a breadth of tasks. To a… ▽ More

    Submitted 26 December, 2022; originally announced December 2022.

  22. arXiv:2212.12044  [pdf, other

    q-fin.ST cs.AI

    Design interpretable experience of dynamical feed forward machine learning model for forecasting NASDAQ

    Authors: Pouriya Khalilian, Sara Azizi, Mohammad Hossein Amiri, Javad T. Firouzjaee

    Abstract: National Association of Securities Dealers Automated Quotations(NASDAQ) is an American stock exchange based. It is one of the most valuable stock economic indices in the world and is located in New York City \cite{pagano2008quality}. The volatility of the stock market and the influence of economic indicators such as crude oil, gold, and the dollar in the stock market, and NASDAQ shares are also af… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

    Comments: 21 pages, 5 figures

  23. arXiv:2209.06941  [pdf, other

    cs.CV cs.LG

    Joint Debiased Representation and Image Clustering Learning with Self-Supervision

    Authors: Shunjie-Fabian Zheng, JaeEun Nam, Emilio Dorigatti, Bernd Bischl, Shekoofeh Azizi, Mina Rezaei

    Abstract: Contrastive learning is among the most successful methods for visual representation learning, and its performance can be further improved by jointly performing clustering on the learned representations. However, existing methods for joint clustering and contrastive learning do not perform well on long-tailed data distributions, as majority classes overwhelm and distort the loss of minority classes… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

  24. arXiv:2205.09723  [pdf, other

    cs.CV cs.AI cs.LG

    Robust and Efficient Medical Imaging with Self-Supervision

    Authors: Shekoofeh Azizi, Laura Culp, Jan Freyberg, Basil Mustafa, Sebastien Baur, Simon Kornblith, Ting Chen, Patricia MacWilliams, S. Sara Mahdavi, Ellery Wulczyn, Boris Babenko, Megan Wilson, Aaron Loh, Po-Hsuan Cameron Chen, Yuan Liu, Pinal Bavishi, Scott Mayer McKinney, Jim Winkens, Abhijit Guha Roy, Zach Beaver, Fiona Ryan, Justin Krogue, Mozziyar Etemadi, Umesh Telang, Yun Liu , et al. (9 additional authors not shown)

    Abstract: Recent progress in Medical Artificial Intelligence (AI) has delivered systems that can reach clinical expert level performance. However, such systems tend to demonstrate sub-optimal "out-of-distribution" performance when evaluated in clinical settings different from the training environment. A common mitigation strategy is to develop separate systems for each clinical setting using site-specific d… ▽ More

    Submitted 3 July, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

  25. arXiv:2204.08144  [pdf, other

    gr-qc hep-th

    Hawking Temperature for 4D-Einstein-Gauss-Bonnet Black Holes from uncertainty principle

    Authors: Sara Azizi, Sareh Eslamzadeh, Javad T. Firouzjaee, Kourosh Nozari

    Abstract: Inspired by string theory, Heisenberg's uncertainty principle can be generalized to include the photon-electron gravitational interaction, which leads to the Generalized Uncertainty Principle (GUP). Although GUP considers gravitational uncertainty at the minimum fundamental length scale in physics, it does not consider the effects of spacetime curvature on quantum mechanical uncertainty relations.… ▽ More

    Submitted 17 April, 2022; originally announced April 2022.

    Comments: 19 pages, 15 figures

  26. arXiv:2201.08018  [pdf, other

    cs.LG cs.AI

    Transfer Learning for Fault Diagnosis of Transmission Lines

    Authors: Fatemeh Mohammadi Shakiba, Milad Shojaee, S. Mohsen Azizi, Mengchu Zhou

    Abstract: Recent artificial intelligence-based methods have shown great promise in the use of neural networks for real-time sensing and detection of transmission line faults and estimation of their locations. The expansion of power systems including transmission lines with various lengths have made a fault detection, classification, and location estimation process more challenging. Transmission line dataset… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

  27. arXiv:2109.07455  [pdf, other

    cs.CV cs.AI cs.LG

    Deep Bregman Divergence for Contrastive Learning of Visual Representations

    Authors: Mina Rezaei, Farzin Soleymani, Bernd Bischl, Shekoofeh Azizi

    Abstract: Deep Bregman divergence measures divergence of data points using neural networks which is beyond Euclidean distance and capable of capturing divergence over distributions. In this paper, we propose deep Bregman divergences for contrastive learning of visual representation where we aim to enhance contrastive loss used in self-supervised learning by training additional networks based on functional B… ▽ More

    Submitted 22 November, 2021; v1 submitted 15 September, 2021; originally announced September 2021.

  28. arXiv:2104.13974  [pdf, other

    cs.DC math.OC

    Joint QoS-aware and Cost-efficient Task Scheduling for Fog-Cloud Resources in a Volunteer Computing System

    Authors: Farooq Hoseiny, Sadoon Azizi, Mohammad Shojafar, Rahim Tafazolli

    Abstract: Volunteer computing is an Internet-based distributed computing system in which volunteers share their extra available resources to manage large-scale tasks. However, computing devices in a Volunteer Computing System (VCS) are highly dynamic and heterogeneous in terms of their processing power, monetary cost, and data transferring latency. To ensure both the high Quality of Service (QoS) and low co… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

    Comments: 21 pages, 6 figures, ACM Transactions on Internet Technology (TOIT)

    MSC Class: 52 ACM Class: C.2.1; G.1.6

  29. Non-adiabatic ionization with tailored laser pulses

    Authors: Sajad Azizi, Ulf Saalmann, Jan M Rost

    Abstract: Non-adiabatic photo-ionization is difficult to control as it relies on the derivatives of the envelope and not on phase-details of the short ionizing pulse. Here, we introduce a catalyzing state, whose presence render non-adiabatic ionization sensitive to phase-details of tailored pulses. Since a catalyzing state is in general easy to create, this opens a perspective for coherent control of ultra-… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

    Comments: 9 pages, 5 figures, 2 appendices

  30. Does Your Dermatology Classifier Know What It Doesn't Know? Detecting the Long-Tail of Unseen Conditions

    Authors: Abhijit Guha Roy, Jie Ren, Shekoofeh Azizi, Aaron Loh, Vivek Natarajan, Basil Mustafa, Nick Pawlowski, Jan Freyberg, Yuan Liu, Zach Beaver, Nam Vo, Peggy Bui, Samantha Winter, Patricia MacWilliams, Greg S. Corrado, Umesh Telang, Yun Liu, Taylan Cemgil, Alan Karthikesalingam, Balaji Lakshminarayanan, Jim Winkens

    Abstract: We develop and rigorously evaluate a deep learning based system that can accurately classify skin conditions while detecting rare conditions for which there is not enough data available for training a confident classifier. We frame this task as an out-of-distribution (OOD) detection problem. Our novel approach, hierarchical outlier detection (HOD) assigns multiple abstention classes for each train… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

    Comments: Under Review, 19 Pages

    Journal ref: Medical Image Analysis (2022)

  31. arXiv:2101.05224  [pdf, other

    eess.IV cs.CV cs.LG

    Big Self-Supervised Models Advance Medical Image Classification

    Authors: Shekoofeh Azizi, Basil Mustafa, Fiona Ryan, Zachary Beaver, Jan Freyberg, Jonathan Deaton, Aaron Loh, Alan Karthikesalingam, Simon Kornblith, Ting Chen, Vivek Natarajan, Mohammad Norouzi

    Abstract: Self-supervised pretraining followed by supervised fine-tuning has seen success in image recognition, especially when labeled examples are scarce, but has received limited attention in medical image analysis. This paper studies the effectiveness of self-supervised learning as a pretraining strategy for medical image classification. We conduct experiments on two distinct tasks: dermatology skin con… ▽ More

    Submitted 1 April, 2021; v1 submitted 13 January, 2021; originally announced January 2021.

  32. arXiv:2005.09704  [pdf, other

    cs.CV cs.GR

    Contextual Residual Aggregation for Ultra High-Resolution Image Inpainting

    Authors: Zili Yi, Qiang Tang, Shekoofeh Azizi, Daesik Jang, Zhan Xu

    Abstract: Recently data-driven image inpainting methods have made inspiring progress, impacting fundamental image editing tasks such as object removal and damaged image repairing. These methods are more effective than classic approaches, however, due to memory limitations they can only handle low-resolution inputs, typically smaller than 1K. Meanwhile, the resolution of photos captured with mobile devices i… ▽ More

    Submitted 19 May, 2020; originally announced May 2020.

    Comments: CVPR 2020 oral paper. 22 pages, 11 figures

  33. arXiv:2001.10863  [pdf

    eess.SY

    The Voltage Regulation of Boost Converters Using Dual Heuristic Programming

    Authors: Sepehr Saadatmand, Mohammadamir Kavousi, Sima Azizi

    Abstract: In this paper, a dual heuristic programming controller is proposed to control a boost converter. Conventional controllers such as proportional integral derivative (PID) or proportional integral (PI) are designed based on the linearized small-signal model near the operating point. Therefore, the performance of the controller during start up, load change, or input voltage variation is not optimal si… ▽ More

    Submitted 27 January, 2020; originally announced January 2020.

    Comments: Accepted paper in: 2020 IEEE 10th Annual Computing and Communication Workshop and Conference (CCWC), 2020

  34. arXiv:2001.08841  [pdf

    cs.RO cs.LG stat.ML

    Autonomous Control of a Line Follower Robot Using a Q-Learning Controller

    Authors: Sepehr Saadatmand, Sima Azizi, Mohammadamir Kavousi, Donald Wunsch

    Abstract: In this paper, a MIMO simulated annealing SA based Q learning method is proposed to control a line follower robot. The conventional controller for these types of robots is the proportional P controller. Considering the unknown mechanical characteristics of the robot and uncertainties such as friction and slippery surfaces, system modeling and controller designing can be extremely challenging. The… ▽ More

    Submitted 23 January, 2020; originally announced January 2020.

    Comments: Accepted paper in IEEE CCWC 2020

  35. arXiv:1903.01015  [pdf, other

    cs.CV

    A Kernelized Manifold Map** to Diminish the Effect of Adversarial Perturbations

    Authors: Saeid Asgari Taghanaki, Kumar Abhishek, Shekoofeh Azizi, Ghassan Hamarneh

    Abstract: The linear and non-flexible nature of deep convolutional models makes them vulnerable to carefully crafted adversarial perturbations. To tackle this problem, we propose a non-linear radial basis convolutional feature map** by learning a Mahalanobis-like distance function. Our method then maps the convolutional features onto a linearly well-separated manifold, which prevents small adversarial per… ▽ More

    Submitted 8 May, 2019; v1 submitted 3 March, 2019; originally announced March 2019.

    Comments: Accepted to CVPR 2019. 10 pages, 6 figures

  36. arXiv:1803.05667  [pdf

    cs.IR cs.CL

    A Study of Recent Contributions on Information Extraction

    Authors: Parisa Naderi Golshan, HosseinAli Rahmani Dashti, Shahrzad Azizi, Leila Safari

    Abstract: This paper reports on modern approaches in Information Extraction (IE) and its two main sub-tasks of Named Entity Recognition (NER) and Relation Extraction (RE). Basic concepts and the most recent approaches in this area are reviewed, which mainly include Machine Learning (ML) based approaches and the more recent trend to Deep Learning (DL) based methods.

    Submitted 15 March, 2018; originally announced March 2018.

  37. arXiv:1709.03020  [pdf

    cs.MM

    Hierarchical Watermarking Framework Based on Analysis of Local Complexity Variations

    Authors: Majid Mohrekesh, Shekoofeh Azizi, Shahram Shirani, Nader Karimi, Shadrokh Samavi

    Abstract: Increasing production and exchange of multimedia content has increased the need for better protection of copyright by means of watermarking. Different methods have been proposed to satisfy the tradeoff between imperceptibility and robustness as two important characteristics in watermarking while maintaining proper data-embedding capacity. Many watermarking methods use image independent set of para… ▽ More

    Submitted 9 September, 2017; originally announced September 2017.

    Comments: 12 pages, 14 figures, 8 tables

  38. arXiv:1701.08843  [pdf

    cond-mat.mes-hall

    Dynamics of a capacitive electret-based microcantilever for energy harvesting

    Authors: Mahyar Ghavami Khabir, Saber Azizi, Mohammad Reza Ghazavi

    Abstract: In this paper, a novel electret-based capacitive energy harvesting device has been developed according to out-of-plane gap closing scheme. The device is composed of a micro cantilever and a substrate which form a variable capacitor and is in series with a resistance. An electret material is used to provide the bias voltage which is needed in capacitive energy harvesters in order to scavenge energy… ▽ More

    Submitted 27 January, 2017; originally announced January 2017.

  39. arXiv:1611.04105  [pdf

    cond-mat.mes-hall

    Nonlinear dynamics of a functionally graded piezoelectric micro-resonator in the vicinity of the primary resonance

    Authors: Meysam T. Chorsi, Saber Azizi, Firooz Bakhtiari-Nejad

    Abstract: This research is on the nonlinear dynamics of a two-sided electrostatically actuated capacitive micro-beam. The microresonator is composed of silicon and PZT as a piezoelectric material. PZT is functionally distributed along the height of the micro-beam according to the power law distribution. The micro-resonator is simultaneously subjected to DC piezoelectric and two-sided electrostatic actuation… ▽ More

    Submitted 13 November, 2016; originally announced November 2016.

    Comments: Journal of Vibration and Control (2015)

  40. arXiv:1610.09425  [pdf, other

    physics.atom-ph cond-mat.quant-gas

    Investigation of Confinement Induced Resonance in Atomic Waveguides with Different Geometries by Quantum Monte Carlo Methods

    Authors: Sajad Azizi, Shahpoor Saeidian

    Abstract: We have investigated the quantum dynamics of two ultracold bosons inside an atomic waveguide for two different confinement geometries (cigar-shaped and toroidal waveguides) by quantum Monte Carlo methods. For quasi-1D gases, the confining potential of the waveguide leads to the so-called confinement induced resonance (CIR), results in the phase transition of the gas to the impenetrable bosonic reg… ▽ More

    Submitted 28 October, 2016; originally announced October 2016.

  41. arXiv:1207.5574  [pdf, ps, other

    math.PR

    A Simple Proof of Berry-Esséen Bounds for the Quadratic Variation of the Subfractional Brownian Motion

    Authors: Soufiane Aazizi

    Abstract: We give a simple technic to derive the Berry-Esséen bounds for the quadratic variation of the subfractional Brownian motion (subfBm). Our approach has two main ingredients: ($i$) bounding from above the covariance of quadratic variation of subfBm by the covariance of the quadratic variation of fractional Brownian motion (fBm); and ($ii$) using the existing results on fBm in \cite{BN08,NP09,N12}. A… ▽ More

    Submitted 23 July, 2012; originally announced July 2012.

  42. arXiv:1206.3433  [pdf, ps, other

    math.PR

    Optimal switching problem and system of reflected multi-dimensional FBSDEs with random terminal time

    Authors: Soufiane Aazizi, Imade Fakhouri

    Abstract: In this paper, we study the solvability of a class of multi-dimensional forward backward stochastic differential equations (FBSDEs) with oblique reflection and unbounded stop** time. Under some mild assumptions on the coefficients in such FBSDE, the existence result of adapted solutions is done via a penalization method. The uniqueness is obtained by a verification theorem similarly to the one u… ▽ More

    Submitted 2 July, 2012; v1 submitted 15 June, 2012; originally announced June 2012.

    MSC Class: 60H10; 93E20

  43. arXiv:1203.2786  [pdf, ps, other

    math.PR

    Berry-Esséen bounds and almost sure CLT for the quadratic variation of the bifractional Brownian motion

    Authors: Soufiane Aazizi, Khalifa Es-Sebaiy

    Abstract: Let $B$ be a bifractional Brownian motion with parameters $H\in (0, 1)$ and $K\in(0,1]$. For any $n\geq1$, set $Z_n =\sum_{i=0}^{n-1}\big[n^{2HK}(B_{(i+1)/n}-B_{i/n})^2-\E((B_{i+1}-B_{i})^2)\big]$. We use the Malliavin calculus and the so-called Stein's method on Wiener chaos introduced by Nourdin and Peccati \cite{NP09} to derive, in the case when $0<HK\leq3/4$, Berry-Esséen-type bounds for the K… ▽ More

    Submitted 27 March, 2012; v1 submitted 13 March, 2012; originally announced March 2012.

  44. arXiv:1112.0255  [pdf, ps, other

    math.PR

    Strong envelope and strong supermartingale: application to reflected bsdes

    Authors: Soufiane Aazizi, Youssef Ouknine

    Abstract: We provide several characterizations to identify Strong envelop (for bounded measurable process) and Strong super-martingale (for non-negative right upper semi-continuous process of the class $\Dc$). As examples of application, we prove existence and uniqueness of reflected backward stochastic differential equation with lower barrier (RBSDB in short) in two cases: $i)$. the obstacle is a measurabl… ▽ More

    Submitted 4 January, 2016; v1 submitted 1 December, 2011; originally announced December 2011.

    MSC Class: 60H30; 60G40; 93E20

  45. arXiv:1110.5059  [pdf, ps, other

    math.PR

    Discrete time approximation of decoupled Forward-Backward SDE driven by pure jump Lévy-processes

    Authors: Soufiane Aazizi

    Abstract: We present a new algorithms to discretize a decoupled forward backward stochastic differential equations driven by pure jump Lévy process (FBSDEL in short). The method is built in two steps. Firstly, we approximate the FBSDEL by a forward backward stochastic differential equations driven by a Brownian motion and Poisson process (FBSDEBP in short), in which we replace the small jumps by a Brownian… ▽ More

    Submitted 23 October, 2011; originally announced October 2011.

    MSC Class: 60H35; 60H07; 60J75

  46. arXiv:1110.0422  [pdf, ps, other

    math.PR

    Skorohod equation and BSDE's with two reflecting barriers

    Authors: Soufiane Aazizi

    Abstract: We solve a class of doubly reflected backward stochastic differential equation whose generator depends on the resistance due to reflections, which extend the recent work of Qian and Xu on reflected BSDE with one barrier. We then obtain the existence and uniqueness and the continuous dependence theorem for this reflected BSDE.

    Submitted 3 October, 2011; originally announced October 2011.

    MSC Class: 60H10; 60H30; 60J45