-
Saturated absorption spectroscopy and frequency locking of DBR laser on the D2 transition of rubidium atoms
Authors:
Davood Razzaghi,
Ali MotazediFard,
Marzieh Akbari,
Seyed Ahmad Madani,
Masoud Yousefi,
Ali Allahi,
Ghazal Mehrabanpajooh,
Mohsen Shokrolahi,
Hamid Asgari,
Zafar Riazi
Abstract:
In this paper, we experimentally report the saturated absorption spectroscopy (SAS) and frequency locking (FL) of a narrow-band DBR laser with 0.5MHz linewidth on the LD2-transition of Rb atoms.
In this paper, we experimentally report the saturated absorption spectroscopy (SAS) and frequency locking (FL) of a narrow-band DBR laser with 0.5MHz linewidth on the LD2-transition of Rb atoms.
△ Less
Submitted 27 May, 2024; v1 submitted 23 May, 2024;
originally announced May 2024.
-
Towards Joint Sequence-Structure Generation of Nucleic Acid and Protein Complexes with SE(3)-Discrete Diffusion
Authors:
Alex Morehead,
Jeffrey Ruffolo,
Aadyot Bhatnagar,
Ali Madani
Abstract:
Generative models of macromolecules carry abundant and impactful implications for industrial and biomedical efforts in protein engineering. However, existing methods are currently limited to modeling protein structures or sequences, independently or jointly, without regard to the interactions that commonly occur between proteins and other macromolecules. In this work, we introduce MMDiff, a genera…
▽ More
Generative models of macromolecules carry abundant and impactful implications for industrial and biomedical efforts in protein engineering. However, existing methods are currently limited to modeling protein structures or sequences, independently or jointly, without regard to the interactions that commonly occur between proteins and other macromolecules. In this work, we introduce MMDiff, a generative model that jointly designs sequences and structures of nucleic acid and protein complexes, independently or in complex, using joint SE(3)-discrete diffusion noise. Such a model has important implications for emerging areas of macromolecular design including structure-based transcription factor design and design of noncoding RNA sequences. We demonstrate the utility of MMDiff through a rigorous new design benchmark for macromolecular complex generation that we introduce in this work. Our results demonstrate that MMDiff is able to successfully generate micro-RNA and single-stranded DNA molecules while being modestly capable of joint modeling DNA and RNA molecules in interaction with multi-chain protein complexes. Source code: https://github.com/Profluent-Internships/MMDiff.
△ Less
Submitted 21 December, 2023;
originally announced January 2024.
-
Distance-Preserving Graph Compression Techniques
Authors:
Amirali Madani,
Anil Maheshwari
Abstract:
We study the problem of distance-preserving graph compression for weighted paths and trees. The problem entails a weighted graph $G = (V, E)$ with non-negative weights, and a subset of edges $E^{\prime} \subset E$ which needs to be removed from G (with their endpoints merged as a supernode). The goal is to redistribute the weights of the deleted edges in a way that minimizes the error. The error i…
▽ More
We study the problem of distance-preserving graph compression for weighted paths and trees. The problem entails a weighted graph $G = (V, E)$ with non-negative weights, and a subset of edges $E^{\prime} \subset E$ which needs to be removed from G (with their endpoints merged as a supernode). The goal is to redistribute the weights of the deleted edges in a way that minimizes the error. The error is defined as the sum of the absolute differences of the shortest path lengths between different pairs of nodes before and after contracting $E^{\prime}$. Based on this error function, we propose optimal approaches for merging any subset of edges in a path and a single edge in a tree. Previous works on graph compression techniques aimed at preserving different graph properties (such as the chromatic number) or solely focused on identifying the optimal set of edges to contract. However, our focus in this paper is on achieving optimal edge contraction (when the contracted edges are provided as input) specifically for weighted trees and paths.
△ Less
Submitted 11 July, 2023;
originally announced July 2023.
-
QR-SACP: Quantitative Risk-based Situational Awareness Calculation and Projection through Threat Information Sharing
Authors:
Mahdieh Safarzadehvahed,
Farzaneh Abazari,
Afsaneh Madani,
Fatemeh Shabani
Abstract:
When a threat is observed, one of the most important challenges is to choose the most appropriate and adequate timely decisions in response to the current and near future situation in order to have the least consequences and costs. Making the appropriate and sufficient decisions requires knowing what situations the threat has engendered or may engender. In this paper, we propose a quantitative ris…
▽ More
When a threat is observed, one of the most important challenges is to choose the most appropriate and adequate timely decisions in response to the current and near future situation in order to have the least consequences and costs. Making the appropriate and sufficient decisions requires knowing what situations the threat has engendered or may engender. In this paper, we propose a quantitative risk-based method called QR-SACP to calculate and project situational awareness in a network based on threat information sharing. In this method, we investigate a threat from different aspects and evaluate the threat's effects through dependency weight among a network's services. We calculate the definite effect of a threat on a service and the cascading propagation of the threat's definite effect on other dependent services to that service. In addition, we project the probability of a threat propagation or recurrence of the threat in other network services in three ways: procedurally, network connections and similar infrastructure or services. Experimental results demonstrate that the QR-SACP method can calculate and project definite and probable threats' effects across the entire network and reveal more details about the threat's current and near future situations.
△ Less
Submitted 28 April, 2023;
originally announced April 2023.
-
Understanding metric-related pitfalls in image analysis validation
Authors:
Annika Reinke,
Minu D. Tizabi,
Michael Baumgartner,
Matthias Eisenmann,
Doreen Heckmann-Nötzel,
A. Emre Kavur,
Tim Rädsch,
Carole H. Sudre,
Laura Acion,
Michela Antonelli,
Tal Arbel,
Spyridon Bakas,
Arriel Benis,
Matthew Blaschko,
Florian Buettner,
M. Jorge Cardoso,
Veronika Cheplygina,
Jianxu Chen,
Evangelia Christodoulou,
Beth A. Cimini,
Gary S. Collins,
Keyvan Farahani,
Luciana Ferrer,
Adrian Galdran,
Bram van Ginneken
, et al. (53 additional authors not shown)
Abstract:
Validation metrics are key for the reliable tracking of scientific progress and for bridging the current chasm between artificial intelligence (AI) research and its translation into practice. However, increasing evidence shows that particularly in image analysis, metrics are often chosen inadequately in relation to the underlying research problem. This could be attributed to a lack of accessibilit…
▽ More
Validation metrics are key for the reliable tracking of scientific progress and for bridging the current chasm between artificial intelligence (AI) research and its translation into practice. However, increasing evidence shows that particularly in image analysis, metrics are often chosen inadequately in relation to the underlying research problem. This could be attributed to a lack of accessibility of metric-related knowledge: While taking into account the individual strengths, weaknesses, and limitations of validation metrics is a critical prerequisite to making educated choices, the relevant knowledge is currently scattered and poorly accessible to individual researchers. Based on a multi-stage Delphi process conducted by a multidisciplinary expert consortium as well as extensive community feedback, the present work provides the first reliable and comprehensive common point of access to information on pitfalls related to validation metrics in image analysis. Focusing on biomedical image analysis but with the potential of transfer to other fields, the addressed pitfalls generalize across application domains and are categorized according to a newly created, domain-agnostic taxonomy. To facilitate comprehension, illustrations and specific examples accompany each pitfall. As a structured body of information accessible to researchers of all levels of expertise, this work enhances global comprehension of a key topic in image analysis validation.
△ Less
Submitted 23 February, 2024; v1 submitted 3 February, 2023;
originally announced February 2023.
-
Robot-Assisted Drilling on Curved Surfaces with Haptic Guidance under Adaptive Admittance Control
Authors:
Alireza Madani,
Pouya P. Niaz,
Berk Guler,
Yusuf Aydin,
Cagatay Basdogan
Abstract:
Drilling a hole on a curved surface with a desired angle is prone to failure when done manually, due to the difficulties in drill alignment and also inherent instabilities of the task, potentially causing injury and fatigue to the workers. On the other hand, it can be impractical to fully automate such a task in real manufacturing environments because the parts arriving at an assembly line can hav…
▽ More
Drilling a hole on a curved surface with a desired angle is prone to failure when done manually, due to the difficulties in drill alignment and also inherent instabilities of the task, potentially causing injury and fatigue to the workers. On the other hand, it can be impractical to fully automate such a task in real manufacturing environments because the parts arriving at an assembly line can have various complex shapes where drill point locations are not easily accessible, making automated path planning difficult. In this work, an adaptive admittance controller with 6 degrees of freedom is developed and deployed on a KUKA LBR iiwa 7 cobot such that the operator is able to manipulate a drill mounted on the robot with one hand comfortably and open holes on a curved surface with haptic guidance of the cobot and visual guidance provided through an AR interface. Real-time adaptation of the admittance dam** provides more transparency when driving the robot in free space while ensuring stability during drilling. After the user brings the drill sufficiently close to the drill target and roughly aligns to the desired drilling angle, the haptic guidance module fine tunes the alignment first and then constrains the user movement to the drilling axis only, after which the operator simply pushes the drill into the workpiece with minimal effort. Two sets of experiments were conducted to investigate the potential benefits of the haptic guidance module quantitatively (Experiment I) and also the practical value of the proposed pHRI system for real manufacturing settings based on the subjective opinion of the participants (Experiment II).
△ Less
Submitted 28 July, 2022;
originally announced July 2022.
-
ProGen2: Exploring the Boundaries of Protein Language Models
Authors:
Erik Nijkamp,
Jeffrey Ruffolo,
Eli N. Weinstein,
Nikhil Naik,
Ali Madani
Abstract:
Attention-based models trained on protein sequences have demonstrated incredible success at classification and generation tasks relevant for artificial intelligence-driven protein design. However, we lack a sufficient understanding of how very large-scale models and data play a role in effective protein model development. We introduce a suite of protein language models, named ProGen2, that are sca…
▽ More
Attention-based models trained on protein sequences have demonstrated incredible success at classification and generation tasks relevant for artificial intelligence-driven protein design. However, we lack a sufficient understanding of how very large-scale models and data play a role in effective protein model development. We introduce a suite of protein language models, named ProGen2, that are scaled up to 6.4B parameters and trained on different sequence datasets drawn from over a billion proteins from genomic, metagenomic, and immune repertoire databases. ProGen2 models show state-of-the-art performance in capturing the distribution of observed evolutionary sequences, generating novel viable sequences, and predicting protein fitness without additional finetuning. As large model sizes and raw numbers of protein sequences continue to become more widely accessible, our results suggest that a growing emphasis needs to be placed on the data distribution provided to a protein sequence model. We release the ProGen2 models and code at https://github.com/salesforce/progen.
△ Less
Submitted 27 June, 2022;
originally announced June 2022.
-
Metrics reloaded: Recommendations for image analysis validation
Authors:
Lena Maier-Hein,
Annika Reinke,
Patrick Godau,
Minu D. Tizabi,
Florian Buettner,
Evangelia Christodoulou,
Ben Glocker,
Fabian Isensee,
Jens Kleesiek,
Michal Kozubek,
Mauricio Reyes,
Michael A. Riegler,
Manuel Wiesenfarth,
A. Emre Kavur,
Carole H. Sudre,
Michael Baumgartner,
Matthias Eisenmann,
Doreen Heckmann-Nötzel,
Tim Rädsch,
Laura Acion,
Michela Antonelli,
Tal Arbel,
Spyridon Bakas,
Arriel Benis,
Matthew Blaschko
, et al. (49 additional authors not shown)
Abstract:
Increasing evidence shows that flaws in machine learning (ML) algorithm validation are an underestimated global problem. Particularly in automatic biomedical image analysis, chosen performance metrics often do not reflect the domain interest, thus failing to adequately measure scientific progress and hindering translation of ML techniques into practice. To overcome this, our large international ex…
▽ More
Increasing evidence shows that flaws in machine learning (ML) algorithm validation are an underestimated global problem. Particularly in automatic biomedical image analysis, chosen performance metrics often do not reflect the domain interest, thus failing to adequately measure scientific progress and hindering translation of ML techniques into practice. To overcome this, our large international expert consortium created Metrics Reloaded, a comprehensive framework guiding researchers in the problem-aware selection of metrics. Following the convergence of ML methodology across application domains, Metrics Reloaded fosters the convergence of validation methodology. The framework was developed in a multi-stage Delphi process and is based on the novel concept of a problem fingerprint - a structured representation of the given problem that captures all aspects that are relevant for metric selection, from the domain interest to the properties of the target structure(s), data set and algorithm output. Based on the problem fingerprint, users are guided through the process of choosing and applying appropriate validation metrics while being made aware of potential pitfalls. Metrics Reloaded targets image analysis problems that can be interpreted as a classification task at image, object or pixel level, namely image-level classification, object detection, semantic segmentation, and instance segmentation tasks. To improve the user experience, we implemented the framework in the Metrics Reloaded online tool, which also provides a point of access to explore weaknesses, strengths and specific recommendations for the most common validation metrics. The broad applicability of our framework across domains is demonstrated by an instantiation for various biological and medical image analysis use cases.
△ Less
Submitted 23 February, 2024; v1 submitted 3 June, 2022;
originally announced June 2022.
-
An adaptive admittance controller for collaborative drilling with a robot based on subtask classification via deep learning
Authors:
Berk Guler,
Pouya P. Niaz,
Alireza Madani,
Yusuf Aydin,
Cagatay Basdogan
Abstract:
In this paper, we propose a supervised learning approach based on an Artificial Neural Network (ANN) model for real-time classification of subtasks in a physical human-robot interaction (pHRI) task involving contact with a stiff environment. In this regard, we consider three subtasks for a given pHRI task: Idle, Driving, and Contact. Based on this classification, the parameters of an admittance co…
▽ More
In this paper, we propose a supervised learning approach based on an Artificial Neural Network (ANN) model for real-time classification of subtasks in a physical human-robot interaction (pHRI) task involving contact with a stiff environment. In this regard, we consider three subtasks for a given pHRI task: Idle, Driving, and Contact. Based on this classification, the parameters of an admittance controller that regulates the interaction between human and robot are adjusted adaptively in real time to make the robot more transparent to the operator (i.e. less resistant) during the Driving phase and more stable during the Contact phase. The Idle phase is primarily used to detect the initiation of task. Experimental results have shown that the ANN model can learn to detect the subtasks under different admittance controller conditions with an accuracy of 98% for 12 participants. Finally, we show that the admittance adaptation based on the proposed subtask classifier leads to 20% lower human effort (i.e. higher transparency) in the Driving phase and 25% lower oscillation amplitude (i.e. higher stability) during drilling in the Contact phase compared to an admittance controller with fixed parameters.
△ Less
Submitted 31 May, 2022; v1 submitted 28 May, 2022;
originally announced May 2022.
-
Deep Extrapolation for Attribute-Enhanced Generation
Authors:
Alvin Chan,
Ali Madani,
Ben Krause,
Nikhil Naik
Abstract:
Attribute extrapolation in sample generation is challenging for deep neural networks operating beyond the training distribution. We formulate a new task for extrapolation in sequence generation, focusing on natural language and proteins, and propose GENhance, a generative framework that enhances attributes through a learned latent space. Trained on movie reviews and a computed protein stability da…
▽ More
Attribute extrapolation in sample generation is challenging for deep neural networks operating beyond the training distribution. We formulate a new task for extrapolation in sequence generation, focusing on natural language and proteins, and propose GENhance, a generative framework that enhances attributes through a learned latent space. Trained on movie reviews and a computed protein stability dataset, GENhance can generate strongly-positive text reviews and highly stable protein sequences without being exposed to similar data during training. We release our benchmark tasks and models to contribute to the study of generative modeling extrapolation and data-driven design in biology and chemistry.
△ Less
Submitted 25 October, 2021; v1 submitted 6 July, 2021;
originally announced July 2021.
-
Common Limitations of Image Processing Metrics: A Picture Story
Authors:
Annika Reinke,
Minu D. Tizabi,
Carole H. Sudre,
Matthias Eisenmann,
Tim Rädsch,
Michael Baumgartner,
Laura Acion,
Michela Antonelli,
Tal Arbel,
Spyridon Bakas,
Peter Bankhead,
Arriel Benis,
Matthew Blaschko,
Florian Buettner,
M. Jorge Cardoso,
Jianxu Chen,
Veronika Cheplygina,
Evangelia Christodoulou,
Beth Cimini,
Gary S. Collins,
Sandy Engelhardt,
Keyvan Farahani,
Luciana Ferrer,
Adrian Galdran,
Bram van Ginneken
, et al. (68 additional authors not shown)
Abstract:
While the importance of automatic image analysis is continuously increasing, recent meta-research revealed major flaws with respect to algorithm validation. Performance metrics are particularly key for meaningful, objective, and transparent performance assessment and validation of the used automatic algorithms, but relatively little attention has been given to the practical pitfalls when using spe…
▽ More
While the importance of automatic image analysis is continuously increasing, recent meta-research revealed major flaws with respect to algorithm validation. Performance metrics are particularly key for meaningful, objective, and transparent performance assessment and validation of the used automatic algorithms, but relatively little attention has been given to the practical pitfalls when using specific metrics for a given image analysis task. These are typically related to (1) the disregard of inherent metric properties, such as the behaviour in the presence of class imbalance or small target structures, (2) the disregard of inherent data set properties, such as the non-independence of the test cases, and (3) the disregard of the actual biomedical domain interest that the metrics should reflect. This living dynamically document has the purpose to illustrate important limitations of performance metrics commonly applied in the field of image analysis. In this context, it focuses on biomedical image analysis problems that can be phrased as image-level classification, semantic segmentation, instance segmentation, or object detection task. The current version is based on a Delphi process on metrics conducted by an international consortium of image analysis experts from more than 60 institutions worldwide.
△ Less
Submitted 6 December, 2023; v1 submitted 12 April, 2021;
originally announced April 2021.
-
High-precision Quantum Transmitometry of DNA and Methylene-Blue using a Frequency-Entangled Twin-Photon Beam in Type-I SPDC
Authors:
Ali Motazedifard,
S. A. Madani
Abstract:
Using the coincidence-count (CC) measurement of the generated frequency-entangled twin-photons beam (TWB) via the process of type-I spontaneous parametric-down conversion (SPDC) in BBO nonlinear crystal (NLC), we have precisely measured the transmittance of very diluted Rabbit- and Human-DNA, Methylene-Blue (MB), as a disinfectant, and thin-film multilayer at near IR wavelength 810nm with an accur…
▽ More
Using the coincidence-count (CC) measurement of the generated frequency-entangled twin-photons beam (TWB) via the process of type-I spontaneous parametric-down conversion (SPDC) in BBO nonlinear crystal (NLC), we have precisely measured the transmittance of very diluted Rabbit- and Human-DNA, Methylene-Blue (MB), as a disinfectant, and thin-film multilayer at near IR wavelength 810nm with an accuracy in order of $\% 0.01 $ due to the quantum correlation, while accuracy of classical-like measurement, single-count (SC), is in order of $\% 0.1 $ in our setup. Moreover, using quantum measurement of the transmittance, the different types of DNA with the same concentration, and also very diluted (in order of pg/$ μ$l) different concentrations of DNA and MB solutions are distinguished and detected with high-reliability. Interestingly, in case of Human-DNA samples in contrast to our classical-like measurement we could precisely detect and distinguish two very diluted concentrations $ 0.01\rm ng/μl $ and $ 0.1\rm ng/μl $ with high reliability while commercial standard spectrometer device of our DNA-manufacturer never could detect and distinguish them. Surprisingly, measurement on the thin-film multilayer illustrates that the introduced method in this work might be performed to cancer/brain tissues or Stem cells for cancer therapy, and may hopefully open a pave and platform for non-invasive quantum diagnosis in future.
△ Less
Submitted 18 March, 2021;
originally announced March 2021.
-
Nonlocal realism tests and quantum state tomography in Sagnac-based type-II polarization-entanglement SPDC-source
Authors:
Ali Motazedifard,
Seyed Ahmad Madani,
J. Jafari Dashkasan,
N. Sobhkhiz Vayaghan
Abstract:
We have experimentally created a robust, ultrabright and phase-stable polarization-entangled state close to maximally entangled Bell-state with $ \% 98 $-fidelity using the type-II spontaneous parametric down-conversion (SPDC) process in periodically-poled KTiOPO$ _4 $ (PPKTP) collinear crystal inside a Sagnac interferometer (SI). Bell inequality measurement, Freedman's test, as the different vers…
▽ More
We have experimentally created a robust, ultrabright and phase-stable polarization-entangled state close to maximally entangled Bell-state with $ \% 98 $-fidelity using the type-II spontaneous parametric down-conversion (SPDC) process in periodically-poled KTiOPO$ _4 $ (PPKTP) collinear crystal inside a Sagnac interferometer (SI). Bell inequality measurement, Freedman's test, as the different versions of CHSH inequality, and also visibility test which all can be seen as the nonlocal realism tests, imply that our created entangled state shows a strong violation from the classical physics or any hidden-variable theory. We have obtained very reliable and very strong Bell violation as $ S=2.78 \pm 0.01 $ with high brightness $ \mathcal{V}_{\rm HV}= \% (99.969 \pm 0.003) $ and $\mathcal{V}_{\rm DA}= \% (96.751 \pm 0.002) $ and very strong violation due to Freedman test as $ δ_{\rm F} = 0.01715 \pm 0.00001 $. Furthermore, using the tomographic reconstruction of quantum states together a maximum-likelihood-technique (MLT) as the numerical optimization, we obtain the physical non-negative definite density operator which shows the nonseparability and entanglement of our prepared state. By having the maximum likelihood density operator, we calculate some important entanglement-measures and entanglement entropies. The Sagnac configuration provides bidirectional crystal pum** yields to high-rate entanglement source which is very applicable in quantum communication, sensing and metrology as well as quantum information protocols, and has potential to be used in quantum illumination-based LIDAR and free-space quantum key distribution (QKD).
△ Less
Submitted 20 June, 2021; v1 submitted 4 December, 2020;
originally announced December 2020.
-
Measurement of entropy and quantum coherence properties of two type-I entangled photonic qubits
Authors:
Ali Motazedifard,
Seyed Ahmad Madani,
N. S. Vayaghan
Abstract:
Using the type-I SPDC process in BBO nonlinear crystal (NLC), we generate a polarization-entangled state near to the maximally-entangled Bell-state with high-visibility (high-brightness) $ 98.50 \pm 1.33 ~ \% $ ($ 87.71 \pm 4.45 ~ \% $) for HV (DA) basis. We calculate the CHSH version of the Bell inequality, as a nonlocal realism test, and find a strong violation from the classical physics or any…
▽ More
Using the type-I SPDC process in BBO nonlinear crystal (NLC), we generate a polarization-entangled state near to the maximally-entangled Bell-state with high-visibility (high-brightness) $ 98.50 \pm 1.33 ~ \% $ ($ 87.71 \pm 4.45 ~ \% $) for HV (DA) basis. We calculate the CHSH version of the Bell inequality, as a nonlocal realism test, and find a strong violation from the classical physics or any hidden variable theory (HVT), $ S= 2.71 \pm 0.10 $. Via measuring the coincidence count (CC) rate in the SPDC process, we obtain the quantum efficiency of single-photon detectors (SPDs) around $ (25.5\pm 3.4) \% $, which is in good agreement to their manufacturer company. As expected, we verify the linear dependency of the CC rate vs. pump power of input CW-laser, which may yield to find the effective second-order susceptibility crystal. Using the theory of the measurement of qubits, includes a tomographic reconstruction of quantum states due to the linear set of 16 polarization-measurement, together with a maximum-likelihood-technique (MLT), which is based on the numerical optimization, we calculate the physical non-negative definite density matrices, which implies on the non-separability and entanglement of prepared state. By having the maximum likelihood density operator, we calculate precisely the entanglement measures such as Concurrence, entanglement of formation, tangle, logarithmic negativity, and different entanglement entropies such as linear entropy, Von-Neumann entropy, and Renyi 2-entropy. Finally, this high-brightness and low-rate entangled photons source can be used for short-range quantum measurements in the Lab.
△ Less
Submitted 20 June, 2021; v1 submitted 4 December, 2020;
originally announced December 2020.
-
Profile Prediction: An Alignment-Based Pre-Training Task for Protein Sequence Models
Authors:
Pascal Sturmfels,
Jesse Vig,
Ali Madani,
Nazneen Fatema Rajani
Abstract:
For protein sequence datasets, unlabeled data has greatly outpaced labeled data due to the high cost of wet-lab characterization. Recent deep-learning approaches to protein prediction have shown that pre-training on unlabeled data can yield useful representations for downstream tasks. However, the optimal pre-training strategy remains an open question. Instead of strictly borrowing from natural la…
▽ More
For protein sequence datasets, unlabeled data has greatly outpaced labeled data due to the high cost of wet-lab characterization. Recent deep-learning approaches to protein prediction have shown that pre-training on unlabeled data can yield useful representations for downstream tasks. However, the optimal pre-training strategy remains an open question. Instead of strictly borrowing from natural language processing (NLP) in the form of masked or autoregressive language modeling, we introduce a new pre-training task: directly predicting protein profiles derived from multiple sequence alignments. Using a set of five, standardized downstream tasks for protein models, we demonstrate that our pre-training task along with a multi-task objective outperforms masked language modeling alone on all five tasks. Our results suggest that protein sequence models may benefit from leveraging biologically-inspired inductive biases that go beyond existing language modeling techniques in NLP.
△ Less
Submitted 30 November, 2020;
originally announced December 2020.
-
Surgical Data Science -- from Concepts toward Clinical Translation
Authors:
Lena Maier-Hein,
Matthias Eisenmann,
Duygu Sarikaya,
Keno März,
Toby Collins,
Anand Malpani,
Johannes Fallert,
Hubertus Feussner,
Stamatia Giannarou,
Pietro Mascagni,
Hirenkumar Nakawala,
Adrian Park,
Carla Pugh,
Danail Stoyanov,
Swaroop S. Vedula,
Kevin Cleary,
Gabor Fichtinger,
Germain Forestier,
Bernard Gibaud,
Teodor Grantcharov,
Makoto Hashizume,
Doreen Heckmann-Nötzel,
Hannes G. Kenngott,
Ron Kikinis,
Lars Mündermann
, et al. (25 additional authors not shown)
Abstract:
Recent developments in data science in general and machine learning in particular have transformed the way experts envision the future of surgery. Surgical Data Science (SDS) is a new research field that aims to improve the quality of interventional healthcare through the capture, organization, analysis and modeling of data. While an increasing number of data-driven approaches and clinical applica…
▽ More
Recent developments in data science in general and machine learning in particular have transformed the way experts envision the future of surgery. Surgical Data Science (SDS) is a new research field that aims to improve the quality of interventional healthcare through the capture, organization, analysis and modeling of data. While an increasing number of data-driven approaches and clinical applications have been studied in the fields of radiological and clinical data science, translational success stories are still lacking in surgery. In this publication, we shed light on the underlying reasons and provide a roadmap for future advances in the field. Based on an international workshop involving leading researchers in the field of SDS, we review current practice, key achievements and initiatives as well as available standards and tools for a number of topics relevant to the field, namely (1) infrastructure for data acquisition, storage and access in the presence of regulatory constraints, (2) data annotation and sharing and (3) data analytics. We further complement this technical perspective with (4) a review of currently available SDS products and the translational progress from academia and (5) a roadmap for faster clinical translation and exploitation of the full potential of SDS, based on an international multi-round Delphi process.
△ Less
Submitted 30 July, 2021; v1 submitted 30 October, 2020;
originally announced November 2020.
-
BERTology Meets Biology: Interpreting Attention in Protein Language Models
Authors:
Jesse Vig,
Ali Madani,
Lav R. Varshney,
Caiming Xiong,
Richard Socher,
Nazneen Fatema Rajani
Abstract:
Transformer architectures have proven to learn useful representations for protein classification and generation tasks. However, these representations present challenges in interpretability. In this work, we demonstrate a set of methods for analyzing protein Transformer models through the lens of attention. We show that attention: (1) captures the folding structure of proteins, connecting amino aci…
▽ More
Transformer architectures have proven to learn useful representations for protein classification and generation tasks. However, these representations present challenges in interpretability. In this work, we demonstrate a set of methods for analyzing protein Transformer models through the lens of attention. We show that attention: (1) captures the folding structure of proteins, connecting amino acids that are far apart in the underlying sequence, but spatially close in the three-dimensional structure, (2) targets binding sites, a key functional component of proteins, and (3) focuses on progressively more complex biophysical properties with increasing layer depth. We find this behavior to be consistent across three Transformer architectures (BERT, ALBERT, XLNet) and two distinct protein datasets. We also present a three-dimensional visualization of the interaction between attention and protein structure. Code for visualization and analysis is available at https://github.com/salesforce/provis.
△ Less
Submitted 28 March, 2021; v1 submitted 26 June, 2020;
originally announced June 2020.
-
ProGen: Language Modeling for Protein Generation
Authors:
Ali Madani,
Bryan McCann,
Nikhil Naik,
Nitish Shirish Keskar,
Namrata Anand,
Raphael R. Eguchi,
Po-Ssu Huang,
Richard Socher
Abstract:
Generative modeling for protein engineering is key to solving fundamental problems in synthetic biology, medicine, and material science. We pose protein engineering as an unsupervised sequence generation problem in order to leverage the exponentially growing set of proteins that lack costly, structural annotations. We train a 1.2B-parameter language model, ProGen, on ~280M protein sequences condit…
▽ More
Generative modeling for protein engineering is key to solving fundamental problems in synthetic biology, medicine, and material science. We pose protein engineering as an unsupervised sequence generation problem in order to leverage the exponentially growing set of proteins that lack costly, structural annotations. We train a 1.2B-parameter language model, ProGen, on ~280M protein sequences conditioned on taxonomic and keyword tags such as molecular function and cellular component. This provides ProGen with an unprecedented range of evolutionary sequence diversity and allows it to generate with fine-grained control as demonstrated by metrics based on primary sequence similarity, secondary structure accuracy, and conformational energy.
△ Less
Submitted 7 March, 2020;
originally announced April 2020.
-
The Generalisation of the DMCA Coefficient to Serve Distinguishing Between Hedge and Safe Haven Capabilities of the Gold
Authors:
Mohamed Arbi Madani,
Zied Ftiti
Abstract:
This paper aims to investigate the role of gold as a hedge and/or safe haven against oil price and currency market movements for medium (calm period) and large (extreme movement) fluctuations. In revisiting the role of gold, our study proposes new insights into the literature. First, our empirical design relaxes the assumption of homogeneous investors in favour of agents with different horizons. S…
▽ More
This paper aims to investigate the role of gold as a hedge and/or safe haven against oil price and currency market movements for medium (calm period) and large (extreme movement) fluctuations. In revisiting the role of gold, our study proposes new insights into the literature. First, our empirical design relaxes the assumption of homogeneous investors in favour of agents with different horizons. Second, we develop a new measure of correlation based on the fractal approach, called the q-detrending moving average cross-correlation coefficient. This allows us to measure the dependence for calm and extreme movements. The proposed measure is both time-varying and time-scale varying, taking into account the complex pattern of commodities and financial time series (chaotic, non-stationary, etc.). Using intraday data from May 2017 to March 2019, including 35608 observations for each variable, our results are as follows. First, we show a negative and significant average and tail dependence for all time scales between gold and USD exchange rates that is consistent with the gold's role as an effective hedge and safe-haven asset. Second, this study puts out average independence and positive and significant tail independence between gold and oil indicating that gold can be used by investors as a weak hedge but cannot be used as an effective safe-haven asset under exceptional market circumstances for all time scales. Third, we examine the hedging and stabilising benefits of gold over calm and turmoil periods for gold-oil futures and gold-currency portfolios by estimation of the optimal portfolio weights and the optimal hedge ratio. We confirm the usefulness of gold for hedging and safe havens at different investment horizons, which favors the inclusion of gold futures in oil futures and currency portfolios for risk management purposes.
△ Less
Submitted 29 December, 2019;
originally announced December 2019.
-
ProDyn0: Inferring calponin homology domain stretching behavior using graph neural networks
Authors:
Ali Madani,
Cyna Shirazinejad,
Jia Rui Ong,
Hengameh Shams,
Mohammad Mofrad
Abstract:
Graph neural networks are a quickly emerging field for non-Euclidean data that leverage the inherent graphical structure to predict node, edge, and global-level properties of a system. Protein properties can not easily be understood as a simple sum of their parts (i.e. amino acids), therefore, understanding their dynamical properties in the context of graphs is attractive for revealing how perturb…
▽ More
Graph neural networks are a quickly emerging field for non-Euclidean data that leverage the inherent graphical structure to predict node, edge, and global-level properties of a system. Protein properties can not easily be understood as a simple sum of their parts (i.e. amino acids), therefore, understanding their dynamical properties in the context of graphs is attractive for revealing how perturbations to their structure can affect their global function. To tackle this problem, we generate a database of 2020 mutated calponin homology (CH) domains undergoing large-scale separation in molecular dynamics. To predict the mechanosensitive force response, we develop neural message passing networks and residual gated graph convnets which predict the protein dependent force separation at 86.63 percent, 81.59 kJ/mol/nm MAE, 76.99 psec MAE for force mode classification, max force magnitude, max force time respectively-- significantly better than non-graph-based deep learning techniques. Towards uniting geometric learning techniques and biophysical observables, we premiere our simulation database as a benchmark dataset for further development/evaluation of graph neural network architectures.
△ Less
Submitted 21 October, 2019;
originally announced October 2019.
-
Bimodal network architectures for automatic generation of image annotation from text
Authors:
Mehdi Moradi,
Ali Madani,
Yaniv Gur,
Yufan Guo,
Tanveer Syeda-Mahmood
Abstract:
Medical image analysis practitioners have embraced big data methodologies. This has created a need for large annotated datasets. The source of big data is typically large image collections and clinical reports recorded for these images. In many cases, however, building algorithms aimed at segmentation and detection of disease requires a training dataset with markings of the areas of interest on th…
▽ More
Medical image analysis practitioners have embraced big data methodologies. This has created a need for large annotated datasets. The source of big data is typically large image collections and clinical reports recorded for these images. In many cases, however, building algorithms aimed at segmentation and detection of disease requires a training dataset with markings of the areas of interest on the image that match with the described anomalies. This process of annotation is expensive and needs the involvement of clinicians. In this work we propose two separate deep neural network architectures for automatic marking of a region of interest (ROI) on the image best representing a finding location, given a textual report or a set of keywords. One architecture consists of LSTM and CNN components and is trained end to end with images, matching text, and markings of ROIs for those images. The output layer estimates the coordinates of the vertices of a polygonal region. The second architecture uses a network pre-trained on a large dataset of the same image types for learning feature representations of the findings of interest. We show that for a variety of findings from chest X-ray images, both proposed architectures learn to estimate the ROI, as validated by clinical annotations. There is a clear advantage obtained from the architecture with pre-trained imaging network. The centroids of the ROIs marked by this network were on average at a distance equivalent to 5.1% of the image width from the centroids of the ground truth ROIs.
△ Less
Submitted 5 September, 2018;
originally announced September 2018.
-
Segmentation of Multiple Sclerosis lesion in brain MR images using Fuzzy C-Means
Authors:
Saba Heidari Gheshlaghi,
Abolfazl Madani,
AmirAbolfazl Suratgar,
Fardin Faraji
Abstract:
Magnetic resonance images (MRI) play an important role in supporting and substituting clinical information in the diagnosis of multiple sclerosis (MS) disease by presenting lesion in brain MR images. In this paper, an algorithm for MS lesion segmentation from Brain MR Images has been presented. We revisit the modification of properties of fuzzy -c means algorithms and the canny edge detection. By…
▽ More
Magnetic resonance images (MRI) play an important role in supporting and substituting clinical information in the diagnosis of multiple sclerosis (MS) disease by presenting lesion in brain MR images. In this paper, an algorithm for MS lesion segmentation from Brain MR Images has been presented. We revisit the modification of properties of fuzzy -c means algorithms and the canny edge detection. By changing and reformed fuzzy c-means clustering algorithms, and applying canny contraction principle, a relationship between MS lesions and edge detection is established. For the special case of FCM, we derive a sufficient condition and clustering parameters, allowing identification of them as (local) minima of the objective function.
△ Less
Submitted 10 April, 2018;
originally announced April 2018.
-
Fast and accurate classification of echocardiograms using deep learning
Authors:
Ali Madani,
Ramy Arnaout,
Mohammad Mofrad,
Rima Arnaout
Abstract:
Echocardiography is essential to modern cardiology. However, human interpretation limits high throughput analysis, limiting echocardiography from reaching its full clinical and research potential for precision medicine. Deep learning is a cutting-edge machine-learning technique that has been useful in analyzing medical images but has not yet been widely applied to echocardiography, partly due to t…
▽ More
Echocardiography is essential to modern cardiology. However, human interpretation limits high throughput analysis, limiting echocardiography from reaching its full clinical and research potential for precision medicine. Deep learning is a cutting-edge machine-learning technique that has been useful in analyzing medical images but has not yet been widely applied to echocardiography, partly due to the complexity of echocardiograms' multi view, multi modality format. The essential first step toward comprehensive computer assisted echocardiographic interpretation is determining whether computers can learn to recognize standard views. To this end, we anonymized 834,267 transthoracic echocardiogram (TTE) images from 267 patients (20 to 96 years, 51 percent female, 26 percent obese) seen between 2000 and 2017 and labeled them according to standard views. Images covered a range of real world clinical variation. We built a multilayer convolutional neural network and used supervised learning to simultaneously classify 15 standard views. Eighty percent of data used was randomly chosen for training and 20 percent reserved for validation and testing on never seen echocardiograms. Using multiple images from each clip, the model classified among 12 video views with 97.8 percent overall test accuracy without overfitting. Even on single low resolution images, test accuracy among 15 views was 91.7 percent versus 70.2 to 83.5 percent for board-certified echocardiographers. Confusional matrices, occlusion experiments, and saliency map** showed that the model finds recognizable similarities among related views and classifies using clinically relevant image features. In conclusion, deep neural networks can classify essential echocardiographic views simultaneously and with high accuracy. Our results provide a foundation for more complex deep learning assisted echocardiographic interpretation.
△ Less
Submitted 26 June, 2017;
originally announced June 2017.
-
A Qualitative Comparison of MPSoC Mobile and Embedded Virtualization Techniques
Authors:
Junaid Shuja,
Abdullah Gani,
Sajjad A. Madani
Abstract:
Virtualization is generally adopted in server and desktop environments to provide for fault tolerance, resource management, and energy efficiency. Virtualization enables parallel execution of multiple operating systems (OSs) while sharing the hardware resources. Virtualization was previously not deemed as feasible technology for mobile and embedded devices due to their limited processing and memor…
▽ More
Virtualization is generally adopted in server and desktop environments to provide for fault tolerance, resource management, and energy efficiency. Virtualization enables parallel execution of multiple operating systems (OSs) while sharing the hardware resources. Virtualization was previously not deemed as feasible technology for mobile and embedded devices due to their limited processing and memory resource. However, the enterprises are advocating Bring Your Own Device (BYOD) applications that enable co-existence of heterogeneous OSs on a single mobile device. Moreover, embedded device require virtualization for logical isolation of secure and general purpose OSs on a single device. In this paper, we investigate the processor architectures in the mobile and embedded space while examining their formal visualizability. We also compare the virtualization solutions enabling coexistence of multiple OSs in Multicore Processor System-on-Chip (MPSoC) mobile and embedded systems. We advocate that virtualization is necessary to manage resource in MPSoC designs and to enable BYOD, security, and logical isolation use cases.
△ Less
Submitted 4 May, 2016;
originally announced May 2016.
-
Vertical optical ring resonators fully integrated with nanophotonic waveguides on silicon-on-insulator substrates
Authors:
Abbas Madani,
Moritz Kleinert,
David Stolarek,
Lars Zimmermann,
Libo Ma,
Oliver G. Schmidt
Abstract:
We demonstrate full integration of vertical optical ring resonators with silicon nanophotonic waveguides on silicon-on-insulator substrates to accomplish a significant step towards 3D photonic integration. The on-chip integration is realized by rolling up 2D differentially strained TiO2 nanomembranes into 3D microtube cavities on a nanophotonic microchip. The integration configuration allows for o…
▽ More
We demonstrate full integration of vertical optical ring resonators with silicon nanophotonic waveguides on silicon-on-insulator substrates to accomplish a significant step towards 3D photonic integration. The on-chip integration is realized by rolling up 2D differentially strained TiO2 nanomembranes into 3D microtube cavities on a nanophotonic microchip. The integration configuration allows for out of plane optical coupling between the in-plane nanowaveguides and the vertical microtube cavities as a compact and mechanically stable optical unit, which could enable refined vertical light transfer in 3D stacks of multiple photonic layers. In this vertical transmission scheme, resonant filtering of optical signals at telecommunication wavelengths is demonstrated based on subwavelength thick walled microcavities. Moreover, an array of microtube cavities is prepared and each microtube cavity is integrated with multiple waveguides which opens up interesting perspectives towards parallel and multi-routing through a single cavity device as well as high-throughput optofluidic sensing schemes.
△ Less
Submitted 29 June, 2015;
originally announced June 2015.
-
Routing protocols for mobile sensor networks: a comparative study
Authors:
Shahzad Ali,
Sajjad A. Madani,
Atta ur Rehman Khan,
Imran Ali Khan
Abstract:
This paper presents a comparison of cluster-based position and non position-based routing protocols for mobile wireless sensor networks to outline design considerations of protocols for mobile environments. The selected protocols are compared on the basis of multiple parameters, which include packet delivery ratio, packet loss, network lifetime, and control overhead using variable number of nodes…
▽ More
This paper presents a comparison of cluster-based position and non position-based routing protocols for mobile wireless sensor networks to outline design considerations of protocols for mobile environments. The selected protocols are compared on the basis of multiple parameters, which include packet delivery ratio, packet loss, network lifetime, and control overhead using variable number of nodes and speeds. The extensive simulation and analysis of results show that position-based routing protocols incur less packet loss as compared to the non position based protocols. However, position-based protocols require localization mechanism or a GPS for the location information, which consumes energy and affects the network lifetime. Alternatively, non position-based protocols are more energy efficient and provide extended network lifetime.
△ Less
Submitted 13 March, 2014;
originally announced March 2014.
-
Bottom-up Graphene Nanoribbon Field-Effect Transistors
Authors:
Patrick B. Bennett,
Zahra Pedramrazi,
Ali Madani,
Yen-Chia Chen,
Dimas G. de Oteyza,
Chen Chen,
Felix R. Fischer,
Michael F. Crommie,
Jeffrey Bokor
Abstract:
Recently developed processes have enabled bottom-up chemical synthesis of graphene nanoribbons (GNRs) with precise atomic structure. These GNRs are ideal candidates for electronic devices because of their uniformity, extremely narrow width below 1 nm, atomically perfect edge structure, and desirable electronic properties. Here, we demonstrate nanoscale chemically synthesized GNR field-effect trans…
▽ More
Recently developed processes have enabled bottom-up chemical synthesis of graphene nanoribbons (GNRs) with precise atomic structure. These GNRs are ideal candidates for electronic devices because of their uniformity, extremely narrow width below 1 nm, atomically perfect edge structure, and desirable electronic properties. Here, we demonstrate nanoscale chemically synthesized GNR field-effect transistors, made possible by development of a new layer transfer process. We observe strong environmental sensitivity and unique transport behavior characteristic of sub-1nm width GNRs.
△ Less
Submitted 1 October, 2013;
originally announced October 2013.