-
Persian Pronoun Resolution: Leveraging Neural Networks and Language Models
Authors:
Hassan Haji Mohammadi,
Alireza Talebpour,
Ahmad Mahmoudi Aznaveh,
Samaneh Yazdani
Abstract:
Coreference resolution, critical for identifying textual entities referencing the same entity, faces challenges in pronoun resolution, particularly identifying pronoun antecedents. Existing methods often treat pronoun resolution as a separate task from mention detection, potentially missing valuable information. This study proposes the first end-to-end neural network system for Persian pronoun res…
▽ More
Coreference resolution, critical for identifying textual entities referencing the same entity, faces challenges in pronoun resolution, particularly identifying pronoun antecedents. Existing methods often treat pronoun resolution as a separate task from mention detection, potentially missing valuable information. This study proposes the first end-to-end neural network system for Persian pronoun resolution, leveraging pre-trained Transformer models like ParsBERT. Our system jointly optimizes both mention detection and antecedent linking, achieving a 3.37 F1 score improvement over the previous state-of-the-art system (which relied on rule-based and statistical methods) on the Mehr corpus. This significant improvement demonstrates the effectiveness of combining neural networks with linguistic models, potentially marking a significant advancement in Persian pronoun resolution and paving the way for further research in this under-explored area.
△ Less
Submitted 17 May, 2024;
originally announced May 2024.
-
A hybrid entity-centric approach to Persian pronoun resolution
Authors:
Hassan Haji Mohammadi,
Alireza Talebpour,
Ahmad Mahmoudi Aznaveh,
Samaneh Yazdani
Abstract:
Pronoun resolution is a challenging subset of an essential field in natural language processing called coreference resolution. Coreference resolution is about finding all entities in the text that refers to the same real-world entity. This paper presents a hybrid model combining multiple rulebased sieves with a machine-learning sieve for pronouns. For this purpose, seven high-precision rule-based…
▽ More
Pronoun resolution is a challenging subset of an essential field in natural language processing called coreference resolution. Coreference resolution is about finding all entities in the text that refers to the same real-world entity. This paper presents a hybrid model combining multiple rulebased sieves with a machine-learning sieve for pronouns. For this purpose, seven high-precision rule-based sieves are designed for the Persian language. Then, a random forest classifier links pronouns to the previous partial clusters. The presented method demonstrates exemplary performance using pipeline design and combining the advantages of machine learning and rulebased methods. This method has solved some challenges in end-to-end models. In this paper, the authors develop a Persian coreference corpus called Mehr in the form of 400 documents. This corpus fixes some weaknesses of the previous corpora in the Persian language. Finally, the efficiency of the presented system compared to the earlier model in Persian is reported by evaluating the proposed method on the Mehr and Uppsala test sets.
△ Less
Submitted 11 November, 2022;
originally announced November 2022.
-
Review of coreference resolution in English and Persian
Authors:
Hassan Haji Mohammadi,
Alireza Talebpour,
Ahmad Mahmoudi Aznaveh,
Samaneh Yazdani
Abstract:
Coreference resolution (CR) is one of the most challenging areas of natural language processing. This task seeks to identify all textual references to the same real-world entity. Research in this field is divided into coreference resolution and anaphora resolution. Due to its application in textual comprehension and its utility in other tasks such as information extraction systems, document summar…
▽ More
Coreference resolution (CR) is one of the most challenging areas of natural language processing. This task seeks to identify all textual references to the same real-world entity. Research in this field is divided into coreference resolution and anaphora resolution. Due to its application in textual comprehension and its utility in other tasks such as information extraction systems, document summarization, and machine translation, this field has attracted considerable interest. Consequently, it has a significant effect on the quality of these systems. This article reviews the existing corpora and evaluation metrics in this field. Then, an overview of the coreference algorithms, from rule-based methods to the latest deep learning techniques, is provided. Finally, coreference resolution and pronoun resolution systems in Persian are investigated.
△ Less
Submitted 8 November, 2022;
originally announced November 2022.
-
NeuralBF: Neural Bilateral Filtering for Top-down Instance Segmentation on Point Clouds
Authors:
Weiwei Sun,
Daniel Rebain,
Renjie Liao,
Vladimir Tankovich,
Soroosh Yazdani,
Kwang Moo Yi,
Andrea Tagliasacchi
Abstract:
We introduce a method for instance proposal generation for 3D point clouds. Existing techniques typically directly regress proposals in a single feed-forward step, leading to inaccurate estimation. We show that this serves as a critical bottleneck, and propose a method based on iterative bilateral filtering with learned kernels. Following the spirit of bilateral filtering, we consider both the dee…
▽ More
We introduce a method for instance proposal generation for 3D point clouds. Existing techniques typically directly regress proposals in a single feed-forward step, leading to inaccurate estimation. We show that this serves as a critical bottleneck, and propose a method based on iterative bilateral filtering with learned kernels. Following the spirit of bilateral filtering, we consider both the deep feature embeddings of each point, as well as their locations in the 3D space. We show via synthetic experiments that our method brings drastic improvements when generating instance proposals for a given point of interest. We further validate our method on the challenging ScanNet benchmark, achieving the best instance segmentation performance amongst the sub-category of top-down methods.
△ Less
Submitted 20 July, 2022;
originally announced July 2022.
-
Forecasting of COVID-19 Cases, Using an Evolutionary Neural Architecture Search Approach
Authors:
Mahdi Rahbar,
Samaneh Yazdani
Abstract:
In late 2019, COVID-19, a severe respiratory disease, emerged, and since then, the world has been facing a deadly pandemic caused by it. This ongoing pandemic has had a significant effect on different aspects of societies. The uncertainty around the number of daily cases made it difficult for decision-makers to control the outbreak. Deep Learning models have proved that they can come in handy in m…
▽ More
In late 2019, COVID-19, a severe respiratory disease, emerged, and since then, the world has been facing a deadly pandemic caused by it. This ongoing pandemic has had a significant effect on different aspects of societies. The uncertainty around the number of daily cases made it difficult for decision-makers to control the outbreak. Deep Learning models have proved that they can come in handy in many real-world problems such as healthcare ones. However, they require a lot of data to learn the features properly and output an acceptable solution. Since COVID-19 has been a lately emerged disease, there was not much data available, especially in the first stage of the pandemic, and this shortage of data makes it challenging to design an optimized model. To overcome these problems, we first introduce a new dataset with augmented features and then forecast COVID-19 cases with a new approach, using an evolutionary neural architecture search with Binary Bat Algorithm (BBA) to generate an optimized deep recurrent network. Finally, to show our approach's effectiveness, we conducted a comparative study on Iran's COVID-19 daily cases. The results prove our approach's capability to generate an accurate deep architecture to forecast the pandemic cases, even in the early stages with limited data.
△ Less
Submitted 15 September, 2021;
originally announced September 2021.
-
Heterointerface control over lithium-induced phase transitions in MoS2 heterostructures
Authors:
Joshua V. Pondick,
Aakash Kumar,
Meng**g Wang,
Sajad Yazdani,
John M. Woods,
Diana Y. Qiu,
Judy J. Cha
Abstract:
Phase transitions of two-dimensional materials and their heterostructures enable many applications including electrochemical energy storage, catalysis, and memory; however, the nucleation pathways by which these transitions proceed remain underexplored, prohibiting engineering control for these applications. Here, we demonstrate that the lithium intercalation-induced 2H-1T' phase transition in MoS…
▽ More
Phase transitions of two-dimensional materials and their heterostructures enable many applications including electrochemical energy storage, catalysis, and memory; however, the nucleation pathways by which these transitions proceed remain underexplored, prohibiting engineering control for these applications. Here, we demonstrate that the lithium intercalation-induced 2H-1T' phase transition in MoS2 proceeds via nucleation of the 1T' phase at a heterointerface by monitoring the phase transition of MoS2/graphene and MoS2/hexagonal boron nitride (hBN) heterostructures with Raman spectroscopy in situ during intercalation. We observe that graphene-MoS2 heterointerfaces require an increase of 0.8 V in applied electrochemical potential to nucleate the 1T' phase in MoS2 compared to hBN-MoS2 heterointerfaces. The increased nucleation barrier at graphene-MoS2 heterointerfaces is due to the reduced charge transfer from lithium to MoS2 at the heterointerface as lithium also dopes graphene based on ab initio calculations. Further, we show that the growth of the 1T' domain propagates along the heterointerface, rather than through the interior of MoS2. Our results provide the first experimental observations of the heterogeneous nucleation and growth of intercalation-induced phase transitions in two-dimensional materials and heterointerface effects on their phase transition.
△ Less
Submitted 5 July, 2021;
originally announced July 2021.
-
Deep Medial Fields
Authors:
Daniel Rebain,
Ke Li,
Vincent Sitzmann,
Soroosh Yazdani,
Kwang Moo Yi,
Andrea Tagliasacchi
Abstract:
Implicit representations of geometry, such as occupancy fields or signed distance fields (SDF), have recently re-gained popularity in encoding 3D solid shape in a functional form. In this work, we introduce medial fields: a field function derived from the medial axis transform (MAT) that makes available information about the underlying 3D geometry that is immediately useful for a number of downstr…
▽ More
Implicit representations of geometry, such as occupancy fields or signed distance fields (SDF), have recently re-gained popularity in encoding 3D solid shape in a functional form. In this work, we introduce medial fields: a field function derived from the medial axis transform (MAT) that makes available information about the underlying 3D geometry that is immediately useful for a number of downstream tasks. In particular, the medial field encodes the local thickness of a 3D shape, and enables O(1) projection of a query point onto the medial axis. To construct the medial field we require nothing but the SDF of the shape itself, thus allowing its straightforward incorporation in any application that relies on signed distance fields. Working in unison with the O(1) surface projection supported by the SDF, the medial field opens the door for an entirely new set of efficient, shape-aware operations on implicit representations. We present three such applications, including a modification to sphere tracing that renders implicit representations with better convergence properties, a fast construction method for memory-efficient rigid-body collision proxies, and an efficient approximation of ambient occlusion that remains stable with respect to viewpoint variations.
△ Less
Submitted 7 June, 2021;
originally announced June 2021.
-
Canonical Capsules: Self-Supervised Capsules in Canonical Pose
Authors:
Weiwei Sun,
Andrea Tagliasacchi,
Boyang Deng,
Sara Sabour,
Soroosh Yazdani,
Geoffrey Hinton,
Kwang Moo Yi
Abstract:
We propose a self-supervised capsule architecture for 3D point clouds. We compute capsule decompositions of objects through permutation-equivariant attention, and self-supervise the process by training with pairs of randomly rotated objects. Our key idea is to aggregate the attention masks into semantic keypoints, and use these to supervise a decomposition that satisfies the capsule invariance/equ…
▽ More
We propose a self-supervised capsule architecture for 3D point clouds. We compute capsule decompositions of objects through permutation-equivariant attention, and self-supervise the process by training with pairs of randomly rotated objects. Our key idea is to aggregate the attention masks into semantic keypoints, and use these to supervise a decomposition that satisfies the capsule invariance/equivariance properties. This not only enables the training of a semantically consistent decomposition, but also allows us to learn a canonicalization operation that enables object-centric reasoning. To train our neural network we require neither classification labels nor manually-aligned training datasets. Yet, by learning an object-centric representation in a self-supervised manner, our method outperforms the state-of-the-art on 3D point cloud reconstruction, canonicalization, and unsupervised classification.
△ Less
Submitted 24 November, 2021; v1 submitted 8 December, 2020;
originally announced December 2020.
-
Unsupervised part representation by Flow Capsules
Authors:
Sara Sabour,
Andrea Tagliasacchi,
Soroosh Yazdani,
Geoffrey E. Hinton,
David J. Fleet
Abstract:
Capsule networks aim to parse images into a hierarchy of objects, parts and relations. While promising, they remain limited by an inability to learn effective low level part descriptions. To address this issue we propose a way to learn primary capsule encoders that detect atomic parts from a single image. During training we exploit motion as a powerful perceptual cue for part definition, with an e…
▽ More
Capsule networks aim to parse images into a hierarchy of objects, parts and relations. While promising, they remain limited by an inability to learn effective low level part descriptions. To address this issue we propose a way to learn primary capsule encoders that detect atomic parts from a single image. During training we exploit motion as a powerful perceptual cue for part definition, with an expressive decoder for part generation within a layered image model with occlusion. Experiments demonstrate robust part discovery in the presence of multiple objects, cluttered backgrounds, and occlusion. The part decoder infers the underlying shape masks, effectively filling in occluded regions of the detected shapes. We evaluate FlowCapsules on unsupervised part segmentation and unsupervised image classification.
△ Less
Submitted 19 February, 2021; v1 submitted 27 November, 2020;
originally announced November 2020.
-
Heterointerface effects on lithium-induced phase transitions in intercalated MoS2
Authors:
Sajad Yazdani,
Joshua V. Pondick,
Aakash Kumar,
Milad Yarali,
John M. Woods,
David J. Hynek,
Diana Y. Qiu,
Judy J. Cha
Abstract:
The intercalation-induced phase transition of MoS2 from the semiconducting 2H to the semimetallic 1T' phase has been studied in detail for nearly a decade; however, the effects of a heterointerface between MoS2 and other two-dimensional (2D) crystals on the phase transition have largely been overlooked. Here, ab initio calculations show that intercalating Li at a MoS2-hexagonal boron nitride (hBN)…
▽ More
The intercalation-induced phase transition of MoS2 from the semiconducting 2H to the semimetallic 1T' phase has been studied in detail for nearly a decade; however, the effects of a heterointerface between MoS2 and other two-dimensional (2D) crystals on the phase transition have largely been overlooked. Here, ab initio calculations show that intercalating Li at a MoS2-hexagonal boron nitride (hBN) interface stabilizes the 1T phase over the 2H phase of MoS2 by ~ 100 mJ m-2, suggesting that encapsulating MoS2 with hBN may lower the electrochemical energy needed for the intercalation-induced phase transition. However, in situ Raman spectroscopy of hBN-MoS2-hBN heterostructures during electrochemical intercalation of Li+ shows that the phase transition occurs at the same applied voltage for the heterostructure as for bare MoS2. We hypothesize that the predicted thermodynamic stabilization of the 1T'-MoS2-hBN interface is counteracted by an energy barrier to the phase transition imposed by the steric hindrance of the heterointerface. The phase transition occurs at lower applied voltages upon heating the heterostructure, which supports our hypothesis. Our study highlights that interfacial effects of 2D heterostructures can go beyond modulating electrical properties and can modify electrochemical and phase transition behaviors.
△ Less
Submitted 27 November, 2020;
originally announced November 2020.
-
DeRF: Decomposed Radiance Fields
Authors:
Daniel Rebain,
Wei Jiang,
Soroosh Yazdani,
Ke Li,
Kwang Moo Yi,
Andrea Tagliasacchi
Abstract:
With the advent of Neural Radiance Fields (NeRF), neural networks can now render novel views of a 3D scene with quality that fools the human eye. Yet, generating these images is very computationally intensive, limiting their applicability in practical scenarios. In this paper, we propose a technique based on spatial decomposition capable of mitigating this issue. Our key observation is that there…
▽ More
With the advent of Neural Radiance Fields (NeRF), neural networks can now render novel views of a 3D scene with quality that fools the human eye. Yet, generating these images is very computationally intensive, limiting their applicability in practical scenarios. In this paper, we propose a technique based on spatial decomposition capable of mitigating this issue. Our key observation is that there are diminishing returns in employing larger (deeper and/or wider) networks. Hence, we propose to spatially decompose a scene and dedicate smaller networks for each decomposed part. When working together, these networks can render the whole scene. This allows us near-constant inference time regardless of the number of decomposed parts. Moreover, we show that a Voronoi spatial decomposition is preferable for this purpose, as it is provably compatible with the Painter's Algorithm for efficient and GPU-friendly rendering. Our experiments show that for real-world scenes, our method provides up to 3x more efficient inference than NeRF (with the same rendering quality), or an improvement of up to 1.0~dB in PSNR (for the same inference cost).
△ Less
Submitted 24 November, 2020;
originally announced November 2020.
-
Voronoi Convolutional Neural Networks
Authors:
Soroosh Yazdani,
Andrea Tagliasacchi
Abstract:
In this technical report, we investigate extending convolutional neural networks to the setting where functions are not sampled in a grid pattern. We show that by treating the samples as the average of a function within a cell, we can find a natural equivalent of most layers used in CNN. We also present an algorithm for running inference for these models exactly using standard convex geometry algo…
▽ More
In this technical report, we investigate extending convolutional neural networks to the setting where functions are not sampled in a grid pattern. We show that by treating the samples as the average of a function within a cell, we can find a natural equivalent of most layers used in CNN. We also present an algorithm for running inference for these models exactly using standard convex geometry algorithms.
△ Less
Submitted 21 October, 2020;
originally announced October 2020.
-
COVID CT-Net: Predicting Covid-19 From Chest CT Images Using Attentional Convolutional Network
Authors:
Shakib Yazdani,
Shervin Minaee,
Rahele Kafieh,
Narges Saeedizadeh,
Milan Sonka
Abstract:
The novel corona-virus disease (COVID-19) pandemic has caused a major outbreak in more than 200 countries around the world, leading to a severe impact on the health and life of many people globally. As of Aug 25th of 2020, more than 20 million people are infected, and more than 800,000 death are reported. Computed Tomography (CT) images can be used as a as an alternative to the time-consuming "rev…
▽ More
The novel corona-virus disease (COVID-19) pandemic has caused a major outbreak in more than 200 countries around the world, leading to a severe impact on the health and life of many people globally. As of Aug 25th of 2020, more than 20 million people are infected, and more than 800,000 death are reported. Computed Tomography (CT) images can be used as a as an alternative to the time-consuming "reverse transcription polymerase chain reaction (RT-PCR)" test, to detect COVID-19. In this work we developed a deep learning framework to predict COVID-19 from CT images. We propose to use an attentional convolution network, which can focus on the infected areas of chest, enabling it to perform a more accurate prediction. We trained our model on a dataset of more than 2000 CT images, and report its performance in terms of various popular metrics, such as sensitivity, specificity, area under the curve, and also precision-recall curve, and achieve very promising results. We also provide a visualization of the attention maps of the model for several test images, and show that our model is attending to the infected regions as intended. In addition to develo** a machine learning modeling framework, we also provide the manual annotation of the potentionally infected regions of chest, with the help of a board-certified radiologist, and make that publicly available for other researchers.
△ Less
Submitted 10 September, 2020;
originally announced September 2020.
-
The Effect of Mechanical Strain on Lithium Staging in Graphene
Authors:
Joshua V. Pondick,
Sajad Yazdani,
Milad Yarali,
Serrae N. Reed,
David J. Hynek,
Judy J. Cha
Abstract:
Lithium intercalation into graphite is the foundation for the lithium-ion battery, and the thermodynamics of the lithiation of graphitic electrodes have been heavily investigated. Intercalated lithium in bulk graphite undergoes structural ordering known as staging to minimize electrostatic repulsions within the crystal lattice. While this process is well-understood for bulk graphite, confinement e…
▽ More
Lithium intercalation into graphite is the foundation for the lithium-ion battery, and the thermodynamics of the lithiation of graphitic electrodes have been heavily investigated. Intercalated lithium in bulk graphite undergoes structural ordering known as staging to minimize electrostatic repulsions within the crystal lattice. While this process is well-understood for bulk graphite, confinement effects become important at the nanoscale, which can significantly impact the electrochemistry of nanostructured electrodes. Therefore, graphene offers a model platform to study intercalation dynamics at the nanoscale by combining on-chip device fabrication and electrochemical intercalation with in situ characterization. We show that microscale mechanical strain significantly affects the formation of ordered lithium phases in graphene. In situ Raman spectroscopy of graphene microflakes mechanically constrained at the edge during lithium intercalation reveals a thickness-dependent increase of up to 1.26 V in the electrochemical potential that induces lithium staging. While the induced mechanical strain energy increases with graphene thickness to the fourth power, its magnitude is small compared to the observed increase in electrochemical energy. We hypothesize that the mechanical strain energy increases a nucleation barrier for lithium staging, greatly delaying the formation of ordered lithium phases. Our results indicate that electrode assembly can critically impact lithium staging dynamics important for cycling rates and power generation for batteries. We demonstrate strain engineering in two-dimensional nanomaterials as an approach to manipulate phase transitions and chemical reactivity.
△ Less
Submitted 6 August, 2020;
originally announced August 2020.
-
COVID TV-UNet: Segmenting COVID-19 Chest CT Images Using Connectivity Imposed U-Net
Authors:
Narges Saeedizadeh,
Shervin Minaee,
Rahele Kafieh,
Shakib Yazdani,
Milan Sonka
Abstract:
The novel corona-virus disease (COVID-19) pandemic has caused a major outbreak in more than 200 countries around the world, leading to a severe impact on the health and life of many people globally. As of mid-July 2020, more than 12 million people were infected, and more than 570,000 death were reported. Computed Tomography (CT) images can be used as an alternative to the time-consuming RT-PCR tes…
▽ More
The novel corona-virus disease (COVID-19) pandemic has caused a major outbreak in more than 200 countries around the world, leading to a severe impact on the health and life of many people globally. As of mid-July 2020, more than 12 million people were infected, and more than 570,000 death were reported. Computed Tomography (CT) images can be used as an alternative to the time-consuming RT-PCR test, to detect COVID-19. In this work we propose a segmentation framework to detect chest regions in CT images, which are infected by COVID-19. We use an architecture similar to U-Net model, and train it to detect ground glass regions, on pixel level. As the infected regions tend to form a connected component (rather than randomly distributed pixels), we add a suitable regularization term to the loss function, to promote connectivity of the segmentation map for COVID-19 pixels. 2D-anisotropic total-variation is used for this purpose, and therefore the proposed model is called "TV-UNet". Through experimental results on a relatively large-scale CT segmentation dataset of around 900 images, we show that adding this new regularization term leads to 2\% gain on overall segmentation performance compared to the U-Net model. Our experimental analysis, ranging from visual evaluation of the predicted segmentation results to quantitative assessment of segmentation performance (precision, recall, Dice score, and mIoU) demonstrated great ability to identify COVID-19 associated regions of the lungs, achieving a mIoU rate of over 99\%, and a Dice score of around 86\%.
△ Less
Submitted 6 August, 2020; v1 submitted 23 July, 2020;
originally announced July 2020.
-
Near Unity Molecular Do** Efficiency in Monolayer MoS2
Authors:
Milad Yarali,
Yiren Zhong,
Serrae N. Reed,
Juefan Wang,
Kanchan A. Ulman,
David J. Charboneau,
Julia B. Curley,
David J. Hynek,
Joshua V. Pondick,
Sajad Yazdani,
Nilay Hazari,
Su Ying Quek,
Hailiang Wang,
Judy J. Cha
Abstract:
Surface functionalization with organic electron donors (OEDs) is an effective do** strategy for two-dimensional (2D) materials, which can achieve do** levels beyond those possible with conventional electric field gating. While the effectiveness of surface functionalization has been demonstrated in many 2D systems, the do** efficiencies of OEDs have largely been unmeasured, which is in stark…
▽ More
Surface functionalization with organic electron donors (OEDs) is an effective do** strategy for two-dimensional (2D) materials, which can achieve do** levels beyond those possible with conventional electric field gating. While the effectiveness of surface functionalization has been demonstrated in many 2D systems, the do** efficiencies of OEDs have largely been unmeasured, which is in stark contrast to their precision syntheses and tailored redox potentials. Here, using monolayer MoS2 as a model system and an organic reductant based on 4,4-bipyridine (DMAP-OED) as a strong organic dopant, we establish that the do** efficiency of DMAP-OED to MoS2 is in the range of 0.63 to 1.26 electrons per molecule. We also achieve the highest do** level to date in monolayer MoS2 by surface functionalization and demonstrate that DMAP-OED is a stronger dopant than benzyl viologen, which was the previous best OED dopant. The measured range of the do** efficiency is in good agreement with the values predicted from first-principles calculations. Our work provides a basis for the rational design of OEDs for high-level do** of 2D materials.
△ Less
Submitted 7 July, 2020;
originally announced July 2020.
-
Deep-COVID: Predicting COVID-19 From Chest X-Ray Images Using Deep Transfer Learning
Authors:
Shervin Minaee,
Rahele Kafieh,
Milan Sonka,
Shakib Yazdani,
Ghazaleh Jamalipour Soufi
Abstract:
The COVID-19 pandemic is causing a major outbreak in more than 150 countries around the world, having a severe impact on the health and life of many people globally. One of the crucial step in fighting COVID-19 is the ability to detect the infected patients early enough, and put them under special care. Detecting this disease from radiography and radiology images is perhaps one of the fastest way…
▽ More
The COVID-19 pandemic is causing a major outbreak in more than 150 countries around the world, having a severe impact on the health and life of many people globally. One of the crucial step in fighting COVID-19 is the ability to detect the infected patients early enough, and put them under special care. Detecting this disease from radiography and radiology images is perhaps one of the fastest ways to diagnose the patients. Some of the early studies showed specific abnormalities in the chest radiograms of patients infected with COVID-19. Inspired by earlier works, we study the application of deep learning models to detect COVID-19 patients from their chest radiography images. We first prepare a dataset of 5,000 Chest X-rays from the publicly available datasets. Images exhibiting COVID-19 disease presence were identified by board-certified radiologist. Transfer learning on a subset of 2,000 radiograms was used to train four popular convolutional neural networks, including ResNet18, ResNet50, SqueezeNet, and DenseNet-121, to identify COVID-19 disease in the analyzed chest X-ray images. We evaluated these models on the remaining 3,000 images, and most of these networks achieved a sensitivity rate of 98% ($\pm$ 3%), while having a specificity rate of around 90%. Besides sensitivity and specificity rates, we also present the receiver operating characteristic (ROC) curve, precision-recall curve, average prediction, and confusion matrix of each model. We also used a technique to generate heatmaps of lung regions potentially infected by COVID-19 and show that the generated heatmaps contain most of the infected areas annotated by our board certified radiologist. While the achieved performance is very encouraging, further analysis is required on a larger set of COVID-19 images, to have a more reliable estimation of accuracy rates. The dataset, model implementations (in PyTorch), and evaluations, are all made publicly available for research community at https://github.com/shervinmin/DeepCovid.git
△ Less
Submitted 21 July, 2020; v1 submitted 20 April, 2020;
originally announced April 2020.
-
CvxNet: Learnable Convex Decomposition
Authors:
Boyang Deng,
Kyle Genova,
Soroosh Yazdani,
Sofien Bouaziz,
Geoffrey Hinton,
Andrea Tagliasacchi
Abstract:
Any solid object can be decomposed into a collection of convex polytopes (in short, convexes). When a small number of convexes are used, such a decomposition can be thought of as a piece-wise approximation of the geometry. This decomposition is fundamental in computer graphics, where it provides one of the most common ways to approximate geometry, for example, in real-time physics simulation. A co…
▽ More
Any solid object can be decomposed into a collection of convex polytopes (in short, convexes). When a small number of convexes are used, such a decomposition can be thought of as a piece-wise approximation of the geometry. This decomposition is fundamental in computer graphics, where it provides one of the most common ways to approximate geometry, for example, in real-time physics simulation. A convex object also has the property of being simultaneously an explicit and implicit representation: one can interpret it explicitly as a mesh derived by computing the vertices of a convex hull, or implicitly as the collection of half-space constraints or support functions. Their implicit representation makes them particularly well suited for neural network training, as they abstract away from the topology of the geometry they need to represent. However, at testing time, convexes can also generate explicit representations -- polygonal meshes -- which can then be used in any downstream application. We introduce a network architecture to represent a low dimensional family of convexes. This family is automatically derived via an auto-encoding process. We investigate the applications of this architecture including automatic convex decomposition, image to 3D reconstruction, and part-based shape retrieval.
△ Less
Submitted 12 April, 2020; v1 submitted 12 September, 2019;
originally announced September 2019.
-
The Signs in Elliptic Nets
Authors:
Amir Akbary,
Manoj Kumar,
Soroosh Yazdani
Abstract:
We give a generalization of a theorem of Silverman and Stephens regarding the signs in an elliptic divisibility sequence to the case of an elliptic net. We also describe applications of this theorem in the study of the distribution of the signs in elliptic nets and generating elliptic nets using the denominators of the linear combination of points on elliptic curves.
We give a generalization of a theorem of Silverman and Stephens regarding the signs in an elliptic divisibility sequence to the case of an elliptic net. We also describe applications of this theorem in the study of the distribution of the signs in elliptic nets and generating elliptic nets using the denominators of the linear combination of points on elliptic curves.
△ Less
Submitted 26 February, 2017;
originally announced February 2017.
-
On the greatest prime factor of some divisibility sequences
Authors:
Amir Akbary,
Soroosh Yazdani
Abstract:
Let $P(m)$ denote the greatest prime factor of $m$. For integer $a>1$, M. Ram Murty and S. Wong proved that, under the assumption of the ABC conjecture, $$P(a^n-1)\gg_{ε, a} n^{2-ε}$$ for any $ε>0$. We study analogues results for the corresponding divisibility sequence over the function field $\mathbb{F}_q(t)$ and for some divisibility sequences associated to elliptic curves over the rational fiel…
▽ More
Let $P(m)$ denote the greatest prime factor of $m$. For integer $a>1$, M. Ram Murty and S. Wong proved that, under the assumption of the ABC conjecture, $$P(a^n-1)\gg_{ε, a} n^{2-ε}$$ for any $ε>0$. We study analogues results for the corresponding divisibility sequence over the function field $\mathbb{F}_q(t)$ and for some divisibility sequences associated to elliptic curves over the rational field $\mathbb{Q}$.
△ Less
Submitted 24 May, 2015;
originally announced May 2015.
-
On Symmetries of Elliptic Nets and Valuations of Net Polynomials
Authors:
Amir Akbary,
Jeff Bleaney,
Soroosh Yazdani
Abstract:
Under certain conditions, we prove that the set of zeros of an elliptic net forms an Abelian group. We present two applications of this fact. Firstly we give a generalization of a theorem of Ayad on valuations of division polynomials in the context of net polynomials. Secondly we generalize a theorem of Ward on symmetry of elliptic divisibility sequences to the case of elliptic nets.
Under certain conditions, we prove that the set of zeros of an elliptic net forms an Abelian group. We present two applications of this fact. Firstly we give a generalization of a theorem of Ayad on valuations of division polynomials in the context of net polynomials. Secondly we generalize a theorem of Ward on symmetry of elliptic divisibility sequences to the case of elliptic nets.
△ Less
Submitted 27 August, 2014;
originally announced August 2014.
-
A Generalized Write Channel Model for Bit-Patterned Media Recording
Authors:
Sima Naseri,
Somaie Yazdani,
Behrooz Razeghi,
Ghosheh Abed Hodtani
Abstract:
In this paper, we propose a generalized write channel model for bit-patterned media recording by considering all sources of errors causing some extra disturbances during write process, in addition to data dependent write synchronization errors. We investigate information-theoretic bounds for this new model according to various input distributions and also compare it numerically to the last propose…
▽ More
In this paper, we propose a generalized write channel model for bit-patterned media recording by considering all sources of errors causing some extra disturbances during write process, in addition to data dependent write synchronization errors. We investigate information-theoretic bounds for this new model according to various input distributions and also compare it numerically to the last proposed model.
△ Less
Submitted 15 July, 2014;
originally announced July 2014.
-
Level lowering modulo prime powers and twisted Fermat equations
Authors:
Sander R. Dahmen,
Soroosh Yazdani
Abstract:
We discuss a clean level lowering theorem modulo prime powers for weight $2$ cusp forms. Furthermore, we illustrate how this can be used to completely solve certain twisted Fermat equations $ax^n+by^n+cz^n=0$.
We discuss a clean level lowering theorem modulo prime powers for weight $2$ cusp forms. Furthermore, we illustrate how this can be used to completely solve certain twisted Fermat equations $ax^n+by^n+cz^n=0$.
△ Less
Submitted 1 September, 2010;
originally announced September 2010.
-
Modular Abelian Varieties of Odd Modular Degree
Authors:
Soroosh Yazdani
Abstract:
In this paper, we will study modular Abelian varieties with odd congruence numbers by examining the cuspidal subgroup of $J_0(N)$. We will show that the conductor of such Abelian varieties must be of a special type. For example, if $N$ is the conductor of an absolutely simple modular Abelian variety with an odd congruence number, then $N$ has at most two prime divisors, and if $N$ is odd, then…
▽ More
In this paper, we will study modular Abelian varieties with odd congruence numbers by examining the cuspidal subgroup of $J_0(N)$. We will show that the conductor of such Abelian varieties must be of a special type. For example, if $N$ is the conductor of an absolutely simple modular Abelian variety with an odd congruence number, then $N$ has at most two prime divisors, and if $N$ is odd, then $N=p^α$ or $N=pq$ for some prime $p$ and $q$. In the second half of this paper, we will focus on modular elliptic curves with odd modular degree. Our results, combined with the work of Agashe, Ribet, and Stein, finds necessary condition for elliptic curves to have odd modular degree. In the process we prove Watkins's conjecture for elliptic curves with odd modular degree and a nontrivial rational torsion point.
△ Less
Submitted 3 October, 2009;
originally announced October 2009.
-
Modular Abelian Variety of Odd Modular Degree
Authors:
S. Yazdani
Abstract:
We will study modular Abelian varieties with odd congruence numbers, by studying the cuspidal subgroup of $J_0(N)$. We show the conductor of such Abelian varieties must be of a special type, for example if $N$ is odd then $N=p^α$ or $N=pq$ for some prime $p$ and $q$. We then focus our attention to modular elliptic curves, and using result of Agashe, Ribet, and Stein, we try to classify all ellip…
▽ More
We will study modular Abelian varieties with odd congruence numbers, by studying the cuspidal subgroup of $J_0(N)$. We show the conductor of such Abelian varieties must be of a special type, for example if $N$ is odd then $N=p^α$ or $N=pq$ for some prime $p$ and $q$. We then focus our attention to modular elliptic curves, and using result of Agashe, Ribet, and Stein, we try to classify all elliptic curves of odd modular degree. Our studies prove many cases of the Stein and Watkins's conjecture on elliptic curves with odd modular degree.
△ Less
Submitted 3 July, 2007;
originally announced July 2007.