-
SIFT-DBT: Self-supervised Initialization and Fine-Tuning for Imbalanced Digital Breast Tomosynthesis Image Classification
Authors:
Yuexi Du,
Regina J. Hooley,
John Lewin,
Nicha C. Dvornek
Abstract:
Digital Breast Tomosynthesis (DBT) is a widely used medical imaging modality for breast cancer screening and diagnosis, offering higher spatial resolution and greater detail through its 3D-like breast volume imaging capability. However, the increased data volume also introduces pronounced data imbalance challenges, where only a small fraction of the volume contains suspicious tissue. This further…
▽ More
Digital Breast Tomosynthesis (DBT) is a widely used medical imaging modality for breast cancer screening and diagnosis, offering higher spatial resolution and greater detail through its 3D-like breast volume imaging capability. However, the increased data volume also introduces pronounced data imbalance challenges, where only a small fraction of the volume contains suspicious tissue. This further exacerbates the data imbalance due to the case-level distribution in real-world data and leads to learning a trivial classification model that only predicts the majority class. To address this, we propose a novel method using view-level contrastive Self-supervised Initialization and Fine-Tuning for identifying abnormal DBT images, namely SIFT-DBT. We further introduce a patch-level multi-instance learning method to preserve spatial resolution. The proposed method achieves 92.69% volume-wise AUC on an evaluation of 970 unique studies.
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
Exact coherent structures in two-dimensional turbulence identified with convolutional autoencoders
Authors:
Jacob Page,
Joe Holey,
Michael P. Brenner,
Rich R. Kerswell
Abstract:
Convolutional autoencoders are used to deconstruct the changing dynamics of two-dimensional Kolmogorov flow as $Re$ is increased from weakly chaotic flow at $Re=40$ to a chaotic state dominated by a domain-filling vortex pair at $Re=400$. The highly accurate embeddings allow us to visualise the evolving structure of state space and are interpretable using `latent Fourier analysis' (Page {\em et. a…
▽ More
Convolutional autoencoders are used to deconstruct the changing dynamics of two-dimensional Kolmogorov flow as $Re$ is increased from weakly chaotic flow at $Re=40$ to a chaotic state dominated by a domain-filling vortex pair at $Re=400$. The highly accurate embeddings allow us to visualise the evolving structure of state space and are interpretable using `latent Fourier analysis' (Page {\em et. al.}, \emph{Phys. Rev. Fluids} \textbf{6}, 2021). Individual latent Fourier modes decode into vortical structures with a streamwise lengthscale controlled by the latent wavenumber, $l$, with only a small number $l \lesssim 8$ required to accurately represent the flow. Latent Fourier projections reveal a detached class of bursting events at $Re=40$ which merge with the low-dissipation dynamics as $Re$ is increased to $100$. We use doubly- ($l=2$) or triply- ($l=3$) periodic latent Fourier modes to generate guesses for UPOs (unstable periodic orbits) associated with high-dissipation events. While the doubly-periodic UPOs are representative of the high-dissipation dynamics at $Re=40$, the same class of UPOs move away from the attractor at $Re=100$ -- where the associated bursting events typically involve larger-scale ($l=1$) structure too. At $Re=400$ an entirely different embedding structure is formed within the network in which no distinct representations of small-scale vortices are observed; instead the network embeds all snapshots based around a large-scale template for the condensate. We use latent Fourier projections to find an associated `large-scale' UPO which we believe to be a finite-$Re$ continuation of a solution to the Euler equations.
△ Less
Submitted 22 September, 2023;
originally announced September 2023.
-
In search of lost introns
Authors:
Miklós Csűrös,
J. Andrew Holey,
Igor B. Rogozin
Abstract:
Many fundamental questions concerning the emergence and subsequent evolution of eukaryotic exon-intron organization are still unsettled. Genome-scale comparative studies, which can shed light on crucial aspects of eukaryotic evolution, require adequate computational tools.
We describe novel computational methods for studying spliceosomal intron evolution. Our goal is to give a reliable charact…
▽ More
Many fundamental questions concerning the emergence and subsequent evolution of eukaryotic exon-intron organization are still unsettled. Genome-scale comparative studies, which can shed light on crucial aspects of eukaryotic evolution, require adequate computational tools.
We describe novel computational methods for studying spliceosomal intron evolution. Our goal is to give a reliable characterization of the dynamics of intron evolution. Our algorithmic innovations address the identification of orthologous introns, and the likelihood-based analysis of intron data. We discuss a compression method for the evaluation of the likelihood function, which is noteworthy for phylogenetic likelihood problems in general. We prove that after $O(nL)$ preprocessing time, subsequent evaluations take $O(nL/\log L)$ time almost surely in the Yule-Harding random model of $n$-taxon phylogenies, where $L$ is the input sequence length.
We illustrate the practicality of our methods by compiling and analyzing a data set involving 18 eukaryotes, more than in any other study to date. The study yields the surprising result that ancestral eukaryotes were fairly intron-rich. For example, the bilaterian ancestor is estimated to have had more than 90% as many introns as vertebrates do now.
△ Less
Submitted 3 February, 2007;
originally announced February 2007.