-
Derivative of the Riemann-Hilbert map
Authors:
Vladimir Marković,
Ognjen Tošić
Abstract:
Given a pair $(X,\nabla)$, consisting of a closed Riemann surface $X$ and a holomorphic connection $\nabla$ on the trivial principal bundle $X\times\mathrm{SL}_2(\mathbb{C})\to X$, the Riemann-Hilbert map sends $(X,\nabla)$ to its monodromy representation. We compute the derivative of this map, and provide a simple description of the locus where it is injective, recovering in the process several p…
▽ More
Given a pair $(X,\nabla)$, consisting of a closed Riemann surface $X$ and a holomorphic connection $\nabla$ on the trivial principal bundle $X\times\mathrm{SL}_2(\mathbb{C})\to X$, the Riemann-Hilbert map sends $(X,\nabla)$ to its monodromy representation. We compute the derivative of this map, and provide a simple description of the locus where it is injective, recovering in the process several previously obtained results.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Large-scale quantum reservoir learning with an analog quantum computer
Authors:
Milan Kornjača,
Hong-Ye Hu,
Chen Zhao,
Jonathan Wurtz,
Phillip Weinberg,
Majd Hamdan,
Andrii Zhdanov,
Sergio H. Cantu,
Hengyun Zhou,
Rodrigo Araiza Bravo,
Kevin Bagnall,
James I. Basham,
Joseph Campo,
Adam Choukri,
Robert DeAngelo,
Paige Frederick,
David Haines,
Julian Hammett,
Ning Hsu,
Ming-Guang Hu,
Florian Huber,
Paul Niklas Jepsen,
Ningyuan Jia,
Thomas Karolyshyn,
Minho Kwon
, et al. (28 additional authors not shown)
Abstract:
Quantum machine learning has gained considerable attention as quantum technology advances, presenting a promising approach for efficiently learning complex data patterns. Despite this promise, most contemporary quantum methods require significant resources for variational parameter optimization and face issues with vanishing gradients, leading to experiments that are either limited in scale or lac…
▽ More
Quantum machine learning has gained considerable attention as quantum technology advances, presenting a promising approach for efficiently learning complex data patterns. Despite this promise, most contemporary quantum methods require significant resources for variational parameter optimization and face issues with vanishing gradients, leading to experiments that are either limited in scale or lack potential for quantum advantage. To address this, we develop a general-purpose, gradient-free, and scalable quantum reservoir learning algorithm that harnesses the quantum dynamics of neutral-atom analog quantum computers to process data. We experimentally implement the algorithm, achieving competitive performance across various categories of machine learning tasks, including binary and multi-class classification, as well as timeseries prediction. Effective and improving learning is observed with increasing system sizes of up to 108 qubits, demonstrating the largest quantum machine learning experiment to date. We further observe comparative quantum kernel advantage in learning tasks by constructing synthetic datasets based on the geometric differences between generated quantum and classical data kernels. Our findings demonstrate the potential of utilizing classically intractable quantum correlations for effective machine learning. We expect these results to stimulate further extensions to different quantum hardware and machine learning paradigms, including early fault-tolerant hardware and generative machine learning tasks.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Comparative Analysis of Personalized Voice Activity Detection Systems: Assessing Real-World Effectiveness
Authors:
Satyam Kumar,
Sai Srujana Buddi,
Utkarsh Oggy Sarawgi,
Vineet Garg,
Shivesh Ranjan,
Ognjen,
Rudovic,
Ahmed Hussen Abdelaziz,
Saurabh Adya
Abstract:
Voice activity detection (VAD) is a critical component in various applications such as speech recognition, speech enhancement, and hands-free communication systems. With the increasing demand for personalized and context-aware technologies, the need for effective personalized VAD systems has become paramount. In this paper, we present a comparative analysis of Personalized Voice Activity Detection…
▽ More
Voice activity detection (VAD) is a critical component in various applications such as speech recognition, speech enhancement, and hands-free communication systems. With the increasing demand for personalized and context-aware technologies, the need for effective personalized VAD systems has become paramount. In this paper, we present a comparative analysis of Personalized Voice Activity Detection (PVAD) systems to assess their real-world effectiveness. We introduce a comprehensive approach to assess PVAD systems, incorporating various performance metrics such as frame-level and utterance-level error rates, detection latency and accuracy, alongside user-level analysis. Through extensive experimentation and evaluation, we provide a thorough understanding of the strengths and limitations of various PVAD variants. This paper advances the understanding of PVAD technology by offering insights into its efficacy and viability in practical applications using a comprehensive set of metrics.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Combining X-Vectors and Bayesian Batch Active Learning: Two-Stage Active Learning Pipeline for Speech Recognition
Authors:
Ognjen Kundacina,
Vladimir Vincan,
Dragisa Miskovic
Abstract:
Emphasizing a data-centric AI approach, this paper introduces a novel two-stage active learning (AL) pipeline for automatic speech recognition (ASR), combining unsupervised and supervised AL methods. The first stage utilizes unsupervised AL by using x-vectors clustering for diverse sample selection from unlabeled speech data, thus establishing a robust initial dataset for the subsequent supervised…
▽ More
Emphasizing a data-centric AI approach, this paper introduces a novel two-stage active learning (AL) pipeline for automatic speech recognition (ASR), combining unsupervised and supervised AL methods. The first stage utilizes unsupervised AL by using x-vectors clustering for diverse sample selection from unlabeled speech data, thus establishing a robust initial dataset for the subsequent supervised AL. The second stage incorporates a supervised AL strategy, with a batch AL method specifically developed for ASR, aimed at selecting diverse and informative batches of samples. Here, sample diversity is also achieved using x-vectors clustering, while the most informative samples are identified using a Bayesian AL method tailored for ASR with an adaptation of Monte Carlo dropout to approximate Bayesian inference. This approach enables precise uncertainty estimation, thereby enhancing ASR model training with significantly reduced data requirements. Our method has shown superior performance compared to competing methods on homogeneous, heterogeneous, and OOD test sets, demonstrating that strategic sample selection and innovative Bayesian modeling can substantially optimize both labeling effort and data utilization in deep learning-based ASR applications.
△ Less
Submitted 3 May, 2024;
originally announced June 2024.
-
Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference Heads
Authors:
Avelina Asada Hadji-Kyriacou,
Ognjen Arandjelovic
Abstract:
Pre-trained Language Models (LMs) exhibit strong zero-shot and in-context learning capabilities; however, their behaviors are often difficult to control. By utilizing Reinforcement Learning from Human Feedback (RLHF), it is possible to fine-tune unsupervised LMs to follow instructions and produce outputs that reflect human preferences. Despite its benefits, RLHF has been shown to potentially harm…
▽ More
Pre-trained Language Models (LMs) exhibit strong zero-shot and in-context learning capabilities; however, their behaviors are often difficult to control. By utilizing Reinforcement Learning from Human Feedback (RLHF), it is possible to fine-tune unsupervised LMs to follow instructions and produce outputs that reflect human preferences. Despite its benefits, RLHF has been shown to potentially harm a language model's reasoning capabilities and introduce artifacts such as hallucinations where the model may fabricate facts. To address this issue we introduce Direct Preference Heads (DPH), a fine-tuning framework that enables LMs to learn human preference signals through an auxiliary reward head without directly affecting the output distribution of the language modeling head. We perform a theoretical analysis of our objective function and find strong ties to Conservative Direct Preference Optimization (cDPO). Finally we evaluate our models on GLUE, RACE, and the GPT4All evaluation suite and demonstrate that our method produces models which achieve higher scores than those fine-tuned with Supervised Fine-Tuning (SFT) or Direct Preference Optimization (DPO) alone.
△ Less
Submitted 30 May, 2024;
originally announced May 2024.
-
Covariant Schrödinger Operator and $L^2$-Vanishing Property on Riemannian Manifolds
Authors:
Ognjen Milatovic
Abstract:
Let $M$ be a complete Riemannian manifold satisfying a weighted Poincaré inequality, and let $\mathcal{E}$ be a Hermitian vector bundle over $M$ equipped with a metric covariant derivative $\nabla$. We consider the operator $H_{X,V}=\nabla^{\dagger}\nabla+\nabla_{X}+ V$, where $\nabla^{\dagger}$ is the formal adjoint of $\nabla$ with respect to the inner product in the space of square-integrable s…
▽ More
Let $M$ be a complete Riemannian manifold satisfying a weighted Poincaré inequality, and let $\mathcal{E}$ be a Hermitian vector bundle over $M$ equipped with a metric covariant derivative $\nabla$. We consider the operator $H_{X,V}=\nabla^{\dagger}\nabla+\nabla_{X}+ V$, where $\nabla^{\dagger}$ is the formal adjoint of $\nabla$ with respect to the inner product in the space of square-integrable sections of $\mathcal{E}$, $X$ is a smooth (real) vector field on $M$, and $V$ is a fiberwise self-adjoint, smooth section of the endomorphism bundle $\textrm{End }\mathcal{E}$. We give a sufficient condition for the triviality of the $L^2$-kernel of $H_{X,V}$. As a corollary, putting $X\equiv 0$ and working in the setting of a Clifford bundle equipped with a Clifford connection $\nabla$, we obtain the triviality of the $L^2$-kernel of $D^2$, where $D$ is the Dirac operator corresponding to $\nabla$. In particular, when $\mathcal{E}=Λ^{k}T^*M$ and $D^2$ is the Hodge--deRham Laplacian on $k$-forms, we recover some recent vanishing results for $L^2$-harmonic $k$-forms.
△ Less
Submitted 6 May, 2024; v1 submitted 1 May, 2024;
originally announced May 2024.
-
Constant energy families of harmonic maps
Authors:
Ognjen Tošić
Abstract:
For a negatively curved manifold $M$ and a continuous map $ψ:Σ\to M$ from a closed surface $Σ$, we study complex submanifolds of Teichmüller space $\mathcal{S}\subset\mathcal{T}(Σ)$ such that the harmonic maps $\{h_X:X\to M\text{ for }X\in\mathcal{S}\}$ in the homotopy class of $ψ$ all have equal energy. When $M$ is real analytic with negative Hermitian sectional curvature, we show that for any su…
▽ More
For a negatively curved manifold $M$ and a continuous map $ψ:Σ\to M$ from a closed surface $Σ$, we study complex submanifolds of Teichmüller space $\mathcal{S}\subset\mathcal{T}(Σ)$ such that the harmonic maps $\{h_X:X\to M\text{ for }X\in\mathcal{S}\}$ in the homotopy class of $ψ$ all have equal energy. When $M$ is real analytic with negative Hermitian sectional curvature, we show that for any such $\mathcal{S}$, there exists a closed Riemann surface $Y$, such that any $h_X$ for $X\in\mathcal{S}$ factors as a holomorphic map $φ_X:X\to Y$ followed by a fixed harmonic map $h:Y\to M$. This answers a question posed by both Toledo and Gromov. As a first application, we show a factorization result for harmonic maps from normal projective varieties to $M$. As a second application, we study homomorphisms from finite index subgroups of map** class groups to $π_1(M)$.
△ Less
Submitted 21 April, 2024;
originally announced April 2024.
-
Fast single atom imaging in optical lattice arrays
Authors:
Lin Su,
Alexander Douglas,
Michal Szurek,
Anne H. Hebert,
Aaron Krahn,
Robin Groth,
Gregory A. Phelps,
Ognjen Markovic,
Markus Greiner
Abstract:
High-resolution fluorescence imaging of ultracold atoms and molecules is paramount to performing quantum simulation and computation in optical lattices and optical tweezers. Imaging durations in these experiments typically range from a millisecond to a second, which can significantly limit the cycle time. In this work, we present fast, 2.4 us single-atom imaging in lattices, with 99.4% fidelity. A…
▽ More
High-resolution fluorescence imaging of ultracold atoms and molecules is paramount to performing quantum simulation and computation in optical lattices and optical tweezers. Imaging durations in these experiments typically range from a millisecond to a second, which can significantly limit the cycle time. In this work, we present fast, 2.4 us single-atom imaging in lattices, with 99.4% fidelity. Additionally, we resolve lattice sites spaced within the diffraction limit by using accordion lattices to increase the atom spacing before imaging. This overcomes the challenge of imaging small-spacing lattices and enables the study of extended Hubbard models using magnetic atoms. We also demonstrate number-resolved imaging without parity projection, which will facilitate experiments such as the exploration of high-filling phases in the extended Bose-Hubbard models, multi-band or SU(N) Fermi-Hubbard models, and quantum link models.
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
On a generalization of compact and connected spaces
Authors:
Nebojsa Elez,
Ognjen Papaz
Abstract:
In this paper we will give two different natural generalizations of compact spaces and connected spaces simultaneously. We will show that these generalizations coincide for the subspaces of the real line and that they differ for subspaces of plane.
In this paper we will give two different natural generalizations of compact spaces and connected spaces simultaneously. We will show that these generalizations coincide for the subspaces of the real line and that they differ for subspaces of plane.
△ Less
Submitted 5 April, 2024;
originally announced April 2024.
-
Fully Distributed Cooperative Multi-agent Underwater Obstacle Avoidance Under Dog Walking Paradigm
Authors:
Kanzhong Yao,
Ognjen Marjanovic,
Simon Watson
Abstract:
Navigation in cluttered underwater environments is challenging, especially when there are constraints on communication and self-localisation. Part of the fully distributed underwater navigation problem has been resolved by introducing multi-agent robot teams, however when the environment becomes cluttered, the problem remains unresolved. In this paper, we first studied the connection between every…
▽ More
Navigation in cluttered underwater environments is challenging, especially when there are constraints on communication and self-localisation. Part of the fully distributed underwater navigation problem has been resolved by introducing multi-agent robot teams, however when the environment becomes cluttered, the problem remains unresolved. In this paper, we first studied the connection between everyday activity of dog walking and the cooperative underwater obstacle avoidance problem. Inspired by this analogy, we propose a novel dog walking paradigm and implement it in a multi-agent underwater system. Simulations were conducted across various scenarios, with performance benchmarked against traditional methods utilising Image-Based Visual Servoing in a multi-agent setup. Results indicate that our dog walking-inspired paradigm significantly enhances cooperative behavior among agents and outperforms the existing approach in navigating through obstacles.
△ Less
Submitted 15 March, 2024;
originally announced March 2024.
-
Virtual Elastic Tether: a New Approach for Multi-agent Navigation in Confined Aquatic Environments
Authors:
Kanzhong Yao,
Xueliang Cheng,
Keir Groves,
Barry Lennox,
Ognjen Marjanovic,
Simon Watson
Abstract:
Underwater navigation is a challenging area in the field of mobile robotics due to inherent constraints in self-localisation and communication in underwater environments. Some of these challenges can be mitigated by using collaborative multi-agent teams. However, when applied underwater, the robustness of traditional multi-agent collaborative control approaches is highly limited due to the unavail…
▽ More
Underwater navigation is a challenging area in the field of mobile robotics due to inherent constraints in self-localisation and communication in underwater environments. Some of these challenges can be mitigated by using collaborative multi-agent teams. However, when applied underwater, the robustness of traditional multi-agent collaborative control approaches is highly limited due to the unavailability of reliable measurements. In this paper, the concept of a Virtual Elastic Tether (VET) is introduced in the context of incomplete state measurements, which represents an innovative approach to underwater navigation in confined spaces. The concept of VET is formulated and validated using the Cooperative Aquatic Vehicle Exploration System (CAVES), which is a sim-to-real multi-agent aquatic robotic platform. Within this framework, a vision-based Autonomous Underwater Vehicle-Autonomous Surface Vehicle leader-follower formulation is developed. Experiments were conducted in both simulation and on a physical platform, benchmarked against a traditional Image-Based Visual Servoing approach. Results indicate that the formation of the baseline approach fails under discrete disturbances, when induced distances between the robots exceeds 0.6 m in simulation and 0.3 m in the real world. In contrast, the VET-enhanced system recovers to pre-perturbation distances within 5 seconds. Furthermore, results illustrate the successful navigation of VET-enhanced CAVES in a confined water pond where the baseline approach fails to perform adequately.
△ Less
Submitted 15 March, 2024;
originally announced March 2024.
-
Adaptive multi-spectral mimicking with 2D-material nanoresonator networks
Authors:
Yujie Luo,
Thomas Christensen,
Ognjen Ilic
Abstract:
Active nanophotonic materials that can emulate and adapt between many different spectral profiles -- with high fidelity and over a broad bandwidth -- could have a far-reaching impact, but are challenging to design due to a high-dimensional and complex design space. Here, we show that a metamaterial network of coupled 2D-material nanoresonators in graphene can adaptively match multiple complex abso…
▽ More
Active nanophotonic materials that can emulate and adapt between many different spectral profiles -- with high fidelity and over a broad bandwidth -- could have a far-reaching impact, but are challenging to design due to a high-dimensional and complex design space. Here, we show that a metamaterial network of coupled 2D-material nanoresonators in graphene can adaptively match multiple complex absorption spectra via a set of input voltages. To design such networks, we develop a semi-analytical auto-differentiable dipole-coupled model that allows scalable optimization of high-dimensional networks with many elements and voltage signals. As a demonstration of multi-spectral capability, we design a single network capable of mimicking four spectral targets resembling select gases (nitric oxide, nitrogen dioxide, methane, nitrous oxide) with very high fidelity (${>}\,90\%$). Our results are relevant for the design of highly reconfigurable optical materials and platforms for applications in sensing, communication and display technology, and signature and thermal management.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
Artwork Protection Against Neural Style Transfer Using Locally Adaptive Adversarial Color Attack
Authors:
Zhongliang Guo,
Junhao Dong,
Yifei Qian,
Kaixuan Wang,
Weiye Li,
Ziheng Guo,
Yuheng Wang,
Yanli Li,
Ognjen Arandjelović,
Lei Fang
Abstract:
Neural style transfer (NST) generates new images by combining the style of one image with the content of another. However, unauthorized NST can exploit artwork, raising concerns about artists' rights and motivating the development of proactive protection methods. We propose Locally Adaptive Adversarial Color Attack (LAACA), empowering artists to protect their artwork from unauthorized style transf…
▽ More
Neural style transfer (NST) generates new images by combining the style of one image with the content of another. However, unauthorized NST can exploit artwork, raising concerns about artists' rights and motivating the development of proactive protection methods. We propose Locally Adaptive Adversarial Color Attack (LAACA), empowering artists to protect their artwork from unauthorized style transfer by processing before public release. By delving into the intricacies of human visual perception and the role of different frequency components, our method strategically introduces frequency-adaptive perturbations in the image. These perturbations significantly degrade the generation quality of NST while maintaining an acceptable level of visual change in the original image, ensuring that potential infringers are discouraged from using the protected artworks, because of its bad NST generation quality. Additionally, existing metrics often overlook the importance of color fidelity in evaluating color-mattered tasks, such as the quality of NST-generated images, which is crucial in the context of artistic works. To comprehensively assess the color-mattered tasks, we propose the Adversarial Color Distance Metric (ACDM), designed to quantify the color difference of images pre- and post-manipulations. Experimental results confirm that attacking NST using LAACA results in visually inferior style transfer, and the ACDM can efficiently measure color-mattered tasks. By providing artists with a tool to safeguard their intellectual property, our work relieves the socio-technical challenges posed by the misuse of NST in the art community.
△ Less
Submitted 5 July, 2024; v1 submitted 17 January, 2024;
originally announced January 2024.
-
Near-ideal Microwave Photon to Electron Conversion in a High Impedance Quantum Circuit
Authors:
Ognjen Stanisavljević,
Jean-Côme Philippe,
Julien Gabelli,
Marco Aprili,
Jérôme Estève,
Julien Basset
Abstract:
Photoelectric detectors cover a wide frequency spectrum spanning from the far ultraviolet to the infrared light with high sensitivity, large quantum efficiency and low dark current. The equivalent photoelectric detection of microwave frequency photons has remained elusive due to inherent differences between microwave photon energy and the interband transition energies exploited in standard photoel…
▽ More
Photoelectric detectors cover a wide frequency spectrum spanning from the far ultraviolet to the infrared light with high sensitivity, large quantum efficiency and low dark current. The equivalent photoelectric detection of microwave frequency photons has remained elusive due to inherent differences between microwave photon energy and the interband transition energies exploited in standard photoelectric detectors. Here we present the realization of a near-ideal microwave photon to electron converter at a frequency typical of circuit quantum electrodynamics. These unique properties are enabled by the use of a high kinetic inductance disordered superconductor, granular aluminium, to enhance the light-matter interaction. This experiment constitutes an important proof of concept regarding low energy microwave photon to electron conversion unveiling new possibilities such as the detection of single microwave photons using charge detection. It finds significance in quantum research openning doors to a wide array of applications, from quantum-enhanced sensing to exploring the fundamental properties of quantum states.
△ Less
Submitted 21 December, 2023;
originally announced December 2023.
-
BICM-compatible Rate Adaptive Geometric Constellation Sha** Using Optimized Many-to-one Labeling
Authors:
Metodi Plamenov Yankov,
Smaranika Swain,
Ognjen Jovanovic,
Darko Zibar,
Francesco Da Ros
Abstract:
In this paper, a rate adaptive geometric constellation sha** (GCS) scheme which is fully backward-compatible with existing state of the art bit-interleaved coded modulation (BICM) systems is proposed and experimentally demonstrated. The system relies on optimization of the positions of the quadrature amplitude modulation (QAM) points on the I/Q plane for maximized achievable information rate, wh…
▽ More
In this paper, a rate adaptive geometric constellation sha** (GCS) scheme which is fully backward-compatible with existing state of the art bit-interleaved coded modulation (BICM) systems is proposed and experimentally demonstrated. The system relies on optimization of the positions of the quadrature amplitude modulation (QAM) points on the I/Q plane for maximized achievable information rate, while maintaining quantization and fiber nonlinear noise robustness. Furthermore, `dummy' bits are multiplexed with coded bits before map** to symbols. Rate adaptivity is achieved by tuning the ratio of coded and `dummy' bits, while maintaining a fixed forward error-correction block and a fixed modulation format size. The points' positions and their labeling are optimized using automatic differentiation. The proposed GCS scheme is compared to a time-sharing hybrid (TH) QAM modulation and the now mainstream probabilistic amplitude sha** (PAS) scheme. The TH without sha** is outperformed for all studied data rates in a simulated linear channel by up to 0.7 dB. In a linear channel, PAS is shown to outperform the proposed GCS scheme, while similar performances are reported for PAS and the proposed GCS in a simulated nonlinear fiber channel. The GCS scheme is experimentally demonstrated in a multi-span recirculating loop coherent optical fiber transmission system with a total distance of up to 3000 km. Near-continuous zero-error flexible throughput is reported as a function of the transmission distance. Up to 1-2 spans of increased reach gains are achieved at the same net data rate w.r.t. conventional QAM. At a given distance, up to 0.79 bits/2D symbol of gain w.r.t. conventional QAM is achieved. In the experiment, similar performance to PAS is demonstrated.
△ Less
Submitted 13 March, 2024; v1 submitted 10 November, 2023;
originally announced December 2023.
-
Context-PEFT: Efficient Multi-Modal, Multi-Task Fine-Tuning
Authors:
Avelina Asada Hadji-Kyriacou,
Ognjen Arandjelovic
Abstract:
This paper introduces a novel Parameter-Efficient Fine-Tuning (PEFT) framework for multi-modal, multi-task transfer learning with pre-trained language models. PEFT techniques such as LoRA, BitFit and IA3 have demonstrated comparable performance to full fine-tuning of pre-trained models for specific downstream tasks, all while demanding significantly fewer trainable parameters and reduced GPU memor…
▽ More
This paper introduces a novel Parameter-Efficient Fine-Tuning (PEFT) framework for multi-modal, multi-task transfer learning with pre-trained language models. PEFT techniques such as LoRA, BitFit and IA3 have demonstrated comparable performance to full fine-tuning of pre-trained models for specific downstream tasks, all while demanding significantly fewer trainable parameters and reduced GPU memory consumption. However, in the context of multi-modal fine-tuning, the need for architectural modifications or full fine-tuning often becomes apparent. To address this we propose Context-PEFT, which learns different groups of adaptor parameters based on the token's domain. This approach enables LoRA-like weight injection without requiring additional architectural changes. Our method is evaluated on the COCO captioning task, where it outperforms full fine-tuning under similar data constraints while simultaneously offering a substantially more parameter-efficient and computationally economical solution.
△ Less
Submitted 14 December, 2023;
originally announced December 2023.
-
Benchmarking Pathology Feature Extractors for Whole Slide Image Classification
Authors:
Georg Wölflein,
Dyke Ferber,
Asier R. Meneghetti,
Omar S. M. El Nahhas,
Daniel Truhn,
Zunamys I. Carrero,
David J. Harrison,
Ognjen Arandjelović,
Jakob Nikolas Kather
Abstract:
Weakly supervised whole slide image classification is a key task in computational pathology, which involves predicting a slide-level label from a set of image patches constituting the slide. Constructing models to solve this task involves multiple design choices, often made without robust empirical or conclusive theoretical justification. To address this, we conduct a comprehensive benchmarking of…
▽ More
Weakly supervised whole slide image classification is a key task in computational pathology, which involves predicting a slide-level label from a set of image patches constituting the slide. Constructing models to solve this task involves multiple design choices, often made without robust empirical or conclusive theoretical justification. To address this, we conduct a comprehensive benchmarking of feature extractors to answer three critical questions: 1) Is stain normalisation still a necessary preprocessing step? 2) Which feature extractors are best for downstream slide-level classification? 3) How does magnification affect downstream performance? Our study constitutes the most comprehensive evaluation of publicly available pathology feature extractors to date, involving more than 10,000 training runs across 14 feature extractors, 9 tasks, 5 datasets, 3 downstream architectures, 2 levels of magnification, and various preprocessing setups. Our findings challenge existing assumptions: 1) We observe empirically, and by analysing the latent space, that skip** stain normalisation and image augmentations does not degrade performance, while significantly reducing memory and computational demands. 2) We develop a novel evaluation metric to compare relative downstream performance, and show that the choice of feature extractor is the most consequential factor for downstream performance. 3) We find that lower-magnification slides are sufficient for accurate slide-level classification. Contrary to previous patch-level benchmarking studies, our approach emphasises clinical relevance by focusing on slide-level biomarker prediction tasks in a weakly supervised setting with external validation cohorts. Our findings stand to streamline digital pathology workflows by minimising preprocessing needs and informing the selection of feature extractors.
△ Less
Submitted 21 June, 2024; v1 submitted 20 November, 2023;
originally announced November 2023.
-
Extended Sobolev Scale on $\mathbb{Z}^n$
Authors:
Ognjen Milatovic
Abstract:
In analogy with the definition of ``extended Sobolev scale" on $\mathbb{R}^n$ by Mikhailets and Murach, working in the setting of the lattice $\mathbb{Z}^n$, we define the ``extended Sobolev scale" $H^{\varphi}(\mathbb{Z}^n)$, where $\varphi$ is a function which is $RO$-varying at infinity. Using the scale $H^{\varphi}(\mathbb{Z}^n)$, we describe all Hilbert function-spaces that serve as interpola…
▽ More
In analogy with the definition of ``extended Sobolev scale" on $\mathbb{R}^n$ by Mikhailets and Murach, working in the setting of the lattice $\mathbb{Z}^n$, we define the ``extended Sobolev scale" $H^{\varphi}(\mathbb{Z}^n)$, where $\varphi$ is a function which is $RO$-varying at infinity. Using the scale $H^{\varphi}(\mathbb{Z}^n)$, we describe all Hilbert function-spaces that serve as interpolation spaces with respect to a pair of discrete Sobolev spaces $[H^{(s_0)}(\mathbb{Z}^n), H^{(s_1)}(\mathbb{Z}^n)]$, with $s_0<s_1$. We use this interpolation result to obtain the map** property and the Fredholmness property of (discrete) pseudo-differential operators (PDOs) in the context of the scale $H^{\varphi}(\mathbb{Z}^n)$. Furthermore, starting from a first-order positive-definite (discrete) PDO $A$ of elliptic type, we define the ``extended discrete $A$-scale" $H^{\varphi}_{A}(\mathbb{Z}^n)$ and show that it coincides, up to norm equivalence, with the scale $H^{\varphi}(\mathbb{Z}^n)$. Additionally, we establish the $\mathbb{Z}^n$-analogues of several other properties of the scale $H^{\varphi}(\mathbb{R}^n)$.
△ Less
Submitted 16 October, 2023;
originally announced October 2023.
-
Semi-Supervised Crowd Counting with Contextual Modeling: Facilitating Holistic Understanding of Crowd Scenes
Authors:
Yifei Qian,
Xiaopeng Hong,
Zhongliang Guo,
Ognjen Arandjelović,
Carl R. Donovan
Abstract:
To alleviate the heavy annotation burden for training a reliable crowd counting model and thus make the model more practicable and accurate by being able to benefit from more data, this paper presents a new semi-supervised method based on the mean teacher framework. When there is a scarcity of labeled data available, the model is prone to overfit local patches. Within such contexts, the convention…
▽ More
To alleviate the heavy annotation burden for training a reliable crowd counting model and thus make the model more practicable and accurate by being able to benefit from more data, this paper presents a new semi-supervised method based on the mean teacher framework. When there is a scarcity of labeled data available, the model is prone to overfit local patches. Within such contexts, the conventional approach of solely improving the accuracy of local patch predictions through unlabeled data proves inadequate. Consequently, we propose a more nuanced approach: fostering the model's intrinsic 'subitizing' capability. This ability allows the model to accurately estimate the count in regions by leveraging its understanding of the crowd scenes, mirroring the human cognitive process. To achieve this goal, we apply masking on unlabeled data, guiding the model to make predictions for these masked patches based on the holistic cues. Furthermore, to help with feature learning, herein we incorporate a fine-grained density classification task. Our method is general and applicable to most existing crowd counting methods as it doesn't have strict structural or loss constraints. In addition, we observe that the model trained with our framework exhibits a 'subitizing'-like behavior. It accurately predicts low-density regions with only a 'glance', while incorporating local details to predict high-density regions. Our method achieves the state-of-the-art performance, surpassing previous approaches by a large margin on challenging benchmarks such as ShanghaiTech A and UCF-QNRF. The code is available at: https://github.com/cha15yq/MRC-Crowd.
△ Less
Submitted 20 April, 2024; v1 submitted 16 October, 2023;
originally announced October 2023.
-
A Stiffness-Oriented Model Order Reduction Method for Low-Inertia Power Systems
Authors:
Simon Muntwiler,
Ognjen Stanojev,
Andrea Zanelli,
Gabriela Hug,
Melanie N. Zeilinger
Abstract:
This paper presents a novel model order reduction technique tailored for power systems with a large share of inverter-based energy resources. Such systems exhibit an increased level of dynamic stiffness compared to traditional power systems, posing challenges for time-domain simulations and control design. Our approach involves rotation of the coordinate system of a linearized system using a trans…
▽ More
This paper presents a novel model order reduction technique tailored for power systems with a large share of inverter-based energy resources. Such systems exhibit an increased level of dynamic stiffness compared to traditional power systems, posing challenges for time-domain simulations and control design. Our approach involves rotation of the coordinate system of a linearized system using a transformation matrix derived from the real Jordan canonical form, leading to mode decoupling. The fast modes are then truncated in the rotated coordinate system to obtain a lower-order model with reduced stiffness. Applying the same transformation to the original nonlinear system results in an approximate separation of slow and fast states, which can be truncated to reduce the stiffness. The resulting reduced-order model demonstrates an accurate time-domain performance, the slow eigenvalues of the linearized system are correctly preserved, and a reduction in the model stiffness is achieved, allowing for accurate integration with increased step size. Our methodology is assessed in detail for a 3-bus system with generation units involving grid-forming/following converters and synchronous machines, where it allows for a computational speed-up of up to 100x compared to the original system. Several standard larger test systems are also considered.
△ Less
Submitted 5 July, 2024; v1 submitted 16 October, 2023;
originally announced October 2023.
-
Qualitative Analysis for Validating IEC 62443-4-2 Requirements in DevSecOps
Authors:
Christian Göttel,
Maëlle Kabir-Querrec,
David Kozhaya,
Thanikesavan Sivanthi,
Ognjen Vuković
Abstract:
Validation of conformance to cybersecurity standards for industrial automation and control systems is an expensive and time consuming process which can delay the time to market. It is therefore crucial to introduce conformance validation stages into the continuous integration/continuous delivery pipeline of products. However, designing such conformance validation in an automated fashion is a highl…
▽ More
Validation of conformance to cybersecurity standards for industrial automation and control systems is an expensive and time consuming process which can delay the time to market. It is therefore crucial to introduce conformance validation stages into the continuous integration/continuous delivery pipeline of products. However, designing such conformance validation in an automated fashion is a highly non-trivial task that requires expert knowledge and depends upon the available security tools, ease of integration into the DevOps pipeline, as well as support for IT and OT interfaces and protocols.
This paper addresses the aforementioned problem focusing on the automated validation of ISA/IEC 62443-4-2 standard component requirements. We present an extensive qualitative analysis of the standard requirements and the current tooling landscape to perform validation. Our analysis demonstrates the coverage established by the currently available tools and sheds light on current gaps to achieve full automation and coverage. Furthermore, we showcase for every component requirement where in the CI/CD pipeline stage it is recommended to test it and the tools to do so.
△ Less
Submitted 23 October, 2023; v1 submitted 13 October, 2023;
originally announced October 2023.
-
Harmonic projections in negative curvature II: large convex sets
Authors:
Ognjen Tošić
Abstract:
An important result in the theory of harmonic maps is due to Benoist--Hulin: given a quasi-isometry $f:X\to Y$ between pinched Hadamard manifolds, there exists a unique harmonic map at a finite distance from $f$. Here we show existence of harmonic maps under a weaker condition on $f$, that we call non-collapsing -- we require that the following two conditions hold uniformly in $x\in X$: (1) averag…
▽ More
An important result in the theory of harmonic maps is due to Benoist--Hulin: given a quasi-isometry $f:X\to Y$ between pinched Hadamard manifolds, there exists a unique harmonic map at a finite distance from $f$. Here we show existence of harmonic maps under a weaker condition on $f$, that we call non-collapsing -- we require that the following two conditions hold uniformly in $x\in X$: (1) average distance from $f(x)$ to $f(y)$ for $y$ on the sphere of radius $R$ centered at $x$ grows linearly with $R$ (2) the pre-image under $f$ of small cones with apex $f(x)$ have low harmonic measures on spheres centered at $x$. Using these ideas, we also continue the previous work of the author on existence of harmonic maps that are at a finite distance from projections to certain convex sets. We show this existence in a pinched negative curvature setting, when the convex set is large enough. For hyperbolic spaces, this includes the convex hulls of open sets in the sphere at infinity with sufficiently regular boundary.
△ Less
Submitted 9 October, 2023;
originally announced October 2023.
-
Privacy-Preserving Distributed Market Mechanism for Active Distribution Networks
Authors:
Matthias Franke,
Ognjen Stanojev,
Lesia Mitridati,
Gabriela Hug
Abstract:
Amidst the worldwide efforts to decarbonize power networks, Local Electricity Markets (LEMs) in distribution networks are gaining importance due to the increased adoption of renewable energy sources and prosumers. Considering that LEMs involve data exchange among independent entities, privacy and cybersecurity are some of the main practical challenges in LEM design. This paper proposes a secure ma…
▽ More
Amidst the worldwide efforts to decarbonize power networks, Local Electricity Markets (LEMs) in distribution networks are gaining importance due to the increased adoption of renewable energy sources and prosumers. Considering that LEMs involve data exchange among independent entities, privacy and cybersecurity are some of the main practical challenges in LEM design. This paper proposes a secure market protocol using innovations from distributed optimization and Secure MultiParty Computation (SMPC). The considered LEM is formulated as an uncertainty-aware joint market for energy and reserves with affine balancing policies. To achieve scalability and enable the use of SMPC, market clearing is solved using the Consensus ADMM algorithm. Subsequently, the data exchange among participants via ADMM iterations is protected using the Shamir secret-sharing scheme to ensure privacy. The market protocol is further reinforced by a secure and verifiable settlement process that uses SMPC and ElGamal commitments to verify market quantities and by a secure recovery scheme for missing network measurements. Finally, the feasibility and performance of the proposed LEM are evaluated on a 15-bus test network.
△ Less
Submitted 30 September, 2023;
originally announced October 2023.
-
Differentiable Machine Learning-Based Modeling for Directly-Modulated Lasers
Authors:
Sergio Hernandez,
Ognjen Jovanovic,
Christophe Peucheret,
Francesco Da Ros,
Darko Zibar
Abstract:
End-to-end learning has become a popular method for joint transmitter and receiver optimization in optical communication systems. Such approach may require a differentiable channel model, thus hindering the optimization of links based on directly modulated lasers (DMLs). This is due to the DML behavior in the large-signal regime, for which no analytical solution is available. In this paper, this p…
▽ More
End-to-end learning has become a popular method for joint transmitter and receiver optimization in optical communication systems. Such approach may require a differentiable channel model, thus hindering the optimization of links based on directly modulated lasers (DMLs). This is due to the DML behavior in the large-signal regime, for which no analytical solution is available. In this paper, this problem is addressed by develo** and comparing differentiable machine learning-based surrogate models. The models are quantitatively assessed in terms of root mean square error and training/testing time. Once the models are trained, the surrogates are then tested in a numerical equalization setup, resembling a practical end-to-end scenario. Based on the numerical investigation conducted, the convolutional attention transformer is shown to outperform the other models considered.
△ Less
Submitted 4 January, 2024; v1 submitted 27 September, 2023;
originally announced September 2023.
-
Application of Deep Learning Methods in Monitoring and Optimization of Electric Power Systems
Authors:
Ognjen Kundacina
Abstract:
This PhD thesis thoroughly examines the utilization of deep learning techniques as a means to advance the algorithms employed in the monitoring and optimization of electric power systems. The first major contribution of this thesis involves the application of graph neural networks to enhance power system state estimation. The second key aspect of this thesis focuses on utilizing reinforcement lear…
▽ More
This PhD thesis thoroughly examines the utilization of deep learning techniques as a means to advance the algorithms employed in the monitoring and optimization of electric power systems. The first major contribution of this thesis involves the application of graph neural networks to enhance power system state estimation. The second key aspect of this thesis focuses on utilizing reinforcement learning for dynamic distribution network reconfiguration. The effectiveness of the proposed methods is affirmed through extensive experimentation and simulations.
△ Less
Submitted 1 September, 2023;
originally announced September 2023.
-
Low-complexity Samples versus Symbols-based Neural Network Receiver for Channel Equalization
Authors:
Yevhenii Osadchuk,
Ognjen Jovanovic,
Stenio M. Ranzini,
Roman Dischler,
Vahid Aref,
Darko Zibar,
Francesco Da Ros
Abstract:
Low-complexity neural networks (NNs) have successfully been applied for digital signal processing (DSP) in short-reach intensity-modulated directly detected optical links, where chromatic dispersion-induced impairments significantly limit the transmission distance. The NN-based equalizers are usually optimized independently from other DSP components, such as matched filtering. This approach may re…
▽ More
Low-complexity neural networks (NNs) have successfully been applied for digital signal processing (DSP) in short-reach intensity-modulated directly detected optical links, where chromatic dispersion-induced impairments significantly limit the transmission distance. The NN-based equalizers are usually optimized independently from other DSP components, such as matched filtering. This approach may result in lower equalization performance. Alternatively, optimizing a NN equalizer to perform functionalities of multiple DSP blocks may increase transmission reach while kee** the complexity low. In this work, we propose a low-complexity NN that performs samples-to-symbol equalization, meaning that the NN-based equalizer includes match filtering and downsampling. We compare it to a samples-to-sample equalization approach followed by match filtering and downsampling in terms of performance and computational complexity. Both approaches are evaluated using three different types of NNs combined with optical preprocessing. We numerically and experimentally show that the proposed samples-to-symbol equalization approach applied for 32 GBd on-off keying (OOK) signals outperforms the samples-domain alternative kee** the computational complexity low. Additionally, the different types of NN-based equalizers are compared in terms of performance with respect to computational complexity.
△ Less
Submitted 28 August, 2023;
originally announced August 2023.
-
Multimodal Latent Emotion Recognition from Micro-expression and Physiological Signals
Authors:
Liangfei Zhang,
Yifei Qian,
Ognjen Arandjelovic,
Anthony Zhu
Abstract:
This paper discusses the benefits of incorporating multimodal data for improving latent emotion recognition accuracy, focusing on micro-expression (ME) and physiological signals (PS). The proposed approach presents a novel multimodal learning framework that combines ME and PS, including a 1D separable and mixable depthwise inception network, a standardised normal distribution weighted feature fusi…
▽ More
This paper discusses the benefits of incorporating multimodal data for improving latent emotion recognition accuracy, focusing on micro-expression (ME) and physiological signals (PS). The proposed approach presents a novel multimodal learning framework that combines ME and PS, including a 1D separable and mixable depthwise inception network, a standardised normal distribution weighted feature fusion method, and depth/physiology guided attention modules for multimodal learning. Experimental results show that the proposed approach outperforms the benchmark method, with the weighted fusion method and guided attention modules both contributing to enhanced performance.
△ Less
Submitted 23 August, 2023;
originally announced August 2023.
-
Addressing Data Scarcity in Optical Matrix Multiplier Modeling Using Transfer Learning
Authors:
Ali Cem,
Ognjen Jovanovic,
Siqi Yan,
Yunhong Ding,
Darko Zibar,
Francesco Da Ros
Abstract:
We present and experimentally evaluate using transfer learning to address experimental data scarcity when training neural network (NN) models for Mach-Zehnder interferometer mesh-based optical matrix multipliers. Our approach involves pre-training the model using synthetic data generated from a less accurate analytical model and fine-tuning with experimental data. Our investigation demonstrates th…
▽ More
We present and experimentally evaluate using transfer learning to address experimental data scarcity when training neural network (NN) models for Mach-Zehnder interferometer mesh-based optical matrix multipliers. Our approach involves pre-training the model using synthetic data generated from a less accurate analytical model and fine-tuning with experimental data. Our investigation demonstrates that this method yields significant reductions in modeling errors compared to using an analytical model, or a standalone NN model when training data is limited. Utilizing regularization techniques and ensemble averaging, we achieve < 1 dB root-mean-square error on the matrix weights implemented by a 3x3 photonic chip while using only 25% of the available data.
△ Less
Submitted 13 November, 2023; v1 submitted 10 August, 2023;
originally announced August 2023.
-
A White-Box False Positive Adversarial Attack Method on Contrastive Loss Based Offline Handwritten Signature Verification Models
Authors:
Zhongliang Guo,
Weiye Li,
Yifei Qian,
Ognjen Arandjelović,
Lei Fang
Abstract:
In this paper, we tackle the challenge of white-box false positive adversarial attacks on contrastive loss based offline handwritten signature verification models. We propose a novel attack method that treats the attack as a style transfer between closely related but distinct writing styles. To guide the generation of deceptive images, we introduce two new loss functions that enhance the attack su…
▽ More
In this paper, we tackle the challenge of white-box false positive adversarial attacks on contrastive loss based offline handwritten signature verification models. We propose a novel attack method that treats the attack as a style transfer between closely related but distinct writing styles. To guide the generation of deceptive images, we introduce two new loss functions that enhance the attack success rate by perturbing the Euclidean distance between the embedding vectors of the original and synthesized samples, while ensuring minimal perturbations by reducing the difference between the generated image and the original image. Our method demonstrates state-of-the-art performance in white-box attacks on contrastive loss based offline handwritten signature verification models, as evidenced by our experiments. The key contributions of this paper include a novel false positive attack method, two new loss functions, effective style transfer in handwriting styles, and superior performance in white-box false positive attacks compared to other white-box attack methods.
△ Less
Submitted 9 February, 2024; v1 submitted 17 August, 2023;
originally announced August 2023.
-
Literal-Aware Knowledge Graph Embedding for Welding Quality Monitoring: A Bosch Case
Authors:
Zhipeng Tan,
Baifan Zhou,
Zhuoxun Zheng,
Ognjen Savkovic,
Ziqi Huang,
Irlan-Grangel Gonzalez,
Ahmet Soylu,
Evgeny Kharlamov
Abstract:
Recently there has been a series of studies in knowledge graph embedding (KGE), which attempts to learn the embeddings of the entities and relations as numerical vectors and mathematical map**s via machine learning (ML). However, there has been limited research that applies KGE for industrial problems in manufacturing. This paper investigates whether and to what extent KGE can be used for an imp…
▽ More
Recently there has been a series of studies in knowledge graph embedding (KGE), which attempts to learn the embeddings of the entities and relations as numerical vectors and mathematical map**s via machine learning (ML). However, there has been limited research that applies KGE for industrial problems in manufacturing. This paper investigates whether and to what extent KGE can be used for an important problem: quality monitoring for welding in manufacturing industry, which is an impactful process accounting for production of millions of cars annually. The work is in line with Bosch research of data-driven solutions that intends to replace the traditional way of destroying cars, which is extremely costly and produces waste. The paper tackles two very challenging questions simultaneously: how large the welding spot diameter is; and to which car body the welded spot belongs to. The problem setting is difficult for traditional ML because there exist a high number of car bodies that should be assigned as class labels. We formulate the problem as link prediction, and experimented popular KGE methods on real industry data, with consideration of literals. Our results reveal both limitations and promising aspects of adapted KGE methods.
△ Less
Submitted 2 August, 2023;
originally announced August 2023.
-
Scaling Data Science Solutions with Semantics and Machine Learning: Bosch Case
Authors:
Baifan Zhou,
Nikolay Nikolov,
Zhuoxun Zheng,
Xianghui Luo,
Ognjen Savkovic,
Dumitru Roman,
Ahmet Soylu,
Evgeny Kharlamov
Abstract:
Industry 4.0 and Internet of Things (IoT) technologies unlock unprecedented amount of data from factory production, posing big data challenges in volume and variety. In that context, distributed computing solutions such as cloud systems are leveraged to parallelise the data processing and reduce computation time. As the cloud systems become increasingly popular, there is increased demand that more…
▽ More
Industry 4.0 and Internet of Things (IoT) technologies unlock unprecedented amount of data from factory production, posing big data challenges in volume and variety. In that context, distributed computing solutions such as cloud systems are leveraged to parallelise the data processing and reduce computation time. As the cloud systems become increasingly popular, there is increased demand that more users that were originally not cloud experts (such as data scientists, domain experts) deploy their solutions on the cloud systems. However, it is non-trivial to address both the high demand for cloud system users and the excessive time required to train them. To this end, we propose SemCloud, a semantics-enhanced cloud system, that couples cloud system with semantic technologies and machine learning. SemCloud relies on domain ontologies and map**s for data integration, and parallelises the semantic data integration and data analysis on distributed computing nodes. Furthermore, SemCloud adopts adaptive Datalog rules and machine learning for automated resource configuration, allowing non-cloud experts to use the cloud system. The system has been evaluated in industrial use case with millions of data, thousands of repeated runs, and domain users, showing promising results.
△ Less
Submitted 2 August, 2023;
originally announced August 2023.
-
Rate Adaptive Geometric Constellation Sha** Using Autoencoders and Many-To-One Map**
Authors:
Metodi P. Yankov,
Ognjen Jovanovic,
Darko Zibar,
Francesco Da Ros
Abstract:
A many-to-one map** geometric constellation sha** scheme is proposed with a fixed modulation format, fixed FEC engine and rate adaptation with an arbitrarily small step. An autoencoder is used to optimize the labelings and constellation points' positions.
A many-to-one map** geometric constellation sha** scheme is proposed with a fixed modulation format, fixed FEC engine and rate adaptation with an arbitrarily small step. An autoencoder is used to optimize the labelings and constellation points' positions.
△ Less
Submitted 19 July, 2023;
originally announced July 2023.
-
Safe Reinforcement Learning for Strategic Bidding of Virtual Power Plants in Day-Ahead Markets
Authors:
Ognjen Stanojev,
Lesia Mitridati,
Riccardo de Nardis di Prata,
Gabriela Hug
Abstract:
This paper presents a novel safe reinforcement learning algorithm for strategic bidding of Virtual Power Plants (VPPs) in day-ahead electricity markets. The proposed algorithm utilizes the Deep Deterministic Policy Gradient (DDPG) method to learn competitive bidding policies without requiring an accurate market model. Furthermore, to account for the complex internal physical constraints of VPPs we…
▽ More
This paper presents a novel safe reinforcement learning algorithm for strategic bidding of Virtual Power Plants (VPPs) in day-ahead electricity markets. The proposed algorithm utilizes the Deep Deterministic Policy Gradient (DDPG) method to learn competitive bidding policies without requiring an accurate market model. Furthermore, to account for the complex internal physical constraints of VPPs we introduce two enhancements to the DDPG method. Firstly, a projection-based safety shield that restricts the agent's actions to the feasible space defined by the non-linear power flow equations and operating constraints of distributed energy resources is derived. Secondly, a penalty for the shield activation in the reward function that incentivizes the agent to learn a safer policy is introduced. A case study based on the IEEE 13-bus network demonstrates the effectiveness of the proposed approach in enabling the agent to learn a highly competitive, safe strategic policy.
△ Less
Submitted 12 September, 2023; v1 submitted 11 July, 2023;
originally announced July 2023.
-
Overview of Deep Learning Methods for Retinal Vessel Segmentation
Authors:
Gorana Gojić,
Ognjen Kundačina,
Dragiša Mišković,
Dinu Dragan
Abstract:
Methods for automated retinal vessel segmentation play an important role in the treatment and diagnosis of many eye and systemic diseases. With the fast development of deep learning methods, more and more retinal vessel segmentation methods are implemented as deep neural networks. In this paper, we provide a brief review of recent deep learning methods from highly influential journals and conferen…
▽ More
Methods for automated retinal vessel segmentation play an important role in the treatment and diagnosis of many eye and systemic diseases. With the fast development of deep learning methods, more and more retinal vessel segmentation methods are implemented as deep neural networks. In this paper, we provide a brief review of recent deep learning methods from highly influential journals and conferences. The review objectives are: (1) to assess the design characteristics of the latest methods, (2) to report and analyze quantitative values of performance evaluation metrics, and (3) to analyze the advantages and disadvantages of the recent solutions.
△ Less
Submitted 1 June, 2023;
originally announced June 2023.
-
Dipolar quantum solids emerging in a Hubbard quantum simulator
Authors:
Lin Su,
Alexander Douglas,
Michal Szurek,
Robin Groth,
S. Furkan Ozturk,
Aaron Krahn,
Anne H. Hébert,
Gregory A. Phelps,
Sepehr Ebadi,
Susannah Dickerson,
Francesca Ferlaino,
Ognjen Marković,
Markus Greiner
Abstract:
In quantum mechanical many-body systems, long-range and anisotropic interactions promote rich spatial structure and can lead to quantum frustration, giving rise to a wealth of complex, strongly correlated quantum phases. Long-range interactions play an important role in nature; however, quantum simulations of lattice systems have largely not been able to realize such interactions. A wide range of…
▽ More
In quantum mechanical many-body systems, long-range and anisotropic interactions promote rich spatial structure and can lead to quantum frustration, giving rise to a wealth of complex, strongly correlated quantum phases. Long-range interactions play an important role in nature; however, quantum simulations of lattice systems have largely not been able to realize such interactions. A wide range of efforts are underway to explore long-range interacting lattice systems using polar molecules, Rydberg atoms, optical cavities, and magnetic atoms. Here, we realize novel quantum phases in a strongly correlated lattice system with long-range dipolar interactions using ultracold magnetic erbium atoms. As we tune the dipolar interaction to be the dominant energy scale in our system, we observe quantum phase transitions from a superfluid into dipolar quantum solids, which we directly detect using quantum gas microscopy with accordion lattices. Controlling the interaction anisotropy by orienting the dipoles enables us to realize a variety of stripe ordered states. Furthermore, by transitioning non-adiabatically through the strongly correlated regime, we observe the emergence of a range of metastable stripe-ordered states. This work demonstrates that novel strongly correlated quantum phases can be realized using long-range dipolar interaction in optical lattices, opening the door to quantum simulations of a wide range of lattice models with long-range and anisotropic interactions.
△ Less
Submitted 3 August, 2023; v1 submitted 1 June, 2023;
originally announced June 2023.
-
Non-adversarial Robustness of Deep Learning Methods for Computer Vision
Authors:
Gorana Gojić,
Vladimir Vincan,
Ognjen Kundačina,
Dragiša Mišković,
Dinu Dragan
Abstract:
Non-adversarial robustness, also known as natural robustness, is a property of deep learning models that enables them to maintain performance even when faced with distribution shifts caused by natural variations in data. However, achieving this property is challenging because it is difficult to predict in advance the types of distribution shifts that may occur. To address this challenge, researche…
▽ More
Non-adversarial robustness, also known as natural robustness, is a property of deep learning models that enables them to maintain performance even when faced with distribution shifts caused by natural variations in data. However, achieving this property is challenging because it is difficult to predict in advance the types of distribution shifts that may occur. To address this challenge, researchers have proposed various approaches, some of which anticipate potential distribution shifts, while others utilize knowledge about the shifts that have already occurred to enhance model generalizability. In this paper, we present a brief overview of the most recent techniques for improving the robustness of computer vision methods, as well as a summary of commonly used robustness benchmark datasets for evaluating the model's performance under data distribution shifts. Finally, we examine the strengths and limitations of the approaches reviewed and identify general trends in deep learning robustness improvement for computer vision.
△ Less
Submitted 24 May, 2023;
originally announced May 2023.
-
Deep Multiple Instance Learning with Distance-Aware Self-Attention
Authors:
Georg Wölflein,
Lucie Charlotte Magister,
Pietro Liò,
David J. Harrison,
Ognjen Arandjelović
Abstract:
Traditional supervised learning tasks require a label for every instance in the training set, but in many real-world applications, labels are only available for collections (bags) of instances. This problem setting, known as multiple instance learning (MIL), is particularly relevant in the medical domain, where high-resolution images are split into smaller patches, but labels apply to the image as…
▽ More
Traditional supervised learning tasks require a label for every instance in the training set, but in many real-world applications, labels are only available for collections (bags) of instances. This problem setting, known as multiple instance learning (MIL), is particularly relevant in the medical domain, where high-resolution images are split into smaller patches, but labels apply to the image as a whole. Recent MIL models are able to capture correspondences between patches by employing self-attention, allowing them to weigh each patch differently based on all other patches in the bag. However, these approaches still do not consider the relative spatial relationships between patches within the larger image, which is especially important in computational pathology. To this end, we introduce a novel MIL model with distance-aware self-attention (DAS-MIL), which explicitly takes into account relative spatial information when modelling the interactions between patches. Unlike existing relative position representations for self-attention which are discrete, our approach introduces continuous distance-dependent terms into the computation of the attention weights, and is the first to apply relative position representations in the context of MIL. We evaluate our model on a custom MNIST-based MIL dataset that requires the consideration of relative spatial information, as well as on CAMELYON16, a publicly available cancer metastasis detection dataset, where we achieve a test AUROC score of 0.91. On both datasets, our model outperforms existing MIL approaches that employ absolute positional encodings, as well as existing relative position representation schemes applied to MIL. Our code is available at https://anonymous.4open.science/r/das-mil.
△ Less
Submitted 20 May, 2023; v1 submitted 17 May, 2023;
originally announced May 2023.
-
Data-Driven Modeling of Directly-Modulated Lasers
Authors:
Sergio Hernandez Fernandez,
Christophe Peucheret,
Ognjen Jovanovic,
Francesco Da Ros,
Darko Zibar
Abstract:
The end-to-end optimization of links based on directly-modulated lasers may require an analytically differentiable channel. We overcome this problem by develo** and comparing differentiable laser models based on machine learning techniques.
The end-to-end optimization of links based on directly-modulated lasers may require an analytically differentiable channel. We overcome this problem by develo** and comparing differentiable laser models based on machine learning techniques.
△ Less
Submitted 15 May, 2023;
originally announced May 2023.
-
Multi-Wavelength Transponders for High-capacity Optical Networks: A Physical-layer-aware Network Planning Study
Authors:
Jasper Müller,
Ognjen Jovanovic,
Tobias Fehenberger,
Gabriele Di Rosa,
Jörg-Peter Elbers,
Carmen Mas-Machuca
Abstract:
Continued cost- and power-efficient capacity scaling in optical networks is imperative to keep pace with ever-increasing traffic demands. In this paper, we investigate multi-wavelength transponders as a potential way forward. Suitable system architectures and realistic specifications of multi-wavelength transponders are identified and analyzed in terms of transmit OSNR penalties and spectral const…
▽ More
Continued cost- and power-efficient capacity scaling in optical networks is imperative to keep pace with ever-increasing traffic demands. In this paper, we investigate multi-wavelength transponders as a potential way forward. Suitable system architectures and realistic specifications of multi-wavelength transponders are identified and analyzed in terms of transmit OSNR penalties and spectral constraints. We investigate the performance for different specifications as compared to single-wavelength transponders in a network planning study on two network topologies, develo** guidelines for multi-wavelength transponders specifications and their potential benefits. The studies show a reduction in the number of required lasers of up to 83% at the expense of a slight increase in number of lightpaths, demonstrating the potential for significant cost savings and efficiency improvements.
△ Less
Submitted 12 May, 2023;
originally announced May 2023.
-
Non-strict plurisubharmonicity of energy on Teichmüller space
Authors:
Ognjen Tošić
Abstract:
For an irreducible representation $ρ:π_1(Σ_g)\to\mathrm{GL}(n,\mathbb{C})$ there is an energy functional $\mathrm{E}_ρ:\mathcal{T}_g\to\mathbb{R}$, defined on Teichmüller space by taking the energy of the associated equivariant harmonic map into the symmetric space $\mathrm{GL}(n,\mathbb{C})/\mathrm{U}(n)$. It follows from a result of Toledo that $\mathrm{E}_ρ$ is plurisubharmonic, i.e. its Levi f…
▽ More
For an irreducible representation $ρ:π_1(Σ_g)\to\mathrm{GL}(n,\mathbb{C})$ there is an energy functional $\mathrm{E}_ρ:\mathcal{T}_g\to\mathbb{R}$, defined on Teichmüller space by taking the energy of the associated equivariant harmonic map into the symmetric space $\mathrm{GL}(n,\mathbb{C})/\mathrm{U}(n)$. It follows from a result of Toledo that $\mathrm{E}_ρ$ is plurisubharmonic, i.e. its Levi form is positive semi-definite. We study the kernel of this Levi form, and relate it to the $\mathbb{C}^*$ action on the moduli space of Higgs bundles. We also show that the points in $\mathcal{T}_g$ where strict plurisubharmonicity fails (i.e. this kernel is non-zero) are critical points of the Hitchin fibration. When $n\geq 2$ and $g\geq 3$, we show that for a generic choice $(S,ρ)$, the map $\mathrm{E}_ρ$ is strictly plurisubharmonic. We also describe the kernel of the Levi form for $n=1$.
△ Less
Submitted 1 February, 2024; v1 submitted 9 May, 2023;
originally announced May 2023.
-
Graph Neural Networks on Factor Graphs for Robust, Fast, and Scalable Linear State Estimation with PMUs
Authors:
Ognjen Kundacina,
Mirsad Cosovic,
Dragisa Miskovic,
Dejan Vukobratovic
Abstract:
As phasor measurement units (PMUs) become more widely used in transmission power systems, a fast state estimation (SE) algorithm that can take advantage of their high sample rates is needed. To accomplish this, we present a method that uses graph neural networks (GNNs) to learn complex bus voltage estimates from PMU voltage and current measurements. We propose an original implementation of GNNs ov…
▽ More
As phasor measurement units (PMUs) become more widely used in transmission power systems, a fast state estimation (SE) algorithm that can take advantage of their high sample rates is needed. To accomplish this, we present a method that uses graph neural networks (GNNs) to learn complex bus voltage estimates from PMU voltage and current measurements. We propose an original implementation of GNNs over the power system's factor graph to simplify the integration of various types and quantities of measurements on power system buses and branches. Furthermore, we augment the factor graph to improve the robustness of GNN predictions. This model is highly efficient and scalable, as its computational complexity is linear with respect to the number of nodes in the power system. Training and test examples were generated by randomly sampling sets of power system measurements and annotated with the exact solutions of linear SE with PMUs. The numerical results demonstrate that the GNN model provides an accurate approximation of the SE solutions. Furthermore, errors caused by PMU malfunctions or communication failures that would normally make the SE problem unobservable have a local effect and do not deteriorate the results in the rest of the power system.
△ Less
Submitted 28 April, 2023;
originally announced April 2023.
-
Tractable Identification of Electric Distribution Networks
Authors:
Ognjen Stanojev,
Lucien Werner,
Steven Low,
Gabriela Hug
Abstract:
The identification of distribution network topology and parameters is a critical problem that lays the foundation for improving network efficiency, enhancing reliability, and increasing its capacity to host distributed energy resources. Network identification problems often involve estimating a large number of parameters based on highly correlated measurements, resulting in an ill-conditioned and…
▽ More
The identification of distribution network topology and parameters is a critical problem that lays the foundation for improving network efficiency, enhancing reliability, and increasing its capacity to host distributed energy resources. Network identification problems often involve estimating a large number of parameters based on highly correlated measurements, resulting in an ill-conditioned and computationally demanding estimation process. We address these challenges by proposing two admittance matrix estimation methods. In the first method, we use the eigendecomposition of the admittance matrix to generalize the notion of stationarity to electrical signals and demonstrate how the stationarity property can be used to facilitate a maximum a posteriori estimation procedure. We relax the stationarity assumption in the second proposed method by employing Linear Minimum Mean Square Error (LMMSE) estimation. Since LMMSE estimation is often ill-conditioned, we introduce an approximate well-conditioned solution based on eigenvalue truncation. Our quantitative results demonstrate the improvement in computational efficiency compared to the state-of-the-art methods while preserving the estimation accuracy.
△ Less
Submitted 22 August, 2023; v1 submitted 4 April, 2023;
originally announced April 2023.
-
Spin Squeezing by Rydberg Dressing in an Array of Atomic Ensembles
Authors:
Jacob A. Hines,
Shankari V. Rajagopal,
Gabriel L. Moreau,
Michael D. Wahrman,
Neomi A. Lewis,
Ognjen Marković,
Monika Schleier-Smith
Abstract:
We report on the creation of an array of spin-squeezed ensembles of cesium atoms via Rydberg dressing, a technique that offers optical control over local interactions between neutral atoms. We optimize the coherence of the interactions by a stroboscopic dressing sequence that suppresses super-Poissonian loss. We thereby prepare squeezed states of $N=200$ atoms with a metrological squeezing paramet…
▽ More
We report on the creation of an array of spin-squeezed ensembles of cesium atoms via Rydberg dressing, a technique that offers optical control over local interactions between neutral atoms. We optimize the coherence of the interactions by a stroboscopic dressing sequence that suppresses super-Poissonian loss. We thereby prepare squeezed states of $N=200$ atoms with a metrological squeezing parameter $ξ^2 = 0.77(9)$ quantifying the reduction in phase variance below the standard quantum limit. We realize metrological gain across three spatially separated ensembles in parallel, with the strength of squeezing controlled by the local intensity of the dressing light. Our method can be applied to enhance the precision of tests of fundamental physics based on arrays of atomic clocks and to enable quantum-enhanced imaging of electromagnetic fields.
△ Less
Submitted 23 August, 2023; v1 submitted 15 March, 2023;
originally announced March 2023.
-
Supporting Future Electrical Utilities: Using Deep Learning Methods in EMS and DMS Algorithms
Authors:
Ognjen Kundacina,
Gorana Gojic,
Mile Mitrovic,
Dragisa Miskovic,
Dejan Vukobratovic
Abstract:
Electrical power systems are increasing in size, complexity, as well as dynamics due to the growing integration of renewable energy resources, which have sporadic power generation. This necessitates the development of near real-time power system algorithms, demanding lower computational complexity regarding the power system size. Considering the growing trend in the collection of historical measur…
▽ More
Electrical power systems are increasing in size, complexity, as well as dynamics due to the growing integration of renewable energy resources, which have sporadic power generation. This necessitates the development of near real-time power system algorithms, demanding lower computational complexity regarding the power system size. Considering the growing trend in the collection of historical measurement data and recent advances in the rapidly develo** deep learning field, the main goal of this paper is to provide a review of recent deep learning-based power system monitoring and optimization algorithms. Electrical utilities can benefit from this review by re-implementing or enhancing the algorithms traditionally used in energy management systems (EMS) and distribution management systems (DMS).
△ Less
Submitted 1 March, 2023;
originally announced March 2023.
-
Scalability and Sample Efficiency Analysis of Graph Neural Networks for Power System State Estimation
Authors:
Ognjen Kundacina,
Gorana Gojic,
Mirsad Cosovic,
Dragisa Miskovic,
Dejan Vukobratovic
Abstract:
Data-driven state estimation (SE) is becoming increasingly important in modern power systems, as it allows for more efficient analysis of system behaviour using real-time measurement data. This paper thoroughly evaluates a phasor measurement unit-only state estimator based on graph neural networks (GNNs) applied over factor graphs. To assess the sample efficiency of the GNN model, we perform multi…
▽ More
Data-driven state estimation (SE) is becoming increasingly important in modern power systems, as it allows for more efficient analysis of system behaviour using real-time measurement data. This paper thoroughly evaluates a phasor measurement unit-only state estimator based on graph neural networks (GNNs) applied over factor graphs. To assess the sample efficiency of the GNN model, we perform multiple training experiments on various training set sizes. Additionally, to evaluate the scalability of the GNN model, we conduct experiments on power systems of various sizes. Our results show that the GNN-based state estimator exhibits high accuracy and efficient use of data. Additionally, it demonstrated scalability in terms of both memory usage and inference time, making it a promising solution for data-driven SE in modern power systems.
△ Less
Submitted 2 March, 2023; v1 submitted 28 February, 2023;
originally announced March 2023.
-
GP CC-OPF: Gaussian Process based optimization tool for Chance-Constrained Optimal Power Flow
Authors:
Mile Mitrovic,
Ognjen Kundacina,
Aleksandr Lukashevich,
Petr Vorobev,
Vladimir Terzija,
Yury Maximov,
Deepjyoti Deka
Abstract:
The Gaussian Process (GP) based Chance-Constrained Optimal Power Flow (CC-OPF) is an open-source Python code developed for solving economic dispatch (ED) problem in modern power grids. In recent years, integrating a significant amount of renewables into a power grid causes high fluctuations and thus brings a lot of uncertainty to power grid operations. This fact makes the conventional model-based…
▽ More
The Gaussian Process (GP) based Chance-Constrained Optimal Power Flow (CC-OPF) is an open-source Python code developed for solving economic dispatch (ED) problem in modern power grids. In recent years, integrating a significant amount of renewables into a power grid causes high fluctuations and thus brings a lot of uncertainty to power grid operations. This fact makes the conventional model-based CC-OPF problem non-convex and computationally complex to solve. The developed tool presents a novel data-driven approach based on the GP regression model for solving the CC-OPF problem with a trade-off between complexity and accuracy. The proposed approach and developed software can help system operators to effectively perform ED optimization in the presence of large uncertainties in the power grid.
△ Less
Submitted 16 February, 2023;
originally announced February 2023.
-
Comparison of Single-Wavelength and Multi-Wavelength Transponders in a Physical-layer-aware Network Planning Study
Authors:
Jasper Müller,
Ognjen Jovanovic,
Carmen Mas-Machuca,
Helmut Griesser,
Tobias Fehenberger,
Jörg-Peter Elbers
Abstract:
Based on suitable system architectures and realistic specifications, transmit OSNR penalties and spectral constraints of multi-wavelength transponders are identified and analyzed in a network study. We report up to 70% less required lasers at the expense of a slight increase in number of lightpaths.
Based on suitable system architectures and realistic specifications, transmit OSNR penalties and spectral constraints of multi-wavelength transponders are identified and analyzed in a network study. We report up to 70% less required lasers at the expense of a slight increase in number of lightpaths.
△ Less
Submitted 16 February, 2023;
originally announced February 2023.
-
Rate Adaptive Autoencoder-based Geometric Constellation Sha**
Authors:
Ognjen Jovanovic,
Metodi P. Yankov,
Francesco Da Ros,
Darko Zibar
Abstract:
An autoencoder is used to optimize bit-to-symbol map**s for geometric constellation sha**. The map**s allow for net rate adaptivity without additional hardware complexity, while achieving up to 300km of transmission distance compared to uniform QAM.
An autoencoder is used to optimize bit-to-symbol map**s for geometric constellation sha**. The map**s allow for net rate adaptivity without additional hardware complexity, while achieving up to 300km of transmission distance compared to uniform QAM.
△ Less
Submitted 28 November, 2022;
originally announced January 2023.
-
Reservoir Computing-based Multi-Symbol Equalization for PAM 4 Short-reach Transmission
Authors:
Yevhenii Osadchuk,
Ognjen Jovanovic,
Darko Zibar,
Francesco Da Ros
Abstract:
We propose spectrum-sliced reservoir computer-based (RC) multi-symbol equalization for 32-GBd PAM4 transmission. RC with 17 symbols at the output achieves an order of magnitude reduction in multiplications/symbol versus single output case while maintaining simple training.
We propose spectrum-sliced reservoir computer-based (RC) multi-symbol equalization for 32-GBd PAM4 transmission. RC with 17 symbols at the output achieves an order of magnitude reduction in multiplications/symbol versus single output case while maintaining simple training.
△ Less
Submitted 29 November, 2022;
originally announced December 2022.
-
Data-efficient Modeling of Optical Matrix Multipliers Using Transfer Learning
Authors:
Ali Cem,
Ognjen Jovanovic,
Siqi Yan,
Yunhong Ding,
Darko Zibar,
Francesco Da Ros
Abstract:
We demonstrate transfer learning-assisted neural network models for optical matrix multipliers with scarce measurement data. Our approach uses <10\% of experimental data needed for best performance and outperforms analytical models for a Mach-Zehnder interferometer mesh.
We demonstrate transfer learning-assisted neural network models for optical matrix multipliers with scarce measurement data. Our approach uses <10\% of experimental data needed for best performance and outperforms analytical models for a Mach-Zehnder interferometer mesh.
△ Less
Submitted 29 November, 2022;
originally announced November 2022.