-
Knowledge Graph Enhanced Retrieval-Augmented Generation for Failure Mode and Effects Analysis
Authors:
Lukas Bahr,
Christoph Wehner,
Judith Wewerka,
José Bittencourt,
Ute Schmid,
Rüdiger Daub
Abstract:
Failure mode and effects analysis (FMEA) is a critical tool for mitigating potential failures, particular during ramp-up phases of new products. However, its effectiveness is often limited by the missing reasoning capabilities of the FMEA tools, which are usually tabular structured. Meanwhile, large language models (LLMs) offer novel prospects for fine-tuning on custom datasets for reasoning withi…
▽ More
Failure mode and effects analysis (FMEA) is a critical tool for mitigating potential failures, particular during ramp-up phases of new products. However, its effectiveness is often limited by the missing reasoning capabilities of the FMEA tools, which are usually tabular structured. Meanwhile, large language models (LLMs) offer novel prospects for fine-tuning on custom datasets for reasoning within FMEA contexts. However, LLMs face challenges in tasks that require factual knowledge, a gap that retrieval-augmented generation (RAG) approaches aim to fill. RAG retrieves information from a non-parametric data store and uses a language model to generate responses. Building on this idea, we propose to advance the non-parametric data store with a knowledge graph (KG). By enhancing the RAG framework with a KG, our objective is to leverage analytical and semantic question-answering capabilities on FMEA data. This paper contributes by presenting a new ontology for FMEA observations, an algorithm for creating vector embeddings from the FMEA KG, and a KG enhanced RAG framework. Our approach is validated through a human study and we measure the performance of the context retrieval recall and precision.
△ Less
Submitted 8 July, 2024; v1 submitted 26 June, 2024;
originally announced June 2024.
-
CoLa-DCE -- Concept-guided Latent Diffusion Counterfactual Explanations
Authors:
Franz Motzkus,
Christian Hellert,
Ute Schmid
Abstract:
Recent advancements in generative AI have introduced novel prospects and practical implementations. Especially diffusion models show their strength in generating diverse and, at the same time, realistic features, positioning them well for generating counterfactual explanations for computer vision models. Answering "what if" questions of what needs to change to make an image classifier change its p…
▽ More
Recent advancements in generative AI have introduced novel prospects and practical implementations. Especially diffusion models show their strength in generating diverse and, at the same time, realistic features, positioning them well for generating counterfactual explanations for computer vision models. Answering "what if" questions of what needs to change to make an image classifier change its prediction, counterfactual explanations align well with human understanding and consequently help in making model behavior more comprehensible. Current methods succeed in generating authentic counterfactuals, but lack transparency as feature changes are not directly perceivable. To address this limitation, we introduce Concept-guided Latent Diffusion Counterfactual Explanations (CoLa-DCE). CoLa-DCE generates concept-guided counterfactuals for any classifier with a high degree of control regarding concept selection and spatial conditioning. The counterfactuals comprise an increased granularity through minimal feature changes. The reference feature visualization ensures better comprehensibility, while the feature localization provides increased transparency of "where" changed "what". We demonstrate the advantages of our approach in minimality and comprehensibility across multiple image classification models and datasets and provide insights into how our CoLa-DCE explanations help comprehend model errors like misclassification cases.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Locally Testing Model Detections for Semantic Global Concepts
Authors:
Franz Motzkus,
Georgii Mikriukov,
Christian Hellert,
Ute Schmid
Abstract:
Ensuring the quality of black-box Deep Neural Networks (DNNs) has become ever more significant, especially in safety-critical domains such as automated driving. While global concept encodings generally enable a user to test a model for a specific concept, linking global concept encodings to the local processing of single network inputs reveals their strengths and limitations. Our proposed framewor…
▽ More
Ensuring the quality of black-box Deep Neural Networks (DNNs) has become ever more significant, especially in safety-critical domains such as automated driving. While global concept encodings generally enable a user to test a model for a specific concept, linking global concept encodings to the local processing of single network inputs reveals their strengths and limitations. Our proposed framework global-to-local Concept Attribution (glCA) uses approaches from local (why a specific prediction originates) and global (how a model works generally) eXplainable Artificial Intelligence (xAI) to test DNNs for a predefined semantical concept locally. The approach allows for conditioning local, post-hoc explanations on predefined semantic concepts encoded as linear directions in the model's latent space. Pixel-exact scoring concerning the global concept usage assists the tester in further understanding the model processing of single data points for the selected concept. Our approach has the advantage of fully covering the model-internal encoding of the semantic concept and allowing the localization of relevant concept-related information. The results show major differences in the local perception and usage of individual global concept encodings and demand for further investigations regarding obtaining thorough semantic concept encodings.
△ Less
Submitted 29 May, 2024; v1 submitted 27 May, 2024;
originally announced May 2024.
-
When a Relation Tells More Than a Concept: Exploring and Evaluating Classifier Decisions with CoReX
Authors:
Bettina Finzel,
Patrick Hilme,
Johannes Rabold,
Ute Schmid
Abstract:
Explanations for Convolutional Neural Networks (CNNs) based on relevance of input pixels might be too unspecific to evaluate which and how input features impact model decisions. Especially in complex real-world domains like biomedicine, the presence of specific concepts (e.g., a certain type of cell) and of relations between concepts (e.g., one cell type is next to another) might be discriminative…
▽ More
Explanations for Convolutional Neural Networks (CNNs) based on relevance of input pixels might be too unspecific to evaluate which and how input features impact model decisions. Especially in complex real-world domains like biomedicine, the presence of specific concepts (e.g., a certain type of cell) and of relations between concepts (e.g., one cell type is next to another) might be discriminative between classes (e.g., different types of tissue). Pixel relevance is not expressive enough to convey this type of information. In consequence, model evaluation is limited and relevant aspects present in the data and influencing the model decisions might be overlooked. This work presents a novel method to explain and evaluate CNN models, which uses a concept- and relation-based explainer (CoReX). It explains the predictive behavior of a model on a set of images by masking (ir-)relevant concepts from the decision-making process and by constraining relations in a learned interpretable surrogate model. We test our approach with several image data sets and CNN architectures. Results show that CoReX explanations are faithful to the CNN model in terms of predictive outcomes. We further demonstrate that CoReX is a suitable tool for evaluating CNNs supporting identification and re-classification of incorrect or ambiguous classifications.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
Can humans teach machines to code?
Authors:
Céline Hocquette,
Johannes Langer,
Andrew Cropper,
Ute Schmid
Abstract:
The goal of inductive program synthesis is for a machine to automatically generate a program from user-supplied examples of the desired behaviour of the program. A key underlying assumption is that humans can provide examples of sufficient quality to teach a concept to a machine. However, as far as we are aware, this assumption lacks both empirical and theoretical support. To address this limitati…
▽ More
The goal of inductive program synthesis is for a machine to automatically generate a program from user-supplied examples of the desired behaviour of the program. A key underlying assumption is that humans can provide examples of sufficient quality to teach a concept to a machine. However, as far as we are aware, this assumption lacks both empirical and theoretical support. To address this limitation, we explore the question `Can humans teach machines to code?'. To answer this question, we conduct a study where we ask humans to generate examples for six programming tasks, such as finding the maximum element of a list. We compare the performance of a program synthesis system trained on (i) human-provided examples, (ii) randomly sampled examples, and (iii) expert-provided examples. Our results show that, on most of the tasks, non-expert participants did not provide sufficient examples for a program synthesis system to learn an accurate program. Our results also show that non-experts need to provide more examples than both randomly sampled and expert-provided examples.
△ Less
Submitted 30 April, 2024;
originally announced April 2024.
-
Comprehensible Artificial Intelligence on Knowledge Graphs: A survey
Authors:
Simon Schramm,
Christoph Wehner,
Ute Schmid
Abstract:
Artificial Intelligence applications gradually move outside the safe walls of research labs and invade our daily lives. This is also true for Machine Learning methods on Knowledge Graphs, which has led to a steady increase in their application since the beginning of the 21st century. However, in many applications, users require an explanation of the Artificial Intelligences decision. This led to i…
▽ More
Artificial Intelligence applications gradually move outside the safe walls of research labs and invade our daily lives. This is also true for Machine Learning methods on Knowledge Graphs, which has led to a steady increase in their application since the beginning of the 21st century. However, in many applications, users require an explanation of the Artificial Intelligences decision. This led to increased demand for Comprehensible Artificial Intelligence. Knowledge Graphs epitomize fertile soil for Comprehensible Artificial Intelligence, due to their ability to display connected data, i.e. knowledge, in a human- as well as machine-readable way. This survey gives a short history to Comprehensible Artificial Intelligence on Knowledge Graphs. Furthermore, we contribute by arguing that the concept Explainable Artificial Intelligence is overloaded and overlap** with Interpretable Machine Learning. By introducing the parent concept Comprehensible Artificial Intelligence, we provide a clear-cut distinction of both concepts while accounting for their similarities. Thus, we provide in this survey a case for Comprehensible Artificial Intelligence on Knowledge Graphs consisting of Interpretable Machine Learning on Knowledge Graphs and Explainable Artificial Intelligence on Knowledge Graphs. This leads to the introduction of a novel taxonomy for Comprehensible Artificial Intelligence on Knowledge Graphs. In addition, a comprehensive overview of the research on Comprehensible Artificial Intelligence on Knowledge Graphs is presented and put into the context of the taxonomy. Finally, research gaps in the field of Comprehensible Artificial Intelligence on Knowledge Graphs are identified for future research.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
A Hybrid Delay Model for Interconnected Multi-Input Gates
Authors:
Arman Ferdowsi,
Matthias Függer,
Josef Salzmann,
Ulrich Schmid
Abstract:
Dynamic digital timing analysis is a less accurate but fast alternative to highly accurate but slow analog simulations of digital circuits. It relies on gate delay models, which allow the determination of input-to-output delays of a gate on a per-transition basis. Accurate delay models not only consider the effect of preceding output transitions here but also delay variations induced by multi-inpu…
▽ More
Dynamic digital timing analysis is a less accurate but fast alternative to highly accurate but slow analog simulations of digital circuits. It relies on gate delay models, which allow the determination of input-to-output delays of a gate on a per-transition basis. Accurate delay models not only consider the effect of preceding output transitions here but also delay variations induced by multi-input switching (MIS) effects in the case of multi-input gates. Starting out from a first-order hybrid delay model for CMOS two-input NOR gates, we develop a hybrid delay model for Muller C gates and show how to augment these models and their analytic delay formulas by a first-order interconnect. Moreover, we conduct a systematic evaluation of the resulting modeling accuracy: Using SPICE simulations, we quantify the MIS effects on the gate delays under various wire lengths, load capacitances, and input strengths for two different CMOS technologies, comparing these results to the predictions of appropriately parameterized versions of our new gate delay models. Overall, our experimental results reveal that they capture all MIS effects with a surprisingly good accuracy despite being first-order only.
△ Less
Submitted 1 July, 2024; v1 submitted 5 March, 2024;
originally announced March 2024.
-
Faithful Dynamic Timing Analysis of Digital Circuits Using Continuous Thresholded Mode-Switched ODEs
Authors:
Arman Ferdowsi,
Matthias Függer,
Thomas Nowak,
Michael Drmota,
Ulrich Schmid
Abstract:
Thresholded hybrid systems are restricted dynamical systems, where the current mode, and hence the ODE system describing its behavior, is solely determined by externally supplied digital input signals and where the only output signals are digital ones generated by comparing an internal state variable to a threshold value. An attractive feature of such systems is easy composition, which is facilita…
▽ More
Thresholded hybrid systems are restricted dynamical systems, where the current mode, and hence the ODE system describing its behavior, is solely determined by externally supplied digital input signals and where the only output signals are digital ones generated by comparing an internal state variable to a threshold value. An attractive feature of such systems is easy composition, which is facilitated by their purely digital interface. A particularly promising application domain of thresholded hybrid systems is digital integrated circuits: Modern digital circuit design considers them as a composition of Millions and even Billions of elementary logic gates, like inverters, GOR and Gand. Since every such logic gate is eventually implemented as an electronic circuit, however, which exhibits a behavior that is governed by some ODE system, thresholded hybrid systems are ideally suited for making the transition from the analog to the digital world rigorous.
In this paper, we prove that the map** from digital input signals to digital output signals is continuous for a large class of thresholded hybrid systems. Moreover, we show that, under some mild conditions regarding causality, this continuity also continues to hold for arbitrary compositions, which in turn guarantees that the composition faithfully captures the analog reality. By applying our generic results to some recently developed thresholded hybrid gate models, both for single-input single-output gates like inverters and for a two-input CMOS NOR gate, we show that they are continuous. Moreover, we provide a novel thresholded hybrid model for the two-input NOR gate, which is not only continuous but also, unlike the existing one, faithfully models all multi-input switching effects.
△ Less
Submitted 7 March, 2024; v1 submitted 5 March, 2024;
originally announced March 2024.
-
Microelectronic readout of a diamond quantum sensor
Authors:
Daniel Wirtitsch,
Georg Wachter,
Sarah Reisenbauer,
Johannes Schalko,
Ulrich Schmid,
Andrea Fant,
Luca Sant,
Michael Trupke
Abstract:
Quantum sensors based on the nitrogen-vacancy (NV) centre in diamond are rapidly advancing from scientific exploration towards the first generation of commercial applications. While significant progress has been made in develo** suitable methods for the manipulation of the NV centre spin state, the detection of the defect luminescence has so far limited the performance of miniaturized sensor arc…
▽ More
Quantum sensors based on the nitrogen-vacancy (NV) centre in diamond are rapidly advancing from scientific exploration towards the first generation of commercial applications. While significant progress has been made in develo** suitable methods for the manipulation of the NV centre spin state, the detection of the defect luminescence has so far limited the performance of miniaturized sensor architectures. The recent development of photoelectric detection of the NV centre's spin state offers a path to circumvent these limitations, but has to-date required research-grade low current amplifiers to detect the picoampere-scale currents obtained from these systems. Here we report on the photoelectric detection of magnetic resonance (PDMR) with NV ensembles using a complementary metal-oxide semiconductor (CMOS) device. The integrated circuit delivers a digitized output of the diamond sensor with low noise and 50 femtoampere resolution. This integration provides the last missing component on the path to a compact, diamond-based quantum sensor. The device is suited for continuous wave (CW) as well as pulsed operation. We demonstrate its functionality with DC and AC magnetometry up to several megahertz, coherent spin rotation and multi-axial decoupling sequences for quantum sensing.
△ Less
Submitted 6 March, 2024; v1 submitted 5 March, 2024;
originally announced March 2024.
-
A Logic for Repair and State Recovery in Byzantine Fault-tolerant Multi-agent Systems
Authors:
Hans van Ditmarsch,
Krisztina Fruzsa,
Roman Kuznets,
Ulrich Schmid
Abstract:
We provide an epistemic logical language and semantics for the modeling and analysis of byzantine fault-tolerant multi-agent systems. This not only facilitates reasoning about the agents' fault status but also supports model updates for implementing repair and state recovery. For each agent, besides the standard knowledge modality our logic provides an additional modality called hope, which is cap…
▽ More
We provide an epistemic logical language and semantics for the modeling and analysis of byzantine fault-tolerant multi-agent systems. This not only facilitates reasoning about the agents' fault status but also supports model updates for implementing repair and state recovery. For each agent, besides the standard knowledge modality our logic provides an additional modality called hope, which is capable of expressing that the agent is correct (not faulty), and also dynamic modalities enabling change of the agents' correctness status. These dynamic modalities are interpreted as model updates that come in three flavours: fully public, more private, or involving factual change. We provide complete axiomatizations for all these variants in the form of reduction systems: formulas with dynamic modalities are equivalent to formulas without. Therefore, they have the same expressivity as the logic of knowledge and hope. Multiple examples are provided to demonstrate the utility and flexibility of our logic for modeling a wide range of repair and state recovery techniques that have been implemented in the context of fault-detection, isolation, and recovery (FDIR) approaches in fault-tolerant distributed computing with byzantine agents.
△ Less
Submitted 27 June, 2024; v1 submitted 12 January, 2024;
originally announced January 2024.
-
Metal Nanoparticle-Functionalized Three-Dimensional Graphene: a versatile platform towards sensors and energy-related applications
Authors:
Emanuele Pompei,
Ylea Vlamidis,
Letizia Ferbel,
Valentina Zannier,
Silvia Rubini,
Daniel Arenas Esteban,
Sara Bals,
Carmela Marinelli,
Georg Pfusterschmied,
Markus Leitgeb,
Ulrich Schmid,
Stefan Heun,
Stefano Veronesi
Abstract:
We demonstrate the first successful functionalization of epitaxial three-dimensional graphene with metal nanoparticles. The functionalization is obtained by immersing the 3D graphene in a nanoparticle colloidal solution. This method is versatile and here is demonstrated for gold and palladium, but can be extended to other types and shapes of nanoparticles. We have measured the nanoparticle density…
▽ More
We demonstrate the first successful functionalization of epitaxial three-dimensional graphene with metal nanoparticles. The functionalization is obtained by immersing the 3D graphene in a nanoparticle colloidal solution. This method is versatile and here is demonstrated for gold and palladium, but can be extended to other types and shapes of nanoparticles. We have measured the nanoparticle density on the top-surface and in the porous layer volume by Scanning Electron Microscopy and Scanning Transmission Electron Microscopy. Samples exhibit a high coverage of nanoparticles with minimal clustering. High quality graphene has been demonstrated to promote the functionalization leading to higher nanoparticle density, both on the surface and in the pores. X-ray Photoelectron Spectroscopy allowed to verify the absence of contamination after the functionalization process. Moreover, it confirmed the thermal stability of the Au- and Pd-functionalized three-dimensional graphene up to 530°C. Our approach opens up new avenues for utilizing three-dimensional graphene as a versatile platform for catalytic applications, sensors, and energy storage and conversion.
△ Less
Submitted 25 October, 2023;
originally announced October 2023.
-
Network Abstractions for Characterizing Communication Requirements in Asynchronous Distributed Systems
Authors:
Hugo Rincon Galeana,
Ulrich Schmid
Abstract:
Whereas distributed computing research has been very successful in exploring the solvability/impossibility border of distributed computing problems like consensus in representative classes of computing models with respect to model parameters like failure bounds, this is not the case for characterizing necessary and sufficient communication requirements. In this paper, we introduce network abstract…
▽ More
Whereas distributed computing research has been very successful in exploring the solvability/impossibility border of distributed computing problems like consensus in representative classes of computing models with respect to model parameters like failure bounds, this is not the case for characterizing necessary and sufficient communication requirements. In this paper, we introduce network abstractions as a novel approach for modeling communication requirements in asynchronous distributed systems. A network abstraction of a run is a sequence of directed graphs on the set of processes, where the $i$-th graph specifies some ``potential'' message chains that can be guaranteed to arise in the $i$-th portion of the run. Formally, they are defined via associating message sending times with the end-to-end delays that would arise if the message was indeed sent by the sender's protocol. Network abstractions also allow to reason about future causal cones that might arise in a run, hence also facilitate reasoning about liveness properties, and are inherently compatible with temporal epistemic reasoning frameworks. We demonstrate the utility of our approach by providing necessary and sufficient network abstractions for solving the canonical firing rebels with relay (FRR) problem, and variants thereof, in asynchronous message-passing systems with up to $f$ byzantine processes connected via point-to-point links. FRR is not only a basic primitive in clock synchronization and consensus algorithms, but also integrates several distributed computing problems, namely triggering events, agreement and even stabilizing agreement, in a single problem instance.
△ Less
Submitted 23 May, 2024; v1 submitted 19 October, 2023;
originally announced October 2023.
-
Three-dimensional graphene on a nano-porous 4H-SiC backbone: a novel material for food sensing applications
Authors:
Stefano Veronesi,
Ylea Vlamidis,
Letizia Ferbel,
Carmela Marinelli,
Chiara Sanmartin,
Isabella Taglieri,
Georg Pfusterschmied,
Markus Leitgeb,
Ulrich Schmid,
Fabio Mencarelli,
Stefan Heun
Abstract:
Sensors which are sensitive to volatile organic compounds and thus able to monitor the conservation state of food, are precious because they work non-destructively and allow to avoid direct contact with the food, ensuring hygienic conditions. In particular, the monitoring of rancidity would solve a widespread issue in food storage. The sensor discussed here is produced utilizing a novel three-dime…
▽ More
Sensors which are sensitive to volatile organic compounds and thus able to monitor the conservation state of food, are precious because they work non-destructively and allow to avoid direct contact with the food, ensuring hygienic conditions. In particular, the monitoring of rancidity would solve a widespread issue in food storage. The sensor discussed here is produced utilizing a novel three-dimensional arrangement of graphene, which is grown on a crystalline silicon carbide (SiC) wafer previously porousified by chemical etching. This approach allows a very high surface-to.volume ratio. Furthermore, the structure of the sensor surface features a large amount of edges, dangling bounds, and active sites, which make the sensor, on a chemically robust skeleton, chemically active, particularly to hydrogenated molecules. The interaction of the sensor with such compounds is read out by measuring the sensor resistance in a four wire configuration. The sensor performance has been assessed on three hazelnut samples: sound hazelnuts, spoiled hazelnuts, and stink bug hazelnuts. A resistance variation of about DeltaR = 0.13 (0.02) Ohm between sound and damaged hazelnuts has been detected. Our measurements confirm the ability of the sensor to discriminate between sound and damaged hazelnuts. The sensor signal is stable for days, providing the possibility to use this sensor for the monitoring of the storage state of fats and foods in general.
△ Less
Submitted 23 September, 2023;
originally announced September 2023.
-
Explaining with Attribute-based and Relational Near Misses: An Interpretable Approach to Distinguishing Facial Expressions of Pain and Disgust
Authors:
Bettina Finzel,
Simon P. Kuhn,
David E. Tafler,
Ute Schmid
Abstract:
Explaining concepts by contrasting examples is an efficient and convenient way of giving insights into the reasons behind a classification decision. This is of particular interest in decision-critical domains, such as medical diagnostics. One particular challenging use case is to distinguish facial expressions of pain and other states, such as disgust, due to high similarity of manifestation. In t…
▽ More
Explaining concepts by contrasting examples is an efficient and convenient way of giving insights into the reasons behind a classification decision. This is of particular interest in decision-critical domains, such as medical diagnostics. One particular challenging use case is to distinguish facial expressions of pain and other states, such as disgust, due to high similarity of manifestation. In this paper, we present an approach for generating contrastive explanations to explain facial expressions of pain and disgust shown in video sequences. We implement and compare two approaches for contrastive explanation generation. The first approach explains a specific pain instance in contrast to the most similar disgust instance(s) based on the occurrence of facial expressions (attributes). The second approach takes into account which temporal relations hold between intervals of facial expressions within a sequence (relations). The input to our explanation generation approach is the output of an interpretable rule-based classifier for pain and disgust.We utilize two different similarity metrics to determine near misses and far misses as contrasting instances. Our results show that near miss explanations are shorter than far miss explanations, independent from the applied similarity metric. The outcome of our evaluation indicates that pain and disgust can be distinguished with the help of temporal relations. We currently plan experiments to evaluate how the explanations help in teaching concepts and how they could be enhanced by further modalities and interaction.
△ Less
Submitted 27 August, 2023;
originally announced August 2023.
-
Task Planning Support for Arborists and Foresters: Comparing Deep Learning Approaches for Tree Inventory and Tree Vitality Assessment Based on UAV-Data
Authors:
Jonas-Dario Troles,
Richard Nieding,
Sonia Simons,
Ute Schmid
Abstract:
Climate crisis and correlating prolonged, more intense periods of drought threaten tree health in cities and forests. In consequence, arborists and foresters suffer from increasing workloads and, in the best case, a consistent but often declining workforce. To optimise workflows and increase productivity, we propose a novel open-source end-to-end approach that generates helpful information and imp…
▽ More
Climate crisis and correlating prolonged, more intense periods of drought threaten tree health in cities and forests. In consequence, arborists and foresters suffer from increasing workloads and, in the best case, a consistent but often declining workforce. To optimise workflows and increase productivity, we propose a novel open-source end-to-end approach that generates helpful information and improves task planning of those who care for trees in and around cities. Our approach is based on RGB and multispectral UAV data, which is used to create tree inventories of city parks and forests and to deduce tree vitality assessments through statistical indices and Deep Learning. Due to EU restrictions regarding flying drones in urban areas, we will also use multispectral satellite data and fifteen soil moisture sensors to extend our tree vitality-related basis of data. Furthermore, Bamberg already has a georeferenced tree cadastre of around 15,000 solitary trees in the city area, which is also used to generate helpful information. All mentioned data is then joined and visualised in an interactive web application allowing arborists and foresters to generate individual and flexible evaluations, thereby improving daily task planning.
△ Less
Submitted 4 July, 2023;
originally announced July 2023.
-
Explaining Hate Speech Classification with Model Agnostic Methods
Authors:
Durgesh Nandini,
Ute Schmid
Abstract:
There have been remarkable breakthroughs in Machine Learning and Artificial Intelligence, notably in the areas of Natural Language Processing and Deep Learning. Additionally, hate speech detection in dialogues has been gaining popularity among Natural Language Processing researchers with the increased use of social media. However, as evidenced by the recent trends, the need for the dimensions of e…
▽ More
There have been remarkable breakthroughs in Machine Learning and Artificial Intelligence, notably in the areas of Natural Language Processing and Deep Learning. Additionally, hate speech detection in dialogues has been gaining popularity among Natural Language Processing researchers with the increased use of social media. However, as evidenced by the recent trends, the need for the dimensions of explainability and interpretability in AI models has been deeply realised. Taking note of the factors above, the research goal of this paper is to bridge the gap between hate speech prediction and the explanations generated by the system to support its decision. This has been achieved by first predicting the classification of a text and then providing a posthoc, model agnostic and surrogate interpretability approach for explainability and to prevent model bias. The bidirectional transformer model BERT has been used for prediction because of its state of the art efficiency over other Machine Learning models. The model agnostic algorithm LIME generates explanations for the output of a trained classifier and predicts the features that influence the model decision. The predictions generated from the model were evaluated manually, and after thorough evaluation, we observed that the model performs efficiently in predicting and explaining its prediction. Lastly, we suggest further directions for the expansion of the provided research work.
△ Less
Submitted 30 May, 2023;
originally announced June 2023.
-
Advanced Mid-Infrared Plasmonic Waveguides For On-Chip Integrated Photonics
Authors:
Mauro David,
Davide Disnan,
Elena Arigliani,
Anna Lardschneider,
Georg Marschick,
Hanh T. Hoang,
Hermann Detz,
Bernhard Lendl,
Ulrich Schmid,
Gottfried Strasser,
Borislav Hinkov
Abstract:
Long-wave infrared (LWIR, 8-14 um) photonics is a rapidly growing research field within the mid-IR with applications in molecular spectroscopy and optical free-space communication. LWIR-applications are often addressed using rather bulky tabletop-sized free-space optical systems, preventing advanced photonic applications such as rapid-time-scale experiments. Here, device miniaturization into photo…
▽ More
Long-wave infrared (LWIR, 8-14 um) photonics is a rapidly growing research field within the mid-IR with applications in molecular spectroscopy and optical free-space communication. LWIR-applications are often addressed using rather bulky tabletop-sized free-space optical systems, preventing advanced photonic applications such as rapid-time-scale experiments. Here, device miniaturization into photonic integrated circuits (PICs) with maintained optical capabilities is key to revolutionize mid-IR photonics. Sub-wavelength mode confinement in plasmonic structures enabled such miniaturization approaches in the visible-to-near-IR spectral range. However, adopting plasmonics for the LWIR needs suitable low-loss and -dispersion materials with compatible integration strategies to existing mid-IR technology. In this work we further unlock the field of LWIR/mid-IR PICs, by combining photolithographic patterning of organic polymers with dielectric-loaded surface plasmon polariton (DLSPP) waveguides. In particular, polyethylene shows favorable optical properties, including low refractive index and broad transparency between ~2-200 um. We investigate the whole value chain, including design, fabrication, and characterization of polyethylene-based DLSPP waveguides and demonstrate their first-time plasmonic operation and mode guiding capabilities along s-bend structures. Low bending losses of ~1.3 dB and straight-section propagation lengths of ~1 mm, pave the way for unprecedented, complex on-chip mid-IR photonic devices. Moreover, DLSPPs allow full control of the mode parameters (propagation length and guiding capabilities) for precisely addressing advanced sensing and telecommunication applications with chip-scale devices.
△ Less
Submitted 5 May, 2023;
originally announced May 2023.
-
Topological Characterization of Consensus Solvability in Directed Dynamic Networks
Authors:
Hugo Rincon Galeana,
Ulrich Schmid,
Kyrill Winkler,
Ami Paz,
Stefan Schmid
Abstract:
Consensus is one of the most fundamental problems in distributed computing. This paper studies the consensus problem in a synchronous dynamic directed network, in which communication is controlled by an oblivious message adversary. The question when consensus is possible in this model has already been studied thoroughly in the literature from a combinatorial perspective, and is known to be challen…
▽ More
Consensus is one of the most fundamental problems in distributed computing. This paper studies the consensus problem in a synchronous dynamic directed network, in which communication is controlled by an oblivious message adversary. The question when consensus is possible in this model has already been studied thoroughly in the literature from a combinatorial perspective, and is known to be challenging. This paper presents a topological perspective on consensus solvability under oblivious message adversaries, which provides interesting new insights. Our main contribution is a topological characterization of consensus solvability, which also leads to explicit decision procedures. Our approach is based on the novel notion of a communication pseudosphere, which can be seen as the message-passing analog of the well-known standard chromatic subdivision for wait-free shared memory systems. We further push the elegance and expressiveness of the "geometric" reasoning enabled by the topological approach by dealing with uninterpreted complexes, which considerably reduce the size of the protocol complex, and by labeling facets with information flow arrows, which give an intuitive meaning to the implicit epistemic status of the faces in a protocol complex.
△ Less
Submitted 5 April, 2023;
originally announced April 2023.
-
A Sufficient Condition for Gaining Belief in Byzantine Fault-Tolerant Distributed Systems
Authors:
Thomas Schlögl,
Ulrich Schmid
Abstract:
Existing protocols for byzantine fault tolerant distributed systems usually rely on the correct agents' ability to detect faulty agents and/or to detect the occurrence of some event or action on some correct agent. In this paper, we provide sufficient conditions that allow an agent to infer the appropriate beliefs from its history, and a procedure that allows these conditions to be checked in fini…
▽ More
Existing protocols for byzantine fault tolerant distributed systems usually rely on the correct agents' ability to detect faulty agents and/or to detect the occurrence of some event or action on some correct agent. In this paper, we provide sufficient conditions that allow an agent to infer the appropriate beliefs from its history, and a procedure that allows these conditions to be checked in finite time. Our results thus provide essential step** stones for develo** efficient protocols and proving them correct.
△ Less
Submitted 11 July, 2023; v1 submitted 1 April, 2023;
originally announced April 2023.
-
Continuity of Thresholded Mode-Switched ODEs and Digital Circuit Delay Models
Authors:
Arman Ferdowsi,
Matthias Függer,
Thomas Nowak,
Ulrich Schmid
Abstract:
Thresholded mode-switched ODEs are restricted dynamical systems that switch ODEs depending on digital input signals only, and produce a digital output signal by thresholding some internal signal. Such systems arise in recent digital circuit delay models, where the analog signals within a gate are governed by ODEs that change depending on the digital inputs.
We prove the continuity of the map**…
▽ More
Thresholded mode-switched ODEs are restricted dynamical systems that switch ODEs depending on digital input signals only, and produce a digital output signal by thresholding some internal signal. Such systems arise in recent digital circuit delay models, where the analog signals within a gate are governed by ODEs that change depending on the digital inputs.
We prove the continuity of the map** from digital input signals to digital output signals for a large class of thresholded mode-switched ODEs. This continuity property is known to be instrumental for ensuring the faithfulness of the model w.r.t. propagating short pulses. We apply our result to several instances of such digital delay models, thereby proving them to be faithful.
△ Less
Submitted 24 March, 2023;
originally announced March 2023.
-
Study of hydrogen absorption in a novel three-dimensional graphene structure: Towards hydrogen storage applications
Authors:
Aureliano Macili,
Ylea Vlamidis,
Georg Pfusterschmied,
Markus Leitgeb,
Ulrich Schmid,
Stefan Heun,
Stefano Veronesi
Abstract:
The use of a novel three-dimensional graphene structure allows circumventing the limitations of the two-dimensional nature of graphene and its application in hydrogen absorption. Here we investigate hydrogen-bonding on monolayer graphene conformally grown via the epitaxial growth method on the (0001) face of a porousified 4H-SiC wafer. Hydrogen absorption is studied via Thermal Desorption Spectros…
▽ More
The use of a novel three-dimensional graphene structure allows circumventing the limitations of the two-dimensional nature of graphene and its application in hydrogen absorption. Here we investigate hydrogen-bonding on monolayer graphene conformally grown via the epitaxial growth method on the (0001) face of a porousified 4H-SiC wafer. Hydrogen absorption is studied via Thermal Desorption Spectroscopy (TDS), exposing the samples to either atomic (D) or molecular (D2) deuterium. The graphene growth temperature, hydrogen exposure temperature, and the morphology of the structure are investigated and related to their effect on hydrogen absorption. The three-dimensional graphene structures chemically bind atomic deuterium when exposed to D2. This is the first report of such an event in unfunctionalized graphene-based materials and implies the presence of a catalytic splitting mechanism. It is further shown that the three-dimensional dendritic structure of the porous material temporarily retains the desorbed molecules and causes delayed emission. The capability of chemisorbing atoms after a catalytic splitting of hydrogen, coupled to its large surface-to-volume ratio, make these structures a promising substrate for hydrogen storage devices.
△ Less
Submitted 17 February, 2023;
originally announced February 2023.
-
A Digital Delay Model Supporting Large Adversarial Delay Variations
Authors:
Daniel Öhlinger,
Ulrich Schmid
Abstract:
Dynamic digital timing analysis is a promising alternative to analog simulations for verifying particularly timing-critical parts of a circuit. A necessary prerequisite is a digital delay model, which allows to accurately predict the input-to-output delay of a given transition in the input signal(s) of a gate. Since all existing digital delay models for dynamic digital timing analysis are determin…
▽ More
Dynamic digital timing analysis is a promising alternative to analog simulations for verifying particularly timing-critical parts of a circuit. A necessary prerequisite is a digital delay model, which allows to accurately predict the input-to-output delay of a given transition in the input signal(s) of a gate. Since all existing digital delay models for dynamic digital timing analysis are deterministic, however, they cannot cover delay fluctuations caused by PVT variations, aging and analog signal noise. The only exception known to us is the $η$-IDM introduced by Függer et al. at DATE'18, which allows to add (very) small adversarially chosen delay variations to the deterministic involution delay model, without endangering its faithfulness. In this paper, we show that it is possible to extend the range of allowed delay variations so significantly that realistic PVT variations and aging are covered by the resulting extended $η$-IDM.
△ Less
Submitted 6 April, 2023; v1 submitted 10 January, 2023;
originally announced January 2023.
-
An Accurate Hybrid Delay Model for Multi-Input Gates
Authors:
Arman Ferdowsi,
Ulrich Schmid,
Josef Salzmann
Abstract:
In order to facilitate the analysis of timing relations between individual transitions in a signal trace, dynamic digital timing analysis offers a less accurate but much faster alternative to analog simulations of digital circuits. This primarily requires gate delay models that also account for the fact that the input-to-output delay of a particular input transition also depends on the temporal di…
▽ More
In order to facilitate the analysis of timing relations between individual transitions in a signal trace, dynamic digital timing analysis offers a less accurate but much faster alternative to analog simulations of digital circuits. This primarily requires gate delay models that also account for the fact that the input-to-output delay of a particular input transition also depends on the temporal distance to the previous output transitions. In the case of multi-input gates, the delay also experiences variations caused by multi-input switching (MIS) effects, i.e., transitions at different inputs that occur in close temporal proximity. In this paper, we advocate the development of hybrid delay models for CMOS gates obtained by replacing transistors with time-variant resistors. We exemplify our approach by applying it to a NOR gate (and, hence, to the dual NAND gate) and a Muller C gate. We analytically solve the resulting first-order differential equations with non-constant-coefficients, and derive analytic expressions for the resulting MIS gate delays. The resulting formulas not only pave the way to a sound model parametrization procedure, but are also instrumental for implementing fast and efficient digital timing simulation. By comparison with analog simulation data, we show that our models faithfully represent all relevant MIS effects. Using an implementation in the Involution Tool, we demonstrate that our model surpasses the alternative digital delay models for NOR gates known to us in terms of accuracy, with comparably short running times.
△ Less
Submitted 15 May, 2023; v1 submitted 19 November, 2022;
originally announced November 2022.
-
CorrLoss: Integrating Co-Occurrence Domain Knowledge for Affect Recognition
Authors:
Ines Rieger,
Jaspar Pahl,
Bettina Finzel,
Ute Schmid
Abstract:
Neural networks are widely adopted, yet the integration of domain knowledge is still underutilized. We propose to integrate domain knowledge about co-occurring facial movements as a constraint in the loss function to enhance the training of neural networks for affect recognition. As the co-ccurrence patterns tend to be similar across datasets, applying our method can lead to a higher generalizabil…
▽ More
Neural networks are widely adopted, yet the integration of domain knowledge is still underutilized. We propose to integrate domain knowledge about co-occurring facial movements as a constraint in the loss function to enhance the training of neural networks for affect recognition. As the co-ccurrence patterns tend to be similar across datasets, applying our method can lead to a higher generalizability of models and a lower risk of overfitting. We demonstrate this by showing performance increases in cross-dataset testing for various datasets. We also show the applicability of our method for calibrating neural networks to different facial expressions.
△ Less
Submitted 31 October, 2022;
originally announced October 2022.
-
On Specifications and Proofs of Timed Circuits
Authors:
Matthias Fuegger,
Christoph Lenzen,
Ulrich Schmid
Abstract:
Given a discrete-state continuous-time reactive system, like a digital circuit, the classical approach is to first model it as a state transition system and then prove its properties. Our contribution advocates a different approach: to directly operate on the input-output behavior of such systems, without identifying states and their transitions in the first place. We discuss the benefits of this…
▽ More
Given a discrete-state continuous-time reactive system, like a digital circuit, the classical approach is to first model it as a state transition system and then prove its properties. Our contribution advocates a different approach: to directly operate on the input-output behavior of such systems, without identifying states and their transitions in the first place. We discuss the benefits of this approach at hand of some examples, which demonstrate that it nicely integrates with concepts of self-stabilization and fault-tolerance. We also elaborate on some unexpected artefacts of module composition in our framework, and conclude with some open research questions.
△ Less
Submitted 17 August, 2022;
originally announced August 2022.
-
Explanatory machine learning for sequential human teaching
Authors:
Lun Ai,
Johannes Langer,
Stephen H. Muggleton,
Ute Schmid
Abstract:
The topic of comprehensibility of machine-learned theories has recently drawn increasing attention. Inductive Logic Programming (ILP) uses logic programming to derive logic theories from small data based on abduction and induction techniques. Learned theories are represented in the form of rules as declarative descriptions of obtained knowledge. In earlier work, the authors provided the first evid…
▽ More
The topic of comprehensibility of machine-learned theories has recently drawn increasing attention. Inductive Logic Programming (ILP) uses logic programming to derive logic theories from small data based on abduction and induction techniques. Learned theories are represented in the form of rules as declarative descriptions of obtained knowledge. In earlier work, the authors provided the first evidence of a measurable increase in human comprehension based on machine-learned logic rules for simple classification tasks. In a later study, it was found that the presentation of machine-learned explanations to humans can produce both beneficial and harmful effects in the context of game learning. We continue our investigation of comprehensibility by examining the effects of the ordering of concept presentations on human comprehension. In this work, we examine the explanatory effects of curriculum order and the presence of machine-learned explanations for sequential problem-solving. We show that 1) there exist tasks A and B such that learning A before B has a better human comprehension with respect to learning B before A and 2) there exist tasks A and B such that the presence of explanations when learning A contributes to improved human comprehension when subsequently learning B. We propose a framework for the effects of sequential teaching on comprehension based on an existing definition of comprehensibility and provide evidence for support from data collected in human trials. Empirical results show that sequential teaching of concepts with increasing complexity a) has a beneficial effect on human comprehension and b) leads to human re-discovery of divide-and-conquer problem-solving strategies, and c) studying machine-learned explanations allows adaptations of human problem-solving strategy with better performance.
△ Less
Submitted 25 March, 2023; v1 submitted 20 May, 2022;
originally announced May 2022.
-
CAIPI in Practice: Towards Explainable Interactive Medical Image Classification
Authors:
Emanuel Slany,
Yannik Ott,
Stephan Scheele,
Jan Paulus,
Ute Schmid
Abstract:
Would you trust physicians if they cannot explain their decisions to you? Medical diagnostics using machine learning gained enormously in importance within the last decade. However, without further enhancements many state-of-the-art machine learning methods are not suitable for medical application. The most important reasons are insufficient data set quality and the black-box behavior of machine l…
▽ More
Would you trust physicians if they cannot explain their decisions to you? Medical diagnostics using machine learning gained enormously in importance within the last decade. However, without further enhancements many state-of-the-art machine learning methods are not suitable for medical application. The most important reasons are insufficient data set quality and the black-box behavior of machine learning algorithms such as Deep Learning models. Consequently, end-users cannot correct the model's decisions and the corresponding explanations. The latter is crucial for the trustworthiness of machine learning in the medical domain. The research field explainable interactive machine learning searches for methods that address both shortcomings. This paper extends the explainable and interactive CAIPI algorithm and provides an interface to simplify human-in-the-loop approaches for image classification. The interface enables the end-user (1) to investigate and (2) to correct the model's prediction and explanation, and (3) to influence the data set quality. After CAIPI optimization with only a single counterexample per iteration, the model achieves an accuracy of $97.48\%$ on the Medical MNIST and $95.02\%$ on the Fashion MNIST. This accuracy is approximately equal to state-of-the-art Deep Learning optimization procedures. Besides, CAIPI reduces the labeling effort by approximately $80\%$.
△ Less
Submitted 31 May, 2022; v1 submitted 6 April, 2022;
originally announced April 2022.
-
Explainable Online Lane Change Predictions on a Digital Twin with a Layer Normalized LSTM and Layer-wise Relevance Propagation
Authors:
Christoph Wehner,
Francis Powlesland,
Bashar Altakrouri,
Ute Schmid
Abstract:
Artificial Intelligence and Digital Twins play an integral role in driving innovation in the domain of intelligent driving. Long short-term memory (LSTM) is a leading driver in the field of lane change prediction for manoeuvre anticipation. However, the decision-making process of such models is complex and non-transparent, hence reducing the trustworthiness of the smart solution. This work present…
▽ More
Artificial Intelligence and Digital Twins play an integral role in driving innovation in the domain of intelligent driving. Long short-term memory (LSTM) is a leading driver in the field of lane change prediction for manoeuvre anticipation. However, the decision-making process of such models is complex and non-transparent, hence reducing the trustworthiness of the smart solution. This work presents an innovative approach and a technical implementation for explaining lane change predictions of layer normalized LSTMs using Layer-wise Relevance Propagation (LRP). The core implementation includes consuming live data from a digital twin on a German highway, live predictions and explanations of lane changes by extending LRP to layer normalized LSTMs, and an interface for communicating and explaining the predictions to a human user. We aim to demonstrate faithful, understandable, and adaptable explanations of lane change prediction to increase the adoption and trustworthiness of AI systems that involve humans. Our research also emphases that explainability and state-of-the-art performance of ML models for manoeuvre anticipation go hand in hand without negatively affecting predictive effectiveness.
△ Less
Submitted 4 April, 2022;
originally announced April 2022.
-
An Interactive Explanatory AI System for Industrial Quality Control
Authors:
Dennis Müller,
Michael März,
Stephan Scheele,
Ute Schmid
Abstract:
Machine learning based image classification algorithms, such as deep neural network approaches, will be increasingly employed in critical settings such as quality control in industry, where transparency and comprehensibility of decisions are crucial. Therefore, we aim to extend the defect detection task towards an interactive human-in-the-loop approach that allows us to integrate rich background k…
▽ More
Machine learning based image classification algorithms, such as deep neural network approaches, will be increasingly employed in critical settings such as quality control in industry, where transparency and comprehensibility of decisions are crucial. Therefore, we aim to extend the defect detection task towards an interactive human-in-the-loop approach that allows us to integrate rich background knowledge and the inference of complex relationships going beyond traditional purely data-driven approaches. We propose an approach for an interactive support system for classifications in an industrial quality control setting that combines the advantages of both (explainable) knowledge-driven and data-driven machine learning methods, in particular inductive logic programming and convolutional neural networks, with human expertise and control. The resulting system can assist domain experts with decisions, provide transparent explanations for results, and integrate feedback from users; thus reducing workload for humans while both respecting their expertise and without removing their agency or accountability.
△ Less
Submitted 17 March, 2022;
originally announced March 2022.
-
Time Complexity of Consensus in Dynamic Networks Under Oblivious Message Adversaries
Authors:
Ami Paz,
Hugo Rincon Galeana,
Stefan Schmid,
Ulrich Schmid,
Kyrill Winkler
Abstract:
Consensus is a most fundamental task in distributed computing. This paper studies the consensus problem for a set of processes connected by a dynamic directed network, in which computation and communication is lock-step synchronous but controlled by an oblivious message adversary. In this basic model, determining consensus solvability and designing consensus algorithms in the case where it is poss…
▽ More
Consensus is a most fundamental task in distributed computing. This paper studies the consensus problem for a set of processes connected by a dynamic directed network, in which computation and communication is lock-step synchronous but controlled by an oblivious message adversary. In this basic model, determining consensus solvability and designing consensus algorithms in the case where it is possible, has been shown to be surprisingly difficult. We present an explicit decision procedure to determine if consensus is possible under a given adversary. This in turn enables us, for the first time, to study the time complexity of consensus in this model. In particular, we derive time complexity upper bounds for consensus solvability both for a centralized decision procedure as well as for solving distributed consensus. We complement these results with time complexity lower bounds. Intriguingly, we find that reaching consensus under an oblivious message adversary can take exponentially longer than broadcasting the input value of some process to all other processes.
△ Less
Submitted 24 February, 2022;
originally announced February 2022.
-
Enabling Verification of Deep Neural Networks in Perception Tasks Using Fuzzy Logic and Concept Embeddings
Authors:
Gesina Schwalbe,
Christian Wirth,
Ute Schmid
Abstract:
One major drawback of deep convolutional neural networks (CNNs) for use in safety critical applications is their black-box nature. This makes it hard to verify or monitor complex, symbolic requirements on already trained computer vision CNNs. In this work, we present a simple, yet effective, approach to verify that a CNN complies with symbolic predicate logic rules which relate visual concepts. It…
▽ More
One major drawback of deep convolutional neural networks (CNNs) for use in safety critical applications is their black-box nature. This makes it hard to verify or monitor complex, symbolic requirements on already trained computer vision CNNs. In this work, we present a simple, yet effective, approach to verify that a CNN complies with symbolic predicate logic rules which relate visual concepts. It is the first that (1) does not modify the CNN, (2) may use visual concepts that are no CNN in- or output feature, and (3) can leverage continuous CNN confidence outputs. To achieve this, we newly combine methods from explainable artificial intelligence and logic: First, using supervised concept embedding analysis, the output of a CNN is post-hoc enriched by concept outputs. Second, rules from prior knowledge are modelled as truth functions that accept the CNN outputs, and can be evaluated with little computational overhead. We here investigate the use of fuzzy logic, i.e., continuous truth values, and of proper output calibration, which both theoretically and practically show slight benefits. Applicability is demonstrated on state-of-the-art object detectors for three verification use-cases, where monitoring of rule breaches can reveal detection errors.
△ Less
Submitted 13 March, 2022; v1 submitted 3 January, 2022;
originally announced January 2022.
-
3D arrangement of epitaxial graphene conformally grown on porousified crystalline SiC
Authors:
Stefano Veronesi,
Georg Pfusterschmied,
Filippo Fabbri,
Markus Leitgeb,
Omer Arif,
Daniel Esteban Arenas,
Sara Bals,
Ulrich Schmid,
Stefan Heun
Abstract:
Nanoporous materials represent a versatile solution for a number of applications ranging from sensing, energy applications, catalysis, drug delivery, and many others. The synergy between the outstanding properties of graphene with a three-dimensional porous structure, circumventing the limits of its 2D nature, constitutes therefore a breakthrough for many fields. We report the first three-dimensio…
▽ More
Nanoporous materials represent a versatile solution for a number of applications ranging from sensing, energy applications, catalysis, drug delivery, and many others. The synergy between the outstanding properties of graphene with a three-dimensional porous structure, circumventing the limits of its 2D nature, constitutes therefore a breakthrough for many fields. We report the first three-dimensional growth of epitaxial graphene on a porousified crystalline 4H-SiC(0001). The wafer porosification is performed via a sequence of metal-assisted photochemical and photoelectrochemical etching in hydrofluoric acid based electrolytes. Pore dimensions of the matrix have been evaluated by electron tomography resulting in an average diameter of 180 nm. Graphene growth is performed in an ultra high vacuum environment at a base pressure of $10^{-11}$ mbar. The graphene growth inside the pores is uniform as confirmed by Transmission Electron Microscopy (TEM) analysis. Raman spectroscopy confirms the high quality of the graphene with a 2D/G ratio $>1$ and an average graphene crystal size of $\approx$ 100 nm. Furthermore, it demonstrates a uniform coverage of graphene across the whole sample area. The surface-to-volume ratio of this novel material, its properties, the tunability of the pore size and the scalability of the surface porosification process offer a game changing perspective for a large number of applications.
△ Less
Submitted 21 December, 2021;
originally announced December 2021.
-
A Simple Hybrid Model for Accurate Delay Modeling of a Multi-Input Gate
Authors:
Arman Ferdowsi,
Jürgen Maier,
Daniel Öhlinger,
Ulrich Schmid
Abstract:
Faithfully representing small gate delay variations caused by input switchings on different inputs in close temporal proximity is a very challenging task for digital delay models. In this paper, we use the example of a 2-input NOR gate to show that a simple hybrid model leads to a surprisingly accurate digital delay model. Our model utilizes simple first-order ordinary differential equations (ODEs…
▽ More
Faithfully representing small gate delay variations caused by input switchings on different inputs in close temporal proximity is a very challenging task for digital delay models. In this paper, we use the example of a 2-input NOR gate to show that a simple hybrid model leads to a surprisingly accurate digital delay model. Our model utilizes simple first-order ordinary differential equations (ODEs) in all modes, resulting from considering transistors as ideal switches in a simple RC model of the gate. By analytically solving the resulting ODEs, we derive expressions for the gate delays, as well as formulas that facilitate model parametrization. It turns out that our model almost faithfully captures the Charlie effect, except in just one specific situation. In addition, we experimentally compare our model's predictions both to SPICE simulations, using some 15 nm technology, and to some existing delay models. Our results show a significant improvement of the achievable modeling accuracy.
△ Less
Submitted 16 November, 2021;
originally announced November 2021.
-
Explanation as a process: user-centric construction of multi-level and multi-modal explanations
Authors:
Bettina Finzel,
David E. Tafler,
Stephan Scheele,
Ute Schmid
Abstract:
In the last years, XAI research has mainly been concerned with develo** new technical approaches to explain deep learning models. Just recent research has started to acknowledge the need to tailor explanations to different contexts and requirements of stakeholders. Explanations must not only suit developers of models, but also domain experts as well as end users. Thus, in order to satisfy differ…
▽ More
In the last years, XAI research has mainly been concerned with develo** new technical approaches to explain deep learning models. Just recent research has started to acknowledge the need to tailor explanations to different contexts and requirements of stakeholders. Explanations must not only suit developers of models, but also domain experts as well as end users. Thus, in order to satisfy different stakeholders, explanation methods need to be combined. While multi-modal explanations have been used to make model predictions more transparent, less research has focused on treating explanation as a process, where users can ask for information according to the level of understanding gained at a certain point in time. Consequently, an opportunity to explore explanations on different levels of abstraction should be provided besides multi-modal explanations. We present a process-based approach that combines multi-level and multi-modal explanations. The user can ask for textual explanations or visualizations through conversational interaction in a drill-down manner. We use Inductive Logic Programming, an interpretable machine learning approach, to learn a comprehensible model. Further, we present an algorithm that creates an explanatory tree for each example for which a classifier decision is to be explained. The explanatory tree can be navigated by the user to get answers of different levels of detail. We provide a proof-of-concept implementation for concepts induced from a semantic net about living beings.
△ Less
Submitted 7 October, 2021;
originally announced October 2021.
-
Continuous Tasks and the Chromatic Simplicial Approximation Theorem
Authors:
Hugo Rincon Galeana,
Sergio Rajsbaum,
Ulrich Schmid
Abstract:
The celebrated 1999 Asynchronous Computability Theorem (ACT) of Herlihy and Shavit characterized the distributed tasks that are wait-free solvable, and thus uncovered a deep connection with algebraic topology. We present a novel interpretation of this theorem, through the notion of continuous task, defined by an input/output specification that is a continuous function. To do so, we introduce a chr…
▽ More
The celebrated 1999 Asynchronous Computability Theorem (ACT) of Herlihy and Shavit characterized the distributed tasks that are wait-free solvable, and thus uncovered a deep connection with algebraic topology. We present a novel interpretation of this theorem, through the notion of continuous task, defined by an input/output specification that is a continuous function. To do so, we introduce a chromatic version of a foundational result for algebraic topology: the simplicial approximation theorem. In addition to providing a different proof of the ACT, the notion of continuous task seems interesting in itself. Indeed, besides the fact that certain distributed problems are naturally specified by continuous functions, continuous tasks have an expressive power that also allows to specify the density of desired outputs for each combination of possible inputs,for example.
△ Less
Submitted 3 September, 2021;
originally announced September 2021.
-
Extending Challenge Sets to Uncover Gender Bias in Machine Translation: Impact of Stereotypical Verbs and Adjectives
Authors:
Jonas-Dario Troles,
Ute Schmid
Abstract:
Human gender bias is reflected in language and text production. Because state-of-the-art machine translation (MT) systems are trained on large corpora of text, mostly generated by humans, gender bias can also be found in MT. For instance when occupations are translated from a language like English, which mostly uses gender neutral words, to a language like German, which mostly uses a feminine and…
▽ More
Human gender bias is reflected in language and text production. Because state-of-the-art machine translation (MT) systems are trained on large corpora of text, mostly generated by humans, gender bias can also be found in MT. For instance when occupations are translated from a language like English, which mostly uses gender neutral words, to a language like German, which mostly uses a feminine and a masculine version for an occupation, a decision must be made by the MT System. Recent research showed that MT systems are biased towards stereotypical translation of occupations. In 2019 the first, and so far only, challenge set, explicitly designed to measure the extent of gender bias in MT systems has been published. In this set measurement of gender bias is solely based on the translation of occupations. In this paper we present an extension of this challenge set, called WiBeMT, with gender-biased adjectives and adds sentences with gender-biased verbs. The resulting challenge set consists of over 70, 000 sentences and has been translated with three commercial MT systems: DeepL Translator, Microsoft Translator, and Google Translate. Results show a gender bias for all three MT systems. This gender bias is to a great extent significantly influenced by adjectives and to a lesser extent by verbs.
△ Less
Submitted 24 July, 2021;
originally announced July 2021.
-
Fire!
Authors:
Krisztina Fruzsa,
Roman Kuznets,
Ulrich Schmid
Abstract:
In this paper, we provide an epistemic analysis of a simple variant of the fundamental consistent broadcasting primitive for byzantine fault-tolerant asynchronous distributed systems. Our Firing Rebels with Relay (FRR) primitive enables agents with a local preference for acting/not acting to trigger an action (FIRE) at all correct agents, in an all-or-nothing fashion. By using the epistemic reason…
▽ More
In this paper, we provide an epistemic analysis of a simple variant of the fundamental consistent broadcasting primitive for byzantine fault-tolerant asynchronous distributed systems. Our Firing Rebels with Relay (FRR) primitive enables agents with a local preference for acting/not acting to trigger an action (FIRE) at all correct agents, in an all-or-nothing fashion. By using the epistemic reasoning framework for byzantine multi-agent systems introduced in our TARK'19 paper, we develop the necessary and sufficient state of knowledge that needs to be acquired by the agents in order to FIRE. It involves eventual common hope (a modality related to belief), which we show to be attained already by achieving eventual mutual hope in the case of FRR. We also identify subtle variations of the necessary and sufficient state of knowledge for FRR for different assumptions on the local preferences.
△ Less
Submitted 21 June, 2021;
originally announced June 2021.
-
Generating Contrastive Explanations for Inductive Logic Programming Based on a Near Miss Approach
Authors:
Johannes Rabold,
Michael Siebers,
Ute Schmid
Abstract:
In recent research, human-understandable explanations of machine learning models have received a lot of attention. Often explanations are given in form of model simplifications or visualizations. However, as shown in cognitive science as well as in early AI research, concept understanding can also be improved by the alignment of a given instance for a concept with a similar counterexample. Contras…
▽ More
In recent research, human-understandable explanations of machine learning models have received a lot of attention. Often explanations are given in form of model simplifications or visualizations. However, as shown in cognitive science as well as in early AI research, concept understanding can also be improved by the alignment of a given instance for a concept with a similar counterexample. Contrasting a given instance with a structurally similar example which does not belong to the concept highlights what characteristics are necessary for concept membership. Such near misses have been proposed by Winston (1970) as efficient guidance for learning in relational domains. We introduce an explanation generation algorithm for relational concepts learned with Inductive Logic Programming (\textsc{GeNME}). The algorithm identifies near miss examples from a given set of instances and ranks these examples by their degree of closeness to a specific positive instance. A modified rule which covers the near miss but not the original instance is given as an explanation. We illustrate \textsc{GeNME} with the well known family domain consisting of kinship relations, the visual relational Winston arches domain and a real-world domain dealing with file management. We also present a psychological experiment comparing human preferences of rule-based, example-based, and near miss explanations in the family and the arches domains.
△ Less
Submitted 15 June, 2021;
originally announced June 2021.
-
Expressive Explanations of DNNs by Combining Concept Analysis with ILP
Authors:
Johannes Rabold,
Gesina Schwalbe,
Ute Schmid
Abstract:
Explainable AI has emerged to be a key component for black-box machine learning approaches in domains with a high demand for reliability or transparency. Examples are medical assistant systems, and applications concerned with the General Data Protection Regulation of the European Union, which features transparency as a cornerstone. Such demands require the ability to audit the rationale behind a c…
▽ More
Explainable AI has emerged to be a key component for black-box machine learning approaches in domains with a high demand for reliability or transparency. Examples are medical assistant systems, and applications concerned with the General Data Protection Regulation of the European Union, which features transparency as a cornerstone. Such demands require the ability to audit the rationale behind a classifier's decision. While visualizations are the de facto standard of explanations, they come short in terms of expressiveness in many ways: They cannot distinguish between different attribute manifestations of visual features (e.g. eye open vs. closed), and they cannot accurately describe the influence of absence of, and relations between features. An alternative would be more expressive symbolic surrogate models. However, these require symbolic inputs, which are not readily available in most computer vision tasks. In this paper we investigate how to overcome this: We use inherent features learned by the network to build a global, expressive, verbal explanation of the rationale of a feed-forward convolutional deep neural network (DNN). The semantics of the features are mined by a concept analysis approach trained on a set of human understandable visual concepts. The explanation is found by an Inductive Logic Programming (ILP) method and presented as first-order rules. We show that our explanation is faithful to the original black-box model.
The code for our experiments is available at https://github.com/mc-lovin-mlem/concept-embeddings-and-ilp/tree/ki2020.
△ Less
Submitted 16 May, 2021;
originally announced May 2021.
-
A Composable Glitch-Aware Delay Model
Authors:
Jürgen Maier,
Daniel Öhlinger,
Ulrich Schmid,
Matthias Függer,
Thomas Nowak
Abstract:
We introduce the Composable Involution Delay Model (CIDM) for fast and accurate digital simulation. It is based on the Involution Delay Model (IDM) [Függer et al., IEEE TCAD 2020], which has been shown to be the only existing candidate for faithful glitch propagation known so far. In its present form, however, it has shortcomings that limit its practical applicability and utility. First, IDM delay…
▽ More
We introduce the Composable Involution Delay Model (CIDM) for fast and accurate digital simulation. It is based on the Involution Delay Model (IDM) [Függer et al., IEEE TCAD 2020], which has been shown to be the only existing candidate for faithful glitch propagation known so far. In its present form, however, it has shortcomings that limit its practical applicability and utility. First, IDM delay predictions are conceptually based on discretizing the analog signal waveforms using specific matching input and output discretization threshold voltages. Unfortunately, they are difficult to determine and typically different for interconnected gates. Second, metastability and high-frequency oscillations in a real circuit could be invisible in the IDM signal predictions. Our CIDM reduces the characterization effort by allowing independent discretization thresholds, improves composability and increases the modeling power by exposing canceled pulse trains at the gate interconnect. We formally show that, despite these improvements, the CIDM still retains the IDM's faithfulness, which is a consequence of the mathematical properties of involution delay functions.
△ Less
Submitted 22 April, 2021;
originally announced April 2021.
-
High finesse microcavities in the optical telecom O-band
Authors:
Jan Fait,
Stefan Putz,
Georg Wachter,
Johannes Schalko,
Ulrich Schmid,
Markus Arndt,
Michael Trupke
Abstract:
Optical microcavities allow to strongly confine light in small mode volumes and with long photon lifetimes. This confinement significantly enhances the interaction between light and matter inside the cavity, with applications such as optical trap** and cooling of nanoparticles, single-photon emission enhancement, quantum information processing, and sensing. For many applications, open resonators…
▽ More
Optical microcavities allow to strongly confine light in small mode volumes and with long photon lifetimes. This confinement significantly enhances the interaction between light and matter inside the cavity, with applications such as optical trap** and cooling of nanoparticles, single-photon emission enhancement, quantum information processing, and sensing. For many applications, open resonators with direct access to the mode volume are necessary. Here we report on a scalable, open-access optical microcavity platform with mode volumes < 30 $λ^3$ and finesse approaching $5x10^5$. This result significantly exceeds the highest optical enhancement factors achieved to date for Fabry-Pérot cavities. The platform provides a building block for high-performance quantum devices relying on strong light-matter interaction.
△ Less
Submitted 6 April, 2021;
originally announced April 2021.
-
The Persistence of False Memory: Brain in a Vat Despite Perfect Clocks
Authors:
Thomas Schlögl,
Ulrich Schmid,
Roman Kuznets
Abstract:
Recently, a detailed epistemic reasoning framework for multi-agent systems with byzantine faulty asynchronous agents and possibly unreliable communication was introduced. We have developed a modular extension framework implemented on top of it, which allows to encode and safely combine additional system assumptions commonly used in the modeling and analysis of fault-tolerant distributed systems, l…
▽ More
Recently, a detailed epistemic reasoning framework for multi-agent systems with byzantine faulty asynchronous agents and possibly unreliable communication was introduced. We have developed a modular extension framework implemented on top of it, which allows to encode and safely combine additional system assumptions commonly used in the modeling and analysis of fault-tolerant distributed systems, like reliable communication, time-bounded communication, multicasting, synchronous and lock-step synchronous agents and even agents with coordinated actions. We use this extension framework for analyzing basic properties of synchronous and lock-step synchronous agents, such as the agents' local and global fault detection abilities. Moreover, we show that even the perfectly synchronized clocks available in lock-step synchronous systems cannot be used to avoid "brain-in-a-vat" scenarios.
△ Less
Submitted 2 November, 2020;
originally announced November 2020.
-
Beneficial and Harmful Explanatory Machine Learning
Authors:
Lun Ai,
Stephen H. Muggleton,
Céline Hocquette,
Mark Gromowski,
Ute Schmid
Abstract:
Given the recent successes of Deep Learning in AI there has been increased interest in the role and need for explanations in machine learned theories. A distinct notion in this context is that of Michie's definition of Ultra-Strong Machine Learning (USML). USML is demonstrated by a measurable increase in human performance of a task following provision to the human of a symbolic machine learned the…
▽ More
Given the recent successes of Deep Learning in AI there has been increased interest in the role and need for explanations in machine learned theories. A distinct notion in this context is that of Michie's definition of Ultra-Strong Machine Learning (USML). USML is demonstrated by a measurable increase in human performance of a task following provision to the human of a symbolic machine learned theory for task performance. A recent paper demonstrates the beneficial effect of a machine learned logic theory for a classification task, yet no existing work to our knowledge has examined the potential harmfulness of machine's involvement for human comprehension during learning. This paper investigates the explanatory effects of a machine learned theory in the context of simple two person games and proposes a framework for identifying the harmfulness of machine explanations based on the Cognitive Science literature. The approach involves a cognitive window consisting of two quantifiable bounds and it is supported by empirical evidence collected from human trials. Our quantitative and qualitative results indicate that human learning aided by a symbolic machine learned theory which satisfies a cognitive window has achieved significantly higher performance than human self learning. Results also demonstrate that human learning aided by a symbolic machine learned theory that fails to satisfy this window leads to significantly worse performance than unaided human learning.
△ Less
Submitted 25 February, 2021; v1 submitted 9 September, 2020;
originally announced September 2020.
-
A Faithful Binary Circuit Model with Adversarial Noise
Authors:
Matthias Függer,
Jürgen Maier,
Robert Najvirt,
Thomas Nowak,
Ulrich Schmid
Abstract:
Accurate delay models are important for static and dynamic timing analysis of digital circuits, and mandatory for formal verification. However, Függer et al. [IEEE TC 2016] proved that pure and inertial delays, which are employed for dynamic timing analysis in state-of-the-art tools like ModelSim, NC-Sim and VCS, do not yield faithful digital circuit models. Involution delays, which are based on d…
▽ More
Accurate delay models are important for static and dynamic timing analysis of digital circuits, and mandatory for formal verification. However, Függer et al. [IEEE TC 2016] proved that pure and inertial delays, which are employed for dynamic timing analysis in state-of-the-art tools like ModelSim, NC-Sim and VCS, do not yield faithful digital circuit models. Involution delays, which are based on delay functions that are mathematical involutions depending on the previous-output-to-input time offset, were introduced by Függer et al. [DATE'15] as a faithful alternative (that can easily be used with existing tools). Although involution delays were shown to predict real signal traces reasonably accurately, any model with a deterministic delay function is naturally limited in its modeling power. In this paper, we thus extend the involution model, by adding non-deterministic delay variations (random or even adversarial), and prove analytically that faithfulness is not impaired by this generalization. Albeit the amount of non-determinism must be considerably restricted to ensure this property, the result is surprising: the involution model differs from non-faithful models mainly in handling fast glitch trains, where small delay shifts have large effects. This originally suggested that adding even small variations should break the faithfulness of the model, which turned out not to be the case. Moreover, the results of our simulations also confirm that this generalized involution model has larger modeling power and, hence, applicability.
△ Less
Submitted 7 December, 2021; v1 submitted 15 June, 2020;
originally announced June 2020.
-
Verifying Deep Learning-based Decisions for Facial Expression Recognition
Authors:
Ines Rieger,
Rene Kollmann,
Bettina Finzel,
Dominik Seuss,
Ute Schmid
Abstract:
Neural networks with high performance can still be biased towards non-relevant features. However, reliability and robustness is especially important for high-risk fields such as clinical pain treatment. We therefore propose a verification pipeline, which consists of three steps. First, we classify facial expressions with a neural network. Next, we apply layer-wise relevance propagation to create p…
▽ More
Neural networks with high performance can still be biased towards non-relevant features. However, reliability and robustness is especially important for high-risk fields such as clinical pain treatment. We therefore propose a verification pipeline, which consists of three steps. First, we classify facial expressions with a neural network. Next, we apply layer-wise relevance propagation to create pixel-based explanations. Finally, we quantify these visual explanations based on a bounding-box method with respect to facial regions. Although our results show that the neural network achieves state-of-the-art results, the evaluation of the visual explanations reveals that relevant facial regions may not be considered.
△ Less
Submitted 14 February, 2020;
originally announced March 2020.
-
Effect of Superpixel Aggregation on Explanations in LIME -- A Case Study with Biological Data
Authors:
Ludwig Schallner,
Johannes Rabold,
Oliver Scholz,
Ute Schmid
Abstract:
End-to-end learning with deep neural networks, such as convolutional neural networks (CNNs), has been demonstrated to be very successful for different tasks of image classification. To make decisions of black-box approaches transparent, different solutions have been proposed. LIME is an approach to explainable AI relying on segmenting images into superpixels based on the Quick-Shift algorithm. In…
▽ More
End-to-end learning with deep neural networks, such as convolutional neural networks (CNNs), has been demonstrated to be very successful for different tasks of image classification. To make decisions of black-box approaches transparent, different solutions have been proposed. LIME is an approach to explainable AI relying on segmenting images into superpixels based on the Quick-Shift algorithm. In this paper, we present an explorative study of how different superpixel methods, namely Felzenszwalb, SLIC and Compact-Watershed, impact the generated visual explanations. We compare the resulting relevance areas with the image parts marked by a human reference. Results show that image parts selected as relevant strongly vary depending on the applied method. Quick-Shift resulted in the least and Compact-Watershed in the highest correspondence with the reference relevance areas.
△ Less
Submitted 17 October, 2019;
originally announced October 2019.
-
Enriching Visual with Verbal Explanations for Relational Concepts -- Combining LIME with Aleph
Authors:
Johannes Rabold,
Hannah Deininger,
Michael Siebers,
Ute Schmid
Abstract:
With the increasing number of deep learning applications, there is a growing demand for explanations. Visual explanations provide information about which parts of an image are relevant for a classifier's decision. However, highlighting of image parts (e.g., an eye) cannot capture the relevance of a specific feature value for a class (e.g., that the eye is wide open). Furthermore, highlighting cann…
▽ More
With the increasing number of deep learning applications, there is a growing demand for explanations. Visual explanations provide information about which parts of an image are relevant for a classifier's decision. However, highlighting of image parts (e.g., an eye) cannot capture the relevance of a specific feature value for a class (e.g., that the eye is wide open). Furthermore, highlighting cannot convey whether the classification depends on the mere presence of parts or on a specific spatial relation between them. Consequently, we present an approach that is capable of explaining a classifier's decision in terms of logic rules obtained by the Inductive Logic Programming system Aleph. The examples and the background knowledge needed for Aleph are based on the explanation generation method LIME. We demonstrate our approach with images of a blocksworld domain. First, we show that our approach is capable of identifying a single relation as important explanatory construct. Afterwards, we present the more complex relational concept of towers. Finally, we show how the generated relational rules can be explicitly related with the input image, resulting in richer explanations.
△ Less
Submitted 4 October, 2019;
originally announced October 2019.
-
Causality and Epistemic Reasoning in Byzantine Multi-Agent Systems
Authors:
Roman Kuznets,
Laurent Prosperi,
Ulrich Schmid,
Krisztina Fruzsa
Abstract:
Causality is an important concept both for proving impossibility results and for synthesizing efficient protocols in distributed computing. For asynchronous agents communicating over unreliable channels, causality is well studied and understood. This understanding, however, relies heavily on the assumption that agents themselves are correct and reliable. We provide the first epistemic analysis of…
▽ More
Causality is an important concept both for proving impossibility results and for synthesizing efficient protocols in distributed computing. For asynchronous agents communicating over unreliable channels, causality is well studied and understood. This understanding, however, relies heavily on the assumption that agents themselves are correct and reliable. We provide the first epistemic analysis of causality in the presence of byzantine agents, i.e., agents that can deviate from their protocol and, thus, cannot be relied upon. Using our new framework for epistemic reasoning in fault-tolerant multi-agent systems, we determine the byzantine analog of the causal cone and describe a communication structure, which we call a multipede, necessary for verifying preconditions for actions in this setting.
△ Less
Submitted 21 July, 2019;
originally announced July 2019.
-
Topological Characterization of Consensus in Distributed Systems
Authors:
Thomas Nowak,
Ulrich Schmid,
Kyrill Winkler
Abstract:
We provide a complete characterization of both uniform and non-uniform deterministic consensus solvability in distributed systems with benign process and communication faults using point-set topology. More specifically, we non-trivially extend the approach introduced by Alpern and Schneider in 1985, by introducing novel fault-aware pseudo-(semi-)metric topologies on the space of infinite execution…
▽ More
We provide a complete characterization of both uniform and non-uniform deterministic consensus solvability in distributed systems with benign process and communication faults using point-set topology. More specifically, we non-trivially extend the approach introduced by Alpern and Schneider in 1985, by introducing novel fault-aware pseudo-(semi-)metric topologies on the space of infinite executions: the process-view topology, induced by a distance function that relies on the local view of a given process in an execution, and the minimum topology, which is induced by a distance function that focuses on the local view of the process that is the last to distinguish two executions. Consensus is solvable in a given model if and only if the sets of admissible executions leading to different decision values is disconnected in these topologies. We also provide two alternative characterizations, based on the broadcastability of connected components and on the exclusion of certain "fair" and "unfair" limit sequences (which coincide with forever bivalent runs). By applying our approach to a wide range of different applications, we provide a topological explanation of a number of existing algorithms and impossibility results and develop several new ones.
△ Less
Submitted 8 August, 2022; v1 submitted 23 May, 2019;
originally announced May 2019.
-
Silicon microcavity arrays with open access and a finesse of half a million
Authors:
G. Wachter,
S. Kuhn,
S. Minniberger,
C. Salter,
P. Asenbaum,
J. Millen,
M. Schneider,
J. Schalko,
U. Schmid,
A. Felgner,
D. Hüser,
M. Arndt,
M. Trupke
Abstract:
Optical resonators are increasingly important tools in science and technology. Their applications range from laser physics, atomic clocks, molecular spectroscopy, and single-photon generation to the detection, trap** and cooling of atoms or nano-scale objects. Many of these applications benefit from strong mode confinement and high optical quality factors, making small mirrors of high surface-qu…
▽ More
Optical resonators are increasingly important tools in science and technology. Their applications range from laser physics, atomic clocks, molecular spectroscopy, and single-photon generation to the detection, trap** and cooling of atoms or nano-scale objects. Many of these applications benefit from strong mode confinement and high optical quality factors, making small mirrors of high surface-quality desirable. Building such devices in silicon yields ultra-low absorption at telecom wavelengths and enables integration of micro-structures with mechanical, electrical and other functionalities. Here, we push optical resonator technology to new limits by fabricating lithographically aligned silicon mirrors with ultra-smooth surfaces, small and wellcontrolled radii of curvature, ultra-low loss and high reflectivity. We build large arrays of microcavities with finesse greater than F = 500,000 and a mode volume of 330 femtoliters at wavelengths near 1550 nm. Such high-quality micro-mirrors open up a new regime of optics and enable unprecedented explorations of strong coupling between light and matter.
△ Less
Submitted 16 January, 2019;
originally announced April 2019.