-
Semi-device independent characterization of multiphoton indistinguishability
Authors:
Giovanni Rodari,
Leonardo Novo,
Riccardo Albiero,
Alessia Suprano,
Carlos T. Tavares,
Eugenio Caruccio,
Francesco Hoch,
Taira Giordani,
Gonzalo Carvacho,
Marco Gardina,
Niki Di Giano,
Serena Di Giorgio,
Giacomo Corrielli,
Francesco Ceccarelli,
Roberto Osellame,
Nicolò Spagnolo,
Ernesto F. Galvão,
Fabio Sciarrino
Abstract:
Multiphoton indistinguishability is a central resource for quantum enhancement in sensing and computation. Develo** and certifying large scale photonic devices requires reliable and accurate characterization of this resource, preferably using methods that are robust against experimental errors. Here, we propose a set of methods for the characterization of multiphoton indistinguishability, based…
▽ More
Multiphoton indistinguishability is a central resource for quantum enhancement in sensing and computation. Develo** and certifying large scale photonic devices requires reliable and accurate characterization of this resource, preferably using methods that are robust against experimental errors. Here, we propose a set of methods for the characterization of multiphoton indistinguishability, based on measurements of bunching and photon number variance. Our methods are robust in a semi-device independent way, in the sense of being effective even when the interferometers are incorrectly dialled. We demonstrate the effectiveness of this approach using an advanced photonic platform comprising a quantum-dot single-photon source and a universal fully-programmable integrated photonic processor. Our results show the practical usefulness of our methods, providing robust certification tools that can be scaled up to larger systems.
△ Less
Submitted 29 April, 2024;
originally announced April 2024.
-
Assessing ML Classification Algorithms and NLP Techniques for Depression Detection: An Experimental Case Study
Authors:
Giuliano Lorenzoni,
Cristina Tavares,
Nathalia Nascimento,
Paulo Alencar,
Donald Cowan
Abstract:
Depression has affected millions of people worldwide and has become one of the most common mental disorders. Early mental disorder detection can reduce costs for public health agencies and prevent other major comorbidities. Additionally, the shortage of specialized personnel is very concerning since Depression diagnosis is highly dependent on expert professionals and is time-consuming. Recent rese…
▽ More
Depression has affected millions of people worldwide and has become one of the most common mental disorders. Early mental disorder detection can reduce costs for public health agencies and prevent other major comorbidities. Additionally, the shortage of specialized personnel is very concerning since Depression diagnosis is highly dependent on expert professionals and is time-consuming. Recent research has evidenced that machine learning (ML) and Natural Language Processing (NLP) tools and techniques have significantly bene ted the diagnosis of depression. However, there are still several challenges in the assessment of depression detection approaches in which other conditions such as post-traumatic stress disorder (PTSD) are present. These challenges include assessing alternatives in terms of data cleaning and pre-processing techniques, feature selection, and appropriate ML classification algorithms. This paper tackels such an assessment based on a case study that compares different ML classifiers, specifically in terms of data cleaning and pre-processing, feature selection, parameter setting, and model choices. The case study is based on the Distress Analysis Interview Corpus - Wizard-of-Oz (DAIC-WOZ) dataset, which is designed to support the diagnosis of mental disorders such as depression, anxiety, and PTSD. Besides the assessment of alternative techniques, we were able to build models with accuracy levels around 84% with Random Forest and XGBoost models, which is significantly higher than the results from the comparable literature which presented the level of accuracy of 72% from the SVM model.
△ Less
Submitted 3 April, 2024;
originally announced April 2024.
-
Extending Variability-Aware Model Selection with Bias Detection in Machine Learning Projects
Authors:
Cristina Tavares,
Nathalia Nascimento,
Paulo Alencar,
Donald Cowan
Abstract:
Data science projects often involve various machine learning (ML) methods that depend on data, code, and models. One of the key activities in these projects is the selection of a model or algorithm that is appropriate for the data analysis at hand. ML model selection depends on several factors, which include data-related attributes such as sample size, functional requirements such as the predictio…
▽ More
Data science projects often involve various machine learning (ML) methods that depend on data, code, and models. One of the key activities in these projects is the selection of a model or algorithm that is appropriate for the data analysis at hand. ML model selection depends on several factors, which include data-related attributes such as sample size, functional requirements such as the prediction algorithm type, and non-functional requirements such as performance and bias. However, the factors that influence such selection are often not well understood and explicitly represented. This paper describes ongoing work on extending an adaptive variability-aware model selection method with bias detection in ML projects. The method involves: (i) modeling the variability of the factors that affect model selection using feature models based on heuristics proposed in the literature; (ii) instantiating our variability model with added features related to bias (e.g., bias-related metrics); and (iii) conducting experiments that illustrate the method in a specific case study to illustrate our approach based on a heart failure prediction project. The proposed approach aims to advance the state of the art by making explicit factors that influence model selection, particularly those related to bias, as well as their interactions. The provided representations can transform model selection in ML projects into a non ad hoc, adaptive, and explainable process.
△ Less
Submitted 23 November, 2023;
originally announced November 2023.
-
GPT in Data Science: A Practical Exploration of Model Selection
Authors:
Nathalia Nascimento,
Cristina Tavares,
Paulo Alencar,
Donald Cowan
Abstract:
There is an increasing interest in leveraging Large Language Models (LLMs) for managing structured data and enhancing data science processes. Despite the potential benefits, this integration poses significant questions regarding their reliability and decision-making methodologies. It highlights the importance of various factors in the model selection process, including the nature of the data, prob…
▽ More
There is an increasing interest in leveraging Large Language Models (LLMs) for managing structured data and enhancing data science processes. Despite the potential benefits, this integration poses significant questions regarding their reliability and decision-making methodologies. It highlights the importance of various factors in the model selection process, including the nature of the data, problem type, performance metrics, computational resources, interpretability vs accuracy, assumptions about data, and ethical considerations. Our objective is to elucidate and express the factors and assumptions guiding GPT-4's model selection recommendations. We employ a variability model to depict these factors and use toy datasets to evaluate both the model and the implementation of the identified heuristics. By contrasting these outcomes with heuristics from other platforms, our aim is to determine the effectiveness and distinctiveness of GPT-4's methodology. This research is committed to advancing our comprehension of AI decision-making processes, especially in the realm of model selection within data science. Our efforts are directed towards creating AI systems that are more transparent and comprehensible, contributing to a more responsible and efficient practice in data science.
△ Less
Submitted 19 November, 2023;
originally announced November 2023.
-
DESiRED -- Dynamic, Enhanced, and Smart iRED: A P4-AQM with Deep Reinforcement Learning and In-band Network Telemetry
Authors:
Leandro C. de Almeida,
Washington Rodrigo Dias da Silva,
Thiago C. Tavares,
Rafael Pasquini,
Chrysa Papagianni,
Fábio L. Verdi
Abstract:
Active Queue Management (AQM) is a mechanism employed to alleviate transient congestion in network device buffers, such as routers and switches. Traditional AQM algorithms use fixed thresholds, like target delay or queue occupancy, to compute random packet drop probabilities. A very small target delay can increase packet losses and reduce link utilization, while a large target delay may increase q…
▽ More
Active Queue Management (AQM) is a mechanism employed to alleviate transient congestion in network device buffers, such as routers and switches. Traditional AQM algorithms use fixed thresholds, like target delay or queue occupancy, to compute random packet drop probabilities. A very small target delay can increase packet losses and reduce link utilization, while a large target delay may increase queueing delays while lowering drop probability. Due to dynamic network traffic characteristics, where traffic fluctuations can lead to significant queue variations, maintaining a fixed threshold AQM may not suit all applications. Consequently, we explore the question: \textit{What is the ideal threshold (target delay) for AQMs?} In this work, we introduce DESiRED (Dynamic, Enhanced, and Smart iRED), a P4-based AQM that leverages precise network feedback from In-band Network Telemetry (INT) to feed a Deep Reinforcement Learning (DRL) model. This model dynamically adjusts the target delay based on rewards that maximize application Quality of Service (QoS). We evaluate DESiRED in a realistic P4-based test environment running an MPEG-DASH service. Our findings demonstrate up to a 90x reduction in video stall and a 42x increase in high-resolution video playback quality when the target delay is adjusted dynamically by DESiRED.
△ Less
Submitted 27 October, 2023;
originally announced October 2023.
-
Effect of gallium do** on structural and transport properties of the topological insulator Bi2Se3 grown by molecular beam epitaxy
Authors:
Daniel Brito,
Ana Pérez-Rodriguez,
Ishwor Khatri,
Carlos José Tavares,
Mario Amado,
Eduardo Castro,
Enrique Diez,
Sascha Sadewasser,
Marcel S Claro
Abstract:
Topological insulators possess a non-conductive bulk and present surface states, henceforth, they are electrically conductive along their boundaries. Bismuth selenide ($Bi_2Se_3$) is one of the most promising topological insulators. However, a major drawback is its n-type nature arising from its natural do**, which makes the transport in the bulk dominant. This effect can be overcome by shifting…
▽ More
Topological insulators possess a non-conductive bulk and present surface states, henceforth, they are electrically conductive along their boundaries. Bismuth selenide ($Bi_2Se_3$) is one of the most promising topological insulators. However, a major drawback is its n-type nature arising from its natural do**, which makes the transport in the bulk dominant. This effect can be overcome by shifting the chemical potential into the bandgap, turning the transport of the surface states to be more pronounced than the bulk counterpart. In this work, $Bi_2Se_3$ was grown by molecular beam epitaxy and doped with 0.8, 2, 7, and 14 at. % of Ga, with the aim of shifting the chemical potential into the bandgap. The structural, morphological, and electronic properties of the Ga doped $Bi_2Se_3$ are studied. Raman and X-ray diffraction measurements confirmed the incorporation of the dopants into the crystal structure. Transport and magnetoresistance measurements in the temperature range of 1.5 to 300 K show that Ga-doped $Bi_2Se_3$ is n-type with a bulk charge carrier concentration of $10^{19} cm^{-3}$. Remarkably, magnetotransport of the weak antilocalization effect (WAL) measurements confirm the existence of surface states up to a do** percentage of 2 at. % of Ga and coherence length values between 50-800 nm, which envisages the possibility of topological superconductivity in this material.
△ Less
Submitted 30 June, 2022;
originally announced June 2022.
-
Towards Understanding Barriers and Mitigation Strategies of Software Engineers with Non-traditional Educational and Occupational Backgrounds
Authors:
Tavian Barnes,
Ken Jen Lee,
Cristina Tavares,
Gema Rodríguez-Pérez,
Meiyappan Nagappan
Abstract:
The traditional path to a software engineering career involves a post-secondary diploma in Software Engineering, Computer Science, or a related field. However, many software engineers take a non-traditional path to their career, starting from other industries or fields of study. This paper proposes a study on barriers faced by software engineers with non-traditional educational and occupational ba…
▽ More
The traditional path to a software engineering career involves a post-secondary diploma in Software Engineering, Computer Science, or a related field. However, many software engineers take a non-traditional path to their career, starting from other industries or fields of study. This paper proposes a study on barriers faced by software engineers with non-traditional educational and occupational backgrounds, and possible mitigation strategies for those barriers. We propose a two-stage methodology, consisting of an exploratory study, followed by a validation study. The exploratory study will involve a grounded-theory-based qualitative analysis of relevant Reddit data to yield a framework around the barriers and possible mitigation strategies. These findings will then be validated using a survey in the validation study. Making software engineering more accessible to those with non-traditional backgrounds will not only bring about the benefits of functional diversity, but also serves as a method of filling in the labour shortages of the software engineering industry.
△ Less
Submitted 8 April, 2022;
originally announced April 2022.
-
Smoothing and differentiation of data by Tikhonov and fractional derivative tools, applied to surface-enhanced Raman scattering (SERS) spectra of crystal violet dye
Authors:
Nelson H. T. Lemes,
Taináh M. R. Santos,
Camila A. Tavares,
Luciano S. Virtuoso,
Kelly A. S. Souza,
Teodorico C. Ramalho
Abstract:
All signals obtained as instrumental response of analytical apparatus are affected by noise, as in Raman spectroscopy. Whereas Raman scattering is an inherently weak process, the noise background can lead to misinterpretations. Although surface amplification of the Raman signal using metallic nanoparticles has been a strategy employed to partially solve the signal-to-noise problem, the pre-process…
▽ More
All signals obtained as instrumental response of analytical apparatus are affected by noise, as in Raman spectroscopy. Whereas Raman scattering is an inherently weak process, the noise background can lead to misinterpretations. Although surface amplification of the Raman signal using metallic nanoparticles has been a strategy employed to partially solve the signal-to-noise problem, the pre-processing of Raman spectral data through the use of mathematical filters has become an integral part of Raman spectroscopy analysis. In this paper, a Tikhonov modified method to remove random noise in experimental data is presented. In order to refine and improve the Tikhonov method as filter, the proposed method includes Euclidean norm of the fractional-order derivative of the solution as an additional criterion in Tikhonov function. In the strategy used here, the solution depends on the regularization parameter, $λ$, and on the fractional derivative order, $α$. As will be demonstrated, with the algorithm presented here, it is possible to obtain a noise free spectrum without affecting the fidelity of the molecular signal. In this alternative, the fractional derivative works as a fine control parameter for the usual Tikhonov method. The proposed method was applied to simulated data and to surface-enhanced Raman scattering (SERS) spectra of crystal violet dye in Ag nanoparticles colloidal dispersion.
△ Less
Submitted 1 April, 2022;
originally announced April 2022.
-
Calculation of the ground-state Stark effect in small molecules using the variational quantum eigensolver
Authors:
Carlos Tavares,
Sofia Oliveira,
Vitor Fernandes,
Andrei Postnikov,
Mikhail I. Vasilevskiy
Abstract:
As quantum computing approaches its first commercial implementations, quantum simulation emerges as a potentially ground-breaking technology for several domains, including Biology and Chemistry. However, taking advantage of quantum algorithms in Quantum Chemistry raises a number of theoretical and practical challenges at different levels, from the conception to its actual execution. We go through…
▽ More
As quantum computing approaches its first commercial implementations, quantum simulation emerges as a potentially ground-breaking technology for several domains, including Biology and Chemistry. However, taking advantage of quantum algorithms in Quantum Chemistry raises a number of theoretical and practical challenges at different levels, from the conception to its actual execution. We go through such challenges in a case study of a quantum simulation for the hydrogen (H2) and lithium hydride (LiH) molecules, at an actual commercially available quantum computer, the IBM Q. The former molecule has always been a playground for testing approximate calculation methods in Quantum Chemistry, while the latter is just a little bit more complex, lacking the mirror symmetry of the former. Using the Variational Quantum Eigensolver (VQE) method, we study the molecule's ground state energy versus interatomic distance, under the action of stationary electric fields (Stark effect). Additionally, we review the necessary calculations of the matrix elements of the second quantization Hamiltonian encompassing the extra terms concerning the action of electric fields, using STO-LG type atomic orbitals to build the minimal basis sets.
△ Less
Submitted 19 April, 2021; v1 submitted 22 March, 2021;
originally announced March 2021.
-
Varying Alpha Generalized Dirac-Born-Infeld Models
Authors:
V. C. Tavares,
C. J. A. P. Martins
Abstract:
We study the cosmological consequences of a class of Dirac-Born-Infeld models, and assess their viability as a candidate for the recent acceleration of the Universe. The model includes both the rolling tachyon field and the generalized Chaplygin gas models as particular limits, and phenomenologically each of these provides a possible mechanism for a deviation of the value of the dark energy equati…
▽ More
We study the cosmological consequences of a class of Dirac-Born-Infeld models, and assess their viability as a candidate for the recent acceleration of the Universe. The model includes both the rolling tachyon field and the generalized Chaplygin gas models as particular limits, and phenomenologically each of these provides a possible mechanism for a deviation of the value of the dark energy equation of state from its canonical (cosmological constant) value. The field-dependent potential that is characteristic of the rolling tachyon also leads to variations of the fine-structure constant $α$, implying that the model can be constrained both by standard cosmological probes and by astrophysical measurements of $α$. Our analysis, using the latest available low-redshfit data and local constraints from atomic clock and weak equivalence principle experiments, shows that the two possible deviations of the dark energy equation of state are constrained to be $\log_{10}{(1+w_0)_V}<-7.85$ and $\log_{10}{(1+w_0)_C}<-0.85$, respectively for the rolling tachyon and Chaplygin components, both being at the $95.4\%$ confidence level (although the latter depends on the choice of priors, in a way that we quantify). Alternatively, the $95.4\%$ confidence level bound on the dimensionless slope of the potential is $\log_{10}λ<-5.36$. This confirms previous analyses indicating that in these models the potential needs to be extremely flat.
△ Less
Submitted 21 January, 2021;
originally announced January 2021.
-
Simulation of non-radiative energy transfer in photosynthetic systems using a quantum computer
Authors:
José Diogo Guimarães,
Carlos Tavares,
Luís Soares Barbosa,
Mikhail I. Vasilevskiy
Abstract:
Photosynthesis is an important and complex physical process in nature, whose comprehensive understanding would have many relevant industrial applications, for instance in the field of energy production. In this paper we propose a quantum algorithm for the simulation of the excitonic transport of energy, occurring in the first stage of the process of photosynthesis. The algorithm takes in account t…
▽ More
Photosynthesis is an important and complex physical process in nature, whose comprehensive understanding would have many relevant industrial applications, for instance in the field of energy production. In this paper we propose a quantum algorithm for the simulation of the excitonic transport of energy, occurring in the first stage of the process of photosynthesis. The algorithm takes in account the quantum and environmental effects (pure-dephasing), influencing the quantum transport. We performed quantum simulations of such phenomena, for a proof of concept scenario, in an actual quantum computer the IBM Q, of 5 qubits. We validate the results with the Haken-Ströbl model and discuss the influence of environmental parameters on the efficiency of the energy transport.
△ Less
Submitted 2 September, 2020;
originally announced September 2020.
-
A Variability-Aware Design Approach to the Data Analysis Modeling Process
Authors:
Maria Cristina Vale Tavares,
Paulo Alencar,
Donald Cowan
Abstract:
The massive amount of current data has led to many different forms of data analysis processes that aim to explore this data to uncover valuable insights. Methodologies to guide the development of big data science projects, including CRISP-DM and SEMMA, have been widely used in industry and academia. The data analysis modeling phase, which involves decisions on the most appropriate models to adopt,…
▽ More
The massive amount of current data has led to many different forms of data analysis processes that aim to explore this data to uncover valuable insights. Methodologies to guide the development of big data science projects, including CRISP-DM and SEMMA, have been widely used in industry and academia. The data analysis modeling phase, which involves decisions on the most appropriate models to adopt, is at the core of these projects. However, from a software engineering perspective, the design and automation of activities performed in this phase are challenging. In this paper, we propose an approach to the data analysis modeling process which involves (i) the assessment of the variability inherent in the CRISP-DM data analysis modeling phase and the provision of feature models that represent this variability; (ii) the definition of a framework structural design that captures the identified variability; and (iii) evaluation of the developed framework design in terms of the possibilities for process automation. The proposed approach advances the state of the art by offering a variability-aware design solution that can enhance system flexibility, potentially leading to novel software frameworks which can significantly improve the level of automation in data analysis modeling process.
△ Less
Submitted 25 December, 2018;
originally announced December 2018.
-
A Type System for Tom
Authors:
Claude Kirchner,
Pierre-Etienne Moreau,
Cláudia Tavares
Abstract:
Extending a given language with new dedicated features is a general and quite used approach to make the programming language more adapted to problems. Being closer to the application, this leads to less programming flaws and easier maintenance. But of course one would still like to perform program analysis on these kinds of extended languages, in particular type checking and inference. In this ca…
▽ More
Extending a given language with new dedicated features is a general and quite used approach to make the programming language more adapted to problems. Being closer to the application, this leads to less programming flaws and easier maintenance. But of course one would still like to perform program analysis on these kinds of extended languages, in particular type checking and inference. In this case one has to make the ty** of the extended features compatible with the ones in the starting language.
The Tom programming language is a typical example of such a situation as it consists of an extension of Java that adds pattern matching, more particularly associative pattern matching, and reduction strategies.
This paper presents a type system with subty** for Tom, that is compatible with Java's type system, and that performs both type checking and type inference. We propose an algorithm that checks if all patterns of a Tom program are well-typed. In addition, we propose an algorithm based on equality and subty** constraints that infers types of variables occurring in a pattern. Both algorithms are exemplified and the proposed type system is showed to be sound and complete.
△ Less
Submitted 25 March, 2010;
originally announced March 2010.