-
Injection Optimization at Particle Accelerators via Reinforcement Learning: From Simulation to Real-World Application
Authors:
Awal Awal,
Jan Hetzel,
Ralf Gebel,
Jörg Pretz
Abstract:
Optimizing the injection process in particle accelerators is crucial for enhancing beam quality and operational efficiency. This paper presents a framework for utilizing Reinforcement Learning (RL) to optimize the injection process at accelerator facilities. By framing the optimization challenge as an RL problem, we developed an agent capable of dynamically aligning the beam's transverse space wit…
▽ More
Optimizing the injection process in particle accelerators is crucial for enhancing beam quality and operational efficiency. This paper presents a framework for utilizing Reinforcement Learning (RL) to optimize the injection process at accelerator facilities. By framing the optimization challenge as an RL problem, we developed an agent capable of dynamically aligning the beam's transverse space with desired targets. Our methodology leverages the Soft Actor-Critic algorithm, enhanced with domain randomization and dense neural networks, to train the agent in simulated environments with varying dynamics promoting it to learn a generalized robust policy. The agent was evaluated in live runs at the Cooler Synchrotron COSY and it has successfully optimized the beam cross-section reaching human operator level but in notably less time. An empirical study further validated the importance of each architecture component in achieving a robust and generalized optimization strategy. The results demonstrate the potential of RL in automating and improving optimization tasks at particle acceleration facilities.
△ Less
Submitted 18 June, 2024;
originally announced June 2024.
-
Weakly Supervised Training for Hologram Verification in Identity Documents
Authors:
Glen Pouliquen,
Guillaume Chiron,
Joseph Chazalon,
Thierry Géraud,
Ahmad Montaser Awal
Abstract:
We propose a method to remotely verify the authenticity of Optically Variable Devices (OVDs), often referred to as ``holograms'', in identity documents. Our method processes video clips captured with smartphones under common lighting conditions, and is evaluated on two public datasets: MIDV-HOLO and MIDV-2020. Thanks to a weakly-supervised training, we optimize a feature extraction and decision pi…
▽ More
We propose a method to remotely verify the authenticity of Optically Variable Devices (OVDs), often referred to as ``holograms'', in identity documents. Our method processes video clips captured with smartphones under common lighting conditions, and is evaluated on two public datasets: MIDV-HOLO and MIDV-2020. Thanks to a weakly-supervised training, we optimize a feature extraction and decision pipeline which achieves a new leading performance on MIDV-HOLO, while maintaining a high recall on documents from MIDV-2020 used as attack samples. It is also the first method, to date, to effectively address the photo replacement attack task, and can be trained on either genuine samples, attack samples, or both for increased performance. By enabling to verify OVD shapes and dynamics with very little supervision, this work opens the way towards the use of massive amounts of unlabeled data to build robust remote identity document verification systems on commodity smartphones. Code is available at https://github.com/EPITAResearchLab/pouliquen.24.icdar
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
Synthetic dataset of ID and Travel Document
Authors:
Carlos Boned,
Maxime Talarmain,
Nabil Ghanmi,
Guillaume Chiron,
Sanket Biswas,
Ahmad Montaser Awal,
Oriol Ramos Terrades
Abstract:
This paper presents a new synthetic dataset of ID and travel documents, called SIDTD. The SIDTD dataset is created to help training and evaluating forged ID documents detection systems. Such a dataset has become a necessity as ID documents contain personal information and a public dataset of real documents can not be released. Moreover, forged documents are scarce, compared to legit ones, and the…
▽ More
This paper presents a new synthetic dataset of ID and travel documents, called SIDTD. The SIDTD dataset is created to help training and evaluating forged ID documents detection systems. Such a dataset has become a necessity as ID documents contain personal information and a public dataset of real documents can not be released. Moreover, forged documents are scarce, compared to legit ones, and the way they are generated varies from one fraudster to another resulting in a class of high intra-variability. In this paper we trained state-of-the-art models on this dataset and we compare them to the performance achieved in larger, but private, datasets. The creation of this dataset will help to document image analysis community to progress in the task of ID document verification.
△ Less
Submitted 3 January, 2024;
originally announced January 2024.
-
Detecting Natural Language Biases with Prompt-based Learning
Authors:
Md Abdul Aowal,
Maliha T Islam,
Priyanka Mary Mammen,
Sandesh Shetty
Abstract:
In this project, we want to explore the newly emerging field of prompt engineering and apply it to the downstream task of detecting LM biases. More concretely, we explore how to design prompts that can indicate 4 different types of biases: (1) gender, (2) race, (3) sexual orientation, and (4) religion-based. Within our project, we experiment with different manually crafted prompts that can draw ou…
▽ More
In this project, we want to explore the newly emerging field of prompt engineering and apply it to the downstream task of detecting LM biases. More concretely, we explore how to design prompts that can indicate 4 different types of biases: (1) gender, (2) race, (3) sexual orientation, and (4) religion-based. Within our project, we experiment with different manually crafted prompts that can draw out the subtle biases that may be present in the language model. We apply these prompts to multiple variations of popular and well-recognized models: BERT, RoBERTa, and T5 to evaluate their biases. We provide a comparative analysis of these models and assess them using a two-fold method: use human judgment to decide whether model predictions are biased and utilize model-level judgment (through further prompts) to understand if a model can self-diagnose the biases of its own prediction.
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
Optimization of the injection beam line at the Cooler Synchrotron COSY using Bayesian Optimization
Authors:
A. Awal,
J. Hetzel,
R. Gebel,
V. Kamerdzhiev,
J. Pretz
Abstract:
The complex non-linear processes in multi-dimensional parameter spaces, that are typical for an accelerator, are a natural application for machine learning algorithms. This paper reports on the use of Bayesian optimization for the optimization of the Injection Beam Line (IBL) of the Cooler Synchrotron storage ring COSY at the Forschungszentrum Jülich, Germany. Bayesian optimization is a machine le…
▽ More
The complex non-linear processes in multi-dimensional parameter spaces, that are typical for an accelerator, are a natural application for machine learning algorithms. This paper reports on the use of Bayesian optimization for the optimization of the Injection Beam Line (IBL) of the Cooler Synchrotron storage ring COSY at the Forschungszentrum Jülich, Germany. Bayesian optimization is a machine learning method that optimizes a continuous objective function using limited observations. The IBL is composed of 15 quadrupoles and 28 steerers. The goal is to increase the beam intensity inside the storage ring. The results showed the effectiveness of the Bayesian optimization in achieving better/faster results compared to manual optimization.
△ Less
Submitted 16 March, 2023; v1 submitted 17 February, 2023;
originally announced February 2023.
-
Bangla Natural Language Processing: A Comprehensive Analysis of Classical, Machine Learning, and Deep Learning Based Methods
Authors:
Ovishake Sen,
Mohtasim Fuad,
MD. Nazrul Islam,
Jakaria Rabbi,
Mehedi Masud,
MD. Kamrul Hasan,
Md. Abdul Awal,
Awal Ahmed Fime,
Md. Tahmid Hasan Fuad,
Delowar Sikder,
MD. Akil Raihan Iftee
Abstract:
The Bangla language is the seventh most spoken language, with 265 million native and non-native speakers worldwide. However, English is the predominant language for online resources and technical knowledge, journals, and documentation. Consequently, many Bangla-speaking people, who have limited command of English, face hurdles to utilize English resources. To bridge the gap between limited support…
▽ More
The Bangla language is the seventh most spoken language, with 265 million native and non-native speakers worldwide. However, English is the predominant language for online resources and technical knowledge, journals, and documentation. Consequently, many Bangla-speaking people, who have limited command of English, face hurdles to utilize English resources. To bridge the gap between limited support and increasing demand, researchers conducted many experiments and developed valuable tools and techniques to create and process Bangla language materials. Many efforts are also ongoing to make it easy to use the Bangla language in the online and technical domains. There are some review papers to understand the past, previous, and future Bangla Natural Language Processing (BNLP) trends. The studies are mainly concentrated on the specific domains of BNLP, such as sentiment analysis, speech recognition, optical character recognition, and text summarization. There is an apparent scarcity of resources that contain a comprehensive review of the recent BNLP tools and methods. Therefore, in this paper, we present a thorough analysis of 75 BNLP research papers and categorize them into 11 categories, namely Information Extraction, Machine Translation, Named Entity Recognition, Parsing, Parts of Speech Tagging, Question Answering System, Sentiment Analysis, Spam and Fake Detection, Text Summarization, Word Sense Disambiguation, and Speech Processing and Recognition. We study articles published between 1999 to 2021, and 50% of the papers were published after 2015. Furthermore, we discuss Classical, Machine Learning and Deep Learning approaches with different datasets while addressing the limitations and current and future trends of the BNLP.
△ Less
Submitted 9 April, 2022; v1 submitted 31 May, 2021;
originally announced May 2021.
-
Acute Lymphoblastic Leukemia Detection from Microscopic Images Using Weighted Ensemble of Convolutional Neural Networks
Authors:
Chayan Mondal,
Md. Kamrul Hasan,
Md. Tasnim Jawad,
Aishwariya Dutta,
Md. Rabiul Islam,
Md. Abdul Awal,
Mohiuddin Ahmad
Abstract:
Acute Lymphoblastic Leukemia (ALL) is a blood cell cancer characterized by numerous immature lymphocytes. Even though automation in ALL prognosis is an essential aspect of cancer diagnosis, it is challenging due to the morphological correlation between malignant and normal cells. The traditional ALL classification strategy demands experienced pathologists to carefully read the cell images, which i…
▽ More
Acute Lymphoblastic Leukemia (ALL) is a blood cell cancer characterized by numerous immature lymphocytes. Even though automation in ALL prognosis is an essential aspect of cancer diagnosis, it is challenging due to the morphological correlation between malignant and normal cells. The traditional ALL classification strategy demands experienced pathologists to carefully read the cell images, which is arduous, time-consuming, and often suffers inter-observer variations. This article has automated the ALL detection task from microscopic cell images, employing deep Convolutional Neural Networks (CNNs). We explore the weighted ensemble of different deep CNNs to recommend a better ALL cell classifier. The weights for the ensemble candidate models are estimated from their corresponding metrics, such as accuracy, F1-score, AUC, and kappa values. Various data augmentations and pre-processing are incorporated for achieving a better generalization of the network. We utilize the publicly available C-NMC-2019 ALL dataset to conduct all the comprehensive experiments. Our proposed weighted ensemble model, using the kappa values of the ensemble candidates as their weights, has outputted a weighted F1-score of 88.6 %, a balanced accuracy of 86.2 %, and an AUC of 0.941 in the preliminary test set. The qualitative results displaying the gradient class activation maps confirm that the introduced model has a concentrated learned region. In contrast, the ensemble candidate models, such as Xception, VGG-16, DenseNet-121, MobileNet, and InceptionResNet-V2, separately produce coarse and scatter learned areas for most example cases. Since the proposed kappa value-based weighted ensemble yields a better result for the aimed task in this article, it can experiment in other domains of medical diagnostic applications.
△ Less
Submitted 9 May, 2021;
originally announced May 2021.
-
Human Activity Analysis and Recognition from Smartphones using Machine Learning Techniques
Authors:
Jakaria Rabbi,
Md. Tahmid Hasan Fuad,
Md. Abdul Awal
Abstract:
Human Activity Recognition (HAR) is considered a valuable research topic in the last few decades. Different types of machine learning models are used for this purpose, and this is a part of analyzing human behavior through machines. It is not a trivial task to analyze the data from wearable sensors for complex and high dimensions. Nowadays, researchers mostly use smartphones or smart home sensors…
▽ More
Human Activity Recognition (HAR) is considered a valuable research topic in the last few decades. Different types of machine learning models are used for this purpose, and this is a part of analyzing human behavior through machines. It is not a trivial task to analyze the data from wearable sensors for complex and high dimensions. Nowadays, researchers mostly use smartphones or smart home sensors to capture these data. In our paper, we analyze these data using machine learning models to recognize human activities, which are now widely used for many purposes such as physical and mental health monitoring. We apply different machine learning models and compare performances. We use Logistic Regression (LR) as the benchmark model for its simplicity and excellent performance on a dataset, and to compare, we take Decision Tree (DT), Support Vector Machine (SVM), Random Forest (RF), and Artificial Neural Network (ANN). Additionally, we select the best set of parameters for each model by grid search. We use the HAR dataset from the UCI Machine Learning Repository as a standard dataset to train and test the models. Throughout the analysis, we can see that the Support Vector Machine performed (average accuracy 96.33%) far better than the other methods. We also prove that the results are statistically significant by employing statistical significance test methods.
△ Less
Submitted 30 March, 2021;
originally announced March 2021.
-
Transient Stability Assessment for Current Constrained and Unconstrained Fault Ride-Through in Virtual Oscillator Controlled Converters
Authors:
M A Awal,
Iqbal Husain
Abstract:
Unified virtual oscillator controller (uVOC) inherits the rigorous analytical foundation offered by oscillator based grid-forming (GFM) controllers and enables fast over-current limiting and fault ride-through (FRT). Control design for effective FRT requires transient stability analysis. Existing transient stability analysis methods and studies are limited in either considering only current uncons…
▽ More
Unified virtual oscillator controller (uVOC) inherits the rigorous analytical foundation offered by oscillator based grid-forming (GFM) controllers and enables fast over-current limiting and fault ride-through (FRT). Control design for effective FRT requires transient stability analysis. Existing transient stability analysis methods and studies are limited in either considering only current unconstrained scenarios or neglecting the simultaneous power angle and voltage dynamics. Under current-constrained faults, the voltage and power angle dynamics are strongly coupled and both play critical roles in determining transient stability. Therefore, decoupled analysis of the two, typically used in transient stability studies, does not offer comprehensive insight into the system dynamics. In this work, the overall FRT method for uVOC is presented and a comprehensive modeling and analysis method for transient stability is developed under both current-saturated and unsaturated symmetrical AC faults. We utilize phase-plane analysis of the overall system in a single graphical representation to obtain holistic insights into the coupled voltage and power angle dynamics. The FRT controller and the analysis method have been validated through simulations and hardware experiments. The results demonstrate that uVOC is not constrained by a critical clearing angle unlike droop and virtual synchronous machine (VSM) type second order controllers.
△ Less
Submitted 18 November, 2020;
originally announced November 2020.
-
Unified Virtual Oscillator Control for Grid-Forming and Grid-Following Converters
Authors:
M A Awal,
Iqbal Husain
Abstract:
A unified virtual oscillator controller (uVOC) is proposed, which enables a unified analysis, design, and implementation framework for both grid-forming (GFM) and grid-following (GFL) voltage source converters (VSCs). Oscillator based GFM controllers, such as dispatchable virtual oscillator control (dVOC), offer rigorous analytical framework with enhanced synchronization, but lack effective fault…
▽ More
A unified virtual oscillator controller (uVOC) is proposed, which enables a unified analysis, design, and implementation framework for both grid-forming (GFM) and grid-following (GFL) voltage source converters (VSCs). Oscillator based GFM controllers, such as dispatchable virtual oscillator control (dVOC), offer rigorous analytical framework with enhanced synchronization, but lack effective fault handling capability which severely limits practical application. The proposed uVOC facilitates synchronization with an arbitrarily low grid voltage and fast over-current limiting; this enables effective fault ride-through unlike existing GFM controllers which typically switch to a back-up controller during fault. GFM operation with uVOC is achieved in both grid connected and islanded modes with seamless transition between the two. In GFL converters, bidirectional power flow control and DC bus voltage regulation is achieved with uVOC. No phase-locked-loop (PLL) is required for either GFL or GFM operation circumventing the synchronization issues associated with PLLs in weak grid applications. Detail small signal models for GFM and GFL operation have been developed and systematic design guidelines for controller parameters are provided. The proposed controller is validated through hardware experiments in a hybrid AC-DC microgrid.
△ Less
Submitted 12 July, 2020;
originally announced July 2020.
-
Modular Medium Voltage AC to Low Voltage DC Converter for Extreme Fast Charging Applications
Authors:
M A Awal,
Iqbal Husain,
Md Rashed Hassan Bipu,
Oscar Andreas Montes,
Fei Teng,
Hao Feng,
Mehnaz Khan,
Srdjan Lukic
Abstract:
A modular and scalable converter for medium voltage (MV) AC to low voltage (LV) DC power conversion is proposed; single-phase-modules (SPMs), each consisting of an active-front-end (AFE) stage and an isolated DC-DC stage, are connected in input-series-output-parallel (ISOP) configuration to reach desired voltage and power capacity. In prior art, high-speed bidirectional communication among modules…
▽ More
A modular and scalable converter for medium voltage (MV) AC to low voltage (LV) DC power conversion is proposed; single-phase-modules (SPMs), each consisting of an active-front-end (AFE) stage and an isolated DC-DC stage, are connected in input-series-output-parallel (ISOP) configuration to reach desired voltage and power capacity. In prior art, high-speed bidirectional communication among modules and a centralized controller is required to ensure module-level voltage and power balancing, which severely limits the scalability and practical realization of higher voltage and higher power systems. Moreover, large capacitors are used to suppress double-line-frequency voltage variations on the common MV DC bus shared by the AFE and the DC-DC stage originating from AC power pulsations through the SPMs. We propose a comprehensive controller which achieves voltage and power balancing using complete decentralized control of the DC-DC stages based on only local sensor feedback and the AFE stages are controlled using feedback of only the LV DC output. Furthermore, reduced capacitor requirement on the MV DC bus is achieved through design and control. The proposed method is validated through simulation and experimental results.
△ Less
Submitted 8 July, 2020;
originally announced July 2020.
-
Abnormality Detection and Localization in Chest X-Rays using Deep Convolutional Neural Networks
Authors:
Mohammad Tariqul Islam,
Md Abdul Aowal,
Ahmed Tahseen Minhaz,
Khalid Ashraf
Abstract:
Chest X-Rays (CXRs) are widely used for diagnosing abnormalities in the heart and lung area. Automatically detecting these abnormalities with high accuracy could greatly enhance real world diagnosis processes. Lack of standard publicly available dataset and benchmark studies, however, makes it difficult to compare various detection methods. In order to overcome these difficulties, we have used a p…
▽ More
Chest X-Rays (CXRs) are widely used for diagnosing abnormalities in the heart and lung area. Automatically detecting these abnormalities with high accuracy could greatly enhance real world diagnosis processes. Lack of standard publicly available dataset and benchmark studies, however, makes it difficult to compare various detection methods. In order to overcome these difficulties, we have used a publicly available Indiana CXR, JSRT and Shenzhen dataset and studied the performance of known deep convolutional network (DCN) architectures on different abnormalities. We find that the same DCN architecture doesn't perform well across all abnormalities. Shallow features or earlier layers consistently provide higher detection accuracy compared to deep features. We have also found ensemble models to improve classification significantly compared to single model. Combining these insight, we report the highest accuracy on chest X-Ray abnormality detection on these datasets. We find that for cardiomegaly detection, the deep learning method improves the accuracy by a staggering 17 percentage point compared to rule based methods. We applied the techniques to the problem of tuberculosis detection on a different dataset and achieved the highest accuracy. Our localization experiments using these trained classifiers show that for spatially spread out abnormalities like cardiomegaly and pulmonary edema, the network can localize the abnormalities successfully most of the time. One remarkable result of the cardiomegaly localization is that the heart and its surrounding region is most responsible for cardiomegaly detection, in contrast to the rule based models where the ratio of heart and lung area is used as the measure. We believe that through deep learning based classification and localization, we will discover many more interesting features in medical image diagnosis that are not considered traditionally.
△ Less
Submitted 27 September, 2017; v1 submitted 27 May, 2017;
originally announced May 2017.
-
A Nearby Old Halo White Dwarf Candidate from the Sloan Digital Sky Survey
Authors:
Patrick B. Hall,
Piotr M. Kowalski,
Hugh C. Harris,
Akshay Awal,
S. K. Leggett,
Mukremin Kilic,
Scott F. Anderson,
Evalyn Gates
Abstract:
We report the discovery of a nearby, old, halo white dwarf candidate from the Sloan Digital Sky Survey. SDSS J110217.48+411315.4 has a proper motion of 1.75 arcsec/year and redder optical colors than all other known featureless (type DC) white dwarfs. We present SDSS imaging and spectroscopy of this object, along with near-infrared photometry obtained at the United Kingdom Infra-Red Telescope. F…
▽ More
We report the discovery of a nearby, old, halo white dwarf candidate from the Sloan Digital Sky Survey. SDSS J110217.48+411315.4 has a proper motion of 1.75 arcsec/year and redder optical colors than all other known featureless (type DC) white dwarfs. We present SDSS imaging and spectroscopy of this object, along with near-infrared photometry obtained at the United Kingdom Infra-Red Telescope. Fitting its photometry with up-to-date model atmospheres, we find that its overall spectral energy distribution is fit reasonably well with a pure hydrogen composition and T_eff~3800 K (assuming log g=8). That temperature and gravity would place this white dwarf at 35 pc from the Sun with a tangential velocity of 290 km/s and space velocities consistent with halo membership; furthermore, its combined main sequence and white dwarf cooling age would be ~11 Gyr. However, if this object is a massive white dwarf, it could be a younger object with a thick disk origin. Whatever its origin, the optical colors of this object are redder than predicted by any current pure hydrogen, pure helium or mixed hydrogen-helium atmospheric model, indicating that there remain problems in our understanding of the complicated physics of the dense atmospheres of cool white dwarfs.
△ Less
Submitted 9 April, 2008;
originally announced April 2008.