-
General one-loop contributions to the decay $H\rightarrow ν_l\barν_lγ$
Authors:
Khiem Hong Phan,
Dzung Tri Tran,
Le Tho Hue
Abstract:
General one-loop contributions to the decay amplitudes $H\rightarrow ν_l\barν_lγ$ are presented, considering all possible contributions of additional heavy vector gauge bosons, fermions, and charged (and also neutral) scalar particles appearing in the loop diagrams. Moreover, the results can be applied directly when extra neutrinos (apart from three ones in standard model) are taken into account i…
▽ More
General one-loop contributions to the decay amplitudes $H\rightarrow ν_l\barν_lγ$ are presented, considering all possible contributions of additional heavy vector gauge bosons, fermions, and charged (and also neutral) scalar particles appearing in the loop diagrams. Moreover, the results can be applied directly when extra neutrinos (apart from three ones in standard model) are taken into account in final states. Analytic results are presented in terms of Passarino-Veltman scalar functions which can be evaluated numerically using {\tt LoopTools}. In the standard model framework, these analytical results are generated and cross-checked with previous computations. We find that our results are well consistent with these computations. Within standard model limit, phenomenological results for the decay channels are also studied using the updated input parameters at the Large Hadron Collider.
△ Less
Submitted 28 June, 2021;
originally announced June 2021.
-
Long-Short Temporal Contrastive Learning of Video Transformers
Authors:
Jue Wang,
Gedas Bertasius,
Du Tran,
Lorenzo Torresani
Abstract:
Video transformers have recently emerged as a competitive alternative to 3D CNNs for video understanding. However, due to their large number of parameters and reduced inductive biases, these models require supervised pretraining on large-scale image datasets to achieve top performance. In this paper, we empirically demonstrate that self-supervised pretraining of video transformers on video-only da…
▽ More
Video transformers have recently emerged as a competitive alternative to 3D CNNs for video understanding. However, due to their large number of parameters and reduced inductive biases, these models require supervised pretraining on large-scale image datasets to achieve top performance. In this paper, we empirically demonstrate that self-supervised pretraining of video transformers on video-only datasets can lead to action recognition results that are on par or better than those obtained with supervised pretraining on large-scale image datasets, even massive ones such as ImageNet-21K. Since transformer-based models are effective at capturing dependencies over extended temporal spans, we propose a simple learning procedure that forces the model to match a long-term view to a short-term view of the same video. Our approach, named Long-Short Temporal Contrastive Learning (LSTCL), enables video transformers to learn an effective clip-level representation by predicting temporal context captured from a longer temporal extent. To demonstrate the generality of our findings, we implement and validate our approach under three different self-supervised contrastive learning frameworks (MoCo v3, BYOL, SimSiam) using two distinct video-transformer architectures, including an improved variant of the Swin Transformer augmented with space-time attention. We conduct a thorough ablation study and show that LSTCL achieves competitive performance on multiple video benchmarks and represents a convincing alternative to supervised image-based pretraining.
△ Less
Submitted 31 March, 2022; v1 submitted 16 June, 2021;
originally announced June 2021.
-
Revisiting the Calibration of Modern Neural Networks
Authors:
Matthias Minderer,
Josip Djolonga,
Rob Romijnders,
Frances Hubis,
Xiaohua Zhai,
Neil Houlsby,
Dustin Tran,
Mario Lucic
Abstract:
Accurate estimation of predictive uncertainty (model calibration) is essential for the safe application of neural networks. Many instances of miscalibration in modern neural networks have been reported, suggesting a trend that newer, more accurate models produce poorly calibrated predictions. Here, we revisit this question for recent state-of-the-art image classification models. We systematically…
▽ More
Accurate estimation of predictive uncertainty (model calibration) is essential for the safe application of neural networks. Many instances of miscalibration in modern neural networks have been reported, suggesting a trend that newer, more accurate models produce poorly calibrated predictions. Here, we revisit this question for recent state-of-the-art image classification models. We systematically relate model calibration and accuracy, and find that the most recent models, notably those not using convolutions, are among the best calibrated. Trends observed in prior model generations, such as decay of calibration with distribution shift or model size, are less pronounced in recent architectures. We also show that model size and amount of pretraining do not fully explain these differences, suggesting that architecture is a major determinant of calibration properties.
△ Less
Submitted 26 October, 2021; v1 submitted 15 June, 2021;
originally announced June 2021.
-
Satellite- and Cache-assisted UAV: A Joint Cache Placement, Resource Allocation, and Trajectory Optimization for 6G Aerial Networks
Authors:
Dinh-Hieu Tran,
Symeon Chatzinotas,
Björn Ottersten
Abstract:
This paper considers LEO satellite- and cache-assisted UAV communications for content delivery in terrestrial networks, which shows great potential for next-generation systems to provide ubiquitous connectivity and high capacity. Specifically, caching is provided by the UAV to reduce backhaul congestion, and the LEO satellite supports the UAV's backhaul link. In this context, we aim to maximize th…
▽ More
This paper considers LEO satellite- and cache-assisted UAV communications for content delivery in terrestrial networks, which shows great potential for next-generation systems to provide ubiquitous connectivity and high capacity. Specifically, caching is provided by the UAV to reduce backhaul congestion, and the LEO satellite supports the UAV's backhaul link. In this context, we aim to maximize the minimum achievable throughput per ground user (GU) by jointly optimizing cache placement, the UAV's resource allocation, and trajectory while cache capacity and flight time are limited. The formulated problem is challenging to solve directly due to its non-convexity and combinatorial nature. To find a solution, the problem is decomposed into three sub-problems: (1) cache placement optimization with fixed UAV resources and trajectory, followed by (2) the UAV resources optimization with fixed cache placement vector and trajectory, and finally, (3) we optimize the UAV trajectory with fixed cache placement and UAV resources. Based on the solutions of sub-problems, an efficient alternating algorithm is proposed utilizing the block coordinate descent (BCD) and successive convex approximation (SCA) methods. Simulation results show that the max-min throughput and total achievable throughput enhancement can be achieved by applying our proposed algorithm instead of other benchmark schemes.
△ Less
Submitted 9 June, 2021;
originally announced June 2021.
-
Uncertainty Baselines: Benchmarks for Uncertainty & Robustness in Deep Learning
Authors:
Zachary Nado,
Neil Band,
Mark Collier,
Josip Djolonga,
Michael W. Dusenberry,
Sebastian Farquhar,
Qixuan Feng,
Angelos Filos,
Marton Havasi,
Rodolphe Jenatton,
Ghassen Jerfel,
Jeremiah Liu,
Zelda Mariet,
Jeremy Nixon,
Shreyas Padhy,
Jie Ren,
Tim G. J. Rudner,
Faris Sbahi,
Yeming Wen,
Florian Wenzel,
Kevin Murphy,
D. Sculley,
Balaji Lakshminarayanan,
Jasper Snoek,
Yarin Gal
, et al. (1 additional authors not shown)
Abstract:
High-quality estimates of uncertainty and robustness are crucial for numerous real-world applications, especially for deep learning which underlies many deployed ML systems. The ability to compare techniques for improving these estimates is therefore very important for research and practice alike. Yet, competitive comparisons of methods are often lacking due to a range of reasons, including: compu…
▽ More
High-quality estimates of uncertainty and robustness are crucial for numerous real-world applications, especially for deep learning which underlies many deployed ML systems. The ability to compare techniques for improving these estimates is therefore very important for research and practice alike. Yet, competitive comparisons of methods are often lacking due to a range of reasons, including: compute availability for extensive tuning, incorporation of sufficiently many baselines, and concrete documentation for reproducibility. In this paper we introduce Uncertainty Baselines: high-quality implementations of standard and state-of-the-art deep learning methods on a variety of tasks. As of this writing, the collection spans 19 methods across 9 tasks, each with at least 5 metrics. Each baseline is a self-contained experiment pipeline with easily reusable and extendable components. Our goal is to provide immediate starting points for experimentation with new methods or applications. Additionally we provide model checkpoints, experiment outputs as Python notebooks, and leaderboards for comparing results. Code available at https://github.com/google/uncertainty-baselines.
△ Less
Submitted 5 January, 2022; v1 submitted 7 June, 2021;
originally announced June 2021.
-
Fully Symmetric Relativistic Quantum Mechanics and Its Physical Implications
Authors:
Bao D. Tran,
Zdzislaw E. Musielak
Abstract:
A new formulation of relativistic quantum mechanics is presented and applied to a free, massive, and spin zero elementary particle in the Minkowski spacetime. The reformulation requires that time and space, as well as the timelike and spacelike intervals, are treated equally, which makes the new theory fully symmetric and consistent with the Special Theory of Relativity. The theory correctly repro…
▽ More
A new formulation of relativistic quantum mechanics is presented and applied to a free, massive, and spin zero elementary particle in the Minkowski spacetime. The reformulation requires that time and space, as well as the timelike and spacelike intervals, are treated equally, which makes the new theory fully symmetric and consistent with the Special Theory of Relativity. The theory correctly reproduces the classical action of a relativistic particle in the path integral formalism, and allows for the introduction of a new quantity called vector-mass, whose physical implications for nonlocality, the uncertainty principle, and quantum vacuum are described and discussed.
△ Less
Submitted 31 May, 2021;
originally announced June 2021.
-
Synthesis and Tailored Properties Towards Designer Covalent Organic Framework Thin Films and Heterostructures
Authors:
Lucas K. Beagle,
Qiyi Fang,
Ly D. Tran,
Luke A. Baldwin,
Christopher Muratore,
Jun Lou,
Nicholas R. Glavin
Abstract:
Porous polymeric covalent organic frameworks (COFs) have been under intense synthetic investigation with over 100 unique structural motifs known. In order to realize the true potential of these materials, converting the powders into thin films with strict control of thickness and morphology is necessary and accomplished through techniques including interfacial synthesis, chemical exfoliation and m…
▽ More
Porous polymeric covalent organic frameworks (COFs) have been under intense synthetic investigation with over 100 unique structural motifs known. In order to realize the true potential of these materials, converting the powders into thin films with strict control of thickness and morphology is necessary and accomplished through techniques including interfacial synthesis, chemical exfoliation and mechanical delamination. Recent progress in the construction and tailored properties of thin film COFs are highlighted in this review, addressing mechanical properties as well as application-focused properties in filtration, electronics, sensors, electrochemical, magnetics, optoelectronics and beyond. Additionally, heterogeneous integration of these thin films with other inorganic and organic materials is discussed, revealing exciting opportunities to integrate COF thin films with other state of the art material and device systems.
△ Less
Submitted 26 May, 2021;
originally announced May 2021.
-
Multiple Meta-model Quantifying for Medical Visual Question Answering
Authors:
Tuong Do,
Binh X. Nguyen,
Erman Tjiputra,
Minh Tran,
Quang D. Tran,
Anh Nguyen
Abstract:
Transfer learning is an important step to extract meaningful features and overcome the data limitation in the medical Visual Question Answering (VQA) task. However, most of the existing medical VQA methods rely on external data for transfer learning, while the meta-data within the dataset is not fully utilized. In this paper, we present a new multiple meta-model quantifying method that effectively…
▽ More
Transfer learning is an important step to extract meaningful features and overcome the data limitation in the medical Visual Question Answering (VQA) task. However, most of the existing medical VQA methods rely on external data for transfer learning, while the meta-data within the dataset is not fully utilized. In this paper, we present a new multiple meta-model quantifying method that effectively learns meta-annotation and leverages meaningful features to the medical VQA task. Our proposed method is designed to increase meta-data by auto-annotation, deal with noisy labels, and output meta-models which provide robust features for medical VQA tasks. Extensively experimental results on two public medical VQA datasets show that our approach achieves superior accuracy in comparison with other state-of-the-art methods, while does not require external data to train meta-models.
△ Less
Submitted 26 June, 2021; v1 submitted 19 May, 2021;
originally announced May 2021.
-
Elementary Methods for Infinite Resistive Networks with Complex Topologies
Authors:
Tung X. Tran,
Linh K. Nguyen,
Quan M. Nguyen,
Chinh D. Tran,
Truong H. Cai,
Trung Phan
Abstract:
Finding the equivalent resistance of an infinite ladder circuit is a classical problem in physics. We expand this well-known challenge to new classes of network topologies, in which the unit cells are much more entangled together. The exact analytical results there can still be obtained with elementary methods. These topology classes will add layers of complexity and much more diversity to a very…
▽ More
Finding the equivalent resistance of an infinite ladder circuit is a classical problem in physics. We expand this well-known challenge to new classes of network topologies, in which the unit cells are much more entangled together. The exact analytical results there can still be obtained with elementary methods. These topology classes will add layers of complexity and much more diversity to a very popular kind of physics puzzles for teachers and students.
△ Less
Submitted 10 May, 2021; v1 submitted 8 May, 2021;
originally announced May 2021.
-
BERT-CoQAC: BERT-based Conversational Question Answering in Context
Authors:
Munazza Zaib,
Dai Hoang Tran,
Subhash Sagar,
Adnan Mahmood,
Wei E. Zhang,
Quan Z. Sheng
Abstract:
As one promising way to inquire about any particular information through a dialog with the bot, question answering dialog systems have gained increasing research interests recently. Designing interactive QA systems has always been a challenging task in natural language processing and used as a benchmark to evaluate a machine's ability of natural language understanding. However, such systems often…
▽ More
As one promising way to inquire about any particular information through a dialog with the bot, question answering dialog systems have gained increasing research interests recently. Designing interactive QA systems has always been a challenging task in natural language processing and used as a benchmark to evaluate a machine's ability of natural language understanding. However, such systems often struggle when the question answering is carried out in multiple turns by the users to seek more information based on what they have already learned, thus, giving rise to another complicated form called Conversational Question Answering (CQA). CQA systems are often criticized for not understanding or utilizing the previous context of the conversation when answering the questions. To address the research gap, in this paper, we explore how to integrate conversational history into the neural machine comprehension system. On one hand, we introduce a framework based on a publically available pre-trained language model called BERT for incorporating history turns into the system. On the other hand, we propose a history selection mechanism that selects the turns that are relevant and contributes the most to answer the current question. Experimentation results revealed that our framework is comparable in performance with the state-of-the-art models on the QuAC leader board. We also conduct a number of experiments to show the side effects of using entire context information which brings unnecessary information and noise signals resulting in a decline in the model's performance.
△ Less
Submitted 22 April, 2021;
originally announced April 2021.
-
Graph-based Person Signature for Person Re-Identifications
Authors:
Binh X. Nguyen,
Binh D. Nguyen,
Tuong Do,
Erman Tjiputra,
Quang D. Tran,
Anh Nguyen
Abstract:
The task of person re-identification (ReID) is to match images of the same person over multiple non-overlap** camera views. Due to the variations in visual factors, previous works have investigated how the person identity, body parts, and attributes benefit the person ReID problem. However, the correlations between attributes, body parts, and within each attribute are not fully utilized. In this…
▽ More
The task of person re-identification (ReID) is to match images of the same person over multiple non-overlap** camera views. Due to the variations in visual factors, previous works have investigated how the person identity, body parts, and attributes benefit the person ReID problem. However, the correlations between attributes, body parts, and within each attribute are not fully utilized. In this paper, we propose a new method to effectively aggregate detailed person descriptions (attributes labels) and visual features (body parts and global features) into a graph, namely Graph-based Person Signature, and utilize Graph Convolutional Networks to learn the topological structure of the visual signature of a person. The graph is integrated into a multi-branch multi-task framework for person re-identification. The extensive experiments are conducted to demonstrate the effectiveness of our proposed approach on two large-scale datasets, including Market-1501 and DukeMTMC-ReID. Our approach achieves competitive results among the state of the art and outperforms other attribute-based or mask-guided methods.
△ Less
Submitted 17 April, 2021; v1 submitted 14 April, 2021;
originally announced April 2021.
-
Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation
Authors:
Weiyao Wang,
Matt Feiszli,
Heng Wang,
Du Tran
Abstract:
Current state-of-the-art object detection and segmentation methods work well under the closed-world assumption. This closed-world setting assumes that the list of object categories is available during training and deployment. However, many real-world applications require detecting or segmenting novel objects, i.e., object categories never seen during training. In this paper, we present, UVO (Unide…
▽ More
Current state-of-the-art object detection and segmentation methods work well under the closed-world assumption. This closed-world setting assumes that the list of object categories is available during training and deployment. However, many real-world applications require detecting or segmenting novel objects, i.e., object categories never seen during training. In this paper, we present, UVO (Unidentified Video Objects), a new benchmark for open-world class-agnostic object segmentation in videos. Besides shifting the problem focus to the open-world setup, UVO is significantly larger, providing approximately 8 times more videos compared with DAVIS, and 7 times more mask (instance) annotations per video compared with YouTube-VOS and YouTube-VIS. UVO is also more challenging as it includes many videos with crowded scenes and complex background motions. We demonstrated that UVO can be used for other applications, such as object tracking and super-voxel segmentation, besides open-world object segmentation. We believe that UVo is a versatile testbed for researchers to develop novel approaches for open-world class-agnostic object segmentation, and inspires new research directions towards a more comprehensive video understanding beyond classification and detection.
△ Less
Submitted 10 April, 2021;
originally announced April 2021.
-
Knowledge Distillation By Sparse Representation Matching
Authors:
Dat Thanh Tran,
Moncef Gabbouj,
Alexandros Iosifidis
Abstract:
Knowledge Distillation refers to a class of methods that transfers the knowledge from a teacher network to a student network. In this paper, we propose Sparse Representation Matching (SRM), a method to transfer intermediate knowledge obtained from one Convolutional Neural Network (CNN) to another by utilizing sparse representation learning. SRM first extracts sparse representations of the hidden f…
▽ More
Knowledge Distillation refers to a class of methods that transfers the knowledge from a teacher network to a student network. In this paper, we propose Sparse Representation Matching (SRM), a method to transfer intermediate knowledge obtained from one Convolutional Neural Network (CNN) to another by utilizing sparse representation learning. SRM first extracts sparse representations of the hidden features of the teacher CNN, which are then used to generate both pixel-level and image-level labels for training intermediate feature maps of the student network. We formulate SRM as a neural processing block, which can be efficiently optimized using stochastic gradient descent and integrated into any CNN in a plug-and-play manner. Our experiments demonstrate that SRM is robust to architectural differences between the teacher and student networks, and outperforms other KD techniques across several datasets.
△ Less
Submitted 31 March, 2021;
originally announced March 2021.
-
Proton-$\rm ^3He$ elastic scattering at intermediate energies
Authors:
A. Watanabe,
S. Nakai,
Y. Wada,
K. Sekiguchi,
A. Deltuva,
T. Akieda,
D. Etoh,
M. Inoue,
Y. Inoue,
K. Kawahara,
H. Kon,
K. Miki,
T. Mukai,
D. Sakai,
S. Shibuya,
Y. Shiokawa,
T. Taguchi,
H. Umetsu,
Y. Utsuki,
M. Watanabe,
S. Goto,
K. Hatanaka,
Y. Hirai,
T. Ino,
D. Inomoto
, et al. (20 additional authors not shown)
Abstract:
We present a precise measurement of the cross section, proton and $\rm ^3He$ analyzing powers, and spin correlation coefficient $C_{y,y}$ for $p$-$\rm ^3He$ elastic scattering near 65 MeV, and a comparison with rigorous four-nucleon scattering calculations based on realistic nuclear potentials and a model with $Δ$-isobar excitation. Clear discrepancies are seen in some of the measured observables…
▽ More
We present a precise measurement of the cross section, proton and $\rm ^3He$ analyzing powers, and spin correlation coefficient $C_{y,y}$ for $p$-$\rm ^3He$ elastic scattering near 65 MeV, and a comparison with rigorous four-nucleon scattering calculations based on realistic nuclear potentials and a model with $Δ$-isobar excitation. Clear discrepancies are seen in some of the measured observables in the regime around the cross section minimum. Theoretical predictions using scaling relations between the calculated cross section and the $\rm ^3 He$ binding energy are not successful in reproducing the data. Large sensitivity to the $NN$ potentials and rather small $Δ$-isobar effects in the calculated cross section are noticed as different features from those in the deuteron-proton elastic scattering. The results obtained above indicate that $p$-$\rm ^3He$ scattering at intermediate energies is an excellent tool to explore nuclear interactions not accessible by three-nucleon scattering.
△ Less
Submitted 26 March, 2021;
originally announced March 2021.
-
One-loop $W$ boson contributions to the decay $H\rightarrow Zγ$ in the general $R_ξ$ gauge
Authors:
Dzung Tri Tran,
Le Tho Hue,
Khiem Hong Phan
Abstract:
One-loop $W$ boson contributions to the decay $H\rightarrow Zγ$ in the general $R_ξ$ gauge are presented. The analytical results are expressed in terms of well-known Passarino-Veltman functions which their numerical evaluations can be generated using {\tt LoopTools}. In the limit $d\rightarrow 4$, we have shown that these analytical results are independent of the unphysical parameter $ξ$ and consi…
▽ More
One-loop $W$ boson contributions to the decay $H\rightarrow Zγ$ in the general $R_ξ$ gauge are presented. The analytical results are expressed in terms of well-known Passarino-Veltman functions which their numerical evaluations can be generated using {\tt LoopTools}. In the limit $d\rightarrow 4$, we have shown that these analytical results are independent of the unphysical parameter $ξ$ and consistent with previous results. The gauge parameter independence are also checked numerically for consistence. Our results are also well stable with different values of $ξ=0, 1, 100,$ and $ξ\rightarrow \infty$.
△ Less
Submitted 30 June, 2021; v1 submitted 25 March, 2021;
originally announced March 2021.
-
One-loop form factors for $H\rightarrow γ^*γ^*$ in $R_ξ$ gauge
Authors:
Khiem Hong Phan,
Dzung Tri Tran
Abstract:
In this paper, we present general one-loop form factors for $H\rightarrow γ^* γ^*$ in $R_ξ$ gauge, considering all cases of two on-shell, one on-shell and two off-shell for final photons. The calculations are performed in standard model and in arbitrary beyond the standard models which charged scalar particles may be exchanged in one-loop diagrams. Analytic results for the form factors are shown i…
▽ More
In this paper, we present general one-loop form factors for $H\rightarrow γ^* γ^*$ in $R_ξ$ gauge, considering all cases of two on-shell, one on-shell and two off-shell for final photons. The calculations are performed in standard model and in arbitrary beyond the standard models which charged scalar particles may be exchanged in one-loop diagrams. Analytic results for the form factors are shown in general forms which are expressed in terms of the Passarino-Veltman functions. We also confirm the results in previous computations which are available for the case of two on-shell photons. The $ξ$-independent of the result is also discussed. We find that numerical results are good stability with varying $ξ=0,1$ and $ξ\rightarrow \infty$.
△ Less
Submitted 18 March, 2021;
originally announced March 2021.
-
RecSim NG: Toward Principled Uncertainty Modeling for Recommender Ecosystems
Authors:
Martin Mladenov,
Chih-Wei Hsu,
Vihan Jain,
Eugene Ie,
Christopher Colby,
Nicolas Mayoraz,
Hubert Pham,
Dustin Tran,
Ivan Vendrov,
Craig Boutilier
Abstract:
The development of recommender systems that optimize multi-turn interaction with users, and model the interactions of different agents (e.g., users, content providers, vendors) in the recommender ecosystem have drawn increasing attention in recent years. Develo** and training models and algorithms for such recommenders can be especially difficult using static datasets, which often fail to offer…
▽ More
The development of recommender systems that optimize multi-turn interaction with users, and model the interactions of different agents (e.g., users, content providers, vendors) in the recommender ecosystem have drawn increasing attention in recent years. Develo** and training models and algorithms for such recommenders can be especially difficult using static datasets, which often fail to offer the types of counterfactual predictions needed to evaluate policies over extended horizons. To address this, we develop RecSim NG, a probabilistic platform for the simulation of multi-agent recommender systems. RecSim NG is a scalable, modular, differentiable simulator implemented in Edward2 and TensorFlow. It offers: a powerful, general probabilistic programming language for agent-behavior specification; tools for probabilistic inference and latent-variable model learning, backed by automatic differentiation and tracing; and a TensorFlow-based runtime for running simulations on accelerated hardware. We describe RecSim NG and illustrate how it can be used to create transparent, configurable, end-to-end models of a recommender ecosystem, complemented by a small set of simple use cases that demonstrate how RecSim NG can help both researchers and practitioners easily develop and train novel algorithms for recommender systems.
△ Less
Submitted 14 March, 2021;
originally announced March 2021.
-
Robust and generalizable embryo selection based on artificial intelligence and time-lapse image sequences
Authors:
Jørgen Berntsen,
Jens Rimestad,
Jacob Theilgaard Lassen,
Dang Tran,
Mikkel Fly Kragh
Abstract:
Assessing and selecting the most viable embryos for transfer is an essential part of in vitro fertilization (IVF). In recent years, several approaches have been made to improve and automate the procedure using artificial intelligence (AI) and deep learning. Based on images of embryos with known implantation data (KID), AI models have been trained to automatically score embryos related to their cha…
▽ More
Assessing and selecting the most viable embryos for transfer is an essential part of in vitro fertilization (IVF). In recent years, several approaches have been made to improve and automate the procedure using artificial intelligence (AI) and deep learning. Based on images of embryos with known implantation data (KID), AI models have been trained to automatically score embryos related to their chance of achieving a successful implantation. However, as of now, only limited research has been conducted to evaluate how embryo selection models generalize to new clinics and how they perform in subgroup analyses across various conditions. In this paper, we investigate how a deep learning-based embryo selection model using only time-lapse image sequences performs across different patient ages and clinical conditions, and how it correlates with traditional morphokinetic parameters. The model was trained and evaluated based on a large dataset from 18 IVF centers consisting of 115,832 embryos, of which 14,644 embryos were transferred KID embryos. In an independent test set, the AI model sorted KID embryos with an area under the curve (AUC) of a receiver operating characteristic curve of 0.67 and all embryos with an AUC of 0.95. A clinic hold-out test showed that the model generalized to new clinics with an AUC range of 0.60-0.75 for KID embryos. Across different subgroups of age, insemination method, incubation time, and transfer protocol, the AUC ranged between 0.63 and 0.69. Furthermore, model predictions correlated positively with blastocyst grading and negatively with direct cleavages. The fully automated iDAScore v1.0 model was shown to perform at least as good as a state-of-the-art manual embryo selection model. Moreover, full automatization of embryo scoring implies fewer manual evaluations and eliminates biases due to inter- and intraobserver variation.
△ Less
Submitted 12 March, 2021;
originally announced March 2021.
-
Observation of the near-threshold intruder $0^-$ resonance in $^{12}$Be
Authors:
J. Chen,
S. M. Wang,
H. T. Fortune,
J. L. Lou,
Y. L. Ye,
Z. H. Li,
N. Michel,
J. G. Li,
C. X. Yuan,
Y. C. Ge,
Q. T. Li,
H. Hua,
D. X. Jiang,
X. F. Yang,
D. Y. Pang,
F. R. Xu,
W. Zuo,
J. C. Pei,
J. Li,
W. Jiang,
Y. L. Sun,
H. L. Zang,
N. Aoi,
H. J. Ong,
E. Ideguchi
, et al. (12 additional authors not shown)
Abstract:
A resonant state at $3.21^{+0.12}_{-0.04}$\,MeV, located just above the one-neutron separation threshold, was observed for the first time in $^{12}$Be from the $^{11}$Be\,$(d,p)^{12}$Be one-neutron transfer reaction in inverse kinematics. This state is assigned a spin-parity of $0^-$, according to the distorted-wave Born approximation (DWBA) and decay-width analysis. Gamow coupled-channel (GCC) an…
▽ More
A resonant state at $3.21^{+0.12}_{-0.04}$\,MeV, located just above the one-neutron separation threshold, was observed for the first time in $^{12}$Be from the $^{11}$Be\,$(d,p)^{12}$Be one-neutron transfer reaction in inverse kinematics. This state is assigned a spin-parity of $0^-$, according to the distorted-wave Born approximation (DWBA) and decay-width analysis. Gamow coupled-channel (GCC) and Gamow shell-model (GSM) calculations show the importance of the continuum-coupling, which dramatically influences the excitation energy and ordering of low-lying states. Various exotic structures associated with cross-shell intruding configurations in $^{12}$Be and in its isotonic nucleus $^{11}$Li are comparably discussed.
△ Less
Submitted 3 March, 2021;
originally announced March 2021.
-
Study of $s$- and $d$-wave intruder strengths in $^{13}{\rm B}_{\rm g.s.}$ via a $p(^{13}{\rm B},d)^{12}{\rm B}$ reaction
Authors:
W. Liu,
J. L. Lou,
Y. L. Ye,
Z. H. Li,
Q. T. Li,
H. Hua,
X. F. Yang,
J. Y. Xu,
H. J. Ong,
D. T. Tran,
N. Aoi,
E. Ideguchi,
D. Y. Pang,
C. X. Yuan,
S. M. Wang,
Y. Jiang,
B. Yang,
Y. Liu,
J. G. Li,
Z. Q. Chen,
J. X. Han,
S. W. Bai,
G. Li,
K. Ma,
Z. W. Tan
, et al. (2 additional authors not shown)
Abstract:
Experimental results of the $p(^{13}{\rm B},d)^{12}{\rm B}$ transfer reaction to the low-lying states in $^{12}$B are reported. The optical potential parameters for the entrance channel are extracted from the elastic scattering $p$($^{13}{\rm B}$, $p$) measured in the same experiment, while those for the exit channel are global ones. Spectroscopic factors associated with the $p$-, $s$-, and $d$-wa…
▽ More
Experimental results of the $p(^{13}{\rm B},d)^{12}{\rm B}$ transfer reaction to the low-lying states in $^{12}$B are reported. The optical potential parameters for the entrance channel are extracted from the elastic scattering $p$($^{13}{\rm B}$, $p$) measured in the same experiment, while those for the exit channel are global ones. Spectroscopic factors associated with the $p$-, $s$-, and $d$-wave neutron transfer to the known $^{12}$B states, are extracted by comparing the deuteron angular distributions with the calculation results. The separated $s$- and $d$-wave intruder strengths in $^{13}{\rm B}_{\rm g.s.}$ were determined to be $10(2)\%$ and $6(1)\%$, respectively, which follow roughly the systematics for the $N$ = 8 neutron-rich isotones. The measured total intruder strength is in good agreement with the shell model calculation, while the individual ones evolve quite differently. Particularly, the sudden change of the $d$-wave intensity between $^{13}$B and $^{12}$Be needs further theoretical interpretation.
△ Less
Submitted 2 March, 2021;
originally announced March 2021.
-
Smeariness Begets Finite Sample Smeariness
Authors:
Do Tran,
Benjamin Eltzner,
Stephan Huckemann
Abstract:
Fréchet means are indispensable for nonparametric statistics on non-Euclidean spaces. For suitable random variables, in some sense, they "sense" topological and geometric structure. In particular, smeariness seems to indicate the presence of positive curvature. While smeariness may be considered more as an academical curiosity, occurring rarely, it has been recently demonstrated that finite sample…
▽ More
Fréchet means are indispensable for nonparametric statistics on non-Euclidean spaces. For suitable random variables, in some sense, they "sense" topological and geometric structure. In particular, smeariness seems to indicate the presence of positive curvature. While smeariness may be considered more as an academical curiosity, occurring rarely, it has been recently demonstrated that finite sample smeariness (FSS) occurs regularly on circles, tori and spheres and affects a large class of typical probability distributions. FSS can be well described by the modulation measuring the quotient of rescaled expected sample mean variance and population variance. Under FSS it is larger than one - that is its value on Euclidean spaces - and this makes quantile based tests using tangent space approximations inapplicable. We show here that near smeary probability distributions there are always FSS probability distributions and as a first step towards the conjecture that all compact spaces feature smeary distributions, we establish directional smeariness under curvature bounds.
△ Less
Submitted 28 February, 2021;
originally announced March 2021.
-
Dynamic curriculum learning via data parameters for noise robust keyword spotting
Authors:
Takuya Higuchi,
Shreyas Saxena,
Mehrez Souden,
Tien Dung Tran,
Masood Delfarah,
Chandra Dhir
Abstract:
We propose dynamic curriculum learning via data parameters for noise robust keyword spotting. Data parameter learning has recently been introduced for image processing, where weight parameters, so-called data parameters, for target classes and instances are introduced and optimized along with model parameters. The data parameters scale logits and control importance over classes and instances durin…
▽ More
We propose dynamic curriculum learning via data parameters for noise robust keyword spotting. Data parameter learning has recently been introduced for image processing, where weight parameters, so-called data parameters, for target classes and instances are introduced and optimized along with model parameters. The data parameters scale logits and control importance over classes and instances during training, which enables automatic curriculum learning without additional annotations for training data. Similarly, in this paper, we propose using this curriculum learning approach for acoustic modeling, and train an acoustic model on clean and noisy utterances with the data parameters. The proposed approach automatically learns the difficulty of the classes and instances, e.g. due to low speech to noise ratio (SNR), in the gradient descent optimization and performs curriculum learning. This curriculum learning leads to overall improvement of the accuracy of the acoustic model. We evaluate the effectiveness of the proposed approach on a keyword spotting task. Experimental results show 7.7% relative reduction in false reject ratio with the data parameters compared to a baseline model which is simply trained on the multiconditioned dataset.
△ Less
Submitted 18 February, 2021;
originally announced February 2021.
-
Riiid! Answer Correctness Prediction Kaggle Challenge: 4th Place Solution Summary
Authors:
Duc Kinh Le Tran
Abstract:
This paper presents my solution to the challenge "Riiid! Answer Correctness Prediction" on Kaggle hosted by Riiid Labs (2020), which scores 0.817 (AUC) and ranks 4th on the final private leaderboard. It is a single transformer-based model heavily inspired from previous works such as SAKT, SAINT and SAINT+. Novel ingredients that I believed to have made a difference are the time-aware attention mec…
▽ More
This paper presents my solution to the challenge "Riiid! Answer Correctness Prediction" on Kaggle hosted by Riiid Labs (2020), which scores 0.817 (AUC) and ranks 4th on the final private leaderboard. It is a single transformer-based model heavily inspired from previous works such as SAKT, SAINT and SAINT+. Novel ingredients that I believed to have made a difference are the time-aware attention mechanism, the concatenation of the embeddings of the input sequences and the embedding of continuous features.
△ Less
Submitted 3 February, 2021;
originally announced February 2021.
-
Generalized Damped Newton Algorithms in Nonsmooth Optimization via Second-Order Subdifferentials
Authors:
Pham Duy Khanh,
Boris Mordukhovich,
Vo Thanh Phat,
Dat Ba Tran
Abstract:
The paper proposes and develops new globally convergent algorithms of the generalized damped Newton type for solving important classes of nonsmooth optimization problems. These algorithms are based on the theory and calculations of second-order subdifferentials of nonsmooth functions with employing the machinery of second-order variational analysis and generalized differentiation. First we develop…
▽ More
The paper proposes and develops new globally convergent algorithms of the generalized damped Newton type for solving important classes of nonsmooth optimization problems. These algorithms are based on the theory and calculations of second-order subdifferentials of nonsmooth functions with employing the machinery of second-order variational analysis and generalized differentiation. First we develop a globally superlinearly convergent damped Newton-type algorithm for the class of continuously differentiable functions with Lipschitzian gradients, which are nonsmooth of second order. Then we design such a globally convergent algorithm to solve a structured class of nonsmooth quadratic composite problems with extended-real-valued cost functions, which typically arise in machine learning and statistics. Finally, we present the results of numerical experiments and compare the performance of our main algorithm applied to an important class of Lasso problems with those achieved by other first-order and second-order optimization algorithms.
△ Less
Submitted 18 January, 2022; v1 submitted 25 January, 2021;
originally announced January 2021.
-
Optical Flow Estimation via Motion Feature Recovery
Authors:
Yang Jiao,
Guangming Shi,
Trac D. Tran
Abstract:
Optical flow estimation with occlusion or large displacement is a problematic challenge due to the lost of corresponding pixels between consecutive frames. In this paper, we discover that the lost information is related to a large quantity of motion features (more than 40%) computed from the popular discriminative cost-volume feature would completely vanish due to invalid sampling, leading to the…
▽ More
Optical flow estimation with occlusion or large displacement is a problematic challenge due to the lost of corresponding pixels between consecutive frames. In this paper, we discover that the lost information is related to a large quantity of motion features (more than 40%) computed from the popular discriminative cost-volume feature would completely vanish due to invalid sampling, leading to the low efficiency of optical flow learning. We call this phenomenon the Vanishing Cost Volume Problem. Inspired by the fact that local motion tends to be highly consistent within a short temporal window, we propose a novel iterative Motion Feature Recovery (MFR) method to address the vanishing cost volume via modeling motion consistency across multiple frames. In each MFR iteration, invalid entries from original motion features are first determined based on the current flow. Then, an efficient network is designed to adaptively learn the motion correlation to recover invalid features for lost-information restoration. The final optical flow is then decoded from the recovered motion features. Experimental results on Sintel and KITTI show that our method achieves state-of-the-art performances. In fact, MFR currently ranks second on Sintel public website.
△ Less
Submitted 15 January, 2021;
originally announced January 2021.
-
VinDr-CXR: An open dataset of chest X-rays with radiologist's annotations
Authors:
Ha Q. Nguyen,
Khanh Lam,
Linh T. Le,
Hieu H. Pham,
Dat Q. Tran,
Dung B. Nguyen,
Dung D. Le,
Chi M. Pham,
Hang T. T. Tong,
Diep H. Dinh,
Cuong D. Do,
Luu T. Doan,
Cuong N. Nguyen,
Binh T. Nguyen,
Que V. Nguyen,
Au D. Hoang,
Hien N. Phan,
Anh T. Nguyen,
Phuong H. Ho,
Dat T. Ngo,
Nghia T. Nguyen,
Nhan T. Nguyen,
Minh Dao,
Van Vu
Abstract:
Most of the existing chest X-ray datasets include labels from a list of findings without specifying their locations on the radiographs. This limits the development of machine learning algorithms for the detection and localization of chest abnormalities. In this work, we describe a dataset of more than 100,000 chest X-ray scans that were retrospectively collected from two major hospitals in Vietnam…
▽ More
Most of the existing chest X-ray datasets include labels from a list of findings without specifying their locations on the radiographs. This limits the development of machine learning algorithms for the detection and localization of chest abnormalities. In this work, we describe a dataset of more than 100,000 chest X-ray scans that were retrospectively collected from two major hospitals in Vietnam. Out of this raw data, we release 18,000 images that were manually annotated by a total of 17 experienced radiologists with 22 local labels of rectangles surrounding abnormalities and 6 global labels of suspected diseases. The released dataset is divided into a training set of 15,000 and a test set of 3,000. Each scan in the training set was independently labeled by 3 radiologists, while each scan in the test set was labeled by the consensus of 5 radiologists. We designed and built a labeling platform for DICOM images to facilitate these annotation procedures. All images are made publicly available (https://www.physionet.org/content/vindr-cxr/1.0.0/) in DICOM format along with the labels of both the training set and the test set.
△ Less
Submitted 20 March, 2022; v1 submitted 29 December, 2020;
originally announced December 2020.
-
FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation
Authors:
Tarun Kalluri,
Deepak Pathak,
Manmohan Chandraker,
Du Tran
Abstract:
A majority of methods for video frame interpolation compute bidirectional optical flow between adjacent frames of a video, followed by a suitable war** algorithm to generate the output frames. However, approaches relying on optical flow often fail to model occlusions and complex non-linear motions directly from the video and introduce additional bottlenecks unsuitable for widespread deployment.…
▽ More
A majority of methods for video frame interpolation compute bidirectional optical flow between adjacent frames of a video, followed by a suitable war** algorithm to generate the output frames. However, approaches relying on optical flow often fail to model occlusions and complex non-linear motions directly from the video and introduce additional bottlenecks unsuitable for widespread deployment. We address these limitations with FLAVR, a flexible and efficient architecture that uses 3D space-time convolutions to enable end-to-end learning and inference for video frame interpolation. Our method efficiently learns to reason about non-linear motions, complex occlusions and temporal abstractions, resulting in improved performance on video interpolation, while requiring no additional inputs in the form of optical flow or depth maps. Due to its simplicity, FLAVR can deliver 3x faster inference speed compared to the current most accurate method on multi-frame interpolation without losing interpolation accuracy. In addition, we evaluate FLAVR on a wide range of challenging settings and consistently demonstrate superior qualitative and quantitative results compared with prior methods on various popular benchmarks including Vimeo-90K, UCF101, DAVIS, Adobe, and GoPro. Finally, we demonstrate that FLAVR for video frame interpolation can serve as a useful self-supervised pretext task for action recognition, optical flow estimation, and motion magnification.
△ Less
Submitted 23 February, 2022; v1 submitted 15 December, 2020;
originally announced December 2020.
-
Levy noise-induced self-induced stochastic resonance in a memristive neuron
Authors:
Marius E. Yamakou,
Tat Dat Tran
Abstract:
Self-induced stochastic resonance (SISR) is a subtle resonance mechanism requiring a nontrivial scaling limit between the stochastic and the deterministic timescales of an excitable system, leading to the emergence of a limit cycle behavior which is absent without noise. All previous studies on SISR in neural systems have only considered the idealized Gaussian white noise. Moreover, these studies…
▽ More
Self-induced stochastic resonance (SISR) is a subtle resonance mechanism requiring a nontrivial scaling limit between the stochastic and the deterministic timescales of an excitable system, leading to the emergence of a limit cycle behavior which is absent without noise. All previous studies on SISR in neural systems have only considered the idealized Gaussian white noise. Moreover, these studies have ignored one electrophysiological aspect of the nerve cell: its memristive properties. In this paper, first, we show that in the excitable regime, the asymptotic matching of the Levy timescale (that follows a power law, unlike Gaussian noise that follows Kramers law) and the deterministic timescale (controlled by the singular parameter) can also induce a strong SISR. In addition, it is shown that the degree of SISR induced by Levy noise is not always higher than that of Gaussian noise. Second, we show that, for both types of noises, the two memristive properties of the neuron have opposite effects on the degree of SISR: the stronger the feedback gain parameter that controls the modulation of the membrane potential with the magnetic flux and the weaker the feedback gain parameter that controls the saturation of the magnetic flux, the higher the degree of SISR. Finally, we show that, for both types of noises, the degree of SISR in the memristive neuron is always higher than in the non-memristive neuron. Our results could find applications in designing neuromorphic circuits operating in noisy regimes.
△ Less
Submitted 23 April, 2021; v1 submitted 5 December, 2020;
originally announced December 2020.
-
The 10 Research Topics in the Internet of Things
Authors:
Wei Emma Zhang,
Quan Z. Sheng,
Adnan Mahmood,
Dai Hoang Tran,
Munazza Zaib,
Salma Abdalla Hamad,
Abdulwahab Aljubairy,
Ahoud Abdulrahmn F. Alhazmi,
Subhash Sagar,
Congbo Ma
Abstract:
Since the term first coined in 1999 by Kevin Ashton, the Internet of Things (IoT) has gained significant momentum as a technology to connect physical objects to the Internet and to facilitate machine-to-human and machine-to-machine communications. Over the past two decades, IoT has been an active area of research and development endeavours by many technical and commercial communities. Yet, IoT tec…
▽ More
Since the term first coined in 1999 by Kevin Ashton, the Internet of Things (IoT) has gained significant momentum as a technology to connect physical objects to the Internet and to facilitate machine-to-human and machine-to-machine communications. Over the past two decades, IoT has been an active area of research and development endeavours by many technical and commercial communities. Yet, IoT technology is still not mature and many issues need to be addressed. In this paper, we identify 10 key research topics and discuss the research problems and opportunities within these topics.
△ Less
Submitted 2 December, 2020;
originally announced December 2020.
-
Analyzing the Machine Learning Conference Review Process
Authors:
David Tran,
Alex Valtchanov,
Keshav Ganapathy,
Raymond Feng,
Eric Slud,
Micah Goldblum,
Tom Goldstein
Abstract:
Mainstream machine learning conferences have seen a dramatic increase in the number of participants, along with a growing range of perspectives, in recent years. Members of the machine learning community are likely to overhear allegations ranging from randomness of acceptance decisions to institutional bias. In this work, we critically analyze the review process through a comprehensive study of pa…
▽ More
Mainstream machine learning conferences have seen a dramatic increase in the number of participants, along with a growing range of perspectives, in recent years. Members of the machine learning community are likely to overhear allegations ranging from randomness of acceptance decisions to institutional bias. In this work, we critically analyze the review process through a comprehensive study of papers submitted to ICLR between 2017 and 2020. We quantify reproducibility/randomness in review scores and acceptance decisions, and examine whether scores correlate with paper impact. Our findings suggest strong institutional bias in accept/reject decisions, even after controlling for paper quality. Furthermore, we find evidence for a gender gap, with female authors receiving lower scores, lower acceptance rates, and fewer citations per paper than their male counterparts. We conclude our work with recommendations for future conference organizers.
△ Less
Submitted 25 November, 2020; v1 submitted 24 November, 2020;
originally announced November 2020.
-
2D+3D Facial Expression Recognition via Discriminative Dynamic Range Enhancement and Multi-Scale Learning
Authors:
Yang Jiao,
Yi Niu,
Trac D. Tran,
Guangming Shi
Abstract:
In 2D+3D facial expression recognition (FER), existing methods generate multi-view geometry maps to enhance the depth feature representation. However, this may introduce false estimations due to local plane fitting from incomplete point clouds. In this paper, we propose a novel Map Generation technique from the viewpoint of information theory, to boost the slight 3D expression differences from str…
▽ More
In 2D+3D facial expression recognition (FER), existing methods generate multi-view geometry maps to enhance the depth feature representation. However, this may introduce false estimations due to local plane fitting from incomplete point clouds. In this paper, we propose a novel Map Generation technique from the viewpoint of information theory, to boost the slight 3D expression differences from strong personality variations. First, we examine the HDR depth data to extract the discriminative dynamic range $r_{dis}$, and maximize the entropy of $r_{dis}$ to a global optimum. Then, to prevent the large deformation caused by over-enhancement, we introduce a depth distortion constraint and reduce the complexity from $O(KN^2)$ to $O(KNτ)$. Furthermore, the constrained optimization is modeled as a $K$-edges maximum weight path problem in a directed acyclic graph, and we solve it efficiently via dynamic programming. Finally, we also design an efficient Facial Attention structure to automatically locate subtle discriminative facial parts for multi-scale learning, and train it with a proposed loss function $\mathcal{L}_{FA}$ without any facial landmarks. Experimental results on different datasets show that the proposed method is effective and outperforms the state-of-the-art 2D+3D FER methods in both FER accuracy and the output entropy of the generated maps.
△ Less
Submitted 16 November, 2020;
originally announced November 2020.
-
EffiScene: Efficient Per-Pixel Rigidity Inference for Unsupervised Joint Learning of Optical Flow, Depth, Camera Pose and Motion Segmentation
Authors:
Yang Jiao,
Trac D. Tran,
Guangming Shi
Abstract:
This paper addresses the challenging unsupervised scene flow estimation problem by jointly learning four low-level vision sub-tasks: optical flow $\textbf{F}$, stereo-depth $\textbf{D}$, camera pose $\textbf{P}$ and motion segmentation $\textbf{S}$. Our key insight is that the rigidity of the scene shares the same inherent geometrical structure with object movements and scene depth. Hence, rigidit…
▽ More
This paper addresses the challenging unsupervised scene flow estimation problem by jointly learning four low-level vision sub-tasks: optical flow $\textbf{F}$, stereo-depth $\textbf{D}$, camera pose $\textbf{P}$ and motion segmentation $\textbf{S}$. Our key insight is that the rigidity of the scene shares the same inherent geometrical structure with object movements and scene depth. Hence, rigidity from $\textbf{S}$ can be inferred by jointly coupling $\textbf{F}$, $\textbf{D}$ and $\textbf{P}$ to achieve more robust estimation. To this end, we propose a novel scene flow framework named EffiScene with efficient joint rigidity learning, going beyond the existing pipeline with independent auxiliary structures. In EffiScene, we first estimate optical flow and depth at the coarse level and then compute camera pose by Perspective-$n$-Points method. To jointly learn local rigidity, we design a novel Rigidity From Motion (RfM) layer with three principal components: \emph{}{(i)} correlation extraction; \emph{}{(ii)} boundary learning; and \emph{}{(iii)} outlier exclusion. Final outputs are fused based on the rigid map $M_R$ from RfM at finer levels. To efficiently train EffiScene, two new losses $\mathcal{L}_{bnd}$ and $\mathcal{L}_{unc}$ are designed to prevent trivial solutions and to regularize the flow boundary discontinuity. Extensive experiments on scene flow benchmark KITTI show that our method is effective and significantly improves the state-of-the-art approaches for all sub-tasks, i.e. optical flow ($5.19 \rightarrow 4.20$), depth estimation ($3.78 \rightarrow 3.46$), visual odometry ($0.012 \rightarrow 0.011$) and motion segmentation ($0.57 \rightarrow 0.62$).
△ Less
Submitted 14 May, 2021; v1 submitted 16 November, 2020;
originally announced November 2020.
-
Computing Crisp Bisimulations for Fuzzy Structures
Authors:
Linh Anh Nguyen,
Dat Xuan Tran
Abstract:
Fuzzy structures such as fuzzy automata, fuzzy transition systems, weighted social networks and fuzzy interpretations in fuzzy description logics have been widely studied. For such structures, bisimulation is a natural notion for characterizing indiscernibility between states or individuals. There are two kinds of bisimulations for fuzzy structures: crisp bisimulations and fuzzy bisimulations. Whi…
▽ More
Fuzzy structures such as fuzzy automata, fuzzy transition systems, weighted social networks and fuzzy interpretations in fuzzy description logics have been widely studied. For such structures, bisimulation is a natural notion for characterizing indiscernibility between states or individuals. There are two kinds of bisimulations for fuzzy structures: crisp bisimulations and fuzzy bisimulations. While the latter fits to the fuzzy paradigm, the former has also attracted attention due to the application of crisp equivalence relations, for example, in minimizing structures. Bisimulations can be formulated for fuzzy labeled graphs and then adapted to other fuzzy structures. In this article, we present an efficient algorithm for computing the partition corresponding to the largest crisp bisimulation of a given finite fuzzy labeled graph. Its complexity is of order $O((m\log{l} + n)\log{n})$, where $n$, $m$ and $l$ are the number of vertices, the number of nonzero edges and the number of different fuzzy degrees of edges of the input graph, respectively. We also study a similar problem for the setting with counting successors, which corresponds to the case with qualified number restrictions in description logics and graded modalities in modal logics. In particular, we provide an efficient algorithm with the complexity $O((m\log{m} + n)\log{n})$ for the considered problem in that setting.
△ Less
Submitted 1 June, 2023; v1 submitted 27 October, 2020;
originally announced October 2020.
-
Carrier Multiplication via Photocurrent Measurements in Dual-Gated MoTe_2
Authors:
Jun Suk Kim,
Minh Dao Tran,
Sung-Tae Kim,
Daehan Yoo,
Sang-Hyun Oh,
Ji-Hee Kim,
Young Hee Lee
Abstract:
Although van der Waals layered transition metal dichalcogenides from transient absorption spectroscopy have successfully demonstrated an ideal carrier multiplication (CM) performance with an onset of nearly 2Eg,interpretation of the CM effect from the optical approach remains unresolved owing to the complexity of many-body electron-hole pairs. We demonstrate the CM effect through simple photocurre…
▽ More
Although van der Waals layered transition metal dichalcogenides from transient absorption spectroscopy have successfully demonstrated an ideal carrier multiplication (CM) performance with an onset of nearly 2Eg,interpretation of the CM effect from the optical approach remains unresolved owing to the complexity of many-body electron-hole pairs. We demonstrate the CM effect through simple photocurrent measurements by fabricating the dual-gate P-N junction of a MoTe2 film on a transparent substrate. Electrons and holes were efficiently extracted by eliminating the Schottky barriers in the metal contact and minimizing multiple reflections. The photocurrent was elevated proportionately to the excitation energy. The boosted quantum efficiency confirms the multiple electron-hole pair generation of >2Eg, consistent with CM results from an optical approach, pushing the solar cell efficiency beyond the Shockley-Queisser limit.
△ Less
Submitted 26 October, 2020;
originally announced October 2020.
-
Bandgap renormalization in monolayer MoS_2 on CsPbBr_3 quantum dot via charge transfer at room temperature
Authors:
Subash Adhikari,
Ji-Hee Kim,
Bumsub Song,
Manh-Ha Doan,
Minh Dao Tran,
Leyre Gomez,
Hyun Kim,
Hamza Zad Gul,
Ganesh Ghimire,
Seok Joon Yun,
Tom Gregorkiewicz,
Young Hee Lee
Abstract:
Many-body effect and strong Coulomb interaction in monolayer transition metal dichalcogenides lead to shrink the intrinsic bandgap, originating from the renormalization of electrical/optical bandgap, exciton binding energy, and spin-orbit splitting. This renormalization phenomenon has been commonly observed at low temperature and requires high photon excitation density. Here, we present the augmen…
▽ More
Many-body effect and strong Coulomb interaction in monolayer transition metal dichalcogenides lead to shrink the intrinsic bandgap, originating from the renormalization of electrical/optical bandgap, exciton binding energy, and spin-orbit splitting. This renormalization phenomenon has been commonly observed at low temperature and requires high photon excitation density. Here, we present the augmented bandgap renormalization in monolayer MoS_2 anchored on CsPbBr_3 perovskite quantum dots at room temperature via charge transfer. The amount of electrons significantly transferred from perovskite gives rise to the large plasma screening in MoS_2. The bandgap in heterostructure is red-shifted by 84 meV with minimal pump fluence, the highest bandgap renormalization in monolayer MoS_2 at room temperature, which saturates with further increase of pump fluence. We further find that the magnitude of bandgap renormalization inversely relates to Thomas-Fermi screening length. This provides plenty of room to explore the bandgap renormalization within existing vast libraries of large bandgap van der Waals heterostructure towards practical devices such as solar cells, photodetectors and light-emitting-diodes.
△ Less
Submitted 26 October, 2020;
originally announced October 2020.
-
Combining Ensembles and Data Augmentation can Harm your Calibration
Authors:
Yeming Wen,
Ghassen Jerfel,
Rafael Muller,
Michael W. Dusenberry,
Jasper Snoek,
Balaji Lakshminarayanan,
Dustin Tran
Abstract:
Ensemble methods which average over multiple neural network predictions are a simple approach to improve a model's calibration and robustness. Similarly, data augmentation techniques, which encode prior information in the form of invariant feature transformations, are effective for improving calibration and robustness. In this paper, we show a surprising pathology: combining ensembles and data aug…
▽ More
Ensemble methods which average over multiple neural network predictions are a simple approach to improve a model's calibration and robustness. Similarly, data augmentation techniques, which encode prior information in the form of invariant feature transformations, are effective for improving calibration and robustness. In this paper, we show a surprising pathology: combining ensembles and data augmentation can harm model calibration. This leads to a trade-off in practice, whereby improved accuracy by combining the two techniques comes at the expense of calibration. On the other hand, selecting only one of the techniques ensures good uncertainty estimates at the expense of accuracy. We investigate this pathology and identify a compounding under-confidence among methods which marginalize over sets of weights and data augmentation techniques which soften labels. Finally, we propose a simple correction, achieving the best of both worlds with significant accuracy and calibration gains over using only ensembles or data augmentation individually. Applying the correction produces new state-of-the art in uncertainty calibration across CIFAR-10, CIFAR-100, and ImageNet.
△ Less
Submitted 22 March, 2021; v1 submitted 19 October, 2020;
originally announced October 2020.
-
Training independent subnetworks for robust prediction
Authors:
Marton Havasi,
Rodolphe Jenatton,
Stanislav Fort,
Jeremiah Zhe Liu,
Jasper Snoek,
Balaji Lakshminarayanan,
Andrew M. Dai,
Dustin Tran
Abstract:
Recent approaches to efficiently ensemble neural networks have shown that strong robustness and uncertainty performance can be achieved with a negligible gain in parameters over the original network. However, these methods still require multiple forward passes for prediction, leading to a significant computational cost. In this work, we show a surprising result: the benefits of using multiple pred…
▽ More
Recent approaches to efficiently ensemble neural networks have shown that strong robustness and uncertainty performance can be achieved with a negligible gain in parameters over the original network. However, these methods still require multiple forward passes for prediction, leading to a significant computational cost. In this work, we show a surprising result: the benefits of using multiple predictions can be achieved `for free' under a single model's forward pass. In particular, we show that, using a multi-input multi-output (MIMO) configuration, one can utilize a single model's capacity to train multiple subnetworks that independently learn the task at hand. By ensembling the predictions made by the subnetworks, we improve model robustness without increasing compute. We observe a significant improvement in negative log-likelihood, accuracy, and calibration error on CIFAR10, CIFAR100, ImageNet, and their out-of-distribution variants compared to previous methods.
△ Less
Submitted 4 August, 2021; v1 submitted 13 October, 2020;
originally announced October 2020.
-
Deep learning for detection and segmentation of artefact and disease instances in gastrointestinal endoscopy
Authors:
Sharib Ali,
Mariia Dmitrieva,
Noha Ghatwary,
Sophia Bano,
Gorkem Polat,
Alptekin Temizel,
Adrian Krenzer,
Amar Hekalo,
Yun Bo Guo,
Bogdan Matuszewski,
Mourad Gridach,
Irina Voiculescu,
Vishnusai Yoganand,
Arnav Chavan,
Aryan Raj,
Nhan T. Nguyen,
Dat Q. Tran,
Le Duy Huynh,
Nicolas Boutry,
Shahadate Rezvy,
Haijian Chen,
Yoon Ho Choi,
Anand Subramanian,
Velmurugan Balasubramanian,
Xiaohong W. Gao
, et al. (12 additional authors not shown)
Abstract:
The Endoscopy Computer Vision Challenge (EndoCV) is a crowd-sourcing initiative to address eminent problems in develo** reliable computer aided detection and diagnosis endoscopy systems and suggest a pathway for clinical translation of technologies. Whilst endoscopy is a widely used diagnostic and treatment tool for hollow-organs, there are several core challenges often faced by endoscopists, ma…
▽ More
The Endoscopy Computer Vision Challenge (EndoCV) is a crowd-sourcing initiative to address eminent problems in develo** reliable computer aided detection and diagnosis endoscopy systems and suggest a pathway for clinical translation of technologies. Whilst endoscopy is a widely used diagnostic and treatment tool for hollow-organs, there are several core challenges often faced by endoscopists, mainly: 1) presence of multi-class artefacts that hinder their visual interpretation, and 2) difficulty in identifying subtle precancerous precursors and cancer abnormalities. Artefacts often affect the robustness of deep learning methods applied to the gastrointestinal tract organs as they can be confused with tissue of interest. EndoCV2020 challenges are designed to address research questions in these remits. In this paper, we present a summary of methods developed by the top 17 teams and provide an objective comparison of state-of-the-art methods and methods designed by the participants for two sub-challenges: i) artefact detection and segmentation (EAD2020), and ii) disease detection and segmentation (EDD2020). Multi-center, multi-organ, multi-class, and multi-modal clinical endoscopy datasets were compiled for both EAD2020 and EDD2020 sub-challenges. The out-of-sample generalization ability of detection algorithms was also evaluated. Whilst most teams focused on accuracy improvements, only a few methods hold credibility for clinical usability. The best performing teams provided solutions to tackle class imbalance, and variabilities in size, origin, modality and occurrences by exploring data augmentation, data fusion, and optimal class thresholding techniques.
△ Less
Submitted 17 February, 2021; v1 submitted 12 October, 2020;
originally announced October 2020.
-
Infinite AC Ladder with a "Twist"
Authors:
Quan M. Nguyen,
Linh K. Nguyen,
Tung X. Tran,
Chinh D. Tran,
Truong H. Cai,
Trung Phan
Abstract:
The infinite AC ladder network can exhibit unexpected behavior. Entangling the topology brings even more surprises, found by direct numerical investigation. We consider a simple modification of the ladder topology and explain the numerical result for the complex impedance, using linear algebra. The infinity limit of the network's size corresponds to kee** only the eigenvectors of the transmissio…
▽ More
The infinite AC ladder network can exhibit unexpected behavior. Entangling the topology brings even more surprises, found by direct numerical investigation. We consider a simple modification of the ladder topology and explain the numerical result for the complex impedance, using linear algebra. The infinity limit of the network's size corresponds to kee** only the eigenvectors of the transmission matrix with the largest eigenvalues, which can be viewed as the most dominant modes of electrical information that propagate through the network.
△ Less
Submitted 13 October, 2020; v1 submitted 12 October, 2020;
originally announced October 2020.
-
An Open Review of OpenReview: A Critical Analysis of the Machine Learning Conference Review Process
Authors:
David Tran,
Alex Valtchanov,
Keshav Ganapathy,
Raymond Feng,
Eric Slud,
Micah Goldblum,
Tom Goldstein
Abstract:
Mainstream machine learning conferences have seen a dramatic increase in the number of participants, along with a growing range of perspectives, in recent years. Members of the machine learning community are likely to overhear allegations ranging from randomness of acceptance decisions to institutional bias. In this work, we critically analyze the review process through a comprehensive study of pa…
▽ More
Mainstream machine learning conferences have seen a dramatic increase in the number of participants, along with a growing range of perspectives, in recent years. Members of the machine learning community are likely to overhear allegations ranging from randomness of acceptance decisions to institutional bias. In this work, we critically analyze the review process through a comprehensive study of papers submitted to ICLR between 2017 and 2020. We quantify reproducibility/randomness in review scores and acceptance decisions, and examine whether scores correlate with paper impact. Our findings suggest strong institutional bias in accept/reject decisions, even after controlling for paper quality. Furthermore, we find evidence for a gender gap, with female authors receiving lower scores, lower acceptance rates, and fewer citations per paper than their male counterparts. We conclude our work with recommendations for future conference organizers.
△ Less
Submitted 26 October, 2020; v1 submitted 10 October, 2020;
originally announced October 2020.
-
One-sided Shewhart control charts for monitoring the ratio of two normal variables in Short Production Runs
Authors:
K. D. Tran,
Q. U. A Khaliq,
A. A. Nadi,
H Tran,
K. P. Tran
Abstract:
Monitoring the ratio of two normal random variables plays an important role in several manufacturing environments. For short production runs, however, the control charts assumed infinite processes cannot function effectively to detect anomalies. In this paper, we tackle this problem by proposing two one-sided Shewhart-type charts to monitor the ratio of two normal random variables for an infinite…
▽ More
Monitoring the ratio of two normal random variables plays an important role in several manufacturing environments. For short production runs, however, the control charts assumed infinite processes cannot function effectively to detect anomalies. In this paper, we tackle this problem by proposing two one-sided Shewhart-type charts to monitor the ratio of two normal random variables for an infinite horizon production. The statistical performance of the proposed charts is investigated using the truncated average run length as a performance measure in short production runs. In order to help the quality practitioner to implement these control charts, we have provided ready-to-use tables of the control limit parameters. An illustrative example from the food industry is given for illustration.
△ Less
Submitted 3 October, 2020;
originally announced October 2020.
-
Multiple interaction learning with question-type prior knowledge for constraining answer search space in visual question answering
Authors:
Tuong Do,
Binh X. Nguyen,
Huy Tran,
Erman Tjiputra,
Quang D. Tran,
Thanh-Toan Do
Abstract:
Different approaches have been proposed to Visual Question Answering (VQA). However, few works are aware of the behaviors of varying joint modality methods over question type prior knowledge extracted from data in constraining answer search space, of which information gives a reliable cue to reason about answers for questions asked in input images. In this paper, we propose a novel VQA model that…
▽ More
Different approaches have been proposed to Visual Question Answering (VQA). However, few works are aware of the behaviors of varying joint modality methods over question type prior knowledge extracted from data in constraining answer search space, of which information gives a reliable cue to reason about answers for questions asked in input images. In this paper, we propose a novel VQA model that utilizes the question-type prior information to improve VQA by leveraging the multiple interactions between different joint modality methods based on their behaviors in answering questions from different types. The solid experiments on two benchmark datasets, i.e., VQA 2.0 and TDIUC, indicate that the proposed method yields the best performance with the most competitive approaches.
△ Less
Submitted 23 September, 2020;
originally announced September 2020.
-
A Differential Game Approach for Beyond Visual Range Tactics
Authors:
Eloy Garcia,
David W. Casbeer,
Dzung Tran,
Meir Pachter
Abstract:
An operational relevant conflict between teams of autonomous vehicles in the Beyond Visual Range domain is addressed in this paper. Optimal strategies are designed in order for a team of air interceptors to protect a high value asset and block the attacking team at a safe distance from such asset. The attacking agents take specific roles of leader and wingman and also devise their own optimal stra…
▽ More
An operational relevant conflict between teams of autonomous vehicles in the Beyond Visual Range domain is addressed in this paper. Optimal strategies are designed in order for a team of air interceptors to protect a high value asset and block the attacking team at a safe distance from such asset. The attacking agents take specific roles of leader and wingman and also devise their own optimal strategies in order to launch an attack as close as possible from the asset. The problem is formulated as a zero-sum differential game between players with different speed over two stages: the attack and the retreat stages. For each stage the state-feedback optimal strategies of each player are derived in analytical form.
△ Less
Submitted 22 September, 2020;
originally announced September 2020.
-
Performance Indicator in Multilinear Compressive Learning
Authors:
Dat Thanh Tran,
Moncef Gabbouj,
Alexandros Iosifidis
Abstract:
Recently, the Multilinear Compressive Learning (MCL) framework was proposed to efficiently optimize the sensing and learning steps when working with multidimensional signals, i.e. tensors. In Compressive Learning in general, and in MCL in particular, the number of compressed measurements captured by a compressive sensing device characterizes the storage requirement or the bandwidth requirement for…
▽ More
Recently, the Multilinear Compressive Learning (MCL) framework was proposed to efficiently optimize the sensing and learning steps when working with multidimensional signals, i.e. tensors. In Compressive Learning in general, and in MCL in particular, the number of compressed measurements captured by a compressive sensing device characterizes the storage requirement or the bandwidth requirement for transmission. This number, however, does not completely characterize the learning performance of a MCL system. In this paper, we analyze the relationship between the input signal resolution, the number of compressed measurements and the learning performance of MCL. Our empirical analysis shows that the reconstruction error obtained at the initialization step of MCL strongly correlates with the learning performance, thus can act as a good indicator to efficiently characterize learning performances obtained from different sensor configurations without optimizing the entire system.
△ Less
Submitted 22 September, 2020;
originally announced September 2020.
-
Deep Metric Learning Meets Deep Clustering: An Novel Unsupervised Approach for Feature Embedding
Authors:
Binh X. Nguyen,
Binh D. Nguyen,
Gustavo Carneiro,
Erman Tjiputra,
Quang D. Tran,
Thanh-Toan Do
Abstract:
Unsupervised Deep Distance Metric Learning (UDML) aims to learn sample similarities in the embedding space from an unlabeled dataset. Traditional UDML methods usually use the triplet loss or pairwise loss which requires the mining of positive and negative samples w.r.t. anchor data points. This is, however, challenging in an unsupervised setting as the label information is not available. In this p…
▽ More
Unsupervised Deep Distance Metric Learning (UDML) aims to learn sample similarities in the embedding space from an unlabeled dataset. Traditional UDML methods usually use the triplet loss or pairwise loss which requires the mining of positive and negative samples w.r.t. anchor data points. This is, however, challenging in an unsupervised setting as the label information is not available. In this paper, we propose a new UDML method that overcomes that challenge. In particular, we propose to use a deep clustering loss to learn centroids, i.e., pseudo labels, that represent semantic classes. During learning, these centroids are also used to reconstruct the input samples. It hence ensures the representativeness of centroids - each centroid represents visually similar samples. Therefore, the centroids give information about positive (visually similar) and negative (visually dissimilar) samples. Based on pseudo labels, we propose a novel unsupervised metric loss which enforces the positive concentration and negative separation of samples in the embedding space. Experimental results on benchmarking datasets show that the proposed approach outperforms other UDML methods.
△ Less
Submitted 9 September, 2020;
originally announced September 2020.
-
Differentiation of measures on complete Riemannian manifolds
Authors:
Jürgen Jost,
Hông Vân Lê,
Tat Dat Tran
Abstract:
In this note we give a new proof of a version of the Besicovitch covering theorem, given in \cite{EG1992}, \cite{Bogachev2007} and extended in \cite{Federer1969}, for locally finite Borel measures on finite dimensional complete Riemannian manifolds $(M,g)$. As a consequence, we prove a differentiation theorem for Borel measures on $(M,g)$, which gives a formula for the Radon-Nikodym density of two…
▽ More
In this note we give a new proof of a version of the Besicovitch covering theorem, given in \cite{EG1992}, \cite{Bogachev2007} and extended in \cite{Federer1969}, for locally finite Borel measures on finite dimensional complete Riemannian manifolds $(M,g)$. As a consequence, we prove a differentiation theorem for Borel measures on $(M,g)$, which gives a formula for the Radon-Nikodym density of two nonnegative locally finite Borel measures $ν_1, ν_2$ on $(M, g)$ such that $ν_1 \ll ν_2$, extending the known case when $(M, g)$ is a standard Euclidean space.
△ Less
Submitted 30 August, 2020;
originally announced August 2020.
-
Short-Packet Communications for MIMO NOMA Systems over Nakagami-m Fading: BLER and Minimum Blocklength Analysis
Authors:
Duc-Dung Tran,
Shree Krishna Sharma,
Symeon Chatzinotas,
Isaac Woungang,
Björn Ottersten
Abstract:
Recently, ultra-reliable and low-latency communications (URLLC) using short-packets has been proposed to fulfill the stringent requirements regarding reliability and latency of emerging applications in 5G and beyond networks. In addition, multiple-input multiple-output non-orthogonal multiple access (MIMO NOMA) is a potential candidate to improve the spectral efficiency, reliability, latency, and…
▽ More
Recently, ultra-reliable and low-latency communications (URLLC) using short-packets has been proposed to fulfill the stringent requirements regarding reliability and latency of emerging applications in 5G and beyond networks. In addition, multiple-input multiple-output non-orthogonal multiple access (MIMO NOMA) is a potential candidate to improve the spectral efficiency, reliability, latency, and connectivity of wireless systems. In this paper, we investigate short-packet communications (SPC) in a multiuser downlink MIMO NOMA system over Nakagami-m fading, and propose two antenna-user selection methods considering two clusters of users having different priority levels. In contrast to the widely-used long data-packet assumption, the SPC analysis requires the redesign of the communication protocols and novel performance metrics. Given this context, we analyze the SPC performance of MIMO NOMA systems using the average block error rate (BLER) and minimum blocklength, instead of the conventional metrics such as ergodic capacity and outage capacity. More specifically, to characterize the system performance regarding SPC, asymptotic (in the high signal-to-noise ratio regime) and approximate closed-form expressions of the average BLER at the users are derived. Based on the asymptotic behavior of the average BLER, an analysis of the diversity order, minimum blocklength, and optimal power allocation is carried out. The achieved results show that MIMO NOMA can serve multiple users simultaneously using a smaller blocklength compared with MIMO OMA, thus demonstrating the benefits of MIMO NOMA for SPC in minimizing the transmission latency. Furthermore, our results indicate that the proposed methods not only improve the BLER performance but also guarantee full diversity gains for the respective users.
△ Less
Submitted 24 August, 2020;
originally announced August 2020.
-
Gauge Functions in Classical Mechanics: From Undriven to Driven Dynamical Systems
Authors:
Z. E. Musielak,
L. C. Vestal,
B. D. Tran,
T. B. Watson
Abstract:
Novel gauge functions are introduced to non-relativistic classical mechanics and used to define forces. The obtained results show that the gauge functions directly affect the energy function and that they allow converting an undriven physical system into a driven one. This is a novel phenomenon in dynamics that resembles the role of gauges in quantum field theories.
Novel gauge functions are introduced to non-relativistic classical mechanics and used to define forces. The obtained results show that the gauge functions directly affect the energy function and that they allow converting an undriven physical system into a driven one. This is a novel phenomenon in dynamics that resembles the role of gauges in quantum field theories.
△ Less
Submitted 9 September, 2020; v1 submitted 12 August, 2020;
originally announced August 2020.
-
Optimal Path Homotopy For Univariate Polynomials
Authors:
Bao Duy Tran
Abstract:
The goal of this paper is to study the path-following method for univariate polynomials. We propose to study the complexity and condition properties when the Newton method is applied as a correction operator. Then we study the geodesics and properties of the condition metric along those curves. Last, we compute approximations of geodesics and study how the condition number varies with the quality…
▽ More
The goal of this paper is to study the path-following method for univariate polynomials. We propose to study the complexity and condition properties when the Newton method is applied as a correction operator. Then we study the geodesics and properties of the condition metric along those curves. Last, we compute approximations of geodesics and study how the condition number varies with the quality of the approximation.
△ Less
Submitted 9 August, 2022; v1 submitted 4 August, 2020;
originally announced August 2020.
-
UAV Relay-Assisted Emergency Communications in IoT Networks: Resource Allocation and Trajectory Optimization
Authors:
Dinh-Hieu Tran,
Van-Dinh Nguyen,
Sumit Gautam,
Symeon Chatzinotas,
Thang X. Vu,
Bjorn Ottersten
Abstract:
Unmanned aerial vehicle (UAV) communication has emerged as a prominent technology for emergency communications (e.g., natural disaster) in the Internet of Things (IoT) networks to enhance the ability of disaster prediction, damage assessment, and rescue operations promptly. A UAV can be deployed as a flying base station (BS) to collect data from time-constrained IoT devices and then transfer it to…
▽ More
Unmanned aerial vehicle (UAV) communication has emerged as a prominent technology for emergency communications (e.g., natural disaster) in the Internet of Things (IoT) networks to enhance the ability of disaster prediction, damage assessment, and rescue operations promptly. A UAV can be deployed as a flying base station (BS) to collect data from time-constrained IoT devices and then transfer it to a ground gateway (GW). In general, the latency constraint at IoT devices and UAV's limited storage capacity highly hinder practical applications of UAV-assisted IoT networks. In this paper, {full-duplex (FD) radio} is adopted at the UAV to overcome these challenges. In addition, half-duplex (HD) scheme for UAV-based relaying is also considered to provide a comparative study between two modes (viz., FD and HD). {Herein, a device is considered to be successfully served iff its data is collected by the UAV and conveyed to GW timely during flight time}. In this context, we aim to maximize the number of served IoT devices by jointly optimizing bandwidth, power allocation, and the UAV trajectory while satisfying each device's requirement and the UAV's limited storage capacity. The formulated optimization problem is troublesome to solve due to its non-convexity and combinatorial nature. {Towards appealing applications, we first relax binary variables into continuous ones and transform the original problem into a more computationally tractable form.} By leveraging inner approximation framework, we derive newly approximated functions for non-convex parts and then develop a simple yet efficient iterative algorithm for its solutions. Next, we attempt to maximize the total throughput subject to the number of served IoT devices. Finally, numerical results show that the proposed algorithms significantly outperform benchmark approaches in terms of the number of served IoT devices and system throughput.
△ Less
Submitted 16 August, 2021; v1 submitted 1 August, 2020;
originally announced August 2020.