Search | arXiv e-print repository

Explanatory causal effects for model agnostic explanations

Authors: Jiuyong Li, Ha Xuan Tran, Thuc Duy Le, Lin Liu, Kui Yu, Jixue Liu

Abstract: This paper studies the problem of estimating the contributions of features to the prediction of a specific instance by a machine learning model and the overall contribution of a feature to the model. The causal effect of a feature (variable) on the predicted outcome reflects the contribution of the feature to a prediction very well. A challenge is that most existing causal effects cannot be estima… ▽ More This paper studies the problem of estimating the contributions of features to the prediction of a specific instance by a machine learning model and the overall contribution of a feature to the model. The causal effect of a feature (variable) on the predicted outcome reflects the contribution of the feature to a prediction very well. A challenge is that most existing causal effects cannot be estimated from data without a known causal graph. In this paper, we define an explanatory causal effect based on a hypothetical ideal experiment. The definition brings several benefits to model agnostic explanations. First, explanations are transparent and have causal meanings. Second, the explanatory causal effect estimation can be data driven. Third, the causal effects provide both a local explanation for a specific prediction and a global explanation showing the overall importance of a feature in a predictive model. We further propose a method using individual and combined variables based on explanatory causal effects for explanations. We show the definition and the method work with experiments on some real-world data sets. △ Less

Submitted 23 June, 2022; originally announced June 2022.

Comments: 17

arXiv:2206.10073 [pdf, other]

Finding Optimal Policy for Queueing Models: New Parameterization

Authors: Trang H. Tran, Lam M. Nguyen, Katya Scheinberg

Abstract: Queueing systems appear in many important real-life applications including communication networks, transportation and manufacturing systems. Reinforcement learning (RL) framework is a suitable model for the queueing control problem where the underlying dynamics are usually unknown and the agent receives little information from the environment to navigate. In this work, we investigate the optimizat… ▽ More Queueing systems appear in many important real-life applications including communication networks, transportation and manufacturing systems. Reinforcement learning (RL) framework is a suitable model for the queueing control problem where the underlying dynamics are usually unknown and the agent receives little information from the environment to navigate. In this work, we investigate the optimization aspects of the queueing model as a RL environment and provide insight to learn the optimal policy efficiently. We propose a new parameterization of the policy by using the intrinsic properties of queueing network systems. Experiments show good performance of our methods with various load conditions from light to heavy traffic. △ Less

Submitted 20 June, 2022; originally announced June 2022.

arXiv:2206.09398 [pdf, other]

Aligning individual brains with Fused Unbalanced Gromov-Wasserstein

Authors: Alexis Thual, Huy Tran, Tatiana Zemskova, Nicolas Courty, Rémi Flamary, Stanislas Dehaene, Bertrand Thirion

Abstract: Individual brains vary in both anatomy and functional organization, even within a given species. Inter-individual variability is a major impediment when trying to draw generalizable conclusions from neuroimaging data collected on groups of subjects. Current co-registration procedures rely on limited data, and thus lead to very coarse inter-subject alignments. In this work, we present a novel metho… ▽ More Individual brains vary in both anatomy and functional organization, even within a given species. Inter-individual variability is a major impediment when trying to draw generalizable conclusions from neuroimaging data collected on groups of subjects. Current co-registration procedures rely on limited data, and thus lead to very coarse inter-subject alignments. In this work, we present a novel method for inter-subject alignment based on Optimal Transport, denoted as Fused Unbalanced Gromov Wasserstein (FUGW). The method aligns cortical surfaces based on the similarity of their functional signatures in response to a variety of stimulation settings, while penalizing large deformations of individual topographic organization. We demonstrate that FUGW is well-suited for whole-brain landmark-free alignment. The unbalanced feature allows to deal with the fact that functional areas vary in size across subjects. Our results show that FUGW alignment significantly increases between-subject correlation of activity for independent functional data, and leads to more precise map** at the group level. △ Less

Submitted 22 August, 2023; v1 submitted 19 June, 2022; originally announced June 2022.

Journal ref: Advances in Neural Information Processing Systems, 35 (2022) 21792-21804

arXiv:2206.08513 [pdf]

TLETA: Deep Transfer Learning and Integrated Cellular Knowledge for Estimated Time of Arrival Prediction

Authors: Hieu Tran, Son Nguyen, I-Ling Yen, Farokh Bastani

Abstract: Vehicle arrival time prediction has been studied widely. With the emergence of IoT devices and deep learning techniques, estimated time of arrival (ETA) has become a critical component in intelligent transportation systems. Though many tools exist for ETA, ETA for special vehicles, such as ambulances, fire engines, etc., is still challenging due to the limited amount of traffic data for special ve… ▽ More Vehicle arrival time prediction has been studied widely. With the emergence of IoT devices and deep learning techniques, estimated time of arrival (ETA) has become a critical component in intelligent transportation systems. Though many tools exist for ETA, ETA for special vehicles, such as ambulances, fire engines, etc., is still challenging due to the limited amount of traffic data for special vehicles. Existing works use one model for all types of vehicles, which can lead to low accuracy. To tackle this, as the first in the field, we propose a deep transfer learning framework TLETA for the driving time prediction. TLETA constructs cellular spatial-temporal knowledge grids for extracting driving patterns, combined with the road network structure embedding to build a deep neural network for ETA. TLETA contains transferable layers to support knowledge transfer between different categories of vehicles. Importantly, our transfer models only train the last layers to map the transferred knowledge, that reduces the training time significantly. The experimental studies show that our model predicts travel time with high accuracy and outperforms many state-of-the-art approaches. △ Less

Submitted 16 June, 2022; originally announced June 2022.

Comments: 8 pages, 3 figures, 3 tables. The 25th IEEE International Conference on Intelligent Transportation Systems (IEEE ITSC 2022)

arXiv:2206.08398 [pdf, other]

Learning Generic Lung Ultrasound Biomarkers for Decoupling Feature Extraction from Downstream Tasks

Authors: Gautam Rajendrakumar Gare, Tom Fox, Pete Lowery, Kevin Zamora, Hai V. Tran, Laura Hutchins, David Montgomery, Amita Krishnan, Deva Kannan Ramanan, Ricardo Luis Rodriguez, Bennett P deBoisblanc, John Michael Galeotti

Abstract: Contemporary artificial neural networks (ANN) are trained end-to-end, jointly learning both features and classifiers for the task of interest. Though enormously effective, this paradigm imposes significant costs in assembling annotated task-specific datasets and training large-scale networks. We propose to decouple feature learning from downstream lung ultrasound tasks by introducing an auxiliary… ▽ More Contemporary artificial neural networks (ANN) are trained end-to-end, jointly learning both features and classifiers for the task of interest. Though enormously effective, this paradigm imposes significant costs in assembling annotated task-specific datasets and training large-scale networks. We propose to decouple feature learning from downstream lung ultrasound tasks by introducing an auxiliary pre-task of visual biomarker classification. We demonstrate that one can learn an informative, concise, and interpretable feature space from ultrasound videos by training models for predicting biomarker labels. Notably, biomarker feature extractors can be trained from data annotated with weak video-scale supervision. These features can be used by a variety of downstream Expert models targeted for diverse clinical tasks (Diagnosis, lung severity, S/F ratio). Crucially, task-specific expert models are comparable in accuracy to end-to-end models directly trained for such target tasks, while being significantly lower cost to train. △ Less

Submitted 16 June, 2022; originally announced June 2022.

arXiv:2206.05869 [pdf, other]

On the Convergence to a Global Solution of Shuffling-Type Gradient Algorithms

Authors: Lam M. Nguyen, Trang H. Tran

Abstract: Stochastic gradient descent (SGD) algorithm is the method of choice in many machine learning tasks thanks to its scalability and efficiency in dealing with large-scale problems. In this paper, we focus on the shuffling version of SGD which matches the mainstream practical heuristics. We show the convergence to a global solution of shuffling SGD for a class of non-convex functions under over-parame… ▽ More Stochastic gradient descent (SGD) algorithm is the method of choice in many machine learning tasks thanks to its scalability and efficiency in dealing with large-scale problems. In this paper, we focus on the shuffling version of SGD which matches the mainstream practical heuristics. We show the convergence to a global solution of shuffling SGD for a class of non-convex functions under over-parameterized settings. Our analysis employs more relaxed non-convex assumptions than previous literature. Nevertheless, we maintain the desired computational complexity as shuffling SGD has achieved in the general convex setting. △ Less

Submitted 25 October, 2023; v1 submitted 12 June, 2022; originally announced June 2022.

Comments: The 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

arXiv:2206.01432 [pdf, other]

On the Generalization of Wasserstein Robust Federated Learning

Authors: Tung-Anh Nguyen, Tuan Dung Nguyen, Long Tan Le, Canh T. Dinh, Nguyen H. Tran

Abstract: In federated learning, participating clients typically possess non-i.i.d. data, posing a significant challenge to generalization to unseen distributions. To address this, we propose a Wasserstein distributionally robust optimization scheme called WAFL. Leveraging its duality, we frame WAFL as an empirical surrogate risk minimization problem, and solve it using a local SGD-based algorithm with conv… ▽ More In federated learning, participating clients typically possess non-i.i.d. data, posing a significant challenge to generalization to unseen distributions. To address this, we propose a Wasserstein distributionally robust optimization scheme called WAFL. Leveraging its duality, we frame WAFL as an empirical surrogate risk minimization problem, and solve it using a local SGD-based algorithm with convergence guarantees. We show that the robustness of WAFL is more general than related approaches, and the generalization bound is robust to all adversarial distributions inside the Wasserstein ball (ambiguity set). Since the center location and radius of the Wasserstein ball can be suitably modified, WAFL shows its applicability not only in robustness but also in domain adaptation. Through empirical evaluation, we demonstrate that WAFL generalizes better than the vanilla FedAvg in non-i.i.d. settings, and is more robust than other related methods in distribution shift settings. Further, using benchmark datasets we show that WAFL is capable of generalizing to unseen target domains. △ Less

Submitted 3 June, 2022; originally announced June 2022.

arXiv:2205.15904 [pdf, other]

doi 10.1007/978-3-031-07481-3_5

Synthesizing Configuration Tactics for Exercising Hidden Options in Serverless Systems

Authors: Jörn Kuhlenkamp, Sebastian Werner, Chin Hong Tran, Stefan Tai

Abstract: A proper configuration of an information system can ensure accuracy and efficiency, among other system objectives. Conversely, a poor configuration can have a significant negative impact on the system's performance, reliability, and cost. Serverless systems, which are comprised of many functions and managed services, especially risk exposure to misconfigurations, with many provider- and platform-s… ▽ More A proper configuration of an information system can ensure accuracy and efficiency, among other system objectives. Conversely, a poor configuration can have a significant negative impact on the system's performance, reliability, and cost. Serverless systems, which are comprised of many functions and managed services, especially risk exposure to misconfigurations, with many provider- and platform-specific, often intransparent and 'hidden' settings. In this paper, we argue to pay close attention to the configuration of serverless systems to exercise options with known accuracy, cost and time. Based on a literature study and long-term serverless systems development experience, we present nine tactics to unlock potentially neglected and unknown options in serverless systems. △ Less

Submitted 3 June, 2022; v1 submitted 31 May, 2022; originally announced May 2022.

Comments: updated typo in abstract

Journal ref: Intelligent Information Systems - CAiSE 2022

arXiv:2205.14923 [pdf, other]

Unbalanced CO-Optimal Transport

Authors: Quang Huy Tran, Hicham Janati, Nicolas Courty, Rémi Flamary, Ievgen Redko, Pinar Demetci, Ritambhara Singh

Abstract: Optimal transport (OT) compares probability distributions by computing a meaningful alignment between their samples. CO-optimal transport (COOT) takes this comparison further by inferring an alignment between features as well. While this approach leads to better alignments and generalizes both OT and Gromov-Wasserstein distances, we provide a theoretical result showing that it is sensitive to outl… ▽ More Optimal transport (OT) compares probability distributions by computing a meaningful alignment between their samples. CO-optimal transport (COOT) takes this comparison further by inferring an alignment between features as well. While this approach leads to better alignments and generalizes both OT and Gromov-Wasserstein distances, we provide a theoretical result showing that it is sensitive to outliers that are omnipresent in real-world data. This prompts us to propose unbalanced COOT for which we provably show its robustness to noise in the compared datasets. To the best of our knowledge, this is the first such result for OT methods in incomparable spaces. With this result in hand, we provide empirical evidence of this robustness for the challenging tasks of heterogeneous domain adaptation with and without varying proportions of classes and simultaneous alignment of samples and features across single-cell measurements. △ Less

Submitted 20 February, 2023; v1 submitted 30 May, 2022; originally announced May 2022.

Comments: Edit format and fix typos

arXiv:2205.10529 [pdf, other]

Fine-Grained Visual Classification using Self Assessment Classifier

Authors: Tuong Do, Huy Tran, Erman Tjiputra, Quang D. Tran, Anh Nguyen

Abstract: Extracting discriminative features plays a crucial role in the fine-grained visual classification task. Most of the existing methods focus on develo** attention or augmentation mechanisms to achieve this goal. However, addressing the ambiguity in the top-k prediction classes is not fully investigated. In this paper, we introduce a Self Assessment Classifier, which simultaneously leverages the re… ▽ More Extracting discriminative features plays a crucial role in the fine-grained visual classification task. Most of the existing methods focus on develo** attention or augmentation mechanisms to achieve this goal. However, addressing the ambiguity in the top-k prediction classes is not fully investigated. In this paper, we introduce a Self Assessment Classifier, which simultaneously leverages the representation of the image and top-k prediction classes to reassess the classification results. Our method is inspired by continual learning with coarse-grained and fine-grained classifiers to increase the discrimination of features in the backbone and produce attention maps of informative areas on the image. In practice, our method works as an auxiliary branch and can be easily integrated into different architectures. We show that by effectively addressing the ambiguity in the top-k prediction classes, our method achieves new state-of-the-art results on CUB200-2011, Stanford Dog, and FGVC Aircraft datasets. Furthermore, our method also consistently improves the accuracy of different existing fine-grained classifiers with a unified setup. △ Less

Submitted 21 May, 2022; originally announced May 2022.

arXiv:2205.09507 [pdf, ps, other]

Single Crystalline 2D Material Nanoribbon Networks for Nanoelectronics

Authors: Muhammad Awais Aslam, Tuan Hoang Tran, Antonio Supina, Olivier Siri, Vincent Meunier, Kenji Watanabe, Takashi Taniguchi, Marko Kralj, Christian Teichert, Evgeniya Sheremet, Raul D. Rodriguez, Aleksandar Matković

Abstract: The last decade has seen a flurry of studies related to graphene nanoribbons owing to their potential applications in the quantum realm. However, little experimental work has been reported towards nanoribbons of other 2D materials due to the absence of synthesis routes. Here, we propose a universal approach to synthesize high-quality networks of nanoribbons from arbitrary 2D materials while mainta… ▽ More The last decade has seen a flurry of studies related to graphene nanoribbons owing to their potential applications in the quantum realm. However, little experimental work has been reported towards nanoribbons of other 2D materials due to the absence of synthesis routes. Here, we propose a universal approach to synthesize high-quality networks of nanoribbons from arbitrary 2D materials while maintaining high crystallinity, sufficient yield, narrow size distribution, and straight-forward device integrability. The wide applicability of this technique is demonstrated by fabricating MoS2, WS2, WSe2, and graphene nanoribbon field effect transistors that inherently do not suffer from interconnection resistances. By relying on self-assembled and self-aligned organic nanostructures as masks, we demonstrate the possibility of controlling the predominant crystallographic direction of the nanoribbon's edges. Electrical characterization shows record mobilities and very high ON currents for various TMDCs despite extreme width scaling. Lastly, we explore decoration of nanoribbon edges with plasmonic particles paving the way towards the development of nanoribbon-based plasmonic sensing and opto-electronic devices. △ Less

Submitted 16 May, 2022; originally announced May 2022.

arXiv:2205.08812 [pdf, other]

Anomaly detection using prediction error with Spatio-Temporal Convolutional LSTM

Authors: Hanh Thi Minh Tran, David Hogg

Abstract: In this paper, we propose a novel method for video anomaly detection motivated by an existing architecture for sequence-to-sequence prediction and reconstruction using a spatio-temporal convolutional Long Short-Term Memory (convLSTM). As in previous work on anomaly detection, anomalies arise as spatially localised failures in reconstruction or prediction. In experiments with five benchmark dataset… ▽ More In this paper, we propose a novel method for video anomaly detection motivated by an existing architecture for sequence-to-sequence prediction and reconstruction using a spatio-temporal convolutional Long Short-Term Memory (convLSTM). As in previous work on anomaly detection, anomalies arise as spatially localised failures in reconstruction or prediction. In experiments with five benchmark datasets, we show that using prediction gives superior performance to using reconstruction. We also compare performance with different length input/output sequences. Overall, our results using prediction are comparable with the state of the art on the benchmark datasets. △ Less

Submitted 18 May, 2022; originally announced May 2022.

arXiv:2205.08548 [pdf, other]

doi 10.3847/1538-4357/ac6fe0

Distances to Local Group Galaxies via Population II, Stellar Distance Indicators I: The Sculptor Dwarf Spheroidal

Authors: Quang H. Tran, Taylor J. Hoyt, Wendy L. Freedman, Barry F. Madore, Elias K. Oakes, William Cerny, Dylan Hatt, Rachael L. Beaton

Abstract: We determine the distance to the Sculptor Dwarf Spheroidal via three Population II stellar distance indicators: (a) the Tip of the Red Giant Branch (TRGB), (b) RR Lyrae variables (RRLs), and (c) the ridgeline of the blue horizontal branch (HB). High signal-to-noise, wide-field $VI$ imaging that covers an area $48' \times 48'$ and reaches a photometric depth approximately 2 mag fainter than the HB… ▽ More We determine the distance to the Sculptor Dwarf Spheroidal via three Population II stellar distance indicators: (a) the Tip of the Red Giant Branch (TRGB), (b) RR Lyrae variables (RRLs), and (c) the ridgeline of the blue horizontal branch (HB). High signal-to-noise, wide-field $VI$ imaging that covers an area $48' \times 48'$ and reaches a photometric depth approximately 2 mag fainter than the HB was acquired with the Magellan-Baade 6.5m telescope. The true modulus derived from Sculptor's TRGB is found to be $μ^\mathrm{TRGB}_o = 19.59 \pm 0.07_\mathrm{stat} \pm 0.05_\mathrm{sys}$ mag. Along with periods adopted from the literature, newly acquired RRL phase points are fit with template light curves to determine $μ_{W_{I,V-I}}^\mathrm{RRL} = 19.60 \pm 0.01_\mathrm{stat} \pm 0.05_\mathrm{sys}$ mag. Finally, the HB distance is found to be $μ^\mathrm{HB}_o = 19.54 \pm 0.03_\mathrm{stat} \pm 0.09_\mathrm{sys}$ mag. Absolute calibrations of each method are anchored by independent geometric zero-points, utilizes a different class of stars, and are determined from the same photometric calibration. △ Less

Submitted 17 May, 2022; originally announced May 2022.

Comments: 16 pages, 14 figures

arXiv:2205.06457 [pdf, ps, other]

ViT5: Pretrained Text-to-Text Transformer for Vietnamese Language Generation

Authors: Long Phan, Hieu Tran, Hieu Nguyen, Trieu H. Trinh

Abstract: We present ViT5, a pretrained Transformer-based encoder-decoder model for the Vietnamese language. With T5-style self-supervised pretraining, ViT5 is trained on a large corpus of high-quality and diverse Vietnamese texts. We benchmark ViT5 on two downstream text generation tasks, Abstractive Text Summarization and Named Entity Recognition. Although Abstractive Text Summarization has been widely st… ▽ More We present ViT5, a pretrained Transformer-based encoder-decoder model for the Vietnamese language. With T5-style self-supervised pretraining, ViT5 is trained on a large corpus of high-quality and diverse Vietnamese texts. We benchmark ViT5 on two downstream text generation tasks, Abstractive Text Summarization and Named Entity Recognition. Although Abstractive Text Summarization has been widely studied for the English language thanks to its rich and large source of data, there has been minimal research into the same task in Vietnamese, a much lower resource language. In this work, we perform exhaustive experiments on both Vietnamese Abstractive Summarization and Named Entity Recognition, validating the performance of ViT5 against many other pretrained Transformer-based encoder-decoder models. Our experiments show that ViT5 significantly outperforms existing models and achieves state-of-the-art results on Vietnamese Text Summarization. On the task of Named Entity Recognition, ViT5 is competitive against previous best results from pretrained encoder-based Transformer models. Further analysis shows the importance of context length during the self-supervised pretraining on downstream performance across different settings. △ Less

Submitted 26 May, 2022; v1 submitted 13 May, 2022; originally announced May 2022.

Comments: NAACL SRW 2022. arXiv admin note: text overlap with arXiv:2110.04257

arXiv:2205.05307 [pdf]

doi 10.3847/1538-3881/ac7da2

The AGEL Survey: Spectroscopic Confirmation of Strong Gravitational Lenses in the DES and DECaLS Fields Selected Using Convolutional Neural Networks

Authors: Kim-Vy H. Tran, Anishya Harshan, Karl Glazebrook, G. C. Keerthi Vasan, Tucker Jones, Colin Jacobs, Glenn G. Kacprzak, Tania M. Barone, Thomas E. Collett, Anshu Gupta, Astrid Henderson, Lisa J. Kewley, Sebastian Lopez, Themiya Nanayakkara, Ryan L. Sanders, Sarah M. Sweet

Abstract: We present spectroscopic confirmation of candidate strong gravitational lenses using the Keck Observatory and Very Large Telescope as part of our ASTRO 3D Galaxy Evolution with Lenses (AGEL) survey. We confirm that 1) search methods using Convolutional Neural Networks (CNN) with visual inspection successfully identify strong gravitational lenses and 2) the lenses are at higher redshifts relative t… ▽ More We present spectroscopic confirmation of candidate strong gravitational lenses using the Keck Observatory and Very Large Telescope as part of our ASTRO 3D Galaxy Evolution with Lenses (AGEL) survey. We confirm that 1) search methods using Convolutional Neural Networks (CNN) with visual inspection successfully identify strong gravitational lenses and 2) the lenses are at higher redshifts relative to existing surveys due to the combination of deeper and higher resolution imaging from DECam and spectroscopy spanning optical to near-infrared wavelengths. We measure 104 redshifts in 77 systems selected from a catalog in the DES and DECaLS imaging fields (r<22 mag). Combining our results with published redshifts, we present redshifts for 68 lenses and establish that CNN-based searches are highly effective for use in future imaging surveys with a success rate of 88% (defined as 68/77). We report 53 strong lenses with spectroscopic redshifts for both the deflector and source (z_src>z_defl), and 15 lenses with a spectroscopic redshift for either the deflector (z_defl>0.21) or source (z_src>1.34). For the 68 lenses, the deflectors and sources have average redshifts and standard deviations of 0.58+/-0.14 and 1.92+/-0.59 respectively, and corresponding redshift ranges of (0.21<z_defl<0.89) and (0.88<z_src<3.55). The AGEL systems include 41 deflectors at zdefl>0.5 that are ideal for follow-up studies to track how mass density profiles evolve with redshift. Our goal with AGEL is to spectroscopically confirm ~100 strong gravitational lenses that can be observed from both hemispheres throughout the year. The AGEL survey is a resource for refining automated all-sky searches and addressing a range of questions in astrophysics and cosmology. △ Less

Submitted 26 September, 2022; v1 submitted 11 May, 2022; originally announced May 2022.

Comments: Updated final version of manuscript published by the Astronomical Journal

arXiv:2205.03976 [pdf, other]

doi 10.1007/s44007-023-00053-2

Orientations and cycles in supersingular isogeny graphs

Authors: Sarah Arpin, Mingjie Chen, Kristin E. Lauter, Renate Scheidler, Katherine E. Stange, Ha T. N. Tran

Abstract: The paper concerns several theoretical aspects of oriented supersingular $\ell$-isogeny volcanoes and their relationship to closed walks in the supersingular $\ell$-isogeny graph. Our main result is a bijection between the rims of the union of all oriented supersingular $\ell$-isogeny volcanoes over $\overline{\mathbb{F}}_p$ (up to conjugation of the orientations), and isogeny cycles (non-backtrac… ▽ More The paper concerns several theoretical aspects of oriented supersingular $\ell$-isogeny volcanoes and their relationship to closed walks in the supersingular $\ell$-isogeny graph. Our main result is a bijection between the rims of the union of all oriented supersingular $\ell$-isogeny volcanoes over $\overline{\mathbb{F}}_p$ (up to conjugation of the orientations), and isogeny cycles (non-backtracking closed walks which are not powers of smaller walks) of the supersingular $\ell$-isogeny graph over $\overline{\mathbb{F}}_p$. The exact proof and statement of this bijection are made more intricate by special behaviours arising from extra automorphisms and the ramification of $p$ in certain quadratic orders. We use the bijection to count isogeny cycles of given length in the supersingular $\ell$-isogeny graph exactly as a sum of class numbers of these orders, and also give an explicit upper bound by estimating the class numbers. △ Less

Submitted 4 December, 2022; v1 submitted 8 May, 2022; originally announced May 2022.

Comments: 41 pages, 7 figures

MSC Class: 14G50; 94A60; 11G05; 14K04

arXiv:2204.13068 [pdf, other]

Measuring vesicle loading with holographic microscopy and bulk light scattering

Authors: Lan Hai Anh Tran, Lauren A. Lowe, Matthew Turner, James Luong, Omar Abdullah A. Khamis, Yaam Deckel, Megan L. Amos, Anna Wang

Abstract: We report efforts to quantify the loading of cell-sized lipid vesicles using in-line digital holographic microscopy. This method does not require fluorescent reporters, fluorescent tracers, or radioactive tracers. A single-color LED light source takes the place of conventional illumination to generate holograms rather than bright field images. By modelling the vesicle's scattering in a microscope… ▽ More We report efforts to quantify the loading of cell-sized lipid vesicles using in-line digital holographic microscopy. This method does not require fluorescent reporters, fluorescent tracers, or radioactive tracers. A single-color LED light source takes the place of conventional illumination to generate holograms rather than bright field images. By modelling the vesicle's scattering in a microscope with a Lorenz-Mie light scattering model, and comparing the results to data holograms, we are able to measure the vesicle's refractive index and thus loading. Performing the same comparison for bulk light scattering measurements enables retrieval of vesicle loading for nanoscale vesicles. △ Less

Submitted 26 April, 2024; v1 submitted 12 April, 2022; originally announced April 2022.

Comments: 7 figures

arXiv:2204.09875 [pdf, other]

Persistent-Transient Duality in Human Behavior Modeling

Authors: Hung Tran, Vuong Le, Svetha Venkatesh, Truyen Tran

Abstract: We propose to model the persistent-transient duality in human behavior using a parent-child multi-channel neural network, which features a parent persistent channel that manages the global dynamics and children transient channels that are initiated and terminated on-demand to handle detailed interactive actions. The short-lived transient sessions are managed by a proposed Transient Switch. The neu… ▽ More We propose to model the persistent-transient duality in human behavior using a parent-child multi-channel neural network, which features a parent persistent channel that manages the global dynamics and children transient channels that are initiated and terminated on-demand to handle detailed interactive actions. The short-lived transient sessions are managed by a proposed Transient Switch. The neural framework is trained to discover the structure of the duality automatically. Our model shows superior performances in human-object interaction motion prediction. △ Less

Submitted 21 April, 2022; originally announced April 2022.

Comments: Accepted at CVPR Precognition Workshop 2022

arXiv:2204.09699 [pdf]

doi 10.3847/1538-4357/ac5b07

Distances to Local Group Galaxies via Population II, Stellar Distance Indicators. II. The Fornax Dwarf Spheroidal

Authors: Elias K. Oakes, Taylor J. Hoyt, Wendy L. Freedman, Barry F. Madore, Quang H. Tran, William Cerny, Rachael L. Beaton, Mark Seibert

Abstract: We determine three independent Population II distance moduli to the Fornax dwarf spheroidal (dSph) galaxy, using wide-field, ground-based $VI$ imaging acquired with the Magellan-Baade telescope at Las Campanas Observatory. After subtracting foreground stars using Gaia EDR3 proper motions, we measure an $I$-band tip of the red giant branch (TRGB) magnitude of… ▽ More We determine three independent Population II distance moduli to the Fornax dwarf spheroidal (dSph) galaxy, using wide-field, ground-based $VI$ imaging acquired with the Magellan-Baade telescope at Las Campanas Observatory. After subtracting foreground stars using Gaia EDR3 proper motions, we measure an $I$-band tip of the red giant branch (TRGB) magnitude of $I_0^\mathrm{TRGB} = 16.753 \pm 0.03_\mathrm{stat} \pm 0.037_\mathrm{sys}$ mag, with a calibration based in the LMC giving a distance modulus of $μ_0^\mathrm{TRGB} = 20.80 \pm 0.037_\mathrm{stat} \pm 0.057_\mathrm{sys}$ mag. We determine an RR Lyrae (RRL) distance from template mean magnitudes, with periods adopted from the literature. Adopting a Gaia DR2 calibration of first overtone RRL period-luminosity and period-Wesenheit relations, we find $μ_0^\mathrm{PLZ} = 20.74 \pm 0.01_\mathrm{stat} \pm 0.12_\mathrm{sys}$ mag and $μ_0^\mathrm{PWZ} = 20.68 \pm 0.02_\mathrm{stat} \pm 0.07_\mathrm{sys}$ mag. Finally, we determine a distance from Fornax's horizontal branch (HB) and two galactic globular cluster calibrators, giving $μ_0^\mathrm{HB} = 20.83 \pm 0.03_\mathrm{stat} \pm 0.09_\mathrm{sys}$ mag. These distances are each derived from homogeneous IMACS photometry, are anchored to independent geometric zero-points, and utilize different classes of stars. We therefore average over independent uncertainties and report the combined distance modulus $\langle μ_0\rangle = 20.770 \pm 0.042_\mathrm{stat} \pm 0.024_\mathrm{sys}$ mag (corresponding to a distance of $143\pm3$ kpc). △ Less

Submitted 10 May, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

Comments: 17 pages, 11 figures, 4 tables; published in the Astrophysical Journal. Updated to correct typos

Journal ref: ApJ, 929, 116 (2022)

arXiv:2204.09242 [pdf, ps, other]

Strongly Quasiconvex subgroups in graphs of groups

Authors: Hoang Thanh Nguyen, Hung Cong Tran

Abstract: Given a graph of groups $\mathcal{G} = (Γ, \{G_v\}, \{G_e\})$ with certain conditions on vertex groups and $G$ acts acylindrically on its Bass-Serre tree $T$. Let $H$ be a finitely generated subgroup of $G$. We prove the following statements equivalence: $H$ has finite height, $(G, T, H)$ is a $A/QI$--triple, $H$ is strongly quasiconvex and virtually free in $G$. We also give a condition to determ… ▽ More Given a graph of groups $\mathcal{G} = (Γ, \{G_v\}, \{G_e\})$ with certain conditions on vertex groups and $G$ acts acylindrically on its Bass-Serre tree $T$. Let $H$ be a finitely generated subgroup of $G$. We prove the following statements equivalence: $H$ has finite height, $(G, T, H)$ is a $A/QI$--triple, $H$ is strongly quasiconvex and virtually free in $G$. We also give a condition to determine whether strong quasiconvexity in a group is preserved under amalgams. △ Less

Submitted 20 April, 2022; originally announced April 2022.

arXiv:2204.05350 [pdf, ps, other]

Leveraging Deep Neural Networks for Massive MIMO Data Detection

Authors: Ly V. Nguyen, Nhan T. Nguyen, Nghi H. Tran, Markku Juntti, A. Lee Swindlehurst, Duy H. N. Nguyen

Abstract: Massive multiple-input multiple-output (MIMO) is a key technology for emerging next-generation wireless systems. Utilizing large antenna arrays at base-stations, massive MIMO enables substantial spatial multiplexing gains by simultaneously serving a large number of users. However, the complexity in massive MIMO signal processing (e.g., data detection) increases rapidly with the number of users, ma… ▽ More Massive multiple-input multiple-output (MIMO) is a key technology for emerging next-generation wireless systems. Utilizing large antenna arrays at base-stations, massive MIMO enables substantial spatial multiplexing gains by simultaneously serving a large number of users. However, the complexity in massive MIMO signal processing (e.g., data detection) increases rapidly with the number of users, making conventional hand-engineered algorithms less computationally efficient. Low-complexity massive MIMO detection algorithms, especially those inspired or aided by deep learning, have emerged as a promising solution. While there exist many MIMO detection algorithms, the aim of this magazine paper is to provide insight into how to leverage deep neural networks (DNN) for massive MIMO detection. We review recent developments in DNN-based MIMO detection that incorporate the domain knowledge of established MIMO detection algorithms with the learning capability of DNNs. We then present a comparison of the key numerical performance metrics of these works. We conclude by describing future research areas and applications of DNNs in massive MIMO receivers. △ Less

Submitted 11 April, 2022; originally announced April 2022.

Comments: 7 pages, 5 figures, Accepted to IEEE Wireless Communications Magazine

arXiv:2203.13807 [pdf, other]

Differentiability of effective fronts in the continuous setting in two dimensions

Authors: Hung V. Tran, Yifeng Yu

Abstract: We study the effective front associated with first-order front propagations in two dimensions ($n=2$) in the periodic setting with continuous coefficients. Our main result says that that the boundary of the effective front is differentiable at every irrational point. Equivalently, the stable norm associated with a continuous $\mathbb{Z}^2$-periodic Riemannian metric is differentiable at irrational… ▽ More We study the effective front associated with first-order front propagations in two dimensions ($n=2$) in the periodic setting with continuous coefficients. Our main result says that that the boundary of the effective front is differentiable at every irrational point. Equivalently, the stable norm associated with a continuous $\mathbb{Z}^2$-periodic Riemannian metric is differentiable at irrational points. This conclusion was obtained decades ago for smooth metrics ([3,5]). To the best of our knowledge, our result provides the first nontrivial property of the effective fronts in the continuous setting, which is the standard assumption in the PDE theory. Combining with the sufficiency result in [12], our result implies that for continuous coefficients, a polygon could be an effective front if and only if it is centrally symmetric with rational vertices and nonempty interior. △ Less

Submitted 7 June, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

Comments: 27 pages, 9 figures. Updated version with additional references and expository parts

arXiv:2203.12071 [pdf, other]

doi 10.1109/LRA.2022.3193464

WayFAST: Navigation with Predictive Traversability in the Field

Authors: Mateus Valverde Gasparino, Arun Narenthiran Sivakumar, Yixiao Liu, Andres Eduardo Baquero Velasquez, Vitor Akihiro Hisano Higuti, John Rogers, Huy Tran, Girish Chowdhary

Abstract: We present a self-supervised approach for learning to predict traversable paths for wheeled mobile robots that require good traction to navigate. Our algorithm, termed WayFAST (Waypoint Free Autonomous Systems for Traversability), uses RGB and depth data, along with navigation experience, to autonomously generate traversable paths in outdoor unstructured environments. Our key inspiration is that t… ▽ More We present a self-supervised approach for learning to predict traversable paths for wheeled mobile robots that require good traction to navigate. Our algorithm, termed WayFAST (Waypoint Free Autonomous Systems for Traversability), uses RGB and depth data, along with navigation experience, to autonomously generate traversable paths in outdoor unstructured environments. Our key inspiration is that traction can be estimated for rolling robots using kinodynamic models. Using traction estimates provided by an online receding horizon estimator, we are able to train a traversability prediction neural network in a self-supervised manner, without requiring heuristics utilized by previous methods. We demonstrate the effectiveness of WayFAST through extensive field testing in varying environments, ranging from sandy dry beaches to forest canopies and snow covered grass fields. Our results clearly demonstrate that WayFAST can learn to avoid geometric obstacles as well as untraversable terrain, such as snow, which would be difficult to avoid with sensors that provide only geometric data, such as LiDAR. Furthermore, we show that our training pipeline based on online traction estimates is more data-efficient than other heuristic-based methods. △ Less

Submitted 1 August, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

Comments: Project website with code and videos: https://mateusgasparino.com/wayfast-traversability-navigation/ Published in the IEEE Robotics and Automation Letters (RA-L, 2022) Accepted for presentation in the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2022)

ACM Class: I.2.9; I.2.6; I.2.10

arXiv:2203.10791 [pdf]

IoT Data Discovery: Routing Table and Summarization Techniques

Authors: Hieu Tran, Son Nguyen, I-Ling Yen, Farokh Bastani

Abstract: In this paper, we consider the IoT data discovery problem in very large and growing scale networks. Through analysis, examples, and experimental studies, we show the importance of peer-to-peer, unstructured routing for IoT data discovery and point out the space efficiency issue that has been overlooked in keyword-based routing algorithms in unstructured networks. Specifically, as the first in the… ▽ More In this paper, we consider the IoT data discovery problem in very large and growing scale networks. Through analysis, examples, and experimental studies, we show the importance of peer-to-peer, unstructured routing for IoT data discovery and point out the space efficiency issue that has been overlooked in keyword-based routing algorithms in unstructured networks. Specifically, as the first in the field, this paper investigates routing table designs and various compression techniques to support effective and space-efficient IoT data discovery routing. Novel summarization algorithms, including alphabetical, hash, and meaning-based summarization and their corresponding coding schemes, are proposed. We also consider routing table design to support summarization without degrading lookup efficiency for discovery query routing. The issue of potentially misleading routing due to summarization is also investigated. Subsequently, we analyze the strategy of when to summarize to balance the tradeoff between the routing table compression rate and the chance of causing misleading routing. For the experimental study, we have collected 100K IoT data streams from various IoT databases as the input dataset. Experimental results show that our summarization solution can reduce the routing table size by 20 to 30 folds with a 2-5% increase in latency compared with similar peer-to-peer discovery routing algorithms without summarization. Also, our approach outperforms DHT-based approaches by 2 to 6 folds in terms of latency and traffic. △ Less

Submitted 6 May, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

Comments: 18 pages, 23 figures, 1 table, 3 algorithms. arXiv admin note: substantial text overlap with arXiv:2107.09558

arXiv:2203.05170 [pdf, other]

LensingETC: a tool to optimize multi-filter imaging campaigns of galaxy-scale strong lensing systems

Authors: Anowar J. Shajib, Karl Glazebrook, Tania Barone, Geraint F. Lewis, Tucker Jones, Kim-Vy H. Tran, Elizabeth Buckley-Geer, Thomas E. Collett, Joshua Frieman, Colin Jacobs

Abstract: Imaging data is the principal observable required to use galaxy-scale strong lensing in a multitude of applications in extragalactic astrophysics and cosmology. In this paper, we develop Lensing Exposure Time Calculator (LensingETC) to optimize the efficiency of telescope time usage when planning multi-filter imaging campaigns for galaxy-scale strong lenses. This tool simulates realistic data tail… ▽ More Imaging data is the principal observable required to use galaxy-scale strong lensing in a multitude of applications in extragalactic astrophysics and cosmology. In this paper, we develop Lensing Exposure Time Calculator (LensingETC) to optimize the efficiency of telescope time usage when planning multi-filter imaging campaigns for galaxy-scale strong lenses. This tool simulates realistic data tailored to specified instrument characteristics and then automatically models them to assess the power of the data in constraining lens model parameters. We demonstrate a use case of this tool by optimizing a two-filter observing strategy (in IR and UVIS) within the limited exposure time per system allowed by a Hubble Space Telescope (HST) Snapshot program. We find that higher resolution is more advantageous to gain constraining power on the lensing observables, when there is a trade-off between signal-to-noise ratio and resolution; e.g., between the UVIS and IR filters of the HST. We also find that, whereas a point spread function (PSF) with sub-Nyquist sampling allows the sample mean for a model parameter to be robustly recovered for both galaxy-galaxy and point-source lensing systems, a sub-Nyquist sampled PSF introduces a larger scatter than a Nyquist sampled one in the deviation from the ground truth for point-source lens systems. △ Less

Submitted 10 March, 2022; originally announced March 2022.

Comments: 10 pages, 7 figures, 3 tables

arXiv:2203.04334 [pdf, other]

doi 10.3847/1538-3881/ac5c4f

TOI-1670 b and c: An Inner Sub-Neptune with an Outer Warm Jupiter Unlikely to have Originated from High-Eccentricity Migration

Authors: Quang H. Tran, Brendan P. Bowler, Michael Endl, William D. Cochran, Phillip J. MacQueen, Davide Gandolfi, Carina M. Persson, Malcolm Fridlund, Enric Palle, Grzegorz Nowak, Hans J. Deeg, Rafael Luque, John H. Livingston, Petr Kabáth, Marek Skarka, Ján Šubjak, Steve B. Howell, Simon H. Albrecht, Karen A. Collins, Massimiliano Esposito, Vincent Van Eylen, Sascha Grziwa, Elisa Goffo, Chelsea X. Huang, Jon M. Jenkins , et al. (16 additional authors not shown)

Abstract: We report the discovery of two transiting planets around the bright ($V=9.9$ mag) main sequence F7 star TOI-1670 by the Transiting Exoplanet Survey Satellite. TOI-1670 b is a sub-Neptune ($R_\mathrm{b} = 2.06_{-0.15}^{+0.19}$ $R_\oplus$) on a 10.9-day orbit and TOI-1670 c is a warm Jupiter ($R_\mathrm{c} = 0.987_{-0.025}^{+0.025}$ $R_\mathrm{Jup}$) on a 40.7-day orbit. Using radial velocity observ… ▽ More We report the discovery of two transiting planets around the bright ($V=9.9$ mag) main sequence F7 star TOI-1670 by the Transiting Exoplanet Survey Satellite. TOI-1670 b is a sub-Neptune ($R_\mathrm{b} = 2.06_{-0.15}^{+0.19}$ $R_\oplus$) on a 10.9-day orbit and TOI-1670 c is a warm Jupiter ($R_\mathrm{c} = 0.987_{-0.025}^{+0.025}$ $R_\mathrm{Jup}$) on a 40.7-day orbit. Using radial velocity observations gathered with the Tull coudé Spectrograph on the Harlan J. Smith telescope and HARPS-N on the Telescopio Nazionale Galileo, we find a planet mass of $M_\mathrm{c} = 0.63_{-0.08}^{+0.09}$ $M_\mathrm{Jup}$ for the outer warm Jupiter, implying a mean density of $ρ_c = 0.81_{-0.11}^{+0.13}$ g cm$^{-3}$. The inner sub-Neptune is undetected in our radial velocity data ($M_\mathrm{b} < 0.13$ $M_\mathrm{Jup}$ at the 99% confidence level). Multi-planet systems like TOI-1670 hosting an outer warm Jupiter on a nearly circular orbit ($e_\mathrm{c} = 0.09_{-0.04}^{+0.05}$) and one or more inner coplanar planets are more consistent with "gentle" formation mechanisms such as disk migration or $in$ $situ$ formation rather than high-eccentricity migration. Of the 11 known systems with a warm Jupiter and a smaller inner companion, 8 (73%) are near a low-order mean-motion resonance, which can be a signature of migration. TOI-1670 joins two other systems (27% of this subsample) with period commensurabilities greater than 3, a common feature of $in$ $situ$ formation or halted inward migration. TOI-1670 and the handful of similar systems support a diversity of formation pathways for warm Jupiters. △ Less

Submitted 8 March, 2022; originally announced March 2022.

Comments: 23 pages, 9 figures, accepted for publication in AJ

arXiv:2202.09353 [pdf, other]

doi 10.1016/j.nimb.2022.06.001

Model Calibration of the Liquid Mercury Spallation Target using Evolutionary Neural Networks and Sparse Polynomial Expansions

Authors: Majdi I. Radaideh, Hoang Tran, Lianshan Lin, Hao Jiang, Drew Winder, Sarma Gorti, Guannan Zhang, Justin Mach, Sarah Cousineau

Abstract: The mercury constitutive model predicting the strain and stress in the target vessel plays a central role in improving the lifetime prediction and future target designs of the mercury targets at the Spallation Neutron Source (SNS). We leverage the experiment strain data collected over multiple years to improve the mercury constitutive model through a combination of large-scale simulations of the t… ▽ More The mercury constitutive model predicting the strain and stress in the target vessel plays a central role in improving the lifetime prediction and future target designs of the mercury targets at the Spallation Neutron Source (SNS). We leverage the experiment strain data collected over multiple years to improve the mercury constitutive model through a combination of large-scale simulations of the target behavior and the use of machine learning tools for parameter estimation. We present two interdisciplinary approaches for surrogate-based model calibration of expensive simulations using evolutionary neural networks and sparse polynomial expansions. The experiments and results of the two methods show a very good agreement for the solid mechanics simulation of the mercury spallation target. The proposed methods are used to calibrate the tensile cutoff threshold, mercury density, and mercury speed of sound during intense proton pulse experiments. Using strain experimental data from the mercury target sensors, the newly calibrated simulations achieve 7\% average improvement on the signal prediction accuracy and 8\% reduction in mean absolute error compared to previously reported reference parameters, with some sensors experiencing up to 30\% improvement. The proposed calibrated simulations can significantly aid in fatigue analysis to estimate the mercury target lifetime and integrity, which reduces abrupt target failure and saves a tremendous amount of costs. However, an important conclusion from this work points out to a deficiency in the current constitutive model based on the equation of state in capturing the full physics of the spallation reaction. Given that some of the calibrated parameters that show a good agreement with the experimental data can be nonphysical mercury properties, we need a more advanced two-phase flow model to capture bubble dynamics and mercury cavitation. △ Less

Submitted 18 February, 2022; originally announced February 2022.

Comments: 26 pages, 10 figures, 6 tables

Journal ref: Nucl. Instrum. Methods Phys. Res. B 525 (2022) 41-54

arXiv:2202.07741 [pdf, other]

Disentangling Successor Features for Coordination in Multi-agent Reinforcement Learning

Authors: Seung Hyun Kim, Neale Van Stralen, Girish Chowdhary, Huy T. Tran

Abstract: Multi-agent reinforcement learning (MARL) is a promising framework for solving complex tasks with many agents. However, a key challenge in MARL is defining private utility functions that ensure coordination when training decentralized agents. This challenge is especially prevalent in unstructured tasks with sparse rewards and many agents. We show that successor features can help address this chall… ▽ More Multi-agent reinforcement learning (MARL) is a promising framework for solving complex tasks with many agents. However, a key challenge in MARL is defining private utility functions that ensure coordination when training decentralized agents. This challenge is especially prevalent in unstructured tasks with sparse rewards and many agents. We show that successor features can help address this challenge by disentangling an individual agent's impact on the global value function from that of all other agents. We use this disentanglement to compactly represent private utilities that support stable training of decentralized agents in unstructured tasks. We implement our approach using a centralized training, decentralized execution architecture and test it in a variety of multi-agent environments. Our results show improved performance and training time relative to existing methods and suggest that disentanglement of successor features offers a promising approach to coordination in MARL. △ Less

Submitted 15 February, 2022; originally announced February 2022.

Comments: The paper is accepted in AAMAS 2022 (International Conference on Autonomous Agents and Multiagent Systems)

arXiv:2202.06350 [pdf, other]

doi 10.1088/1475-7516/2022/07/014

Numerical prescriptions of early-time divergences of the in-in formalism

Authors: Duc Huy Tran, Yi Wang, Juanyi Yang, Yuhang Zhu

Abstract: In quantum field theory, the in and out states can be related to the full Hamiltonian by the $iε$ prescription. A Wick rotation can further bring the correlation functions to Euclidean spacetime where the integrals are better defined. This setup is convenient for analytical calculations. However, for numerical calculations, an infinitesimal $ε$ or a Wick rotation of numerical functions are difficu… ▽ More In quantum field theory, the in and out states can be related to the full Hamiltonian by the $iε$ prescription. A Wick rotation can further bring the correlation functions to Euclidean spacetime where the integrals are better defined. This setup is convenient for analytical calculations. However, for numerical calculations, an infinitesimal $ε$ or a Wick rotation of numerical functions are difficult to implement. We propose two new numerical methods to solve this problem, namely an Integral Basis method based on linear regression and a Beta Regulator method based on Cesàro/Riesz summation. Another class of partition-extrapolation methods previously used in electromagnetic engineering is also introduced. We benchmark these methods with existing methods using in-in formalism integrals, indicating advantages of these new methods over the existing methods in computation time and accuracy. △ Less

Submitted 13 February, 2022; originally announced February 2022.

arXiv:2202.05892 [pdf, other]

doi 10.3847/1538-4365/ac90c1

POSYDON: A General-Purpose Population Synthesis Code with Detailed Binary-Evolution Simulations

Authors: Tassos Fragos, Jeff J. Andrews, Simone S. Bavera, Christopher P. L. Berry, Scott Coughlin, Aaron Dotter, Prabin Giri, Vicky Kalogera, Aggelos Katsaggelos, Konstantinos Kovlakas, Shamal Lalvani, Devina Misra, Philipp M. Srivastava, Ying Qin, Kyle A. Rocha, Jaime Roman-Garza, Juan Gabriel Serra, Petter Stahle, Meng Sun, Xu Teng, Goce Trajcevski, Nam Hai Tran, Zepei Xing, Emmanouil Zapartas, Michael Zevin

Abstract: Most massive stars are members of a binary or a higher-order stellar systems, where the presence of a binary companion can decisively alter their evolution via binary interactions. Interacting binaries are also important astrophysical laboratories for the study of compact objects. Binary population synthesis studies have been used extensively over the last two decades to interpret observations of… ▽ More Most massive stars are members of a binary or a higher-order stellar systems, where the presence of a binary companion can decisively alter their evolution via binary interactions. Interacting binaries are also important astrophysical laboratories for the study of compact objects. Binary population synthesis studies have been used extensively over the last two decades to interpret observations of compact-object binaries and to decipher the physical processes that lead to their formation. Here, we present POSYDON, a novel, binary population synthesis code that incorporates full stellar-structure and binary-evolution modeling, using the MESA code, throughout the whole evolution of the binaries. The use of POSYDON enables the self-consistent treatment of physical processes in stellar and binary evolution, including: realistic mass-transfer calculations and assessment of stability, internal angular-momentum transport and tides, stellar core sizes, mass-transfer rates and orbital periods. This paper describes the detailed methodology and implementation of POSYDON, including the assumed physics of stellar- and binary-evolution, the extensive grids of detailed single- and binary-star models, the post-processing, classification and interpolation methods we developed for use with the grids, and the treatment of evolutionary phases that are not based on pre-calculated grids. The first version of POSYDON targets binaries with massive primary stars (potential progenitors of neutron stars or black holes) at solar metallicity. △ Less

Submitted 7 August, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

Comments: 60 pages, 33 figures, 8 tables, referee's comments addressed. The code and the accompanying documentations and data products are available at https:\\posydon.org

arXiv:2202.03525 [pdf, other]

Nesterov Accelerated Shuffling Gradient Method for Convex Optimization

Authors: Trang H. Tran, Katya Scheinberg, Lam M. Nguyen

Abstract: In this paper, we propose Nesterov Accelerated Shuffling Gradient (NASG), a new algorithm for the convex finite-sum minimization problems. Our method integrates the traditional Nesterov's acceleration momentum with different shuffling sampling schemes. We show that our algorithm has an improved rate of $\mathcal{O}(1/T)$ using unified shuffling schemes, where $T$ is the number of epochs. This rate… ▽ More In this paper, we propose Nesterov Accelerated Shuffling Gradient (NASG), a new algorithm for the convex finite-sum minimization problems. Our method integrates the traditional Nesterov's acceleration momentum with different shuffling sampling schemes. We show that our algorithm has an improved rate of $\mathcal{O}(1/T)$ using unified shuffling schemes, where $T$ is the number of epochs. This rate is better than that of any other shuffling gradient methods in convex regime. Our convergence analysis does not require an assumption on bounded domain or a bounded gradient condition. For randomized shuffling schemes, we improve the convergence bound further. When employing some initial condition, we show that our method converges faster near the small neighborhood of the solution. Numerical simulations demonstrate the efficiency of our algorithm. △ Less

Submitted 12 June, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

arXiv:2202.03524 [pdf, ps, other]

Finite-Sum Optimization: A New Perspective for Convergence to a Global Solution

Authors: Lam M. Nguyen, Trang H. Tran, Marten van Dijk

Abstract: Deep neural networks (DNNs) have shown great success in many machine learning tasks. Their training is challenging since the loss surface of the network architecture is generally non-convex, or even non-smooth. How and under what assumptions is guaranteed convergence to a \textit{global} minimum possible? We propose a reformulation of the minimization problem allowing for a new recursive algorithm… ▽ More Deep neural networks (DNNs) have shown great success in many machine learning tasks. Their training is challenging since the loss surface of the network architecture is generally non-convex, or even non-smooth. How and under what assumptions is guaranteed convergence to a \textit{global} minimum possible? We propose a reformulation of the minimization problem allowing for a new recursive algorithmic framework. By using bounded style assumptions, we prove convergence to an $\varepsilon$-(global) minimum using $\mathcal{\tilde{O}}(1/\varepsilon^3)$ gradient computations. Our theoretical foundation motivates further study, implementation, and optimization of the new algorithmic framework and further investigation of its non-standard bounded style assumptions. This new direction broadens our understanding of why and under what circumstances training of a DNN converges to a global minimum. △ Less

Submitted 7 February, 2022; originally announced February 2022.

arXiv:2202.03394 [pdf, ps, other]

Local mass-conserving solution for a critical Coagulation-Fragmentation equation

Authors: Hung V. Tran, Truong-Son Van

Abstract: The critical coagulation-fragmentation equation with multiplicative coagulation and constant fragmentation kernels is known to not have global mass-conserving solutions when the initial mass is greater than $1$. We show that for any given positive initial mass with finite second moment, there is a time $T^*>0$ such that the equation possesses a unique mass-conserving solution up to $T^*$. The nove… ▽ More The critical coagulation-fragmentation equation with multiplicative coagulation and constant fragmentation kernels is known to not have global mass-conserving solutions when the initial mass is greater than $1$. We show that for any given positive initial mass with finite second moment, there is a time $T^*>0$ such that the equation possesses a unique mass-conserving solution up to $T^*$. The novel idea is to singularly perturb the constant fragmentation kernel by small additive terms and study the limiting behavior of the solutions of the perturbed system via the Bernstein transform. △ Less

Submitted 11 December, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

Comments: Minor clarifications

MSC Class: 35D40; 35F21; 44A10; 45J05; 49L20; 49L25

arXiv:2202.02442 [pdf, other]

doi 10.48550/arXiv.2202.02442

Transfer Reinforcement Learning for Differing Action Spaces via Q-Network Representations

Authors: Nathan Beck, Abhiramon Rajasekharan, Hieu Tran

Abstract: Transfer learning approaches in reinforcement learning aim to assist agents in learning their target domains by leveraging the knowledge learned from other agents that have been trained on similar source domains. For example, recent research focus within this space has been placed on knowledge transfer between tasks that have different transition dynamics and reward functions; however, little focu… ▽ More Transfer learning approaches in reinforcement learning aim to assist agents in learning their target domains by leveraging the knowledge learned from other agents that have been trained on similar source domains. For example, recent research focus within this space has been placed on knowledge transfer between tasks that have different transition dynamics and reward functions; however, little focus has been placed on knowledge transfer between tasks that have different action spaces. In this paper, we approach the task of transfer learning between domains that differ in action spaces. We present a reward sha** method based on source embedding similarity that is applicable to domains with both discrete and continuous action spaces. The efficacy of our approach is evaluated on transfer to restricted action spaces in the Acrobot-v1 and Pendulum-v0 domains. A comparison with two baselines shows that our method does not outperform these baselines in these continuous action spaces but does show an improvement in these discrete action spaces. We conclude our analysis with future directions for this work. △ Less

Submitted 21 April, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

Comments: 5 pages, 2 figures, 1 table

arXiv:2201.11488 [pdf, ps, other]

doi 10.1029/2021JE007062

Thermal structure and aerosols in Mars' atmosphere from TIRVIM/ACS onboard the ExoMars Trace Gas Orbiter : validation of the retrieval algorithm

Authors: Sandrine Guerlet, N. Ignatiev, F. Forget, T. Fouchet, P. Vlasov, G. Bergeron, R. M. B. Young, E. Millour, S. Fan, H. Tran, A. Shakun, A. Grigoriev, A. Trokhimovskiy, F. Montmessin, O. Korablev

Abstract: The Atmospheric Chemistry Suite (ACS) onboard the ExoMars Trace Gas Orbiter (TGO) monitors the Martian atmosphere through different spectral intervals in the infrared light. We present a retrieval algorithm tailored to the analysis of spectra acquired in nadir geometry by TIRVIM, the thermal infrared channel of ACS. Our algorithm simultaneously retrieves vertical profile of atmospheric temperature… ▽ More The Atmospheric Chemistry Suite (ACS) onboard the ExoMars Trace Gas Orbiter (TGO) monitors the Martian atmosphere through different spectral intervals in the infrared light. We present a retrieval algorithm tailored to the analysis of spectra acquired in nadir geometry by TIRVIM, the thermal infrared channel of ACS. Our algorithm simultaneously retrieves vertical profile of atmospheric temperature up to 50 km, surface temperature, and integrated optical depth of dust and water ice clouds. The specificity of the TIRVIM dataset lies in its capacity to resolve the diurnal cycle over a 54 sol period. However, it is uncertain to what extent can the desired atmospheric quantities be accurately estimated at different times of day. Here we first present an Observing System Simulation Experiment (OSSE). We produce synthetic observations at various latitudes, seasons and local times and run our retrieval algorithm on these synthetic data, to evaluate its robustness. Different sources of biases are documented, in particular regarding aerosol retrievals. Atmospheric temperature retrievals are found robust even when dust and/or water ice cloud opacities are not well estimated in our OSSE. We then apply our algorithm to TIRVIM observations in April-May, 2018 and perform a cross-validation of retrieved atmospheric temperature and dust integrated opacity by comparisons with thousands of co-located Mars Climate Sounder (MCS) retrievals. Most differences between TIRVIM and MCS atmospheric temperatures can be attributed to differences in vertical sensitivity. Daytime dust opacities agree well with each other, while biases are found in nighttime dust opacity retrieved from TIRVIM at this season. △ Less

Submitted 27 January, 2022; originally announced January 2022.

Comments: 51 pages, 29 figures. Accepted in Juanuary 2022 for publication in Journal of Geophysical Research (Planets)

arXiv:2201.11079 [pdf, other]

Orienteering with one endomorphism

Authors: Sarah Arpin, Mingjie Chen, Kristin E. Lauter, Renate Scheidler, Katherine E. Stange, Ha T. N. Tran

Abstract: In supersingular isogeny-based cryptography, the path-finding problem reduces to the endomorphism ring problem. Can path-finding be reduced to knowing just one endomorphism? It is known that a small endomorphism enables polynomial-time path-finding and endomorphism ring computation (Love-Boneh [36]). An endomorphism gives an explicit orientation of a supersingular elliptic curve. In this paper, we… ▽ More In supersingular isogeny-based cryptography, the path-finding problem reduces to the endomorphism ring problem. Can path-finding be reduced to knowing just one endomorphism? It is known that a small endomorphism enables polynomial-time path-finding and endomorphism ring computation (Love-Boneh [36]). An endomorphism gives an explicit orientation of a supersingular elliptic curve. In this paper, we use the volcano structure of the oriented supersingular isogeny graph to take ascending/descending/horizontal steps on the graph and deduce path-finding algorithms to an initial curve. Each altitude of the volcano corresponds to a unique quadratic order, called the primitive order. We introduce a new hard problem of computing the primitive order given an arbitrary endomorphism on the curve, and we also provide a sub-exponential quantum algorithm for solving it. In concurrent work (Wesolowski [54]), it was shown that the endomorphism ring problem in the presence of one endomorphism with known primitive order reduces to a vectorization problem, implying path-finding algorithms. Our path-finding algorithms are more general in the sense that we don't assume the knowledge of the primitive order associated with the endomorphism. △ Less

Submitted 19 October, 2022; v1 submitted 26 January, 2022; originally announced January 2022.

Comments: 40 pages, 1 figure; 3rd revision implements small corrections and expositional improvements

MSC Class: Primary: 14G50; 94A60; 11G05; 14K04; 11-04; Secondary: 11R52

arXiv:2201.10285 [pdf, other]

Efficient Approximations of the Fisher Matrix in Neural Networks using Kronecker Product Singular Value Decomposition

Authors: Abdoulaye Koroko, Ani Anciaux-Sedrakian, Ibtihel Ben Gharbia, Valérie Garès, Mounir Haddou, Quang Huy Tran

Abstract: Several studies have shown the ability of natural gradient descent to minimize the objective function more efficiently than ordinary gradient descent based methods. However, the bottleneck of this approach for training deep neural networks lies in the prohibitive cost of solving a large dense linear system corresponding to the Fisher Information Matrix (FIM) at each iteration. This has motivated v… ▽ More Several studies have shown the ability of natural gradient descent to minimize the objective function more efficiently than ordinary gradient descent based methods. However, the bottleneck of this approach for training deep neural networks lies in the prohibitive cost of solving a large dense linear system corresponding to the Fisher Information Matrix (FIM) at each iteration. This has motivated various approximations of either the exact FIM or the empirical one. The most sophisticated of these is KFAC, which involves a Kronecker-factored block diagonal approximation of the FIM. With only a slight additional cost, a few improvements of KFAC from the standpoint of accuracy are proposed. The common feature of the four novel methods is that they rely on a direct minimization problem, the solution of which can be computed via the Kronecker product singular value decomposition technique. Experimental results on the three standard deep auto-encoder benchmarks showed that they provide more accurate approximations to the FIM. Furthermore, they outperform KFAC and state-of-the-art first-order methods in terms of optimization speed. △ Less

Submitted 14 October, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

arXiv:2201.10166 [pdf, other]

doi 10.1109/ISBI48211.2021.9433826

Dense Pixel-Labeling for Reverse-Transfer and Diagnostic Learning on Lung Ultrasound for COVID-19 and Pneumonia Detection

Authors: Gautam Rajendrakumar Gare, Andrew Schoenling, Vipin Philip, Hai V Tran, Bennett P deBoisblanc, Ricardo Luis Rodriguez, John Michael Galeotti

Abstract: We propose using a pre-trained segmentation model to perform diagnostic classification in order to achieve better generalization and interpretability, terming the technique reverse-transfer learning. We present an architecture to convert segmentation models to classification models. We compare and contrast dense vs sparse segmentation labeling and study its impact on diagnostic classification. We… ▽ More We propose using a pre-trained segmentation model to perform diagnostic classification in order to achieve better generalization and interpretability, terming the technique reverse-transfer learning. We present an architecture to convert segmentation models to classification models. We compare and contrast dense vs sparse segmentation labeling and study its impact on diagnostic classification. We compare the performance of U-Net trained with dense and sparse labels to segment A-lines, B-lines, and Pleural lines on a custom dataset of lung ultrasound scans from 4 patients. Our experiments show that dense labels help reduce false positive detection. We study the classification capability of the dense and sparse trained U-Net and contrast it with a non-pretrained U-Net, to detect and differentiate COVID-19 and Pneumonia on a large ultrasound dataset of about 40k curvilinear and linear probe images. Our segmentation-based models perform better classification when using pretrained segmentation weights, with the dense-label pretrained U-Net performing the best. △ Less

Submitted 25 January, 2022; originally announced January 2022.

Journal ref: 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI), 2021, pp. 1406-1410

arXiv:2201.08605 [pdf, other]

Seamless and Energy Efficient Maritime Coverage in Coordinated 6G Space-Air-Sea Non-Terrestrial Networks

Authors: Sheikh Salman Hassan, Do Hyeon Kim, Yan Kyaw Tun, Nguyen H. Tran, Walid Saad, Choong Seon Hong

Abstract: Non-terrestrial networks (NTNs), which integrate space and aerial networks with terrestrial systems, are a key area in the emerging sixth-generation (6G) wireless networks. As part of 6G, NTNs must provide pervasive connectivity to a wide range of devices, including smartphones, vehicles, sensors, robots, and maritime users. However, due to the high mobility and deployment of NTNs, managing the sp… ▽ More Non-terrestrial networks (NTNs), which integrate space and aerial networks with terrestrial systems, are a key area in the emerging sixth-generation (6G) wireless networks. As part of 6G, NTNs must provide pervasive connectivity to a wide range of devices, including smartphones, vehicles, sensors, robots, and maritime users. However, due to the high mobility and deployment of NTNs, managing the space-air-sea (SAS) NTN resources, i.e., energy, power, and channel allocation, is a major challenge. The design of a SAS-NTN for energy-efficient resource allocation is investigated in this study. The goal is to maximize system energy efficiency (EE) by collaboratively optimizing user equipment (UE) association, power control, and unmanned aerial vehicle (UAV) deployment. Given the limited payloads of UAVs, this work focuses on minimizing the total energy cost of UAVs (trajectory and transmission) while meeting EE requirements. A mixed-integer nonlinear programming problem is proposed, followed by the development of an algorithm to decompose, and solve each problem distributedly. The binary (UE association) and continuous (power, deployment) variables are separated using the Bender decomposition (BD), and then the Dinkelbach algorithm (DA) is used to convert fractional programming into an equivalent solvable form in the subproblem. A standard optimization solver is utilized to deal with the complexity of the master problem for binary variables. The alternating direction method of multipliers (ADMM) algorithm is used to solve the subproblem for the continuous variables. Our proposed algorithm provides a suboptimal solution, and simulation results demonstrate that the proposed algorithm achieves better EE than baselines. △ Less

Submitted 21 January, 2022; originally announced January 2022.

arXiv:2201.07368 [pdf, other]

doi 10.1007/978-3-030-90874-4_14

The Role of Pleura and Adipose in Lung Ultrasound AI

Authors: Gautam Rajendrakumar Gare, Wanwen Chen, Alex Ling Yu Hung, Edward Chen, Hai V. Tran, Tom Fox, Pete Lowery, Kevin Zamora, Bennett P deBoisblanc, Ricardo Luis Rodriguez, John Michael Galeotti

Abstract: In this paper, we study the significance of the pleura and adipose tissue in lung ultrasound AI analysis. We highlight their more prominent appearance when using high-frequency linear (HFL) instead of curvilinear ultrasound probes, showing HFL reveals better pleura detail. We compare the diagnostic utility of the pleura and adipose tissue using an HFL ultrasound probe. Masking the adipose tissue d… ▽ More In this paper, we study the significance of the pleura and adipose tissue in lung ultrasound AI analysis. We highlight their more prominent appearance when using high-frequency linear (HFL) instead of curvilinear ultrasound probes, showing HFL reveals better pleura detail. We compare the diagnostic utility of the pleura and adipose tissue using an HFL ultrasound probe. Masking the adipose tissue during training and inference (while retaining the pleural line and Merlin's space artifacts such as A-lines and B-lines) improved the AI model's diagnostic accuracy. △ Less

Submitted 18 January, 2022; originally announced January 2022.

Comments: Published in MICCAI 2021 workshop on Lessons Learned from the development and application of medical imaging-based AI technologies for combating COVID-19 (LL-COVID19). The first two authors contributed equally to this work

Journal ref: LL-COVID19 2021. Lecture Notes in Computer Science, vol 12969. Springer, Cham

arXiv:2201.07357 [pdf, other]

Weakly Supervised Contrastive Learning for Better Severity Scoring of Lung Ultrasound

Authors: Gautam Rajendrakumar Gare, Hai V. Tran, Bennett P deBoisblanc, Ricardo Luis Rodriguez, John Michael Galeotti

Abstract: With the onset of the COVID-19 pandemic, ultrasound has emerged as an effective tool for bedside monitoring of patients. Due to this, a large amount of lung ultrasound scans have been made available which can be used for AI based diagnosis and analysis. Several AI-based patient severity scoring models have been proposed that rely on scoring the appearance of the ultrasound scans. AI models are tra… ▽ More With the onset of the COVID-19 pandemic, ultrasound has emerged as an effective tool for bedside monitoring of patients. Due to this, a large amount of lung ultrasound scans have been made available which can be used for AI based diagnosis and analysis. Several AI-based patient severity scoring models have been proposed that rely on scoring the appearance of the ultrasound scans. AI models are trained using ultrasound-appearance severity scores that are manually labeled based on standardized visual features. We address the challenge of labeling every ultrasound frame in the video clips. Our contrastive learning method treats the video clip severity labels as noisy weak severity labels for individual frames, thus requiring only video-level labels. We show that it performs better than the conventional cross-entropy loss based training. We combine frame severity predictions to come up with video severity predictions and show that the frame based model achieves comparable performance to a video based TSM model, on a large dataset combining public and private sources. △ Less

Submitted 18 January, 2022; originally announced January 2022.

Comments: Under Review for MIDL 2022 conference

arXiv:2201.06626 [pdf, other]

doi 10.1007/978-3-031-06773-0_15

Neural Network Compression of ACAS Xu Early Prototype is Unsafe: Closed-Loop Verification through Quantized State Backreachability

Authors: Stanley Bak, Hoang-Dung Tran

Abstract: ACAS Xu is an air-to-air collision avoidance system designed for unmanned aircraft that issues horizontal turn advisories to avoid an intruder aircraft. Due the use of a large lookup table in the design, a neural network compression of the policy was proposed. Analysis of this system has spurred a significant body of research in the formal methods community on neural network verification. While ma… ▽ More ACAS Xu is an air-to-air collision avoidance system designed for unmanned aircraft that issues horizontal turn advisories to avoid an intruder aircraft. Due the use of a large lookup table in the design, a neural network compression of the policy was proposed. Analysis of this system has spurred a significant body of research in the formal methods community on neural network verification. While many powerful methods have been developed, most work focuses on open-loop properties of the networks, rather than the main point of the system -- collision avoidance -- which requires closed-loop analysis. In this work, we develop a technique to verify a closed-loop approximation of the system using state quantization and backreachability. We use favorable assumptions for the analysis -- perfect sensor information, instant following of advisories, ideal aircraft maneuvers and an intruder that only flies straight. When the method fails to prove the system is safe, we refine the quantization parameters until generating counterexamples where the original (non-quantized) system also has collisions. △ Less

Submitted 27 March, 2022; v1 submitted 17 January, 2022; originally announced January 2022.

arXiv:2201.01974 [pdf, ps, other]

Characterizations of diffusion matrices in homogenization of elliptic equations in nondivergence-form

Authors: Xiaoqin Guo, Timo Sprekeler, Hung V. Tran

Abstract: We characterize diffusion matrices that yield a $L^{\infty}$ convergence rate of $\mathcal{O}(\varepsilon^2)$ in the theory of periodic homogenization of linear elliptic equations in nondivergence-form. Such type-$\varepsilon^2$ diffusion matrices are of particular interest as the optimal rate of convergence in the generic case is only $\mathcal{O}(\varepsilon)$. First, we provide a new class of t… ▽ More We characterize diffusion matrices that yield a $L^{\infty}$ convergence rate of $\mathcal{O}(\varepsilon^2)$ in the theory of periodic homogenization of linear elliptic equations in nondivergence-form. Such type-$\varepsilon^2$ diffusion matrices are of particular interest as the optimal rate of convergence in the generic case is only $\mathcal{O}(\varepsilon)$. First, we provide a new class of type-$\varepsilon^2$ diffusion matrices, confirming a conjecture posed in [15]. Then, we give a complete characterization of diagonal diffusion matrices in two dimensions and a systematic study in higher dimensions. △ Less

Submitted 14 January, 2022; v1 submitted 6 January, 2022; originally announced January 2022.

Comments: 31 pages; added Section 3.3

MSC Class: 35B27; 35B40; 35J25

arXiv:2201.00132 [pdf, other]

doi 10.1109/ICMLA51294.2020.00223

SAFL: A Self-Attention Scene Text Recognizer with Focal Loss

Authors: Bao Hieu Tran, Thanh Le-Cong, Huu Manh Nguyen, Duc Anh Le, Thanh Hung Nguyen, Phi Le Nguyen

Abstract: In the last decades, scene text recognition has gained worldwide attention from both the academic community and actual users due to its importance in a wide range of applications. Despite achievements in optical character recognition, scene text recognition remains challenging due to inherent problems such as distortions or irregular layout. Most of the existing approaches mainly leverage recurren… ▽ More In the last decades, scene text recognition has gained worldwide attention from both the academic community and actual users due to its importance in a wide range of applications. Despite achievements in optical character recognition, scene text recognition remains challenging due to inherent problems such as distortions or irregular layout. Most of the existing approaches mainly leverage recurrence or convolution-based neural networks. However, while recurrent neural networks (RNNs) usually suffer from slow training speed due to sequential computation and encounter problems as vanishing gradient or bottleneck, CNN endures a trade-off between complexity and performance. In this paper, we introduce SAFL, a self-attention-based neural network model with the focal loss for scene text recognition, to overcome the limitation of the existing approaches. The use of focal loss instead of negative log-likelihood helps the model focus more on low-frequency samples training. Moreover, to deal with the distortions and irregular texts, we exploit Spatial TransformerNetwork (STN) to rectify text before passing to the recognition network. We perform experiments to compare the performance of the proposed model with seven benchmarks. The numerical results show that our model achieves the best performance. △ Less

Submitted 1 January, 2022; originally announced January 2022.

Comments: Accepted to ICMLA 2020

Journal ref: 2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA)

arXiv:2112.11052 [pdf, other]

doi 10.1109/NICS54270.2021.9701541

Predicting Job Titles from Job Descriptions with Multi-label Text Classification

Authors: Hieu Trung Tran, Hanh Hong Phuc Vo, Son T. Luu

Abstract: Finding a suitable job and hunting for eligible candidates are important to job seeking and human resource agencies. With the vast information about job descriptions, employees and employers need assistance to automatically detect job titles based on job description texts. In this paper, we propose the multi-label classification approach for predicting relevant job titles from job description text… ▽ More Finding a suitable job and hunting for eligible candidates are important to job seeking and human resource agencies. With the vast information about job descriptions, employees and employers need assistance to automatically detect job titles based on job description texts. In this paper, we propose the multi-label classification approach for predicting relevant job titles from job description texts, and implement the Bi-GRU-LSTM-CNN with different pre-trained language models to apply for the job titles prediction problem. The BERT with multilingual pre-trained model obtains the highest result by F1-scores on both development and test sets, which are 62.20% on the development set, and 47.44% on the test set. △ Less

Submitted 9 February, 2022; v1 submitted 21 December, 2021; originally announced December 2021.

Comments: Published in the 2021 NAFOSTED Conference on Information and Computer Science (NICS 2021)

arXiv:2112.10957 [pdf, other]

RSSI prediction using Machine Learning models

Authors: Tung Giang Le, Huy Tung Quach, Thu Thao Dao Le, Manh Hoang Tran

Abstract: In this study, we present a method to predict the Received signal strength indication (RSSI) in an area of the base station. Traditional attenuated wave propagation models are often time consuming as well as computationally complex, depending on the unique factors of the medium. This study focuses on providing a solution to predict signal quality using coordinate values of many points in the consi… ▽ More In this study, we present a method to predict the Received signal strength indication (RSSI) in an area of the base station. Traditional attenuated wave propagation models are often time consuming as well as computationally complex, depending on the unique factors of the medium. This study focuses on providing a solution to predict signal quality using coordinate values of many points in the considering area. We apply machine learning models such as linear regression, Support Vector Machine (SVM) or Decision tree model, to directly predict the RSSI of many points in the range of a base station without computing the complex parameters of the attenuated propagation model. The effectiveness of RSSI prediction was evaluated by mean square error (MSE) and mean absolute error (MAE). The stage of training and testing machine learning models in the research uses data that are the actual measurement results during the research process. △ Less

Submitted 20 December, 2021; originally announced December 2021.

Comments: 6 pages, in Vietnamese

arXiv:2112.10747 [pdf, other]

Effective fronts of polygon shapes in two dimensions

Authors: Wenjia **g, Hung V. Tran, Yifeng Yu

Abstract: We study the effective fronts of first order front propagations in two dimensions ($n=2$) in the periodic setting. Using PDE-based approaches, we show that for every $α\in (0,1)$, the class of centrally symmetric polygons with rational vertices and nonempty interior is admissible as effective fronts for given front speeds in $C^{1,α}(\mathbb T^2,(0,\infty))$. This result can also be formulated in… ▽ More We study the effective fronts of first order front propagations in two dimensions ($n=2$) in the periodic setting. Using PDE-based approaches, we show that for every $α\in (0,1)$, the class of centrally symmetric polygons with rational vertices and nonempty interior is admissible as effective fronts for given front speeds in $C^{1,α}(\mathbb T^2,(0,\infty))$. This result can also be formulated in the language of stable norms corresponding to periodic metrics in $\mathbb T^2$. Similar results were known long time ago when $n\geq 3$ for front speeds in $C^{\infty}(\mathbb T^n,(0,\infty))$. Due to topological restrictions, the two dimensional case is much more subtle. In fact, the effective front is $C^1$, which cannot be a polygon, for given $C^{1,1}(\mathbb T^2,(0,\infty))$ front speeds. Our regularity requirements on front speeds are hence optimal. To the best of our knowledge, this is the first time that polygonal effective fronts have been constructed in two dimensions. △ Less

Submitted 20 December, 2021; originally announced December 2021.

Comments: 14 pages; 5 figures

arXiv:2112.08154 [pdf, other]

doi 10.1103/PhysRevB.105.134402

Effect of magnetocrystalline anisotropy on magnetocaloric properties of AlFe$_{2}$B$_{2}$ compound

Authors: Hung Ba Tran, Hiroyoshi Momida, Yu-ichiro Matsushita, Kazunori Sato, Yukihiro Makino, Koun Shirai, Tamio Oguchi

Abstract: It is well known that the temperature dependence of the effective magnetocrystalline anisotropy energy obeys the $l(l+1)/2$ power law of magnetization in the Callen-Callen theory. Therefore, according to the Callen-Callen theory, the magnetocrystalline anisotropy energy is assumed to be zero at the critical temperature where the magnetization is approximately zero. This study estimates the tempera… ▽ More It is well known that the temperature dependence of the effective magnetocrystalline anisotropy energy obeys the $l(l+1)/2$ power law of magnetization in the Callen-Callen theory. Therefore, according to the Callen-Callen theory, the magnetocrystalline anisotropy energy is assumed to be zero at the critical temperature where the magnetization is approximately zero. This study estimates the temperature dependence of the magnetocrystalline anisotropy energy by integrating the magnetization versus magnetic field ($M$--$H$) curves, and found that the magnetocrystalline anisotropy is still finite even above the Curie temperature in the uniaxial anisotropy, whereas this does not appear in the cubic anisotropy case. The origin is the fast reduction of the anisotropy field, which is the magnetic field required to saturate the magnetization along the hard axis, in the case of cubic anisotropy. Therefore, the magnetization anisotropy and anisotropic magnetic susceptibility, those are the key factors of magnetic anisotropy, could not be established in the case of cubic anisotropy. In addition, the effect of magnetocrystalline anisotropy on magnetocaloric properties, as the difference between the entropy change curves of AlFe$_{2}$B$_{2}$ appears above the Curie temperature, which is in good agreement with a previous experimental study. This is proof of magnetic anisotropy at slightly above Curie temperature. △ Less

Submitted 15 December, 2021; originally announced December 2021.

Journal ref: Physical Review B 105, 134402 (2022)

arXiv:2112.06896 [pdf, other]

Optimal convergence rate for periodic homogenization of convex Hamilton-Jacobi equations

Authors: Hung V. Tran, Yifeng Yu

Abstract: In this paper, we show that the rate of convergence in periodic homogenization of convex Hamilton-Jacobi equations is always $O(\varepsilon)$, which is optimal. This is a natural extension of a result concerning stable norms in metric geometry [4] that is essentially equivalent to the homogenization of convex static Hamilton-Jacobi equations. Another extremely interesting question in this directio… ▽ More In this paper, we show that the rate of convergence in periodic homogenization of convex Hamilton-Jacobi equations is always $O(\varepsilon)$, which is optimal. This is a natural extension of a result concerning stable norms in metric geometry [4] that is essentially equivalent to the homogenization of convex static Hamilton-Jacobi equations. Another extremely interesting question in this direction is whether the $O(\varepsilon)$ rate holds in the nonconvex setting. We present a special nonconvex example with $O(\varepsilon)$ convergence rate, which relies on identifying the shape of the effective Hamiltonian and game theory interpretation formulas. △ Less

Submitted 30 June, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

Comments: second version with 14 pages; updated expositions and references; a nonconvex result (Theorem 1.2) was added

arXiv:2112.01212 [pdf, ps, other]

Curvature of the Second kind and a conjecture of Nishikawa

Authors: Matthew Gursky, Xiaodong Cao, Hung Tran

Abstract: In this paper, we investigate manifolds for which the curvature of the second kind (following the terminology of Nishikawa) satisfies certain positivity conditions. Our main result settles Nishikawa's conjecture that manifolds for which the curvature (operator) of the second kind are positive are diffeomorphic to a sphere, by showing that such manifolds satisfy Brendle's PIC1 condition. In dimensi… ▽ More In this paper, we investigate manifolds for which the curvature of the second kind (following the terminology of Nishikawa) satisfies certain positivity conditions. Our main result settles Nishikawa's conjecture that manifolds for which the curvature (operator) of the second kind are positive are diffeomorphic to a sphere, by showing that such manifolds satisfy Brendle's PIC1 condition. In dimension four we show that curvature of the second kind has a canonical normal form, and use this to classify Einstein four-manifolds for which the curvature (operator) of the second kind is five-non-negative. We also calculate the normal form for some explicit examples in order to show that this assumption is sharp. △ Less

Submitted 2 December, 2021; originally announced December 2021.

Comments: 18 pages

MSC Class: 53C21; 53C24; 53C25

Showing 201–250 of 788 results for author: Tran, H