-
Explanatory causal effects for model agnostic explanations
Authors:
Jiuyong Li,
Ha Xuan Tran,
Thuc Duy Le,
Lin Liu,
Kui Yu,
Jixue Liu
Abstract:
This paper studies the problem of estimating the contributions of features to the prediction of a specific instance by a machine learning model and the overall contribution of a feature to the model. The causal effect of a feature (variable) on the predicted outcome reflects the contribution of the feature to a prediction very well. A challenge is that most existing causal effects cannot be estima…
▽ More
This paper studies the problem of estimating the contributions of features to the prediction of a specific instance by a machine learning model and the overall contribution of a feature to the model. The causal effect of a feature (variable) on the predicted outcome reflects the contribution of the feature to a prediction very well. A challenge is that most existing causal effects cannot be estimated from data without a known causal graph. In this paper, we define an explanatory causal effect based on a hypothetical ideal experiment. The definition brings several benefits to model agnostic explanations. First, explanations are transparent and have causal meanings. Second, the explanatory causal effect estimation can be data driven. Third, the causal effects provide both a local explanation for a specific prediction and a global explanation showing the overall importance of a feature in a predictive model. We further propose a method using individual and combined variables based on explanatory causal effects for explanations. We show the definition and the method work with experiments on some real-world data sets.
△ Less
Submitted 23 June, 2022;
originally announced June 2022.
-
Finding Optimal Policy for Queueing Models: New Parameterization
Authors:
Trang H. Tran,
Lam M. Nguyen,
Katya Scheinberg
Abstract:
Queueing systems appear in many important real-life applications including communication networks, transportation and manufacturing systems. Reinforcement learning (RL) framework is a suitable model for the queueing control problem where the underlying dynamics are usually unknown and the agent receives little information from the environment to navigate. In this work, we investigate the optimizat…
▽ More
Queueing systems appear in many important real-life applications including communication networks, transportation and manufacturing systems. Reinforcement learning (RL) framework is a suitable model for the queueing control problem where the underlying dynamics are usually unknown and the agent receives little information from the environment to navigate. In this work, we investigate the optimization aspects of the queueing model as a RL environment and provide insight to learn the optimal policy efficiently. We propose a new parameterization of the policy by using the intrinsic properties of queueing network systems. Experiments show good performance of our methods with various load conditions from light to heavy traffic.
△ Less
Submitted 20 June, 2022;
originally announced June 2022.
-
Aligning individual brains with Fused Unbalanced Gromov-Wasserstein
Authors:
Alexis Thual,
Huy Tran,
Tatiana Zemskova,
Nicolas Courty,
Rémi Flamary,
Stanislas Dehaene,
Bertrand Thirion
Abstract:
Individual brains vary in both anatomy and functional organization, even within a given species. Inter-individual variability is a major impediment when trying to draw generalizable conclusions from neuroimaging data collected on groups of subjects. Current co-registration procedures rely on limited data, and thus lead to very coarse inter-subject alignments. In this work, we present a novel metho…
▽ More
Individual brains vary in both anatomy and functional organization, even within a given species. Inter-individual variability is a major impediment when trying to draw generalizable conclusions from neuroimaging data collected on groups of subjects. Current co-registration procedures rely on limited data, and thus lead to very coarse inter-subject alignments. In this work, we present a novel method for inter-subject alignment based on Optimal Transport, denoted as Fused Unbalanced Gromov Wasserstein (FUGW). The method aligns cortical surfaces based on the similarity of their functional signatures in response to a variety of stimulation settings, while penalizing large deformations of individual topographic organization. We demonstrate that FUGW is well-suited for whole-brain landmark-free alignment. The unbalanced feature allows to deal with the fact that functional areas vary in size across subjects. Our results show that FUGW alignment significantly increases between-subject correlation of activity for independent functional data, and leads to more precise map** at the group level.
△ Less
Submitted 22 August, 2023; v1 submitted 19 June, 2022;
originally announced June 2022.
-
TLETA: Deep Transfer Learning and Integrated Cellular Knowledge for Estimated Time of Arrival Prediction
Authors:
Hieu Tran,
Son Nguyen,
I-Ling Yen,
Farokh Bastani
Abstract:
Vehicle arrival time prediction has been studied widely. With the emergence of IoT devices and deep learning techniques, estimated time of arrival (ETA) has become a critical component in intelligent transportation systems. Though many tools exist for ETA, ETA for special vehicles, such as ambulances, fire engines, etc., is still challenging due to the limited amount of traffic data for special ve…
▽ More
Vehicle arrival time prediction has been studied widely. With the emergence of IoT devices and deep learning techniques, estimated time of arrival (ETA) has become a critical component in intelligent transportation systems. Though many tools exist for ETA, ETA for special vehicles, such as ambulances, fire engines, etc., is still challenging due to the limited amount of traffic data for special vehicles. Existing works use one model for all types of vehicles, which can lead to low accuracy. To tackle this, as the first in the field, we propose a deep transfer learning framework TLETA for the driving time prediction. TLETA constructs cellular spatial-temporal knowledge grids for extracting driving patterns, combined with the road network structure embedding to build a deep neural network for ETA. TLETA contains transferable layers to support knowledge transfer between different categories of vehicles. Importantly, our transfer models only train the last layers to map the transferred knowledge, that reduces the training time significantly. The experimental studies show that our model predicts travel time with high accuracy and outperforms many state-of-the-art approaches.
△ Less
Submitted 16 June, 2022;
originally announced June 2022.
-
Learning Generic Lung Ultrasound Biomarkers for Decoupling Feature Extraction from Downstream Tasks
Authors:
Gautam Rajendrakumar Gare,
Tom Fox,
Pete Lowery,
Kevin Zamora,
Hai V. Tran,
Laura Hutchins,
David Montgomery,
Amita Krishnan,
Deva Kannan Ramanan,
Ricardo Luis Rodriguez,
Bennett P deBoisblanc,
John Michael Galeotti
Abstract:
Contemporary artificial neural networks (ANN) are trained end-to-end, jointly learning both features and classifiers for the task of interest. Though enormously effective, this paradigm imposes significant costs in assembling annotated task-specific datasets and training large-scale networks. We propose to decouple feature learning from downstream lung ultrasound tasks by introducing an auxiliary…
▽ More
Contemporary artificial neural networks (ANN) are trained end-to-end, jointly learning both features and classifiers for the task of interest. Though enormously effective, this paradigm imposes significant costs in assembling annotated task-specific datasets and training large-scale networks. We propose to decouple feature learning from downstream lung ultrasound tasks by introducing an auxiliary pre-task of visual biomarker classification. We demonstrate that one can learn an informative, concise, and interpretable feature space from ultrasound videos by training models for predicting biomarker labels. Notably, biomarker feature extractors can be trained from data annotated with weak video-scale supervision. These features can be used by a variety of downstream Expert models targeted for diverse clinical tasks (Diagnosis, lung severity, S/F ratio). Crucially, task-specific expert models are comparable in accuracy to end-to-end models directly trained for such target tasks, while being significantly lower cost to train.
△ Less
Submitted 16 June, 2022;
originally announced June 2022.
-
On the Convergence to a Global Solution of Shuffling-Type Gradient Algorithms
Authors:
Lam M. Nguyen,
Trang H. Tran
Abstract:
Stochastic gradient descent (SGD) algorithm is the method of choice in many machine learning tasks thanks to its scalability and efficiency in dealing with large-scale problems. In this paper, we focus on the shuffling version of SGD which matches the mainstream practical heuristics. We show the convergence to a global solution of shuffling SGD for a class of non-convex functions under over-parame…
▽ More
Stochastic gradient descent (SGD) algorithm is the method of choice in many machine learning tasks thanks to its scalability and efficiency in dealing with large-scale problems. In this paper, we focus on the shuffling version of SGD which matches the mainstream practical heuristics. We show the convergence to a global solution of shuffling SGD for a class of non-convex functions under over-parameterized settings. Our analysis employs more relaxed non-convex assumptions than previous literature. Nevertheless, we maintain the desired computational complexity as shuffling SGD has achieved in the general convex setting.
△ Less
Submitted 25 October, 2023; v1 submitted 12 June, 2022;
originally announced June 2022.
-
On the Generalization of Wasserstein Robust Federated Learning
Authors:
Tung-Anh Nguyen,
Tuan Dung Nguyen,
Long Tan Le,
Canh T. Dinh,
Nguyen H. Tran
Abstract:
In federated learning, participating clients typically possess non-i.i.d. data, posing a significant challenge to generalization to unseen distributions. To address this, we propose a Wasserstein distributionally robust optimization scheme called WAFL. Leveraging its duality, we frame WAFL as an empirical surrogate risk minimization problem, and solve it using a local SGD-based algorithm with conv…
▽ More
In federated learning, participating clients typically possess non-i.i.d. data, posing a significant challenge to generalization to unseen distributions. To address this, we propose a Wasserstein distributionally robust optimization scheme called WAFL. Leveraging its duality, we frame WAFL as an empirical surrogate risk minimization problem, and solve it using a local SGD-based algorithm with convergence guarantees. We show that the robustness of WAFL is more general than related approaches, and the generalization bound is robust to all adversarial distributions inside the Wasserstein ball (ambiguity set). Since the center location and radius of the Wasserstein ball can be suitably modified, WAFL shows its applicability not only in robustness but also in domain adaptation. Through empirical evaluation, we demonstrate that WAFL generalizes better than the vanilla FedAvg in non-i.i.d. settings, and is more robust than other related methods in distribution shift settings. Further, using benchmark datasets we show that WAFL is capable of generalizing to unseen target domains.
△ Less
Submitted 3 June, 2022;
originally announced June 2022.
-
Synthesizing Configuration Tactics for Exercising Hidden Options in Serverless Systems
Authors:
Jörn Kuhlenkamp,
Sebastian Werner,
Chin Hong Tran,
Stefan Tai
Abstract:
A proper configuration of an information system can ensure accuracy and efficiency, among other system objectives. Conversely, a poor configuration can have a significant negative impact on the system's performance, reliability, and cost. Serverless systems, which are comprised of many functions and managed services, especially risk exposure to misconfigurations, with many provider- and platform-s…
▽ More
A proper configuration of an information system can ensure accuracy and efficiency, among other system objectives. Conversely, a poor configuration can have a significant negative impact on the system's performance, reliability, and cost. Serverless systems, which are comprised of many functions and managed services, especially risk exposure to misconfigurations, with many provider- and platform-specific, often intransparent and 'hidden' settings. In this paper, we argue to pay close attention to the configuration of serverless systems to exercise options with known accuracy, cost and time. Based on a literature study and long-term serverless systems development experience, we present nine tactics to unlock potentially neglected and unknown options in serverless systems.
△ Less
Submitted 3 June, 2022; v1 submitted 31 May, 2022;
originally announced May 2022.
-
Unbalanced CO-Optimal Transport
Authors:
Quang Huy Tran,
Hicham Janati,
Nicolas Courty,
Rémi Flamary,
Ievgen Redko,
Pinar Demetci,
Ritambhara Singh
Abstract:
Optimal transport (OT) compares probability distributions by computing a meaningful alignment between their samples. CO-optimal transport (COOT) takes this comparison further by inferring an alignment between features as well. While this approach leads to better alignments and generalizes both OT and Gromov-Wasserstein distances, we provide a theoretical result showing that it is sensitive to outl…
▽ More
Optimal transport (OT) compares probability distributions by computing a meaningful alignment between their samples. CO-optimal transport (COOT) takes this comparison further by inferring an alignment between features as well. While this approach leads to better alignments and generalizes both OT and Gromov-Wasserstein distances, we provide a theoretical result showing that it is sensitive to outliers that are omnipresent in real-world data. This prompts us to propose unbalanced COOT for which we provably show its robustness to noise in the compared datasets. To the best of our knowledge, this is the first such result for OT methods in incomparable spaces. With this result in hand, we provide empirical evidence of this robustness for the challenging tasks of heterogeneous domain adaptation with and without varying proportions of classes and simultaneous alignment of samples and features across single-cell measurements.
△ Less
Submitted 20 February, 2023; v1 submitted 30 May, 2022;
originally announced May 2022.
-
Fine-Grained Visual Classification using Self Assessment Classifier
Authors:
Tuong Do,
Huy Tran,
Erman Tjiputra,
Quang D. Tran,
Anh Nguyen
Abstract:
Extracting discriminative features plays a crucial role in the fine-grained visual classification task. Most of the existing methods focus on develo** attention or augmentation mechanisms to achieve this goal. However, addressing the ambiguity in the top-k prediction classes is not fully investigated. In this paper, we introduce a Self Assessment Classifier, which simultaneously leverages the re…
▽ More
Extracting discriminative features plays a crucial role in the fine-grained visual classification task. Most of the existing methods focus on develo** attention or augmentation mechanisms to achieve this goal. However, addressing the ambiguity in the top-k prediction classes is not fully investigated. In this paper, we introduce a Self Assessment Classifier, which simultaneously leverages the representation of the image and top-k prediction classes to reassess the classification results. Our method is inspired by continual learning with coarse-grained and fine-grained classifiers to increase the discrimination of features in the backbone and produce attention maps of informative areas on the image. In practice, our method works as an auxiliary branch and can be easily integrated into different architectures. We show that by effectively addressing the ambiguity in the top-k prediction classes, our method achieves new state-of-the-art results on CUB200-2011, Stanford Dog, and FGVC Aircraft datasets. Furthermore, our method also consistently improves the accuracy of different existing fine-grained classifiers with a unified setup.
△ Less
Submitted 21 May, 2022;
originally announced May 2022.
-
Single Crystalline 2D Material Nanoribbon Networks for Nanoelectronics
Authors:
Muhammad Awais Aslam,
Tuan Hoang Tran,
Antonio Supina,
Olivier Siri,
Vincent Meunier,
Kenji Watanabe,
Takashi Taniguchi,
Marko Kralj,
Christian Teichert,
Evgeniya Sheremet,
Raul D. Rodriguez,
Aleksandar Matković
Abstract:
The last decade has seen a flurry of studies related to graphene nanoribbons owing to their potential applications in the quantum realm. However, little experimental work has been reported towards nanoribbons of other 2D materials due to the absence of synthesis routes. Here, we propose a universal approach to synthesize high-quality networks of nanoribbons from arbitrary 2D materials while mainta…
▽ More
The last decade has seen a flurry of studies related to graphene nanoribbons owing to their potential applications in the quantum realm. However, little experimental work has been reported towards nanoribbons of other 2D materials due to the absence of synthesis routes. Here, we propose a universal approach to synthesize high-quality networks of nanoribbons from arbitrary 2D materials while maintaining high crystallinity, sufficient yield, narrow size distribution, and straight-forward device integrability. The wide applicability of this technique is demonstrated by fabricating MoS2, WS2, WSe2, and graphene nanoribbon field effect transistors that inherently do not suffer from interconnection resistances. By relying on self-assembled and self-aligned organic nanostructures as masks, we demonstrate the possibility of controlling the predominant crystallographic direction of the nanoribbon's edges. Electrical characterization shows record mobilities and very high ON currents for various TMDCs despite extreme width scaling. Lastly, we explore decoration of nanoribbon edges with plasmonic particles paving the way towards the development of nanoribbon-based plasmonic sensing and opto-electronic devices.
△ Less
Submitted 16 May, 2022;
originally announced May 2022.
-
Anomaly detection using prediction error with Spatio-Temporal Convolutional LSTM
Authors:
Hanh Thi Minh Tran,
David Hogg
Abstract:
In this paper, we propose a novel method for video anomaly detection motivated by an existing architecture for sequence-to-sequence prediction and reconstruction using a spatio-temporal convolutional Long Short-Term Memory (convLSTM). As in previous work on anomaly detection, anomalies arise as spatially localised failures in reconstruction or prediction. In experiments with five benchmark dataset…
▽ More
In this paper, we propose a novel method for video anomaly detection motivated by an existing architecture for sequence-to-sequence prediction and reconstruction using a spatio-temporal convolutional Long Short-Term Memory (convLSTM). As in previous work on anomaly detection, anomalies arise as spatially localised failures in reconstruction or prediction. In experiments with five benchmark datasets, we show that using prediction gives superior performance to using reconstruction. We also compare performance with different length input/output sequences. Overall, our results using prediction are comparable with the state of the art on the benchmark datasets.
△ Less
Submitted 18 May, 2022;
originally announced May 2022.
-
Distances to Local Group Galaxies via Population II, Stellar Distance Indicators I: The Sculptor Dwarf Spheroidal
Authors:
Quang H. Tran,
Taylor J. Hoyt,
Wendy L. Freedman,
Barry F. Madore,
Elias K. Oakes,
William Cerny,
Dylan Hatt,
Rachael L. Beaton
Abstract:
We determine the distance to the Sculptor Dwarf Spheroidal via three Population II stellar distance indicators: (a) the Tip of the Red Giant Branch (TRGB), (b) RR Lyrae variables (RRLs), and (c) the ridgeline of the blue horizontal branch (HB). High signal-to-noise, wide-field $VI$ imaging that covers an area $48' \times 48'$ and reaches a photometric depth approximately 2 mag fainter than the HB…
▽ More
We determine the distance to the Sculptor Dwarf Spheroidal via three Population II stellar distance indicators: (a) the Tip of the Red Giant Branch (TRGB), (b) RR Lyrae variables (RRLs), and (c) the ridgeline of the blue horizontal branch (HB). High signal-to-noise, wide-field $VI$ imaging that covers an area $48' \times 48'$ and reaches a photometric depth approximately 2 mag fainter than the HB was acquired with the Magellan-Baade 6.5m telescope. The true modulus derived from Sculptor's TRGB is found to be $μ^\mathrm{TRGB}_o = 19.59 \pm 0.07_\mathrm{stat} \pm 0.05_\mathrm{sys}$ mag. Along with periods adopted from the literature, newly acquired RRL phase points are fit with template light curves to determine $μ_{W_{I,V-I}}^\mathrm{RRL} = 19.60 \pm 0.01_\mathrm{stat} \pm 0.05_\mathrm{sys}$ mag. Finally, the HB distance is found to be $μ^\mathrm{HB}_o = 19.54 \pm 0.03_\mathrm{stat} \pm 0.09_\mathrm{sys}$ mag. Absolute calibrations of each method are anchored by independent geometric zero-points, utilizes a different class of stars, and are determined from the same photometric calibration.
△ Less
Submitted 17 May, 2022;
originally announced May 2022.
-
ViT5: Pretrained Text-to-Text Transformer for Vietnamese Language Generation
Authors:
Long Phan,
Hieu Tran,
Hieu Nguyen,
Trieu H. Trinh
Abstract:
We present ViT5, a pretrained Transformer-based encoder-decoder model for the Vietnamese language. With T5-style self-supervised pretraining, ViT5 is trained on a large corpus of high-quality and diverse Vietnamese texts. We benchmark ViT5 on two downstream text generation tasks, Abstractive Text Summarization and Named Entity Recognition. Although Abstractive Text Summarization has been widely st…
▽ More
We present ViT5, a pretrained Transformer-based encoder-decoder model for the Vietnamese language. With T5-style self-supervised pretraining, ViT5 is trained on a large corpus of high-quality and diverse Vietnamese texts. We benchmark ViT5 on two downstream text generation tasks, Abstractive Text Summarization and Named Entity Recognition. Although Abstractive Text Summarization has been widely studied for the English language thanks to its rich and large source of data, there has been minimal research into the same task in Vietnamese, a much lower resource language. In this work, we perform exhaustive experiments on both Vietnamese Abstractive Summarization and Named Entity Recognition, validating the performance of ViT5 against many other pretrained Transformer-based encoder-decoder models. Our experiments show that ViT5 significantly outperforms existing models and achieves state-of-the-art results on Vietnamese Text Summarization. On the task of Named Entity Recognition, ViT5 is competitive against previous best results from pretrained encoder-based Transformer models. Further analysis shows the importance of context length during the self-supervised pretraining on downstream performance across different settings.
△ Less
Submitted 26 May, 2022; v1 submitted 13 May, 2022;
originally announced May 2022.
-
The AGEL Survey: Spectroscopic Confirmation of Strong Gravitational Lenses in the DES and DECaLS Fields Selected Using Convolutional Neural Networks
Authors:
Kim-Vy H. Tran,
Anishya Harshan,
Karl Glazebrook,
G. C. Keerthi Vasan,
Tucker Jones,
Colin Jacobs,
Glenn G. Kacprzak,
Tania M. Barone,
Thomas E. Collett,
Anshu Gupta,
Astrid Henderson,
Lisa J. Kewley,
Sebastian Lopez,
Themiya Nanayakkara,
Ryan L. Sanders,
Sarah M. Sweet
Abstract:
We present spectroscopic confirmation of candidate strong gravitational lenses using the Keck Observatory and Very Large Telescope as part of our ASTRO 3D Galaxy Evolution with Lenses (AGEL) survey. We confirm that 1) search methods using Convolutional Neural Networks (CNN) with visual inspection successfully identify strong gravitational lenses and 2) the lenses are at higher redshifts relative t…
▽ More
We present spectroscopic confirmation of candidate strong gravitational lenses using the Keck Observatory and Very Large Telescope as part of our ASTRO 3D Galaxy Evolution with Lenses (AGEL) survey. We confirm that 1) search methods using Convolutional Neural Networks (CNN) with visual inspection successfully identify strong gravitational lenses and 2) the lenses are at higher redshifts relative to existing surveys due to the combination of deeper and higher resolution imaging from DECam and spectroscopy spanning optical to near-infrared wavelengths. We measure 104 redshifts in 77 systems selected from a catalog in the DES and DECaLS imaging fields (r<22 mag). Combining our results with published redshifts, we present redshifts for 68 lenses and establish that CNN-based searches are highly effective for use in future imaging surveys with a success rate of 88% (defined as 68/77). We report 53 strong lenses with spectroscopic redshifts for both the deflector and source (z_src>z_defl), and 15 lenses with a spectroscopic redshift for either the deflector (z_defl>0.21) or source (z_src>1.34). For the 68 lenses, the deflectors and sources have average redshifts and standard deviations of 0.58+/-0.14 and 1.92+/-0.59 respectively, and corresponding redshift ranges of (0.21<z_defl<0.89) and (0.88<z_src<3.55). The AGEL systems include 41 deflectors at zdefl>0.5 that are ideal for follow-up studies to track how mass density profiles evolve with redshift. Our goal with AGEL is to spectroscopically confirm ~100 strong gravitational lenses that can be observed from both hemispheres throughout the year. The AGEL survey is a resource for refining automated all-sky searches and addressing a range of questions in astrophysics and cosmology.
△ Less
Submitted 26 September, 2022; v1 submitted 11 May, 2022;
originally announced May 2022.
-
Orientations and cycles in supersingular isogeny graphs
Authors:
Sarah Arpin,
Mingjie Chen,
Kristin E. Lauter,
Renate Scheidler,
Katherine E. Stange,
Ha T. N. Tran
Abstract:
The paper concerns several theoretical aspects of oriented supersingular $\ell$-isogeny volcanoes and their relationship to closed walks in the supersingular $\ell$-isogeny graph. Our main result is a bijection between the rims of the union of all oriented supersingular $\ell$-isogeny volcanoes over $\overline{\mathbb{F}}_p$ (up to conjugation of the orientations), and isogeny cycles (non-backtrac…
▽ More
The paper concerns several theoretical aspects of oriented supersingular $\ell$-isogeny volcanoes and their relationship to closed walks in the supersingular $\ell$-isogeny graph. Our main result is a bijection between the rims of the union of all oriented supersingular $\ell$-isogeny volcanoes over $\overline{\mathbb{F}}_p$ (up to conjugation of the orientations), and isogeny cycles (non-backtracking closed walks which are not powers of smaller walks) of the supersingular $\ell$-isogeny graph over $\overline{\mathbb{F}}_p$. The exact proof and statement of this bijection are made more intricate by special behaviours arising from extra automorphisms and the ramification of $p$ in certain quadratic orders. We use the bijection to count isogeny cycles of given length in the supersingular $\ell$-isogeny graph exactly as a sum of class numbers of these orders, and also give an explicit upper bound by estimating the class numbers.
△ Less
Submitted 4 December, 2022; v1 submitted 8 May, 2022;
originally announced May 2022.
-
Measuring vesicle loading with holographic microscopy and bulk light scattering
Authors:
Lan Hai Anh Tran,
Lauren A. Lowe,
Matthew Turner,
James Luong,
Omar Abdullah A. Khamis,
Yaam Deckel,
Megan L. Amos,
Anna Wang
Abstract:
We report efforts to quantify the loading of cell-sized lipid vesicles using in-line digital holographic microscopy. This method does not require fluorescent reporters, fluorescent tracers, or radioactive tracers. A single-color LED light source takes the place of conventional illumination to generate holograms rather than bright field images. By modelling the vesicle's scattering in a microscope…
▽ More
We report efforts to quantify the loading of cell-sized lipid vesicles using in-line digital holographic microscopy. This method does not require fluorescent reporters, fluorescent tracers, or radioactive tracers. A single-color LED light source takes the place of conventional illumination to generate holograms rather than bright field images. By modelling the vesicle's scattering in a microscope with a Lorenz-Mie light scattering model, and comparing the results to data holograms, we are able to measure the vesicle's refractive index and thus loading. Performing the same comparison for bulk light scattering measurements enables retrieval of vesicle loading for nanoscale vesicles.
△ Less
Submitted 26 April, 2024; v1 submitted 12 April, 2022;
originally announced April 2022.
-
Persistent-Transient Duality in Human Behavior Modeling
Authors:
Hung Tran,
Vuong Le,
Svetha Venkatesh,
Truyen Tran
Abstract:
We propose to model the persistent-transient duality in human behavior using a parent-child multi-channel neural network, which features a parent persistent channel that manages the global dynamics and children transient channels that are initiated and terminated on-demand to handle detailed interactive actions. The short-lived transient sessions are managed by a proposed Transient Switch. The neu…
▽ More
We propose to model the persistent-transient duality in human behavior using a parent-child multi-channel neural network, which features a parent persistent channel that manages the global dynamics and children transient channels that are initiated and terminated on-demand to handle detailed interactive actions. The short-lived transient sessions are managed by a proposed Transient Switch. The neural framework is trained to discover the structure of the duality automatically. Our model shows superior performances in human-object interaction motion prediction.
△ Less
Submitted 21 April, 2022;
originally announced April 2022.
-
Distances to Local Group Galaxies via Population II, Stellar Distance Indicators. II. The Fornax Dwarf Spheroidal
Authors:
Elias K. Oakes,
Taylor J. Hoyt,
Wendy L. Freedman,
Barry F. Madore,
Quang H. Tran,
William Cerny,
Rachael L. Beaton,
Mark Seibert
Abstract:
We determine three independent Population II distance moduli to the Fornax dwarf spheroidal (dSph) galaxy, using wide-field, ground-based $VI$ imaging acquired with the Magellan-Baade telescope at Las Campanas Observatory. After subtracting foreground stars using Gaia EDR3 proper motions, we measure an $I$-band tip of the red giant branch (TRGB) magnitude of…
▽ More
We determine three independent Population II distance moduli to the Fornax dwarf spheroidal (dSph) galaxy, using wide-field, ground-based $VI$ imaging acquired with the Magellan-Baade telescope at Las Campanas Observatory. After subtracting foreground stars using Gaia EDR3 proper motions, we measure an $I$-band tip of the red giant branch (TRGB) magnitude of $I_0^\mathrm{TRGB} = 16.753 \pm 0.03_\mathrm{stat} \pm 0.037_\mathrm{sys}$ mag, with a calibration based in the LMC giving a distance modulus of $μ_0^\mathrm{TRGB} = 20.80 \pm 0.037_\mathrm{stat} \pm 0.057_\mathrm{sys}$ mag. We determine an RR Lyrae (RRL) distance from template mean magnitudes, with periods adopted from the literature. Adopting a Gaia DR2 calibration of first overtone RRL period-luminosity and period-Wesenheit relations, we find $μ_0^\mathrm{PLZ} = 20.74 \pm 0.01_\mathrm{stat} \pm 0.12_\mathrm{sys}$ mag and $μ_0^\mathrm{PWZ} = 20.68 \pm 0.02_\mathrm{stat} \pm 0.07_\mathrm{sys}$ mag. Finally, we determine a distance from Fornax's horizontal branch (HB) and two galactic globular cluster calibrators, giving $μ_0^\mathrm{HB} = 20.83 \pm 0.03_\mathrm{stat} \pm 0.09_\mathrm{sys}$ mag. These distances are each derived from homogeneous IMACS photometry, are anchored to independent geometric zero-points, and utilize different classes of stars. We therefore average over independent uncertainties and report the combined distance modulus $\langle μ_0\rangle = 20.770 \pm 0.042_\mathrm{stat} \pm 0.024_\mathrm{sys}$ mag (corresponding to a distance of $143\pm3$ kpc).
△ Less
Submitted 10 May, 2022; v1 submitted 20 April, 2022;
originally announced April 2022.
-
Strongly Quasiconvex subgroups in graphs of groups
Authors:
Hoang Thanh Nguyen,
Hung Cong Tran
Abstract:
Given a graph of groups $\mathcal{G} = (Γ, \{G_v\}, \{G_e\})$ with certain conditions on vertex groups and $G$ acts acylindrically on its Bass-Serre tree $T$. Let $H$ be a finitely generated subgroup of $G$. We prove the following statements equivalence: $H$ has finite height, $(G, T, H)$ is a $A/QI$--triple, $H$ is strongly quasiconvex and virtually free in $G$. We also give a condition to determ…
▽ More
Given a graph of groups $\mathcal{G} = (Γ, \{G_v\}, \{G_e\})$ with certain conditions on vertex groups and $G$ acts acylindrically on its Bass-Serre tree $T$. Let $H$ be a finitely generated subgroup of $G$. We prove the following statements equivalence: $H$ has finite height, $(G, T, H)$ is a $A/QI$--triple, $H$ is strongly quasiconvex and virtually free in $G$. We also give a condition to determine whether strong quasiconvexity in a group is preserved under amalgams.
△ Less
Submitted 20 April, 2022;
originally announced April 2022.
-
Leveraging Deep Neural Networks for Massive MIMO Data Detection
Authors:
Ly V. Nguyen,
Nhan T. Nguyen,
Nghi H. Tran,
Markku Juntti,
A. Lee Swindlehurst,
Duy H. N. Nguyen
Abstract:
Massive multiple-input multiple-output (MIMO) is a key technology for emerging next-generation wireless systems. Utilizing large antenna arrays at base-stations, massive MIMO enables substantial spatial multiplexing gains by simultaneously serving a large number of users. However, the complexity in massive MIMO signal processing (e.g., data detection) increases rapidly with the number of users, ma…
▽ More
Massive multiple-input multiple-output (MIMO) is a key technology for emerging next-generation wireless systems. Utilizing large antenna arrays at base-stations, massive MIMO enables substantial spatial multiplexing gains by simultaneously serving a large number of users. However, the complexity in massive MIMO signal processing (e.g., data detection) increases rapidly with the number of users, making conventional hand-engineered algorithms less computationally efficient. Low-complexity massive MIMO detection algorithms, especially those inspired or aided by deep learning, have emerged as a promising solution. While there exist many MIMO detection algorithms, the aim of this magazine paper is to provide insight into how to leverage deep neural networks (DNN) for massive MIMO detection. We review recent developments in DNN-based MIMO detection that incorporate the domain knowledge of established MIMO detection algorithms with the learning capability of DNNs. We then present a comparison of the key numerical performance metrics of these works. We conclude by describing future research areas and applications of DNNs in massive MIMO receivers.
△ Less
Submitted 11 April, 2022;
originally announced April 2022.
-
Differentiability of effective fronts in the continuous setting in two dimensions
Authors:
Hung V. Tran,
Yifeng Yu
Abstract:
We study the effective front associated with first-order front propagations in two dimensions ($n=2$) in the periodic setting with continuous coefficients. Our main result says that that the boundary of the effective front is differentiable at every irrational point. Equivalently, the stable norm associated with a continuous $\mathbb{Z}^2$-periodic Riemannian metric is differentiable at irrational…
▽ More
We study the effective front associated with first-order front propagations in two dimensions ($n=2$) in the periodic setting with continuous coefficients. Our main result says that that the boundary of the effective front is differentiable at every irrational point. Equivalently, the stable norm associated with a continuous $\mathbb{Z}^2$-periodic Riemannian metric is differentiable at irrational points. This conclusion was obtained decades ago for smooth metrics ([3,5]). To the best of our knowledge, our result provides the first nontrivial property of the effective fronts in the continuous setting, which is the standard assumption in the PDE theory. Combining with the sufficiency result in [12], our result implies that for continuous coefficients, a polygon could be an effective front if and only if it is centrally symmetric with rational vertices and nonempty interior.
△ Less
Submitted 7 June, 2022; v1 submitted 25 March, 2022;
originally announced March 2022.
-
WayFAST: Navigation with Predictive Traversability in the Field
Authors:
Mateus Valverde Gasparino,
Arun Narenthiran Sivakumar,
Yixiao Liu,
Andres Eduardo Baquero Velasquez,
Vitor Akihiro Hisano Higuti,
John Rogers,
Huy Tran,
Girish Chowdhary
Abstract:
We present a self-supervised approach for learning to predict traversable paths for wheeled mobile robots that require good traction to navigate. Our algorithm, termed WayFAST (Waypoint Free Autonomous Systems for Traversability), uses RGB and depth data, along with navigation experience, to autonomously generate traversable paths in outdoor unstructured environments. Our key inspiration is that t…
▽ More
We present a self-supervised approach for learning to predict traversable paths for wheeled mobile robots that require good traction to navigate. Our algorithm, termed WayFAST (Waypoint Free Autonomous Systems for Traversability), uses RGB and depth data, along with navigation experience, to autonomously generate traversable paths in outdoor unstructured environments. Our key inspiration is that traction can be estimated for rolling robots using kinodynamic models. Using traction estimates provided by an online receding horizon estimator, we are able to train a traversability prediction neural network in a self-supervised manner, without requiring heuristics utilized by previous methods. We demonstrate the effectiveness of WayFAST through extensive field testing in varying environments, ranging from sandy dry beaches to forest canopies and snow covered grass fields. Our results clearly demonstrate that WayFAST can learn to avoid geometric obstacles as well as untraversable terrain, such as snow, which would be difficult to avoid with sensors that provide only geometric data, such as LiDAR. Furthermore, we show that our training pipeline based on online traction estimates is more data-efficient than other heuristic-based methods.
△ Less
Submitted 1 August, 2022; v1 submitted 22 March, 2022;
originally announced March 2022.
-
IoT Data Discovery: Routing Table and Summarization Techniques
Authors:
Hieu Tran,
Son Nguyen,
I-Ling Yen,
Farokh Bastani
Abstract:
In this paper, we consider the IoT data discovery problem in very large and growing scale networks. Through analysis, examples, and experimental studies, we show the importance of peer-to-peer, unstructured routing for IoT data discovery and point out the space efficiency issue that has been overlooked in keyword-based routing algorithms in unstructured networks. Specifically, as the first in the…
▽ More
In this paper, we consider the IoT data discovery problem in very large and growing scale networks. Through analysis, examples, and experimental studies, we show the importance of peer-to-peer, unstructured routing for IoT data discovery and point out the space efficiency issue that has been overlooked in keyword-based routing algorithms in unstructured networks. Specifically, as the first in the field, this paper investigates routing table designs and various compression techniques to support effective and space-efficient IoT data discovery routing. Novel summarization algorithms, including alphabetical, hash, and meaning-based summarization and their corresponding coding schemes, are proposed. We also consider routing table design to support summarization without degrading lookup efficiency for discovery query routing. The issue of potentially misleading routing due to summarization is also investigated. Subsequently, we analyze the strategy of when to summarize to balance the tradeoff between the routing table compression rate and the chance of causing misleading routing. For the experimental study, we have collected 100K IoT data streams from various IoT databases as the input dataset. Experimental results show that our summarization solution can reduce the routing table size by 20 to 30 folds with a 2-5% increase in latency compared with similar peer-to-peer discovery routing algorithms without summarization. Also, our approach outperforms DHT-based approaches by 2 to 6 folds in terms of latency and traffic.
△ Less
Submitted 6 May, 2022; v1 submitted 21 March, 2022;
originally announced March 2022.
-
LensingETC: a tool to optimize multi-filter imaging campaigns of galaxy-scale strong lensing systems
Authors:
Anowar J. Shajib,
Karl Glazebrook,
Tania Barone,
Geraint F. Lewis,
Tucker Jones,
Kim-Vy H. Tran,
Elizabeth Buckley-Geer,
Thomas E. Collett,
Joshua Frieman,
Colin Jacobs
Abstract:
Imaging data is the principal observable required to use galaxy-scale strong lensing in a multitude of applications in extragalactic astrophysics and cosmology. In this paper, we develop Lensing Exposure Time Calculator (LensingETC) to optimize the efficiency of telescope time usage when planning multi-filter imaging campaigns for galaxy-scale strong lenses. This tool simulates realistic data tail…
▽ More
Imaging data is the principal observable required to use galaxy-scale strong lensing in a multitude of applications in extragalactic astrophysics and cosmology. In this paper, we develop Lensing Exposure Time Calculator (LensingETC) to optimize the efficiency of telescope time usage when planning multi-filter imaging campaigns for galaxy-scale strong lenses. This tool simulates realistic data tailored to specified instrument characteristics and then automatically models them to assess the power of the data in constraining lens model parameters. We demonstrate a use case of this tool by optimizing a two-filter observing strategy (in IR and UVIS) within the limited exposure time per system allowed by a Hubble Space Telescope (HST) Snapshot program. We find that higher resolution is more advantageous to gain constraining power on the lensing observables, when there is a trade-off between signal-to-noise ratio and resolution; e.g., between the UVIS and IR filters of the HST. We also find that, whereas a point spread function (PSF) with sub-Nyquist sampling allows the sample mean for a model parameter to be robustly recovered for both galaxy-galaxy and point-source lensing systems, a sub-Nyquist sampled PSF introduces a larger scatter than a Nyquist sampled one in the deviation from the ground truth for point-source lens systems.
△ Less
Submitted 10 March, 2022;
originally announced March 2022.
-
TOI-1670 b and c: An Inner Sub-Neptune with an Outer Warm Jupiter Unlikely to have Originated from High-Eccentricity Migration
Authors:
Quang H. Tran,
Brendan P. Bowler,
Michael Endl,
William D. Cochran,
Phillip J. MacQueen,
Davide Gandolfi,
Carina M. Persson,
Malcolm Fridlund,
Enric Palle,
Grzegorz Nowak,
Hans J. Deeg,
Rafael Luque,
John H. Livingston,
Petr Kabáth,
Marek Skarka,
Ján Šubjak,
Steve B. Howell,
Simon H. Albrecht,
Karen A. Collins,
Massimiliano Esposito,
Vincent Van Eylen,
Sascha Grziwa,
Elisa Goffo,
Chelsea X. Huang,
Jon M. Jenkins
, et al. (16 additional authors not shown)
Abstract:
We report the discovery of two transiting planets around the bright ($V=9.9$ mag) main sequence F7 star TOI-1670 by the Transiting Exoplanet Survey Satellite. TOI-1670 b is a sub-Neptune ($R_\mathrm{b} = 2.06_{-0.15}^{+0.19}$ $R_\oplus$) on a 10.9-day orbit and TOI-1670 c is a warm Jupiter ($R_\mathrm{c} = 0.987_{-0.025}^{+0.025}$ $R_\mathrm{Jup}$) on a 40.7-day orbit. Using radial velocity observ…
▽ More
We report the discovery of two transiting planets around the bright ($V=9.9$ mag) main sequence F7 star TOI-1670 by the Transiting Exoplanet Survey Satellite. TOI-1670 b is a sub-Neptune ($R_\mathrm{b} = 2.06_{-0.15}^{+0.19}$ $R_\oplus$) on a 10.9-day orbit and TOI-1670 c is a warm Jupiter ($R_\mathrm{c} = 0.987_{-0.025}^{+0.025}$ $R_\mathrm{Jup}$) on a 40.7-day orbit. Using radial velocity observations gathered with the Tull coudé Spectrograph on the Harlan J. Smith telescope and HARPS-N on the Telescopio Nazionale Galileo, we find a planet mass of $M_\mathrm{c} = 0.63_{-0.08}^{+0.09}$ $M_\mathrm{Jup}$ for the outer warm Jupiter, implying a mean density of $ρ_c = 0.81_{-0.11}^{+0.13}$ g cm$^{-3}$. The inner sub-Neptune is undetected in our radial velocity data ($M_\mathrm{b} < 0.13$ $M_\mathrm{Jup}$ at the 99% confidence level). Multi-planet systems like TOI-1670 hosting an outer warm Jupiter on a nearly circular orbit ($e_\mathrm{c} = 0.09_{-0.04}^{+0.05}$) and one or more inner coplanar planets are more consistent with "gentle" formation mechanisms such as disk migration or $in$ $situ$ formation rather than high-eccentricity migration. Of the 11 known systems with a warm Jupiter and a smaller inner companion, 8 (73%) are near a low-order mean-motion resonance, which can be a signature of migration. TOI-1670 joins two other systems (27% of this subsample) with period commensurabilities greater than 3, a common feature of $in$ $situ$ formation or halted inward migration. TOI-1670 and the handful of similar systems support a diversity of formation pathways for warm Jupiters.
△ Less
Submitted 8 March, 2022;
originally announced March 2022.
-
Model Calibration of the Liquid Mercury Spallation Target using Evolutionary Neural Networks and Sparse Polynomial Expansions
Authors:
Majdi I. Radaideh,
Hoang Tran,
Lianshan Lin,
Hao Jiang,
Drew Winder,
Sarma Gorti,
Guannan Zhang,
Justin Mach,
Sarah Cousineau
Abstract:
The mercury constitutive model predicting the strain and stress in the target vessel plays a central role in improving the lifetime prediction and future target designs of the mercury targets at the Spallation Neutron Source (SNS). We leverage the experiment strain data collected over multiple years to improve the mercury constitutive model through a combination of large-scale simulations of the t…
▽ More
The mercury constitutive model predicting the strain and stress in the target vessel plays a central role in improving the lifetime prediction and future target designs of the mercury targets at the Spallation Neutron Source (SNS). We leverage the experiment strain data collected over multiple years to improve the mercury constitutive model through a combination of large-scale simulations of the target behavior and the use of machine learning tools for parameter estimation. We present two interdisciplinary approaches for surrogate-based model calibration of expensive simulations using evolutionary neural networks and sparse polynomial expansions. The experiments and results of the two methods show a very good agreement for the solid mechanics simulation of the mercury spallation target. The proposed methods are used to calibrate the tensile cutoff threshold, mercury density, and mercury speed of sound during intense proton pulse experiments. Using strain experimental data from the mercury target sensors, the newly calibrated simulations achieve 7\% average improvement on the signal prediction accuracy and 8\% reduction in mean absolute error compared to previously reported reference parameters, with some sensors experiencing up to 30\% improvement. The proposed calibrated simulations can significantly aid in fatigue analysis to estimate the mercury target lifetime and integrity, which reduces abrupt target failure and saves a tremendous amount of costs. However, an important conclusion from this work points out to a deficiency in the current constitutive model based on the equation of state in capturing the full physics of the spallation reaction. Given that some of the calibrated parameters that show a good agreement with the experimental data can be nonphysical mercury properties, we need a more advanced two-phase flow model to capture bubble dynamics and mercury cavitation.
△ Less
Submitted 18 February, 2022;
originally announced February 2022.
-
Disentangling Successor Features for Coordination in Multi-agent Reinforcement Learning
Authors:
Seung Hyun Kim,
Neale Van Stralen,
Girish Chowdhary,
Huy T. Tran
Abstract:
Multi-agent reinforcement learning (MARL) is a promising framework for solving complex tasks with many agents. However, a key challenge in MARL is defining private utility functions that ensure coordination when training decentralized agents. This challenge is especially prevalent in unstructured tasks with sparse rewards and many agents. We show that successor features can help address this chall…
▽ More
Multi-agent reinforcement learning (MARL) is a promising framework for solving complex tasks with many agents. However, a key challenge in MARL is defining private utility functions that ensure coordination when training decentralized agents. This challenge is especially prevalent in unstructured tasks with sparse rewards and many agents. We show that successor features can help address this challenge by disentangling an individual agent's impact on the global value function from that of all other agents. We use this disentanglement to compactly represent private utilities that support stable training of decentralized agents in unstructured tasks. We implement our approach using a centralized training, decentralized execution architecture and test it in a variety of multi-agent environments. Our results show improved performance and training time relative to existing methods and suggest that disentanglement of successor features offers a promising approach to coordination in MARL.
△ Less
Submitted 15 February, 2022;
originally announced February 2022.
-
Numerical prescriptions of early-time divergences of the in-in formalism
Authors:
Duc Huy Tran,
Yi Wang,
Juanyi Yang,
Yuhang Zhu
Abstract:
In quantum field theory, the in and out states can be related to the full Hamiltonian by the $iε$ prescription. A Wick rotation can further bring the correlation functions to Euclidean spacetime where the integrals are better defined. This setup is convenient for analytical calculations. However, for numerical calculations, an infinitesimal $ε$ or a Wick rotation of numerical functions are difficu…
▽ More
In quantum field theory, the in and out states can be related to the full Hamiltonian by the $iε$ prescription. A Wick rotation can further bring the correlation functions to Euclidean spacetime where the integrals are better defined. This setup is convenient for analytical calculations. However, for numerical calculations, an infinitesimal $ε$ or a Wick rotation of numerical functions are difficult to implement. We propose two new numerical methods to solve this problem, namely an Integral Basis method based on linear regression and a Beta Regulator method based on Cesàro/Riesz summation. Another class of partition-extrapolation methods previously used in electromagnetic engineering is also introduced. We benchmark these methods with existing methods using in-in formalism integrals, indicating advantages of these new methods over the existing methods in computation time and accuracy.
△ Less
Submitted 13 February, 2022;
originally announced February 2022.
-
POSYDON: A General-Purpose Population Synthesis Code with Detailed Binary-Evolution Simulations
Authors:
Tassos Fragos,
Jeff J. Andrews,
Simone S. Bavera,
Christopher P. L. Berry,
Scott Coughlin,
Aaron Dotter,
Prabin Giri,
Vicky Kalogera,
Aggelos Katsaggelos,
Konstantinos Kovlakas,
Shamal Lalvani,
Devina Misra,
Philipp M. Srivastava,
Ying Qin,
Kyle A. Rocha,
Jaime Roman-Garza,
Juan Gabriel Serra,
Petter Stahle,
Meng Sun,
Xu Teng,
Goce Trajcevski,
Nam Hai Tran,
Zepei Xing,
Emmanouil Zapartas,
Michael Zevin
Abstract:
Most massive stars are members of a binary or a higher-order stellar systems, where the presence of a binary companion can decisively alter their evolution via binary interactions. Interacting binaries are also important astrophysical laboratories for the study of compact objects. Binary population synthesis studies have been used extensively over the last two decades to interpret observations of…
▽ More
Most massive stars are members of a binary or a higher-order stellar systems, where the presence of a binary companion can decisively alter their evolution via binary interactions. Interacting binaries are also important astrophysical laboratories for the study of compact objects. Binary population synthesis studies have been used extensively over the last two decades to interpret observations of compact-object binaries and to decipher the physical processes that lead to their formation. Here, we present POSYDON, a novel, binary population synthesis code that incorporates full stellar-structure and binary-evolution modeling, using the MESA code, throughout the whole evolution of the binaries. The use of POSYDON enables the self-consistent treatment of physical processes in stellar and binary evolution, including: realistic mass-transfer calculations and assessment of stability, internal angular-momentum transport and tides, stellar core sizes, mass-transfer rates and orbital periods. This paper describes the detailed methodology and implementation of POSYDON, including the assumed physics of stellar- and binary-evolution, the extensive grids of detailed single- and binary-star models, the post-processing, classification and interpolation methods we developed for use with the grids, and the treatment of evolutionary phases that are not based on pre-calculated grids. The first version of POSYDON targets binaries with massive primary stars (potential progenitors of neutron stars or black holes) at solar metallicity.
△ Less
Submitted 7 August, 2022; v1 submitted 11 February, 2022;
originally announced February 2022.
-
Nesterov Accelerated Shuffling Gradient Method for Convex Optimization
Authors:
Trang H. Tran,
Katya Scheinberg,
Lam M. Nguyen
Abstract:
In this paper, we propose Nesterov Accelerated Shuffling Gradient (NASG), a new algorithm for the convex finite-sum minimization problems. Our method integrates the traditional Nesterov's acceleration momentum with different shuffling sampling schemes. We show that our algorithm has an improved rate of $\mathcal{O}(1/T)$ using unified shuffling schemes, where $T$ is the number of epochs. This rate…
▽ More
In this paper, we propose Nesterov Accelerated Shuffling Gradient (NASG), a new algorithm for the convex finite-sum minimization problems. Our method integrates the traditional Nesterov's acceleration momentum with different shuffling sampling schemes. We show that our algorithm has an improved rate of $\mathcal{O}(1/T)$ using unified shuffling schemes, where $T$ is the number of epochs. This rate is better than that of any other shuffling gradient methods in convex regime. Our convergence analysis does not require an assumption on bounded domain or a bounded gradient condition. For randomized shuffling schemes, we improve the convergence bound further. When employing some initial condition, we show that our method converges faster near the small neighborhood of the solution. Numerical simulations demonstrate the efficiency of our algorithm.
△ Less
Submitted 12 June, 2022; v1 submitted 7 February, 2022;
originally announced February 2022.
-
Finite-Sum Optimization: A New Perspective for Convergence to a Global Solution
Authors:
Lam M. Nguyen,
Trang H. Tran,
Marten van Dijk
Abstract:
Deep neural networks (DNNs) have shown great success in many machine learning tasks. Their training is challenging since the loss surface of the network architecture is generally non-convex, or even non-smooth. How and under what assumptions is guaranteed convergence to a \textit{global} minimum possible? We propose a reformulation of the minimization problem allowing for a new recursive algorithm…
▽ More
Deep neural networks (DNNs) have shown great success in many machine learning tasks. Their training is challenging since the loss surface of the network architecture is generally non-convex, or even non-smooth. How and under what assumptions is guaranteed convergence to a \textit{global} minimum possible? We propose a reformulation of the minimization problem allowing for a new recursive algorithmic framework. By using bounded style assumptions, we prove convergence to an $\varepsilon$-(global) minimum using $\mathcal{\tilde{O}}(1/\varepsilon^3)$ gradient computations. Our theoretical foundation motivates further study, implementation, and optimization of the new algorithmic framework and further investigation of its non-standard bounded style assumptions. This new direction broadens our understanding of why and under what circumstances training of a DNN converges to a global minimum.
△ Less
Submitted 7 February, 2022;
originally announced February 2022.
-
Local mass-conserving solution for a critical Coagulation-Fragmentation equation
Authors:
Hung V. Tran,
Truong-Son Van
Abstract:
The critical coagulation-fragmentation equation with multiplicative coagulation and constant fragmentation kernels is known to not have global mass-conserving solutions when the initial mass is greater than $1$. We show that for any given positive initial mass with finite second moment, there is a time $T^*>0$ such that the equation possesses a unique mass-conserving solution up to $T^*$. The nove…
▽ More
The critical coagulation-fragmentation equation with multiplicative coagulation and constant fragmentation kernels is known to not have global mass-conserving solutions when the initial mass is greater than $1$. We show that for any given positive initial mass with finite second moment, there is a time $T^*>0$ such that the equation possesses a unique mass-conserving solution up to $T^*$. The novel idea is to singularly perturb the constant fragmentation kernel by small additive terms and study the limiting behavior of the solutions of the perturbed system via the Bernstein transform.
△ Less
Submitted 11 December, 2022; v1 submitted 7 February, 2022;
originally announced February 2022.
-
Transfer Reinforcement Learning for Differing Action Spaces via Q-Network Representations
Authors:
Nathan Beck,
Abhiramon Rajasekharan,
Hieu Tran
Abstract:
Transfer learning approaches in reinforcement learning aim to assist agents in learning their target domains by leveraging the knowledge learned from other agents that have been trained on similar source domains. For example, recent research focus within this space has been placed on knowledge transfer between tasks that have different transition dynamics and reward functions; however, little focu…
▽ More
Transfer learning approaches in reinforcement learning aim to assist agents in learning their target domains by leveraging the knowledge learned from other agents that have been trained on similar source domains. For example, recent research focus within this space has been placed on knowledge transfer between tasks that have different transition dynamics and reward functions; however, little focus has been placed on knowledge transfer between tasks that have different action spaces. In this paper, we approach the task of transfer learning between domains that differ in action spaces. We present a reward sha** method based on source embedding similarity that is applicable to domains with both discrete and continuous action spaces. The efficacy of our approach is evaluated on transfer to restricted action spaces in the Acrobot-v1 and Pendulum-v0 domains. A comparison with two baselines shows that our method does not outperform these baselines in these continuous action spaces but does show an improvement in these discrete action spaces. We conclude our analysis with future directions for this work.
△ Less
Submitted 21 April, 2022; v1 submitted 4 February, 2022;
originally announced February 2022.
-
Thermal structure and aerosols in Mars' atmosphere from TIRVIM/ACS onboard the ExoMars Trace Gas Orbiter : validation of the retrieval algorithm
Authors:
Sandrine Guerlet,
N. Ignatiev,
F. Forget,
T. Fouchet,
P. Vlasov,
G. Bergeron,
R. M. B. Young,
E. Millour,
S. Fan,
H. Tran,
A. Shakun,
A. Grigoriev,
A. Trokhimovskiy,
F. Montmessin,
O. Korablev
Abstract:
The Atmospheric Chemistry Suite (ACS) onboard the ExoMars Trace Gas Orbiter (TGO) monitors the Martian atmosphere through different spectral intervals in the infrared light. We present a retrieval algorithm tailored to the analysis of spectra acquired in nadir geometry by TIRVIM, the thermal infrared channel of ACS. Our algorithm simultaneously retrieves vertical profile of atmospheric temperature…
▽ More
The Atmospheric Chemistry Suite (ACS) onboard the ExoMars Trace Gas Orbiter (TGO) monitors the Martian atmosphere through different spectral intervals in the infrared light. We present a retrieval algorithm tailored to the analysis of spectra acquired in nadir geometry by TIRVIM, the thermal infrared channel of ACS. Our algorithm simultaneously retrieves vertical profile of atmospheric temperature up to 50 km, surface temperature, and integrated optical depth of dust and water ice clouds. The specificity of the TIRVIM dataset lies in its capacity to resolve the diurnal cycle over a 54 sol period. However, it is uncertain to what extent can the desired atmospheric quantities be accurately estimated at different times of day. Here we first present an Observing System Simulation Experiment (OSSE). We produce synthetic observations at various latitudes, seasons and local times and run our retrieval algorithm on these synthetic data, to evaluate its robustness. Different sources of biases are documented, in particular regarding aerosol retrievals. Atmospheric temperature retrievals are found robust even when dust and/or water ice cloud opacities are not well estimated in our OSSE. We then apply our algorithm to TIRVIM observations in April-May, 2018 and perform a cross-validation of retrieved atmospheric temperature and dust integrated opacity by comparisons with thousands of co-located Mars Climate Sounder (MCS) retrievals. Most differences between TIRVIM and MCS atmospheric temperatures can be attributed to differences in vertical sensitivity. Daytime dust opacities agree well with each other, while biases are found in nighttime dust opacity retrieved from TIRVIM at this season.
△ Less
Submitted 27 January, 2022;
originally announced January 2022.
-
Orienteering with one endomorphism
Authors:
Sarah Arpin,
Mingjie Chen,
Kristin E. Lauter,
Renate Scheidler,
Katherine E. Stange,
Ha T. N. Tran
Abstract:
In supersingular isogeny-based cryptography, the path-finding problem reduces to the endomorphism ring problem. Can path-finding be reduced to knowing just one endomorphism? It is known that a small endomorphism enables polynomial-time path-finding and endomorphism ring computation (Love-Boneh [36]). An endomorphism gives an explicit orientation of a supersingular elliptic curve. In this paper, we…
▽ More
In supersingular isogeny-based cryptography, the path-finding problem reduces to the endomorphism ring problem. Can path-finding be reduced to knowing just one endomorphism? It is known that a small endomorphism enables polynomial-time path-finding and endomorphism ring computation (Love-Boneh [36]). An endomorphism gives an explicit orientation of a supersingular elliptic curve. In this paper, we use the volcano structure of the oriented supersingular isogeny graph to take ascending/descending/horizontal steps on the graph and deduce path-finding algorithms to an initial curve. Each altitude of the volcano corresponds to a unique quadratic order, called the primitive order. We introduce a new hard problem of computing the primitive order given an arbitrary endomorphism on the curve, and we also provide a sub-exponential quantum algorithm for solving it. In concurrent work (Wesolowski [54]), it was shown that the endomorphism ring problem in the presence of one endomorphism with known primitive order reduces to a vectorization problem, implying path-finding algorithms. Our path-finding algorithms are more general in the sense that we don't assume the knowledge of the primitive order associated with the endomorphism.
△ Less
Submitted 19 October, 2022; v1 submitted 26 January, 2022;
originally announced January 2022.
-
Efficient Approximations of the Fisher Matrix in Neural Networks using Kronecker Product Singular Value Decomposition
Authors:
Abdoulaye Koroko,
Ani Anciaux-Sedrakian,
Ibtihel Ben Gharbia,
Valérie Garès,
Mounir Haddou,
Quang Huy Tran
Abstract:
Several studies have shown the ability of natural gradient descent to minimize the objective function more efficiently than ordinary gradient descent based methods. However, the bottleneck of this approach for training deep neural networks lies in the prohibitive cost of solving a large dense linear system corresponding to the Fisher Information Matrix (FIM) at each iteration. This has motivated v…
▽ More
Several studies have shown the ability of natural gradient descent to minimize the objective function more efficiently than ordinary gradient descent based methods. However, the bottleneck of this approach for training deep neural networks lies in the prohibitive cost of solving a large dense linear system corresponding to the Fisher Information Matrix (FIM) at each iteration. This has motivated various approximations of either the exact FIM or the empirical one. The most sophisticated of these is KFAC, which involves a Kronecker-factored block diagonal approximation of the FIM. With only a slight additional cost, a few improvements of KFAC from the standpoint of accuracy are proposed. The common feature of the four novel methods is that they rely on a direct minimization problem, the solution of which can be computed via the Kronecker product singular value decomposition technique. Experimental results on the three standard deep auto-encoder benchmarks showed that they provide more accurate approximations to the FIM. Furthermore, they outperform KFAC and state-of-the-art first-order methods in terms of optimization speed.
△ Less
Submitted 14 October, 2022; v1 submitted 25 January, 2022;
originally announced January 2022.
-
Dense Pixel-Labeling for Reverse-Transfer and Diagnostic Learning on Lung Ultrasound for COVID-19 and Pneumonia Detection
Authors:
Gautam Rajendrakumar Gare,
Andrew Schoenling,
Vipin Philip,
Hai V Tran,
Bennett P deBoisblanc,
Ricardo Luis Rodriguez,
John Michael Galeotti
Abstract:
We propose using a pre-trained segmentation model to perform diagnostic classification in order to achieve better generalization and interpretability, terming the technique reverse-transfer learning. We present an architecture to convert segmentation models to classification models. We compare and contrast dense vs sparse segmentation labeling and study its impact on diagnostic classification. We…
▽ More
We propose using a pre-trained segmentation model to perform diagnostic classification in order to achieve better generalization and interpretability, terming the technique reverse-transfer learning. We present an architecture to convert segmentation models to classification models. We compare and contrast dense vs sparse segmentation labeling and study its impact on diagnostic classification. We compare the performance of U-Net trained with dense and sparse labels to segment A-lines, B-lines, and Pleural lines on a custom dataset of lung ultrasound scans from 4 patients. Our experiments show that dense labels help reduce false positive detection. We study the classification capability of the dense and sparse trained U-Net and contrast it with a non-pretrained U-Net, to detect and differentiate COVID-19 and Pneumonia on a large ultrasound dataset of about 40k curvilinear and linear probe images. Our segmentation-based models perform better classification when using pretrained segmentation weights, with the dense-label pretrained U-Net performing the best.
△ Less
Submitted 25 January, 2022;
originally announced January 2022.
-
Seamless and Energy Efficient Maritime Coverage in Coordinated 6G Space-Air-Sea Non-Terrestrial Networks
Authors:
Sheikh Salman Hassan,
Do Hyeon Kim,
Yan Kyaw Tun,
Nguyen H. Tran,
Walid Saad,
Choong Seon Hong
Abstract:
Non-terrestrial networks (NTNs), which integrate space and aerial networks with terrestrial systems, are a key area in the emerging sixth-generation (6G) wireless networks. As part of 6G, NTNs must provide pervasive connectivity to a wide range of devices, including smartphones, vehicles, sensors, robots, and maritime users. However, due to the high mobility and deployment of NTNs, managing the sp…
▽ More
Non-terrestrial networks (NTNs), which integrate space and aerial networks with terrestrial systems, are a key area in the emerging sixth-generation (6G) wireless networks. As part of 6G, NTNs must provide pervasive connectivity to a wide range of devices, including smartphones, vehicles, sensors, robots, and maritime users. However, due to the high mobility and deployment of NTNs, managing the space-air-sea (SAS) NTN resources, i.e., energy, power, and channel allocation, is a major challenge. The design of a SAS-NTN for energy-efficient resource allocation is investigated in this study. The goal is to maximize system energy efficiency (EE) by collaboratively optimizing user equipment (UE) association, power control, and unmanned aerial vehicle (UAV) deployment. Given the limited payloads of UAVs, this work focuses on minimizing the total energy cost of UAVs (trajectory and transmission) while meeting EE requirements. A mixed-integer nonlinear programming problem is proposed, followed by the development of an algorithm to decompose, and solve each problem distributedly. The binary (UE association) and continuous (power, deployment) variables are separated using the Bender decomposition (BD), and then the Dinkelbach algorithm (DA) is used to convert fractional programming into an equivalent solvable form in the subproblem. A standard optimization solver is utilized to deal with the complexity of the master problem for binary variables. The alternating direction method of multipliers (ADMM) algorithm is used to solve the subproblem for the continuous variables. Our proposed algorithm provides a suboptimal solution, and simulation results demonstrate that the proposed algorithm achieves better EE than baselines.
△ Less
Submitted 21 January, 2022;
originally announced January 2022.
-
The Role of Pleura and Adipose in Lung Ultrasound AI
Authors:
Gautam Rajendrakumar Gare,
Wanwen Chen,
Alex Ling Yu Hung,
Edward Chen,
Hai V. Tran,
Tom Fox,
Pete Lowery,
Kevin Zamora,
Bennett P deBoisblanc,
Ricardo Luis Rodriguez,
John Michael Galeotti
Abstract:
In this paper, we study the significance of the pleura and adipose tissue in lung ultrasound AI analysis. We highlight their more prominent appearance when using high-frequency linear (HFL) instead of curvilinear ultrasound probes, showing HFL reveals better pleura detail. We compare the diagnostic utility of the pleura and adipose tissue using an HFL ultrasound probe. Masking the adipose tissue d…
▽ More
In this paper, we study the significance of the pleura and adipose tissue in lung ultrasound AI analysis. We highlight their more prominent appearance when using high-frequency linear (HFL) instead of curvilinear ultrasound probes, showing HFL reveals better pleura detail. We compare the diagnostic utility of the pleura and adipose tissue using an HFL ultrasound probe. Masking the adipose tissue during training and inference (while retaining the pleural line and Merlin's space artifacts such as A-lines and B-lines) improved the AI model's diagnostic accuracy.
△ Less
Submitted 18 January, 2022;
originally announced January 2022.
-
Weakly Supervised Contrastive Learning for Better Severity Scoring of Lung Ultrasound
Authors:
Gautam Rajendrakumar Gare,
Hai V. Tran,
Bennett P deBoisblanc,
Ricardo Luis Rodriguez,
John Michael Galeotti
Abstract:
With the onset of the COVID-19 pandemic, ultrasound has emerged as an effective tool for bedside monitoring of patients. Due to this, a large amount of lung ultrasound scans have been made available which can be used for AI based diagnosis and analysis. Several AI-based patient severity scoring models have been proposed that rely on scoring the appearance of the ultrasound scans. AI models are tra…
▽ More
With the onset of the COVID-19 pandemic, ultrasound has emerged as an effective tool for bedside monitoring of patients. Due to this, a large amount of lung ultrasound scans have been made available which can be used for AI based diagnosis and analysis. Several AI-based patient severity scoring models have been proposed that rely on scoring the appearance of the ultrasound scans. AI models are trained using ultrasound-appearance severity scores that are manually labeled based on standardized visual features. We address the challenge of labeling every ultrasound frame in the video clips. Our contrastive learning method treats the video clip severity labels as noisy weak severity labels for individual frames, thus requiring only video-level labels. We show that it performs better than the conventional cross-entropy loss based training. We combine frame severity predictions to come up with video severity predictions and show that the frame based model achieves comparable performance to a video based TSM model, on a large dataset combining public and private sources.
△ Less
Submitted 18 January, 2022;
originally announced January 2022.
-
Neural Network Compression of ACAS Xu Early Prototype is Unsafe: Closed-Loop Verification through Quantized State Backreachability
Authors:
Stanley Bak,
Hoang-Dung Tran
Abstract:
ACAS Xu is an air-to-air collision avoidance system designed for unmanned aircraft that issues horizontal turn advisories to avoid an intruder aircraft. Due the use of a large lookup table in the design, a neural network compression of the policy was proposed. Analysis of this system has spurred a significant body of research in the formal methods community on neural network verification. While ma…
▽ More
ACAS Xu is an air-to-air collision avoidance system designed for unmanned aircraft that issues horizontal turn advisories to avoid an intruder aircraft. Due the use of a large lookup table in the design, a neural network compression of the policy was proposed. Analysis of this system has spurred a significant body of research in the formal methods community on neural network verification. While many powerful methods have been developed, most work focuses on open-loop properties of the networks, rather than the main point of the system -- collision avoidance -- which requires closed-loop analysis.
In this work, we develop a technique to verify a closed-loop approximation of the system using state quantization and backreachability. We use favorable assumptions for the analysis -- perfect sensor information, instant following of advisories, ideal aircraft maneuvers and an intruder that only flies straight. When the method fails to prove the system is safe, we refine the quantization parameters until generating counterexamples where the original (non-quantized) system also has collisions.
△ Less
Submitted 27 March, 2022; v1 submitted 17 January, 2022;
originally announced January 2022.
-
Characterizations of diffusion matrices in homogenization of elliptic equations in nondivergence-form
Authors:
Xiaoqin Guo,
Timo Sprekeler,
Hung V. Tran
Abstract:
We characterize diffusion matrices that yield a $L^{\infty}$ convergence rate of $\mathcal{O}(\varepsilon^2)$ in the theory of periodic homogenization of linear elliptic equations in nondivergence-form. Such type-$\varepsilon^2$ diffusion matrices are of particular interest as the optimal rate of convergence in the generic case is only $\mathcal{O}(\varepsilon)$. First, we provide a new class of t…
▽ More
We characterize diffusion matrices that yield a $L^{\infty}$ convergence rate of $\mathcal{O}(\varepsilon^2)$ in the theory of periodic homogenization of linear elliptic equations in nondivergence-form. Such type-$\varepsilon^2$ diffusion matrices are of particular interest as the optimal rate of convergence in the generic case is only $\mathcal{O}(\varepsilon)$. First, we provide a new class of type-$\varepsilon^2$ diffusion matrices, confirming a conjecture posed in [15]. Then, we give a complete characterization of diagonal diffusion matrices in two dimensions and a systematic study in higher dimensions.
△ Less
Submitted 14 January, 2022; v1 submitted 6 January, 2022;
originally announced January 2022.
-
SAFL: A Self-Attention Scene Text Recognizer with Focal Loss
Authors:
Bao Hieu Tran,
Thanh Le-Cong,
Huu Manh Nguyen,
Duc Anh Le,
Thanh Hung Nguyen,
Phi Le Nguyen
Abstract:
In the last decades, scene text recognition has gained worldwide attention from both the academic community and actual users due to its importance in a wide range of applications. Despite achievements in optical character recognition, scene text recognition remains challenging due to inherent problems such as distortions or irregular layout. Most of the existing approaches mainly leverage recurren…
▽ More
In the last decades, scene text recognition has gained worldwide attention from both the academic community and actual users due to its importance in a wide range of applications. Despite achievements in optical character recognition, scene text recognition remains challenging due to inherent problems such as distortions or irregular layout. Most of the existing approaches mainly leverage recurrence or convolution-based neural networks. However, while recurrent neural networks (RNNs) usually suffer from slow training speed due to sequential computation and encounter problems as vanishing gradient or bottleneck, CNN endures a trade-off between complexity and performance. In this paper, we introduce SAFL, a self-attention-based neural network model with the focal loss for scene text recognition, to overcome the limitation of the existing approaches. The use of focal loss instead of negative log-likelihood helps the model focus more on low-frequency samples training. Moreover, to deal with the distortions and irregular texts, we exploit Spatial TransformerNetwork (STN) to rectify text before passing to the recognition network. We perform experiments to compare the performance of the proposed model with seven benchmarks. The numerical results show that our model achieves the best performance.
△ Less
Submitted 1 January, 2022;
originally announced January 2022.
-
Predicting Job Titles from Job Descriptions with Multi-label Text Classification
Authors:
Hieu Trung Tran,
Hanh Hong Phuc Vo,
Son T. Luu
Abstract:
Finding a suitable job and hunting for eligible candidates are important to job seeking and human resource agencies. With the vast information about job descriptions, employees and employers need assistance to automatically detect job titles based on job description texts. In this paper, we propose the multi-label classification approach for predicting relevant job titles from job description text…
▽ More
Finding a suitable job and hunting for eligible candidates are important to job seeking and human resource agencies. With the vast information about job descriptions, employees and employers need assistance to automatically detect job titles based on job description texts. In this paper, we propose the multi-label classification approach for predicting relevant job titles from job description texts, and implement the Bi-GRU-LSTM-CNN with different pre-trained language models to apply for the job titles prediction problem. The BERT with multilingual pre-trained model obtains the highest result by F1-scores on both development and test sets, which are 62.20% on the development set, and 47.44% on the test set.
△ Less
Submitted 9 February, 2022; v1 submitted 21 December, 2021;
originally announced December 2021.
-
RSSI prediction using Machine Learning models
Authors:
Tung Giang Le,
Huy Tung Quach,
Thu Thao Dao Le,
Manh Hoang Tran
Abstract:
In this study, we present a method to predict the Received signal strength indication (RSSI) in an area of the base station. Traditional attenuated wave propagation models are often time consuming as well as computationally complex, depending on the unique factors of the medium. This study focuses on providing a solution to predict signal quality using coordinate values of many points in the consi…
▽ More
In this study, we present a method to predict the Received signal strength indication (RSSI) in an area of the base station. Traditional attenuated wave propagation models are often time consuming as well as computationally complex, depending on the unique factors of the medium. This study focuses on providing a solution to predict signal quality using coordinate values of many points in the considering area. We apply machine learning models such as linear regression, Support Vector Machine (SVM) or Decision tree model, to directly predict the RSSI of many points in the range of a base station without computing the complex parameters of the attenuated propagation model. The effectiveness of RSSI prediction was evaluated by mean square error (MSE) and mean absolute error (MAE). The stage of training and testing machine learning models in the research uses data that are the actual measurement results during the research process.
△ Less
Submitted 20 December, 2021;
originally announced December 2021.
-
Effective fronts of polygon shapes in two dimensions
Authors:
Wenjia **g,
Hung V. Tran,
Yifeng Yu
Abstract:
We study the effective fronts of first order front propagations in two dimensions ($n=2$) in the periodic setting. Using PDE-based approaches, we show that for every $α\in (0,1)$, the class of centrally symmetric polygons with rational vertices and nonempty interior is admissible as effective fronts for given front speeds in $C^{1,α}(\mathbb T^2,(0,\infty))$. This result can also be formulated in…
▽ More
We study the effective fronts of first order front propagations in two dimensions ($n=2$) in the periodic setting. Using PDE-based approaches, we show that for every $α\in (0,1)$, the class of centrally symmetric polygons with rational vertices and nonempty interior is admissible as effective fronts for given front speeds in $C^{1,α}(\mathbb T^2,(0,\infty))$. This result can also be formulated in the language of stable norms corresponding to periodic metrics in $\mathbb T^2$. Similar results were known long time ago when $n\geq 3$ for front speeds in $C^{\infty}(\mathbb T^n,(0,\infty))$. Due to topological restrictions, the two dimensional case is much more subtle. In fact, the effective front is $C^1$, which cannot be a polygon, for given $C^{1,1}(\mathbb T^2,(0,\infty))$ front speeds. Our regularity requirements on front speeds are hence optimal. To the best of our knowledge, this is the first time that polygonal effective fronts have been constructed in two dimensions.
△ Less
Submitted 20 December, 2021;
originally announced December 2021.
-
Effect of magnetocrystalline anisotropy on magnetocaloric properties of AlFe$_{2}$B$_{2}$ compound
Authors:
Hung Ba Tran,
Hiroyoshi Momida,
Yu-ichiro Matsushita,
Kazunori Sato,
Yukihiro Makino,
Koun Shirai,
Tamio Oguchi
Abstract:
It is well known that the temperature dependence of the effective magnetocrystalline anisotropy energy obeys the $l(l+1)/2$ power law of magnetization in the Callen-Callen theory. Therefore, according to the Callen-Callen theory, the magnetocrystalline anisotropy energy is assumed to be zero at the critical temperature where the magnetization is approximately zero. This study estimates the tempera…
▽ More
It is well known that the temperature dependence of the effective magnetocrystalline anisotropy energy obeys the $l(l+1)/2$ power law of magnetization in the Callen-Callen theory. Therefore, according to the Callen-Callen theory, the magnetocrystalline anisotropy energy is assumed to be zero at the critical temperature where the magnetization is approximately zero. This study estimates the temperature dependence of the magnetocrystalline anisotropy energy by integrating the magnetization versus magnetic field ($M$--$H$) curves, and found that the magnetocrystalline anisotropy is still finite even above the Curie temperature in the uniaxial anisotropy, whereas this does not appear in the cubic anisotropy case. The origin is the fast reduction of the anisotropy field, which is the magnetic field required to saturate the magnetization along the hard axis, in the case of cubic anisotropy. Therefore, the magnetization anisotropy and anisotropic magnetic susceptibility, those are the key factors of magnetic anisotropy, could not be established in the case of cubic anisotropy. In addition, the effect of magnetocrystalline anisotropy on magnetocaloric properties, as the difference between the entropy change curves of AlFe$_{2}$B$_{2}$ appears above the Curie temperature, which is in good agreement with a previous experimental study. This is proof of magnetic anisotropy at slightly above Curie temperature.
△ Less
Submitted 15 December, 2021;
originally announced December 2021.
-
Optimal convergence rate for periodic homogenization of convex Hamilton-Jacobi equations
Authors:
Hung V. Tran,
Yifeng Yu
Abstract:
In this paper, we show that the rate of convergence in periodic homogenization of convex Hamilton-Jacobi equations is always $O(\varepsilon)$, which is optimal. This is a natural extension of a result concerning stable norms in metric geometry [4] that is essentially equivalent to the homogenization of convex static Hamilton-Jacobi equations. Another extremely interesting question in this directio…
▽ More
In this paper, we show that the rate of convergence in periodic homogenization of convex Hamilton-Jacobi equations is always $O(\varepsilon)$, which is optimal. This is a natural extension of a result concerning stable norms in metric geometry [4] that is essentially equivalent to the homogenization of convex static Hamilton-Jacobi equations. Another extremely interesting question in this direction is whether the $O(\varepsilon)$ rate holds in the nonconvex setting. We present a special nonconvex example with $O(\varepsilon)$ convergence rate, which relies on identifying the shape of the effective Hamiltonian and game theory interpretation formulas.
△ Less
Submitted 30 June, 2022; v1 submitted 13 December, 2021;
originally announced December 2021.
-
Curvature of the Second kind and a conjecture of Nishikawa
Authors:
Matthew Gursky,
Xiaodong Cao,
Hung Tran
Abstract:
In this paper, we investigate manifolds for which the curvature of the second kind (following the terminology of Nishikawa) satisfies certain positivity conditions. Our main result settles Nishikawa's conjecture that manifolds for which the curvature (operator) of the second kind are positive are diffeomorphic to a sphere, by showing that such manifolds satisfy Brendle's PIC1 condition. In dimensi…
▽ More
In this paper, we investigate manifolds for which the curvature of the second kind (following the terminology of Nishikawa) satisfies certain positivity conditions. Our main result settles Nishikawa's conjecture that manifolds for which the curvature (operator) of the second kind are positive are diffeomorphic to a sphere, by showing that such manifolds satisfy Brendle's PIC1 condition. In dimension four we show that curvature of the second kind has a canonical normal form, and use this to classify Einstein four-manifolds for which the curvature (operator) of the second kind is five-non-negative. We also calculate the normal form for some explicit examples in order to show that this assumption is sharp.
△ Less
Submitted 2 December, 2021;
originally announced December 2021.