Search | arXiv e-print repository

Enhancing Q&A with Domain-Specific Fine-Tuning and Iterative Reasoning: A Comparative Study

Authors: Zooey Nguyen, Anthony Annunziata, Vinh Luong, Sang Dinh, Quynh Le, Anh Hai Ha, Chanh Le, Hong An Phan, Shruti Raghavan, Christopher Nguyen

Abstract: This paper investigates the impact of domain-specific model fine-tuning and of reasoning mechanisms on the performance of question-answering (Q&A) systems powered by large language models (LLMs) and Retrieval-Augmented Generation (RAG). Using the FinanceBench SEC financial filings dataset, we observe that, for RAG, combining a fine-tuned embedding model with a fine-tuned LLM achieves better accura… ▽ More This paper investigates the impact of domain-specific model fine-tuning and of reasoning mechanisms on the performance of question-answering (Q&A) systems powered by large language models (LLMs) and Retrieval-Augmented Generation (RAG). Using the FinanceBench SEC financial filings dataset, we observe that, for RAG, combining a fine-tuned embedding model with a fine-tuned LLM achieves better accuracy than generic models, with relatively greater gains attributable to fine-tuned embedding models. Additionally, employing reasoning iterations on top of RAG delivers an even bigger jump in performance, enabling the Q&A systems to get closer to human-expert quality. We discuss the implications of such findings, propose a structured technical design space capturing major technical components of Q&A AI, and provide recommendations for making high-impact technical choices for such components. We plan to follow up on this work with actionable guides for AI teams and further investigations into the impact of domain-specific augmentation in RAG and into agentic AI capabilities such as advanced planning and reasoning. △ Less

Submitted 19 April, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

Comments: Fixed typo of OODA's score on harder-question set in Table 2

arXiv:2402.04276 [pdf, other]

Accelerated boundary integral analysis of energy eigenvalues for confined electron states in quantum semiconductor heterostructures

Authors: J. D. Phan, A. -V. Phan

Abstract: This paper presents a novel and efficient approach for the computation of energy eigenvalues in quantum semiconductor heterostructures. Accurate determination of the electronic states in these heterostructures is crucial for understanding their optical and electronic properties, making it a key challenge in semiconductor physics. The proposed method is based on utilizing series expansions of zero-… ▽ More This paper presents a novel and efficient approach for the computation of energy eigenvalues in quantum semiconductor heterostructures. Accurate determination of the electronic states in these heterostructures is crucial for understanding their optical and electronic properties, making it a key challenge in semiconductor physics. The proposed method is based on utilizing series expansions of zero-order Bessel functions to numerically solve the Schrödinger equation using boundary integral method for bound electron states in a computationally efficient manner. To validate the proposed technique, we applied it to address previously explored issues by other research groups. The results clearly demonstrate the computational efficiency and high precision of our approach. Notably, the proposed technique significantly reduces the computational time compared to the conventional method for searching the energy eigenvalues in quantum structures. △ Less

Submitted 15 June, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

MSC Class: 65N38 (Primary) 93B60; 82D77 (Secondary)

arXiv:2401.16951 [pdf, other]

Empirical tight-binding method for large-supercell simulations of disordered semiconductor alloys

Authors: Anh-Luan Phan, Alessandro Pecchia, Alessia Di Vito, Matthias Auf der Maur

Abstract: We analyze and present applications of a recently proposed empirical tight-binding scheme for investigating the effects of alloy disorder on various electronic and optical properties of semiconductor alloys, such as the band gap variation, the localization of charge carriers, and the optical transitions. The results for a typical antimony-containing III-V alloy, GaAsSb, show that the new scheme gr… ▽ More We analyze and present applications of a recently proposed empirical tight-binding scheme for investigating the effects of alloy disorder on various electronic and optical properties of semiconductor alloys, such as the band gap variation, the localization of charge carriers, and the optical transitions. The results for a typical antimony-containing III-V alloy, GaAsSb, show that the new scheme greatly improves the accuracy in reproducing the experimental alloy band gaps compared to other widely used schemes. The atomistic nature of the empirical tight-binding approach paired with a reliable parameterization enables more detailed physical insights into the effects of disorder in alloyed materials. △ Less

Submitted 30 January, 2024; originally announced January 2024.

Comments: 9 pages, 6 figures

arXiv:2401.06569 [pdf, other]

Effect of the Nature of the Solid Substrate on Spatially Heterogeneous Activated Dynamics in Glass Forming Supported Films

Authors: Anh D. Phan, Kenneth S. Schweizer

Abstract: We extend the force-level ECNLE theory to treat the spatial gradients of the alpha relaxation time and glass transition temperature, and the corresponding film-averaged quantities, to the geometrically asymmetric case of finite thickness supported films with variable fluid - substrate coupling. The latter typically nonuniversally slows down motion near the solid-liquid interface as modeled via mod… ▽ More We extend the force-level ECNLE theory to treat the spatial gradients of the alpha relaxation time and glass transition temperature, and the corresponding film-averaged quantities, to the geometrically asymmetric case of finite thickness supported films with variable fluid - substrate coupling. The latter typically nonuniversally slows down motion near the solid-liquid interface as modeled via modification of the surface dynamic free energy caging constraints which are spatially transferred into the film, and which compete with the accelerated relaxation gradient induced by the vapor interface. Quantitative applications to the foundational hard sphere fluid and a polymer melt are presented. The strength of the effective fluid-substrate coupling has very large consequences on the dynamical gradients and film-averaged quantities in a film thickness and thermodynamic state dependent manner. The interference of the dynamical gradients of opposite nature emanating from the vapor and solid interfaces is determined, including the conditions for the disappearance of a bulk-like region in the film center. The relative importance of surface-induced modification of local caging versus the generic truncation of the long range collective elastic component of the activation barrier is studied. The conditions for the accuracy and failure of a simple superposition approximation for dynamical gradients in thin films is also determined. The emergence of near substrate dead layers, large gradient effects on film-averaged response functions, and a weak non-monotonic evolution of dynamic gradients in thick and cold films, are briefly discussed. The connection of our theoretical results to simulations and experiments is briefly discussed, as is extension to treat more complex glass-forming systems under nanoconfinement. △ Less

Submitted 12 January, 2024; originally announced January 2024.

Comments: 22 pages, 5 figures, accepted for publication in Journal of Chemical Physics

arXiv:2312.12522 [pdf, other]

Searching Dark Photons using displaced vertices at Belle II -- with backgrounds

Authors: Joerg Jaeckel, Anh Vu Phan

Abstract: Dark photons in the MeV to GeV range with kinetic mixing of the order of $\lesssim 10^{-4}-10^{-3}$ can be produced in significant numbers at low energy colliders such as Belle II. Their decay length can be macroscopic raising the hope for a fairly clean search via displaced vertices as proposed in Ref. [1]. However, even this is not background free. Here, we calculate and discuss problematic back… ▽ More Dark photons in the MeV to GeV range with kinetic mixing of the order of $\lesssim 10^{-4}-10^{-3}$ can be produced in significant numbers at low energy colliders such as Belle II. Their decay length can be macroscopic raising the hope for a fairly clean search via displaced vertices as proposed in Ref. [1]. However, even this is not background free. Here, we calculate and discuss problematic backgrounds from displaced photon conversion and discuss their potential impact on the sensitivity. In addition we also briefly consider the dangers of prompt backgrounds. △ Less

Submitted 19 December, 2023; originally announced December 2023.

Comments: 21 pages, 8 figures

arXiv:2312.08075 [pdf, other]

TERM Model: Tensor Ring Mixture Model for Density Estimation

Authors: Ruituo Wu, Jiani Liu, Ce Zhu, Anh-Huy Phan, Ivan V. Oseledets, Yipeng Liu

Abstract: Efficient probability density estimation is a core challenge in statistical machine learning. Tensor-based probabilistic graph methods address interpretability and stability concerns encountered in neural network approaches. However, a substantial number of potential tensor permutations can lead to a tensor network with the same structure but varying expressive capabilities. In this paper, we take… ▽ More Efficient probability density estimation is a core challenge in statistical machine learning. Tensor-based probabilistic graph methods address interpretability and stability concerns encountered in neural network approaches. However, a substantial number of potential tensor permutations can lead to a tensor network with the same structure but varying expressive capabilities. In this paper, we take tensor ring decomposition for density estimator, which significantly reduces the number of permutation candidates while enhancing expressive capability compared with existing used decompositions. Additionally, a mixture model that incorporates multiple permutation candidates with adaptive weights is further designed, resulting in increased expressive flexibility and comprehensiveness. Different from the prevailing directions of tensor network structure/permutation search, our approach provides a new viewpoint inspired by ensemble learning. This approach acknowledges that suboptimal permutations can offer distinctive information besides that of optimal permutations. Experiments show the superiority of the proposed approach in estimating probability density for moderately dimensional datasets and sampling to capture intricate details. △ Less

Submitted 13 December, 2023; originally announced December 2023.

arXiv:2312.00872 [pdf, other]

doi 10.1007/JHEP05(2024)075

Precise tests of the axion coupling to tops

Authors: Anh Vu Phan, Susanne Westhoff

Abstract: We present an in-depth analysis of axions and axion-like particles in top-pair production at the LHC. Our main goal is to probe the axion coupling to top quarks at high energies. To this end, we calculate the top-antitop cross section and differential distributions including ALP effects up to one-loop level. By comparing these predictions with LHC precision measurements, we constrain the top coupl… ▽ More We present an in-depth analysis of axions and axion-like particles in top-pair production at the LHC. Our main goal is to probe the axion coupling to top quarks at high energies. To this end, we calculate the top-antitop cross section and differential distributions including ALP effects up to one-loop level. By comparing these predictions with LHC precision measurements, we constrain the top coupling of axion-like particles with masses below the top-antitop threshold. Our results apply to all UV completions of the ALP effective theory with dominant couplings to top quarks, in particular to DFSZ-like axion models. △ Less

Submitted 7 June, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

Comments: 27 pages, 7 figures

Journal ref: JHEP 05 (2024) 075

arXiv:2310.10796 [pdf, ps, other]

Mixed Mode Oscillations in a Three-Timescale Coupled Morris-Lecar System

Authors: Ngoc Anh Phan, Yangyang Wang

Abstract: Mixed mode oscillations (MMOs) are complex oscillatory behaviors of multiple-timescale dynamical systems in which there is an alternation of large-amplitude and small-amplitude oscillations. It is well known that MMOs in two-timescale systems can arise either from a canard mechanism associated with folded node singularities or a delayed Andronov-Hopf bifurcation (DHB) of the fast subsystem. While… ▽ More Mixed mode oscillations (MMOs) are complex oscillatory behaviors of multiple-timescale dynamical systems in which there is an alternation of large-amplitude and small-amplitude oscillations. It is well known that MMOs in two-timescale systems can arise either from a canard mechanism associated with folded node singularities or a delayed Andronov-Hopf bifurcation (DHB) of the fast subsystem. While MMOs in two-timescale systems have been extensively studied, less is known regarding MMOs emerging in three-timescale systems. In this work, we examine the mechanisms of MMOs in coupled Morris-Lecar neurons with three distinct timescales. We investigate two kinds of MMOs occurring in the presence of a singularity known as canard-delayed-Hopf (CDH) and in cases where CDH is absent. In both cases, we examine how features and mechanisms of MMOs vary with respect to variations in timescales. Our analysis reveals that MMOs supported by CDH demonstrate significantly stronger robustness than those in its absence. Moreover, we show that the mere presence of CDH does not guarantee the occurrence of MMOs. This work yields important insights into conditions under which the two separate mechanisms in two-timescale context, canard and DHB, can interact in a three-timescale setting and produce more robust MMOs, particularly against timescale variations. △ Less

Submitted 28 February, 2024; v1 submitted 16 October, 2023; originally announced October 2023.

arXiv:2308.04595 [pdf, other]

Quantization Aware Factorization for Deep Neural Network Compression

Authors: Daria Cherniuk, Stanislav Abukhovich, Anh-Huy Phan, Ivan Oseledets, Andrzej Cichocki, Julia Gusak

Abstract: Tensor decomposition of convolutional and fully-connected layers is an effective way to reduce parameters and FLOP in neural networks. Due to memory and power consumption limitations of mobile or embedded devices, the quantization step is usually necessary when pre-trained models are deployed. A conventional post-training quantization approach applied to networks with decomposed weights yields a d… ▽ More Tensor decomposition of convolutional and fully-connected layers is an effective way to reduce parameters and FLOP in neural networks. Due to memory and power consumption limitations of mobile or embedded devices, the quantization step is usually necessary when pre-trained models are deployed. A conventional post-training quantization approach applied to networks with decomposed weights yields a drop in accuracy. This motivated us to develop an algorithm that finds tensor approximation directly with quantized factors and thus benefit from both compression techniques while kee** the prediction quality of the model. Namely, we propose to use Alternating Direction Method of Multipliers (ADMM) for Canonical Polyadic (CP) decomposition with factors whose elements lie on a specified quantization grid. We compress neural network weights with a devised algorithm and evaluate it's prediction quality and performance. We compare our approach to state-of-the-art post-training quantization methods and demonstrate competitive results and high flexibility in achiving a desirable quality-performance tradeoff. △ Less

Submitted 8 August, 2023; originally announced August 2023.

arXiv:2306.09822 [pdf, other]

Lightweight Attribute Localizing Models for Pedestrian Attribute Recognition

Authors: Ashish Jha, Dimitrii Ermilov, Konstantin Sobolev, Anh Huy Phan, Salman Ahmadi-Asl, Naveed Ahmed, Imran Junejo, Zaher AL Aghbari, Thar Baker, Ahmed Mohamed Khedr, Andrzej Cichocki

Abstract: Pedestrian Attribute Recognition (PAR) deals with the problem of identifying features in a pedestrian image. It has found interesting applications in person retrieval, suspect re-identification and soft biometrics. In the past few years, several Deep Neural Networks (DNNs) have been designed to solve the task; however, the developed DNNs predominantly suffer from over-parameterization and high com… ▽ More Pedestrian Attribute Recognition (PAR) deals with the problem of identifying features in a pedestrian image. It has found interesting applications in person retrieval, suspect re-identification and soft biometrics. In the past few years, several Deep Neural Networks (DNNs) have been designed to solve the task; however, the developed DNNs predominantly suffer from over-parameterization and high computational complexity. These problems hinder them from being exploited in resource-constrained embedded devices with limited memory and computational capacity. By reducing a network's layers using effective compression techniques, such as tensor decomposition, neural network compression is an effective method to tackle these problems. We propose novel Lightweight Attribute Localizing Models (LWALM) for Pedestrian Attribute Recognition (PAR). LWALM is a compressed neural network obtained after effective layer-wise compression of the Attribute Localization Model (ALM) using the Canonical Polyadic Decomposition with Error Preserving Correction (CPD-EPC) algorithm. △ Less

Submitted 16 June, 2023; originally announced June 2023.

arXiv:2305.09564 [pdf, other]

Image Reconstruction using Superpixel Clustering and Tensor Completion

Authors: Maame G. Asante-Mensah, Anh Huy Phan, Salman Ahmadi-Asl, Zaher Al Aghbari, Andrzej Cichocki

Abstract: This paper presents a pixel selection method for compact image representation based on superpixel segmentation and tensor completion. Our method divides the image into several regions that capture important textures or semantics and selects a representative pixel from each region to store. We experiment with different criteria for choosing the representative pixel and find that the centroid pixel… ▽ More This paper presents a pixel selection method for compact image representation based on superpixel segmentation and tensor completion. Our method divides the image into several regions that capture important textures or semantics and selects a representative pixel from each region to store. We experiment with different criteria for choosing the representative pixel and find that the centroid pixel performs the best. We also propose two smooth tensor completion algorithms that can effectively reconstruct different types of images from the selected pixels. Our experiments show that our superpixel-based method achieves better results than uniform sampling for various missing ratios. △ Less

Submitted 16 May, 2023; originally announced May 2023.

arXiv:2305.05030 [pdf, ps, other]

Adaptive Cross Tubal Tensor Approximation

Authors: Salman Ahmadi-Asl, Anh Huy Phan, Andrzej Cichocki, Anastasia Sozykina, Zaher Al Aghbari, Jun Wang, Ivan Oseledets

Abstract: In this paper, we propose a new adaptive cross algorithm for computing a low tubal rank approximation of third-order tensors, with less memory and lower computational complexity than the truncated tensor SVD (t-SVD). This makes it applicable for decomposing large-scale tensors. We conduct numerical experiments on synthetic and real-world datasets to confirm the efficiency and feasibility of the pr… ▽ More In this paper, we propose a new adaptive cross algorithm for computing a low tubal rank approximation of third-order tensors, with less memory and lower computational complexity than the truncated tensor SVD (t-SVD). This makes it applicable for decomposing large-scale tensors. We conduct numerical experiments on synthetic and real-world datasets to confirm the efficiency and feasibility of the proposed algorithm. The simulation results show more than one order of magnitude acceleration in the computation of low tubal rank (t-SVD) for large-scale tensors. An application to pedestrian attribute recognition is also presented. △ Less

Submitted 11 May, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

arXiv:2305.00749 [pdf, ps, other]

Robust Low-Tubal-rank tensor recovery Using Discrete Empirical Interpolation Method with Optimized Slice/Feature Selection

Authors: Salman Ahmadi-Asl, Anh-Huy Phan, Cesar F. Caiafa, Andrzej Cichocki

Abstract: In this paper, we extend the Discrete Empirical Interpolation Method (DEIM) to the third-order tensor case based on the t-product and use it to select important/ significant lateral and horizontal slices/features. The proposed Tubal DEIM (TDEIM) is investigated both theoretically and numerically. The experimental results show that the TDEIM can provide more accurate approximations than the existin… ▽ More In this paper, we extend the Discrete Empirical Interpolation Method (DEIM) to the third-order tensor case based on the t-product and use it to select important/ significant lateral and horizontal slices/features. The proposed Tubal DEIM (TDEIM) is investigated both theoretically and numerically. The experimental results show that the TDEIM can provide more accurate approximations than the existing methods. An application of the proposed method to the supervised classification task is also presented. △ Less

Submitted 7 May, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

arXiv:2303.07894 [pdf, other]

Photo-to-heat conversion of broadband metamaterial absorbers based on TiN nanoparticles under laser and solar illumination

Authors: Do T. Nga, Anh D. Phan, Thudsaphungthong Julie, Nam B. Le, Chu Viet Ha

Abstract: We theoretically investigate photothermal heating of ultra-flexible metamaterials, which are obtained by randomly mixing TiN nanoparticles in polydimethylsiloxane (PDMS). Due to the plasmonic properties of TiN nanoparticles, incident light is perfectly absorbed in a broadband range (300-3000 nm) to generate heat within these metamaterials. Under irradiation of an 808 nm near-infrared laser with di… ▽ More We theoretically investigate photothermal heating of ultra-flexible metamaterials, which are obtained by randomly mixing TiN nanoparticles in polydimethylsiloxane (PDMS). Due to the plasmonic properties of TiN nanoparticles, incident light is perfectly absorbed in a broadband range (300-3000 nm) to generate heat within these metamaterials. Under irradiation of an 808 nm near-infrared laser with different intensities, our predicted temperature rises as a function of time agree well with recent experimental data. For a given laser intensity, the temperature rise varies non-monotonically with concentration of TiN nanoparticles because the enhancement of thermal conductivity and absorbed energy as adding plasmonic nanostructures leads to opposite effects on the heating process. When the model is extended to solar heating, photothermal behaviors are qualitatively similar but the temperature increase is less than 13 $K$. Our studies would provide good guidance for future experimental studies on the photo-to-heat conversion of broadband perfect absorbers. △ Less

Submitted 14 March, 2023; originally announced March 2023.

Comments: 8 pages, 7 figures, accepted for publications in Materials Today Communications

arXiv:2209.08220 [pdf, other]

doi 10.1103/PhysRevB.106.094103

Theoretical predictions of melting behaviors of hcp iron up to 4000 GPa

Authors: Tran Dinh Cuong, Nguyen Quang Hoc, Nguyen Duc Trung, Nguyen Thi Thao, Anh D. Phan

Abstract: The high-pressure melting diagram of iron is a vital ingredient for the geodynamic modeling of planetary interiors. Nonetheless, available data for molten iron show an alarming discrepancy. Herein, we propose an efficient one-phase approach to capture the solid-liquid transition of iron under extreme conditions. Our basic idea is to extend the statistical moment method to determine the density of… ▽ More The high-pressure melting diagram of iron is a vital ingredient for the geodynamic modeling of planetary interiors. Nonetheless, available data for molten iron show an alarming discrepancy. Herein, we propose an efficient one-phase approach to capture the solid-liquid transition of iron under extreme conditions. Our basic idea is to extend the statistical moment method to determine the density of iron in the TPa region. On that basis, we adapt the work-heat equivalence principle to appropriately link equation-of-state parameters with melting properties. This strategy allows explaining cutting-edge experimental and ab initio results without massive computational workloads. Our theoretical calculations would be helpful to constrain the chemical composition, internal dynamics, and thermal evolution of the Earth and super-Earths. △ Less

Submitted 16 September, 2022; originally announced September 2022.

Journal ref: Phys. Rev. B 106, 094103 (2022)

arXiv:2208.08032 [pdf, other]

doi 10.1088/1361-648X/ac8b51

Screening and collective effects in randomly pinned fluids: A new theoretical framework

Authors: Anh D. Phan

Abstract: We propose a theoretical framework for the dynamics of bulk isotropic hard-sphere systems in the presence of randomly pinned particles and apply this theory to supercooled water to validate it. Structural relaxation is mainly governed by local and non-local activated process. As the pinned fraction grows, a local caging constraint becomes stronger and the long range collective aspect of relaxation… ▽ More We propose a theoretical framework for the dynamics of bulk isotropic hard-sphere systems in the presence of randomly pinned particles and apply this theory to supercooled water to validate it. Structural relaxation is mainly governed by local and non-local activated process. As the pinned fraction grows, a local caging constraint becomes stronger and the long range collective aspect of relaxation is screened by immobile obstacles. Different responses of the local and cooperative motions results in subtle predictions for how the alpha relaxation time varies with pinning and density. Our theoretical analysis for the relaxation time of water with pinned molecules quantitatively well describe previous simulations. In addition, the thermal dependence of relaxation for unpinned bulk water is also consistent with prior computational and experimental data. △ Less

Submitted 16 August, 2022; originally announced August 2022.

Comments: 7 pages, 4 figures, accepted for publication in Journal of Physics: Condensed Matter

arXiv:2207.12542 [pdf, other]

A Randomized Algorithm for Tensor Singular Value Decomposition using an Arbitrary Number of Passes

Authors: Salman Ahmadi-Asl, Anh-Huy Phan, Andrzej Cichocki

Abstract: Efficient and fast computation of a tensor singular value decomposition (t-SVD) with a few passes over the underlying data tensor is crucial because of its many potential applications. The current/existing subspace randomized algorithms need (2q+2) passes over the data tensor to compute a t-SVD, where q is a non-negative integer number (power iteration parameter). In this paper, we propose an effi… ▽ More Efficient and fast computation of a tensor singular value decomposition (t-SVD) with a few passes over the underlying data tensor is crucial because of its many potential applications. The current/existing subspace randomized algorithms need (2q+2) passes over the data tensor to compute a t-SVD, where q is a non-negative integer number (power iteration parameter). In this paper, we propose an efficient and flexible randomized algorithm that can handle any number of passes q, which not necessary need be even. The flexibility of the proposed algorithm in using fewer passes naturally leads to lower computational and communication costs. This advantage makes it particularly appropriate when our task calls for several tensor decompositions or when the data tensors are huge. The proposed algorithm is a generalization of the methods developed for matrices to tensors. The expected/ average error bound of the proposed algorithm is derived. Extensive numerical experiments on random and real-world data sets are conducted, and the proposed algorithm is compared with some baseline algorithms. The extensive computer simulation experiments demonstrate that the proposed algorithm is practical, efficient, and in general outperforms the state of the arts algorithms. We also demonstrate how to use the proposed method to develop a fast algorithm for the tensor completion problem. △ Less

Submitted 20 January, 2024; v1 submitted 25 July, 2022; originally announced July 2022.

arXiv:2207.06072 [pdf, other]

Cross Tensor Approximation for Image and Video Completion

Authors: Salman Ahmadi-Asl, Maame Gyamfua Asante-Mensah, Andrzej Cichocki, Anh-Huy Phan, Ivan Oseledets, Jun Wang

Abstract: This paper proposes a general framework to use the cross tensor approximation or tensor ColUmn-Row (CUR) approximation for reconstructing incomplete images and videos. The key importance of the new algorithms is their simplicity and ease of implementation with low computational complexity. For the case of data tensors with 1) structural missing components or 2) a high missing rate, we propose an e… ▽ More This paper proposes a general framework to use the cross tensor approximation or tensor ColUmn-Row (CUR) approximation for reconstructing incomplete images and videos. The key importance of the new algorithms is their simplicity and ease of implementation with low computational complexity. For the case of data tensors with 1) structural missing components or 2) a high missing rate, we propose an efficient smooth tensor CUR algorithms which first make the sampled fibers smooth and then apply the proposed CUR algorithms. The numerical experiments show the significant benefit of this smoothing procedure. The main contribution of this paper is to develop/investigate improved multistage CUR algorithms with filtering (smoothing ) preprocessing for tensor completion. The second contribution is a detailed comparison of the performance of image recovery for four different CUR strategies via extensive computer simulations. Our simulations clearly indicated that the proposed algorithms are much faster than most of the existing state-of-the-art algorithms developed for tensor completion, while performance is comparable and often even better. Furthermore, we will provide in GitHub the MATLAB codes which can be used for various applications. Moreover, to our best knowledge, the CUR (cross approximation) algorithms have not been investigated nor compared till now for image and video completion. △ Less

Submitted 9 January, 2023; v1 submitted 13 July, 2022; originally announced July 2022.

arXiv:2205.06086 [pdf, other]

doi 10.1093/mnras/stac2832

The Tianlai dish array low-z surveys forecasts

Authors: Olivier Perdereau, Réza Ansari, Albert Stebbins, Peter T. Timbie, Xuelei Chen, Fengquan Wu, Jixia Li, John P. Marriner, Gregory S. Tucker, Yan** Cong, Santanu Das, Yichao Li, Yingfeng Liu, Christophe Magneville, Jeffrey B. Peterson, Anh Phan, Lily Robinthal, Shijie Sun, Yougang Wang, Yanlin Wu, Yidong Xu, Kaifeng Yu, Zijie Yu, Jiao Zhang, Juyong Zhang , et al. (1 additional authors not shown)

Abstract: We present the science case for surveys with the Tianlai dish array interferometer tuned to the $\left[ 1300, 1400 \right] \mathrm{MHz}$ frequency range. Starting from a realistic generation of mock visibility data according to the survey strategy, we reconstruct a map of the sky and perform a foreground subtraction. We show that a survey of the North Celestial Polar cap during a year of observing… ▽ More We present the science case for surveys with the Tianlai dish array interferometer tuned to the $\left[ 1300, 1400 \right] \mathrm{MHz}$ frequency range. Starting from a realistic generation of mock visibility data according to the survey strategy, we reconstruct a map of the sky and perform a foreground subtraction. We show that a survey of the North Celestial Polar cap during a year of observing time and covering an area of $150 \, \mathrm{deg^2}$ would reach a sensitivity of $ 1.5-2 \, \mathrm{mK} $ per $1 \, \mathrm{MHz} \times 0.25^2 \, \mathrm{deg^2 }$ voxel and be marginally impacted by mode-mixing. Tianlai would be able to detect a handful $(\sim 10)$ of nearby massive \HI clumps as well as a very strong cross-correlation signal of 21\,cm intensity maps with the North Celestial Cap Survey optical galaxies. We have also studied the performance of a mid-latitude survey, covering $\sim 1500 \, \mathrm{deg^2}$ centered on a declination of $δ=55^\circ$, which overlaps the Sloan Digital Sky Survey footprint. Despite a higher noise level for the mid-latitude survey, as well as significant distortions due to mode mixing, Tianlai would be able to detect a highly significant cross-correlation between the 21\,cm signal and the Sloan spectroscopic galaxy sample. Using the extragalactic signals from either or both of these surveys, it will be possible to assess the impact of calibration uncertainties, antenna pattern uncertainties, sources of noise, and mode mixing for future surveys requiring higher sensitivity. △ Less

Submitted 12 May, 2022; originally announced May 2022.

Comments: 20 pages, 22 figures. Submitted to MNRAS

arXiv:2205.00293 [pdf, other]

TTOpt: A Maximum Volume Quantized Tensor Train-based Optimization and its Application to Reinforcement Learning

Authors: Konstantin Sozykin, Andrei Chertkov, Roman Schutski, Anh-Huy Phan, Andrzej Cichocki, Ivan Oseledets

Abstract: We present a novel procedure for optimization based on the combination of efficient quantized tensor train representation and a generalized maximum matrix volume principle. We demonstrate the applicability of the new Tensor Train Optimizer (TTOpt) method for various tasks, ranging from minimization of multidimensional functions to reinforcement learning. Our algorithm compares favorably to popular… ▽ More We present a novel procedure for optimization based on the combination of efficient quantized tensor train representation and a generalized maximum matrix volume principle. We demonstrate the applicability of the new Tensor Train Optimizer (TTOpt) method for various tasks, ranging from minimization of multidimensional functions to reinforcement learning. Our algorithm compares favorably to popular evolutionary-based methods and outperforms them by the number of function evaluations or execution time, often by a significant margin. △ Less

Submitted 28 September, 2022; v1 submitted 30 April, 2022; originally announced May 2022.

Comments: 26 pages, 8 figures, accepted to Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022). Pre camera-ready version

arXiv:2203.02617 [pdf, other]

How to Train Unstable Looped Tensor Network

Authors: Anh-Huy Phan, Konstantin Sobolev, Dmitry Ermilov, Igor Vorona, Nikolay Kozyrskiy, Petr Tichavsky, Andrzej Cichocki

Abstract: A rising problem in the compression of Deep Neural Networks is how to reduce the number of parameters in convolutional kernels and the complexity of these layers by low-rank tensor approximation. Canonical polyadic tensor decomposition (CPD) and Tucker tensor decomposition (TKD) are two solutions to this problem and provide promising results. However, CPD often fails due to degeneracy, making the… ▽ More A rising problem in the compression of Deep Neural Networks is how to reduce the number of parameters in convolutional kernels and the complexity of these layers by low-rank tensor approximation. Canonical polyadic tensor decomposition (CPD) and Tucker tensor decomposition (TKD) are two solutions to this problem and provide promising results. However, CPD often fails due to degeneracy, making the networks unstable and hard to fine-tune. TKD does not provide much compression if the core tensor is big. This motivates using a hybrid model of CPD and TKD, a decomposition with multiple Tucker models with small core tensor, known as block term decomposition (BTD). This paper proposes a more compact model that further compresses the BTD by enforcing core tensors in BTD identical. We establish a link between the BTD with shared parameters and a looped chain tensor network (TC). Unfortunately, such strongly constrained tensor networks (with loop) encounter severe numerical instability, as proved by y (Landsberg, 2012) and (Handschuh, 2015a). We study perturbation of chain tensor networks, provide interpretation of instability in TC, demonstrate the problem. We propose novel methods to gain the stability of the decomposition results, keep the network robust and attain better approximation. Experimental results will confirm the superiority of the proposed methods in compression of well-known CNNs, and TC decomposition under challenging scenarios △ Less

Submitted 4 March, 2022; originally announced March 2022.

MSC Class: 65K05; 49M27

arXiv:2202.02158 [pdf, other]

doi 10.1039/D2CP00116K

Theoretical Insights into Non-Arrhenius Behaviors of Thermal Vacancies in Anharmonic Crystals

Authors: Tran Dinh Cuong, Anh D. Phan

Abstract: Vacancies are prevalent point defects in crystals, but their thermal responses are elusive. Herein, we formulate a simple theoretical model to shed light on the vacancy evolution during heating. Vibrational excitations are thoroughly investigated via moment recurrence techniques in quantum statistical mechanics. On that basis, we carry out numerical analyses for Ag, Cu, and Ni with the Sutton-Chen… ▽ More Vacancies are prevalent point defects in crystals, but their thermal responses are elusive. Herein, we formulate a simple theoretical model to shed light on the vacancy evolution during heating. Vibrational excitations are thoroughly investigated via moment recurrence techniques in quantum statistical mechanics. On that basis, we carry out numerical analyses for Ag, Cu, and Ni with the Sutton-Chen many-body potential. Our results reveal that the well-known Arrhenius law is insufficient to describe the proliferation of vacancies. Specifically, anharmonic effects lead to a strong nonlinearity in the Gibbs energy of vacancy formation. Our physical picture is well supported by previous simulations and experiments. △ Less

Submitted 4 February, 2022; originally announced February 2022.

Comments: Accepted for publication in Physical Chemistry Chemical Physics 2022

arXiv:2202.00218 [pdf, other]

Confinement effects on the spatially inhomogeneous dynamics in metallic glass films

Authors: Anh D. Phan

Abstract: We develop the Elastically Collective Nonlinear Langevin Equation theory to investigate, for the first time, glassy dynamics in capped metallic glass thin films. Finite-size effects on the spatial gradient of structural relaxation time and glass transition temperature (Tg) are calculated at different temperatures and vitrification criteria. Molecular dynamics is significantly slowed down near roug… ▽ More We develop the Elastically Collective Nonlinear Langevin Equation theory to investigate, for the first time, glassy dynamics in capped metallic glass thin films. Finite-size effects on the spatial gradient of structural relaxation time and glass transition temperature (Tg) are calculated at different temperatures and vitrification criteria. Molecular dynamics is significantly slowed down near rough solid surfaces and the dynamics at location far from the interfaces is sped up. In thick films, the mobility gradient normalized by the bulk value well obeys the double-exponential form since interference effects between two surfaces are weak. Reducing the film thickness induces a strong dynamic coupling between two surfaces and flattens the relaxation gradient. The normalized gradient of the glass transition temperature is independent of vitrification timescale criterion and can be fitted by a superposition function as the films are not ultra-thin. The local fragility is found to remain unchanged with location. This finding suggests that one can use Angell plots of bulk relaxation time and the Tg spatial gradient to characterize glassy dynamics in metallic glass films. Our computational results agree well with experimental data and simulation. △ Less

Submitted 31 January, 2022; originally announced February 2022.

Comments: 8 pages, 6 figures, accepted for publication in Journal of Physical Chemistry B

arXiv:2112.14390 [pdf, other]

doi 10.1002/pssr.202100496

Tailoring Drug Mobility by Photothermal Heating of Graphene Plasmons

Authors: Anh D. Phan, Nguyen K. Ngan, Do T. Nga, Nam B. Le, Chu Viet Ha

Abstract: We propose a theoretical approach to quantitatively determine the photothermally driven enhancement of molecular mobility of graphene-indomethacin mixtures under infrared laser irradiation. Graphene plasmons absorb incident electromagnetic energy and dissipate them into heat. The absorbed energy depends on optical properties of graphene plasmons, which are sensitive to structural parameters, and c… ▽ More We propose a theoretical approach to quantitatively determine the photothermally driven enhancement of molecular mobility of graphene-indomethacin mixtures under infrared laser irradiation. Graphene plasmons absorb incident electromagnetic energy and dissipate them into heat. The absorbed energy depends on optical properties of graphene plasmons, which are sensitive to structural parameters, and concentration of plasmonic nanostructures. By using theoretical model, we calculate temperature gradients of the bulk drug with different concentrations of graphene plasmons. From these, we determine the temperature dependence of structural molecular relaxation and diffusion of indomethacin and find how the heating process significantly enhances the drug mobility. △ Less

Submitted 28 December, 2021; originally announced December 2021.

Comments: 6 pages, 4 figures, accepted for publication in Rapid Research Letters

arXiv:2112.08449 [pdf, other]

Kernel Matrix Completion for Offline Quantum-Enhanced Machine Learning

Authors: Annie Naveh, Imogen Fitzgerald, Anna Phan, Andrew Lockwood, Travis L. Scholten

Abstract: Enhancing classical machine learning (ML) algorithms through quantum kernels is a rapidly growing research topic in quantum machine learning (QML). A key challenge in using kernels -- both classical and quantum -- is that ML workflows involve acquiring new observations, for which new kernel values need to be calculated. Transferring data back-and-forth between where the new observations are genera… ▽ More Enhancing classical machine learning (ML) algorithms through quantum kernels is a rapidly growing research topic in quantum machine learning (QML). A key challenge in using kernels -- both classical and quantum -- is that ML workflows involve acquiring new observations, for which new kernel values need to be calculated. Transferring data back-and-forth between where the new observations are generated & a quantum computer incurs a time delay; this delay may exceed the timescales relevant for using the QML algorithm in the first place. In this work, we show quantum kernel matrices can be extended to incorporate new data using a classical (chordal-graph-based) matrix completion algorithm. The minimal sample complexity needed for perfect completion is dependent on matrix rank. We empirically show that (a) quantum kernel matrices can be completed using this algorithm when the minimal sample complexity is met, (b) the error of the completion degrades gracefully in the presence of finite-sampling noise, and (c) the rank of quantum kernel matrices depends weakly on the expressibility of the quantum feature map generating the kernel. Further, on a real-world, industrially-relevant data set, the completion error behaves gracefully even when the minimal sample complexity is not reached. △ Less

Submitted 15 December, 2021; originally announced December 2021.

arXiv:2110.01037 [pdf, other]

Effects of surface charge and environmental factors on the electrostatic interaction of fiber with virus-like particle: A case of coronavirus

Authors: D. N. Dung, A. D. Phan, T. T. Nguyen, V. D. Lam

Abstract: We propose a theoretical model to elucidate intermolecular electrostatic interactions between a virus and a substrate. Our model treats the virus as a homogeneous particle having surface charge and the polymer fiber of the respirator as a charged plane. Electric potentials surrounding the virus and fiber are influenced by the surface charge distribution of the virus. We use Poisson-Boltzmann equat… ▽ More We propose a theoretical model to elucidate intermolecular electrostatic interactions between a virus and a substrate. Our model treats the virus as a homogeneous particle having surface charge and the polymer fiber of the respirator as a charged plane. Electric potentials surrounding the virus and fiber are influenced by the surface charge distribution of the virus. We use Poisson-Boltzmann equations to calculate electric potentials. Then, Derjaguin's approximation and a linear superposition of the potential function are extended to determine the electrostatic force. In this work, we apply this model for coronavirus or SARS-CoV-2 case and numerical results quantitatively agree with prior simulation. We find that the influence of fiber's potential on the surface charge of the virus is important and is considered in interaction calculations to obtain better accuracy. The electrostatic interaction significantly decays with increasing separation distance, and this curve becomes steeper when adding more salt. Although the interaction force increases with heating, one can observe the repulsive-attractive transition when the environment is acidic. △ Less

Submitted 3 October, 2021; originally announced October 2021.

Comments: accepted for publication in AIP Advances

arXiv:2109.12623 [pdf]

Phase equilibria -- thermal conductivity relationship within multicomponent Phase Change Materials from 273 K up to above the melting temperature

Authors: Anh Thu Phan, Aïmen E. Gheribi, Patrice Chartrand

Abstract: Among all the properties required for the design of the next generation of PCM (density, heat capacity, thermal expansion, latent energy, volume change upon melting, corrosion rate, etc.) the thermal transport properties are by far the least known, especially for molten salt mixtures and solid solutions. We present in this paper a theoretical framework for accurate predictions of thermal conductiv… ▽ More Among all the properties required for the design of the next generation of PCM (density, heat capacity, thermal expansion, latent energy, volume change upon melting, corrosion rate, etc.) the thermal transport properties are by far the least known, especially for molten salt mixtures and solid solutions. We present in this paper a theoretical framework for accurate predictions of thermal conductivity of multicomponent salt-based PCM, from 273.15 K up to above melting temperature. The solid phase is considered as a microstructure with its proper temperature dependent parameters: phase volume fraction, grain size distribution, porosity, etc. As case studies, five new potential PCMs for CSP applications are considered. Their thermal conductivity is estimated as a function of temperature, from room temperature to 200 K above their melting point. The predictive capability of the proposed framework is discussed based on a comparison with available experimental data. The effect of equilibrium and non-equilibrium microstructural parameters (i.e. phase fraction, phase composition, average grain size, inter-grain, and intra-grain porosity) on the effective thermal conductivity of the solid states of the promising chloride PCMs is discussed. Lastly, recommendations for the design of next generations of PCM materials are suggested in order to improve their thermal transport properties. △ Less

Submitted 26 September, 2021; originally announced September 2021.

arXiv:2108.10088 [pdf, other]

Loop-corrected Higgs Masses in the NMSSM with Inverse Seesaw Mechanism

Authors: Thi Nhung Dao, Margarete Mühlleitner, Anh Vu Phan

Abstract: In this study, we work in the framework of the Next-to-Minimal extension of the Standard Model (NMSSM) extended by six singlet leptonic superfields. Through the mixing with the three doublet leptonic superfields, the non-zero tiny neutrino masses can be generated through the inverse seesaw mechanism. While $R$-parity is conserved in this model lepton number is explicitly violated. We quantify the… ▽ More In this study, we work in the framework of the Next-to-Minimal extension of the Standard Model (NMSSM) extended by six singlet leptonic superfields. Through the mixing with the three doublet leptonic superfields, the non-zero tiny neutrino masses can be generated through the inverse seesaw mechanism. While $R$-parity is conserved in this model lepton number is explicitly violated. We quantify the impact of the extended neutrino sector on the NMSSM Higgs sector by computing the complete one-loop corrections with full momentum dependence to the Higgs boson masses in a mixed on-shell-$\overline{\mbox{DR}}$ renormalization scheme, with and without the inclusion of CP violation. The results are consistently combined with the dominant two-loop corrections at ${\cal O}(α_t(α_s+α_t))$ to improve the predictions for the Higgs mixing and the loop-corrected masses. In our numerical study we include the constraints from the Higgs data, the neutrino oscillation data, the charged lepton flavor-violating decays $l_i \to l_j + γ$, and the new physics constraints from the oblique parameters $S,T,U$. We present in this context the one-loop decay width for $l_i \to l_j + γ$. The loop-corrected Higgs boson masses are included in the Fortran code NMSSMCALC-nuSS. △ Less

Submitted 23 August, 2021; originally announced August 2021.

Comments: 41 pages, 9 figures

Report number: IFIRSE-TH-2021-3, KA-TP-15-2021

arXiv:2107.13410 [pdf, other]

Toward a better understanding of activation volume and dynamic decoupling of glass-forming liquids under compression

Authors: Anh D. Phan, Nguyen K. Ngan, Nam B. Le, Le T. M. Thanh

Abstract: We theoretically investigate physical properties of the pressure-induced activation volume and dynamic decoupling of ternidazole, glycerol, and probucol by the Elastically Collective Nonlinear Langevin Equation theory. Based on the predicted temperature dependence of activated relaxation under various compression, the activation volume is determined to characterize effects of pressure on molecular… ▽ More We theoretically investigate physical properties of the pressure-induced activation volume and dynamic decoupling of ternidazole, glycerol, and probucol by the Elastically Collective Nonlinear Langevin Equation theory. Based on the predicted temperature dependence of activated relaxation under various compression, the activation volume is determined to characterize effects of pressure on molecular dynamics of materials. We find that the decoupling of the structural relaxation time of compressed systems from their bulk uncompressed value is governed by the power-law rule. The decoupling exponent exponentially grows with pressure below 2 GPa. The decoupling exponent and activation volume are intercorrelated and have a connection with the differential activation free energy. We numerically and mathematically analyze relationships among these quantities to explain many results in previous experiments and simulations. △ Less

Submitted 28 July, 2021; originally announced July 2021.

Comments: accepted for publication in Macromolecular Theory and Simulations

arXiv:2106.10902 [pdf, ps, other]

doi 10.1140/epjb/s10051-021-00176-x

Electronic transport in two-dimensional strained Dirac materials under multi-step Fermi velocity barrier: transfer matrix method for supersymmetric systems

Authors: Anh-Luan Phan, Dai-Nam Le

Abstract: In recent years, graphene and other two-dimensional Dirac materials like silicene, germanene, etc. have been studied from different points of view: from mathematical physics, condensed matter physics to high energy physics. In this study, we utilize both supersymmetric quantum mechanics (SUSY-QM) and transfer matrix method (TTM) to examine electronic transport in two-dimensional Dirac materials un… ▽ More In recent years, graphene and other two-dimensional Dirac materials like silicene, germanene, etc. have been studied from different points of view: from mathematical physics, condensed matter physics to high energy physics. In this study, we utilize both supersymmetric quantum mechanics (SUSY-QM) and transfer matrix method (TTM) to examine electronic transport in two-dimensional Dirac materials under the influences of multi-step deformation as well as multi-step Fermi velocity barrier. The effects of multi-step effective mass and multi-step applied fields are also taken into account in our investigation. Results show the possibility of modulating the Klein tunneling of Dirac electron by using strain or electric field. △ Less

Submitted 15 October, 2021; v1 submitted 21 June, 2021; originally announced June 2021.

Comments: 22 pages, 7 figures, published on European Physical Journal B

Journal ref: Eur. Phys. J. B 94 (2021)165

arXiv:2106.01782 [pdf, other]

Machine learning models for DOTA 2 outcomes prediction

Authors: Kodirjon Akhmedov, Anh Huy Phan

Abstract: Prediction of the real-time multiplayer online battle arena (MOBA) games' match outcome is one of the most important and exciting tasks in Esports analytical research. This research paper predominantly focuses on building predictive machine and deep learning models to identify the outcome of the Dota 2 MOBA game using the new method of multi-forward steps predictions. Three models were investigate… ▽ More Prediction of the real-time multiplayer online battle arena (MOBA) games' match outcome is one of the most important and exciting tasks in Esports analytical research. This research paper predominantly focuses on building predictive machine and deep learning models to identify the outcome of the Dota 2 MOBA game using the new method of multi-forward steps predictions. Three models were investigated and compared: Linear Regression (LR), Neural Networks (NN), and a type of recurrent neural network Long Short-Term Memory (LSTM). In order to achieve the goals, we developed a data collecting python server using Game State Integration (GSI) to track the real-time data of the players. Once the exploratory feature analysis and tuning hyper-parameters were done, our models' experiments took place on different players with dissimilar backgrounds of playing experiences. The achieved accuracy scores depend on the multi-forward prediction parameters, which for the worse case in linear regression 69\% but on average 82\%, while in the deep learning models hit the utmost accuracy of prediction on average 88\% for NN, and 93\% for LSTM models. △ Less

Submitted 3 June, 2021; originally announced June 2021.

Comments: 11 pages, 12 figures, the paper will be published in IEEE Transactions on Games Journal

arXiv:2105.13007 [pdf, other]

doi 10.1002/pssr.202100235

Impact of high pressure on reversible structural relaxation of metallic glass

Authors: Nguyen K. Ngan, Anh D. Phan, Alessio Zaccone

Abstract: We theoretically investigate the temperature dependence of the reversible structural relaxation time and diffusion constant of metallic glasses under pressure. The compression not only changes the glassy dynamics, but also generates a metastable state along with a higher-energy state where the system can rejuvenate. The relaxation times for forward and backward transitions in this two-state system… ▽ More We theoretically investigate the temperature dependence of the reversible structural relaxation time and diffusion constant of metallic glasses under pressure. The compression not only changes the glassy dynamics, but also generates a metastable state along with a higher-energy state where the system can rejuvenate. The relaxation times for forward and backward transitions in this two-state system are nearly identical and much faster than the relaxation time without accounting for barrier-recrossing. At ambient pressure, the expected irreversible relaxation process is recovered, and our numerical results agree well with prior experimental results. An increase of pressure has a minor effect on the relaxation time and diffusion constant that one computes without considering the influence of the metastable state, but it leads to a large reduction of the reversible relaxation time computed upon taking the metastable state into account. The presence of external compression is also shown to trigger a fragile-to-strong crossover in metallic glasses. △ Less

Submitted 27 May, 2021; originally announced May 2021.

Comments: This paper has been accepted for publication in Physica Status Solidi (RRL) - Rapid Research Letters

arXiv:2105.08299 [pdf, other]

doi 10.3847/1538-4365/ac01e7

Infrared Surface Brightness Fluctuation Distances for MASSIVE and Type Ia Supernova Host Galaxies

Authors: Joseph B. Jensen, John P. Blakeslee, Chung-Pei Ma, Peter A. Milne, Peter J. Brown, Michele Cantiello, Peter M. Garnavich, Jenny E. Greene, John R. Lucey, Anh Phan, R. Brent Tully, Charlotte M. Wood

Abstract: We measured high-quality surface brightness fluctuation (SBF) distances for a sample of 63 massive early-type galaxies using the WFC3/IR camera on the Hubble Space Telescope. The median uncertainty on the SBF distance measurements is 0.085 mag, or 3.9% in distance. Achieving this precision at distances of 50 to 100 Mpc required significant improvements to the SBF calibration and data analysis proc… ▽ More We measured high-quality surface brightness fluctuation (SBF) distances for a sample of 63 massive early-type galaxies using the WFC3/IR camera on the Hubble Space Telescope. The median uncertainty on the SBF distance measurements is 0.085 mag, or 3.9% in distance. Achieving this precision at distances of 50 to 100 Mpc required significant improvements to the SBF calibration and data analysis procedures for WFC3/IR data. Forty-two of the galaxies are from the MASSIVE Galaxy Survey, a complete sample of massive galaxies within ~100 Mpc; the SBF distances for these will be used to improve the estimates of the stellar and central supermassive black hole masses in these galaxies. Twenty-four of the galaxies are Type Ia supernova hosts, useful for calibrating SN Ia distances for early-type galaxies and exploring possible systematic trends in the peak luminosities. Our results demonstrate that the SBF method is a powerful and versatile technique for measuring distances to galaxies with evolved stellar populations out to 100 Mpc and constraining the local value of the Hubble constant. △ Less

Submitted 18 May, 2021; originally announced May 2021.

Comments: Accepted for publication in Astrophysical Journal Supplement Series; 22 pages, 7 figures, with 61 additional figures to be published as an online figure set

arXiv:2105.07126 [pdf, other]

doi 10.1093/mnras/stac618

AlgoSCR: An algorithm for Solar Contamination Removal from radio interferometric data

Authors: Anh Phan, Santanu Das, Albert Stebbins, Peter Timbie, Reza Ansari, Shifan Zuo, Jixia Li, Trevor Oxholm, Fengquan Wu, Xuelei Chen, Shijie Sun, Yougang Wang, Jiao Zhang

Abstract: Hydrogen intensity map** is a new field in astronomy that promises to make three-dimensional maps of the matter distribution of the Universe using the redshifted $21\,\textrm{cm}$ line of neutral hydrogen gas (HI). Several ongoing and upcoming radio interferometers, such as Tianlai, CHIME, HERA, HIRAX, etc. are using this technique. These instruments are designed to map large swaths of the sky b… ▽ More Hydrogen intensity map** is a new field in astronomy that promises to make three-dimensional maps of the matter distribution of the Universe using the redshifted $21\,\textrm{cm}$ line of neutral hydrogen gas (HI). Several ongoing and upcoming radio interferometers, such as Tianlai, CHIME, HERA, HIRAX, etc. are using this technique. These instruments are designed to map large swaths of the sky by drift scanning over periods of many months. One of the challenges of the observations is that the daytime data is contaminated by strong radio signals from the Sun. In the case of Tianlai, this results in almost half of the measured data being unusable. We try to address this issue by develo** an algorithm for solar contamination removal (AlgoSCR) from the radio data. The algorithm is based on an eigenvalue analysis of the visibility matrix, and hence is applicable only to interferometers. We apply AlgoSCR to simulated visibilities, as well as real daytime data from the Tianlai dish array. The algorithm can remove most of the solar contamination without seriously affecting other sky signals and thus makes the data usable for certain applications. △ Less

Submitted 14 May, 2021; originally announced May 2021.

arXiv:2104.01347 [pdf, other]

Theoretical Model for the High-Pressure Melting Process of MgO with the B1 Structure

Authors: Tran Dinh Cuong, Anh D. Phan

Abstract: MgO is an abundant mineral in the rocky mantle of terrestrial planets, but its melting behaviors remain enigmatic. Here we introduce a simple theoretical model to investigate the B1-liquid transition of MgO up to 370 GPa. Vibrational free energies of B1-MgO are fully computed by the moment recurrence technique in quantum statistical physics. On that basis, we associate the melting temperature with… ▽ More MgO is an abundant mineral in the rocky mantle of terrestrial planets, but its melting behaviors remain enigmatic. Here we introduce a simple theoretical model to investigate the B1-liquid transition of MgO up to 370 GPa. Vibrational free energies of B1-MgO are fully computed by the moment recurrence technique in quantum statistical physics. On that basis, we associate the melting temperature with the isothermal bulk modulus via the work-heat equivalence principle. This strategy allows us to quantitatively explain recent experimental data. Our numerical analyses would yield insights into planetary dynamics and evolution. △ Less

Submitted 5 April, 2021; v1 submitted 3 April, 2021; originally announced April 2021.

Comments: accepted for publication in Vacuum 2021

arXiv:2103.04246 [pdf, other]

RNA Alternative Splicing Prediction with Discrete Compositional Energy Network

Authors: Alvin Chan, Anna Korsakova, Yew-Soon Ong, Fernaldo Richtia Winnerdy, Kah Wai Lim, Anh Tuan Phan

Abstract: A single gene can encode for different protein versions through a process called alternative splicing. Since proteins play major roles in cellular functions, aberrant splicing profiles can result in a variety of diseases, including cancers. Alternative splicing is determined by the gene's primary sequence and other regulatory factors such as RNA-binding protein levels. With these as input, we form… ▽ More A single gene can encode for different protein versions through a process called alternative splicing. Since proteins play major roles in cellular functions, aberrant splicing profiles can result in a variety of diseases, including cancers. Alternative splicing is determined by the gene's primary sequence and other regulatory factors such as RNA-binding protein levels. With these as input, we formulate the prediction of RNA splicing as a regression task and build a new training dataset (CAPD) to benchmark learned models. We propose discrete compositional energy network (DCEN) which leverages the hierarchical relationships between splice sites, junctions and transcripts to approach this task. In the case of alternative splicing prediction, DCEN models mRNA transcript probabilities through its constituent splice junctions' energy values. These transcript probabilities are subsequently mapped to relative abundance values of key nucleotides and trained with ground-truth experimental measurements. Through our experiments on CAPD, we show that DCEN outperforms baselines and ablation variants. △ Less

Submitted 6 March, 2021; originally announced March 2021.

Comments: ACM CHIL 2021 Camera-Ready

arXiv:2012.10023 [pdf, other]

Efficient Analytical Approach for High-Pressure Melting Properties of Iron

Authors: Tran Dinh Cuong, Anh D. Phan

Abstract: Iron represents the principal constituent of the Earth's core, but its high-pressure melting diagram remains ambiguous. Here we present a simple analytical approach to predict the melting properties of iron under deep-Earth conditions. In our model, anharmonic free energies of the solid phase are directly determined by the moment expansion technique in quantum statistical mechanics. This basis ass… ▽ More Iron represents the principal constituent of the Earth's core, but its high-pressure melting diagram remains ambiguous. Here we present a simple analytical approach to predict the melting properties of iron under deep-Earth conditions. In our model, anharmonic free energies of the solid phase are directly determined by the moment expansion technique in quantum statistical mechanics. This basis associated with the Lindemann criterion for a vibrational instability can deduce the melting temperature. Moreover, we correlate the thermal expansion process with the shear response to explain a discontinuity of atomic volume, enthalpy, and entropy upon melting. Our numerical calculations are quantitatively consistent with recent experiments and simulations. The obtained results would improve understanding of the Earth's structure, dynamics, and evolution. △ Less

Submitted 17 December, 2020; originally announced December 2020.

Comments: 8 pages, 6 figures, accepted for publication on Vacuum 2020

arXiv:2012.09629 [pdf, other]

doi 10.1109/QCE52317.2021.00058

Teaching quantum computing with an interactive textbook

Authors: James R. Wootton, Francis Harkins, Nicholas T. Bronn, Almudena Carrera Vazquez, Anna Phan, Abraham T. Asfaw

Abstract: Quantum computing is a technology that promises to offer significant advantages during the coming decades. Though the technology is still in a prototype stage, the last few years have seen many of these prototype devices become accessible to the public. This has been accompanied by the open-source development of the software required to use and test quantum hardware in increasingly sophisticated w… ▽ More Quantum computing is a technology that promises to offer significant advantages during the coming decades. Though the technology is still in a prototype stage, the last few years have seen many of these prototype devices become accessible to the public. This has been accompanied by the open-source development of the software required to use and test quantum hardware in increasingly sophisticated ways. Such tools provide new education opportunities, not just for quantum computing specifically, but also more broadly for quantum information science and even quantum physics as a whole. In this paper we present a case study of one education resource which aims to take advantage of the opportunities: the open-source online textbook `Learn Quantum Computation using Qiskit'. An overview of the topics covered is given, as well as an explanation of the approach taken for each. △ Less

Submitted 16 December, 2020; originally announced December 2020.

Journal ref: 2021 IEEE International Conference on Quantum Computing and Engineering

arXiv:2011.05946 [pdf, other]

doi 10.1093/mnras/stab1802

The Tianlai Dish Pathfinder Array: design, operation and performance of a prototype transit radio interferometer

Authors: Fengquan Wu, Jixia Li, Shifan Zuo, Xuelei Chen, Santanu Das, John P. Marriner, Trevor M. Oxholm, Anh Phan, Albert Stebbins, Peter T. Timbie, Reza Ansari, Jean-Eric Campagne, Zhi** Chen, Yan** Cong, Qizhi Huang, Yichao Li, Tao Liu, Yingfeng Liu, Chenhui Niu, Calvin Osinga, Olivier Perdereau, Jeffrey B. Peterson, Huli Shi, Gage Siebert, Shijie Sun , et al. (12 additional authors not shown)

Abstract: The Tianlai Dish Pathfinder Array is a radio interferometer designed to test techniques for 21~cm intensity map** in the post-reionization universe as a means for measuring large-scale cosmic structure. It performs drift scans of the sky at constant declination. We describe the design, calibration, noise level, and stability of this instrument based on the analysis of about $\sim 5 \%$ of 6,200… ▽ More The Tianlai Dish Pathfinder Array is a radio interferometer designed to test techniques for 21~cm intensity map** in the post-reionization universe as a means for measuring large-scale cosmic structure. It performs drift scans of the sky at constant declination. We describe the design, calibration, noise level, and stability of this instrument based on the analysis of about $\sim 5 \%$ of 6,200 hours of on-sky observations through October, 2019. Beam pattern determinations using drones and the transit of bright sources are in good agreement, and compatible with electromagnetic simulations. Combining all the baselines, we make maps around bright sources and show that the array behaves as expected. A few hundred hours of observations at different declinations have been used to study the array geometry and pointing imperfections, as well as the instrument noise behaviour. We show that the system temperature is below 80~K for most feed antennas, and that noise fluctuations decrease as expected with integration time, at least up to a few hundred seconds. Analysis of long integrations, from 10 nights of observations of the North Celestial Pole, yielded visibilities with amplitudes of 20-30~mK, consistent with the expected signal from the NCP radio sky with $<10\,$mK precision for $1 ~\mathrm{MHz} \times 1~ \mathrm{min}$ binning. Hi-pass filtering the spectra to remove smooth spectrum signal yields a residual consistent with zero signal at the $0.5\,$mK level. △ Less

Submitted 27 June, 2021; v1 submitted 11 November, 2020; originally announced November 2020.

Comments: 30 pages, 38 figures

arXiv:2011.05695 [pdf, other]

Cooperative nanoparticle self-assembly and photothermal heating in a flexible plasmonic metamaterial

Authors: Anh D. Phan, Vu D. Lam, Katsunori Wakabayashi

Abstract: We theoretically investigate equilibrium behaviors and photothermal effects of a flexible plasmonic metamaterial composed of aramid nanofibers and gold nanoparticles. The fiber matrix is considered as an external field to reconfigure a nanoparticle assembly. We find that the heating process tunes particle-particle and fiber-particle interactions, which alter adsorption of nanoparticles on fiber su… ▽ More We theoretically investigate equilibrium behaviors and photothermal effects of a flexible plasmonic metamaterial composed of aramid nanofibers and gold nanoparticles. The fiber matrix is considered as an external field to reconfigure a nanoparticle assembly. We find that the heating process tunes particle-particle and fiber-particle interactions, which alter adsorption of nanoparticles on fiber surfaces or clustering in pore spaces. Thus, it is possible to control the nanoparticle self-assembly by laser illumination. Gold nanoparticles strongly absorb radiations and efficiently dissipate absorbed energy into heat. By solving the heat transfer equation associated with an effective medium approximation, we calculate the spatial temperature rise. Remarkably, our theoretical results quantitatively agree with prior experiments. This indicates that we can ignore plasmonic coupling effects induced by particle clustering. Effects of the laser spot size and intensity on the photothermal heating are also discussed. △ Less

Submitted 11 November, 2020; originally announced November 2020.

Comments: 7 pages, 4 figures, accepted for publication in RSC Advances

arXiv:2010.15408 [pdf, other]

doi 10.1021/acs.jpcb.0c05523

Determination of Young's modulus of active pharmaceutical ingredients by relaxation dynamics at elevated pressures

Authors: Anh D. Phan

Abstract: A new approach is theoretically proposed to study the glass transition of active pharmaceutical ingredients and a glass-forming anisotropic molecular liquid at high pressures. We describe amorphous materials as a fluid of hard spheres. Effects of nearest-neighbor interactions and cooperative motions of particles on glassy dynamics are quantified through a local and collective elastic barrier calcu… ▽ More A new approach is theoretically proposed to study the glass transition of active pharmaceutical ingredients and a glass-forming anisotropic molecular liquid at high pressures. We describe amorphous materials as a fluid of hard spheres. Effects of nearest-neighbor interactions and cooperative motions of particles on glassy dynamics are quantified through a local and collective elastic barrier calculated using the Elastically Collective Nonlinear Langevin Equation theory. Inserting two barriers into Kramer's theory gives structural relaxation time. Then, we formulate a new map** based on the thermal expansion process under pressure to intercorrelate particle density, temperature, and pressure. This analysis allows us to determine the pressure and temperature dependence of alpha relaxation. From this, we estimate an effective elastic modulus of amorphous materials and capture effects of conformation on the relaxation process. Remarkably, our theoretical results agree well with experiments. △ Less

Submitted 29 October, 2020; originally announced October 2020.

Comments: 7 pages, 4 figures, accepted for publication in Journal of Physical Chemistry B

arXiv:2009.14039 [pdf, other]

A tidal lung simulation to quantify lung heterogeneity with the Inspired Sinewave Test

Authors: Minh C. Tran, Douglas C. Crockett, Phi A. Phan, Stephen J. Payne, Andrew D. Farmery

Abstract: We have created a lung simulation to quantify lung heterogeneity from the results of the inspired sinewave test (IST). The IST is a lung function test that is non-invasive, non-ionising and does not require patients' cooperation. A tidal lung simulation is developed to assess this test and also a method is proposed to calculate lung heterogeneity from IST results. A sensitivity analysis based on t… ▽ More We have created a lung simulation to quantify lung heterogeneity from the results of the inspired sinewave test (IST). The IST is a lung function test that is non-invasive, non-ionising and does not require patients' cooperation. A tidal lung simulation is developed to assess this test and also a method is proposed to calculate lung heterogeneity from IST results. A sensitivity analysis based on the Morris method and linear regression were applied to verify and to validate the simulation. Additionally, simulated emphysema and pulmonary embolism conditions were created using the simulation to assess the ability of the IST to identify these conditions. Experimental data from five pigs (pre-injured vs injured) were used for validation. This paper contributes to the development of the IST. Firstly, our sensitivity analysis reveals that the IST is highly accurate with an underestimation of about 5% of the simulated values. Sensitivity analysis suggested that both instability in tidal volume and extreme expiratory flow coefficients during the test cause random errors in the IST results. Secondly, the ratios of IST results obtained at two tracer gas oscillation frequencies can identify lung heterogeneity (ELV60/ELV180 and Qp60/Qp180). There was dissimilarity between simulated emphysema and pulmonary embolism (p < 0:0001). In the animal model, the control group had ELV60/ELV180 = 0.58 compared with 0.39 in injured animals (p < 0.0001). △ Less

Submitted 29 September, 2020; originally announced September 2020.

Journal ref: IEEE, EMBC2020

arXiv:2009.10201 [pdf, other]

doi 10.1039/D0CP02761H

Coupling between structural relaxation and diffusion in glass-forming liquids under pressure variation

Authors: Anh D. Phan, Kajetan Koperwas, Marian Paluch, Katsunori Wakabayashi

Abstract: We theoretically investigate structural relaxation and activated diffusion of glass-forming liquids at different pressures using both the Elastically Collective Nonlinear Langevin Equation (ECNLE) theory and molecular dynamics (MD) simulation. An external pressure restricts local motions of a single molecule within its cage and triggers the slowing down of cooperative mobility. While the ECNLE the… ▽ More We theoretically investigate structural relaxation and activated diffusion of glass-forming liquids at different pressures using both the Elastically Collective Nonlinear Langevin Equation (ECNLE) theory and molecular dynamics (MD) simulation. An external pressure restricts local motions of a single molecule within its cage and triggers the slowing down of cooperative mobility. While the ECNLE theory and simulation generally predict a monotonic increase of the glass transition temperature and dynamic fragility with pressure, the simulation indicates a decrease of fragility as pressure above 1000 bar. The structural relaxation time is found to be linearly coupled with the inverse diffusion constant. Remarkably, this coupling is independent of compression. Theoretical calculations agree quantitatively well with simulations and are also consistent with prior works. △ Less

Submitted 21 September, 2020; originally announced September 2020.

Comments: 7 pages, 5 figures

arXiv:2008.05441 [pdf, other]

Stable Low-rank Tensor Decomposition for Compression of Convolutional Neural Network

Authors: Anh-Huy Phan, Konstantin Sobolev, Konstantin Sozykin, Dmitry Ermilov, Julia Gusak, Petr Tichavsky, Valeriy Glukhov, Ivan Oseledets, Andrzej Cichocki

Abstract: Most state of the art deep neural networks are overparameterized and exhibit a high computational cost. A straightforward approach to this problem is to replace convolutional kernels with its low-rank tensor approximations, whereas the Canonical Polyadic tensor Decomposition is one of the most suited models. However, fitting the convolutional tensors by numerical optimization algorithms often enco… ▽ More Most state of the art deep neural networks are overparameterized and exhibit a high computational cost. A straightforward approach to this problem is to replace convolutional kernels with its low-rank tensor approximations, whereas the Canonical Polyadic tensor Decomposition is one of the most suited models. However, fitting the convolutional tensors by numerical optimization algorithms often encounters diverging components, i.e., extremely large rank-one tensors but canceling each other. Such degeneracy often causes the non-interpretable result and numerical instability for the neural network fine-tuning. This paper is the first study on degeneracy in the tensor decomposition of convolutional kernels. We present a novel method, which can stabilize the low-rank approximation of convolutional kernels and ensure efficient compression while preserving the high-quality performance of the neural networks. We evaluate our approach on popular CNN architectures for image classification and show that our method results in much lower accuracy degradation and provides consistent performance. △ Less

Submitted 12 August, 2020; originally announced August 2020.

Comments: This paper is accepted to ECCV2020

arXiv:2007.15524 [pdf, other]

doi 10.1103/PhysRevLett.126.025502

Theory of pressure-induced rejuvenation and strain-hardening in metallic glasses

Authors: Anh D. Phan, Alessio Zaccone, Vu D. Lam, Katsunori Wakabayashi

Abstract: We theoretically investigate high-pressure effects on the atomic dynamics of metallic glasses. The theory predicts compression-induced rejuvenation and the resulting strain hardening that have been recently observed in metallic glasses. Structural relaxation under pressure is mainly governed by local cage dynamics. The external pressure restricts the dynamical constraints and slows down the atomic… ▽ More We theoretically investigate high-pressure effects on the atomic dynamics of metallic glasses. The theory predicts compression-induced rejuvenation and the resulting strain hardening that have been recently observed in metallic glasses. Structural relaxation under pressure is mainly governed by local cage dynamics. The external pressure restricts the dynamical constraints and slows down the atomic mobility. In addition, the compression induces a rejuvenated metastable state (local minimum) at a higher energy in the free energy landscape. Thus, compressed metallic glasses can rejuvenate and the corresponding relaxation is reversible. This behavior leads to strain hardening in mechanical deformation experiments. Theoretical predictions agree well with experiments. △ Less

Submitted 24 October, 2020; v1 submitted 30 July, 2020; originally announced July 2020.

Journal ref: Phys. Rev. Lett. 126, 025502 (2021)

arXiv:2007.11760 [pdf, other]

Applications of mesoporous silica encapsulated gold nanorods loaded doxorubicin in chemo-photothermal therapy

Authors: Nghiem Thi Ha Lien, Anh D. Phan, Bui Thi Van Khanh, Nguyen Thi Thuy, Nguyen Trong Nghia, Hoang Thi My Nhung, Tran Hong Nhung, Do Quang Hoa, Vu Duong, Nguyen Minh Hue

Abstract: We investigate chemo-photothermal effects of gold nanorods (GNRs) coated using mesoporous silica (mSiO2) loading doxorubicin (DOX). When the mesoporous silica layer is embedded by doxorubicin drugs, a significant change in absorption spectra enable to quantify the drug loading. We carry out photothermal experiments on saline and livers of mice having GNRs@mSiO2 and GNRs@mSiO2-DOX. We also inject t… ▽ More We investigate chemo-photothermal effects of gold nanorods (GNRs) coated using mesoporous silica (mSiO2) loading doxorubicin (DOX). When the mesoporous silica layer is embedded by doxorubicin drugs, a significant change in absorption spectra enable to quantify the drug loading. We carry out photothermal experiments on saline and livers of mice having GNRs@mSiO2 and GNRs@mSiO2-DOX. We also inject the gold nanostructures into many tumor-implanted mice and use laser illumination on some of them. By measuring weight and size of tumors, the distinct efficiency of photothermal therapy and chemotherapy on treatment is determined. We experimentally confirm the accumulation of gold nanostructures in liver. △ Less

Submitted 22 July, 2020; originally announced July 2020.

Comments: 8 pages, 6 figures, accepted for publication

arXiv:2007.11356 [pdf, other]

Enhanced solar photothermal effect of PANi fabrics with plasmonic nanostructures

Authors: Do T. Nga, Anh D. Phan, Vu D. Lam, Lilia M. Woods, Katsunori Wakabayashi

Abstract: The photothermal energy conversion in hanging and floating polyaniline (PANi)-cotton fabrics is investigated using a model based on the heat diffusion equation. Perfect absorption and anti-reflection of wet hanging PANi-cotton fabrics cause quick transfer of total incident light into water confining nearly 100 $\%$ of the sunlight. As a result, a hanging membrane is found to have more attractive p… ▽ More The photothermal energy conversion in hanging and floating polyaniline (PANi)-cotton fabrics is investigated using a model based on the heat diffusion equation. Perfect absorption and anti-reflection of wet hanging PANi-cotton fabrics cause quick transfer of total incident light into water confining nearly 100 $\%$ of the sunlight. As a result, a hanging membrane is found to have more attractive properties than a floating above water fabric. We find, however, that the photothermal properties of a floating PANi-cotton membrane can greatly be enhanced by dispersing TiN nanoparticles in the water below the fabric. The calculated temperature gradients for TiN nanoparticle solutions show that the absorbed energy grows with increasing the nanoparticle density and that the photothermal process occurs mostly near the surface. The collective heating effects depend on the size and density of nanoparticles, which can further be used to modulate the photothermal process. △ Less

Submitted 22 July, 2020; originally announced July 2020.

Comments: 7 pages, 5 figures, accepted for publication in RSC Advances

arXiv:2006.14540 [pdf, other]

Graph Convolutional Neural Networks for analysis of EEG signals, BCI application

Authors: Mirfarid Musavian Ghazani, Anh Huy Phan

Abstract: Decoding brain signals has gained many attention and has found much applications in recent years such as Brain Computer Interfaces, communicating with controlling external devices using the user's intentions, occupies an emerging field with the potential of changing the world, with diverse applications from rehabilitation to human augmentation. This being said brain signal analysis, EEG brain sign… ▽ More Decoding brain signals has gained many attention and has found much applications in recent years such as Brain Computer Interfaces, communicating with controlling external devices using the user's intentions, occupies an emerging field with the potential of changing the world, with diverse applications from rehabilitation to human augmentation. This being said brain signal analysis, EEG brain signal analysis in particular, is a challenging task. With the advances and achievements in the field of deep learning in problem solving with using only raw data, few attempts has been carried in recent years, to apply deep learning to tackle EEG among other types of brain signals. In this study, we propose a novel loss function, called DeepCSP to extend the classical Common Spatial Patterns to a non linear, differentiable module to serve as the loss function to enforce linearly separable latent representations of EEG signals belonging to different classes in an end to end manner on raw signals without the need to perform extensive feature engineering. With recent generalizations of deep learning methods to work on arbitrarily structured graphs and the introduced loss we have proposed two light weight models to decode EEG signals and carried experiments to show their performance. △ Less

Submitted 16 June, 2020; originally announced June 2020.

Comments: 11 pages, 5 figures

arXiv:2006.08878 [pdf, other]

CNN Acceleration by Low-rank Approximation with Quantized Factors

Authors: Nikolay Kozyrskiy, Anh-Huy Phan

Abstract: The modern convolutional neural networks although achieve great results in solving complex computer vision tasks still cannot be effectively used in mobile and embedded devices due to the strict requirements for computational complexity, memory and power consumption. The CNNs have to be compressed and accelerated before deployment. In order to solve this problem the novel approach combining two kn… ▽ More The modern convolutional neural networks although achieve great results in solving complex computer vision tasks still cannot be effectively used in mobile and embedded devices due to the strict requirements for computational complexity, memory and power consumption. The CNNs have to be compressed and accelerated before deployment. In order to solve this problem the novel approach combining two known methods, low-rank tensor approximation in Tucker format and quantization of weights and feature maps (activations), is proposed. The greedy one-step and multi-step algorithms for the task of multilinear rank selection are proposed. The approach for quality restoration after applying Tucker decomposition and quantization is developed. The efficiency of our method is demonstrated for ResNet18 and ResNet34 on CIFAR-10, CIFAR-100 and Imagenet classification tasks. As a result of comparative analysis performed for other methods for compression and acceleration our approach showed its promising features. △ Less

Submitted 15 June, 2020; originally announced June 2020.

arXiv:2005.14506 [pdf, other]

Deep convolutional tensor network

Authors: Philip Blagoveschensky, Anh Huy Phan

Abstract: Neural networks have achieved state of the art results in many areas, supposedly due to parameter sharing, locality, and depth. Tensor networks (TNs) are linear algebraic representations of quantum many-body states based on their entanglement structure. TNs have found use in machine learning. We devise a novel TN based model called Deep convolutional tensor network (DCTN) for image classification,… ▽ More Neural networks have achieved state of the art results in many areas, supposedly due to parameter sharing, locality, and depth. Tensor networks (TNs) are linear algebraic representations of quantum many-body states based on their entanglement structure. TNs have found use in machine learning. We devise a novel TN based model called Deep convolutional tensor network (DCTN) for image classification, which has parameter sharing, locality, and depth. It is based on the Entangled plaquette states (EPS) TN. We show how EPS can be implemented as a backpropagatable layer. We test DCTN on MNIST, FashionMNIST, and CIFAR10 datasets. A shallow DCTN performs well on MNIST and FashionMNIST and has a small parameter count. Unfortunately, depth increases overfitting and thus decreases test accuracy. Also, DCTN of any depth performs badly on CIFAR10 due to overfitting. It is to be determined why. We discuss how the hyperparameters of DCTN affect its training and overfitting. △ Less

Submitted 14 November, 2020; v1 submitted 29 May, 2020; originally announced May 2020.

Comments: 14 pages, 18 figures, to be published in the proceedings of NeurIPS 2020 Quantum tensor networks in machine learning workshop

ACM Class: I.5.1

Showing 1–50 of 247 results for author: Phan, A