-
Realization of Conditional Operations through Transition Pathway Engineering
Authors:
Sheng Zhang,
Peng Duan,
Yun-Jie Wang,
Tian-Le Wang,
Peng Wang,
Ren-Ze Zhao,
Xiao-Yan Yang,
Ze-An Zhao,
Liang-Liang Guo,
Yong Chen,
Hai-Feng Zhang,
Lei Du,
Hao-Ran Tao,
Zhi-Fei Li,
Yuan Wu,
Zhi-Long Jia,
Wei-Cheng Kong,
Zhao-Yun Chen,
Yu-Chun Wu,
Guo-** Guo
Abstract:
In the NISQ era, achieving large-scale quantum computing demands compact circuits to mitigate decoherence and gate error accumulation. Quantum operations with diverse degrees of freedom hold promise for circuit compression, but conventional approaches encounter challenges in simultaneously adjusting multiple parameters. Here, we propose a transition composite gate (TCG) scheme grounded on state-se…
▽ More
In the NISQ era, achieving large-scale quantum computing demands compact circuits to mitigate decoherence and gate error accumulation. Quantum operations with diverse degrees of freedom hold promise for circuit compression, but conventional approaches encounter challenges in simultaneously adjusting multiple parameters. Here, we propose a transition composite gate (TCG) scheme grounded on state-selective transition path engineering, enabling more expressive conditional operations. We experimentally validate a controlled unitary (CU) gate as an example, with independent and continuous parameters. By adjusting the parameters of $\rm X^{12}$ gate, we obtain the CU family with a fidelity range of 95.2% to 99.0% leveraging quantum process tomography (QPT). To demonstrate the capability of circuit compression, we use TCG scheme to prepare 3-qubit Greenberger-Horne-Zeilinger (GHZ) and W states, with the fidelity of 96.77% and 95.72%. TCG can achieve the reduction in circuit depth of about 40% and 44% compared with the use of CZ gates only. Moreover, we show that short-path TCG (SPTCG) can further reduce the state-preparation circuit time cost. The TCG scheme exhibits advantages in certain quantum circuits and shows significant potential for large-scale quantum algorithms.
△ Less
Submitted 10 July, 2024; v1 submitted 9 July, 2024;
originally announced July 2024.
-
Subharmonic oscillations in the Floquet circuit with the frequency-synthesis dimension
Authors:
Bo Lv,
Shiyun Xia,
Ye Tian,
Ting Liu,
Hongyang Mu,
Zhichao Shen,
Sijie Wang,
Zheng Zhu,
Huibin Tao,
Fanyi Meng,
**hui Shi
Abstract:
The period-doubling oscillation emerges with the coexistence between zero and π modes in Floquet topological insulator. Here, utilized the flexibility of the circuit, we construct the Floquet circuit with frequency-synthetic dimension and find the topological-protected deeply-subharmonic oscillations with the period extensively exceeding the doubling-driven period. In the construction framework, t…
▽ More
The period-doubling oscillation emerges with the coexistence between zero and π modes in Floquet topological insulator. Here, utilized the flexibility of the circuit, we construct the Floquet circuit with frequency-synthetic dimension and find the topological-protected deeply-subharmonic oscillations with the period extensively exceeding the doubling-driven period. In the construction framework, the periodically-driven mechanism is attained by implementing the circuit-oscillator hierarchy with the step**-variation resonances in frequency domain. The zero and π modes that arise at the Floquet band in the circuit indicate the anomalous boundary-bulk correspondence. The coexistence of zero and π modes, results in a subharmonic oscillation with the extremely-low frequency on the edge of the Floquet circuit. Furthermore, we explore the Floquet band with the enhanced periodically-driven strength tailored by the component flexibility of the circuit. Our method provides a flexible scheme to study Floquet topological phases, and open a new path for realizing the deeply subwavelength system.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
Improving density matrix electronic structure method by deep learning
Authors:
Zechen Tang,
Nianlong Zou,
He Li,
Yuxiang Wang,
Zilong Yuan,
Honggeng Tao,
Yang Li,
Zezhou Chen,
Boheng Zhao,
Minghui Sun,
Hong Jiang,
Wenhui Duan,
Yong Xu
Abstract:
The combination of deep learning and ab initio materials calculations is emerging as a trending frontier of materials science research, with deep-learning density functional theory (DFT) electronic structure being particularly promising. In this work, we introduce a neural-network method for modeling the DFT density matrix, a fundamental yet previously unexplored quantity in deep-learning electron…
▽ More
The combination of deep learning and ab initio materials calculations is emerging as a trending frontier of materials science research, with deep-learning density functional theory (DFT) electronic structure being particularly promising. In this work, we introduce a neural-network method for modeling the DFT density matrix, a fundamental yet previously unexplored quantity in deep-learning electronic structure. Utilizing an advanced neural network framework that leverages the nearsightedness and equivariance properties of the density matrix, the method demonstrates high accuracy and excellent generalizability in multiple example studies, as well as capability to precisely predict charge density and reproduce other electronic structure properties. Given the pivotal role of the density matrix in DFT as well as other computational methods, the current research introduces a novel approach to the deep-learning study of electronic structure properties, opening up new opportunities for deep-learning enhanced computational materials study.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
Ultrasensitive acoustic graphene plasmons in a graphene-transition metal dichalcogenide heterostructure: strong plasmon-phonon coupling and wavelength sensitivity enhanced by a metal screen
Authors:
Ícaro R. Lavora,
Z. H. Tao,
H. M. Dongd,
Andrey Chaves,
F. M. Peeters,
Milorad V. Milosevic
Abstract:
Acoustic plasmons in graphene exhibit strong confinement induced by a proximate metal surface and hybridize with phonons of transition metal dichalcogenides (TMDs) when these materials are combined in a van der Waals heterostructure, thus forming screened graphene plasmon-phonon polaritons (SGPPPs), a type of acoustic mode. While SGPPPs are shown to be very sensitive to the dielectric properties o…
▽ More
Acoustic plasmons in graphene exhibit strong confinement induced by a proximate metal surface and hybridize with phonons of transition metal dichalcogenides (TMDs) when these materials are combined in a van der Waals heterostructure, thus forming screened graphene plasmon-phonon polaritons (SGPPPs), a type of acoustic mode. While SGPPPs are shown to be very sensitive to the dielectric properties of the environment, enhancing the SGPPPs coupling strength in realistic heterostructures is still challenging. Here we employ the quantum electrostatic heterostructure model, which builds upon the density functional theory calculations for monolayers, to show that the use of a metal as a substrate for graphene-TMD heterostructures (i) vigorously enhances the coupling strength between acoustic plasmons and the TMD phonons, and (ii) markedly improves the sensitivity of the plasmon wavelength on the structural details of the host platform in real space, thus allowing one to use the effect of environmental screening on acoustic plasmons to probe the structure and composition of a van der Waals heterostructure down to the monolayer resolution.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Single-photon triggered quantum entanglement between two qubits or at least 2000 identical qubits
Authors:
Wangjun Lu,
Cuilu Zhai,
Hong Tao,
Yaju Song,
Shiqing Tang,
Lan Xu
Abstract:
This paper studies the effect of single-photon light fields on quantum entanglement between two qubits and multiple identical qubits initially in a direct state. For two qubits, we first analyze the impact of the excited state's weight on single-photon-triggered entanglement, finding that excessive weight disrupts this process. We then explore how initial coherence affects entanglement, discoverin…
▽ More
This paper studies the effect of single-photon light fields on quantum entanglement between two qubits and multiple identical qubits initially in a direct state. For two qubits, we first analyze the impact of the excited state's weight on single-photon-triggered entanglement, finding that excessive weight disrupts this process. We then explore how initial coherence affects entanglement, discovering that maximum initial coherence enables the single photon to achieve maximal entanglement. For multiple qubits, we similarly investigate the effects of the excited state's weight and initial coherence on entanglement control. In large qubit systems, we find that single photons cannot trigger entanglement when excited-state weights exceed ground-state weights or when all qubits are initially in the ground state. Interestingly, single photons can still trigger entanglement between any two qubits in systems with at least 2000 qubits, with the entanglement depending on initial state parameters rather than the number of qubits.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
BSRBF-KAN: A combination of B-splines and Radial Basic Functions in Kolmogorov-Arnold Networks
Authors:
Hoang-Thang Ta
Abstract:
In this paper, we introduce BSRBF-KAN, a Kolmogorov Arnold Network (KAN) that combines Bsplines and radial basis functions (RBFs) to fit input vectors in data training. We perform experiments with BSRBF-KAN, MLP, and other popular KANs, including EfficientKAN, FastKAN, FasterKAN, and GottliebKAN over the MNIST and Fashion-MNIST datasets. BSRBF-KAN shows stability in 5 training sessions with a comp…
▽ More
In this paper, we introduce BSRBF-KAN, a Kolmogorov Arnold Network (KAN) that combines Bsplines and radial basis functions (RBFs) to fit input vectors in data training. We perform experiments with BSRBF-KAN, MLP, and other popular KANs, including EfficientKAN, FastKAN, FasterKAN, and GottliebKAN over the MNIST and Fashion-MNIST datasets. BSRBF-KAN shows stability in 5 training sessions with a competitive average accuracy of 97.55% on MNIST and 89.33% on FashionMNIST and obtains convergence better than other networks. We expect BSRBF-KAN to open many combinations of mathematical functions to design KANs. Our repo is publicly available at: https://github.com/hoangthangta/BSRBF-KAN.
△ Less
Submitted 19 June, 2024; v1 submitted 16 June, 2024;
originally announced June 2024.
-
Universal materials model of deep-learning density functional theory Hamiltonian
Authors:
Yuxiang Wang,
Yang Li,
Zechen Tang,
He Li,
Zilong Yuan,
Honggeng Tao,
Nianlong Zou,
Ting Bao,
Xinghao Liang,
Zezhou Chen,
Shanghua Xu,
Ce Bian,
Zhiming Xu,
Chong Wang,
Chen Si,
Wenhui Duan,
Yong Xu
Abstract:
Realizing large materials models has emerged as a critical endeavor for materials research in the new era of artificial intelligence, but how to achieve this fantastic and challenging objective remains elusive. Here, we propose a feasible pathway to address this paramount pursuit by develo** universal materials models of deep-learning density functional theory Hamiltonian (DeepH), enabling compu…
▽ More
Realizing large materials models has emerged as a critical endeavor for materials research in the new era of artificial intelligence, but how to achieve this fantastic and challenging objective remains elusive. Here, we propose a feasible pathway to address this paramount pursuit by develo** universal materials models of deep-learning density functional theory Hamiltonian (DeepH), enabling computational modeling of the complicated structure-property relationship of materials in general. By constructing a large materials database and substantially improving the DeepH method, we obtain a universal materials model of DeepH capable of handling diverse elemental compositions and material structures, achieving remarkable accuracy in predicting material properties. We further showcase a promising application of fine-tuning universal materials models for enhancing specific materials models. This work not only demonstrates the concept of DeepH's universal materials model but also lays the groundwork for develo** large materials models, opening up significant opportunities for advancing artificial intelligence-driven materials discovery.
△ Less
Submitted 15 June, 2024;
originally announced June 2024.
-
The Prompt Report: A Systematic Survey of Prompting Techniques
Authors:
Sander Schulhoff,
Michael Ilie,
Nishant Balepur,
Konstantine Kahadze,
Amanda Liu,
Chenglei Si,
Yinheng Li,
Aayush Gupta,
HyoJung Han,
Sevien Schulhoff,
Pranav Sandeep Dulepet,
Saurav Vidyadhara,
Dayeon Ki,
Sweta Agrawal,
Chau Pham,
Gerson Kroiz,
Feileen Li,
Hudson Tao,
Ashay Srivastava,
Hevander Da Costa,
Saloni Gupta,
Megan L. Rogers,
Inna Goncearenco,
Giuseppe Sarli,
Igor Galynker
, et al. (6 additional authors not shown)
Abstract:
Generative Artificial Intelligence (GenAI) systems are being increasingly deployed across all parts of industry and research settings. Developers and end users interact with these systems through the use of prompting or prompt engineering. While prompting is a widespread and highly researched concept, there exists conflicting terminology and a poor ontological understanding of what constitutes a p…
▽ More
Generative Artificial Intelligence (GenAI) systems are being increasingly deployed across all parts of industry and research settings. Developers and end users interact with these systems through the use of prompting or prompt engineering. While prompting is a widespread and highly researched concept, there exists conflicting terminology and a poor ontological understanding of what constitutes a prompt due to the area's nascency. This paper establishes a structured understanding of prompts, by assembling a taxonomy of prompting techniques and analyzing their use. We present a comprehensive vocabulary of 33 vocabulary terms, a taxonomy of 58 text-only prompting techniques, and 40 techniques for other modalities. We further present a meta-analysis of the entire literature on natural language prefix-prompting.
△ Less
Submitted 16 June, 2024; v1 submitted 6 June, 2024;
originally announced June 2024.
-
Enabling Large-Scale and High-Precision Fluid Simulations on Near-Term Quantum Computers
Authors:
Zhao-Yun Chen,
Teng-Yang Ma,
Chuang-Chao Ye,
Liang Xu,
Ming-Yang Tan,
Xi-Ning Zhuang,
Xiao-Fan Xu,
Yun-Jie Wang,
Tai-** Sun,
Yong Chen,
Lei Du,
Liang-Liang Guo,
Hai-Feng Zhang,
Hao-Ran Tao,
Tian-Le Wang,
Xiao-Yan Yang,
Ze-An Zhao,
Peng Wang,
Sheng Zhang,
Chi Zhang,
Ren-Ze Zhao,
Zhi-Long Jia,
Wei-Cheng Kong,
Meng-Han Dou,
Jun-Chao Wang
, et al. (7 additional authors not shown)
Abstract:
Quantum computational fluid dynamics (QCFD) offers a promising alternative to classical computational fluid dynamics (CFD) by leveraging quantum algorithms for higher efficiency. This paper introduces a comprehensive QCFD method, including an iterative method "Iterative-QLS" that suppresses error in quantum linear solver, and a subspace method to scale the solution to a larger size. We implement o…
▽ More
Quantum computational fluid dynamics (QCFD) offers a promising alternative to classical computational fluid dynamics (CFD) by leveraging quantum algorithms for higher efficiency. This paper introduces a comprehensive QCFD method, including an iterative method "Iterative-QLS" that suppresses error in quantum linear solver, and a subspace method to scale the solution to a larger size. We implement our method on a superconducting quantum computer, demonstrating successful simulations of steady Poiseuille flow and unsteady acoustic wave propagation. The Poiseuille flow simulation achieved a relative error of less than $0.2\%$, and the unsteady acoustic wave simulation solved a 5043-dimensional matrix. We emphasize the utilization of the quantum-classical hybrid approach in applications of near-term quantum computers. By adapting to quantum hardware constraints and offering scalable solutions for large-scale CFD problems, our method paves the way for practical applications of near-term quantum computers in computational science.
△ Less
Submitted 19 June, 2024; v1 submitted 10 June, 2024;
originally announced June 2024.
-
On the Role of Controllability in Pulse-based Quantum Machine Learning Models
Authors:
Han-Xiao Tao,
Re-Bing Wu
Abstract:
Pulse-based quantum machine learning (QML) models possess full expressivity when they are ensemble controllable. However, it has also been shown that barren plateaus emerge in such models, rendering training intractable for systems with large dimension. In this paper, we show that the trade-off is closely related to the controllability of the underlying pulse-based models. We first apply the Flies…
▽ More
Pulse-based quantum machine learning (QML) models possess full expressivity when they are ensemble controllable. However, it has also been shown that barren plateaus emerge in such models, rendering training intractable for systems with large dimension. In this paper, we show that the trade-off is closely related to the controllability of the underlying pulse-based models. We first apply the Fliess-series expansion to pulse-based QML models to investigate the effect of control system structure on model expressivity, which leads to a universal criterion for assessing the expressivity of generic QML models. Guided by this criterion, we then demonstrate how designing pulse-based models on low-dimensional manifolds can balance expressivity and trainability. Finally, numerical experiments are carried out to verify the proposed criterion and our analysis, which futher demonstrate that increasing dimensionality enhances expressivity but avoids barren plateaus if the model is designed with limited controllability on a submanifold. Our approach provides a promising path for designing pulse-based QML models that are both highly expressive and trainable.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
ThangDLU at #SMM4H 2024: Encoder-decoder models for classifying text data on social disorders in children and adolescents
Authors:
Hoang-Thang Ta,
Abu Bakar Siddiqur Rahman,
Lotfollah Najjar,
Alexander Gelbukh
Abstract:
This paper describes our participation in Task 3 and Task 5 of the #SMM4H (Social Media Mining for Health) 2024 Workshop, explicitly targeting the classification challenges within tweet data. Task 3 is a multi-class classification task centered on tweets discussing the impact of outdoor environments on symptoms of social anxiety. Task 5 involves a binary classification task focusing on tweets repo…
▽ More
This paper describes our participation in Task 3 and Task 5 of the #SMM4H (Social Media Mining for Health) 2024 Workshop, explicitly targeting the classification challenges within tweet data. Task 3 is a multi-class classification task centered on tweets discussing the impact of outdoor environments on symptoms of social anxiety. Task 5 involves a binary classification task focusing on tweets reporting medical disorders in children. We applied transfer learning from pre-trained encoder-decoder models such as BART-base and T5-small to identify the labels of a set of given tweets. We also presented some data augmentation methods to see their impact on the model performance. Finally, the systems obtained the best F1 score of 0.627 in Task 3 and the best F1 score of 0.841 in Task 5.
△ Less
Submitted 30 April, 2024;
originally announced April 2024.
-
On de Bruijn Covering Sequences and Arrays
Authors:
Yeow Meng Chee,
Tuvi Etzion,
Hoang Ta,
Van Khu Vu
Abstract:
An $(m,n,R)$-de Bruijn covering array (dBCA) is a doubly periodic $M \times N$ array over an alphabet of size $q$ such that the set of all its $m \times n$ windows form a covering code with radius $R$. An upper bound of the smallest array area of an $(m,n,R)$-dBCA is provided using a probabilistic technique which is similar to the one that was used for an upper bound on the length of a de Bruijn c…
▽ More
An $(m,n,R)$-de Bruijn covering array (dBCA) is a doubly periodic $M \times N$ array over an alphabet of size $q$ such that the set of all its $m \times n$ windows form a covering code with radius $R$. An upper bound of the smallest array area of an $(m,n,R)$-dBCA is provided using a probabilistic technique which is similar to the one that was used for an upper bound on the length of a de Bruijn covering sequence. A folding technique to construct a dBCA from a de Bruijn covering sequence or de Bruijn covering sequences code is presented. Several new constructions that yield shorter de Bruijn covering sequences and $(m,n,R)$-dBCAs with smaller areas are also provided. These constructions are mainly based on sequences derived from cyclic codes, self-dual sequences, primitive polynomials, an interleaving technique, folding, and mutual shifts of sequences with the same covering radius. Finally, constructions of de Bruijn covering sequences codes are also discussed.
△ Less
Submitted 9 May, 2024; v1 submitted 21 April, 2024;
originally announced April 2024.
-
Learning Human Motion from Monocular Videos via Cross-Modal Manifold Alignment
Authors:
Shuaiying Hou,
Hongyu Tao,
Junheng Fang,
Changqing Zou,
Hujun Bao,
Weiwei Xu
Abstract:
Learning 3D human motion from 2D inputs is a fundamental task in the realms of computer vision and computer graphics. Many previous methods grapple with this inherently ambiguous task by introducing motion priors into the learning process. However, these approaches face difficulties in defining the complete configurations of such priors or training a robust model. In this paper, we present the Vid…
▽ More
Learning 3D human motion from 2D inputs is a fundamental task in the realms of computer vision and computer graphics. Many previous methods grapple with this inherently ambiguous task by introducing motion priors into the learning process. However, these approaches face difficulties in defining the complete configurations of such priors or training a robust model. In this paper, we present the Video-to-Motion Generator (VTM), which leverages motion priors through cross-modal latent feature space alignment between 3D human motion and 2D inputs, namely videos and 2D keypoints. To reduce the complexity of modeling motion priors, we model the motion data separately for the upper and lower body parts. Additionally, we align the motion data with a scale-invariant virtual skeleton to mitigate the interference of human skeleton variations to the motion priors. Evaluated on AIST++, the VTM showcases state-of-the-art performance in reconstructing 3D human motion from monocular videos. Notably, our VTM exhibits the capabilities for generalization to unseen view angles and in-the-wild videos.
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
Semantic SQL -- Combining and optimizing semantic predicates in SQL
Authors:
Akash Mittal,
Anshul Bheemreddy,
Huili Tao
Abstract:
In recent years, the surge in unstructured data analysis, facilitated by advancements in Machine Learning (ML), has prompted diverse approaches for handling images, text documents, and videos. Analysts, leveraging ML models, can extract meaningful information from unstructured data and store it in relational databases, allowing the execution of SQL queries for further analysis. Simultaneously, vec…
▽ More
In recent years, the surge in unstructured data analysis, facilitated by advancements in Machine Learning (ML), has prompted diverse approaches for handling images, text documents, and videos. Analysts, leveraging ML models, can extract meaningful information from unstructured data and store it in relational databases, allowing the execution of SQL queries for further analysis. Simultaneously, vector databases have emerged, embedding unstructured data for efficient top-k queries based on textual queries. This paper introduces a novel framework SSQL - Semantic SQL that utilizes these two approaches, enabling the incorporation of semantic queries within SQL statements. Our approach extends SQL queries with dedicated keywords for specifying semantic queries alongside predicates related to ML model results and metadata. Our experimental results show that using just semantic queries fails catastrophically to answer count and spatial queries in more than 60% of the cases. Our proposed method jointly optimizes the queries containing both semantic predicates and predicates on structured tables, such as those generated by ML models or other metadata. Further, to improve the query results, we incorporated human-in-the-loop feedback to determine the optimal similarity score threshold for returning results.
△ Less
Submitted 5 April, 2024;
originally announced April 2024.
-
A Two-Phase Recall-and-Select Framework for Fast Model Selection
Authors:
Jianwei Cui,
Wenhang Shi,
Honglin Tao,
Wei Lu,
Xiaoyong Du
Abstract:
As the ubiquity of deep learning in various machine learning applications has amplified, a proliferation of neural network models has been trained and shared on public model repositories. In the context of a targeted machine learning assignment, utilizing an apt source model as a starting point typically outperforms the strategy of training from scratch, particularly with limited training data. De…
▽ More
As the ubiquity of deep learning in various machine learning applications has amplified, a proliferation of neural network models has been trained and shared on public model repositories. In the context of a targeted machine learning assignment, utilizing an apt source model as a starting point typically outperforms the strategy of training from scratch, particularly with limited training data. Despite the investigation and development of numerous model selection strategies in prior work, the process remains time-consuming, especially given the ever-increasing scale of model repositories. In this paper, we propose a two-phase (coarse-recall and fine-selection) model selection framework, aiming to enhance the efficiency of selecting a robust model by leveraging the models' training performances on benchmark datasets. Specifically, the coarse-recall phase clusters models showcasing similar training performances on benchmark datasets in an offline manner. A light-weight proxy score is subsequently computed between this model cluster and the target dataset, which serves to recall a significantly smaller subset of potential candidate models in a swift manner. In the following fine-selection phase, the final model is chosen by fine-tuning the recalled models on the target dataset with successive halving. To accelerate the process, the final fine-tuning performance of each potential model is predicted by mining the model's convergence trend on the benchmark datasets, which aids in filtering lower performance models more earlier during fine-tuning. Through extensive experimentation on tasks covering natural language processing and computer vision, it has been demonstrated that the proposed methodology facilitates the selection of a high-performing model at a rate about 3x times faster than conventional baseline methods. Our code is available at https://github.com/plasware/two-phase-selection.
△ Less
Submitted 28 March, 2024;
originally announced April 2024.
-
Finding Decision Tree Splits in Streaming and Massively Parallel Models
Authors:
Huy Pham,
Hoang Ta,
Hoa T. Vu
Abstract:
In this work, we provide data stream algorithms that compute optimal splits in decision tree learning. In particular, given a data stream of observations $x_i$ and their labels $y_i$, the goal is to find the optimal split point $j$ that divides the data into two sets such that the mean squared error (for regression) or misclassification rate (for classification) is minimized. We provide various fa…
▽ More
In this work, we provide data stream algorithms that compute optimal splits in decision tree learning. In particular, given a data stream of observations $x_i$ and their labels $y_i$, the goal is to find the optimal split point $j$ that divides the data into two sets such that the mean squared error (for regression) or misclassification rate (for classification) is minimized. We provide various fast streaming algorithms that use sublinear space and a small number of passes for these problems. These algorithms can also be extended to the massively parallel computation model. Our work, while not directly comparable, complements the seminal work of Domingos and Hulten (KDD 2000).
△ Less
Submitted 17 April, 2024; v1 submitted 28 March, 2024;
originally announced March 2024.
-
Neural-network density functional theory
Authors:
Yang Li,
Zechen Tang,
Zezhou Chen,
Minghui Sun,
Boheng Zhao,
He Li,
Honggeng Tao,
Zilong Yuan,
Wenhui Duan,
Yong Xu
Abstract:
Deep-learning density functional theory (DFT) shows great promise to significantly accelerate material discovery and potentially revolutionize materials research, which demands a close combination between neural networks and DFT computation. However, current research in this field primarily relies on supervised learning, making the developments of neural networks and DFT isolated from each other.…
▽ More
Deep-learning density functional theory (DFT) shows great promise to significantly accelerate material discovery and potentially revolutionize materials research, which demands a close combination between neural networks and DFT computation. However, current research in this field primarily relies on supervised learning, making the developments of neural networks and DFT isolated from each other. In this work, we present a theoretical framework of neural-network DFT, which unifies the optimization of neural networks with the variational computation of DFT, enabling physics-informed unsupervised learning. Moreover, we develop a differential DFT code incorporated with deep-learning DFT Hamiltonian, and introduce algorithms of automatic differentiation and backpropagation to DFT, demonstrating the concept of neural-network DFT. The advanced neural-network architecture not only surpasses conventional approaches in accuracy and efficiency, but offers a new paradigm for develo** deep-learning DFT methods.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
Optimizing Polynomial Graph Filters: A Novel Adaptive Krylov Subspace Approach
Authors:
Keke Huang,
Wencai Cao,
Hoang Ta,
Xiaokui Xiao,
Pietro Liò
Abstract:
Graph Neural Networks (GNNs), known as spectral graph filters, find a wide range of applications in web networks. To bypass eigendecomposition, polynomial graph filters are proposed to approximate graph filters by leveraging various polynomial bases for filter training. However, no existing studies have explored the diverse polynomial graph filters from a unified perspective for optimization.
In…
▽ More
Graph Neural Networks (GNNs), known as spectral graph filters, find a wide range of applications in web networks. To bypass eigendecomposition, polynomial graph filters are proposed to approximate graph filters by leveraging various polynomial bases for filter training. However, no existing studies have explored the diverse polynomial graph filters from a unified perspective for optimization.
In this paper, we first unify polynomial graph filters, as well as the optimal filters of identical degrees into the Krylov subspace of the same order, thus providing equivalent expressive power theoretically. Next, we investigate the asymptotic convergence property of polynomials from the unified Krylov subspace perspective, revealing their limited adaptability in graphs with varying heterophily degrees. Inspired by those facts, we design a novel adaptive Krylov subspace approach to optimize polynomial bases with provable controllability over the graph spectrum so as to adapt various heterophily graphs. Subsequently, we propose AdaptKry, an optimized polynomial graph filter utilizing bases from the adaptive Krylov subspaces. Meanwhile, in light of the diverse spectral properties of complex graphs, we extend AdaptKry by leveraging multiple adaptive Krylov bases without incurring extra training costs. As a consequence, extended AdaptKry is able to capture the intricate characteristics of graphs and provide insights into their inherent complexity. We conduct extensive experiments across a series of real-world datasets. The experimental results demonstrate the superior filtering capability of AdaptKry, as well as the optimized efficacy of the adaptive Krylov basis.
△ Less
Submitted 20 May, 2024; v1 submitted 12 March, 2024;
originally announced March 2024.
-
Maximum Length RLL Sequences in de Bruijn Graph
Authors:
Yeow Meng Chee,
Tuvi Etzion,
Tien Long Nguyen,
Duy Hoang Ta,
Vinh Duc Tran,
Van Khu Vu
Abstract:
A timing and synchronization system based on a de Bruijn sequence has been proposed and studied recently for a channel associated with quantum communication that requires reliable synchronization. To avoid a long period of no-pulse in such a system on-off pulses are used to simulate a zero and on-on pulses are used to simulate a one. However, these sequences have high redundancy. To reduce the red…
▽ More
A timing and synchronization system based on a de Bruijn sequence has been proposed and studied recently for a channel associated with quantum communication that requires reliable synchronization. To avoid a long period of no-pulse in such a system on-off pulses are used to simulate a zero and on-on pulses are used to simulate a one. However, these sequences have high redundancy. To reduce the redundancy, run-length limited sequences in the de Bruijn graph are proposed for the same purpose. The maximum length of such sequences in the de Bruijn graph is studied and an efficient algorithm to construct a large set of these sequences is presented. A maximum length sequence for which the position of each window can be computed efficiently is constructed. Finally, an enumeration of the number of such sequences is given and some generalizations are discussed.
△ Less
Submitted 3 March, 2024;
originally announced March 2024.
-
NocPlace: Nocturnal Visual Place Recognition via Generative and Inherited Knowledge Transfer
Authors:
Bingxi Liu,
Yiqun Wang,
Huaqi Tao,
Tingjun Huang,
Fulin Tang,
Yihong Wu,
**qiang Cui,
Hong Zhang
Abstract:
Visual Place Recognition (VPR) is crucial in computer vision, aiming to retrieve database images similar to a query image from an extensive collection of known images. However, like many vision tasks, VPR always degrades at night due to the scarcity of nighttime images. Moreover, VPR needs to address the cross-domain problem of night-to-day rather than just the issue of a single nighttime domain.…
▽ More
Visual Place Recognition (VPR) is crucial in computer vision, aiming to retrieve database images similar to a query image from an extensive collection of known images. However, like many vision tasks, VPR always degrades at night due to the scarcity of nighttime images. Moreover, VPR needs to address the cross-domain problem of night-to-day rather than just the issue of a single nighttime domain. In response to these issues, we present NocPlace, which leverages generative and inherited knowledge transfer to embed resilience against dazzling lights and extreme darkness in the global descriptor. First, we establish a day-night urban scene dataset called NightCities, capturing diverse lighting variations and dark scenarios across 60 cities globally. Then, an image generation network is trained on this dataset and processes a large-scale VPR dataset, obtaining its nighttime version. Finally, VPR models are fine-tuned using descriptors inherited from themselves and night-style images, which builds explicit cross-domain contrastive relationships. Comprehensive experiments on various datasets demonstrate our contributions and the superiority of NocPlace. Without adding any real-time computing resources, NocPlace improves the performance of Eigenplaces by 7.6% on Tokyo 24/7 Night and 16.8% on SVOX Night.
△ Less
Submitted 21 March, 2024; v1 submitted 26 February, 2024;
originally announced February 2024.
-
A Solution for Commercializing, Decentralizing and Storing Electronic Medical Records by Integrating Proxy Re-Encryption, IPFS, and Blockchain
Authors:
Phong Tran,
Thong Nguyen,
Long Chu,
Nhi Tran,
Hang Ta
Abstract:
The rapid expansion of user medical records across global systems presents not only opportunities but also new challenges in maintaining effective application models that ensure user privacy, controllability, and the ability to commercialize patient medical records. Moreover, the proliferation of data analysis models in healthcare institutions necessitates the decentralization and restorability of…
▽ More
The rapid expansion of user medical records across global systems presents not only opportunities but also new challenges in maintaining effective application models that ensure user privacy, controllability, and the ability to commercialize patient medical records. Moreover, the proliferation of data analysis models in healthcare institutions necessitates the decentralization and restorability of medical record data. It is imperative that user medical data collected from these systems can be easily analyzed and utilized even years after collection, without the risk of data loss due to numerous factors. Additionally, medical information must be authorized by the data owner, granting patients the right to accept or decline data usage requests from medical research agencies. In response, we propose an innovative solution for implementing a decentralized system utilizing an EVM-compatible blockchain and IPFS for decentralized storage. To ensure privacy and control, we employ Proxy Re-Encryption (PRE), a cryptographic authorized method, within the medical data marketplace. Our proposed architecture significantly reduces costs associated with granting read access to healthcare research agencies by minimizing the encryption and decryption time of stored records. Furthermore, it empowers users with enhanced control over their health data through tamperproof blockchain smart contracts and IPFS, safeguarding the integrity and privacy of their medical records.
△ Less
Submitted 4 June, 2024; v1 submitted 8 February, 2024;
originally announced February 2024.
-
Equivariant Neural Network Force Fields for Magnetic Materials
Authors:
Zilong Yuan,
Zhiming Xu,
He Li,
Xinle Cheng,
Honggeng Tao,
Zechen Tang,
Zhiyuan Zhou,
Wenhui Duan,
Yong Xu
Abstract:
Neural network force fields have significantly advanced ab initio atomistic simulations across diverse fields. However, their application in the realm of magnetic materials is still in its early stage due to challenges posed by the subtle magnetic energy landscape and the difficulty of obtaining training data. Here we introduce a data-efficient neural network architecture to represent density func…
▽ More
Neural network force fields have significantly advanced ab initio atomistic simulations across diverse fields. However, their application in the realm of magnetic materials is still in its early stage due to challenges posed by the subtle magnetic energy landscape and the difficulty of obtaining training data. Here we introduce a data-efficient neural network architecture to represent density functional theory total energy, atomic forces, and magnetic forces as functions of atomic and magnetic structures. Our approach incorporates the principle of equivariance under the three-dimensional Euclidean group into the neural network model. Through systematic experiments on various systems, including monolayer magnets, curved nanotube magnets, and moiré-twisted bilayer magnets of $\text{CrI}_{3}$, we showcase the method's high efficiency and accuracy, as well as exceptional generalization ability. The work creates opportunities for exploring magnetic phenomena in large-scale materials systems.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
Unleashing the Expressive Power of Pulse-Based Quantum Neural Networks
Authors:
Han-Xiao Tao,
Jiaqi Hu,
Re-Bing Wu
Abstract:
Quantum machine learning (QML) based on Noisy Intermediate-Scale Quantum (NISQ) devices hinges on the optimal utilization of limited quantum resources. While gate-based QML models are user-friendly for software engineers, their expressivity is restricted by the permissible circuit depth within a finite coherence time. In contrast, pulse-based models enable the construction of "infinitely" deep qua…
▽ More
Quantum machine learning (QML) based on Noisy Intermediate-Scale Quantum (NISQ) devices hinges on the optimal utilization of limited quantum resources. While gate-based QML models are user-friendly for software engineers, their expressivity is restricted by the permissible circuit depth within a finite coherence time. In contrast, pulse-based models enable the construction of "infinitely" deep quantum neural networks within the same time, which may unleash greater expressive power for complex learning tasks. In this paper, this potential is investigated from the perspective of quantum control theory. We first indicate that the nonlinearity of pulse-based models comes from the encoding process that can be viewed as the continuous limit of data-reuploading in gate-based models. Subsequently, we prove that the pulse-based model can approximate arbitrary nonlinear functions when the underlying physical system is ensemble controllable. Under this condition, numerical simulations demonstrate the enhanced expressivity by either increasing the pulse length or the number of qubits. As anticipated, we show through numerical examples that the pulse-based model can unleash more expressive power compared to the gate-based model. These findings lay a theoretical foundation for understanding and designing expressive QML models using NISQ devices.
△ Less
Submitted 25 June, 2024; v1 submitted 5 February, 2024;
originally announced February 2024.
-
Region-Based Representations Revisited
Authors:
Michal Shlapentokh-Rothman,
Ansel Blume,
Yao Xiao,
Yuqun Wu,
Sethuraman T V,
Heyi Tao,
Jae Yong Lee,
Wilfredo Torres,
Yu-Xiong Wang,
Derek Hoiem
Abstract:
We investigate whether region-based representations are effective for recognition. Regions were once a mainstay in recognition approaches, but pixel and patch-based features are now used almost exclusively. We show that recent class-agnostic segmenters like SAM can be effectively combined with strong unsupervised representations like DINOv2 and used for a wide variety of tasks, including semantic…
▽ More
We investigate whether region-based representations are effective for recognition. Regions were once a mainstay in recognition approaches, but pixel and patch-based features are now used almost exclusively. We show that recent class-agnostic segmenters like SAM can be effectively combined with strong unsupervised representations like DINOv2 and used for a wide variety of tasks, including semantic segmentation, object-based image retrieval, and multi-image analysis. Once the masks and features are extracted, these representations, even with linear decoders, enable competitive performance, making them well suited to applications that require custom queries. The compactness of the representation also makes it well-suited to video analysis and other problems requiring inference across many images.
△ Less
Submitted 9 June, 2024; v1 submitted 4 February, 2024;
originally announced February 2024.
-
DeepH-2: Enhancing deep-learning electronic structure via an equivariant local-coordinate transformer
Authors:
Yuxiang Wang,
He Li,
Zechen Tang,
Honggeng Tao,
Yanzhen Wang,
Zilong Yuan,
Zezhou Chen,
Wenhui Duan,
Yong Xu
Abstract:
Deep-learning electronic structure calculations show great potential for revolutionizing the landscape of computational materials research. However, current neural-network architectures are not deemed suitable for widespread general-purpose application. Here we introduce a framework of equivariant local-coordinate transformer, designed to enhance the deep-learning density functional theory Hamilto…
▽ More
Deep-learning electronic structure calculations show great potential for revolutionizing the landscape of computational materials research. However, current neural-network architectures are not deemed suitable for widespread general-purpose application. Here we introduce a framework of equivariant local-coordinate transformer, designed to enhance the deep-learning density functional theory Hamiltonian referred to as DeepH-2. Unlike previous models such as DeepH and DeepH-E3, DeepH-2 seamlessly integrates the simplicity of local-coordinate transformations and the mathematical elegance of equivariant neural networks, effectively overcoming their respective disadvantages. Based on our comprehensive experiments, DeepH-2 demonstrates superiority over its predecessors in both efficiency and accuracy, showcasing state-of-the-art performance. This advancement opens up opportunities for exploring universal neural network models or even large materials models.
△ Less
Submitted 30 January, 2024;
originally announced January 2024.
-
Falcon: Fair Active Learning using Multi-armed Bandits
Authors:
Ki Hyun Tae,
Hantian Zhang,
Jaeyoung Park,
Kexin Rong,
Steven Euijong Whang
Abstract:
Biased data can lead to unfair machine learning models, highlighting the importance of embedding fairness at the beginning of data analysis, particularly during dataset curation and labeling. In response, we propose Falcon, a scalable fair active learning framework. Falcon adopts a data-centric approach that improves machine learning model fairness via strategic sample selection. Given a user-spec…
▽ More
Biased data can lead to unfair machine learning models, highlighting the importance of embedding fairness at the beginning of data analysis, particularly during dataset curation and labeling. In response, we propose Falcon, a scalable fair active learning framework. Falcon adopts a data-centric approach that improves machine learning model fairness via strategic sample selection. Given a user-specified group fairness measure, Falcon identifies samples from "target groups" (e.g., (attribute=female, label=positive)) that are the most informative for improving fairness. However, a challenge arises since these target groups are defined using ground truth labels that are not available during sample selection. To handle this, we propose a novel trial-and-error method, where we postpone using a sample if the predicted label is different from the expected one and falls outside the target group. We also observe the trade-off that selecting more informative samples results in higher likelihood of postponing due to undesired label prediction, and the optimal balance varies per dataset. We capture the trade-off between informativeness and postpone rate as policies and propose to automatically select the best policy using adversarial multi-armed bandit methods, given their computational efficiency and theoretical guarantees. Experiments show that Falcon significantly outperforms existing fair active learning approaches in terms of fairness and accuracy and is more efficient. In particular, only Falcon supports a proper trade-off between accuracy and fairness where its maximum fairness score is 1.8-4.5x higher than the second-best results.
△ Less
Submitted 23 January, 2024; v1 submitted 23 January, 2024;
originally announced January 2024.
-
Self-training from Self-memory in Data-to-text Generation
Authors:
Hoang-Thang Ta
Abstract:
This paper introduces a novel training model, self-training from self-memory (STSM) in data-to-text generation (DTG), allowing the model to self-train on subsets, including self-memory as outputs inferred directly from the trained models and/or the new data. The quality of self-memory is validated by two models, data-to-text (D2T) and text-to-data (T2D), by two pre-defined conditions: (1) the appe…
▽ More
This paper introduces a novel training model, self-training from self-memory (STSM) in data-to-text generation (DTG), allowing the model to self-train on subsets, including self-memory as outputs inferred directly from the trained models and/or the new data. The quality of self-memory is validated by two models, data-to-text (D2T) and text-to-data (T2D), by two pre-defined conditions: (1) the appearance of all source values in the outputs of the D2T model and (2) the ability to convert back to source data in the outputs in the T2D model. We utilize a greedy algorithm to generate shorter D2T outputs if they contain all source values. Subsequently, we use the T2D model to confirm that these outputs can capture input relationships by demonstrating their capacity to convert text back into data. With 30% of the dataset, we can train the D2T model with a competitive performance compared to full training in the same setup. We experiment with our model on two datasets, E2E NLG and DART. STSM offers the D2T model a generalization capability from its subset memory while reducing training data volume. Ultimately, we anticipate that this paper will contribute to continual learning solutions that adapt to new training data, incorporating it as a form of self-memory in DTG tasks. The curated dataset is publicly available at: https://github.com/hoangthangta/STSM.
△ Less
Submitted 19 January, 2024;
originally announced January 2024.
-
DepressionEmo: A novel dataset for multilabel classification of depression emotions
Authors:
Abu Bakar Siddiqur Rahman,
Hoang-Thang Ta,
Lotfollah Najjar,
Azad Azadmanesh,
Ali Saffet Gönül
Abstract:
Emotions are integral to human social interactions, with diverse responses elicited by various situational contexts. Particularly, the prevalence of negative emotional states has been correlated with negative outcomes for mental health, necessitating a comprehensive analysis of their occurrence and impact on individuals. In this paper, we introduce a novel dataset named DepressionEmo designed to d…
▽ More
Emotions are integral to human social interactions, with diverse responses elicited by various situational contexts. Particularly, the prevalence of negative emotional states has been correlated with negative outcomes for mental health, necessitating a comprehensive analysis of their occurrence and impact on individuals. In this paper, we introduce a novel dataset named DepressionEmo designed to detect 8 emotions associated with depression by 6037 examples of long Reddit user posts. This dataset was created through a majority vote over inputs by zero-shot classifications from pre-trained models and validating the quality by annotators and ChatGPT, exhibiting an acceptable level of interrater reliability between annotators. The correlation between emotions, their distribution over time, and linguistic analysis are conducted on DepressionEmo. Besides, we provide several text classification methods classified into two groups: machine learning methods such as SVM, XGBoost, and Light GBM; and deep learning methods such as BERT, GAN-BERT, and BART. The pretrained BART model, bart-base allows us to obtain the highest F1- Macro of 0.76, showing its outperformance compared to other methods evaluated in our analysis. Across all emotions, the highest F1-Macro value is achieved by suicide intent, indicating a certain value of our dataset in identifying emotions in individuals with depression symptoms through text analysis. The curated dataset is publicly available at: https://github.com/abuBakarSiddiqurRahman/DepressionEmo.
△ Less
Submitted 9 January, 2024;
originally announced January 2024.
-
Coordinated Planning of Offshore Charging Stations and Electrified Ships: A Case Study on Shanghai-Busan Maritime Route
Authors:
Hao Li,
Hanqi Tao,
Wentao Huang,
Hongcai Zhang,
Ran Li
Abstract:
Despite the success of electric vehicles on land, electrification of maritime ships is challenged by the dilemma of range anxiety and cargo-carrying capacity. The longer range requires larger batteries, which inevitably eat up the precious cargo space and weight. This paper breaks new ground by proposing a coordinated planning model for offshore charging stations (OCSs) and electric ships (ESs), m…
▽ More
Despite the success of electric vehicles on land, electrification of maritime ships is challenged by the dilemma of range anxiety and cargo-carrying capacity. The longer range requires larger batteries, which inevitably eat up the precious cargo space and weight. This paper breaks new ground by proposing a coordinated planning model for offshore charging stations (OCSs) and electric ships (ESs), marking a first in this field. Strategically situated OCS can partition a long maritime route into several shorter segments, which in turn lead to smaller batteries and thus larger cargo capacities. The research analyzed the impact of maritime geographical conditions on the placement and sizing process and provided insights into the trade-offs between battery size, cargo-carrying capacity, and the cruising range of different types of electrified ships. Using real Automatic Identification System (AIS) data, we estimated the economic feasibility of the Shanghai-Busan high-traffic maritime route and conducted a sensitivity analysis on factors affecting its economic viability. The results show that installing OCS can significantly reduce the propulsion cost compared with ESs without OCS and traditional internal combustion engine (ICE) ships.
△ Less
Submitted 25 December, 2023;
originally announced December 2023.
-
Efficient Title Reranker for Fast and Improved Knowledge-Intense NLP
Authors:
Ziyi Chen,
Jize Jiang,
Daqian Zuo,
Heyi Tao,
Jun Yang,
Yuxiang Wei
Abstract:
In recent RAG approaches, rerankers play a pivotal role in refining retrieval accuracy with the ability of revealing logical relations for each pair of query and text. However, existing rerankers are required to repeatedly encode the query and a large number of long retrieved text. This results in high computational costs and limits the number of retrieved text, hindering accuracy. As a remedy of…
▽ More
In recent RAG approaches, rerankers play a pivotal role in refining retrieval accuracy with the ability of revealing logical relations for each pair of query and text. However, existing rerankers are required to repeatedly encode the query and a large number of long retrieved text. This results in high computational costs and limits the number of retrieved text, hindering accuracy. As a remedy of the problem, we introduce the Efficient Title Reranker via Broadcasting Query Encoder, a novel technique for title reranking that achieves a 20x-40x speedup over the vanilla passage reranker. Furthermore, we introduce Sigmoid Trick, a novel loss function customized for title reranking. Combining both techniques, we empirically validated their effectiveness, achieving state-of-the-art results on all four datasets we experimented with from the KILT knowledge benchmark.
△ Less
Submitted 25 February, 2024; v1 submitted 19 December, 2023;
originally announced December 2023.
-
WebWISE: Web Interface Control and Sequential Exploration with Large Language Models
Authors:
Heyi Tao,
Sethuraman T V,
Michal Shlapentokh-Rothman,
Derek Hoiem
Abstract:
The paper investigates using a Large Language Model (LLM) to automatically perform web software tasks using click, scroll, and text input operations. Previous approaches, such as reinforcement learning (RL) or imitation learning, are inefficient to train and task-specific. Our method uses filtered Document Object Model (DOM) elements as observations and performs tasks step-by-step, sequentially ge…
▽ More
The paper investigates using a Large Language Model (LLM) to automatically perform web software tasks using click, scroll, and text input operations. Previous approaches, such as reinforcement learning (RL) or imitation learning, are inefficient to train and task-specific. Our method uses filtered Document Object Model (DOM) elements as observations and performs tasks step-by-step, sequentially generating small programs based on the current observations. We use in-context learning, either benefiting from a single manually provided example, or an automatically generated example based on a successful zero-shot trial. We evaluate the proposed method on the MiniWob++ benchmark. With only one in-context example, our WebWISE method achieves similar or better performance than other methods that require many demonstrations or trials.
△ Less
Submitted 24 October, 2023; v1 submitted 24 October, 2023;
originally announced October 2023.
-
Efficient Approximation of Quantum Channel Fidelity Exploiting Symmetry
Authors:
Yeow Meng Chee,
Hoang Ta,
Van Khu Vu
Abstract:
Determining the optimal fidelity for the transmission of quantum information over noisy quantum channels is one of the central problems in quantum information theory. Recently, [Berta-Borderi-Fawzi-Scholz, Mathematical Programming, 2021] introduced an asymptotically converging semidefinite programming hierarchy of outer bounds for this quantity. However, the size of the semidefinite programs (SDPs…
▽ More
Determining the optimal fidelity for the transmission of quantum information over noisy quantum channels is one of the central problems in quantum information theory. Recently, [Berta-Borderi-Fawzi-Scholz, Mathematical Programming, 2021] introduced an asymptotically converging semidefinite programming hierarchy of outer bounds for this quantity. However, the size of the semidefinite programs (SDPs) grows exponentially with respect to the level of the hierarchy, thus making their computation unscalable. In this work, by exploiting the symmetries in the SDP, we show that, for a fixed output dimension of the quantum channel, we can compute the SDP in time polynomial with respect to the level of the hierarchy and input dimension. As a direct consequence of our result, the optimal fidelity can be approximated with an accuracy of $ε$ in $\mathrm{poly}(1/ε, \text{input dimension})$ time.
△ Less
Submitted 21 March, 2024; v1 submitted 30 August, 2023;
originally announced August 2023.
-
On the Asymptotic Nonnegative Rank of Matrices and its Applications in Information Theory
Authors:
Yeow Meng Chee,
Quoc Tung Le,
Hoang Ta
Abstract:
In this paper, we study the asymptotic nonnegative rank of matrices, which characterizes the asymptotic growth of the nonnegative rank of fixed nonnegative matrices under the Kronecker product. This quantity is important since it governs several notions in information theory such as the so-called exact Rényi common information and the amortized communication complexity. By using the theory of asym…
▽ More
In this paper, we study the asymptotic nonnegative rank of matrices, which characterizes the asymptotic growth of the nonnegative rank of fixed nonnegative matrices under the Kronecker product. This quantity is important since it governs several notions in information theory such as the so-called exact Rényi common information and the amortized communication complexity. By using the theory of asymptotic spectra of V. Strassen (J. Reine Angew. Math. 1988), we define formally the asymptotic spectrum of nonnegative matrices and give a dual characterization of the asymptotic nonnegative rank. As a complementary of the nonnegative rank, we introduce the notion of the subrank of a nonnegative matrix and show that it is exactly equal to the size of the maximum induced matching of the bipartite graph defined on the support of the matrix (therefore, independent of the value of entries). Finally, we show that two matrix parameters, namely rank and fractional cover number, belong to the asymptotic spectrum of nonnegative matrices.
△ Less
Submitted 29 January, 2024; v1 submitted 14 August, 2023;
originally announced August 2023.
-
Quantum multiparameter estimation with graph states
Authors:
Hong Tao,
Xiaoqing Tan
Abstract:
In the SU(2) dynamics, it is especially significant to achieve a simultaneous optimal multiparameter estimation but it is very difficult. Evolution on SU(N) dynamics is a research method to explore simultaneous multiparameter estimation with the quantum network. As the highly entangled states, graph state, is an intrinsical quantum resource for quantum metrology. For n-qubit graph state, we propos…
▽ More
In the SU(2) dynamics, it is especially significant to achieve a simultaneous optimal multiparameter estimation but it is very difficult. Evolution on SU(N) dynamics is a research method to explore simultaneous multiparameter estimation with the quantum network. As the highly entangled states, graph state, is an intrinsical quantum resource for quantum metrology. For n-qubit graph state, we propose a simultaneous multiparameter estimation scheme that investigates evolution in SU(N) dynamics. For single-parameter estimation, the precision limit beyond the Heisenberg limit in the higher dimension spin of SU(2). We consider two scenarios where the Hamiltonian operator is commutation and non-commutation respectively and verify that the global estimation precision is higher than the local estimation precision. In the parameter limit condition, the precision of parameter estimation for the simultaneous estimation of each parameter is equal to the precision of the singleparameter estimation. In addition, we find a precision-enhancement scheme that depends on the dynamics SU(N). The smaller the N for the dynamics evolution, the higher the precision of the parameter estimation. Finally, we prove that the graph state is the optimal state of quantum metrology, a set of optimal measurement basic can be found, and the precision limit of multiparameter estimation can attain the quantum Cramér-Rao bound.
△ Less
Submitted 6 June, 2023; v1 submitted 4 June, 2023;
originally announced June 2023.
-
A Two-part Transformer Network for Controllable Motion Synthesis
Authors:
Shuaiying Hou,
Hongyu Tao,
Hujun Bao,
Weiwei Xu
Abstract:
Although part-based motion synthesis networks have been investigated to reduce the complexity of modeling heterogeneous human motions, their computational cost remains prohibitive in interactive applications. To this end, we propose a novel two-part transformer network that aims to achieve high-quality, controllable motion synthesis results in real-time. Our network separates the skeleton into the…
▽ More
Although part-based motion synthesis networks have been investigated to reduce the complexity of modeling heterogeneous human motions, their computational cost remains prohibitive in interactive applications. To this end, we propose a novel two-part transformer network that aims to achieve high-quality, controllable motion synthesis results in real-time. Our network separates the skeleton into the upper and lower body parts, reducing the expensive cross-part fusion operations, and models the motions of each part separately through two streams of auto-regressive modules formed by multi-head attention layers. However, such a design might not sufficiently capture the correlations between the parts. We thus intentionally let the two parts share the features of the root joint and design a consistency loss to penalize the difference in the estimated root features and motions by these two auto-regressive modules, significantly improving the quality of synthesized motions. After training on our motion dataset, our network can synthesize a wide range of heterogeneous motions, like cartwheels and twists. Experimental and user study results demonstrate that our network is superior to state-of-the-art human motion synthesis networks in the quality of generated motions.
△ Less
Submitted 19 June, 2023; v1 submitted 25 April, 2023;
originally announced April 2023.
-
Effects of Oxidation on the Tribological Properties of Diamond Sliding Against Silica. Insights from Ab initio Molecular Dynamics
Authors:
Huong T. T. Ta,
Nam V. Tran,
M. C. Righi
Abstract:
Tribological phenomena such as adhesion, friction, and wear can undermine the functionality of devices and applications based on the diamond-silica interface. Controlling these phenomena is highly desirable, but difficult since extrinsic factors, such as the surface termination by adsorbed species, can deeply affect the reactivity of diamond and its resistance to wear. In this work, we investigate…
▽ More
Tribological phenomena such as adhesion, friction, and wear can undermine the functionality of devices and applications based on the diamond-silica interface. Controlling these phenomena is highly desirable, but difficult since extrinsic factors, such as the surface termination by adsorbed species, can deeply affect the reactivity of diamond and its resistance to wear. In this work, we investigate the effects of diamond oxidation by massive ab initio molecular dynamics simulations of silica sliding against diamond surfaces considering different surface orientations, O-coverages, and tribological conditions. Our findings reveal a dual role of oxygen that depends on coverage. At full coverage, the adsorbed oxygen is very effective in friction and wear reduction because the repulsion with the silica counter-surface prevents the formation of chemical bonds across the interface. At reduced coverage and high pressure, Si-O-C bonds are anyway established. In this situation the presence of oxygen results detrimental as it weakens the surface C-C bonds making the surface more vulnerable to wear. Indeed we observed atomic wear on the C(110) surface at 50% O-coverage under harsh tribological conditions. The mechanisms of friction reduction and atomistic wear are explained through the analysis of the electronic properties and surface-surface interactions. Overall, our accurate in silico experiments shed light into the effects of adsorbed oxygen on the tribological behavior of diamond and show how oxidized diamond can be worn by silica.
△ Less
Submitted 24 April, 2023;
originally announced April 2023.
-
Atomistic Wear Mechanisms in Diamond: Effects of Surface Orientation, Stress, and Interaction with Adsorbed Molecules
Authors:
Huong T. T. Ta,
Nam V. Tran,
M. C. Righi
Abstract:
Despite its unrivaled hardness, diamond can be severely worn during the interaction with others, even softer materials. In this work, we calculate from first-principles the energy and forces necessary to induce the atomistic wear of diamond, and compare them for different surface orientations and passivation by oxygen, hydrogen, and water fragments. The primary mechanism of wear is identified as t…
▽ More
Despite its unrivaled hardness, diamond can be severely worn during the interaction with others, even softer materials. In this work, we calculate from first-principles the energy and forces necessary to induce the atomistic wear of diamond, and compare them for different surface orientations and passivation by oxygen, hydrogen, and water fragments. The primary mechanism of wear is identified as the detachment of carbon chains. This is particularly true for oxidized diamond and diamond interacting with silica. A very interesting result concerns the role of stress, which reveals that compressive stresses can highly favor wear, making it even energetically favorable.
△ Less
Submitted 17 April, 2023;
originally announced April 2023.
-
Experimental Implementation of Short-Path Non-adiabatic Geometric Gates in a Superconducting Circuit
Authors:
Xin-Xin Yang,
Liang-Liang Guo,
Hai-Feng Zhang,
Lei Du,
Chi Zhang,
Hao-Ran Tao,
Yong Chen,
Peng Duan,
Zhi-Long Jia,
Wei-Cheng Kong,
Guo-** Guo
Abstract:
The non-adiabatic geometric quantum computation (NGQC) has attracted a lot of attention for noise-resilient quantum control. However, previous implementations of NGQC require long evolution paths that make them more vulnerable to incoherent errors than their dynamical counterparts.In this work, we experimentally realize a universal short-path non-adiabatic geometric gate set (SPNGQC) with a 2-time…
▽ More
The non-adiabatic geometric quantum computation (NGQC) has attracted a lot of attention for noise-resilient quantum control. However, previous implementations of NGQC require long evolution paths that make them more vulnerable to incoherent errors than their dynamical counterparts.In this work, we experimentally realize a universal short-path non-adiabatic geometric gate set (SPNGQC) with a 2-times shorter evolution path on a superconducting quantum processor. Characterizing with both quantum process tomography and randomized benchmarking methods, we report an average single-qubit gate fidelity of 99.86% and a two-qubit gate fidelity of 97.9%. Additionally, we demonstrate superior robustness of single-qubit SP-NGQC gate to Rabi frequency error in some certain parameter space by comparing their performance to those of the dynamical gates and the former NGQC gates.
△ Less
Submitted 22 March, 2023;
originally announced March 2023.
-
Transformer-based approaches to Sentiment Detection
Authors:
Olumide Ebenezer Ojo,
Hoang Thang Ta,
Alexander Gelbukh,
Hiram Calvo,
Olaronke Oluwayemisi Adebanji,
Grigori Sidorov
Abstract:
The use of transfer learning methods is largely responsible for the present breakthrough in Natural Learning Processing (NLP) tasks across multiple domains. In order to solve the problem of sentiment detection, we examined the performance of four different types of well-known state-of-the-art transformer models for text classification. Models such as Bidirectional Encoder Representations from Tran…
▽ More
The use of transfer learning methods is largely responsible for the present breakthrough in Natural Learning Processing (NLP) tasks across multiple domains. In order to solve the problem of sentiment detection, we examined the performance of four different types of well-known state-of-the-art transformer models for text classification. Models such as Bidirectional Encoder Representations from Transformers (BERT), Robustly Optimized BERT Pre-training Approach (RoBERTa), a distilled version of BERT (DistilBERT), and a large bidirectional neural network architecture (XLNet) were proposed. The performance of the four models that were used to detect disaster in the text was compared. All the models performed well enough, indicating that transformer-based models are suitable for the detection of disaster in text. The RoBERTa transformer model performs best on the test dataset with a score of 82.6% and is highly recommended for quality predictions. Furthermore, we discovered that the learning algorithms' performance was influenced by the pre-processing techniques, the nature of words in the vocabulary, unbalanced labeling, and the model parameters.
△ Less
Submitted 13 March, 2023;
originally announced March 2023.
-
Lee-Yang zeros and quantum Fisher information matrix in a nonlinear system
Authors:
Hong Tao,
Yuguo Su,
Xingyu Zhang,
**g Liu,
Xiaoguang Wang
Abstract:
The distribution of Lee-Yang zeros not only matters in thermodynamics and quantum mechanics, but also in mathematics. Hereby we propose a nonlinear quantum toy model and discuss the distribution of corresponding Lee-Yang zeros. Utilizing the coupling between a probe qubit and the nonlinear system, all Lee-Yang zeros can be detected in the dynamics of the probe qubit by tuning the coupling strength…
▽ More
The distribution of Lee-Yang zeros not only matters in thermodynamics and quantum mechanics, but also in mathematics. Hereby we propose a nonlinear quantum toy model and discuss the distribution of corresponding Lee-Yang zeros. Utilizing the coupling between a probe qubit and the nonlinear system, all Lee-Yang zeros can be detected in the dynamics of the probe qubit by tuning the coupling strength and linear coefficient of the nonlinear system. Moreover, the analytical expression of the quantum Fisher information matrix at the Lee-Yang zeros is provided, and an interesting phenomenon is discovered. Both the coupling strength and temperature can simultaneously attain their precision limits at the Lee-Yang zeros. However, the probe qubit cannot work as a thermometer at a Lee-Yang zero if it sits on the unit circle.
△ Less
Submitted 3 August, 2023; v1 submitted 6 March, 2023;
originally announced March 2023.
-
A Convolutional-Transformer Network for Crack Segmentation with Boundary Awareness
Authors:
Huaqi Tao,
Bingxi Liu,
**qiang Cui,
Hong Zhang
Abstract:
Cracks play a crucial role in assessing the safety and durability of manufactured buildings. However, the long and sharp topological features and complex background of cracks make the task of crack segmentation extremely challenging. In this paper, we propose a novel convolutional-transformer network based on encoder-decoder architecture to solve this challenge. Particularly, we designed a Dilated…
▽ More
Cracks play a crucial role in assessing the safety and durability of manufactured buildings. However, the long and sharp topological features and complex background of cracks make the task of crack segmentation extremely challenging. In this paper, we propose a novel convolutional-transformer network based on encoder-decoder architecture to solve this challenge. Particularly, we designed a Dilated Residual Block (DRB) and a Boundary Awareness Module (BAM). The DRB pays attention to the local detail of cracks and adjusts the feature dimension for other blocks as needed. And the BAM learns the boundary features from the dilated crack label. Furthermore, the DRB is combined with a lightweight transformer that captures global information to serve as an effective encoder. Experimental results show that the proposed network performs better than state-of-the-art algorithms on two typical datasets. Datasets, code, and trained models are available for research at https://github.com/HqiTao/CT-crackseg.
△ Less
Submitted 11 November, 2023; v1 submitted 22 February, 2023;
originally announced February 2023.
-
Artificial Intelligence System for Detection and Screening of Cardiac Abnormalities using Electrocardiogram Images
Authors:
Deyun Zhang,
Shijia Geng,
Yang Zhou,
Weilun Xu,
Guodong Wei,
Kai Wang,
Jie Yu,
Qiang Zhu,
Yongkui Li,
Yonghong Zhao,
Xingyue Chen,
Rui Zhang,
Zhaoji Fu,
Rongbo Zhou,
Yanqi E,
Sumei Fan,
Qinghao Zhao,
Chuandong Cheng,
Nan Peng,
Liang Zhang,
Linlin Zheng,
Jianjun Chu,
Hongbin Xu,
Chen Tan,
Jian Liu
, et al. (6 additional authors not shown)
Abstract:
The artificial intelligence (AI) system has achieved expert-level performance in electrocardiogram (ECG) signal analysis. However, in underdeveloped countries or regions where the healthcare information system is imperfect, only paper ECGs can be provided. Analysis of real-world ECG images (photos or scans of paper ECGs) remains challenging due to complex environments or interference. In this stud…
▽ More
The artificial intelligence (AI) system has achieved expert-level performance in electrocardiogram (ECG) signal analysis. However, in underdeveloped countries or regions where the healthcare information system is imperfect, only paper ECGs can be provided. Analysis of real-world ECG images (photos or scans of paper ECGs) remains challenging due to complex environments or interference. In this study, we present an AI system developed to detect and screen cardiac abnormalities (CAs) from real-world ECG images. The system was evaluated on a large dataset of 52,357 patients from multiple regions and populations across the world. On the detection task, the AI system obtained area under the receiver operating curve (AUC) of 0.996 (hold-out test), 0.994 (external test 1), 0.984 (external test 2), and 0.979 (external test 3), respectively. Meanwhile, the detection results of AI system showed a strong correlation with the diagnosis of cardiologists (cardiologist 1 (R=0.794, p<1e-3), cardiologist 2 (R=0.812, p<1e-3)). On the screening task, the AI system achieved AUCs of 0.894 (hold-out test) and 0.850 (external test). The screening performance of the AI system was better than that of the cardiologists (AI system (0.846) vs. cardiologist 1 (0.520) vs. cardiologist 2 (0.480)). Our study demonstrates the feasibility of an accurate, objective, easy-to-use, fast, and low-cost AI system for CA detection and screening. The system has the potential to be used by healthcare professionals, caregivers, and general users to assess CAs based on real-world ECG images.
△ Less
Submitted 10 February, 2023;
originally announced February 2023.
-
More is Better: A Database for Spontaneous Micro-Expression with High Frame Rates
Authors:
Sirui Zhao,
Huaying Tang,
Xinglong Mao,
Shifeng Liu,
Hanqing Tao,
Hao Wang,
Tong Xu,
Enhong Chen
Abstract:
As one of the most important psychic stress reactions, micro-expressions (MEs), are spontaneous and transient facial expressions that can reveal the genuine emotions of human beings. Thus, recognizing MEs (MER) automatically is becoming increasingly crucial in the field of affective computing, and provides essential technical support in lie detection, psychological analysis and other areas. Howeve…
▽ More
As one of the most important psychic stress reactions, micro-expressions (MEs), are spontaneous and transient facial expressions that can reveal the genuine emotions of human beings. Thus, recognizing MEs (MER) automatically is becoming increasingly crucial in the field of affective computing, and provides essential technical support in lie detection, psychological analysis and other areas. However, the lack of abundant ME data seriously restricts the development of cutting-edge data-driven MER models. Despite the recent efforts of several spontaneous ME datasets to alleviate this problem, it is still a tiny amount of work. To solve the problem of ME data hunger, we construct a dynamic spontaneous ME dataset with the largest current ME data scale, called DFME (Dynamic Facial Micro-expressions), which includes 7,526 well-labeled ME videos induced by 671 participants and annotated by more than 20 annotators throughout three years. Afterwards, we adopt four classical spatiotemporal feature learning models on DFME to perform MER experiments to objectively verify the validity of DFME dataset. In addition, we explore different solutions to the class imbalance and key-frame sequence sampling problems in dynamic MER respectively on DFME, so as to provide a valuable reference for future research. The comprehensive experimental results show that our DFME dataset can facilitate the research of automatic MER, and provide a new benchmark for MER. DFME will be published via https://mea-lab-421.github.io.
△ Less
Submitted 3 January, 2023;
originally announced January 2023.
-
Low-rank Tensor Assisted K-space Generative Model for Parallel Imaging Reconstruction
Authors:
Wei Zhang,
Zengwei Xiao,
Hui Tao,
Minghui Zhang,
Xiaoling Xu,
Qiegen Liu
Abstract:
Although recent deep learning methods, especially generative models, have shown good performance in fast magnetic resonance imaging, there is still much room for improvement in high-dimensional generation. Considering that internal dimensions in score-based generative models have a critical impact on estimating the gradient of the data distribution, we present a new idea, low-rank tensor assisted…
▽ More
Although recent deep learning methods, especially generative models, have shown good performance in fast magnetic resonance imaging, there is still much room for improvement in high-dimensional generation. Considering that internal dimensions in score-based generative models have a critical impact on estimating the gradient of the data distribution, we present a new idea, low-rank tensor assisted k-space generative model (LR-KGM), for parallel imaging reconstruction. This means that we transform original prior information into high-dimensional prior information for learning. More specifically, the multi-channel data is constructed into a large Hankel matrix and the matrix is subsequently folded into tensor for prior learning. In the testing phase, the low-rank rotation strategy is utilized to impose low-rank constraints on tensor output of the generative network. Furthermore, we alternately use traditional generative iterations and low-rank high-dimensional tensor iterations for reconstruction. Experimental comparisons with the state-of-the-arts demonstrated that the proposed LR-KGM method achieved better performance.
△ Less
Submitted 11 December, 2022;
originally announced December 2022.
-
Map** Process for the Task: Wikidata Statements to Text as Wikipedia Sentences
Authors:
Hoang Thang Ta,
Alexander Gelbukha,
Grigori Sidorov
Abstract:
Acknowledged as one of the most successful online cooperative projects in human society, Wikipedia has obtained rapid growth in recent years and desires continuously to expand content and disseminate knowledge values for everyone globally. The shortage of volunteers brings to Wikipedia many issues, including develo** content for over 300 languages at the present. Therefore, the benefit that mach…
▽ More
Acknowledged as one of the most successful online cooperative projects in human society, Wikipedia has obtained rapid growth in recent years and desires continuously to expand content and disseminate knowledge values for everyone globally. The shortage of volunteers brings to Wikipedia many issues, including develo** content for over 300 languages at the present. Therefore, the benefit that machines can automatically generate content to reduce human efforts on Wikipedia language projects could be considerable. In this paper, we propose our map** process for the task of converting Wikidata statements to natural language text (WS2T) for Wikipedia projects at the sentence level. The main step is to organize statements, represented as a group of quadruples and triples, and then to map them to corresponding sentences in English Wikipedia. We evaluate the output corpus in various aspects: sentence structure analysis, noise filtering, and relationships between sentence components based on word embedding models. The results are helpful not only for the data-to-text generation task but also for other relevant works in the field.
△ Less
Submitted 23 October, 2022;
originally announced October 2022.
-
WikiDes: A Wikipedia-Based Dataset for Generating Short Descriptions from Paragraphs
Authors:
Hoang Thang Ta,
Abu Bakar Siddiqur Rahman,
Navonil Majumder,
Amir Hussain,
Lotfollah Najjar,
Newton Howard,
Soujanya Poria,
Alexander Gelbukh
Abstract:
As free online encyclopedias with massive volumes of content, Wikipedia and Wikidata are key to many Natural Language Processing (NLP) tasks, such as information retrieval, knowledge base building, machine translation, text classification, and text summarization. In this paper, we introduce WikiDes, a novel dataset to generate short descriptions of Wikipedia articles for the problem of text summar…
▽ More
As free online encyclopedias with massive volumes of content, Wikipedia and Wikidata are key to many Natural Language Processing (NLP) tasks, such as information retrieval, knowledge base building, machine translation, text classification, and text summarization. In this paper, we introduce WikiDes, a novel dataset to generate short descriptions of Wikipedia articles for the problem of text summarization. The dataset consists of over 80k English samples on 6987 topics. We set up a two-phase summarization method - description generation (Phase I) and candidate ranking (Phase II) - as a strong approach that relies on transfer and contrastive learning. For description generation, T5 and BART show their superiority compared to other small-scale pre-trained models. By applying contrastive learning with the diverse input from beam search, the metric fusion-based ranking models outperform the direct description generation models significantly up to 22 ROUGE in topic-exclusive split and topic-independent split. Furthermore, the outcome descriptions in Phase II are supported by human evaluation in over 45.33% chosen compared to 23.66% in Phase I against the gold descriptions. In the aspect of sentiment analysis, the generated descriptions cannot effectively capture all sentiment polarities from paragraphs while doing this task better from the gold descriptions. The automatic generation of new descriptions reduces the human efforts in creating them and enriches Wikidata-based knowledge graphs. Our paper shows a practical impact on Wikipedia and Wikidata since there are thousands of missing descriptions. Finally, we expect WikiDes to be a useful dataset for related works in capturing salient information from short paragraphs. The curated dataset is publicly available at: https://github.com/declare-lab/WikiDes.
△ Less
Submitted 26 September, 2022;
originally announced September 2022.
-
iFlipper: Label Flip** for Individual Fairness
Authors:
Hantian Zhang,
Ki Hyun Tae,
Jaeyoung Park,
Xu Chu,
Steven Euijong Whang
Abstract:
As machine learning becomes prevalent, mitigating any unfairness present in the training data becomes critical. Among the various notions of fairness, this paper focuses on the well-known individual fairness, which states that similar individuals should be treated similarly. While individual fairness can be improved when training a model (in-processing), we contend that fixing the data before mode…
▽ More
As machine learning becomes prevalent, mitigating any unfairness present in the training data becomes critical. Among the various notions of fairness, this paper focuses on the well-known individual fairness, which states that similar individuals should be treated similarly. While individual fairness can be improved when training a model (in-processing), we contend that fixing the data before model training (pre-processing) is a more fundamental solution. In particular, we show that label flip** is an effective pre-processing technique for improving individual fairness. Our system iFlipper solves the optimization problem of minimally flip** labels given a limit to the individual fairness violations, where a violation occurs when two similar examples in the training data have different labels. We first prove that the problem is NP-hard. We then propose an approximate linear programming algorithm and provide theoretical guarantees on how close its result is to the optimal solution in terms of the number of label flips. We also propose techniques for making the linear programming solution more optimal without exceeding the violations limit. Experiments on real datasets show that iFlipper significantly outperforms other pre-processing baselines in terms of individual fairness and accuracy on unseen test sets. In addition, iFlipper can be combined with in-processing techniques for even better results.
△ Less
Submitted 15 September, 2022;
originally announced September 2022.
-
Towards quantitative super-resolution microscopy: Molecular maps with statistical guarantees
Authors:
Katharina Proksch,
Frank Werner,
Jan Keller-Findeisen,
Haisen Ta,
Axel Munk
Abstract:
Quantifying the number of molecules from fluorescence microscopy measurements is an important topic in cell biology and medical research. In this work, we present a consecutive algorithm for super-resolution (STED) scanning microscopy that provides molecule counts in automatically generated image segments and offers statistical guarantees in form of asymptotic confidence intervals. To this end, we…
▽ More
Quantifying the number of molecules from fluorescence microscopy measurements is an important topic in cell biology and medical research. In this work, we present a consecutive algorithm for super-resolution (STED) scanning microscopy that provides molecule counts in automatically generated image segments and offers statistical guarantees in form of asymptotic confidence intervals. To this end, we first apply a multiscale scanning procedure on STED microscopy measurements of the sample to obtain a system of significant regions, each of which contains at least one molecule with prescribed uniform probability. This system of regions will typically be highly redundant and consists of rectangular building blocks. To choose an informative but non-redundant subset of more naturally shaped regions, we hybridize our system with the result of a generic segmentation algorithm. The diameter of the segments can be of the order of the resolution of the microscope. Using multiple photon coincidence measurements of the same sample in confocal mode, we are then able to estimate the brightness and number of the molecules and give uniform confidence intervals on the molecule counts for each previously constructed segment. In other words, we establish a so-called molecular map with uniform error control. The performance of the algorithm is investigated on simulated and real data.
△ Less
Submitted 2 October, 2023; v1 submitted 27 July, 2022;
originally announced July 2022.
-
A multi-cubic-kilometre neutrino telescope in the western Pacific Ocean
Authors:
Z. P. Ye,
F. Hu,
W. Tian,
Q. C. Chang,
Y. L. Chang,
Z. S. Cheng,
J. Gao,
T. Ge,
G. H. Gong,
J. Guo,
X. X. Guo,
X. G. He,
J. T. Huang,
K. Jiang,
P. K. Jiang,
Y. P. **g,
H. L. Li,
J. L. Li,
L. Li,
W. L. Li,
Z. Li,
N. Y. Liao,
Q. Lin,
F. Liu,
J. L. Liu
, et al. (33 additional authors not shown)
Abstract:
Next-generation neutrino telescopes with significantly improved sensitivity are required to pinpoint the sources of the diffuse astrophysical neutrino flux detected by IceCube and uncover the century-old puzzle of cosmic ray origins. A detector near the equator will provide a unique viewpoint of the neutrino sky, complementing IceCube and other neutrino telescopes in the Northern Hemisphere. Here…
▽ More
Next-generation neutrino telescopes with significantly improved sensitivity are required to pinpoint the sources of the diffuse astrophysical neutrino flux detected by IceCube and uncover the century-old puzzle of cosmic ray origins. A detector near the equator will provide a unique viewpoint of the neutrino sky, complementing IceCube and other neutrino telescopes in the Northern Hemisphere. Here we present results from an expedition to the north-eastern region of the South China Sea, in the western Pacific Ocean. A favorable neutrino telescope site was found on an abyssal plain at a depth of $\sim$ 3.5km. At depths below 3km, the sea current speed, water absorption and scattering lengths for Cherenkov light, were measured to be $v_{\mathrm{c}}<$10cm/s, $λ_{\mathrm{abs} }\simeq$ 27m and $λ_{\mathrm{sca} }\simeq$ 63m, respectively. Accounting for these measurements, we present the design and expected performance of a next-generation neutrino telescope, TRopIcal DEep-sea Neutrino Telescope (TRIDENT). With its advanced photon-detection technology and large dimensions, TRIDENT expects to observe the IceCube steady source candidate NGC 1068 with 5$σ$ significance within 1 year of operation. This level of sensitivity will open a new arena for diagnosing the origin of cosmic rays and probing fundamental physics over astronomical baselines.
△ Less
Submitted 13 May, 2024; v1 submitted 10 July, 2022;
originally announced July 2022.
-
A Self-Guided Framework for Radiology Report Generation
Authors:
Jun Li,
Shibo Li,
Ying Hu,
Huiren Tao
Abstract:
Automatic radiology report generation is essential to computer-aided diagnosis. Through the success of image captioning, medical report generation has been achievable. However, the lack of annotated disease labels is still the bottleneck of this area. In addition, the image-text data bias problem and complex sentences make it more difficult to generate accurate reports. To address these gaps, we p…
▽ More
Automatic radiology report generation is essential to computer-aided diagnosis. Through the success of image captioning, medical report generation has been achievable. However, the lack of annotated disease labels is still the bottleneck of this area. In addition, the image-text data bias problem and complex sentences make it more difficult to generate accurate reports. To address these gaps, we pre-sent a self-guided framework (SGF), a suite of unsupervised and supervised deep learning methods to mimic the process of human learning and writing. In detail, our framework obtains the domain knowledge from medical reports with-out extra disease labels and guides itself to extract fined-grain visual features as-sociated with the text. Moreover, SGF successfully improves the accuracy and length of medical report generation by incorporating a similarity comparison mechanism that imitates the process of human self-improvement through compar-ative practice. Extensive experiments demonstrate the utility of our SGF in the majority of cases, showing its superior performance over state-of-the-art meth-ods. Our results highlight the capacity of the proposed framework to distinguish fined-grained visual details between words and verify its advantage in generating medical reports.
△ Less
Submitted 19 June, 2022;
originally announced June 2022.