-
Topology-Informed Graph Transformer
Authors:
Yun Young Choi,
Sun Woo Park,
Minho Lee,
Youngho Woo
Abstract:
Transformers have revolutionized performance in Natural Language Processing and Vision, paving the way for their integration with Graph Neural Networks (GNNs). One key challenge in enhancing graph transformers is strengthening the discriminative power of distinguishing isomorphisms of graphs, which plays a crucial role in boosting their predictive performances. To address this challenge, we introd…
▽ More
Transformers have revolutionized performance in Natural Language Processing and Vision, paving the way for their integration with Graph Neural Networks (GNNs). One key challenge in enhancing graph transformers is strengthening the discriminative power of distinguishing isomorphisms of graphs, which plays a crucial role in boosting their predictive performances. To address this challenge, we introduce 'Topology-Informed Graph Transformer (TIGT)', a novel transformer enhancing both discriminative power in detecting graph isomorphisms and the overall performance of Graph Transformers. TIGT consists of four components: A topological positional embedding layer using non-isomorphic universal covers based on cyclic subgraphs of graphs to ensure unique graph representation: A dual-path message-passing layer to explicitly encode topological characteristics throughout the encoder layers: A global attention mechanism: And a graph information layer to recalibrate channel-wise graph features for better feature representation. TIGT outperforms previous Graph Transformers in classifying synthetic dataset aimed at distinguishing isomorphism classes of graphs. Additionally, mathematical analysis and empirical evaluations highlight our model's competitive edge over state-of-the-art Graph Transformers across various benchmark datasets.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
A Gated MLP Architecture for Learning Topological Dependencies in Spatio-Temporal Graphs
Authors:
Yun Young Choi,
Minho Lee,
Sun Woo Park,
Seunghwan Lee,
Joohwan Ko
Abstract:
Graph Neural Networks (GNNs) and Transformer have been increasingly adopted to learn the complex vector representations of spatio-temporal graphs, capturing intricate spatio-temporal dependencies crucial for applications such as traffic datasets. Although many existing methods utilize multi-head attention mechanisms and message-passing neural networks (MPNNs) to capture both spatial and temporal r…
▽ More
Graph Neural Networks (GNNs) and Transformer have been increasingly adopted to learn the complex vector representations of spatio-temporal graphs, capturing intricate spatio-temporal dependencies crucial for applications such as traffic datasets. Although many existing methods utilize multi-head attention mechanisms and message-passing neural networks (MPNNs) to capture both spatial and temporal relations, these approaches encode temporal and spatial relations independently, and reflect the graph's topological characteristics in a limited manner. In this work, we introduce the Cycle to Mixer (Cy2Mixer), a novel spatio-temporal GNN based on topological non-trivial invariants of spatio-temporal graphs with gated multi-layer perceptrons (gMLP). The Cy2Mixer is composed of three blocks based on MLPs: A message-passing block for encapsulating spatial information, a cycle message-passing block for enriching topological information through cyclic subgraphs, and a temporal block for capturing temporal properties. We bolster the effectiveness of Cy2Mixer with mathematical evidence emphasizing that our cycle message-passing block is capable of offering differentiated information to the deep learning model compared to the message-passing block. Furthermore, empirical evaluations substantiate the efficacy of the Cy2Mixer, demonstrating state-of-the-art performances across various traffic benchmark datasets.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Model-Free Reconstruction of Capacity Degradation Trajectory of Lithium-Ion Batteries Using Early Cycle Data
Authors:
Seongyoon Kim,
Hangsoon Jung,
Minho Lee,
Yun Young Choi,
Jung-Il Choi
Abstract:
Early degradation prediction of lithium-ion batteries is crucial for ensuring safety and preventing unexpected failure in manufacturing and diagnostic processes. Long-term capacity trajectory predictions can fail due to cumulative errors and noise. To address this issue, this study proposes a data-centric method that uses early single-cycle data to predict the capacity degradation trajectory of li…
▽ More
Early degradation prediction of lithium-ion batteries is crucial for ensuring safety and preventing unexpected failure in manufacturing and diagnostic processes. Long-term capacity trajectory predictions can fail due to cumulative errors and noise. To address this issue, this study proposes a data-centric method that uses early single-cycle data to predict the capacity degradation trajectory of lithium-ion cells. The method involves predicting a few knots at specific retention levels using a deep learning-based model and interpolating them to reconstruct the trajectory. Two approaches are used to identify the retention levels of two to four knots: uniformly dividing the retention up to the end of life and finding optimal locations using Bayesian optimization. The proposed model is validated with experimental data from 169 cells using five-fold cross-validation. The results show that mean absolute percentage errors in trajectory prediction are less than 1.60% for all cases of knots. By predicting only the cycle numbers of at least two knots based on early single-cycle charge and discharge data, the model can directly estimate the overall capacity degradation trajectory. Further experiments suggest using three-cycle input data to achieve robust and efficient predictions, even in the presence of noise. The method is then applied to predict various shapes of capacity degradation patterns using additional experimental data from 82 cells. The study demonstrates that collecting only the cycle information of a few knots during model training and a few early cycle data points for predictions is sufficient for predicting capacity degradation. This can help establish appropriate warranties or replacement cycles in battery manufacturing and diagnosis processes.
△ Less
Submitted 31 March, 2023;
originally announced March 2023.
-
The PWLR Graph Representation: A Persistent Weisfeiler-Lehman scheme with Random Walks for Graph Classification
Authors:
Sun Woo Park,
Yun Young Choi,
Dosang Joe,
U ** Choi,
Youngho Woo
Abstract:
This paper presents the Persistent Weisfeiler-Lehman Random walk scheme (abbreviated as PWLR) for graph representations, a novel mathematical framework which produces a collection of explainable low-dimensional representations of graphs with discrete and continuous node features. The proposed scheme effectively incorporates normalized Weisfeiler-Lehman procedure, random walks on graphs, and persis…
▽ More
This paper presents the Persistent Weisfeiler-Lehman Random walk scheme (abbreviated as PWLR) for graph representations, a novel mathematical framework which produces a collection of explainable low-dimensional representations of graphs with discrete and continuous node features. The proposed scheme effectively incorporates normalized Weisfeiler-Lehman procedure, random walks on graphs, and persistent homology. We thereby integrate three distinct properties of graphs, which are local topological features, node degrees, and global topological invariants, while preserving stability from graph perturbations. This generalizes many variants of Weisfeiler-Lehman procedures, which are primarily used to embed graphs with discrete node labels. Empirical results suggest that these representations can be efficiently utilized to produce comparable results to state-of-the-art techniques in classifying graphs with discrete node labels, and enhanced performances in classifying those with continuous node features.
△ Less
Submitted 29 August, 2022;
originally announced August 2022.
-
Impedance-based Capacity Estimation for Lithium-Ion Batteries Using Generative Adversarial Network
Authors:
Seongyoon Kim,
Yun Young Choi,
Jung-Il Choi
Abstract:
This paper proposes a fully unsupervised methodology for the reliable extraction of latent variables representing the characteristics of lithium-ion batteries (LIBs) from electrochemical impedance spectroscopy (EIS) data using information maximizing generative adversarial networks. Meaningful representations can be obtained from EIS data even when measured with direct current and without relaxatio…
▽ More
This paper proposes a fully unsupervised methodology for the reliable extraction of latent variables representing the characteristics of lithium-ion batteries (LIBs) from electrochemical impedance spectroscopy (EIS) data using information maximizing generative adversarial networks. Meaningful representations can be obtained from EIS data even when measured with direct current and without relaxation, which are difficult to express when using circuit models. The extracted latent variables were investigated as capacity degradation progressed and were used to estimate the discharge capacity of the batteries by employing Gaussian process regression. The proposed method was validated under various conditions of EIS data during charging and discharging. The results indicate that the proposed model provides more robust capacity estimations than the direct capacity estimations obtained from EIS. We demonstrate that the latent variables extracted from the EIS data measured with direct current and without relaxation reliably represent the degradation characteristics of LIBs.
△ Less
Submitted 2 July, 2021;
originally announced July 2021.
-
Observations of Abell 4059 with Chandra, HST and VLA: unraveling a complex cluster/radio-galaxy interaction
Authors:
Yun Young Choi,
Christopher S. Reynolds,
Sebastian Heinz,
Jessica L. Rosenberg,
Eric S. Perlman,
Jongmann Yang
Abstract:
(abridged) We present a detailed reanalysis of the Chandra data for the galaxy cluster Abell 4059 and its central radio galaxy, PKS2354-35. We also present new 1.4GHz and 4.7GHz CnB-array radio data from the Very Large Array (VLA), as well as a short archival WFPC2 image from the Hubble Space Telescope. The presence of a strong interaction between this radio galaxy and the intracluster medium (I…
▽ More
(abridged) We present a detailed reanalysis of the Chandra data for the galaxy cluster Abell 4059 and its central radio galaxy, PKS2354-35. We also present new 1.4GHz and 4.7GHz CnB-array radio data from the Very Large Array (VLA), as well as a short archival WFPC2 image from the Hubble Space Telescope. The presence of a strong interaction between this radio galaxy and the intracluster medium (ICM) was suggested by Huang & Sarazin (1998) on the basis of a short observation by the High Resolution Imager on ROSAT, and confirmed in our preliminary analysis of the Chandra/ACIS-S data. In particular, X-ray imaging clearly shows two cavities within the ICM that are approximately aligned with the radio-galaxy axis. However, using our new radio maps we fail to find a detailed correspondence between the 1 arcmin scale radio-lobes and the ICM cavities. This suggests that the cavities are ``ghosts'' of a previous burst of powerful activity by PKS 2354-35. We also examine the nature of the central asymmetric ridge (or bar) of X-ray emission extending for 30kpc south-west of the cluster center that has been noted in these previous analyzes. We find the ridge to be denser and cooler than, but probably in pressure balance with, its surroundings. The thermal evolution of this structure seems to be dominated by radiative cooling, possibly enhanced by the radio-galaxy ICM interaction. We discuss several possible models for the formation of this SW ridge and find none of them to be entirely satisfactory.
△ Less
Submitted 5 February, 2004;
originally announced February 2004.
-
Advances in domain independent linear text segmentation
Authors:
Freddy Y. Y. Choi
Abstract:
This paper describes a method for linear text segmentation which is twice as accurate and over seven times as fast as the state-of-the-art (Reynar, 1998). Inter-sentence similarity is replaced by rank in the local context. Boundary locations are discovered by divisive clustering.
This paper describes a method for linear text segmentation which is twice as accurate and over seven times as fast as the state-of-the-art (Reynar, 1998). Inter-sentence similarity is replaced by rank in the local context. Boundary locations are discovered by divisive clustering.
△ Less
Submitted 30 March, 2000;
originally announced March 2000.