-
CrysMMNet: Multimodal Representation for Crystal Property Prediction
Authors:
Kishalay Das,
Pawan Goyal,
Seung-Cheol Lee,
Satadeep Bhattacharjee,
Niloy Ganguly
Abstract:
Machine Learning models have emerged as a powerful tool for fast and accurate prediction of different crystalline properties. Exiting state-of-the-art models rely on a single modality of crystal data i.e. crystal graph structure, where they construct multi-graph by establishing edges between nearby atoms in 3D space and apply GNN to learn materials representation. Thereby, they encode local chemic…
▽ More
Machine Learning models have emerged as a powerful tool for fast and accurate prediction of different crystalline properties. Exiting state-of-the-art models rely on a single modality of crystal data i.e. crystal graph structure, where they construct multi-graph by establishing edges between nearby atoms in 3D space and apply GNN to learn materials representation. Thereby, they encode local chemical semantics around the atoms successfully but fail to capture important global periodic structural information like space group number, crystal symmetry, rotational information, etc, which influence different crystal properties. In this work, we leverage textual descriptions of materials to model global structural information into graph structure and learn a more robust and enriched representation of crystalline materials. To this effect, we first curate a textual dataset for crystalline material databases containing descriptions of each material. Further, we propose CrysMMNet, a simple multi-modal framework, which fuses both structural and textual representation together to generate a joint multimodal representation of crystalline materials. We conduct extensive experiments on two benchmark datasets across ten different properties to show that CrysMMNet outperforms existing state-of-the-art baseline methods with a good margin. We also observe that fusing the textual representation with crystal graph structure provides consistent improvement for all the SOTA GNN models compared to their own vanilla versions. We have shared the textual dataset, that we have curated for both the benchmark material databases, with the community for future use.
△ Less
Submitted 9 June, 2023;
originally announced July 2023.
-
CrysGNN : Distilling pre-trained knowledge to enhance property prediction for crystalline materials
Authors:
Kishalay Das,
Bidisha Samanta,
Pawan Goyal,
Seung-Cheol Lee,
Satadeep Bhattacharjee,
Niloy Ganguly
Abstract:
In recent years, graph neural network (GNN) based approaches have emerged as a powerful technique to encode complex topological structure of crystal materials in an enriched representation space. These models are often supervised in nature and using the property-specific training data, learn relationship between crystal structure and different properties like formation energy, bandgap, bulk modulu…
▽ More
In recent years, graph neural network (GNN) based approaches have emerged as a powerful technique to encode complex topological structure of crystal materials in an enriched representation space. These models are often supervised in nature and using the property-specific training data, learn relationship between crystal structure and different properties like formation energy, bandgap, bulk modulus, etc. Most of these methods require a huge amount of property-tagged data to train the system which may not be available for different properties. However, there is an availability of a huge amount of crystal data with its chemical composition and structural bonds. To leverage these untapped data, this paper presents CrysGNN, a new pre-trained GNN framework for crystalline materials, which captures both node and graph level structural information of crystal graphs using a huge amount of unlabelled material data. Further, we extract distilled knowledge from CrysGNN and inject into different state of the art property predictors to enhance their property prediction accuracy. We conduct extensive experiments to show that with distilled knowledge from the pre-trained model, all the SOTA algorithms are able to outperform their own vanilla version with good margins. We also observe that the distillation process provides a significant improvement over the conventional approach of finetuning the pre-trained model. We have released the pre-trained model along with the large dataset of 800K crystal graph which we carefully curated; so that the pretrained model can be plugged into any existing and upcoming models to enhance their prediction accuracy.
△ Less
Submitted 14 January, 2023;
originally announced January 2023.
-
CrysXPP:An Explainable Property Predictor for Crystalline Materials
Authors:
Kishalay Das,
Bidisha Samanta,
Pawan Goyal,
Seung-Cheol Lee,
Satadeep Bhattacharjee,
Niloy Ganguly
Abstract:
We present a deep-learning framework, CrysXPP, to allow rapid prediction of electronic, magnetic and elastic properties of a wide range of materials with reasonable precision. Although our work is consistent with several recent attempts to build deep learning-based property predictors, it overcomes some of their limitations. CrysXPP lowers the need for a large volume of tagged data to train a deep…
▽ More
We present a deep-learning framework, CrysXPP, to allow rapid prediction of electronic, magnetic and elastic properties of a wide range of materials with reasonable precision. Although our work is consistent with several recent attempts to build deep learning-based property predictors, it overcomes some of their limitations. CrysXPP lowers the need for a large volume of tagged data to train a deep learning model by intelligently designing an autoencoder CrysAE and passing the structural information to the property prediction process. The autoencoder in turn is trained on a huge volume of untagged crystal graphs, the designed loss function helps in capturing all their important structural and chemical information. Moreover, CrysXPP uses only a small amount of tagged data for property prediction, and also trains a feature selector that provides interpretability to the results obtained. We demonstrate that CrysXPP convincingly performs better than all the competing and recent baseline algorithms across seven diverse set of properties. Most notably, when given a small amount of experimental data, CrysXPP is consistently able to outperform conventional DFT. We release the large pretrained model CrysAE so that it could be fine-tuned using small amount of tagged data by the research community on various applications with restricted data source.
△ Less
Submitted 2 February, 2022; v1 submitted 22 April, 2021;
originally announced April 2021.
-
Positron Annihilation Study of Zn O Nanoparticles Grown under Folic Acid Template
Authors:
Sreetama Dutta,
Sourav Sarkar,
Bichitra Nandi Ganguly
Abstract:
Positron lifetime spectroscopy (PAL), Doppler broadening (DB) as well as coincidence Doppler broadening (CDB) spectroscopy of a new variety of Folic acid (FA) capped zinc oxide nano-particle samples has been performed at room temperature. The results show interesting patterns of observation, hither-to unobserved in ZnO wurtzite crystalline samples, such as predominance of positronium formation, as…
▽ More
Positron lifetime spectroscopy (PAL), Doppler broadening (DB) as well as coincidence Doppler broadening (CDB) spectroscopy of a new variety of Folic acid (FA) capped zinc oxide nano-particle samples has been performed at room temperature. The results show interesting patterns of observation, hither-to unobserved in ZnO wurtzite crystalline samples, such as predominance of positronium formation, as reflected in the (PAL) analysis, phase transition in the nano crystalline samples at ~0.8-1.0 % FA concentration as depicted from DB results. Also, the chemical environment of the samples has been analysed from the ratio curves of CDB studies. Beside these, other independent results from X-ray diffraction (XRD) data and Debye- Scherrer method, Transmission electron microscopic (TEM) observations and as well as Fourier transformed infrared spectroscopic (FT-IR) analysis are reported for comparison.
△ Less
Submitted 8 January, 2014;
originally announced January 2014.
-
Positron Annihilation Study of Biopolymer Inulin for Understanding its Structural Organization
Authors:
Bichitra Nandi Ganguly,
Madhusudan Roy,
S. P. Moulik
Abstract:
Inulins are nano-meter size semi-crystalline particles, composed of oligomeric fructose units. It has been subjected to fine micro-structural analysis under temperature variations using mainly positron annihilation spectroscopy. The results show a non-monotonous temperature sensitive behaviour of the positron parameters, with considerable variation of its free volume size. The ortho-positronium pi…
▽ More
Inulins are nano-meter size semi-crystalline particles, composed of oligomeric fructose units. It has been subjected to fine micro-structural analysis under temperature variations using mainly positron annihilation spectroscopy. The results show a non-monotonous temperature sensitive behaviour of the positron parameters, with considerable variation of its free volume size. The ortho-positronium pick-off component shows a major thermotropic transition at ~320K and a structure loss due to glass transition. Differential scanning calorimetry confirms the onset of the major molecular transition around the same temperature with an enthalpy change of ΔH ~379J /gm and thermo-gravimetric analysis shows mass loss in the said transition. Keywords: Inulin, fructose units, positron annihilation spectroscopy, microstructure, free volume analysis. thermotropic transition, thermal analysis.
△ Less
Submitted 8 January, 2014;
originally announced January 2014.
-
Characteristics of Dispersed ZnO-Folic acid Conjugate in Aqueous Medium
Authors:
Sreetama Dutta,
Bichitra Nandi Ganguly
Abstract:
The focus of this article is based on the aqueous dispersed state properties of inorganic ZnO nanoparticles (average size lessthan or equal to 4 nm), their surface modification and bio-functionalization with folic acid at physiological pH ~ 7.5, suitable for bio-imaging and targeted therapeutic application. While TEM studies of the ZnO nano-crystallites have been performed to estimate their size a…
▽ More
The focus of this article is based on the aqueous dispersed state properties of inorganic ZnO nanoparticles (average size lessthan or equal to 4 nm), their surface modification and bio-functionalization with folic acid at physiological pH ~ 7.5, suitable for bio-imaging and targeted therapeutic application. While TEM studies of the ZnO nano-crystallites have been performed to estimate their size and morphology in dry state, the band gap properties of the freshly prepared samples, the hydrodynamic size in aqueous solution phase and the wide fluorescence range in visible region have been investigated to establish the fact that the sol is particularly suitable for the bio-medical purpose in the aqueous dispersed state.
Key words: ZnO nanoparticle; folic acid; band gap; hydrodynamic size; fluorescence.
△ Less
Submitted 4 December, 2013;
originally announced December 2013.
-
Gauging Structural Aspects of ZnO nano-Crystal Growth ThroughX-ray Diffraction Studies and PAC
Authors:
Bichitra Nandi Ganguly,
Sreetama Dutta,
Soma Roy,
Jens Röder,
Karl Johnston,
ISOLDE-Collaboration
Abstract:
The structural characterization of sol-gel based nano crystalline ZnO material is being reported as we observe several previously unreported structural aspects following a sequence of annealing stages. As-grown samples were characterised by Fourier Transform Infrared Spectroscopy (FTIR). Chemical purity of the nano-grains and their crystallinity has been monitored by energy dispersive X-ray (EDAX)…
▽ More
The structural characterization of sol-gel based nano crystalline ZnO material is being reported as we observe several previously unreported structural aspects following a sequence of annealing stages. As-grown samples were characterised by Fourier Transform Infrared Spectroscopy (FTIR). Chemical purity of the nano-grains and their crystallinity has been monitored by energy dispersive X-ray (EDAX) analysis and transmission electron microscopy (TEM), while the unusual changes in nano-crystal growth structure have been studied by X-ray diffraction method. In addition, such samples have been studied by using perturbed angular correlation (PAC) technique with the short-lived radioactive probe 111mCd. Changes in the local electronic environment following sintering of the nano-crystalline grains have been observed by this method.
△ Less
Submitted 23 November, 2013;
originally announced November 2013.
-
Understanding how both the partitions of a bipartite network affect its one-mode projection
Authors:
Animesh Mukherjee,
Monojit Choudhury,
Niloy Ganguly
Abstract:
It is a well-known fact that the degree distribution (DD) of the nodes in a partition of a bipartite network influences the DD of its one-mode projection on that partition. However, there are no studies exploring the effect of the DD of the other partition on the one-mode projection. In this article, we show that the DD of the other partition, in fact, has a very strong influence on the DD of the…
▽ More
It is a well-known fact that the degree distribution (DD) of the nodes in a partition of a bipartite network influences the DD of its one-mode projection on that partition. However, there are no studies exploring the effect of the DD of the other partition on the one-mode projection. In this article, we show that the DD of the other partition, in fact, has a very strong influence on the DD of the one-mode projection. We establish this fact by deriving the exact or approximate closed-forms of the DD of the one-mode projection through the application of generating function formalism followed by the method of iterative convolution. The results are cross-validated through appropriate simulations.
△ Less
Submitted 19 May, 2011;
originally announced May 2011.
-
Analyzing the Degree Distribution of the One-mode Projection of an Alphabetic Bipartite Network (α-BIN)
Authors:
Animesh Mukherjee,
Monojit Choudhury,
Niloy Ganguly
Abstract:
The paper is being withdrawn since the authors felt that the submission is a little premature after a careful reading by some of the experts in this field.
The paper is being withdrawn since the authors felt that the submission is a little premature after a careful reading by some of the experts in this field.
△ Less
Submitted 20 December, 2010; v1 submitted 4 February, 2009;
originally announced February 2009.
-
Generalized theory for node disruption in finite size complex networks
Authors:
Bivas Mitra,
Niloy Ganguly,
Sujoy Ghose,
Fernando Peruani
Abstract:
After a failure or attack the structure of a complex network changes due to node removal. Here, we show that the degree distribution of the distorted network, under any node disturbances, can be easily computed through a simple formula. Based on this expression, we derive a general condition for the stability of non-correlated finite complex networks under any arbitrary attack. We apply this for…
▽ More
After a failure or attack the structure of a complex network changes due to node removal. Here, we show that the degree distribution of the distorted network, under any node disturbances, can be easily computed through a simple formula. Based on this expression, we derive a general condition for the stability of non-correlated finite complex networks under any arbitrary attack. We apply this formalism to derive an expression for the percolation threshold $f_c$ under a general attack of the form $f_k \sim k^γ$, where $f_k$ stands for the probability of a node of degree $k$ of being removed during the attack. We show that $f_c$ of a finite network of size $N$ exhibits an additive correction which scales as $N^{-1}$ with respect to the classical result for infinite networks.
△ Less
Submitted 4 November, 2008;
originally announced November 2008.
-
Emergence of a non-scaling degree distribution in bipartite networks: a numerical and analytical study
Authors:
Fernando Peruani,
Monojit Choudhury,
Animesh Mukherjee,
Niloy Ganguly
Abstract:
We study the growth of bipartite networks in which the number of nodes in one of the partitions is kept fixed while the other partition is allowed to grow. We study random and preferential attachment as well as combination of both. We derive the exact analytical expression for the degree-distribution of all these different types of attachments while assuming that edges are incorporated sequentia…
▽ More
We study the growth of bipartite networks in which the number of nodes in one of the partitions is kept fixed while the other partition is allowed to grow. We study random and preferential attachment as well as combination of both. We derive the exact analytical expression for the degree-distribution of all these different types of attachments while assuming that edges are incorporated sequentially, i.e., a single edge is added to the growing network in a time step. We also provide an approximate expression for the case when more than one edge are added in a time step. We show that depending on the relative weight between random and preferential attachment, the degree-distribution of this type of network falls into one of four possible regimes which range from a binomial distribution for pure random attachment to an u-shaped distribution for dominant preferential attachment.
△ Less
Submitted 23 March, 2007;
originally announced March 2007.