Search | arXiv e-print repository

Exploring the Potential of Large Language Models in Graph Generation

Authors: Yang Yao, Xin Wang, Zeyang Zhang, Yijian Qin, Ziwei Zhang, Xu Chu, Yuekui Yang, Wenwu Zhu, Hong Mei

Abstract: Large language models (LLMs) have achieved great success in many fields, and recent works have studied exploring LLMs for graph discriminative tasks such as node classification. However, the abilities of LLMs for graph generation remain unexplored in the literature. Graph generation requires the LLM to generate graphs with given properties, which has valuable real-world applications such as drug d… ▽ More Large language models (LLMs) have achieved great success in many fields, and recent works have studied exploring LLMs for graph discriminative tasks such as node classification. However, the abilities of LLMs for graph generation remain unexplored in the literature. Graph generation requires the LLM to generate graphs with given properties, which has valuable real-world applications such as drug discovery, while tends to be more challenging. In this paper, we propose LLM4GraphGen to explore the ability of LLMs for graph generation with systematical task designs and extensive experiments. Specifically, we propose several tasks tailored with comprehensive experiments to address key questions regarding LLMs' understanding of different graph structure rules, their ability to capture structural type distributions, and their utilization of domain knowledge for property-based graph generation. Our evaluations demonstrate that LLMs, particularly GPT-4, exhibit preliminary abilities in graph generation tasks, including rule-based and distribution-based generation. We also observe that popular prompting methods, such as few-shot and chain-of-thought prompting, do not consistently enhance performance. Besides, LLMs show potential in generating molecules with specific properties. These findings may serve as foundations for designing good LLMs based models for graph generation and provide valuable insights and further research. △ Less

Submitted 21 March, 2024; originally announced March 2024.

arXiv:2312.08906 [pdf, other]

Using eye tracking to investigate what native Chinese speakers notice about linguistic landscape images

Authors: Zichao Wei, Yewei Qin

Abstract: Linguistic landscape is an important field in sociolinguistic research. Eye tracking technology is a common technology in psychological research. There are few cases of using eye movement to study linguistic landscape. This paper uses eye tracking technology to study the actual fixation of the linguistic landscape and finds that in the two dimensions of fixation time and fixation times, the fixati… ▽ More Linguistic landscape is an important field in sociolinguistic research. Eye tracking technology is a common technology in psychological research. There are few cases of using eye movement to study linguistic landscape. This paper uses eye tracking technology to study the actual fixation of the linguistic landscape and finds that in the two dimensions of fixation time and fixation times, the fixation of native Chinese speakers to the linguistic landscape is higher than that of the general landscape. This paper argues that this phenomenon is due to the higher information density of linguistic landscapes. At the same time, the article also discusses other possible reasons for this phenomenon. △ Less

Submitted 2 July, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

ACM Class: J.4

arXiv:2302.12619 [pdf, other]

T-Phenotype: Discovering Phenotypes of Predictive Temporal Patterns in Disease Progression

Authors: Yuchao Qin, Mihaela van der Schaar, Changhee Lee

Abstract: Clustering time-series data in healthcare is crucial for clinical phenoty** to understand patients' disease progression patterns and to design treatment guidelines tailored to homogeneous patient subgroups. While rich temporal dynamics enable the discovery of potential clusters beyond static correlations, two major challenges remain outstanding: i) discovery of predictive patterns from many pote… ▽ More Clustering time-series data in healthcare is crucial for clinical phenoty** to understand patients' disease progression patterns and to design treatment guidelines tailored to homogeneous patient subgroups. While rich temporal dynamics enable the discovery of potential clusters beyond static correlations, two major challenges remain outstanding: i) discovery of predictive patterns from many potential temporal correlations in the multi-variate time-series data and ii) association of individual temporal patterns to the target label distribution that best characterizes the underlying clinical progression. To address such challenges, we develop a novel temporal clustering method, T-Phenotype, to discover phenotypes of predictive temporal patterns from labeled time-series data. We introduce an efficient representation learning approach in frequency domain that can encode variable-length, irregularly-sampled time-series into a unified representation space, which is then applied to identify various temporal patterns that potentially contribute to the target label using a new notion of path-based similarity. Throughout the experiments on synthetic and real-world datasets, we show that T-Phenotype achieves the best phenotype discovery performance over all the evaluated baselines. We further demonstrate the utility of T-Phenotype by uncovering clinically meaningful patient subgroups characterized by unique temporal patterns. △ Less

Submitted 24 February, 2023; originally announced February 2023.

arXiv:2207.02833 [pdf]

Mesoscopic Collective Activity in Excitatory Neural Fields: Cross-frequency Coupling

Authors: Yu Qin, Alex Sheremet

Abstract: In the brain, cross-frequency coupling has been hypothesized to result from the activity of specialized microcircuits. For example, theta-gamma coupling is assumed to be generated by specialized cell pairs (PING and ING mechanisms), or special cells (e.g., fast bursting neurons). However, this implies that the generating mechanisms is uniquely specific to the brain. In fact, cross-scale coupling i… ▽ More In the brain, cross-frequency coupling has been hypothesized to result from the activity of specialized microcircuits. For example, theta-gamma coupling is assumed to be generated by specialized cell pairs (PING and ING mechanisms), or special cells (e.g., fast bursting neurons). However, this implies that the generating mechanisms is uniquely specific to the brain. In fact, cross-scale coupling is a phenomenon encountered in the physics of all large, multi-scale systems: phase and amplitude correlations between components of different scales arise as a result of nonlinear interaction. Because the brain is a multi-scale system too, a similar mechanism must be active in the brain. Here, we represent brain activity as a superposition of nonlinearly interacting patterns of spatio-temporal activity (collective activity), supported by populations of neurons. Cross-frequency coupling is a direct consequence of the nonlinear interactions, and does not require specialized cells or cell pairs. It is therefore universal, and must be active in neural fields of any composition. To emphasize this, we demonstrate the phenomenon in excitatory fields. While there is no doubt that specialized cells play a role in theta-gamma coupling, our results suggest that the coupling mechanism is at the same time simpler and richer: simpler because it involves the universal principle of nonlinearity; richer, because nonlinearity of collective activity is likely modulated by specialized-cell populations in ways to be yet understood. △ Less

Submitted 6 July, 2022; originally announced July 2022.

Comments: 24 pages, 10 figures

MSC Class: 35Q92; 92B20; 92C20

arXiv:2206.07966 [pdf]

Mesoscopic Collective Activity in Excitatory Neural Fields: Governing Equations

Authors: Yu Qin, Alex Sheremet

Abstract: In this study we derive the governing equations for mesoscopic collective activity in the cortex, starting from the generic Hodgkin-Huxley equations for microscopic cell dynamics. For simplicity, and to maintain focus on the essential elements of the derivation, the discussion is confined to excitatory neural fields. The fundamental assumption of the procedure is that mesoscale processes are macro… ▽ More In this study we derive the governing equations for mesoscopic collective activity in the cortex, starting from the generic Hodgkin-Huxley equations for microscopic cell dynamics. For simplicity, and to maintain focus on the essential elements of the derivation, the discussion is confined to excitatory neural fields. The fundamental assumption of the procedure is that mesoscale processes are macroscopic with respect to cell-scale activity, and emerge as the average behavior of a large population of cells. Because of their duration, action-potential details are assumed not observable at mesoscale; the essential mesoscopic function of action potentials is to redistribute energy in the neural field. The Hodgkin-Huxley dynamical model is first reduced to a set of equations that describe subthreshold dynamics. An ensemble average over a cell population then produces a closed system of equations involving two mesoscopic state variables: the density of kinetic energy J, carried by sodium ionic currents, and the excitability H of the neural field, which could be described as the average state of gating variable h. The resulting model is represented as essentially a subthreshold process; and the dynamical role of the firing rate is naturally reassessed as describing energy transfers. The linear properties of the equations are consistent with expectations for the dynamics of excitatory neural fields: the system supports oscillations of progressive waves, with shorter waves typically having higher frequencies, propagating slower, and decaying faster. Extending the derivation to include more complex cell dynamics (e.g., including other ionic channels, e.g., calcium channels) and multiple-type, excitatory-inhibitory, neural fields is straightforward, and will be presented elsewhere. △ Less

Submitted 6 July, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

Comments: 27 pages, 7 figures

MSC Class: 92-10 (Primary) 92B25 (Secondary)

arXiv:2110.01493 [pdf, other]

Exploiting Pre-Trained ASR Models for Alzheimer's Disease Recognition Through Spontaneous Speech

Authors: Ying Qin, Wei Liu, Zhiyuan Peng, Si-Ioi Ng, **gyu Li, Haibo Hu, Tan Lee

Abstract: Alzheimer's disease (AD) is a progressive neurodegenerative disease and recently attracts extensive attention worldwide. Speech technology is considered a promising solution for the early diagnosis of AD and has been enthusiastically studied. Most recent works concentrate on the use of advanced BERT-like classifiers for AD detection. Input to these classifiers are speech transcripts produced by au… ▽ More Alzheimer's disease (AD) is a progressive neurodegenerative disease and recently attracts extensive attention worldwide. Speech technology is considered a promising solution for the early diagnosis of AD and has been enthusiastically studied. Most recent works concentrate on the use of advanced BERT-like classifiers for AD detection. Input to these classifiers are speech transcripts produced by automatic speech recognition (ASR) models. The major challenge is that the quality of transcription could degrade significantly under complex acoustic conditions in the real world. The detection performance, in consequence, is largely limited. This paper tackles the problem via tailoring and adapting pre-trained neural-network based ASR model for the downstream AD recognition task. Only bottom layers of the ASR model are retained. A simple fully-connected neural network is added on top of the tailored ASR model for classification. The heavy BERT classifier is discarded. The resulting model is light-weight and can be fine-tuned in an end-to-end manner for AD recognition. Our proposed approach takes only raw speech as input, and no extra transcription process is required. The linguistic information of speech is implicitly encoded in the tailored ASR model and contributes to boosting the performance. Experiments show that our proposed approach outperforms the best manual transcript-based RoBERTa by an absolute margin of 4.6% in terms of accuracy. Our best-performing models achieve the accuracy of 83.2% and 78.0% in the long-audio and short-audio competition tracks of the 2021 NCMMSC Alzheimer's Disease Recognition Challenge, respectively. △ Less

Submitted 4 October, 2021; originally announced October 2021.

Comments: Accepted by NCMMSC2021

arXiv:2012.04217 [pdf, other]

doi 10.1103/PhysRevResearch.3.023218

Phase-Amplitude Coupling in Neuronal Oscillator Networks

Authors: Yuzhen Qin, Tommaso Menara, Danielle S. Bassett, Fabio Pasqualetti

Abstract: Phase-amplitude coupling (PAC) describes the phenomenon where the power of a high-frequency oscillation evolves with the phase of a low-frequency one. We propose a model that explains the emergence of PAC in two commonly-accepted architectures in the brain, namely, a high-frequency neural oscillation driven by an external low-frequency input and two interacting local oscillations with distinct, lo… ▽ More Phase-amplitude coupling (PAC) describes the phenomenon where the power of a high-frequency oscillation evolves with the phase of a low-frequency one. We propose a model that explains the emergence of PAC in two commonly-accepted architectures in the brain, namely, a high-frequency neural oscillation driven by an external low-frequency input and two interacting local oscillations with distinct, locally-generated frequencies. We further propose an interconnection structure for brain regions and demonstrate that low-frequency phase synchrony can integrate high-frequency activities regulated by local PAC and control the direction of information flow across distant regions. △ Less

Submitted 8 December, 2020; originally announced December 2020.

Comments: 6 pages, 5 figures

Journal ref: Phys. Rev. Research 3, 023218 (2021)

arXiv:2003.07405 [pdf, other]

doi 10.1109/LCSYS.2020.3005449

Mediated Remote Synchronization of Kuramoto-Sakaguchi Oscillators: the Number of Mediators Matters

Authors: Yuzhen Qin, Ming Cao, Brian D. O. Anderson, Danielle S. Bassett, Fabio Pasqualetti

Abstract: Cortical regions without direct neuronal connections have been observed to exhibit synchronized dynamics. A recent empirical study has further revealed that such regions that share more common neighbors are more likely to behave coherently. To analytically investigate the underlying mechanisms, we consider that a set of n oscillators, which have no direct connections, are linked through m intermed… ▽ More Cortical regions without direct neuronal connections have been observed to exhibit synchronized dynamics. A recent empirical study has further revealed that such regions that share more common neighbors are more likely to behave coherently. To analytically investigate the underlying mechanisms, we consider that a set of n oscillators, which have no direct connections, are linked through m intermediate oscillators (called mediators), forming a complete bipartite network structure. Modeling the oscillators by the Kuramoto-Sakaguchi model, we rigorously prove that mediated remote synchronization, i.e., synchronization between those n oscillators that are not directly connected, becomes more robust as the number of mediators increases. Simulations are also carried out to show that our theoretical findings can be applied to other general and complex networks. △ Less

Submitted 9 July, 2020; v1 submitted 16 March, 2020; originally announced March 2020.

arXiv:1905.03611 [pdf, other]

Effect of E-cigarette Use and Social Network on Smoking Behavior Change: An agent-based model of E-cigarette and Cigarette Interaction

Authors: Yang Qin, Rojiemiahd Edjoc, Nathaniel D Osgood

Abstract: Despite a general reduction in smoking in many areas of the developed world, it remains one of the biggest public health threats. As an alternative to tobacco, the use of electronic cigarettes (ECig) has been increased dramatically over the last decade. ECig use is hypothesized to impact smoking behavior through several pathways, not only as a means of quitting cigarettes and lowering risk of rela… ▽ More Despite a general reduction in smoking in many areas of the developed world, it remains one of the biggest public health threats. As an alternative to tobacco, the use of electronic cigarettes (ECig) has been increased dramatically over the last decade. ECig use is hypothesized to impact smoking behavior through several pathways, not only as a means of quitting cigarettes and lowering risk of relapse, but also as both an alternative nicotine delivery device to cigarettes, as a visible use of nicotine that can lead to imitative behavior in the form of smoking, and as a gateway nicotine delivery technology that can build high levels of nicotine tolerance and pave the way for initiation of smoking. Evidence regarding the effect of ECig use on smoking behavior change remains inconclusive. To address these challenges, we built an agent-based model (ABM) of smoking and ECig use to examine the effects of ECig use on smoking behavior change. The impact of social network (SN) on the initiation of smoking and ECig use were also explored. Findings from the simulation suggest that the use of ECig generates substantially lower prevalence of current smoker (PCS), which demonstrates the potential for reducing smoking and lowering the risk of relapse. The effects of proximity-based influences within SN increases the prevalence of current ECig user (PCEU). The model also suggests the importance of improved understanding of drivers in cessation and relapse in ECig use, in light of findings that such aspects of behavior change may notably influence smoking behavior change and burden. △ Less

Submitted 2 May, 2019; originally announced May 2019.

Comments: 10 pages, SBP-BRiMS 2019

arXiv:1905.02552 [pdf, other]

Multi-Scale Simulation Modeling for Prevention and Public Health Management of Diabetes in Pregnancy and Sequelae

Authors: Yang Qin, Louise Freebairn, Jo-An Atkinson, Weicheng Qian, Anahita Safarishahrbijari, Nathaniel D Osgood

Abstract: Diabetes in pregnancy (DIP) is an increasing public health priority in the Australian Capital Territory, particularly due to its impact on risk for develo** Type 2 diabetes. While earlier diagnostic screening results in greater capacity for early detection and treatment, such benefits must be balanced with the greater demands this imposes on public health services. To address such planning chall… ▽ More Diabetes in pregnancy (DIP) is an increasing public health priority in the Australian Capital Territory, particularly due to its impact on risk for develo** Type 2 diabetes. While earlier diagnostic screening results in greater capacity for early detection and treatment, such benefits must be balanced with the greater demands this imposes on public health services. To address such planning challenges, a multi-scale hybrid simulation model of DIP was built to explore the interaction of risk factors and capture the dynamics underlying the development of DIP. The impact of interventions on health outcomes at the physiological, health service and population level is measured. Of particular central significance in the model is a compartmental model representing the underlying physiological regulation of glycemic status based on beta-cell dynamics and insulin resistance. The model also simulated the dynamics of continuous BMI evolution, glycemic status change during pregnancy and diabetes classification driven by the individual-level physiological model. We further modeled public health service pathways providing diagnosis and care for DIP to explore the optimization of resource use during service delivery. The model was extensively calibrated against empirical data. △ Less

Submitted 2 May, 2019; originally announced May 2019.

Comments: 10 pages, SBP-BRiMS 2019

arXiv:1406.4483 [pdf]

Effect of low-intensity pulsed ultrasound on biocompatibility and cellular uptake of chitosan-TPP nanoparticles

Authors: Junyi Wu, Gaojun Liu, Yi-Xian Qin, Yizhi Meng

Abstract: Using low molecular weight chitosan nanoparticles (CNPs) prepared by an ionic gelation method, we report the effect of low-intensity pulsed ultrasound (US) on cell viability and nanoparticle uptake in cultured murine pre-osteoblasts. Particle size and zeta potential are measured using Dynamic Light Scattering (DLS), and cell viability is evaluated using the MTS assay. Results show that 30 min deli… ▽ More Using low molecular weight chitosan nanoparticles (CNPs) prepared by an ionic gelation method, we report the effect of low-intensity pulsed ultrasound (US) on cell viability and nanoparticle uptake in cultured murine pre-osteoblasts. Particle size and zeta potential are measured using Dynamic Light Scattering (DLS), and cell viability is evaluated using the MTS assay. Results show that 30 min delivery of CNPs at 0.5 mg/mL is able to prevent loss of cell viability due to either serum starvation or subsequent exposure to US (1 W/cm2 or 2 W/cm2, up to 1 min). Additionally, flow cytometry data suggest that there is a close association between cellular membrane integrity and the presence of CNPs when US at 2 W/cm2 is administered. △ Less

Submitted 17 June, 2014; originally announced June 2014.

Showing 1–11 of 11 results for author: Qin, Y