Search | arXiv e-print repository

Integrating Clinical Knowledge into Concept Bottleneck Models

Authors: Winnie Pang, Xueyi Ke, Satoshi Tsutsui, Bihan Wen

Abstract: Concept bottleneck models (CBMs), which predict human-interpretable concepts (e.g., nucleus shapes in cell images) before predicting the final output (e.g., cell type), provide insights into the decision-making processes of the model. However, training CBMs solely in a data-driven manner can introduce undesirable biases, which may compromise prediction performance, especially when the trained mode… ▽ More Concept bottleneck models (CBMs), which predict human-interpretable concepts (e.g., nucleus shapes in cell images) before predicting the final output (e.g., cell type), provide insights into the decision-making processes of the model. However, training CBMs solely in a data-driven manner can introduce undesirable biases, which may compromise prediction performance, especially when the trained models are evaluated on out-of-domain images (e.g., those acquired using different devices). To mitigate this challenge, we propose integrating clinical knowledge to refine CBMs, better aligning them with clinicians' decision-making processes. Specifically, we guide the model to prioritize the concepts that clinicians also prioritize. We validate our approach on two datasets of medical images: white blood cell and skin images. Empirical validation demonstrates that incorporating medical guidance enhances the model's classification performance on unseen datasets with varying preparation methods, thereby increasing its real-world applicability. △ Less

Submitted 9 July, 2024; originally announced July 2024.

Comments: Accepted to MICCAI2024

arXiv:2407.03133 [pdf, other]

Quantifying the Cross-sectoral Intersecting Discrepancies within Multiple Groups Using Latent Class Analysis Towards Fairness

Authors: Yingfang Yuan, Kefan Chen, Mehdi Rizvi, Lynne Baillie, Wei Pang

Abstract: The growing interest in fair AI development is evident. The ''Leave No One Behind'' initiative urges us to address multiple and intersecting forms of inequality in accessing services, resources, and opportunities, emphasising the significance of fairness in AI. This is particularly relevant as an increasing number of AI tools are applied to decision-making processes, such as resource allocation an… ▽ More The growing interest in fair AI development is evident. The ''Leave No One Behind'' initiative urges us to address multiple and intersecting forms of inequality in accessing services, resources, and opportunities, emphasising the significance of fairness in AI. This is particularly relevant as an increasing number of AI tools are applied to decision-making processes, such as resource allocation and service scheme development, across various sectors such as health, energy, and housing. Therefore, exploring joint inequalities in these sectors is significant and valuable for thoroughly understanding overall inequality and unfairness. This research introduces an innovative approach to quantify cross-sectoral intersecting discrepancies among user-defined groups using latent class analysis. These discrepancies can be used to approximate inequality and provide valuable insights to fairness issues. We validate our approach using both proprietary and public datasets, including EVENS and Census 2021 (England & Wales) datasets, to examine cross-sectoral intersecting discrepancies among different ethnic groups. We also verify the reliability of the quantified discrepancy by conducting a correlation analysis with a government public metric. Our findings reveal significant discrepancies between minority ethnic groups, highlighting the need for targeted interventions in real-world AI applications. Additionally, we demonstrate how the proposed approach can be used to provide insights into the fairness of machine learning. △ Less

Submitted 11 July, 2024; v1 submitted 24 May, 2024; originally announced July 2024.

arXiv:2406.04371 [pdf, other]

Phased Instruction Fine-Tuning for Large Language Models

Authors: Wei Pang, Chuan Zhou, Xiao-Hua Zhou, Xiaojie Wang

Abstract: Instruction Fine-Tuning enhances pre-trained language models from basic next-word prediction to complex instruction-following. However, existing One-off Instruction Fine-Tuning (One-off IFT) method, applied on a diverse instruction, may not effectively boost models' adherence to instructions due to the simultaneous handling of varying instruction complexities. To improve this, Phased Instruction F… ▽ More Instruction Fine-Tuning enhances pre-trained language models from basic next-word prediction to complex instruction-following. However, existing One-off Instruction Fine-Tuning (One-off IFT) method, applied on a diverse instruction, may not effectively boost models' adherence to instructions due to the simultaneous handling of varying instruction complexities. To improve this, Phased Instruction Fine-Tuning (Phased IFT) is proposed, based on the idea that learning to follow instructions is a gradual process. It assesses instruction difficulty using GPT-4, divides the instruction data into subsets of increasing difficulty, and uptrains the model sequentially on these subsets. Experiments with Llama-2 7B/13B/70B, Llama3 8/70B and Mistral-7B models using Alpaca data show that Phased IFT significantly outperforms One-off IFT, supporting the progressive alignment hypothesis and providing a simple and efficient way to enhance large language models. Codes and datasets from our experiments are freely available at https://github.com/xubuvd/PhasedSFT. △ Less

Submitted 16 June, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

Comments: The final version, to be appear at ACL 2024 Findings

arXiv:2405.19600 [pdf, ps, other]

Do spectral cues matter in contrast-based graph self-supervised learning?

Authors: Xiangru Jian, Xinjian Zhao, Wei Pang, Chaolong Ying, Yimu Wang, Yaoyao Xu, Tianshu Yu

Abstract: The recent surge in contrast-based graph self-supervised learning has prominently featured an intensified exploration of spectral cues. However, an intriguing paradox emerges, as methods grounded in seemingly conflicting assumptions or heuristic approaches regarding the spectral domain demonstrate notable enhancements in learning performance. This paradox prompts a critical inquiry into the genuin… ▽ More The recent surge in contrast-based graph self-supervised learning has prominently featured an intensified exploration of spectral cues. However, an intriguing paradox emerges, as methods grounded in seemingly conflicting assumptions or heuristic approaches regarding the spectral domain demonstrate notable enhancements in learning performance. This paradox prompts a critical inquiry into the genuine contribution of spectral information to contrast-based graph self-supervised learning. This study undertakes an extensive investigation into this inquiry, conducting a thorough study of the relationship between spectral characteristics and the learning outcomes of contemporary methodologies. Based on this analysis, we claim that the effectiveness and significance of spectral information need to be questioned. Instead, we revisit simple edge perturbation: random edge drop** designed for node-level self-supervised learning and random edge adding intended for graph-level self-supervised learning. Compelling evidence is presented that these simple yet effective strategies consistently yield superior performance while demanding significantly fewer computational resources compared to all prior spectral augmentation methods. The proposed insights represent a significant leap forward in the field, potentially resha** the understanding and implementation of graph self-supervised learning. △ Less

Submitted 29 May, 2024; originally announced May 2024.

arXiv:2405.19327 [pdf, other]

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series

Authors: Ge Zhang, Scott Qu, Jiaheng Liu, Chenchen Zhang, Chenghua Lin, Chou Leuang Yu, Danny Pan, Esther Cheng, Jie Liu, Qunshu Lin, Raven Yuan, Tuney Zheng, Wei Pang, Xinrun Du, Yiming Liang, Yinghao Ma, Yizhi Li, Ziyang Ma, Bill Lin, Emmanouil Benetos, Huan Yang, Junting Zhou, Kai**g Ma, Minghao Liu, Morry Niu , et al. (20 additional authors not shown)

Abstract: Large Language Models (LLMs) have made great strides in recent years to achieve unprecedented performance across different tasks. However, due to commercial interest, the most competitive models like GPT, Gemini, and Claude have been gated behind proprietary interfaces without disclosing the training details. Recently, many institutions have open-sourced several strong LLMs like LLaMA-3, comparabl… ▽ More Large Language Models (LLMs) have made great strides in recent years to achieve unprecedented performance across different tasks. However, due to commercial interest, the most competitive models like GPT, Gemini, and Claude have been gated behind proprietary interfaces without disclosing the training details. Recently, many institutions have open-sourced several strong LLMs like LLaMA-3, comparable to existing closed-source LLMs. However, only the model's weights are provided with most details (e.g., intermediate checkpoints, pre-training corpus, and training code, etc.) being undisclosed. To improve the transparency of LLMs, the research community has formed to open-source truly open LLMs (e.g., Pythia, Amber, OLMo), where more details (e.g., pre-training corpus and training code) are being provided. These models have greatly advanced the scientific study of these large models including their strengths, weaknesses, biases and risks. However, we observe that the existing truly open LLMs on reasoning, knowledge, and coding tasks are still inferior to existing state-of-the-art LLMs with similar model sizes. To this end, we open-source MAP-Neo, a highly capable and transparent bilingual language model with 7B parameters trained from scratch on 4.5T high-quality tokens. Our MAP-Neo is the first fully open-sourced bilingual LLM with comparable performance compared to existing state-of-the-art LLMs. Moreover, we open-source all details to reproduce our MAP-Neo, where the cleaned pre-training corpus, data cleaning pipeline, checkpoints, and well-optimized training/evaluation framework are provided. Finally, we hope our MAP-Neo will enhance and strengthen the open research community and inspire more innovations and creativities to facilitate the further improvements of LLMs. △ Less

Submitted 10 July, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

Comments: https://map-neo.github.io/

arXiv:2405.17724 [pdf, ps, other]

ClavaDDPM: Multi-relational Data Synthesis with Cluster-guided Diffusion Models

Authors: Wei Pang, Masoumeh Shafieinejad, Lucy Liu, Xi He

Abstract: Recent research in tabular data synthesis has focused on single tables, whereas real-world applications often involve complex data with tens or hundreds of interconnected tables. Previous approaches to synthesizing multi-relational (multi-table) data fall short in two key aspects: scalability for larger datasets and capturing long-range dependencies, such as correlations between attributes spread… ▽ More Recent research in tabular data synthesis has focused on single tables, whereas real-world applications often involve complex data with tens or hundreds of interconnected tables. Previous approaches to synthesizing multi-relational (multi-table) data fall short in two key aspects: scalability for larger datasets and capturing long-range dependencies, such as correlations between attributes spread across different tables. Inspired by the success of diffusion models in tabular data modeling, we introduce $\textbf{C}luster$ $\textbf{La}tent$ $\textbf{Va}riable$ $guided$ $\textbf{D}enoising$ $\textbf{D}iffusion$ $\textbf{P}robabilistic$ $\textbf{M}odels$ (ClavaDDPM). This novel approach leverages clustering labels as intermediaries to model relationships between tables, specifically focusing on foreign key constraints. ClavaDDPM leverages the robust generation capabilities of diffusion models while incorporating efficient algorithms to propagate the learned latent variables across tables. This enables ClavaDDPM to capture long-range dependencies effectively. Extensive evaluations on multi-table datasets of varying sizes show that ClavaDDPM significantly outperforms existing methods for these long-range dependencies while remaining competitive on utility metrics for single-table data. △ Less

Submitted 27 May, 2024; originally announced May 2024.

arXiv:2405.10452 [pdf, other]

Navigating Public Sentiment in the Circular Economy through Topic Modelling and Hyperparameter Optimisation

Authors: Junhao Song, Yingfang Yuan, Kaiwen Chang, Bing Xu, ** Xuan, Wei Pang

Abstract: To advance the circular economy (CE), it is crucial to gain insights into the evolution of public sentiments, cognitive pathways of the masses concerning circular products and digital technology, and recognise the primary concerns. To achieve this, we collected data related to the CE from diverse platforms including Twitter, Reddit, and The Guardian. This comprehensive data collection spanned acro… ▽ More To advance the circular economy (CE), it is crucial to gain insights into the evolution of public sentiments, cognitive pathways of the masses concerning circular products and digital technology, and recognise the primary concerns. To achieve this, we collected data related to the CE from diverse platforms including Twitter, Reddit, and The Guardian. This comprehensive data collection spanned across three distinct strata of the public: the general public, professionals, and official sources. Subsequently, we utilised three topic models on the collected data. Topic modelling represents a type of data-driven and machine learning approach for text mining, capable of automatically categorising a large number of documents into distinct semantic groups. Simultaneously, these groups are described by topics, and these topics can aid in understanding the semantic content of documents at a high level. However, the performance of topic modelling may vary depending on different hyperparameter values. Therefore, in this study, we proposed a framework for topic modelling with hyperparameter optimisation for CE and conducted a series of systematic experiments to ensure that topic models are set with appropriate hyperparameters and to gain insights into the correlations between the CE and public opinion based on well-established models. The results of this study indicate that concerns about sustainability and economic impact persist across all three datasets. Official sources demonstrate a higher level of engagement with the application and regulation of CE. To the best of our knowledge, this study is pioneering in investigating various levels of public opinions concerning CE through topic modelling with the exploration of hyperparameter optimisation. △ Less

Submitted 16 May, 2024; originally announced May 2024.

arXiv:2405.04620 [pdf, ps, other]

Folded context condensation in Path Integral formalism for infinite context transformers

Authors: Won-Gi Paeng, Daesuk Kwon

Abstract: This short note is written for rapid communication of long context training and to share the idea of how to train it with low memory usage. In the note, we generalize the attention algorithm and neural network of Generative Pre-Trained Transformers and reinterpret it in Path integral formalism. First, the role of the transformer is understood as the time evolution of the token state and second, it… ▽ More This short note is written for rapid communication of long context training and to share the idea of how to train it with low memory usage. In the note, we generalize the attention algorithm and neural network of Generative Pre-Trained Transformers and reinterpret it in Path integral formalism. First, the role of the transformer is understood as the time evolution of the token state and second, it is suggested that the all key-token states in the same time as the query-token can attend to the attention with the query token states. As a result of the repetitive time evolution, it is discussed that the token states in the past sequence meats the token states in the present sequence so that the attention between separated sequences becomes possible for maintaining infinite contextual information just by using low memory for limited size of sequence. For the experiment, the $12$ input token window size was taken and one GPU with $24$GB memory was used for the pre-training. It was confirmed that more than $150$ length context is preserved. The sampling result of the training, the code and the other details will be included in the revised version of this note later. △ Less

Submitted 9 May, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

Comments: 7 pages, 2 figures

arXiv:2404.05083 [pdf, other]

HaVTR: Improving Video-Text Retrieval Through Augmentation Using Large Foundation Models

Authors: Yimu Wang, Shuai Yuan, Xiangru Jian, Wei Pang, Mushi Wang, Ning Yu

Abstract: While recent progress in video-text retrieval has been driven by the exploration of powerful model architectures and training strategies, the representation learning ability of video-text retrieval models is still limited due to low-quality and scarce training data annotations. To address this issue, we present a novel video-text learning paradigm, HaVTR, which augments video and text data to lear… ▽ More While recent progress in video-text retrieval has been driven by the exploration of powerful model architectures and training strategies, the representation learning ability of video-text retrieval models is still limited due to low-quality and scarce training data annotations. To address this issue, we present a novel video-text learning paradigm, HaVTR, which augments video and text data to learn more generalized features. Specifically, we first adopt a simple augmentation method, which generates self-similar data by randomly duplicating or drop** subwords and frames. In addition, inspired by the recent advancement in visual and language generative models, we propose a more powerful augmentation method through textual paraphrasing and video stylization using large language models (LLMs) and visual generative models (VGMs). Further, to bring richer information into video and text, we propose a hallucination-based augmentation method, where we use LLMs and VGMs to generate and add new relevant information to the original data. Benefiting from the enriched data, extensive experiments on several video-text retrieval benchmarks demonstrate the superiority of HaVTR over existing methods. △ Less

Submitted 7 April, 2024; originally announced April 2024.

arXiv:2402.07271 [pdf, other]

Previously on the Stories: Recap Snippet Identification for Story Reading

Authors: Jiangnan Li, Qiu**g Wang, Liyan Xu, Wenjie Pang, Mo Yu, Zheng Lin, Wei** Wang, Jie Zhou

Abstract: Similar to the "previously-on" scenes in TV shows, recaps can help book reading by recalling the readers' memory about the important elements in previous texts to better understand the ongoing plot. Despite its usefulness, this application has not been well studied in the NLP community. We propose the first benchmark on this useful task called Recap Snippet Identification with a hand-crafted evalu… ▽ More Similar to the "previously-on" scenes in TV shows, recaps can help book reading by recalling the readers' memory about the important elements in previous texts to better understand the ongoing plot. Despite its usefulness, this application has not been well studied in the NLP community. We propose the first benchmark on this useful task called Recap Snippet Identification with a hand-crafted evaluation dataset. Our experiments show that the proposed task is challenging to PLMs, LLMs, and proposed methods as the task requires a deep understanding of the plot correlation between snippets. △ Less

Submitted 11 February, 2024; originally announced February 2024.

arXiv:2402.07244 [pdf, other]

SAIS: A Novel Bio-Inspired Artificial Immune System Based on Symbiotic Paradigm

Authors: Junhao Song, Yingfang Yuan, Wei Pang

Abstract: We propose a novel type of Artificial Immune System (AIS): Symbiotic Artificial Immune Systems (SAIS), drawing inspiration from symbiotic relationships in biology. SAIS parallels the three key stages (i.e., mutualism, commensalism and parasitism) of population updating from the Symbiotic Organisms Search (SOS) algorithm. This parallel approach effectively addresses the challenges of large populati… ▽ More We propose a novel type of Artificial Immune System (AIS): Symbiotic Artificial Immune Systems (SAIS), drawing inspiration from symbiotic relationships in biology. SAIS parallels the three key stages (i.e., mutualism, commensalism and parasitism) of population updating from the Symbiotic Organisms Search (SOS) algorithm. This parallel approach effectively addresses the challenges of large population size and enhances population diversity in AIS, which traditional AIS and SOS struggle to resolve efficiently. We conducted a series of experiments, which demonstrated that our SAIS achieved comparable performance to the state-of-the-art approach SOS and outperformed other popular AIS approaches and evolutionary algorithms across 26 benchmark problems. Furthermore, we investigated the problem of parameter selection and found that SAIS performs better in handling larger population sizes while requiring fewer generations. Finally, we believe SAIS, as a novel bio-inspired and immune-inspired algorithm, paves the way for innovation in bio-inspired computing with the symbiotic paradigm. △ Less

Submitted 11 February, 2024; originally announced February 2024.

arXiv:2401.05264 [pdf]

Comparison of Markowitz Model and Single-Index Model on Portfolio Selection of Malaysian Stocks

Authors: Zhang Chern Lee, Wei Yun Tan, Hoong Khen Koo, Wilson Pang

Abstract: Our article is focused on the application of Markowitz Portfolio Theory and the Single Index Model on 10-year historical monthly return data for 10 stocks included in FTSE Bursa Malaysia KLCI, which is also our market index, as well as a risk-free asset which is the monthly fixed deposit rate. We will calculate the minimum variance portfolio and maximum Sharpe portfolio for both the Markowitz mode… ▽ More Our article is focused on the application of Markowitz Portfolio Theory and the Single Index Model on 10-year historical monthly return data for 10 stocks included in FTSE Bursa Malaysia KLCI, which is also our market index, as well as a risk-free asset which is the monthly fixed deposit rate. We will calculate the minimum variance portfolio and maximum Sharpe portfolio for both the Markowitz model and Single Index model subject to five different constraints, with the results presented in the form of tables and graphs such that comparisons between the different models and constraints can be made. We hope this article will help provide useful information for future investors who are interested in the Malaysian stock market and would like to construct an efficient investment portfolio. Keywords: Markowitz Portfolio Theory, Single Index Model, FTSE Bursa Malaysia KLCI, Efficient Portfolio △ Less

Submitted 10 January, 2024; originally announced January 2024.

Comments: 19 pages, 5 figures

arXiv:2310.04677 [pdf, other]

AG-CRC: Anatomy-Guided Colorectal Cancer Segmentation in CT with Imperfect Anatomical Knowledge

Authors: Rongzhao Zhang, Zhian Bai, Ruoying Yu, Wenrao Pang, Lingyun Wang, Lifeng Zhu, Xiaofan Zhang, Huan Zhang, Weiguo Hu

Abstract: When delineating lesions from medical images, a human expert can always keep in mind the anatomical structure behind the voxels. However, although high-quality (though not perfect) anatomical information can be retrieved from computed tomography (CT) scans with modern deep learning algorithms, it is still an open problem how these automatically generated organ masks can assist in addressing challe… ▽ More When delineating lesions from medical images, a human expert can always keep in mind the anatomical structure behind the voxels. However, although high-quality (though not perfect) anatomical information can be retrieved from computed tomography (CT) scans with modern deep learning algorithms, it is still an open problem how these automatically generated organ masks can assist in addressing challenging lesion segmentation tasks, such as the segmentation of colorectal cancer (CRC). In this paper, we develop a novel Anatomy-Guided segmentation framework to exploit the auto-generated organ masks to aid CRC segmentation from CT, namely AG-CRC. First, we obtain multi-organ segmentation (MOS) masks with existing MOS models (e.g., TotalSegmentor) and further derive a more robust organ of interest (OOI) mask that may cover most of the colon-rectum and CRC voxels. Then, we propose an anatomy-guided training patch sampling strategy by optimizing a heuristic gain function that considers both the proximity of important regions (e.g., the tumor or organs of interest) and sample diversity. Third, we design a novel self-supervised learning scheme inspired by the topology of tubular organs like the colon to boost the model performance further. Finally, we employ a masked loss scheme to guide the model to focus solely on the essential learning region. We extensively evaluate the proposed method on two CRC segmentation datasets, where substantial performance improvement (5% to 9% in Dice) is achieved over current state-of-the-art medical image segmentation models, and the ablation studies further evidence the efficacy of every proposed component. △ Less

Submitted 30 November, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

Comments: under review

arXiv:2308.00560 [pdf, other]

Reinforcement Learning-based Non-Autoregressive Solver for Traveling Salesman Problems

Authors: Yubin Xiao, Di Wang, Boyang Li, Huanhuan Chen, Wei Pang, Xuan Wu, Hao Li, Dong Xu, Yanchun Liang, You Zhou

Abstract: The Traveling Salesman Problem (TSP) is a well-known combinatorial optimization problem with broad real-world applications. Recently, neural networks have gained popularity in this research area because they provide strong heuristic solutions to TSPs. Compared to autoregressive neural approaches, non-autoregressive (NAR) networks exploit the inference parallelism to elevate inference speed but suf… ▽ More The Traveling Salesman Problem (TSP) is a well-known combinatorial optimization problem with broad real-world applications. Recently, neural networks have gained popularity in this research area because they provide strong heuristic solutions to TSPs. Compared to autoregressive neural approaches, non-autoregressive (NAR) networks exploit the inference parallelism to elevate inference speed but suffer from comparatively low solution quality. In this paper, we propose a novel NAR model named NAR4TSP, which incorporates a specially designed architecture and an enhanced reinforcement learning strategy. To the best of our knowledge, NAR4TSP is the first TSP solver that successfully combines RL and NAR networks. The key lies in the incorporation of NAR network output decoding into the training process. NAR4TSP efficiently represents TSP encoded information as rewards and seamlessly integrates it into reinforcement learning strategies, while maintaining consistent TSP sequence constraints during both training and testing phases. Experimental results on both synthetic and real-world TSP instances demonstrate that NAR4TSP outperforms four state-of-the-art models in terms of solution quality, inference speed, and generalization to unseen scenarios. △ Less

Submitted 17 October, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

Comments: 14 pages, 5 figures

arXiv:2306.13531 [pdf, other]

WBCAtt: A White Blood Cell Dataset Annotated with Detailed Morphological Attributes

Authors: Satoshi Tsutsui, Winnie Pang, Bihan Wen

Abstract: The examination of blood samples at a microscopic level plays a fundamental role in clinical diagnostics, influencing a wide range of medical conditions. For instance, an in-depth study of White Blood Cells (WBCs), a crucial component of our blood, is essential for diagnosing blood-related diseases such as leukemia and anemia. While multiple datasets containing WBC images have been proposed, they… ▽ More The examination of blood samples at a microscopic level plays a fundamental role in clinical diagnostics, influencing a wide range of medical conditions. For instance, an in-depth study of White Blood Cells (WBCs), a crucial component of our blood, is essential for diagnosing blood-related diseases such as leukemia and anemia. While multiple datasets containing WBC images have been proposed, they mostly focus on cell categorization, often lacking the necessary morphological details to explain such categorizations, despite the importance of explainable artificial intelligence (XAI) in medical domains. This paper seeks to address this limitation by introducing comprehensive annotations for WBC images. Through collaboration with pathologists, a thorough literature review, and manual inspection of microscopic images, we have identified 11 morphological attributes associated with the cell and its components (nucleus, cytoplasm, and granules). We then annotated ten thousand WBC images with these attributes. Moreover, we conduct experiments to predict these attributes from images, providing insights beyond basic WBC classification. As the first public dataset to offer such extensive annotations, we also illustrate specific applications that can benefit from our attribute annotations. Overall, our dataset paves the way for interpreting WBC recognition models, further advancing XAI in the fields of pathology and hematology. △ Less

Submitted 25 December, 2023; v1 submitted 23 June, 2023; originally announced June 2023.

Comments: Neural Information Processing Systems 2023

arXiv:2306.02054 [pdf]

Low-Complexity Acoustic Scene Classification Using Data Augmentation and Lightweight ResNet

Authors: Yanxiong Li, Wenchang Cao, Wei Xie, Qisheng Huang, Wenfeng Pang, Qianhua He

Abstract: We present a work on low-complexity acoustic scene classification (ASC) with multiple devices, namely the subtask A of Task 1 of the DCASE2021 challenge. This subtask focuses on classifying audio samples of multiple devices with a low-complexity model, where two main difficulties need to be overcome. First, the audio samples are recorded by different devices, and there is mismatch of recording dev… ▽ More We present a work on low-complexity acoustic scene classification (ASC) with multiple devices, namely the subtask A of Task 1 of the DCASE2021 challenge. This subtask focuses on classifying audio samples of multiple devices with a low-complexity model, where two main difficulties need to be overcome. First, the audio samples are recorded by different devices, and there is mismatch of recording devices in audio samples. We reduce the negative impact of the mismatch of recording devices by using some effective strategies, including data augmentation (e.g., mix-up, spectrum correction, pitch shift), usages of multi-patch network structure and channel attention. Second, the model size should be smaller than a threshold (e.g., 128 KB required by the DCASE2021 challenge). To meet this condition, we adopt a ResNet with both depthwise separable convolution and channel attention as the backbone network, and perform model compression. In summary, we propose a low-complexity ASC method using data augmentation and a lightweight ResNet. Evaluated on the official development and evaluation datasets, our method obtains classification accuracy scores of 71.6% and 66.7%, respectively; and obtains Log-loss scores of 1.038 and 1.136, respectively. Our final model size is 110.3 KB which is smaller than the maximum of 128 KB. △ Less

Submitted 3 June, 2023; originally announced June 2023.

Comments: 5 pages, 5 figures, 4 tables. Accepted for publication in the 16th IEEE International Conference on Signal Processing (IEEE ICSP)

arXiv:2305.19724 [pdf, other]

A Surrogate Model Framework for Explainable Autonomous Behaviour

Authors: Konstantinos Gavriilidis, Andrea Munafo, Wei Pang, Helen Hastie

Abstract: Adoption and deployment of robotic and autonomous systems in industry are currently hindered by the lack of transparency, required for safety and accountability. Methods for providing explanations are needed that are agnostic to the underlying autonomous system and easily updated. Furthermore, different stakeholders with varying levels of expertise, will require different levels of information. In… ▽ More Adoption and deployment of robotic and autonomous systems in industry are currently hindered by the lack of transparency, required for safety and accountability. Methods for providing explanations are needed that are agnostic to the underlying autonomous system and easily updated. Furthermore, different stakeholders with varying levels of expertise, will require different levels of information. In this work, we use surrogate models to provide transparency as to the underlying policies for behaviour activation. We show that these surrogate models can effectively break down autonomous agents' behaviour into explainable components for use in natural language explanations. △ Less

Submitted 31 May, 2023; originally announced May 2023.

arXiv:2305.11024 [pdf, other]

CDIDN: A Registration Model with High Deformation Impedance Capability for Long-Term Tracking of Pulmonary Lesion Dynamics

Authors: Xinyu Zhao, Sa Huang, Wei Pang, You Zhou

Abstract: We study the problem of registration for medical CT images from a novel perspective -- the sensitivity to degree of deformations in CT images. Although some learning-based methods have shown success in terms of average accuracy, their ability to handle regions with local large deformation (LLD) may significantly decrease compared to dealing with regions with minor deformation. This motivates our r… ▽ More We study the problem of registration for medical CT images from a novel perspective -- the sensitivity to degree of deformations in CT images. Although some learning-based methods have shown success in terms of average accuracy, their ability to handle regions with local large deformation (LLD) may significantly decrease compared to dealing with regions with minor deformation. This motivates our research into this issue. Two main causes of LLDs are organ motion and changes in tissue structure, with the latter often being a long-term process. In this paper, we propose a novel registration model called Cascade-Dilation Inter-Layer Differential Network (CDIDN), which exhibits both high deformation impedance capability (DIC) and accuracy. CDIDN improves its resilience to LLDs in CT images by enhancing LLDs in the displacement field (DF). It uses a feature-based progressive decomposition of LLDs, blending feature flows of different levels into a main flow in a top-down manner. It leverages Inter-Layer Differential Module (IDM) at each level to locally refine the main flow and globally smooth the feature flow, and also integrates feature velocity fields that can effectively handle feature deformations of various degrees. We assess CDIDN using lungs as representative organs with large deformation. Our findings show that IDM significantly enhances LLDs of the DF, by which improves the DIC and accuracy of the model. Compared with other outstanding learning-based methods, CDIDN exhibits the best DIC and excellent accuracy. Based on vessel enhancement and enhanced LLDs of the DF, we propose a novel method to accurately track the appearance, disappearance, enlargement, and shrinkage of pulmonary lesions, which effectively addresses detection of early lesions and peripheral lung lesions, issues of false enlargement, false shrinkage, and mutilation of lesions. △ Less

Submitted 24 May, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

arXiv:2305.10156 [pdf, other]

Personality Understanding of Fictional Characters during Book Reading

Authors: Mo Yu, Jiangnan Li, Shunyu Yao, Wenjie Pang, Xiaochen Zhou, Zhou Xiao, Fandong Meng, Jie Zhou

Abstract: Comprehending characters' personalities is a crucial aspect of story reading. As readers engage with a story, their understanding of a character evolves based on new events and information; and multiple fine-grained aspects of personalities can be perceived. This leads to a natural problem of situated and fine-grained personality understanding. The problem has not been studied in the NLP field, pr… ▽ More Comprehending characters' personalities is a crucial aspect of story reading. As readers engage with a story, their understanding of a character evolves based on new events and information; and multiple fine-grained aspects of personalities can be perceived. This leads to a natural problem of situated and fine-grained personality understanding. The problem has not been studied in the NLP field, primarily due to the lack of appropriate datasets mimicking the process of book reading. We present the first labeled dataset PersoNet for this problem. Our novel annotation strategy involves annotating user notes from online reading apps as a proxy for the original books. Experiments and human studies indicate that our dataset construction is both efficient and accurate; and our task heavily relies on long-term context to achieve accurate predictions for both machines and humans. The dataset is available at https://github.com/Gorov/personet_acl23. △ Less

Submitted 29 October, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

Comments: Accepted at ACL 2023

arXiv:2305.07152 [pdf, other]

Surgical tool classification and localization: results and methods from the MICCAI 2022 SurgToolLoc challenge

Authors: Aneeq Zia, Kiran Bhattacharyya, Xi Liu, Max Berniker, Ziheng Wang, Rogerio Nespolo, Satoshi Kondo, Satoshi Kasai, Kousuke Hirasawa, Bo Liu, David Austin, Yiheng Wang, Michal Futrega, Jean-Francois Puget, Zhenqiang Li, Yoichi Sato, Ryo Fujii, Ryo Hachiuma, Mana Masuda, Hideo Saito, An Wang, Mengya Xu, Mobarakol Islam, Long Bai, Winnie Pang , et al. (46 additional authors not shown)

Abstract: The ability to automatically detect and track surgical instruments in endoscopic videos can enable transformational interventions. Assessing surgical performance and efficiency, identifying skilled tool use and choreography, and planning operational and logistical aspects of OR resources are just a few of the applications that could benefit. Unfortunately, obtaining the annotations needed to train… ▽ More The ability to automatically detect and track surgical instruments in endoscopic videos can enable transformational interventions. Assessing surgical performance and efficiency, identifying skilled tool use and choreography, and planning operational and logistical aspects of OR resources are just a few of the applications that could benefit. Unfortunately, obtaining the annotations needed to train machine learning models to identify and localize surgical tools is a difficult task. Annotating bounding boxes frame-by-frame is tedious and time-consuming, yet large amounts of data with a wide variety of surgical tools and surgeries must be captured for robust training. Moreover, ongoing annotator training is needed to stay up to date with surgical instrument innovation. In robotic-assisted surgery, however, potentially informative data like timestamps of instrument installation and removal can be programmatically harvested. The ability to rely on tool installation data alone would significantly reduce the workload to train robust tool-tracking models. With this motivation in mind we invited the surgical data science community to participate in the challenge, SurgToolLoc 2022. The goal was to leverage tool presence data as weak labels for machine learning models trained to detect tools and localize them in video frames with bounding boxes. We present the results of this challenge along with many of the team's efforts. We conclude by discussing these results in the broader context of machine learning and surgical data science. The training data used for this challenge consisting of 24,695 video clips with tool presence labels is also being released publicly and can be accessed at https://console.cloud.google.com/storage/browser/isi-surgtoolloc-2022. △ Less

Submitted 31 May, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

arXiv:2301.13082 [pdf, other]

PaCaNet: A Study on CycleGAN with Transfer Learning for Diversifying Fused Chinese Painting and Calligraphy

Authors: Zuhao Yang, Huajun Bai, Zhang Luo, Yang Xu, Wei Pang, Yue Wang, Yisheng Yuan, Yingfang Yuan

Abstract: AI-Generated Content (AIGC) has recently gained a surge in popularity, powered by its high efficiency and consistency in production, and its capability of being customized and diversified. The cross-modality nature of the representation learning mechanism in most AIGC technology allows for more freedom and flexibility in exploring new types of art that would be impossible in the past. Inspired by… ▽ More AI-Generated Content (AIGC) has recently gained a surge in popularity, powered by its high efficiency and consistency in production, and its capability of being customized and diversified. The cross-modality nature of the representation learning mechanism in most AIGC technology allows for more freedom and flexibility in exploring new types of art that would be impossible in the past. Inspired by the pictogram subset of Chinese characters, we proposed PaCaNet, a CycleGAN-based pipeline for producing novel artworks that fuse two different art types, traditional Chinese painting and calligraphy. In an effort to produce stable and diversified output, we adopted three main technical innovations: 1. Using one-shot learning to increase the creativity of pre-trained models and diversify the content of the fused images. 2. Controlling the preference over generated Chinese calligraphy by freezing randomly sampled parameters in pre-trained models. 3. Using a regularization method to encourage the models to produce images similar to Chinese paintings. Furthermore, we conducted a systematic study to explore the performance of PaCaNet in diversifying fused Chinese painting and calligraphy, which showed satisfying results. In conclusion, we provide a new direction of creating arts by fusing the visual information in paintings and the stroke features in Chinese calligraphy. Our approach creates a unique aesthetic experience rooted in the origination of Chinese hieroglyph characters. It is also a unique opportunity to delve deeper into traditional artwork and, in doing so, to create a meaningful impact on preserving and revitalizing traditional heritage. △ Less

Submitted 21 May, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

arXiv:2212.08568 [pdf, other]

Biomedical image analysis competitions: The state of current participation practice

Authors: Matthias Eisenmann, Annika Reinke, Vivienn Weru, Minu Dietlinde Tizabi, Fabian Isensee, Tim J. Adler, Patrick Godau, Veronika Cheplygina, Michal Kozubek, Sharib Ali, Anubha Gupta, Jan Kybic, Alison Noble, Carlos Ortiz de Solórzano, Samiksha Pachade, Caroline Petitjean, Daniel Sage, Donglai Wei, Elizabeth Wilden, Deepak Alapatt, Vincent Andrearczyk, Ujjwal Baid, Spyridon Bakas, Niranjan Balu, Sophia Bano , et al. (331 additional authors not shown)

Abstract: The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis,… ▽ More The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps. △ Less

Submitted 12 September, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

arXiv:2211.14553 [pdf]

doi 10.14445/22315381/IJETT-V70I11P208

A Remote Baby Surveillance System with RFID and GPS Tracking

Authors: Ruven A/L Sundarajoo, Gwo Chin Chung, Wai Leong Pang, Soo Fun Tan

Abstract: In the 21st century, sending babies or children to daycare centres has become more and more common among young guardians. The balance between full-time work and child care is increasingly challenging nowadays. In Malaysia, thousands of child abuse cases have been reported from babysitting centres every year, which indeed triggers the anxiety and stress of the guardians. Hence, this paper proposes… ▽ More In the 21st century, sending babies or children to daycare centres has become more and more common among young guardians. The balance between full-time work and child care is increasingly challenging nowadays. In Malaysia, thousands of child abuse cases have been reported from babysitting centres every year, which indeed triggers the anxiety and stress of the guardians. Hence, this paper proposes to construct a remote baby surveillance system with radio-frequency identification (RFID) and global positioning system (GPS) tracking. With the incorporation of the Internet of Things (IoT), a sensor-based microcontroller is used to detect the conditions of the baby as well as the surrounding environment and then display the real-time data as well as notifications to alert the guardians via a mobile application. These conditions include the crying and waking of the baby, as well as temperature, the mattress's wetness, and moving objects around the baby. In addition, RFID and GPS location tracking are implemented to ensure the safety of the baby, while white noise is used to increase the comfort of the baby. In the end, a prototype has been successfully developed for functionality and reliability testing. Several experiments have been conducted to measure the efficiency of the mattress's wetness detection, the RFID transmission range, the frequency spectrum of white noise, and also the output power of the solar panel. The proposed system is expected to assist guardians in ensuring the safety and comfort of their babies remotely, as well as prevent any occurrence of child abuse. △ Less

Submitted 26 November, 2022; originally announced November 2022.

Comments: 12 pages, 13 figures Published with International Journal of Engineering Trends and Technology (IJETT)

Journal ref: International Journal of Engineering Trends and Technology, vol. 70, no. 11, pp. 81-92, 2022

arXiv:2207.08930 [pdf, other]

Cooperative Infrastructure Perception

Authors: Fawad Ahmad, Christina Suyong Shin, Weiwu Pang, Branden Leong, Pradipta Ghosh, Ramesh Govindan

Abstract: Recent works have considered two qualitatively different approaches to overcome line-of-sight limitations of 3D sensors used for perception: cooperative perception and infrastructure-augmented perception. In this paper, motivated by increasing deployments of infrastructure LiDARs, we explore a third approach, cooperative infrastructure perception. This approach generates perception outputs by fusi… ▽ More Recent works have considered two qualitatively different approaches to overcome line-of-sight limitations of 3D sensors used for perception: cooperative perception and infrastructure-augmented perception. In this paper, motivated by increasing deployments of infrastructure LiDARs, we explore a third approach, cooperative infrastructure perception. This approach generates perception outputs by fusing outputs of multiple infrastructure sensors, but, to be useful, must do so quickly and accurately. We describe the design, implementation and evaluation of Cooperative Infrastructure Perception (CIP), which uses a combination of novel algorithms and systems optimizations. It produces perception outputs within 100 ms using modest computing resources and with accuracy comparable to the state-of-the-art. CIP, when used to augment vehicle perception, can improve safety. When used in conjunction with offloaded planning, CIP can increase traffic throughput at intersections. △ Less

Submitted 26 June, 2024; v1 submitted 18 July, 2022; originally announced July 2022.

arXiv:2204.04746 [pdf, other]

doi 10.1016/j.media.2023.102803

CholecTriplet2021: A benchmark challenge for surgical action triplet recognition

Authors: Chinedu Innocent Nwoye, Deepak Alapatt, Tong Yu, Armine Vardazaryan, Fangfang Xia, Zixuan Zhao, Tong Xia, Fucang Jia, Yuxuan Yang, Hao Wang, Derong Yu, Guoyan Zheng, Xiaotian Duan, Neil Getty, Ricardo Sanchez-Matilla, Maria Robu, Li Zhang, Huabin Chen, Jiacheng Wang, Liansheng Wang, Bokai Zhang, Beerend Gerats, Sista Raviteja, Rachana Sathish, Rong Tao , et al. (37 additional authors not shown)

Abstract: Context-aware decision support in the operating room can foster surgical safety and efficiency by leveraging real-time feedback from surgical workflow analysis. Most existing works recognize surgical activities at a coarse-grained level, such as phases, steps or events, leaving out fine-grained interaction details about the surgical activity; yet those are needed for more helpful AI assistance in… ▽ More Context-aware decision support in the operating room can foster surgical safety and efficiency by leveraging real-time feedback from surgical workflow analysis. Most existing works recognize surgical activities at a coarse-grained level, such as phases, steps or events, leaving out fine-grained interaction details about the surgical activity; yet those are needed for more helpful AI assistance in the operating room. Recognizing surgical actions as triplets of <instrument, verb, target> combination delivers comprehensive details about the activities taking place in surgical videos. This paper presents CholecTriplet2021: an endoscopic vision challenge organized at MICCAI 2021 for the recognition of surgical action triplets in laparoscopic videos. The challenge granted private access to the large-scale CholecT50 dataset, which is annotated with action triplet information. In this paper, we present the challenge setup and assessment of the state-of-the-art deep learning methods proposed by the participants during the challenge. A total of 4 baseline methods from the challenge organizers and 19 new deep learning algorithms by competing teams are presented to recognize surgical action triplets directly from surgical videos, achieving mean average precision (mAP) ranging from 4.2% to 38.1%. This study also analyzes the significance of the results obtained by the presented approaches, performs a thorough methodological comparison between them, in-depth result analysis, and proposes a novel ensemble method for enhanced recognition. Our analysis shows that surgical workflow analysis is not yet solved, and also highlights interesting directions for future research on fine-grained surgical activity recognition which is of utmost importance for the development of AI in surgery. △ Less

Submitted 29 December, 2022; v1 submitted 10 April, 2022; originally announced April 2022.

Comments: CholecTriplet2021 challenge report. Paper accepted at Elsevier journal of Medical Image Analysis. 22 pages, 8 figures, 11 tables. Challenge website: https://cholectriplet2021.grand-challenge.org

Journal ref: Medical Image Analysis 86 (2023) 102803

arXiv:2203.12371 [pdf]

High Phonon Scattering Rates Suppress Thermal Conductivity in Hyperstoichiometric Uranium Dioxide

Authors: Hao Ma, Matthew S. Bryan, Judy W. L. Pang, Douglas L. Abernathy, Daniel J. Antonio, Krzysztof Gofryk, Michael E. Manley

Abstract: Uranium dioxide (UO$_2$), one of the most important nuclear fuels, can accumulate excess oxygen atoms as interstitial defects, which significantly impacts thermal properties. In this study, thermal conductivities and inelastic neutron scattering measurements on UO$_2$ and UO$_{2+x}$ (x=0.3, 0.4, 0.8, 0.11) were performed at low temperatures (2-300 K). The thermal conductivity of UO$_{2+x}$ is sign… ▽ More Uranium dioxide (UO$_2$), one of the most important nuclear fuels, can accumulate excess oxygen atoms as interstitial defects, which significantly impacts thermal properties. In this study, thermal conductivities and inelastic neutron scattering measurements on UO$_2$ and UO$_{2+x}$ (x=0.3, 0.4, 0.8, 0.11) were performed at low temperatures (2-300 K). The thermal conductivity of UO$_{2+x}$ is significantly suppressed compared to UO$_2$ except near the Néel temperature TN= 30.8 K, where it is independent of x. Phonon measurements demonstrate that the heat capacities and phonon group velocities of UO$_2$ and UO$_{2+x}$ are similar and that the suppressed thermal conductivity in UO$_{2+x}$ results from high phonon scattering rates. These new insights advance our fundamental understanding of thermal transport properties in advanced nuclear fuels. △ Less

Submitted 23 March, 2022; originally announced March 2022.

arXiv:2107.01879 [pdf, other]

doi 10.1142/S0217732322300038

Cusp in the Symmetry Energy, Speed of Sound in Neutron Stars and Emergent Pseudo-Conformal Symmetry

Authors: Hyun Kyu Lee, Yong-Liang Ma, Won-Gi Paeng, Mannque Rho

Abstract: We review how the "cusp" predicted in the nuclear symmetry energy generated by a topology change at density $n_{1/2}\gsim 2 n_0$ can have a surprising consequence, so far unrecognized in nuclear physics and astrophysics communities, on the structure of dense compact-star matter. The topology change, when translated into nuclear EFT with "effective" QCD degrees of freedom in terms of hidden local a… ▽ More We review how the "cusp" predicted in the nuclear symmetry energy generated by a topology change at density $n_{1/2}\gsim 2 n_0$ can have a surprising consequence, so far unrecognized in nuclear physics and astrophysics communities, on the structure of dense compact-star matter. The topology change, when translated into nuclear EFT with "effective" QCD degrees of freedom in terms of hidden local and scale symmetries duly taken into account, predicts an EoS that is soft below and stiff above $n\gsim n_{1/2}$, involving no low-order phase transitions, and yields the macrophysical properties of neutron stars consistent -- so far with no tension -- with the astrophysical observations, including the maximum mass $ 2.0\lsim M/ M_\odot\lsim 2.2$ as well as the GW data. Furthermore it describes the interior core of the massive stars populated by baryon-charge-fractionalized quasi-fermions that are neither baryonic nor quarkonic. It is argued that the cusp "buried" in the symmetry energy resulting from strong correlations with hidden heavy degrees of freedom leads, at $n\gsim n_{1/2}$, to what we dubbed "pseudo-conformal" sound speed, $v^2_{pcs}/c^2\approx 1/3$, precociously converged from below at $n_{1/2}$. It is not strictly conformal since the trace of energy-momentum tensor is not zero even in the chiral limit. This observation with the topology change identified with the putative hadron-quark continuity, taking place at at density $\gsim 2 n_0$, implies that the quantities accurately measured at $\sim n_0$ cannot give a stringent constraint for what takes place at the core density of compact stars $\sim (3-7) n_0$. This is because the change of degrees of freedom in effective field theory is involved. We discuss the implication of this on the recent PREX-II "dilemma" in the measured skin thickness of $^{208}$Pb. △ Less

Submitted 29 January, 2022; v1 submitted 5 July, 2021; originally announced July 2021.

Comments: Invited review with title change in Modern Physics Letters A

arXiv:2104.06046 [pdf, other]

Which Hyperparameters to Optimise? An Investigation of Evolutionary Hyperparameter Optimisation in Graph Neural Network For Molecular Property Prediction

Authors: Yingfang Yuan, Wenjun Wang, Wei Pang

Abstract: Recently, the study of graph neural network (GNN) has attracted much attention and achieved promising performance in molecular property prediction. Most GNNs for molecular property prediction are proposed based on the idea of learning the representations for the nodes by aggregating the information of their neighbor nodes (e.g. atoms). Then, the representations can be passed to subsequent layers t… ▽ More Recently, the study of graph neural network (GNN) has attracted much attention and achieved promising performance in molecular property prediction. Most GNNs for molecular property prediction are proposed based on the idea of learning the representations for the nodes by aggregating the information of their neighbor nodes (e.g. atoms). Then, the representations can be passed to subsequent layers to deal with individual downstream tasks. Therefore, the architectures of GNNs can be considered as being composed of two core parts: graph-related layers and task-specific layers. Facing real-world molecular problems, the hyperparameter optimization for those layers are vital. Hyperparameter optimization (HPO) becomes expensive in this situation because evaluating candidate solutions requires massive computational resources to train and validate models. Furthermore, a larger search space often makes the HPO problems more challenging. In this research, we focus on the impact of selecting two types of GNN hyperparameters, those belonging to graph-related layers and those of task-specific layers, on the performance of GNN for molecular property prediction. In our experiments. we employed a state-of-the-art evolutionary algorithm (i.e., CMA-ES) for HPO. The results reveal that optimizing the two types of hyperparameters separately can gain the improvements on GNNs' performance, but optimising both types of hyperparameters simultaneously will lead to predominant improvements. Meanwhile, our study also further confirms the importance of HPO for GNNs in molecular property prediction problems. △ Less

Submitted 14 April, 2021; v1 submitted 13 April, 2021; originally announced April 2021.

arXiv:2103.00172 [pdf, other]

A Survey on Physarum Polycephalum Intelligent Foraging Behaviour and Bio-Inspired Applications

Authors: Abubakr Awad, Wei Pang, David Lusseau, George M. Coghill

Abstract: In recent years, research on Physarum polycephalum has become more popular after Nakagaki et al. (2000) performed their famous experiment showing that Physarum was able to find the shortest route through a maze. Subsequent researches have confirmed the ability of Physarum-inspired algorithms to solve a wide range of NP-hard problems. In contrast to previous reviews that either focus on biological… ▽ More In recent years, research on Physarum polycephalum has become more popular after Nakagaki et al. (2000) performed their famous experiment showing that Physarum was able to find the shortest route through a maze. Subsequent researches have confirmed the ability of Physarum-inspired algorithms to solve a wide range of NP-hard problems. In contrast to previous reviews that either focus on biological aspects or bio-inspired applications, here we present a comprehensive review that highlights recent Physarum polycephalum biological aspects, mathematical models, and Physarum bio-inspired algorithms and their applications. The novelty of this review stems from our exploration of Physarum intelligent behaviour in competition settings. Further, we have presented our new model to simulate Physarum in competition, where multiple Physarum interact with each other and with their environments. The bio-inspired Physarum in competition algorithms proved to have great potentials for future research. △ Less

Submitted 8 May, 2021; v1 submitted 27 February, 2021; originally announced March 2021.

Comments: arXiv admin note: text overlap with arXiv:1712.02910 by other authors

ACM Class: I.2.8; I.6.5

arXiv:2102.11995 [pdf, other]

A Genetic Algorithm with Tree-structured Mutation for Hyperparameter Optimisation of Graph Neural Networks

Authors: Yingfang Yuan, Wenjun Wang, Wei Pang

Abstract: In recent years, graph neural networks (GNNs) have gained increasing attention, as they possess the excellent capability of processing graph-related problems. In practice, hyperparameter optimisation (HPO) is critical for GNNs to achieve satisfactory results, but this process is costly because the evaluations of different hyperparameter settings require excessively training many GNNs. Many approac… ▽ More In recent years, graph neural networks (GNNs) have gained increasing attention, as they possess the excellent capability of processing graph-related problems. In practice, hyperparameter optimisation (HPO) is critical for GNNs to achieve satisfactory results, but this process is costly because the evaluations of different hyperparameter settings require excessively training many GNNs. Many approaches have been proposed for HPO, which aims to identify promising hyperparameters efficiently. In particular, the genetic algorithm (GA) for HPO has been explored, which treats GNNs as a black-box model, of which only the outputs can be observed given a set of hyperparameters. However, because GNN models are sophisticated and the evaluations of hyperparameters on GNNs are expensive, GA requires advanced techniques to balance the exploration and exploitation of the search and make the optimisation more effective given limited computational resources. Therefore, we proposed a tree-structured mutation strategy for GA to alleviate this issue. Meanwhile, we reviewed the recent HPO works, which gives room for the idea of tree-structure to develop, and we hope our approach can further improve these HPO methods in the future. △ Less

Submitted 28 April, 2021; v1 submitted 23 February, 2021; originally announced February 2021.

arXiv:2102.10042 [pdf, other]

Impact of asymptomatic COVID-19 carriers on pandemic policy outcomes

Authors: Weijie Pang, Hassan Chehaitli, T. R. Hurd

Abstract: This paper provides a mathematical model to show that the incorrect estimation of r, the fraction of asymptomatic COVID-19 carriers in the general population, can account for much of the world's failure to contain the pandemic in its early phases. The SE(A+O)R model with infectives separated into asymptomatic and ordinary carriers, supplemented by a model of the data generation process, is calibra… ▽ More This paper provides a mathematical model to show that the incorrect estimation of r, the fraction of asymptomatic COVID-19 carriers in the general population, can account for much of the world's failure to contain the pandemic in its early phases. The SE(A+O)R model with infectives separated into asymptomatic and ordinary carriers, supplemented by a model of the data generation process, is calibrated to standard datasets for several countries. It is shown that certain fundamental parameters, notably r, are unidentifiable with this data. A number of potential types of policy intervention are analyzed. It is found that the lack of parameter identifiability implies that only some, but not all, potential policy interventions can be correctly predicted. In an example representing Italy in March 2020, a hypothetical optimal policy of isolating confirmed cases that aims to reduce the basic reproduction number of the outbreak to R0 = 0.8 assuming r = 10%, only achieves R0 = 1.4 if it turns out that r = 40%. △ Less

Submitted 16 February, 2021; originally announced February 2021.

Comments: 15 pages, 10 figures

arXiv:2102.04283 [pdf, ps, other]

doi 10.1145/3449639.3459370

A Systematic Comparison Study on Hyperparameter Optimisation of Graph Neural Networks for Molecular Property Prediction

Authors: Yingfang Yuan, Wenjun Wang, Wei Pang

Abstract: Graph neural networks (GNNs) have been proposed for a wide range of graph-related learning tasks. In particular, in recent years, an increasing number of GNN systems were applied to predict molecular properties. However, a direct impediment is to select appropriate hyperparameters to achieve satisfactory performance with lower computational cost. Meanwhile, many molecular datasets are far smaller… ▽ More Graph neural networks (GNNs) have been proposed for a wide range of graph-related learning tasks. In particular, in recent years, an increasing number of GNN systems were applied to predict molecular properties. However, a direct impediment is to select appropriate hyperparameters to achieve satisfactory performance with lower computational cost. Meanwhile, many molecular datasets are far smaller than many other datasets in typical deep learning applications. Most hyperparameter optimization (HPO) methods have not been explored in terms of their efficiencies on such small datasets in the molecular domain. In this paper, we conducted a theoretical analysis of common and specific features for two state-of-the-art and popular algorithms for HPO: TPE and CMA-ES, and we compared them with random search (RS), which is used as a baseline. Experimental studies are carried out on several benchmarks in MoleculeNet, from different perspectives to investigate the impact of RS, TPE, and CMA-ES on HPO of GNNs for molecular property prediction. In our experiments, we concluded that RS, TPE, and CMA-ES have their individual advantages in tackling different specific molecular problems. Finally, we believe our work will motivate further research on GNN as applied to molecular machine learning problems in chemistry and materials sciences. △ Less

Submitted 21 April, 2021; v1 submitted 8 February, 2021; originally announced February 2021.

arXiv:2101.09300 [pdf, other]

A Novel Genetic Algorithm with Hierarchical Evaluation Strategy for Hyperparameter Optimisation of Graph Neural Networks

Authors: Yingfang Yuan, Wenjun Wang, George M. Coghill, Wei Pang

Abstract: Graph representation of structured data can facilitate the extraction of stereoscopic features, and it has demonstrated excellent ability when working with deep learning systems, the so-called Graph Neural Networks (GNNs). Choosing a promising architecture for constructing GNNs can be transferred to a hyperparameter optimisation problem, a very challenging task due to the size of the underlying se… ▽ More Graph representation of structured data can facilitate the extraction of stereoscopic features, and it has demonstrated excellent ability when working with deep learning systems, the so-called Graph Neural Networks (GNNs). Choosing a promising architecture for constructing GNNs can be transferred to a hyperparameter optimisation problem, a very challenging task due to the size of the underlying search space and high computational cost for evaluating candidate GNNs. To address this issue, this research presents a novel genetic algorithm with a hierarchical evaluation strategy (HESGA), which combines the full evaluation of GNNs with a fast evaluation approach. By using full evaluation, a GNN is represented by a set of hyperparameter values and trained on a specified dataset, and root mean square error (RMSE) will be used to measure the quality of the GNN represented by the set of hyperparameter values (for regression problems). While in the proposed fast evaluation process, the training will be interrupted at an early stage, the difference of RMSE values between the starting and interrupted epochs will be used as a fast score, which implies the potential of the GNN being considered. To coordinate both types of evaluations, the proposed hierarchical strategy uses the fast evaluation in a lower level for recommending candidates to a higher level, where the full evaluation will act as a final assessor to maintain a group of elite individuals. To validate the effectiveness of HESGA, we apply it to optimise two types of deep graph neural networks. The experimental results on three benchmark datasets demonstrate its advantages compared to Bayesian hyperparameter optimization. △ Less

Submitted 26 January, 2021; v1 submitted 22 January, 2021; originally announced January 2021.

arXiv:2101.04680 [pdf, other]

Thin Lenses and Thin Cameras

Authors: Wubin Pang, David J. Brady

Abstract: Cassegrain designs can be used to build thin lenses. We analyze the relationships between system thickness and aperture sizes of the two mirrors as well as FoV size. Our analysis shows that decrease in lens thickness imposes tight constraint on the aperture and FoV size. To mitigate this limitation, we propose to fill the gaps between the primary and the secondary with high index material. The Gas… ▽ More Cassegrain designs can be used to build thin lenses. We analyze the relationships between system thickness and aperture sizes of the two mirrors as well as FoV size. Our analysis shows that decrease in lens thickness imposes tight constraint on the aperture and FoV size. To mitigate this limitation, we propose to fill the gaps between the primary and the secondary with high index material. The Gassegrain optics cuts the track length into half and high index material reduces ray angle and height, consequently the incident ray angle can be increased, i.e., the FoV angle is extended. Defining telephoto ratio as the ratio of lens thickness to focal length, we achieve telephoto ratios as small as 0.43 for a visible Cassegrain thin lens and 1.20 for an infrared Cassegrain thin lens. To achieve an arbitrary FoV coverage, we present an strategy by integrating multiple thin lenses on one plane with each unit covering a different FoV region. To avoid physically tilting each unit, we propose beam steering with metasurface. By image stitching, we obtain wide FoV images. △ Less

Submitted 12 January, 2021; originally announced January 2021.

arXiv:2011.03543 [pdf, other]

XVA Valuation under Market Illiquidity

Authors: Weijie Pang, Stephan Sturm

Abstract: Before the 2008 financial crisis, most research in financial mathematics focused on pricing options without considering the effects of counterparties' defaults, illiquidity problems, and the role of the sale and repurchase agreement (Repo) market. Recently, models were proposed to address this by computing a total valuation adjustment (XVA) of derivatives; however without considering a potential c… ▽ More Before the 2008 financial crisis, most research in financial mathematics focused on pricing options without considering the effects of counterparties' defaults, illiquidity problems, and the role of the sale and repurchase agreement (Repo) market. Recently, models were proposed to address this by computing a total valuation adjustment (XVA) of derivatives; however without considering a potential crisis in the market. In this article, we include a possible crisis by using an alternating renewal process to describe the switching between a normal financial regime and a financial crisis. We develop a framework to price the XVA of a European claim in this state-dependent situation. The price is characterized as a solution to a backward stochastic differential equation (BSDE), and we prove the existence and uniqueness of this solution. In a numerical study based on a deep learning algorithm for BSDEs, we compare the effect of different parameters on the valuation of the XVA. △ Less

Submitted 6 November, 2020; originally announced November 2020.

Comments: 49 pages

arXiv:2009.05226 [pdf, other]

Extending Label Smoothing Regularization with Self-Knowledge Distillation

Authors: Ji-Yue Wang, Pei Zhang, Wen-feng Pang, Jie Li

Abstract: Inspired by the strong correlation between the Label Smoothing Regularization(LSR) and Knowledge distillation(KD), we propose an algorithm LsrKD for training boost by extending the LSR method to the KD regime and applying a softer temperature. Then we improve the LsrKD by a Teacher Correction(TC) method, which manually sets a constant larger proportion for the right class in the uniform distributi… ▽ More Inspired by the strong correlation between the Label Smoothing Regularization(LSR) and Knowledge distillation(KD), we propose an algorithm LsrKD for training boost by extending the LSR method to the KD regime and applying a softer temperature. Then we improve the LsrKD by a Teacher Correction(TC) method, which manually sets a constant larger proportion for the right class in the uniform distribution teacher. To further improve the performance of LsrKD, we develop a self-distillation method named Memory-replay Knowledge Distillation (MrKD) that provides a knowledgeable teacher to replace the uniform distribution one in LsrKD. The MrKD method penalizes the KD loss between the current model's output distributions and its copies' on the training trajectory. By preventing the model learning so far from its historical output distribution space, MrKD can stabilize the learning and find a more robust minimum. Our experiments show that LsrKD can improve LSR performance consistently at no cost, especially on several deep neural networks where LSR is ineffectual. Also, MrKD can significantly improve single model training. The experiment results confirm that the TC can help LsrKD and MrKD to boost training, especially on the networks they are failed. Overall, LsrKD, MrKD, and their TC variants are comparable to or outperform the LSR method, suggesting the broad applicability of these KD methods. △ Less

Submitted 11 September, 2020; originally announced September 2020.

arXiv:2009.01061 [pdf, ps, other]

A new form of liquid matter: quantum droplets

Authors: Zhihuan Luo, Wei Pang, Bin Liu, Yongyao Li, Boris A. Malomed

Abstract: This brief review summarizes recent theoretical and experimental results which predict and establish the existence of quantum droplets (QDs), i.e., robust two- and three-dimensional (2D and 3D) self-trapped states in Bose-Einstein condensates (BECs), which are stabilized by effective selffirepulsion induced by quantum fluctuations around the mean-field (MF) states [alias the Lee-Huang--Yang (LHY)… ▽ More This brief review summarizes recent theoretical and experimental results which predict and establish the existence of quantum droplets (QDs), i.e., robust two- and three-dimensional (2D and 3D) self-trapped states in Bose-Einstein condensates (BECs), which are stabilized by effective selffirepulsion induced by quantum fluctuations around the mean-field (MF) states [alias the Lee-Huang--Yang (LHY) effect]. The basic models are presented, taking special care of the dimension crossover, 2D -> 3D. Recently reported experimental results, which exhibit stable 3D and quasi-2D QDs in binary BECs, with the inter-component attraction slightly exceeding the MF self-repulsion in each component, and in single-component condensates of atoms carrying permanent magnetic moments, are presented in some detail. The summary of theoretical results is focused, chiefly, on 3D and quasi-2D QDs with embedded vorticity, as the possibility to stabilize such states is a remarkable prediction. Stable vortex states are presented both for QDs in free space, and for singular but physically relevant 2D modes pulled to the center by the inverse-square potential, with the quantum collapse suppressed by the LHY effect. △ Less

Submitted 20 October, 2020; v1 submitted 2 September, 2020; originally announced September 2020.

Comments: A brief review article to be published in Frontiers in Physics

arXiv:2004.06311 [pdf, other]

Public Health Policy: COVID-19 Epidemic and SEIR Model with Asymptomatic Viral Carriers

Authors: Weijie Pang

Abstract: We measure the effect of different public health regulations to the spread of COVID-19, based on a SEIRA model -- a SEIR model including asymptomatic transmissions. The cumulative confirmed cases and death show nonlinear positive relationship with the value of asymptomatic rate. Based on this model, we analyze the inhibit effects to COVID-19 of three types of public health policies, i.e. isolation… ▽ More We measure the effect of different public health regulations to the spread of COVID-19, based on a SEIRA model -- a SEIR model including asymptomatic transmissions. The cumulative confirmed cases and death show nonlinear positive relationship with the value of asymptomatic rate. Based on this model, we analyze the inhibit effects to COVID-19 of three types of public health policies, i.e. isolation of laboratory confirmed cases, general personal protection and quarantine (lock-down). The simulations conclude that the isolation display limited effects to the asymptomatic viral carriers. The general personal protection and quarantine perform similar effects when the their percentages of participants are same. When the total proportion of asymptomatic, mild symptomatic and neglected patients is 40%, only depends on isolation policy may lead to an additional 75% infections, compared with general personal protection or quarantine with an efficiency 80%. At end, we provide seven recommendations of public health intervention before and during an aerial transmitted epidemic (COVID-19). △ Less

Submitted 14 April, 2020; originally announced April 2020.

Comments: 17 pages, 10 figures

arXiv:2003.07466 [pdf]

doi 10.1103/PhysRevB.102.014504

Shapes of rotating normal fluid 3He versus superfluid 4He droplets in molecular beams

Authors: Deepak Verma, Sean M. O. O Connell, Alexandra J. Feinberg, Swetha Erukala, Rico M. Tanyag, Charles Bernando, Weiwu Pang, Catherine A. Saladrigas, Benjamin W. Toulson, Mario Borgwardt, Niranjan Shivaram, Ming-Fu Lin, Andre Al Haddad, Wolfgang Jäger, Christoph Bostedt, Peter Walter, Oliver Gessner, Andrey F. Vilesov

Abstract: Previous single-pulse extreme ultraviolet and X-ray coherent diffraction studies revealed that superfluid 4He droplets obtained in free jet expansion acquire sizable angular momentum, resulting in significant centrifugal distortion. Similar experiments with normal fluid 3He droplets may help elucidating the origin of the of the large degree of rotational excitation and highlight similarities and d… ▽ More Previous single-pulse extreme ultraviolet and X-ray coherent diffraction studies revealed that superfluid 4He droplets obtained in free jet expansion acquire sizable angular momentum, resulting in significant centrifugal distortion. Similar experiments with normal fluid 3He droplets may help elucidating the origin of the of the large degree of rotational excitation and highlight similarities and differences of dynamics in normal and superfluid droplets. Here, we present the first comparison of the shapes of isolated 3He and 4He droplets following expansion of the corresponding fluids in vacuum at temperatures as low as ~ 2 K. Large 3He and 4He droplets with average radii of ~160 nm and ~350 nm, respectively, were produced. We find that the majority of the 3He droplets in the beam correspond to rotating oblate spheroids with reduced average angular momentum ($Λ$) and reduced angular velocities ($Ω$) similar to that of 4He droplets. Given the different physical nature of 3He and 4He, this similarity in $Λ$ and $Ω$ may be surprising and suggest that similar mechanisms induce rotation regardless of the isotope. We hypothesized that the observed distribution of droplet sizes and angular momenta stem from processes in the dense region close to the nozzle. In this region, the significant velocity spread and collisions between the droplets induce excessive rotation followed by droplet fission. The process may repeat itself several times before the droplets enter the collision-fee high vacuum region further downstream. △ Less

Submitted 16 March, 2020; originally announced March 2020.

Comments: 29 pages, 6 figures

Journal ref: Phys. Rev. B 102, 014504 (2020)

arXiv:2002.12704 [pdf, other]

ImmuNetNAS: An Immune-network approach for searching Convolutional Neural Network Architectures

Authors: Kefan Chen, Wei Pang

Abstract: In this research, we propose ImmuNetNAS, a novel Neural Architecture Search (NAS) approach inspired by the immune network theory. The core of ImmuNetNAS is built on the original immune network algorithm, which iteratively updates the population through hypermutation and selection, and eliminates the self-generation individuals that do not meet the requirements through comparing antibody affinity a… ▽ More In this research, we propose ImmuNetNAS, a novel Neural Architecture Search (NAS) approach inspired by the immune network theory. The core of ImmuNetNAS is built on the original immune network algorithm, which iteratively updates the population through hypermutation and selection, and eliminates the self-generation individuals that do not meet the requirements through comparing antibody affinity and inter-specific similarity. In addition, in order to facilitate the mutation operation, we propose a novel two-component based neural structure coding strategy. Furthermore, an improved mutation strategy based on Standard Genetic Algorithm (SGA) was proposed according to this encoding method. Finally, based on the proposed two-component based coding method, a new antibody affinity calculation method was developed to screen suitable neural architectures. Systematic evaluations demonstrate that our system has achieved good performance on both the MNIST and CIFAR-10 datasets. We open-source our code on GitHub in order to share it with other deep learning researchers and practitioners. △ Less

Submitted 28 February, 2020; originally announced February 2020.

Comments: 7 pages, 7 figures, 5 tables. No conference right now

arXiv:2002.10340 [pdf, other]

Guessing State Tracking for Visual Dialogue

Authors: Wei Pang, Xiaojie Wang

Abstract: The Guesser is a task of visual grounding in GuessWhat?! like visual dialogue. It locates the target object in an image supposed by an Oracle oneself over a question-answer based dialogue between a Questioner and the Oracle. Most existing guessers make one and only one guess after receiving all question-answer pairs in a dialogue with the predefined number of rounds. This paper proposes a guessing… ▽ More The Guesser is a task of visual grounding in GuessWhat?! like visual dialogue. It locates the target object in an image supposed by an Oracle oneself over a question-answer based dialogue between a Questioner and the Oracle. Most existing guessers make one and only one guess after receiving all question-answer pairs in a dialogue with the predefined number of rounds. This paper proposes a guessing state for the Guesser, and regards guess as a process with change of guessing state through a dialogue. A guessing state tracking based guess model is therefore proposed. The guessing state is defined as a distribution on objects in the image. With that in hand, two loss functions are defined as supervisions for model training. Early supervision brings supervision to Guesser at early rounds, and incremental supervision brings monotonicity to the guessing state. Experimental results on GuessWhat?! dataset show that our model significantly outperforms previous models, achieves new state-of-the-art, especially the success rate of guessing 83.3% is approaching the human-level accuracy of 84.4%. △ Less

Submitted 18 July, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

Comments: Accepted at ECCV 2020. The paper is about how the Guesser in the GuessWhat?! game guess. More details can be found at https://github.com/xubuvd/guesswhat

arXiv:2001.10338 [pdf, ps, other]

Short Text Classification via Term Graph

Authors: Wei Pang

Abstract: Short text classi cation is a method for classifying short sentence with prede ned labels. However, short text is limited in shortness in text length that leads to a challenging problem of sparse features. Most of existing methods treat each short sentences as independently and identically distributed (IID), local context only in the sentence itself is focused and the relational information betwee… ▽ More Short text classi cation is a method for classifying short sentence with prede ned labels. However, short text is limited in shortness in text length that leads to a challenging problem of sparse features. Most of existing methods treat each short sentences as independently and identically distributed (IID), local context only in the sentence itself is focused and the relational information between sentences are lost. To overcome these limitations, we propose a PathWalk model that combine the strength of graph networks and short sentences to solve the sparseness of short text. Experimental results on four different available datasets show that our PathWalk method achieves the state-of-the-art results, demonstrating the efficiency and robustness of graph networks for short text classification. △ Less

Submitted 19 January, 2020; originally announced January 2020.

Comments: 9 pages, 15 figures, Short Text Classification, Term Graph

arXiv:1911.07928 [pdf, other]

Visual Dialogue State Tracking for Question Generation

Authors: Wei Pang, Xiaojie Wang

Abstract: GuessWhat?! is a visual dialogue task between a guesser and an oracle. The guesser aims to locate an object supposed by the oracle oneself in an image by asking a sequence of Yes/No questions. Asking proper questions with the progress of dialogue is vital for achieving successful final guess. As a result, the progress of dialogue should be properly represented and tracked. Previous models for ques… ▽ More GuessWhat?! is a visual dialogue task between a guesser and an oracle. The guesser aims to locate an object supposed by the oracle oneself in an image by asking a sequence of Yes/No questions. Asking proper questions with the progress of dialogue is vital for achieving successful final guess. As a result, the progress of dialogue should be properly represented and tracked. Previous models for question generation pay less attention on the representation and tracking of dialogue states, and therefore are prone to asking low quality questions such as repeated questions. This paper proposes visual dialogue state tracking (VDST) based method for question generation. A visual dialogue state is defined as the distribution on objects in the image as well as representations of objects. Representations of objects are updated with the change of the distribution on objects. An object-difference based attention is used to decode new question. The distribution on objects is updated by comparing the question-answer pair and objects. Experimental results on GuessWhat?! dataset show that our model significantly outperforms existing methods and achieves new state-of-the-art performance. It is also noticeable that our model reduces the rate of repeated questions from more than 50% to 21.9% compared with previous state-of-the-art methods. △ Less

Submitted 24 November, 2019; v1 submitted 12 November, 2019; originally announced November 2019.

Comments: 8 pages, 4 figures, Accept-Oral by AAAI-2020

arXiv:1911.07729 [pdf, other]

ImmuNeCS: Neural Committee Search by an Artificial Immune System

Authors: Luc Frachon, Wei Pang, George M. Coghill

Abstract: Current Neural Architecture Search techniques can suffer from a few shortcomings, including high computational cost, excessive bias from the search space, conceptual complexity or uncertain empirical benefits over random search. In this paper, we present ImmuNeCS, an attempt at addressing these issues with a method that offers a simple, flexible, and efficient way of building deep learning models… ▽ More Current Neural Architecture Search techniques can suffer from a few shortcomings, including high computational cost, excessive bias from the search space, conceptual complexity or uncertain empirical benefits over random search. In this paper, we present ImmuNeCS, an attempt at addressing these issues with a method that offers a simple, flexible, and efficient way of building deep learning models automatically, and we demonstrate its effectiveness in the context of convolutional neural networks. Instead of searching for the 1-best architecture for a given task, we focus on building a population of neural networks that are then ensembled into a neural network committee, an approach we dub 'Neural Committee Search'. To ensure sufficient performance from the committee, our search algorithm is based on an artificial immune system that balances individual performance with population diversity. This allows us to stop the search when accuracy starts to plateau, and to bridge the performance gap through ensembling. In order to justify our method, we first verify that the chosen search space exhibits the locality property. To further improve efficiency, we also combine partial evaluation, weight inheritance, and progressive search. First, experiments are run to verify the validity of these techniques. Then, preliminary experimental results on two popular computer vision benchmarks show that our method consistently outperforms random search and yields promising results within reasonable GPU budgets. An additional experiment also shows that ImmuNeCS's solutions transfer effectively to a more difficult task, where they achieve results comparable to a direct search on the new task. We believe these findings can open the way for new, accessible alternatives to traditional NAS. △ Less

Submitted 22 October, 2020; v1 submitted 18 November, 2019; originally announced November 2019.

Comments: 16 pages including references, 6 figures, 3 tables, 2 algorithms

arXiv:1910.12926 [pdf]

doi 10.1103/PhysRevLett.124.215301

Angular momentum in rotating superfluid droplets

Authors: Sean M. O. OConnell, Rico Mayro P. Tanyag, Deepak Verma, Charles Bernando, Weiwu Pang, Camila Bacellar, Catherine A. Saladrigas, Johannes Mahl, Benjamin W. Toulson, Yoshiaki Kumagai, Peter Walter, Francesco Ancilotto, Manuel Barranco, Marti Pi, Christoph Bostedt, Oliver Gessner, Andrey F. Vilesov

Abstract: The angular momentum of rotating superfluid droplets originates from quantized vortices and capillary waves, the interplay between which remains to be uncovered. Here, the rotation of isolated sub-micrometer superfluid 4He droplets is studied by ultrafast x-ray diffraction using a free electron laser. The diffraction patterns provide simultaneous access to the morphology of the droplets and the vo… ▽ More The angular momentum of rotating superfluid droplets originates from quantized vortices and capillary waves, the interplay between which remains to be uncovered. Here, the rotation of isolated sub-micrometer superfluid 4He droplets is studied by ultrafast x-ray diffraction using a free electron laser. The diffraction patterns provide simultaneous access to the morphology of the droplets and the vortex arrays they host. In capsule-shaped droplets, vortices form a distorted triangular lattice, whereas they arrange along elliptical contours in ellipsoidal droplets. The combined action of vortices and capillary waves results in droplet shapes close to those of classical droplets rotating with the same angular velocity. The findings are corroborated by density functional theory calculations describing the velocity fields and shape deformations of a rotating superfluid cylinder. △ Less

Submitted 28 October, 2019; originally announced October 2019.

Comments: submitted to Physical Review Letters

arXiv:1909.06451 [pdf, other]

Distributed Focus and Digital Zoom

Authors: Wubin Pang, David J. Brady

Abstract: We explore integrated microcamera focus systems for array cameras. We propose a new model for system camera integration relying on fast action focus mechanisms with >10mm aperture. Rather than reducing resolution or expanding aperture size, such systems can be used in arrays to enable digital zoom. We show that a common mechanism supports camera modules with focal lengths ranging from 25 to 60 mm.… ▽ More We explore integrated microcamera focus systems for array cameras. We propose a new model for system camera integration relying on fast action focus mechanisms with >10mm aperture. Rather than reducing resolution or expanding aperture size, such systems can be used in arrays to enable digital zoom. We show that a common mechanism supports camera modules with focal lengths ranging from 25 to 60 mm. Designs for each focal length include a fixed objective lens group and an adjustable back focus group. Increasing the focal power of the front focal group enables the travel range of available microcamera modules to accommodate long focal length systems. We present design examples both discrete and multiscale array camera systems. △ Less

Submitted 13 September, 2019; originally announced September 2019.

arXiv:1905.07350 [pdf, other]

DeepSwarm: Optimising Convolutional Neural Networks using Swarm Intelligence

Authors: Edvinas Byla, Wei Pang

Abstract: In this paper we propose DeepSwarm, a novel neural architecture search (NAS) method based on Swarm Intelligence principles. At its core DeepSwarm uses Ant Colony Optimization (ACO) to generate ant population which uses the pheromone information to collectively search for the best neural architecture. Furthermore, by using local and global pheromone update rules our method ensures the balance betwe… ▽ More In this paper we propose DeepSwarm, a novel neural architecture search (NAS) method based on Swarm Intelligence principles. At its core DeepSwarm uses Ant Colony Optimization (ACO) to generate ant population which uses the pheromone information to collectively search for the best neural architecture. Furthermore, by using local and global pheromone update rules our method ensures the balance between exploitation and exploration. On top of this, to make our method more efficient we combine progressive neural architecture search with weight reusability. Furthermore, due to the nature of ACO our method can incorporate heuristic information which can further speed up the search process. After systematic and extensive evaluation, we discover that on three different datasets (MNIST, Fashion-MNIST, and CIFAR-10) when compared to existing systems our proposed method demonstrates competitive performance. Finally, we open source DeepSwarm as a NAS library and hope it can be used by more deep learning researchers and practitioners. △ Less

Submitted 17 May, 2019; originally announced May 2019.

Comments: 13 pages, 6 figures, to access DeepSwarm code go to https://github.com/Pattio/DeepSwarm

ACM Class: I.2.6

arXiv:1904.04483 [pdf, other]

doi 10.1016/j.nuclphysa.2019.06.010

The Inhomogeneous Phase of Dense Skyrmion Matter

Authors: Byung-Yoon Park, Won-Gi Paeng, Vicente Vento

Abstract: It was predicted qualitatively in ref.[1] that skyrmion matter at low density is stable in an inhomogeneous phase where skyrmions condensate into lumps while the remaining space is mostly empty. The aim of this paper is to proof quantitatively this prediction. In order to construct an inhomogeneous medium we distort the original FCC crystal to produce a phase of planar structures made of skyrmions… ▽ More It was predicted qualitatively in ref.[1] that skyrmion matter at low density is stable in an inhomogeneous phase where skyrmions condensate into lumps while the remaining space is mostly empty. The aim of this paper is to proof quantitatively this prediction. In order to construct an inhomogeneous medium we distort the original FCC crystal to produce a phase of planar structures made of skyrmions. We implement mathematically these planar structures by means of the 't Hooft instanton solution using the Atiyah-Manton ansatz. The results of our calculation of the average density and energy confirm the prediction suggesting that the phase diagram of the dense skyrmion matter is a lot more complex than a simple phase transition from the skyrmion FCC crystal lattice to the half-skyrmion CC one. Our results show that skyrmion matter shares common properties with standard nuclear matter develo** a skin and leading to a binding energy equation which resembles the Weiszaecker mass formula. △ Less

Submitted 9 April, 2019; originally announced April 2019.

Comments: 8 figures, 14 pages

arXiv:1810.07394 [pdf, ps, other]

doi 10.1016/j.cnsns.2019.04.008

Hybrid matter-wave - microwave solitons on the lattice

Authors: Zhihuan Luo, Weiwen Luo, Wei Pang, Zhijie Mai, Yongyao Li, Boris A. Malomed

Abstract: We introduce a two-component system which models a pseudospinor Bose-Einstein condensate (BEC), with a microwave field coupling its two components. The feedback of BEC of the field (the local-field effect) is taken into account by dint of the respective Poisson equation, which is solved using the Green's function. This gives rise to an effective long-range self-trap** interaction, which may act… ▽ More We introduce a two-component system which models a pseudospinor Bose-Einstein condensate (BEC), with a microwave field coupling its two components. The feedback of BEC of the field (the local-field effect) is taken into account by dint of the respective Poisson equation, which is solved using the Green's function. This gives rise to an effective long-range self-trap** interaction, which may act alone, or be combined with the contact cubic nonlinearity. The system is made discrete by loading the BEC into a deep optical-lattice potential. Numerical solutions demonstrate that onsite-centered fundamental solitons are stable in the cases of attractive or zero contact interactions, while offsite-centered solitons are unstable. In the case of the repulsive onsite nonlinearity, offsite solitons are stable, while their onsite-centered counterparts are stable only at sufficiently small values of the norm, where bistability between the off- and onsite-centered mode takes place. The shape of the onsite-centered solitons is very accurately predicted by a variational approximation (which includes essential technical novelties). Spatially-antisymmetric (\textquotedblleft twisted") solitons are stable at small values of the norm, being unstable at larger norms. In the strongly asymmetric version of the two-component system, which includes the Zeeman splitting, the system is reduced to a single discrete Gross-Pitaevskii equation, by eliminating the small higher-energy component. △ Less

Submitted 17 October, 2018; originally announced October 2018.

Comments: 12 Pages, 10 Figures, and 41 References

arXiv:1806.04868 [pdf, ps, other]

doi 10.1016/j.cnsns.2019.01.031

Two-dimensional composite solitons in Bose-Einstein condensates with spatially confined spin-orbit coupling

Authors: Yongyao Li, Xiliang Zhang, Rongxuan Zhong, Zhihuan Luo, Bin Liu, Chunqing Huang, Wei Pang, Boris A. Malomed

Abstract: It was recently found that the spin-orbit (SO) coupling can help to create stable matter-wave solitons in spinor Bose-Einstein condensates in the two-dimensional (2D) free space. Being induced by external laser illumination, the effective SO coupling can be applied too in a spatially confined area. Using numerical methods and the variational approximation (VA), we build families of 2D solitons of… ▽ More It was recently found that the spin-orbit (SO) coupling can help to create stable matter-wave solitons in spinor Bose-Einstein condensates in the two-dimensional (2D) free space. Being induced by external laser illumination, the effective SO coupling can be applied too in a spatially confined area. Using numerical methods and the variational approximation (VA), we build families of 2D solitons of the semi-vortex (SV) and mixed-mode (MM) types, and explore their stability, assuming that the SO-coupling strength is confined in the radial direction as a Gaussian. The most essential result is identification, by means of the VA and numerical methods, of the minimum size of the spatial confinement for which the 2D system maintains stable solitons of the SV and MM types. △ Less

Submitted 14 March, 2019; v1 submitted 13 June, 2018; originally announced June 2018.

Comments: 9 pages, 6 figures, and 61 references, Commun Nonlinear Sci Numer Simulat 73 481 (2019)

Showing 1–50 of 94 results for author: Paeng, W