-
Zero-shot detection of buildings in mobile LiDAR using Language Vision Model
Authors:
June Moh Goo,
Zichao Zeng,
Jan Boehm
Abstract:
Recent advances have demonstrated that Language Vision Models (LVMs) surpass the existing State-of-the-Art (SOTA) in two-dimensional (2D) computer vision tasks, motivating attempts to apply LVMs to three-dimensional (3D) data. While LVMs are efficient and effective in addressing various downstream 2D vision tasks without training, they face significant challenges when it comes to point clouds, a r…
▽ More
Recent advances have demonstrated that Language Vision Models (LVMs) surpass the existing State-of-the-Art (SOTA) in two-dimensional (2D) computer vision tasks, motivating attempts to apply LVMs to three-dimensional (3D) data. While LVMs are efficient and effective in addressing various downstream 2D vision tasks without training, they face significant challenges when it comes to point clouds, a representative format for representing 3D data. It is more difficult to extract features from 3D data and there are challenges due to large data sizes and the cost of the collection and labelling, resulting in a notably limited availability of datasets. Moreover, constructing LVMs for point clouds is even more challenging due to the requirements for large amounts of data and training time. To address these issues, our research aims to 1) apply the Grounded SAM through Spherical Projection to transfer 3D to 2D, and 2) experiment with synthetic data to evaluate its effectiveness in bridging the gap between synthetic and real-world data domains. Our approach exhibited high performance with an accuracy of 0.96, an IoU of 0.85, precision of 0.92, recall of 0.91, and an F1 score of 0.92, confirming its potential. However, challenges such as occlusion problems and pixel-level overlaps of multi-label points during spherical image generation remain to be addressed in future studies.
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
Zero-shot Building Age Classification from Facade Image Using GPT-4
Authors:
Zichao Zeng,
June Moh Goo,
Xinglei Wang,
Bin Chi,
Meihui Wang,
Jan Boehm
Abstract:
A building's age of construction is crucial for supporting many geospatial applications. Much current research focuses on estimating building age from facade images using deep learning. However, building an accurate deep learning model requires a considerable amount of labelled training data, and the trained models often have geographical constraints. Recently, large pre-trained vision language mo…
▽ More
A building's age of construction is crucial for supporting many geospatial applications. Much current research focuses on estimating building age from facade images using deep learning. However, building an accurate deep learning model requires a considerable amount of labelled training data, and the trained models often have geographical constraints. Recently, large pre-trained vision language models (VLMs) such as GPT-4 Vision, which demonstrate significant generalisation capabilities, have emerged as potential training-free tools for dealing with specific vision tasks, but their applicability and reliability for building information remain unexplored. In this study, a zero-shot building age classifier for facade images is developed using prompts that include logical instructions. Taking London as a test case, we introduce a new dataset, FI-London, comprising facade images and building age epochs. Although the training-free classifier achieved a modest accuracy of 39.69%, the mean absolute error of 0.85 decades indicates that the model can predict building age epochs successfully albeit with a small bias. The ensuing discussion reveals that the classifier struggles to predict the age of very old buildings and is challenged by fine-grained predictions within 2 decades. Overall, the classifier utilising GPT-4 Vision is capable of predicting the rough age epoch of a building from a single facade image without any training.
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
Estimating the Effect of Crosstalk Error on Circuit Fidelity Using Noisy Intermediate-Scale Quantum Devices
Authors:
Sovanmonynuth Heng,
Myeongseong Go,
Youngsun Han
Abstract:
Current advancements in technology have focused the attention of the quantum computing community toward exploring the potential of near-term devices whose computing power surpasses that of classical computers in practical applications. An unresolved central question revolves around whether the inherent noise in these devices can be overcome or whether any potential quantum advantage would be limit…
▽ More
Current advancements in technology have focused the attention of the quantum computing community toward exploring the potential of near-term devices whose computing power surpasses that of classical computers in practical applications. An unresolved central question revolves around whether the inherent noise in these devices can be overcome or whether any potential quantum advantage would be limited. There is no doubt that crosstalk is one of the main sources of noise in noisy intermediate-scale quantum (NISQ) systems, and it poses a fundamental challenge to hardware designs. Crosstalk between parallel instructions can corrupt quantum states and cause incorrect program execution. In this study, we present a necessary analysis of the crosstalk error effect on NISQ devices. Our approach is extremely straightforward and practical to estimate the crosstalk error of various multi-qubit devices. In particular, we combine the randomized benchmarking (RB) and simultaneous randomized benchmarking (SRB) protocol to estimate the crosstalk error from the correlation controlled-NOT (CNOT) gate. We demonstrate this protocol experimentally on 5-, 7-, \& 16-qubit devices. Our results demonstrate the crosstalk error model of three different IBM quantum devices over the experimental week and compare the error variation against the machine, number of qubits, quantum volume, processor, and topology. We then confirm the improvement in the circuit fidelity on different benchmarks by up to 3.06x via inserting an instruction barrier, as compared with an IBM quantum noisy device which offers near-optimal crosstalk mitigation in practice. Finally, we discuss the current system limitation, its tradeoff on fidelity and depth, noise beyond the NISQ system, and mitigation opportunities to ensure that the quantum operation can perform its quantum magic undisturbed.
△ Less
Submitted 17 May, 2024; v1 submitted 10 February, 2024;
originally announced February 2024.
-
Improving Zero-noise Extrapolation for Quantum-gate Error Mitigation using a Noise-aware Folding Method
Authors:
Leanghok Hour,
Myeongseong Go,
Youngsun Han
Abstract:
Recent thousand-qubit processors represent a significant hardware advancement, but current limitations prevent effective quantum error correction (QEC), necessitating reliance on quantum error mitigation (QEM) to enhance result fidelity from quantum computers. Our paper introduces a noise-aware folding technique that enhances Zero-Noise Extrapolation (ZNE) by leveraging the noise characteristics o…
▽ More
Recent thousand-qubit processors represent a significant hardware advancement, but current limitations prevent effective quantum error correction (QEC), necessitating reliance on quantum error mitigation (QEM) to enhance result fidelity from quantum computers. Our paper introduces a noise-aware folding technique that enhances Zero-Noise Extrapolation (ZNE) by leveraging the noise characteristics of target quantum hardware to fold circuits more efficiently. Unlike traditional ZNE approaches assuming uniform error distribution, our method redistributes noise using calibration data based on hardware noise models. By employing a noise-adaptive compilation method combined with our proposed folding mechanism, we enhance the ZNE accuracy of quantum gate-based computing using superconducting quantum computers. This paper highlights the uniqueness of our method, summarizes noise accumulation, presents the scaling algorithm, and compares the reliability of our method with those of existing models using linear extrapolation model. Experimental results show that compared to existing folding methods, our approach achieved a 35% improvement on quantum computer simulators and a 31% improvement on real quantum computers, demonstrating the effectiveness of our proposed approach.
△ Less
Submitted 14 May, 2024; v1 submitted 23 January, 2024;
originally announced January 2024.
-
Context-Aware Coupler Reconfiguration for Tunable Coupler-Based Superconducting Quantum Computers
Authors:
Leanghok Hour,
Sovanmonynuth Heng,
Sengthai Heng,
Myeongseong Go,
Youngsun Han
Abstract:
We address interconnection challenges in limited-qubit superconducting quantum computers (SQC), which often face crosstalk errors due to expanded qubit interactions during operations. Existing mitigation methods carry trade-offs, like hardware couplers or software-based gate scheduling. Our innovation, the Context-Aware COupler REconfiguration (CA-CORE) compilation method, aligns with application-…
▽ More
We address interconnection challenges in limited-qubit superconducting quantum computers (SQC), which often face crosstalk errors due to expanded qubit interactions during operations. Existing mitigation methods carry trade-offs, like hardware couplers or software-based gate scheduling. Our innovation, the Context-Aware COupler REconfiguration (CA-CORE) compilation method, aligns with application-specific design principles. It optimizes the qubit connections for improved SQC performance, leveraging tunable couplers. Through contextual analysis of qubit correlations, we configure an efficient coupling map considering SQC constraints. Our method reduces depth and SWAP operations by up to 18.84% and 42.47%, respectively. It also enhances circuit fidelity by 40% compared to IBM and Google's topologies. Notably, our method compiles a 33-qubit circuit in less than 1 second.
△ Less
Submitted 31 March, 2024; v1 submitted 8 January, 2024;
originally announced January 2024.
-
Socio-Economic Deprivation Analysis: Diffusion Maps
Authors:
June Moh Goo
Abstract:
This report proposes a model to predict the location of the most deprived areas in a city using data from the census. A census data is very high dimensional and needs to be simplified. We use a novel algorithm to reduce dimensionality and find patterns: The diffusion map. Features are defined by eigenvectors of the Laplacian matrix that defines the diffusion map. Eigenvectors corresponding to the…
▽ More
This report proposes a model to predict the location of the most deprived areas in a city using data from the census. A census data is very high dimensional and needs to be simplified. We use a novel algorithm to reduce dimensionality and find patterns: The diffusion map. Features are defined by eigenvectors of the Laplacian matrix that defines the diffusion map. Eigenvectors corresponding to the smallest eigenvalues indicate specific population features. Previous work has found qualitatively that the second most important dimension for describing the census data in Bristol is linked to deprivation. In this report, we analyse how good this dimension is as a model for predicting deprivation by comparing with the recognised measures. The Pearson correlation coefficient was found to be over 0.7. The top 10 per cent of deprived areas in the UK which also locate in Bristol are extracted to test the accuracy of the model. There are 52 most deprived areas, and 38 areas are correctly identified by comparing to the model. The influence of scores of IMD domains that do not correlate with the models, Eigenvector 2 entries of non-deprived OAs and orthogonality of Eigenvectors cause the model to fail the prediction of 14 deprived areas.
However, overall, the model shows a high performance to predict the future deprivation of overall areas where the project considers. This project is expected to support the government to allocate resources and funding.
△ Less
Submitted 15 December, 2023;
originally announced December 2023.
-
The Nanoplasmonic Purcell Effect in Ultrafast and High-Light-Yield Perovskite Scintillators
Authors:
Wenzheng Ye,
Zhihua Yong,
Michael Go,
Dominik Kowal,
Francesco Maddalena,
Liliana Tjahjana,
Wang Hong,
Arramel Arramel,
Christophe Dujardin,
Muhammad Danang Birowosuto,
Liang Jie Wong
Abstract:
The development of X-ray scintillators with ultrahigh light yields and ultrafast response times is a long sought-after goal. In this work, we theoretically predict and experimentally demonstrate a fundamental mechanism that pushes the frontiers of ultrafast X-ray scintillator performance: the use of nanoscale-confined surface plasmon polariton modes to tailor the scintillator response time via the…
▽ More
The development of X-ray scintillators with ultrahigh light yields and ultrafast response times is a long sought-after goal. In this work, we theoretically predict and experimentally demonstrate a fundamental mechanism that pushes the frontiers of ultrafast X-ray scintillator performance: the use of nanoscale-confined surface plasmon polariton modes to tailor the scintillator response time via the Purcell effect. By incorporating nanoplasmonic materials in scintillator devices, this work predicts over 10-fold enhancement in decay rate and 38% reduction in time resolution even with only a simple planar design. We experimentally demonstrate the nanoplasmonic Purcell effect using perovskite scintillators, enhancing the light yield by over 120% to 88 $\pm$ 11 ph/keV, and the decay rate by over 60% to 2.0 $\pm$ 0.2 ns for the average decay time, and 0.7 $\pm$ 0.1 ns for the ultrafast decay component, in good agreement with the predictions of our theoretical framework. We perform proof-of-concept X-ray imaging experiments using nanoplasmonic scintillators, demonstrating 182% enhancement in the modulation transfer function at 4 line pairs per millimeter spatial frequency. This work highlights the enormous potential of nanoplasmonics in optimizing ultrafast scintillator devices for applications including time-of-flight X-ray imaging and photon-counting computed tomography.
△ Less
Submitted 12 September, 2023;
originally announced September 2023.
-
Magnetic fields in the Horsehead Nebula
Authors:
Jihye Hwang,
Kate Pattle,
Harriet Parsons,
Mallory Go,
Jongsoo Kim
Abstract:
We present the first polarized dust emission measurements of the Horsehead Nebula, obtained using the POL-2 polarimeter on the Submillimetre Common-User Bolometer Array 2 (SCUBA-2) camera on the James Clerk Maxwell Telescope (JCMT). The Horsehead Nebula contains two sub-millimeter sources, a photodissociation region (PDR; SMM1) and a starless core (SMM2). We see well-ordered magnetic fields in bot…
▽ More
We present the first polarized dust emission measurements of the Horsehead Nebula, obtained using the POL-2 polarimeter on the Submillimetre Common-User Bolometer Array 2 (SCUBA-2) camera on the James Clerk Maxwell Telescope (JCMT). The Horsehead Nebula contains two sub-millimeter sources, a photodissociation region (PDR; SMM1) and a starless core (SMM2). We see well-ordered magnetic fields in both sources. We estimated plane-of-sky magnetic field strengths of 56$\pm$9 and 129$\pm$21 $μ$G in SMM1 and SMM2, respectively, and obtained mass-to-flux ratios and Alfvén Mach numbers of less than 0.6, suggesting that the magnetic field can resist gravitational collapse and that magnetic pressure exceeds internal turbulent pressure in these sources. In SMM2, the kinetic and gravitational energies are comparable to one another, but less than the magnetic energy. We suggest a schematic view of the overall magnetic field structure in the Horsehead Nebula. Magnetic field lines in SMM1 appear have been compressed and reordered during the formation of the PDR, while the likely more-embedded SMM2 may have inherited its field from that of the pre-shock molecular cloud. The magnetic fields appear to currently play an important role in supporting both sources.
△ Less
Submitted 14 March, 2023;
originally announced March 2023.
-
Roadmap on spatiotemporal light fields
Authors:
Yijie Shen,
Qiwen Zhan,
Logan G. Wright,
Demetrios N. Christodoulides,
Frank W. Wise,
Alan E. Willner,
Zhe Zhao,
Kai-heng Zou,
Chen-Ting Liao,
Carlos Hernández-García,
Margaret Murnane,
Miguel A. Porras,
Andy Chong,
Chenhao Wan,
Konstantin Y. Bliokh,
Murat Yessenov,
Ayman F. Abouraddy,
Liang Jie Wong,
Michael Go,
Suraj Kumar,
Cheng Guo,
Shanhui Fan,
Nikitas Papasimakis,
Nikolay I. Zheludev,
Lu Chen
, et al. (20 additional authors not shown)
Abstract:
Spatiotemporal sculpturing of light pulse with ultimately sophisticated structures represents the holy grail of the human everlasting pursue of ultrafast information transmission and processing as well as ultra-intense energy concentration and extraction. It also holds the key to unlock new extraordinary fundamental physical effects. Traditionally, spatiotemporal light pulses are always treated as…
▽ More
Spatiotemporal sculpturing of light pulse with ultimately sophisticated structures represents the holy grail of the human everlasting pursue of ultrafast information transmission and processing as well as ultra-intense energy concentration and extraction. It also holds the key to unlock new extraordinary fundamental physical effects. Traditionally, spatiotemporal light pulses are always treated as spatiotemporally separable wave packet as solution of the Maxwell's equations. In the past decade, however, more generalized forms of spatiotemporally nonseparable solution started to emerge with growing importance for their striking physical effects. This roadmap intends to highlight the recent advances in the creation and control of increasingly complex spatiotemporally sculptured pulses, from spatiotemporally separable to complex nonseparable states, with diverse geometric and topological structures, presenting a bird's eye viewpoint on the zoology of spatiotemporal light fields and the outlook of future trends and open challenges.
△ Less
Submitted 20 October, 2022;
originally announced October 2022.
-
gSwin: Gated MLP Vision Model with Hierarchical Structure of Shifted Window
Authors:
Mocho Go,
Hideyuki Tachibana
Abstract:
Following the success in language domain, the self-attention mechanism (transformer) is adopted in the vision domain and achieving great success recently. Additionally, as another stream, multi-layer perceptron (MLP) is also explored in the vision domain. These architectures, other than traditional CNNs, have been attracting attention recently, and many methods have been proposed. As one that comb…
▽ More
Following the success in language domain, the self-attention mechanism (transformer) is adopted in the vision domain and achieving great success recently. Additionally, as another stream, multi-layer perceptron (MLP) is also explored in the vision domain. These architectures, other than traditional CNNs, have been attracting attention recently, and many methods have been proposed. As one that combines parameter efficiency and performance with locality and hierarchy in image recognition, we propose gSwin, which merges the two streams; Swin Transformer and (multi-head) gMLP. We showed that our gSwin can achieve better accuracy on three vision tasks, image classification, object detection and semantic segmentation, than Swin Transformer, with smaller model size.
△ Less
Submitted 2 September, 2023; v1 submitted 24 August, 2022;
originally announced August 2022.
-
Neural manifold analysis of brain circuit dynamics in health and disease
Authors:
Rufus Mitchell-Heggs,
Seigfred Prado,
Giuseppe P. Gava,
Mary Ann Go,
Simon R. Schultz
Abstract:
Recent developments in experimental neuroscience make it possible to simultaneously record the activity of thousands of neurons. However, the development of analysis approaches for such large-scale neural recordings have been slower than those applicable to single-cell experiments. One approach that has gained recent popularity is neural manifold learning. This approach takes advantage of the fact…
▽ More
Recent developments in experimental neuroscience make it possible to simultaneously record the activity of thousands of neurons. However, the development of analysis approaches for such large-scale neural recordings have been slower than those applicable to single-cell experiments. One approach that has gained recent popularity is neural manifold learning. This approach takes advantage of the fact that often, even though neural datasets may be very high dimensional, the dynamics of neural activity tends to traverse a much lower-dimensional space. The topological structures formed by these low-dimensional neural subspaces are referred to as neural manifolds, and may potentially provide insight linking neural circuit dynamics with cognitive function and behavioural performance. In this paper we review a number of linear and non-linear approaches to neural manifold learning, by setting them within a common mathematical framework, and comparing their advantages and disadvantages with respect to their use for neural data analysis. We apply them to a number of datasets from published literature, comparing the manifolds that result from their application to hippocampal place cells, motor cortical neurons during a reaching task, and prefrontal cortical neurons during a multi-behaviour task. We find that in many circumstances linear algorithms produce similar results to non-linear methods, although in particular in cases where the behavioural complexity is greater, nonlinear methods tend to find lower dimensional manifolds, at the possible expense of interpretability. We demonstrate that these methods are applicable to the study of neurological disorders through simulation of a mouse model of Alzheimers Disease, and speculate that neural manifold analysis may help us to understand the circuit-level consequences of molecular and cellular neuropathology.
△ Less
Submitted 17 October, 2022; v1 submitted 22 March, 2022;
originally announced March 2022.
-
Quasi-Taylor Samplers for Diffusion Generative Models based on Ideal Derivatives
Authors:
Hideyuki Tachibana,
Mocho Go,
Muneyoshi Inahara,
Yotaro Katayama,
Yotaro Watanabe
Abstract:
Diffusion generative models have emerged as a new challenger to popular deep neural generative models such as GANs, but have the drawback that they often require a huge number of neural function evaluations (NFEs) during synthesis unless some sophisticated sampling strategies are employed. This paper proposes new efficient samplers based on the numerical schemes derived by the familiar Taylor expa…
▽ More
Diffusion generative models have emerged as a new challenger to popular deep neural generative models such as GANs, but have the drawback that they often require a huge number of neural function evaluations (NFEs) during synthesis unless some sophisticated sampling strategies are employed. This paper proposes new efficient samplers based on the numerical schemes derived by the familiar Taylor expansion, which directly solves the ODE/SDE of interest. In general, it is not easy to compute the derivatives that are required in higher-order Taylor schemes, but in the case of diffusion models, this difficulty is alleviated by the trick that the authors call ``ideal derivative substitution,'' in which the higher-order derivatives are replaced by tractable ones. To derive ideal derivatives, the authors argue the ``single point approximation,'' in which the true score function is approximated by a conditional one, holds in many cases, and considered the derivatives of this approximation. Applying thus obtained new quasi-Taylor samplers to image generation tasks, the authors experimentally confirmed that the proposed samplers could synthesize plausible images in small number of NFEs, and that the performance was better or at the same level as DDIM and Runge-Kutta methods. The paper also argues the relevance of the proposed samplers to the existing ones mentioned above.
△ Less
Submitted 11 October, 2022; v1 submitted 26 December, 2021;
originally announced December 2021.
-
An Automated Generation of Bootstrap Equations for Numerical Study of Critical Phenomena
Authors:
Mocho Go
Abstract:
In this thesis, we introduce new tools for the conformal bootstrap, autoboot and qboot. Each tool solves a different step in the whole computational stack, and combined with an existing efficient tool SDPB which solves semidefinite programming (SDP), our tools make it easier to study conformal field theories with more complicated global symmetries and with more general spectra. In the introduction…
▽ More
In this thesis, we introduce new tools for the conformal bootstrap, autoboot and qboot. Each tool solves a different step in the whole computational stack, and combined with an existing efficient tool SDPB which solves semidefinite programming (SDP), our tools make it easier to study conformal field theories with more complicated global symmetries and with more general spectra. In the introduction, we review how the conformal bootstrap method gives rich information about the theory at the fixed point of renormalization group, or in other words, the critical phenomena such as the Ising model at criticality. The following three sections focus on the theories behind the implementation of autoboot and qboot, and the explicit implementation, freely available at https://github.com/selpoG/autoboot/ and https://github.com/selpoG/qboot/, is discussed in section 5. We also have two applications in the last section, the Ising model and the O(2) vector model in three dimensions, each of them has close relationship with a physical system in the real world.
△ Less
Submitted 7 June, 2020;
originally announced June 2020.
-
autoboot: A generator of bootstrap equations with global symmetry
Authors:
Mocho Go,
Yuji Tachikawa
Abstract:
We introduce autoboot, a Mathematica program which automatically generates mixed-correlator bootstrap equations of an arbitrary number of scalar external operators, given the global symmetry group and the representations of the operators. The output is a Python program which uses Ohtsuki's cboot which in turn uses Simmons-Duffin's sdpb. In an appendix we also discuss a simple technique to signific…
▽ More
We introduce autoboot, a Mathematica program which automatically generates mixed-correlator bootstrap equations of an arbitrary number of scalar external operators, given the global symmetry group and the representations of the operators. The output is a Python program which uses Ohtsuki's cboot which in turn uses Simmons-Duffin's sdpb. In an appendix we also discuss a simple technique to significantly reduce the time to run sdpb, which we call hot-starting.
△ Less
Submitted 6 May, 2019; v1 submitted 25 March, 2019;
originally announced March 2019.
-
Luttinger-liquid-like behavior in bulk crystals of the quasi-one-dimensional conductor NbSe$_3$
Authors:
S. V. Zaitsev-Zotov,
M. S. H. Go,
E. Slot,
H. S. J. van der Zant
Abstract:
CDW/Normal metal/CDW junctions and nanoconstrictions in crystals of the quasi-one-dimensional conductor NbSe$_3$ are manufactured using a focused-ion-beam. It is found that the low-temperature conduction of these structures changes dramatically and loses the features of the charge-density-wave transition. Instead, a dielectric phase is developed. Up to 6-order power-law variations of the conduct…
▽ More
CDW/Normal metal/CDW junctions and nanoconstrictions in crystals of the quasi-one-dimensional conductor NbSe$_3$ are manufactured using a focused-ion-beam. It is found that the low-temperature conduction of these structures changes dramatically and loses the features of the charge-density-wave transition. Instead, a dielectric phase is developed. Up to 6-order power-law variations of the conduction as a function of both temperature and electric field can be observed for this new phase. The transition from quasi-one-dimensional behavior to one-dimensional behavior is associated with destruction of the three-dimensional order of the charge-density waves by fluctuations. It results in a recovery of the Luttinger-liquid properties of metallic chains, like it takes place in sliding Luttinger liquid phase.
△ Less
Submitted 30 October, 2001;
originally announced October 2001.