Search | arXiv e-print repository

Data-driven Model Reduction for Soft Robots via Lagrangian Operator Inference

Authors: Harsh Sharma, Iman Adibnazari, Jacobo Cervera-Torralba, Michael T. Tolley, Boris Kramer

Abstract: Data-driven model reduction methods provide a nonintrusive way of constructing computationally efficient surrogates of high-fidelity models for real-time control of soft robots. This work leverages the Lagrangian nature of the model equations to derive structure-preserving linear reduced-order models via Lagrangian Operator Inference and compares their performance with prominent linear model reduc… ▽ More Data-driven model reduction methods provide a nonintrusive way of constructing computationally efficient surrogates of high-fidelity models for real-time control of soft robots. This work leverages the Lagrangian nature of the model equations to derive structure-preserving linear reduced-order models via Lagrangian Operator Inference and compares their performance with prominent linear model reduction techniques through an anguilliform swimming soft robot model example with 231,336 degrees of freedom. The case studies demonstrate that preserving the underlying Lagrangian structure leads to learned models with higher predictive accuracy and robustness to unseen inputs. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08839 [pdf, other]

A Survey on the Application of Generative Adversarial Networks in Cybersecurity: Prospective, Direction and Open Research Scopes

Authors: Md Mashrur Arifin, Md Shoaib Ahmed, Tanmai Kumar Ghosh, Jun Zhuang, Jyh-haw Yeh

Abstract: With the proliferation of Artificial Intelligence, there has been a massive increase in the amount of data required to be accumulated and disseminated digitally. As the data are available online in digital landscapes with complex and sophisticated infrastructures, it is crucial to implement various defense mechanisms based on cybersecurity. Generative Adversarial Networks (GANs), which are deep le… ▽ More With the proliferation of Artificial Intelligence, there has been a massive increase in the amount of data required to be accumulated and disseminated digitally. As the data are available online in digital landscapes with complex and sophisticated infrastructures, it is crucial to implement various defense mechanisms based on cybersecurity. Generative Adversarial Networks (GANs), which are deep learning models, have emerged as powerful solutions for addressing the constantly changing security issues. This survey studies the significance of the deep learning model, precisely on GANs, in strengthening cybersecurity defenses. Our survey aims to explore the various works completed in GANs, such as Intrusion Detection Systems (IDS), Mobile and Network Trespass, BotNet Detection, and Malware Detection. The focus is to examine how GANs can be influential tools to strengthen cybersecurity defenses in these domains. Further, the paper discusses the challenges and constraints of using GANs in these areas and suggests future research directions. Overall, the paper highlights the potential of GANs in enhancing cybersecurity measures and addresses the need for further exploration in this field. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08838 [pdf, other]

Deep Learning for Network Anomaly Detection under Data Contamination: Evaluating Robustness and Mitigating Performance Degradation

Authors: D'Jeff K. Nkashama, Jordan Masakuna Félicien, Arian Soltani, Jean-Charles Verdier, Pierre-Martin Tardif, Marc Frappier, Froduald Kabanza

Abstract: Deep learning (DL) has emerged as a crucial tool in network anomaly detection (NAD) for cybersecurity. While DL models for anomaly detection excel at extracting features and learning patterns from data, they are vulnerable to data contamination -- the inadvertent inclusion of attack-related data in training sets presumed benign. This study evaluates the robustness of six unsupervised DL algorithms… ▽ More Deep learning (DL) has emerged as a crucial tool in network anomaly detection (NAD) for cybersecurity. While DL models for anomaly detection excel at extracting features and learning patterns from data, they are vulnerable to data contamination -- the inadvertent inclusion of attack-related data in training sets presumed benign. This study evaluates the robustness of six unsupervised DL algorithms against data contamination using our proposed evaluation protocol. Results demonstrate significant performance degradation in state-of-the-art anomaly detection algorithms when exposed to contaminated data, highlighting the critical need for self-protection mechanisms in DL-based NAD models. To mitigate this vulnerability, we propose an enhanced auto-encoder with a constrained latent representation, allowing normal data to cluster more densely around a learnable center in the latent space. Our evaluation reveals that this approach exhibits improved resistance to data contamination compared to existing methods, offering a promising direction for more robust NAD systems. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: arXiv admin note: text overlap with arXiv:2207.03576

arXiv:2407.08837 [pdf, other]

Constraining $f(Q, L_m)$ gravity with bulk viscosity

Authors: Yerlan Myrzakulov, M. Koussour, I. Y. Davletov, J. Rayimbaev

Abstract: We investigate the influence of bulk viscosity on late-time cosmic acceleration within an extended $f(Q, L_m)$ gravity framework, where the non-metricity $Q$ is non-minimally coupled with the matter Lagrangian $L_m$. Analyzing the function $f(Q, L_m) = αQ + βL_m$, we derive exact solutions under non-relativistic matter domination. Using observational datasets ($H(z)$, Pantheon supernovae, and thei… ▽ More We investigate the influence of bulk viscosity on late-time cosmic acceleration within an extended $f(Q, L_m)$ gravity framework, where the non-metricity $Q$ is non-minimally coupled with the matter Lagrangian $L_m$. Analyzing the function $f(Q, L_m) = αQ + βL_m$, we derive exact solutions under non-relativistic matter domination. Using observational datasets ($H(z)$, Pantheon supernovae, and their combination), we constrain the model parameters $H_0$, $α$, $β$, and $ζ$. The deceleration parameter transitions from positive to negative values around redshifts $z_t \approx 0.80$ to $0.99 $, indicating current accelerated expansion. Moreover, the effective equation of state parameter, $ω_{eff}$, resembles quintessence dark energy ($-1 < ω_{eff} < -\frac{1}{3}$), with corresponding values from respective datasets. Finally, we use the $Om(z)$ diagnostic, which confirms that our model demonstrates quintessence-like behavior. Our findings underscore the significant role of bulk viscosity in understanding accelerated expansion in the universe within alternative gravity theories. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 12 pages, 8 figures

arXiv:2407.08834 [pdf, other]

The Solar eruptioN Integral Field Spectrograph

Authors: Vicki L. Herde, Phillip C. Chamberlin, Don Schmit, Adrian Daw, Ryan O. Milligan, Vanessa Polito, Souvik Bose, Spencer Boyajian, Paris Buedel, Will Edgar, Alex Gebben, Qian Gong, Ross Jacobsen, Nicholas Nell, Bennet Schwab, Alan Sims, David Summers, Zachary Turner, Trace Valade, Joseph Wallace

Abstract: The Solar eruptioN Integral Field Spectrograph (SNIFS) is a solar-gazing spectrograph scheduled to fly in the summer of 2025 on a NASA sounding rocket. Its goal is to view the solar chromosphere and transition region at a high cadence (1s) both spatially (0.5") and spectrally (33 mÅ) viewing wavelengths around Lyman Alpha (1216 Å), Si iii (1206 Å) and O v (1218 Å) to observe spicules, nanoflares,… ▽ More The Solar eruptioN Integral Field Spectrograph (SNIFS) is a solar-gazing spectrograph scheduled to fly in the summer of 2025 on a NASA sounding rocket. Its goal is to view the solar chromosphere and transition region at a high cadence (1s) both spatially (0.5") and spectrally (33 mÅ) viewing wavelengths around Lyman Alpha (1216 Å), Si iii (1206 Å) and O v (1218 Å) to observe spicules, nanoflares, and possibly a solar flare. This time cadence will provide yet-unobserved detail about fast-changing features of the Sun. The instrument is comprised of a Gregorian-style reflecting telescope combined with a spectrograph via a specialized mirrorlet array that focuses the light from each spatial location in the image so that it may be spectrally dispersed without overlap from neighboring locations. This paper discusses the driving science, detailed instrument and subsystem design, and pre-integration testing of the SNIFS instrument. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 22 pages (not including references), 7 figures, submitting to Solar Physics

arXiv:2407.08832 [pdf]

doi 10.3847/1538-3881/ad528d

Searching for Protoplanets around MWC 758 and MWC 480 in Br-$γ$ using Kernel Phase and SCExAO/CHARIS

Authors: Alexander Chaushev, Steph Sallum, Julien Lozi, Jeffrey Chilcote, Tyler Groff, Olivier Guyon, N. Jeremy Kasdin, Barnaby Norris, Andy Skemer

Abstract: Discovering new actively-accreting protoplanets is crucial to answering open questions about planet formation. However, identifying such planets at orbital distances where they are expected to be abundant is extremely challenging, both due to the technical requirements and large distances to star-forming regions. Here we use the kernel phase interferometry (KPI) technique to search for companions… ▽ More Discovering new actively-accreting protoplanets is crucial to answering open questions about planet formation. However, identifying such planets at orbital distances where they are expected to be abundant is extremely challenging, both due to the technical requirements and large distances to star-forming regions. Here we use the kernel phase interferometry (KPI) technique to search for companions around the $\sim$6 and $\sim$8 Myr old Herbig Ae stars MWC 758 and MWC 480. KPI is a data analysis technique which is sensitive to moderate asymmetries, arising from eg. a circumstellar disk or companions with contrasts of up to 6-8 mags, at separations down to and even below the classical Rayleigh diffraction limit ($\sim 1.2λ/ D$). Using the high spectral resolution K-band mode of the SCExAO/CHARIS integral field spectrograph, we search for both excess Br-$γ$ line emission and continuum emission from companions around MWC 480 and MWC 758. We are able to set limits on the presence of rapidly accreting protoplanets and brown dwarfs between 4 and 16 au, well interior to those of previous studies. In Br-$γ$, we set limits on excess line emission equivalent to accretion rates ranging from $10^{-5} M_{j}^{2}.yr^{-1}$ to $10^{-6}M_{j}^{2}.yr^{-1}$. Our achievable contrasts demonstrate that KPI using SCExAO/CHARIS is a promising technique to search for giant accreting protoplanets at smaller separations compared to conventional imaging. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 8 pages, 4 figures

Journal ref: The Astronomical Journal (2024), 168, 70

arXiv:2407.08826 [pdf, ps, other]

The CDAWG Index and Pattern Matching on Grammar-Compressed Strings

Authors: Alan M. Cleary, Joseph Winjum, Jordan Dood, Shunsuke Inenaga

Abstract: The compact directed acyclic word graph (CDAWG) is the minimal compact automaton that recognizes all the suffixes of a string. Classically the CDAWG has been implemented as an index of the string it recognizes, requiring $o(n)$ space for a copy of the string $T$ being indexed, where $n=|T|$. In this work, we propose using the CDAWG as an index for grammar-compressed strings. While this enables all… ▽ More The compact directed acyclic word graph (CDAWG) is the minimal compact automaton that recognizes all the suffixes of a string. Classically the CDAWG has been implemented as an index of the string it recognizes, requiring $o(n)$ space for a copy of the string $T$ being indexed, where $n=|T|$. In this work, we propose using the CDAWG as an index for grammar-compressed strings. While this enables all analyses supported by the CDAWG on any grammar-compressed string, in this work we specifically consider pattern matching. Using the CDAWG index, pattern matching can be performed on any grammar-compressed string in $\mathcal{O}(\text{ra}(m)+\text{occ})$ time while requiring only $\mathcal{O}(\text{er}(T))$ additional space, where $m$ is the length of the pattern, $\text{ra}(m)$ is the grammar random access time, $\text{occ}$ is the number of occurrences of the pattern in $T$, and $\text{er}(T)$ is the number of right-extensions of the maximal repeats in $T$. Our experiments show that even when using a naïve random access algorithm, the CDAWG index achieves state of the art run-time performance for pattern matching on grammar-compressed strings. Additionally, we find that all of the grammars computed for our experiments are smaller than the number of right-extensions in the string they produce and, thus, their CDAWGs are within the best known $\mathcal{O}(\text{er}(T))$ space asymptotic bound. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08821 [pdf]

Drying of soft colloidal films

Authors: Keumkyung Kuk, Julian Ringling, Kevin Gräff, Sebastian Hänsch, Virginia Carrasco-Fadanelli, Andrey A. Rudov, Igor I. Potemkin, Regine von Klitzing, Ivo Buttinoni, Matthias Karg

Abstract: Thin films made of deformable micro- and nano-units, such as biological membranes, polymer interfaces, and particle-laden liquid surfaces, exhibit a complex behavior during drying, with consequences for various applications like wound healing, coating technologies, and additive manufacturing. Studying the drying dynamics and structural changes of soft colloidal films thus holds the potential to yi… ▽ More Thin films made of deformable micro- and nano-units, such as biological membranes, polymer interfaces, and particle-laden liquid surfaces, exhibit a complex behavior during drying, with consequences for various applications like wound healing, coating technologies, and additive manufacturing. Studying the drying dynamics and structural changes of soft colloidal films thus holds the potential to yield valuable insights to achieve improvements for applications. In this study, we employ interfacial monolayers of core-shell microgels with varying degrees of softness as model systems and investigate their drying behavior on differently modified solid substrates (hydrophobic vs. hydrophilic). By leveraging on video microscopy, particle tracking, and thin film interference, we shed light on the interplay between microgel adhesion to solid surfaces and the immersion capillary forces that arise in the thin liquid film. We discovered that a dried replica of the interfacial microstructure can be more accurately achieved on a hydrophobic substrate relative to a hydrophilic one, particularly when employing softer colloids as opposed to harder counterparts. These observations are qualitatively supported by experiments with a thin film pressure balance which allows mimicking and controlling the drying process and by computer simulations with coarse-grained models. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 29 pages of main manuscript with 8 figures and 1 table, 21 pages of Supporting Information with 21 figures and 4 tables

arXiv:2407.08816 [pdf, other]

Improved analysis of strong-interaction-stable doubly-bottom tetraquarks on the lattice

Authors: B. Colquhoun, A. Francis, R. J. Hudspith, R. Lewis, K. Maltman, W. G. Parrott

Abstract: We update earlier lattice results for the binding energies of the flavor antitriplet of strong-interaction-stable doubly bottom, $J^P=1^+$ tetraquarks, employing an extended sink construction which produces significantly improved ground-state effective-mass plateaus, as well as new, larger-volume ensembles which reduce possible finite-volume effects at lighter pion masses. The updated bindings are… ▽ More We update earlier lattice results for the binding energies of the flavor antitriplet of strong-interaction-stable doubly bottom, $J^P=1^+$ tetraquarks, employing an extended sink construction which produces significantly improved ground-state effective-mass plateaus, as well as new, larger-volume ensembles which reduce possible finite-volume effects at lighter pion masses. The updated bindings are $115(17)$ MeV for the $I=0$ member of the antitriplet and $47(8)$ MeV for its $I=1/2$ partner. We also provide an update of our earlier study of the variable heavy mass dependence of binding in the $1^+$ channel and new results on this dependence for binding in the $0^+$ channel, accessible when the two heavy quarks have unequal masses. Implications of these results of potential relevance to experimental searches for signals of the production of doubly bottom tetraquarks and/or a possible bottom-charm partner of the $T_{cc}$ are also discussed. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 27 pages, 7 figures

arXiv:2407.08814 [pdf, other]

Covariate Assisted Entity Ranking with Sparse Intrinsic Scores

Authors: Jianqing Fan, Jikai Hou, Mengxin Yu

Abstract: This paper addresses the item ranking problem with associate covariates, focusing on scenarios where the preference scores can not be fully explained by covariates, and the remaining intrinsic scores, are sparse. Specifically, we extend the pioneering Bradley-Terry-Luce (BTL) model by incorporating covariate information and considering sparse individual intrinsic scores. Our work introduces novel… ▽ More This paper addresses the item ranking problem with associate covariates, focusing on scenarios where the preference scores can not be fully explained by covariates, and the remaining intrinsic scores, are sparse. Specifically, we extend the pioneering Bradley-Terry-Luce (BTL) model by incorporating covariate information and considering sparse individual intrinsic scores. Our work introduces novel model identification conditions and examines the regularized penalized Maximum Likelihood Estimator (MLE) statistical rates. We then construct a debiased estimator for the penalized MLE and analyze its distributional properties. Additionally, we apply our method to the goodness-of-fit test for models with no latent intrinsic scores, namely, the covariates fully explaining the preference scores of individual items. We also offer confidence intervals for ranks. Our numerical studies lend further support to our theoretical findings, demonstrating validation for our proposed method △ Less

Submitted 9 July, 2024; originally announced July 2024.

Comments: 77 pages, 3 figures

arXiv:2407.08812 [pdf, ps, other]

Fence decompositions and cherry covers in non-binary phylogenetic networks

Authors: Joan Carles Pons, Pau Vives López, Yukihiro Murakami, Leo Van Iersel

Abstract: Reticulate evolution can be modelled using phylogenetic networks. Tree-based networks, which are one of the more general classes of phylogenetic networks, have recently gained eminence for its ability to represent evolutionary histories with an underlying tree structure. To better understand tree-based networks, numerous characterizations have been proposed, based on tree embeddings, matchings, an… ▽ More Reticulate evolution can be modelled using phylogenetic networks. Tree-based networks, which are one of the more general classes of phylogenetic networks, have recently gained eminence for its ability to represent evolutionary histories with an underlying tree structure. To better understand tree-based networks, numerous characterizations have been proposed, based on tree embeddings, matchings, and arc partitions. Here, we build a bridge between two arc partition characterizations, namely maximal fence decompositions and cherry covers. Results on cherry covers have been found for general phylogenetic networks. We first show that the number of cherry covers is the same as the number of support trees (underlying tree structure of tree-based networks) for a given semibinary network. Maximal fence decompositions have only been defined thus far for binary networks (constraints on vertex degrees). We remedy this by generalizing fence decompositions to non-binary networks, and using this, we characterize semi-binary tree-based networks in terms of forbidden structures. Furthermore, we give an explicit enumeration of cherry covers of semi-binary networks, by studying its fence decomposition. Finally, we prove that it is possible to characterize semi-binary tree-child networks, a subclass of tree-based networks, in terms of the number of their cherry covers. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 16 pages

arXiv:2407.08810 [pdf, other]

Quantum Plasma Creation near a Magnetar

Authors: Jonathan Zhang, Christopher Thompson

Abstract: Magnetars in quiescent states continue to emit hard X-rays with a power far exceeding the loss of rotational energy. It has recently been noted that this hard X-ray continuum may bear a direct signature of quantum electrodynamic (QED) effects in magnetic fields stronger than the Schwinger field ($B_{\rm Q} = 4.4\times 10^{13}$ G). When the current flowing into the magnetosphere is driven by narrow… ▽ More Magnetars in quiescent states continue to emit hard X-rays with a power far exceeding the loss of rotational energy. It has recently been noted that this hard X-ray continuum may bear a direct signature of quantum electrodynamic (QED) effects in magnetic fields stronger than the Schwinger field ($B_{\rm Q} = 4.4\times 10^{13}$ G). When the current flowing into the magnetosphere is driven by narrow structures in the solid crust, the $e^\pm$ pair plasma supporting the current relaxes to a collisional and trans-relativistic state. The decay of a pair into two photons produces a broad, bremsstrahlung-like spectrum of hard X-rays, similar to that observed and extending up to $0.5-1$ MeV. The conversion of two gamma rays to a pair is further enhanced by a factor $\sim B/B_{\rm Q}$. Monte Carlo calculations of pair creation in a dipole magnetic field are presented. Non-local particle injection is found to be strong enough to suppress the high voltage that otherwise would accompany polar magnetic twist; the hard X-rays are mostly emitted away from the magnetic poles. Some of the pairs annihilate in an optically thin surface layer. The prototypical anomalous X-ray pulsar 1E 2259$+$586, which shows a hard X-ray continuum but relatively weak torque noise, slow spindown, and no radio emission, is a Rosetta Stone for understanding the magnetar circuit, consistent with the picture advanced here. For a $15-60$ keV luminosity as low as $10^{34}$ erg s$^{-1}$, the polar flux of sub-relativistic pairs produces an optical depth $3-30$ to electron cyclotron scattering in the $1-10$ keV band, reducing the net X-ray polarization. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 23 pages, 16 figures, submitted to the Astrophysical Journal

arXiv:2407.08807 [pdf, other]

Whispers from the quantum core: the ringdown of semiclassical stars

Authors: Julio Arrechea, Stefano Liberati, Vania Vellucci

Abstract: This investigation delves into the ringdown signals produced by semiclassical stars, which are ultra-compact, regular solutions of the Einstein equations incorporating stress-energy contributions from quantum vacuum polarization. These stars exhibit an approximately Schwarzschild exterior and an interior composed of a constant-density classical fluid and a cloud of vacuum polarization. By adjustin… ▽ More This investigation delves into the ringdown signals produced by semiclassical stars, which are ultra-compact, regular solutions of the Einstein equations incorporating stress-energy contributions from quantum vacuum polarization. These stars exhibit an approximately Schwarzschild exterior and an interior composed of a constant-density classical fluid and a cloud of vacuum polarization. By adjusting their compactness and density, we can alter the internal structure of these stars without modifying the exterior. This adaptability enables us to examine the sensitivity of the ringdown signal to the innermost regions of the emitting object and to compare it with similar geometries that differ substantially only at the core. Our results indicate that echo signals are intrinsically linked to the presence of stable light rings and can be very sensitive to the internal structure of the emitting object. This point was previously overlooked, either due to the imposition of reflective boundary conditions at the stellar surface or due to the assumption of low curvature interior geometries. Specifically, for stellar-sized semiclassical stars, we find that the interior travel time is sufficiently prolonged to render the echoes effectively unobservable. These findings underscore the potential efficacy of ultra-compact objects as black hole mimickers and emphasize that any phenomenological constraints on such objects necessitate a detailed understanding of their specific properties and core structure. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08801 [pdf, other]

DG-PIC: Domain Generalized Point-In-Context Learning for Point Cloud Understanding

Authors: **cen Jiang, Qianyu Zhou, Yuhang Li, Xuequan Lu, Meili Wang, Lizhuang Ma, Jian Chang, Jian Jun Zhang

Abstract: Recent point cloud understanding research suffers from performance drops on unseen data, due to the distribution shifts across different domains. While recent studies use Domain Generalization (DG) techniques to mitigate this by learning domain-invariant features, most are designed for a single task and neglect the potential of testing data. Despite In-Context Learning (ICL) showcasing multi-task… ▽ More Recent point cloud understanding research suffers from performance drops on unseen data, due to the distribution shifts across different domains. While recent studies use Domain Generalization (DG) techniques to mitigate this by learning domain-invariant features, most are designed for a single task and neglect the potential of testing data. Despite In-Context Learning (ICL) showcasing multi-task learning capability, it usually relies on high-quality context-rich data and considers a single dataset, and has rarely been studied in point cloud understanding. In this paper, we introduce a novel, practical, multi-domain multi-task setting, handling multiple domains and multiple tasks within one unified model for domain generalized point cloud understanding. To this end, we propose Domain Generalized Point-In-Context Learning (DG-PIC) that boosts the generalizability across various tasks and domains at testing time. In particular, we develop dual-level source prototype estimation that considers both global-level shape contextual and local-level geometrical structures for representing source domains and a dual-level test-time feature shifting mechanism that leverages both macro-level domain semantic information and micro-level patch positional relationships to pull the target data closer to the source ones during the testing. Our DG-PIC does not require any model updates during the testing and can handle unseen domains and multiple tasks, \textit{i.e.,} point cloud reconstruction, denoising, and registration, within one unified model. We also introduce a benchmark for this new setting. Comprehensive experiments demonstrate that DG-PIC outperforms state-of-the-art techniques significantly. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: Accepted to ECCV 2024

arXiv:2407.08800 [pdf, other]

Local Clustering for Lung Cancer Image Classification via Sparse Solution Technique

Authors: Jackson Hamel, Ming-Jun Lai, Zhaiming Shen, Ye Tian

Abstract: In this work, we propose to use a local clustering approach based on the sparse solution technique to study the medical image, especially the lung cancer image classification task. We view images as the vertices in a weighted graph and the similarity between a pair of images as the edges in the graph. The vertices within the same cluster can be assumed to share similar features and properties, thu… ▽ More In this work, we propose to use a local clustering approach based on the sparse solution technique to study the medical image, especially the lung cancer image classification task. We view images as the vertices in a weighted graph and the similarity between a pair of images as the edges in the graph. The vertices within the same cluster can be assumed to share similar features and properties, thus making the applications of graph clustering techniques very useful for image classification. Recently, the approach based on the sparse solutions of linear systems for graph clustering has been found to identify clusters more efficiently than traditional clustering methods such as spectral clustering. We propose to use the two newly developed local clustering methods based on sparse solution of linear system for image classification. In addition, we employ a box spline-based tight-wavelet-framelet method to clean these images and help build a better adjacency matrix before clustering. The performance of our methods is shown to be very effective in classifying images. Our approach is significantly more efficient and either favorable or equally effective compared with other state-of-the-art approaches. Finally, we shall make a remark by pointing out two image deformation methods to build up more artificial image data to increase the number of labeled images. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08798 [pdf, other]

Electronically-driven switching of topology in LaSbTe

Authors: J. Bannies, M. Michiardi, H. -H. Kung, S. Godin, J. W. Simonson, M. Oudah, M. Zonno, S. Gorovikov, S. Zhdanovich, I. S. Elfimov, A. Damascelli, M. C. Aronson

Abstract: In the past two decades, various classes of topological materials have been discovered, spanning topological insulators, semimetals, and metals. While the observation and understanding of the topology of a material has been a primary focus so far, the precise and easy control of topology in a single material remains largely unexplored. Here, we demonstrate full experimental control over the topolo… ▽ More In the past two decades, various classes of topological materials have been discovered, spanning topological insulators, semimetals, and metals. While the observation and understanding of the topology of a material has been a primary focus so far, the precise and easy control of topology in a single material remains largely unexplored. Here, we demonstrate full experimental control over the topological Dirac nodal loop in the square-net material LaSb$_\mathrm{x}$Te$_\mathrm{2-x}$ by chemical substitution and electron do**. Using angle-resolved photoemission spectroscopy (ARPES), we show that changing the antimony concentration x from 0.9 to 1.0 in the bulk opens a gap as large as 400 meV in the nodal loop. Our symmetry analysis based on single-crystal X-ray diffraction and a minimal tight binding model establishes that the breaking of \textit{n} glide symmetry in the square-net layer is responsible for the opening of the gap. Remarkably, we can also realize this topological phase transition \textit{in situ} on the surface of LaSb$_\mathrm{x}$Te$_\mathrm{2-x}$ by chemical gating using potassium deposition, which enables the reversible switching of the topology from gapped to gapless nodal loop. The underlying control parameter for the structural and topological transition in the bulk and on the surface is the electron concentration. It opens a pathway towards applications in devices based on switching topology by electrostatic gating. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08797 [pdf, other]

Deep Inverse Design for High-Level Synthesis

Authors: ** Chang, Tosiron Adegbija, Yuchao Liao, Claudio Talarico, Ao Li, Janet Roveda

Abstract: High-level synthesis (HLS) has significantly advanced the automation of digital circuits design, yet the need for expertise and time in pragma tuning remains challenging. Existing solutions for the design space exploration (DSE) adopt either heuristic methods, lacking essential information for further optimization potential, or predictive models, missing sufficient generalization due to the time-c… ▽ More High-level synthesis (HLS) has significantly advanced the automation of digital circuits design, yet the need for expertise and time in pragma tuning remains challenging. Existing solutions for the design space exploration (DSE) adopt either heuristic methods, lacking essential information for further optimization potential, or predictive models, missing sufficient generalization due to the time-consuming nature of HLS and the exponential growth of the design space. To address these challenges, we propose Deep Inverse Design for HLS (DID4HLS), a novel approach that integrates graph neural networks and generative models. DID4HLS iteratively optimizes hardware designs aimed at compute-intensive algorithms by learning conditional distributions of design features from post-HLS data. Compared to four state-of-the-art DSE baselines, our method achieved an average improvement of 42.5% on average distance to reference set (ADRS) compared to the best-performing baselines across six benchmarks, while demonstrating high robustness and efficiency. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08792 [pdf, other]

ProxyGPT: Enabling Anonymous Queries in AI Chatbots with (Un)Trustworthy Browser Proxies

Authors: Dzung Pham, Jade Sheffey, Chau Minh Pham, Amir Houmansadr

Abstract: AI-powered chatbots (ChatGPT, Claude, etc.) require users to create an account using their email and phone number, thereby linking their personally identifiable information to their conversational data and usage patterns. As these chatbots are increasingly being used for tasks involving sensitive information, privacy concerns have been raised about how chatbot providers handle user data. To addres… ▽ More AI-powered chatbots (ChatGPT, Claude, etc.) require users to create an account using their email and phone number, thereby linking their personally identifiable information to their conversational data and usage patterns. As these chatbots are increasingly being used for tasks involving sensitive information, privacy concerns have been raised about how chatbot providers handle user data. To address these concerns, we present ProxyGPT, a privacy-enhancing system that enables anonymous queries in popular chatbot platforms. ProxyGPT leverages volunteer proxies to submit user queries on their behalf, thus providing network-level anonymity for chatbot users. The system is designed to support key security properties such as content integrity via TLS-backed data provenance, end-to-end encryption, and anonymous payment, while also ensuring usability and sustainability. We provide a thorough analysis of the privacy, security, and integrity of our system and identify various future research directions, particularly in the area of private chatbot query synthesis. Our human evaluation shows that ProxyGPT offers users a greater sense of privacy compared to traditional AI chatbots, especially in scenarios where users are hesitant to share their identity with chatbot providers. Although our proof-of-concept has higher latency than popular chatbots, our human interview participants consider this to be an acceptable trade-off for anonymity. To the best of our knowledge, ProxyGPT is the first comprehensive proxy-based solution for privacy-preserving AI chatbots. Our codebase is available at https://github.com/dzungvpham/proxygpt. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08788 [pdf, other]

Purifying quantum-dot light in a coherent frequency interface

Authors: Fabrizio Chiriano, Christopher L. Morrison, Joseph Ho, Thomas Jaeken, Alessandro Fedrizzi

Abstract: Quantum networks typically operate in the telecom wavelengths to take advantage of low-loss transmission in optical fibres. However, bright quantum dots (QDs) emitting highly indistinguishable quantum states of light, such as InGaAs QDs, often emit photons in the near infrared thus necessitating frequency conversion (FC) to the telecom band. Furthermore, the signal quality of quantum emissions is… ▽ More Quantum networks typically operate in the telecom wavelengths to take advantage of low-loss transmission in optical fibres. However, bright quantum dots (QDs) emitting highly indistinguishable quantum states of light, such as InGaAs QDs, often emit photons in the near infrared thus necessitating frequency conversion (FC) to the telecom band. Furthermore, the signal quality of quantum emissions is crucial for the effective performance of these networks. In this work we report a method for simultaneously implementing spectral purification and frequency shifting of single photons from QD sources to the C-band in a periodically poled Lithium Niobate waveguide. We consider difference frequency generation in the counter-propagating configuration to implement FC with the output emission bandwidth in units of GHz. Our approach establishes a clear path to integrating high-performance single-emitter sources in a hybrid quantum network. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08783 [pdf, other]

Characterizing Translation-Invariant Bell Inequalities using Tropical Algebra and Graph Polytopes

Authors: Mengyao Hu, Eloïc Vallée, Tim Seynnaeve, Patrick Emonts, Jordi Tura

Abstract: Nonlocality is one of the key features of quantum physics, which is revealed through the violation of a Bell inequality. In large multipartite systems, nonlocality characterization quickly becomes a challenging task. A common practice is to make use of symmetries, low-order correlators, or exploiting local geometries, to restrict the class of inequalities. In this paper, we characterize translatio… ▽ More Nonlocality is one of the key features of quantum physics, which is revealed through the violation of a Bell inequality. In large multipartite systems, nonlocality characterization quickly becomes a challenging task. A common practice is to make use of symmetries, low-order correlators, or exploiting local geometries, to restrict the class of inequalities. In this paper, we characterize translation-invariant (TI) Bell inequalities with finite-range correlators in one-dimensional geometries. We introduce a novel methodology based on tropical algebra tensor networks and highlight its connection to graph theory. Surprisingly, we find that the TI Bell polytope has a number of extremal points that can be uniformly upper-bounded with respect to the system size. We give an efficient method to list all vertices of the polytope for a particular system size, and characterize the tightness of a given TI Bell inequality. The connections highlighted in our work allow us to re-interpret concepts developed in the fields of tropical algebra and graph theory in the context of Bell nonlocality, and vice-versa. This work extends a parallel article [M. Hu \textit{et al.}, arXiv: 2208.02798 (2022)] on the same subject. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 27 pages, 10 figures

arXiv:2407.08781 [pdf, other]

JADES: Spectroscopic Confirmation and Proper Motion for a T-Dwarf at 2 Kiloparsecs

Authors: Kevin N. Hainline, Francesco D'Eugenio, Fengwu Sun, Jakob M. Helton, Brittany E. Miles, MArk S. Marley, Ben W. P. Lew, Jarron M. Leisenring, Andrew J. Bunker, Phillip A. Cargile, Stefano Carniani, Daniel J. Eisenstein, Ignas Juodzbalis, Benjamin D. Johnson, Brant Robertson, Sandro Tacchella, Christina C. Williams, Christopher N. A. Willmer

Abstract: Large area observations of extragalactic deep fields with the James Webb Space Telescope (JWST) have provided a wealth of candidate low-mass L- and T-class brown dwarfs. The existence of these sources, which are at derived distances of hundreds of parsecs to several kiloparsecs from the Sun, has strong implications for the low-mass end of the stellar initial mass function, and the link between sta… ▽ More Large area observations of extragalactic deep fields with the James Webb Space Telescope (JWST) have provided a wealth of candidate low-mass L- and T-class brown dwarfs. The existence of these sources, which are at derived distances of hundreds of parsecs to several kiloparsecs from the Sun, has strong implications for the low-mass end of the stellar initial mass function, and the link between stars and planets at low metallicities. In this letter, we present a JWST/NIRSpec PRISM spectrum of brown dwarf JADES-GS-BD-9, confirming its photometric selection from observations taken as part of the JWST Advanced Deep Extragalactic Survey (JADES) program. Fits to this spectrum indicate that the brown dwarf has an effective temperature of 800-900K (T5 - T6) at a distance of $1.8 - 2.3$kpc from the Sun, with evidence of the source being at low metallicity ([M/H] $\leq -0.5$). Finally, because of the cadence of JADES NIRCam observations of this source, we additionally uncover a proper motion between the 2022 and 2023 centroids, and we measure a proper motion of $20 \pm 4$ mas yr$^{-1}$ (a transverse velocity of 214 km s$^{-1}$ at 2.25 kpc). At this predicted metallicity, distance, and transverse velocity, it is likely that this source belongs either to the edge of the Milky Way thick disk or the galactic halo. This spectral confirmation demonstrates the efficacy of photometric selection of these important sources across deep extragalactic JWST imaging. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 10 pages, 4 figures, submitted to AAS Journals

arXiv:2407.08772 [pdf, ps, other]

A Spectral Atlas of Lyman Alpha Emitters at z = 5.7 and z = 6.6

Authors: A. Songaila, L. L. Cowie, A. J. Barger, E. M. Hu, A. J. Taylor

Abstract: We present two uniformly observed spectroscopic samples of Ly-alpha emitters (LAEs) (127 at z = 5.7 and 82 at z = 6.6), which we use to investigate the evolution of the LAE population at these redshifts. The observations cover a large field (44 sq. deg) in the North Ecliptic Pole (HEROES), as well as several smaller fields. We have a small number of exotic LAEs in the samples: double-peaked Ly-alp… ▽ More We present two uniformly observed spectroscopic samples of Ly-alpha emitters (LAEs) (127 at z = 5.7 and 82 at z = 6.6), which we use to investigate the evolution of the LAE population at these redshifts. The observations cover a large field (44 sq. deg) in the North Ecliptic Pole (HEROES), as well as several smaller fields. We have a small number of exotic LAEs in the samples: double-peaked Ly-alpha profiles; very extended red wings; and one impressive lensed LAE cross. We also find three broad-line AGNs. We compare the Ly-alpha line width measurements at the two redshifts, finding that the lower-luminosity LAEs show a strong evolution of decreasing line width with increasing redshift, while the high-luminosity LAEs do not, with a transition luminosity of log L(Ly-alpha) = 43.25 erg s-1 . Thus, at z = 6.6, the high-luminosity LAEs may be producing large ionized bubbles themselves, or they may be residing in overdense galaxy sites that are producing such bubbles. In order to avoid losses in the red wing, the radius of the ionized bubble must be larger than 1 pMpc. The double-peaked LAEs also require transmission on the blue side. For the four at z = 6.6, we use models to estimate the proximity radii, Ra , where the ionizing flux of the galaxy is sufficient to make the surroundings have a low enough neutral fraction to pass the blue light. Since the required Ra are large, multiple ionizing sources in the vicinity may be needed. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 22 pages, 16 figures, 3 tables. Accepted for publication in The Astrophysical Journal. Table 3 available at https://zenodo.org/records/12693647?token=eyJhbGciOiJIUzUxMiJ9.eyJpZCI6IjlhOThiNGFmLTY4NWUtNDQ3NS1hMTAyLTY5ZGI3NTYxN2ZjZiIsImRhdGEiOnt9LCJyYW5kb20iOiI0MTY4ZDA3MzA0ZDk5MmU5NzMxNDA0MTFkOTlhNzQ0YyJ9.85LDgYQl5ZOfjvAXzGZONlvzmuPz-Sb12fiuNq_Q6xtsg1WRHNKOE0JdFT3uo68LMz7EI1SOeukdkmvE3wN2dA

arXiv:2407.08770 [pdf, other]

Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing

Authors: Huanqian Wang, Yang Yue, Rui Lu, **gxin Shi, Andrew Zhao, Shenzhi Wang, Shiji Song, Gao Huang

Abstract: Large Language Models (LLMs) have demonstrated great potential as generalist assistants, showcasing powerful task understanding and problem-solving capabilities. To deploy LLMs as AI assistants, it is crucial that these models exhibit desirable behavioral traits, such as non-toxicity and resilience against jailbreak attempts. Current methods for detoxification or preventing jailbreaking usually in… ▽ More Large Language Models (LLMs) have demonstrated great potential as generalist assistants, showcasing powerful task understanding and problem-solving capabilities. To deploy LLMs as AI assistants, it is crucial that these models exhibit desirable behavioral traits, such as non-toxicity and resilience against jailbreak attempts. Current methods for detoxification or preventing jailbreaking usually involve Supervised Fine-Tuning (SFT) or Reinforcement Learning from Human Feedback (RLHF), which requires finetuning billions of parameters through gradient descent with substantial computation cost. Furthermore, models modified through SFT and RLHF may deviate from the pretrained models, potentially leading to a degradation in foundational LLM capabilities. In this paper, we observe that surprisingly, directly editing a small subset of parameters can effectively modulate specific behaviors of LLMs, such as detoxification and resistance to jailbreaking. Specifically, for a behavior that we aim to avoid, we employ a linear classifier, which we term the behavior probe, to classify binary behavior labels within the hidden state space of the LLM. Using this probe, we introduce an algorithm to identify a critical subset of LLM parameters that significantly influence this targeted behavior. Then we directly edit these selected parameters by shifting them towards the behavior probe. Such a direct parameter editing method necessitates only inference-level computational resources. Experiments demonstrate that in the representative detoxification task, our approach achieves reductions of up to 90.0\% in toxicity on the RealToxicityPrompts dataset and 49.2\% on ToxiGen, while maintaining the LLM's general capabilities in areas such as common sense, question answering, and mathematics. Our code is available at https://github.com/lucywang720/model-surgery. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 23 pages, 14 figures

MSC Class: 68T50 (Primary) 68T07; 62M45 (Secondary) ACM Class: I.2.7

arXiv:2407.08769 [pdf, other]

AuNR-SMA: Automated Gold Nanorod Spectral Morphology Analysis Pipeline

Authors: Samuel P. Gleason, Jakob C. Dahl, Mahmoud Elzouka, Xingzhi Wang, Dana O. Byrne, Mumtaz Gababa, Hannah Cho, Ravi Prasher, Sean Lubner, Emory Chan, A. Paul Alivisatos

Abstract: The development of a colloidal synthesis procedure to produce nanomaterials of a specific size with high shape and size purity is often a time consuming, iterative process. This is often due to the time, resource and expertise intensive characterization methods required for quantitative determination of nanomaterial size and shape. Absorption spectroscopy is often the easiest method of colloidal n… ▽ More The development of a colloidal synthesis procedure to produce nanomaterials of a specific size with high shape and size purity is often a time consuming, iterative process. This is often due to the time, resource and expertise intensive characterization methods required for quantitative determination of nanomaterial size and shape. Absorption spectroscopy is often the easiest method of colloidal nanomaterial characterization, however, due to the lack of a reliable method to extract nanoparticle shapes from absorption spectroscopy, it is generally treated as a more qualitative measure for metal nanoparticles. This work demonstrates a gold nanorod (AuNR) spectral morphology analysis (SMA) tool, AuNR-SMA, which is a fast and accurate method to extract quantitative information about an AuNR sample's structural parameters from its absorption spectra. We apply AuNR-SMA in three distinct applications. First, we demonstrate its utility as an automated analysis tool in a high throughput AuNR synthesis procedure by generating quantitative size information from optical spectra. Second, we use the predictions generated by this model to train a machine learning model capable of predicting the resulting AuNR size distributions from the reaction conditions used to synthesize them. Third, we turn this model to spectra extracted from the literature where no size distributions are reported to impute unreported quantitative information of AuNR synthesis. This approach can potentially be extended to any other nanocrystal system where the absorption spectra are size dependent and accurate numerical simulation of the absorption spectra is possible. In addition, this pipeline could be integrated into automated synthesis apparatuses to provide interpretable data from simple measurements and help explore the synthesis science of nanoparticles in a rational manner or facilitate closed-loop workflows. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08766 [pdf, other]

Revisiting $C$ and $CP$ Violation in $η\to π^+π^-π^0$ Decay

Authors: Jun Shi, Jian Liang, Susan Gardner

Abstract: The decay $η\to π^+π^-π^0$ is an ideal process in which to study flavor-conserving $C$ and $CP$ violation beyond the Standard Model. We deduce the $C$- and $CP$-odd quark operators that contribute to $η\to π^+π^-π^0$ originating from the mass-dimension 6 Standard Model effective field theory. The corresponding hadron-level operators that generate a non-vanishing $I=0$ amplitude at order $p^6$ in t… ▽ More The decay $η\to π^+π^-π^0$ is an ideal process in which to study flavor-conserving $C$ and $CP$ violation beyond the Standard Model. We deduce the $C$- and $CP$-odd quark operators that contribute to $η\to π^+π^-π^0$ originating from the mass-dimension 6 Standard Model effective field theory. The corresponding hadron-level operators that generate a non-vanishing $I=0$ amplitude at order $p^6$ in the chiral effective theory are presented for the first time, in addition to the leading order operatorsascribed to the $I=2$ final state. By fitting the KLOE-2 and the most recent BESIII experimental data, we determine the coefficients of the lowest order $I=0$ and $I=2$ amplitudes and estimate the potential new physics energy scale. We also perform an impact study of the future $η\to π^+π^-π^0$ experiments. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 11 pages, 4 figures

arXiv:2407.08764 [pdf, other]

The Radiation Gauge: When is it Valid?

Authors: Jie Zhu, Christopher J. Ryu, Dong-Yeop Na, Weng Cho Chew

Abstract: In this paper, we shall show that the vector-scalar potential ($\mathbf{A}$-$Φ$) formulation, for many problems, can be further simplified by ignoring the scalar potential contribution and setting it to zero. In this paper, we shall show that the vector-scalar potential ($\mathbf{A}$-$Φ$) formulation, for many problems, can be further simplified by ignoring the scalar potential contribution and setting it to zero. △ Less

Submitted 10 July, 2024; originally announced July 2024.

arXiv:2407.08761 [pdf, other]

Bayesian analysis for pretest-posttest binary outcomes with adaptive significance levels

Authors: Alejandra Estefanía Patiño Hoyos, Johnatan Cardona Jiménez

Abstract: Count outcomes in longitudinal studies are frequent in clinical and engineering studies. In frequentist and Bayesian statistical analysis, methods such as Mixed linear models allow the variability or correlation within individuals to be taken into account. However, in more straightforward scenarios, where only two stages of an experiment are observed (pre-treatment vs. post-treatment), there are o… ▽ More Count outcomes in longitudinal studies are frequent in clinical and engineering studies. In frequentist and Bayesian statistical analysis, methods such as Mixed linear models allow the variability or correlation within individuals to be taken into account. However, in more straightforward scenarios, where only two stages of an experiment are observed (pre-treatment vs. post-treatment), there are only a few tools available, mainly for continuous outcomes. Thus, this work introduces a Bayesian statistical methodology for comparing paired samples in binary pretest-posttest scenarios. We establish a Bayesian probabilistic model for the inferential analysis of the unknown quantities, which is validated and refined through simulation analyses, and present an application to a dataset taken from the Television School and Family Smoking Prevention and Cessation Project (TVSFP) (Flay et al., 1995). The application of the Full Bayesian Significance Test (FBST) for precise hypothesis testing, along with the implementation of adaptive significance levels in the decision-making process, is included. △ Less

Submitted 9 July, 2024; originally announced July 2024.

arXiv:2407.08760 [pdf, ps, other]

Hydrodynamics as the effective field theory of strong-to-weak spontaneous symmetry breaking

Authors: Xiaoyang Huang, Marvin Qi, Jian-Hao Zhang, Andrew Lucas

Abstract: Inspired by the hunt for new phases of matter in quantum mixed states, it has recently been proposed that the equivalence of microcanonical and canonical ensembles in statistical mechanics is a manifestation of strong-to-weak spontaneous symmetry breaking (SWSSB) in an underlying many-body quantum description. Here, we build an effective field theory for SWSSB of a global U(1) symmetry; the answer… ▽ More Inspired by the hunt for new phases of matter in quantum mixed states, it has recently been proposed that the equivalence of microcanonical and canonical ensembles in statistical mechanics is a manifestation of strong-to-weak spontaneous symmetry breaking (SWSSB) in an underlying many-body quantum description. Here, we build an effective field theory for SWSSB of a global U(1) symmetry; the answer exactly reproduces the Schwinger-Keldysh effective field theory of diffusion for the conserved charge. We conclude that hydrodynamics can be understood as a theory of "superfluidity" for the broken strong symmetry: a non-vanishing susceptibility is a measurable order parameter for SWSSB, the diffusion mode is the Goldstone boson of the spontaneously broken continuous symmetry, and a generalization of Goldstone's Theorem implies that the diffusion mode is always long-lived. This perspective provides a transparent physical explanation for the unusual "reparameterization" symmetries which are a necessary ingredient of Schwinger-Keldysh effective field theories for "normal fluids". △ Less

Submitted 9 July, 2024; originally announced July 2024.

arXiv:2407.08758 [pdf]

Credit Card Fraud Detection in the Nigerian Financial Sector: A Comparison of Unsupervised TensorFlow-Based Anomaly Detection Techniques, Autoencoders and PCA Algorithm

Authors: Jennifer Onyeama

Abstract: Credit card fraud is a major cause of national concern in the Nigerian financial sector, affecting hundreds of transactions per second and impacting international ecommerce negatively. Despite the rapid spread and adoption of online marketing, millions of Nigerians are prevented from transacting in several countries with local credit cards due to bans and policies directed at restricting credit ca… ▽ More Credit card fraud is a major cause of national concern in the Nigerian financial sector, affecting hundreds of transactions per second and impacting international ecommerce negatively. Despite the rapid spread and adoption of online marketing, millions of Nigerians are prevented from transacting in several countries with local credit cards due to bans and policies directed at restricting credit card fraud. Presently, a myriad of technologies exist to detect fraudulent transactions, a few of which are adopted by Nigerian financial institutions to proactively manage the situation. Fraud detection allows institutions to restrict offenders from networks and with a centralized banking identity management system, such as the Bank Verification Number used by the Central Bank of Nigeria, offenders who may have stolen other identities can be backtraced and their bank accounts frozen. This paper aims to compare the effectiveness of two fraud detection technologies that are projected to work fully independent of human intervention to possibly predict and detect fraudulent credit card transactions. Autoencoders as an unsupervised tensorflow based anomaly detection technique generally offers greater performance in dimensionality reduction than the Principal Component Analysis, and this theory was tested out on Nigerian credit card transaction data. Results demonstrate that autoencoders are better suited to analyzing complex and extensive datasets and offer more reliable results with minimal mislabeling than the PCA algorithm. △ Less

Submitted 8 March, 2024; originally announced July 2024.

Comments: Pre-STATA compression, pre-analysis, full-scale raw data-set: https://github.com/Pzalms/Credit-Card-Fraud-Detection-using-Nigerian-Bank-Data/blob/main/dataset.csv

arXiv:2407.08751 [pdf, other]

Latent Diffusion for Neural Spiking Data

Authors: Jaivardhan Kapoor, Auguste Schulz, Julius Vetter, Felix Pei, Richard Gao, Jakob H. Macke

Abstract: Modern datasets in neuroscience enable unprecedented inquiries into the relationship between complex behaviors and the activity of many simultaneously recorded neurons. While latent variable models can successfully extract low-dimensional embeddings from such recordings, using them to generate realistic spiking data, especially in a behavior-dependent manner, still poses a challenge. Here, we pres… ▽ More Modern datasets in neuroscience enable unprecedented inquiries into the relationship between complex behaviors and the activity of many simultaneously recorded neurons. While latent variable models can successfully extract low-dimensional embeddings from such recordings, using them to generate realistic spiking data, especially in a behavior-dependent manner, still poses a challenge. Here, we present Latent Diffusion for Neural Spiking data (LDNS), a diffusion-based generative model with a low-dimensional latent space: LDNS employs an autoencoder with structured state-space (S4) layers to project discrete high-dimensional spiking data into continuous time-aligned latents. On these inferred latents, we train expressive (conditional) diffusion models, enabling us to sample neural activity with realistic single-neuron and population spiking statistics. We validate LDNS on synthetic data, accurately recovering latent structure, firing rates, and spiking statistics. Next, we demonstrate its flexibility by generating variable-length data that mimics human cortical activity during attempted speech. We show how to equip LDNS with an expressive observation model that accounts for single-neuron dynamics not mediated by the latent state, further increasing the realism of generated samples. Finally, conditional LDNS trained on motor cortical activity during diverse reaching behaviors can generate realistic spiking data given reach direction or unseen reach trajectories. In summary, LDNS simultaneously enables inference of low-dimensional latents and realistic conditional generation of neural spiking datasets, opening up further possibilities for simulating experimentally testable hypotheses. △ Less

Submitted 27 June, 2024; originally announced July 2024.

arXiv:2407.08750 [pdf, other]

ROLCH: Regularized Online Learning for Conditional Heteroskedasticity

Authors: Simon Hirsch, Jonathan Berrisch, Florian Ziel

Abstract: Large-scale streaming data are common in modern machine learning applications and have led to the development of online learning algorithms. Many fields, such as supply chain management, weather and meteorology, energy markets, and finance, have pivoted towards using probabilistic forecasts, which yields the need not only for accurate learning of the expected value but also for learning the condit… ▽ More Large-scale streaming data are common in modern machine learning applications and have led to the development of online learning algorithms. Many fields, such as supply chain management, weather and meteorology, energy markets, and finance, have pivoted towards using probabilistic forecasts, which yields the need not only for accurate learning of the expected value but also for learning the conditional heteroskedasticity. Against this backdrop, we present a methodology for online estimation of regularized linear distributional models for conditional heteroskedasticity. The proposed algorithm is based on a combination of recent developments for the online estimation of LASSO models and the well-known GAMLSS framework. We provide a case study on day-ahead electricity price forecasting, in which we show the competitive performance of the adaptive estimation combined with strongly reduced computational effort. Our algorithms are implemented in a computationally efficient Python package. △ Less

Submitted 26 June, 2024; originally announced July 2024.

arXiv:2407.08745 [pdf, other]

Evolutionary Computation for the Design and Enrichment of General-Purpose Artificial Intelligence Systems: Survey and Prospects

Authors: Javier Poyatos, Javier Del Ser, Salvador Garcia, Hisao Ishibuchi, Daniel Molina, Isaac Triguero, Bing Xue, Xin Yao, Francisco Herrera

Abstract: In Artificial Intelligence, there is an increasing demand for adaptive models capable of dealing with a diverse spectrum of learning tasks, surpassing the limitations of systems devised to cope with a single task. The recent emergence of General-Purpose Artificial Intelligence Systems (GPAIS) poses model configuration and adaptability challenges at far greater complexity scales than the optimal de… ▽ More In Artificial Intelligence, there is an increasing demand for adaptive models capable of dealing with a diverse spectrum of learning tasks, surpassing the limitations of systems devised to cope with a single task. The recent emergence of General-Purpose Artificial Intelligence Systems (GPAIS) poses model configuration and adaptability challenges at far greater complexity scales than the optimal design of traditional Machine Learning models. Evolutionary Computation (EC) has been a useful tool for both the design and optimization of Machine Learning models, endowing them with the capability to configure and/or adapt themselves to the task under consideration. Therefore, their application to GPAIS is a natural choice. This paper aims to analyze the role of EC in the field of GPAIS, exploring the use of EC for their design or enrichment. We also match GPAIS properties to Machine Learning areas in which EC has had a notable contribution, highlighting recent milestones of EC for GPAIS. Furthermore, we discuss the challenges of harnessing the benefits of EC for GPAIS, presenting different strategies to both design and improve GPAIS with EC, covering tangential areas, identifying research niches, and outlining potential research directions for EC and GPAIS. △ Less

Submitted 3 June, 2024; originally announced July 2024.

arXiv:2407.08739 [pdf, other]

MAVIS: Mathematical Visual Instruction Tuning

Authors: Renrui Zhang, Xinyu Wei, Dongzhi Jiang, Yichi Zhang, Ziyu Guo, Chengzhuo Tong, Jiaming Liu, Aojun Zhou, Bin Wei, Shanghang Zhang, Peng Gao, Hongsheng Li

Abstract: Multi-modal Large Language Models (MLLMs) have recently emerged as a significant focus in academia and industry. Despite their proficiency in general multi-modal scenarios, the mathematical problem-solving capabilities in visual contexts remain insufficiently explored. We identify three key areas within MLLMs that need to be improved: visual encoding of math diagrams, diagram-language alignment, a… ▽ More Multi-modal Large Language Models (MLLMs) have recently emerged as a significant focus in academia and industry. Despite their proficiency in general multi-modal scenarios, the mathematical problem-solving capabilities in visual contexts remain insufficiently explored. We identify three key areas within MLLMs that need to be improved: visual encoding of math diagrams, diagram-language alignment, and mathematical reasoning skills. This draws forth an urgent demand for large-scale, high-quality data and training pipelines in visual mathematics. In this paper, we propose MAVIS, the first MAthematical VISual instruction tuning paradigm for MLLMs, involving a series of mathematical visual datasets and specialized MLLMs. Targeting the three issues, MAVIS contains three progressive training stages from scratch. First, we curate MAVIS-Caption, consisting of 558K diagram-caption pairs, to fine-tune a math-specific vision encoder (CLIP-Math) through contrastive learning, tailored for improved diagram visual encoding. Second, we utilize MAVIS-Caption to align the CLIP-Math with a large language model (LLM) by a projection layer, enhancing vision-language alignment in mathematical domains. Third, we introduce MAVIS-Instruct, including 900K meticulously collected and annotated visual math problems, which is adopted to finally instruct-tune the MLLM for robust mathematical reasoning skills. In MAVIS-Instruct, we incorporate complete chain-of-thought (CoT) rationales for each problem, and minimize textual redundancy, thereby concentrating the model towards the visual elements. Data and Models are released at https://github.com/ZrrSkywalker/MAVIS △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: Work in progress. Data and Models are released at https://github.com/ZrrSkywalker/MAVIS

arXiv:2407.08734 [pdf, other]

Transformer Circuit Faithfulness Metrics are not Robust

Authors: Joseph Miller, Bilal Chughtai, William Saunders

Abstract: Mechanistic interpretability work attempts to reverse engineer the learned algorithms present inside neural networks. One focus of this work has been to discover 'circuits' -- subgraphs of the full model that explain behaviour on specific tasks. But how do we measure the performance of such circuits? Prior work has attempted to measure circuit 'faithfulness' -- the degree to which the circuit repl… ▽ More Mechanistic interpretability work attempts to reverse engineer the learned algorithms present inside neural networks. One focus of this work has been to discover 'circuits' -- subgraphs of the full model that explain behaviour on specific tasks. But how do we measure the performance of such circuits? Prior work has attempted to measure circuit 'faithfulness' -- the degree to which the circuit replicates the performance of the full model. In this work, we survey many considerations for designing experiments that measure circuit faithfulness by ablating portions of the model's computation. Concerningly, we find existing methods are highly sensitive to seemingly insignificant changes in the ablation methodology. We conclude that existing circuit faithfulness scores reflect both the methodological choices of researchers as well as the actual components of the circuit - the task a circuit is required to perform depends on the ablation used to test it. The ultimate goal of mechanistic interpretability work is to understand neural networks, so we emphasize the need for more clarity in the precise claims being made about circuits. We open source a library at https://github.com/UFO-101/auto-circuit that includes highly efficient implementations of a wide range of ablation methodologies and circuit discovery algorithms. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: CoLM 2024 Conference Paper. 11 page main body. 11 page appendix. 12 figures

arXiv:2407.08733 [pdf, other]

Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist

Authors: Zihao Zhou, Shudong Liu, Maizhen Ning, Wei Liu, **dong Wang, Derek F. Wong, Xiaowei Huang, Qiufeng Wang, Kaizhu Huang

Abstract: Exceptional mathematical reasoning ability is one of the key features that demonstrate the power of large language models (LLMs). How to comprehensively define and evaluate the mathematical abilities of LLMs, and even reflect the user experience in real-world scenarios, has emerged as a critical issue. Current benchmarks predominantly concentrate on problem-solving capabilities, which presents a s… ▽ More Exceptional mathematical reasoning ability is one of the key features that demonstrate the power of large language models (LLMs). How to comprehensively define and evaluate the mathematical abilities of LLMs, and even reflect the user experience in real-world scenarios, has emerged as a critical issue. Current benchmarks predominantly concentrate on problem-solving capabilities, which presents a substantial risk of model overfitting and fails to accurately represent genuine mathematical reasoning abilities. In this paper, we argue that if a model really understands a problem, it should be robustly and readily applied across a diverse array of tasks. Motivated by this, we introduce MATHCHECK, a well-designed checklist for testing task generalization and reasoning robustness, as well as an automatic tool to generate checklists efficiently. MATHCHECK includes multiple mathematical reasoning tasks and robustness test types to facilitate a comprehensive evaluation of both mathematical reasoning ability and behavior testing. Utilizing MATHCHECK, we develop MATHCHECK-GSM and MATHCHECK-GEO to assess mathematical textual reasoning and multi-modal reasoning capabilities, respectively, serving as upgraded versions of benchmarks including GSM8k, GeoQA, UniGeo, and Geometry3K. We adopt MATHCHECK-GSM and MATHCHECK-GEO to evaluate over 20 LLMs and 11 MLLMs, assessing their comprehensive mathematical reasoning abilities. Our results demonstrate that while frontier LLMs like GPT-4o continue to excel in various abilities on the checklist, many other model families exhibit a significant decline. Further experiments indicate that, compared to traditional math benchmarks, MATHCHECK better reflects true mathematical abilities and represents mathematical intelligence more linearly, thereby supporting our design. On our MATHCHECK, we can easily conduct detailed behavior analysis to deeply investigate models. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 35 pages, 10 figures, preprint

arXiv:2407.08731 [pdf, other]

Massive-ish Particles from Small-ish Scales: Non-Perturbative Techniques for Cosmological Collider Physics from Large-Scale Structure Surveys

Authors: Samuel Goldstein, Oliver H. E. Philcox, J. Colin Hill, Lam Hui

Abstract: Massive particles produced during inflation impact soft limits of primordial correlators. Searching for these signatures presents an exciting opportunity to uncover the particle spectrum in the inflationary epoch. We present non-perturbative methods to constrain intermediate-mass scalars ($0\leq m/H<3/2$, where $H$ is the inflationary Hubble scale) produced during inflation, which give rise to a p… ▽ More Massive particles produced during inflation impact soft limits of primordial correlators. Searching for these signatures presents an exciting opportunity to uncover the particle spectrum in the inflationary epoch. We present non-perturbative methods to constrain intermediate-mass scalars ($0\leq m/H<3/2$, where $H$ is the inflationary Hubble scale) produced during inflation, which give rise to a power-law scaling in the squeezed primordial bispectrum. Exploiting the large-scale structure consistency relations and the separate universe approach, we derive models for the late-time squeezed matter bispectrum and collapsed matter trispectrum sourced by these fields. To validate our models, we run $N$-body simulations with the "Cosmological Collider" squeezed bispectrum for two different particle masses. Our models yield unbiased constraints on the amplitude of non-Gaussianity, $f_{\rm NL}^Δ$, from the squeezed bispectrum and collapsed trispectrum deep into the non-linear regime ($k_{\rm max}\approx 2~h/{\rm Mpc}$ at $z=0$). We assess the information content of these summary statistics, emphasizing the importance of sample variance cancellation in the matter sector. We also study the scale-dependent halo bias in our simulations. For mass-selected halos, the non-Gaussian bias estimated from our simulations agrees with predictions based on (i) separate universe simulations and (ii) universal mass functions. With further work, these results can be used to search for inflationary massive particle production with upcoming galaxy surveys. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: 25 pages, 11 figures; comments welcome

arXiv:2407.08726 [pdf, other]

Map It Anywhere (MIA): Empowering Bird's Eye View Map** using Large-scale Public Data

Authors: Cherie Ho, Jiaye Zou, Omar Alama, Sai Mitheran Jagadesh Kumar, Benjamin Chiang, Taneesh Gupta, Chen Wang, Nikhil Keetha, Katia Sycara, Sebastian Scherer

Abstract: Top-down Bird's Eye View (BEV) maps are a popular representation for ground robot navigation due to their richness and flexibility for downstream tasks. While recent methods have shown promise for predicting BEV maps from First-Person View (FPV) images, their generalizability is limited to small regions captured by current autonomous vehicle-based datasets. In this context, we show that a more sca… ▽ More Top-down Bird's Eye View (BEV) maps are a popular representation for ground robot navigation due to their richness and flexibility for downstream tasks. While recent methods have shown promise for predicting BEV maps from First-Person View (FPV) images, their generalizability is limited to small regions captured by current autonomous vehicle-based datasets. In this context, we show that a more scalable approach towards generalizable map prediction can be enabled by using two large-scale crowd-sourced map** platforms, Mapillary for FPV images and OpenStreetMap for BEV semantic maps. We introduce Map It Anywhere (MIA), a data engine that enables seamless curation and modeling of labeled map prediction data from existing open-source map platforms. Using our MIA data engine, we display the ease of automatically collecting a dataset of 1.2 million pairs of FPV images & BEV maps encompassing diverse geographies, landscapes, environmental factors, camera models & capture scenarios. We further train a simple camera model-agnostic model on this data for BEV map prediction. Extensive evaluations using established benchmarks and our dataset show that the data curated by MIA enables effective pretraining for generalizable BEV map prediction, with zero-shot performance far exceeding baselines trained on existing datasets by 35%. Our analysis highlights the promise of using large-scale public maps for develo** & testing generalizable BEV perception, paving the way for more robust autonomous navigation. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08725 [pdf, other]

MetaUrban: A Simulation Platform for Embodied AI in Urban Spaces

Authors: Wayne Wu, Honglin He, Yiran Wang, Chenda Duan, Jack He, Zhizheng Liu, Quanyi Li, Bolei Zhou

Abstract: Public urban spaces like streetscapes and plazas serve residents and accommodate social life in all its vibrant variations. Recent advances in Robotics and Embodied AI make public urban spaces no longer exclusive to humans. Food delivery bots and electric wheelchairs have started sharing sidewalks with pedestrians, while diverse robot dogs and humanoids have recently emerged in the street. Ensurin… ▽ More Public urban spaces like streetscapes and plazas serve residents and accommodate social life in all its vibrant variations. Recent advances in Robotics and Embodied AI make public urban spaces no longer exclusive to humans. Food delivery bots and electric wheelchairs have started sharing sidewalks with pedestrians, while diverse robot dogs and humanoids have recently emerged in the street. Ensuring the generalizability and safety of these forthcoming mobile machines is crucial when navigating through the bustling streets in urban spaces. In this work, we present MetaUrban, a compositional simulation platform for Embodied AI research in urban spaces. MetaUrban can construct an infinite number of interactive urban scenes from compositional elements, covering a vast array of ground plans, object placements, pedestrians, vulnerable road users, and other mobile agents' appearances and dynamics. We design point navigation and social navigation tasks as the pilot study using MetaUrban for embodied AI research and establish various baselines of Reinforcement Learning and Imitation Learning. Experiments demonstrate that the compositional nature of the simulated environments can substantially improve the generalizability and safety of the trained mobile agents. MetaUrban will be made publicly available to provide more research opportunities and foster safe and trustworthy embodied AI in urban spaces. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: Technical report. Project page: https://metadriverse.github.io/metaurban/

arXiv:2407.08713 [pdf, other]

GTA: A Benchmark for General Tool Agents

Authors: Jize Wang, Zerun Ma, Yining Li, Songyang Zhang, Cailian Chen, Kai Chen, Xinyi Le

Abstract: Significant focus has been placed on integrating large language models (LLMs) with various tools in develo** general-purpose agents. This poses a challenge to LLMs' tool-use capabilities. However, there are evident gaps between existing tool-use evaluations and real-world scenarios. Current evaluations often use AI-generated queries, single-step tasks, dummy tools, and text-only interactions, fa… ▽ More Significant focus has been placed on integrating large language models (LLMs) with various tools in develo** general-purpose agents. This poses a challenge to LLMs' tool-use capabilities. However, there are evident gaps between existing tool-use evaluations and real-world scenarios. Current evaluations often use AI-generated queries, single-step tasks, dummy tools, and text-only interactions, failing to reveal the agents' real-world problem-solving abilities effectively. To address this, we propose GTA, a benchmark for General Tool Agents, featuring three main aspects: (i) Real user queries: human-written queries with simple real-world objectives but implicit tool-use, requiring the LLM to reason the suitable tools and plan the solution steps. (ii) Real deployed tools: an evaluation platform equipped with tools across perception, operation, logic, and creativity categories to evaluate the agents' actual task execution performance. (iii) Real multimodal inputs: authentic image files, such as spatial scenes, web page screenshots, tables, code snippets, and printed/handwritten materials, used as the query contexts to align with real-world scenarios closely. We design 229 real-world tasks and executable tool chains to evaluate mainstream LLMs. Our findings show that real-world user queries are challenging for existing LLMs, with GPT-4 completing less than 50% of the tasks and most LLMs achieving below 25%. This evaluation reveals the bottlenecks in the tool-use capabilities of current LLMs in real-world scenarios, which provides future direction for advancing general-purpose tool agents. The code and dataset are available at https://github.com/open-compass/GTA. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: Github repo: https://github.com/open-compass/GTA

arXiv:2407.08711 [pdf, other]

OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects

Authors: Akshay Krishnan, Abhijit Kundu, Kevis-Kokitsi Maninis, James Hays, Matthew Brown

Abstract: We propose OmniNOCS, a large-scale monocular dataset with 3D Normalized Object Coordinate Space (NOCS) maps, object masks, and 3D bounding box annotations for indoor and outdoor scenes. OmniNOCS has 20 times more object classes and 200 times more instances than existing NOCS datasets (NOCS-Real275, Wild6D). We use OmniNOCS to train a novel, transformer-based monocular NOCS prediction model (NOCSfo… ▽ More We propose OmniNOCS, a large-scale monocular dataset with 3D Normalized Object Coordinate Space (NOCS) maps, object masks, and 3D bounding box annotations for indoor and outdoor scenes. OmniNOCS has 20 times more object classes and 200 times more instances than existing NOCS datasets (NOCS-Real275, Wild6D). We use OmniNOCS to train a novel, transformer-based monocular NOCS prediction model (NOCSformer) that can predict accurate NOCS, instance masks and poses from 2D object detections across diverse classes. It is the first NOCS model that can generalize to a broad range of classes when prompted with 2D boxes. We evaluate our model on the task of 3D oriented bounding box prediction, where it achieves comparable results to state-of-the-art 3D detection methods such as Cube R-CNN. Unlike other 3D detection methods, our model also provides detailed and accurate 3D object shape and segmentation. We propose a novel benchmark for the task of NOCS prediction based on OmniNOCS, which we hope will serve as a useful baseline for future work in this area. Our dataset and code will be at the project website: https://omninocs.github.io. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: Accepted to ECCV 2024, project website: https://omninocs.github.io

arXiv:2407.08710 [pdf, other]

End-to-End Orchestration of NextG Media Services over the Distributed Compute Continuum

Authors: Alessandro Mauro, Antonia Maria Tulino, Jaime Llorca

Abstract: NextG (5G and beyond) networks, through the increasing integration of cloud/edge computing technologies, are becoming highly distributed compute platforms ideally suited to host emerging resource-intensive and latency-sensitive applications (e.g., industrial automation, extended reality, distributed AI). The end-to-end orchestration of such demanding applications, which involves function/data plac… ▽ More NextG (5G and beyond) networks, through the increasing integration of cloud/edge computing technologies, are becoming highly distributed compute platforms ideally suited to host emerging resource-intensive and latency-sensitive applications (e.g., industrial automation, extended reality, distributed AI). The end-to-end orchestration of such demanding applications, which involves function/data placement, flow routing, and joint communication/computation/storage resource allocation, requires new models and algorithms able to capture: (i) their disaggregated microservice-based architecture, (ii) their complex processing graph structures, including multiple-input multiple-output processing stages, and (iii) the opportunities for efficiently sharing and replicating data streams that may be useful for multiple functions and/or end users. To this end, we first identify the technical gaps in existing literature that prevent efficiently addressing the optimal orchestration of emerging applications described by information-aware directed acyclic graphs (DAGs). We then leverage the recently proposed Cloud Network Flow optimization framework and a novel functionally-equivalent DAG-to-Forest graph transformation procedure to design IDAGO (Information-Aware DAG Orchestration), a polynomial-time multi-criteria approximation algorithm for the optimal orchestration of NextG media services over NextG compute-integrated networks. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08709 [pdf, other]

Hierarchical Bayesian estimation of motor-evoked potential recruitment curves yields accurate and robust estimates

Authors: Vishweshwar Tyagi, Lynda M. Murray, Ahmet S. Asan, Christopher Mandigo, Michael S. Virk, Noam Y. Harel, Jason B. Carmel, James R. McIntosh

Abstract: Electromagnetic stimulation probes and modulates the neural systems that control movement. Key to understanding their effects is the muscle recruitment curve, which maps evoked potential size against stimulation intensity. Current methods to estimate curve parameters require large samples; however, obtaining these is often impractical due to experimental constraints. Here, we present a hierarchica… ▽ More Electromagnetic stimulation probes and modulates the neural systems that control movement. Key to understanding their effects is the muscle recruitment curve, which maps evoked potential size against stimulation intensity. Current methods to estimate curve parameters require large samples; however, obtaining these is often impractical due to experimental constraints. Here, we present a hierarchical Bayesian framework that accounts for small samples, handles outliers, simulates high-fidelity data, and returns a posterior distribution over curve parameters that quantify estimation uncertainty. It uses a rectified-logistic function that estimates motor threshold and outperforms conventionally used sigmoidal alternatives in predictive performance, as demonstrated through cross-validation. In simulations, our method outperforms non-hierarchical models by reducing threshold estimation error on sparse data and requires fewer participants to detect shifts in threshold compared to frequentist testing. We present two common use cases involving electrical and electromagnetic stimulation data and provide an open-source library for Python, called hbMEP, for diverse applications. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08706 [pdf, other]

HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models

Authors: Runhui Huang, Xinpeng Ding, Chunwei Wang, Jianhua Han, Yulong Liu, Hengshuang Zhao, Hang Xu, Lu Hou, Wei Zhang, Xiaodan Liang

Abstract: High-resolution inputs enable Large Vision-Language Models (LVLMs) to discern finer visual details, enhancing their comprehension capabilities. To reduce the training and computation costs caused by high-resolution input, one promising direction is to use sliding windows to slice the input into uniform patches, each matching the input size of the well-trained vision encoder. Although efficient, th… ▽ More High-resolution inputs enable Large Vision-Language Models (LVLMs) to discern finer visual details, enhancing their comprehension capabilities. To reduce the training and computation costs caused by high-resolution input, one promising direction is to use sliding windows to slice the input into uniform patches, each matching the input size of the well-trained vision encoder. Although efficient, this slicing strategy leads to the fragmentation of original input, i.e., the continuity of contextual information and spatial geometry is lost across patches, adversely affecting performance in cross-patch context perception and position-specific tasks. To overcome these shortcomings, we introduce HiRes-LLaVA, a novel framework designed to efficiently process any size of high-resolution input without altering the original contextual and geometric information. HiRes-LLaVA comprises two innovative components: (i) a SliceRestore adapter that reconstructs sliced patches into their original form, efficiently extracting both global and local features via down-up-sampling and convolution layers, and (ii) a Self-Mining Sampler to compresses the vision tokens based on themselves, preserving the original context and positional information while reducing training overhead. To assess the ability of handling context fragmentation, we construct a new benchmark, EntityGrid-QA, consisting of edge-related and position-related tasks. Our comprehensive experiments demonstrate the superiority of HiRes-LLaVA on both existing public benchmarks and on EntityGrid-QA, particularly on document-oriented tasks, establishing new standards for handling high-resolution inputs. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08704 [pdf, other]

Towards Efficient Deployment of Hybrid SNNs on Neuromorphic and Edge AI Hardware

Authors: James Seekings, Peyton Chandarana, Mahsa Ardakani, MohammadReza Mohammadi, Ramtin Zand

Abstract: This paper explores the synergistic potential of neuromorphic and edge computing to create a versatile machine learning (ML) system tailored for processing data captured by dynamic vision sensors. We construct and train hybrid models, blending spiking neural networks (SNNs) and artificial neural networks (ANNs) using PyTorch and Lava frameworks. Our hybrid architecture integrates an SNN for tempor… ▽ More This paper explores the synergistic potential of neuromorphic and edge computing to create a versatile machine learning (ML) system tailored for processing data captured by dynamic vision sensors. We construct and train hybrid models, blending spiking neural networks (SNNs) and artificial neural networks (ANNs) using PyTorch and Lava frameworks. Our hybrid architecture integrates an SNN for temporal feature extraction and an ANN for classification. We delve into the challenges of deploying such hybrid structures on hardware. Specifically, we deploy individual components on Intel's Neuromorphic Processor Loihi (for SNN) and Jetson Nano (for ANN). We also propose an accumulator circuit to transfer data from the spiking to the non-spiking domain. Furthermore, we conduct comprehensive performance analyses of hybrid SNN-ANN models on a heterogeneous system of neuromorphic and edge AI hardware, evaluating accuracy, latency, power, and energy consumption. Our findings demonstrate that the hybrid spiking networks surpass the baseline ANN model across all metrics and outperform the baseline SNN model in accuracy and latency. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08702 [pdf, other]

Production and stabilization of a spin mixture of ultracold dipolar Bose gases

Authors: Maxime Lecomte, Alexandre Journeaux, Julie Veschambre, Jean Dalibard, Raphael Lopes

Abstract: Mixtures of ultracold gases with long-range interactions are expected to open new avenues in the study of quantum matter. Natural candidates for this research are spin mixtures of atomic species with large magnetic moments. However, the lifetime of such assemblies can be strongly affected by the dipolar relaxation that occurs in spin-flip collisions. Here we present experimental results for a mixt… ▽ More Mixtures of ultracold gases with long-range interactions are expected to open new avenues in the study of quantum matter. Natural candidates for this research are spin mixtures of atomic species with large magnetic moments. However, the lifetime of such assemblies can be strongly affected by the dipolar relaxation that occurs in spin-flip collisions. Here we present experimental results for a mixture composed of the two lowest Zeeman states of $^{162}$Dy atoms, that act as dark states with respect to a light-induced quadratic Zeeman effect. We show that, due to an interference phenomenon, the rate for such inelastic processes is dramatically reduced with respect to the Wigner threshold law. Additionally, we determine the scattering lengths characterizing the s-wave interaction between these states, providing all necessary data to predict the miscibility range of the mixture, depending on its dimensionality. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08696 [pdf, other]

Reducing the Resources Required by ADAPT-VQE Using Coupled Exchange Operators and Improved Subroutines

Authors: Mafalda Ramôa, Panagiotis G. Anastasiou, Luis Paulo Santos, Nicholas J. Mayhall, Edwin Barnes, Sophia E. Economou

Abstract: Adaptive variational quantum algorithms arguably offer the best prospects for quantum advantage in the NISQ era. Since the inception of the first such algorithm, ADAPT-VQE, many improvements have appeared in the literature. We combine the key improvements along with a novel operator pool -- which we term Coupled Exchange Operator (CEO) pool -- to assess the cost of running state-of-the-art ADAPT-V… ▽ More Adaptive variational quantum algorithms arguably offer the best prospects for quantum advantage in the NISQ era. Since the inception of the first such algorithm, ADAPT-VQE, many improvements have appeared in the literature. We combine the key improvements along with a novel operator pool -- which we term Coupled Exchange Operator (CEO) pool -- to assess the cost of running state-of-the-art ADAPT-VQE on hardware in terms of measurement counts and circuit depth. We show a dramatic reduction of these quantum resources compared to the early versions of the algorithm. We also find that our state-of-the-art CEO-ADAPT-VQE outperforms UCCSD, the most widely regarded static VQE ansatz, in all relevant metrics. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08694 [pdf, other]

Cloud Atlas: Efficient Fault Localization for Cloud Systems using Language Models and Causal Insight

Authors: Zhiqiang Xie, Yujia Zheng, Lizi Ottens, Kun Zhang, Christos Kozyrakis, Jonathan Mace

Abstract: Runtime failure and performance degradation is commonplace in modern cloud systems. For cloud providers, automatically determining the root cause of incidents is paramount to ensuring high reliability and availability as prompt fault localization can enable faster diagnosis and triage for timely resolution. A compelling solution explored in recent work is causal reasoning using causal graphs to ca… ▽ More Runtime failure and performance degradation is commonplace in modern cloud systems. For cloud providers, automatically determining the root cause of incidents is paramount to ensuring high reliability and availability as prompt fault localization can enable faster diagnosis and triage for timely resolution. A compelling solution explored in recent work is causal reasoning using causal graphs to capture relationships between varied cloud system performance metrics. To be effective, however, systems developers must correctly define the causal graph of their system, which is a time-consuming, brittle, and challenging task that increases in difficulty for large and dynamic systems and requires domain expertise. Alternatively, automated data-driven approaches have limited efficacy for cloud systems due to the inherent rarity of incidents. In this work, we present Atlas, a novel approach to automatically synthesizing causal graphs for cloud systems. Atlas leverages large language models (LLMs) to generate causal graphs using system documentation, telemetry, and deployment feedback. Atlas is complementary to data-driven causal discovery techniques, and we further enhance Atlas with a data-driven validation step. We evaluate Atlas across a range of fault localization scenarios and demonstrate that Atlas is capable of generating causal graphs in a scalable and generalizable manner, with performance that far surpasses that of data-driven algorithms and is commensurate to the ground-truth baseline. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2407.08692 [pdf, other]

FAR-Trans: An Investment Dataset for Financial Asset Recommendation

Authors: Javier Sanz-Cruzado, Nikolaos Droukas, Richard McCreadie

Abstract: Financial asset recommendation (FAR) is a sub-domain of recommender systems which identifies useful financial securities for investors, with the expectation that they will invest capital on the recommended assets. FAR solutions analyse and learn from multiple data sources, including time series pricing data, customer profile information and expectations, as well as past investments. However, most… ▽ More Financial asset recommendation (FAR) is a sub-domain of recommender systems which identifies useful financial securities for investors, with the expectation that they will invest capital on the recommended assets. FAR solutions analyse and learn from multiple data sources, including time series pricing data, customer profile information and expectations, as well as past investments. However, most models have been developed over proprietary datasets, making a comparison over a common benchmark impossible. In this paper, we aim to solve this problem by introducing FAR-Trans, the first public dataset for FAR, containing pricing information and retail investor transactions acquired from a large European financial institution. We also provide a bench-marking comparison between eleven FAR algorithms over the data for use as future baselines. The dataset can be downloaded from https://doi.org/10.5525/gla.researchdata.1658 . △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: Accepted at the IJCAI-2024 Workshop on Recommender Systems in Finance (Fin-RecSys)

arXiv:2407.08691 [pdf, other]

ElasticAST: An Audio Spectrogram Transformer for All Length and Resolutions

Authors: Jiu Feng, Mehmet Hamza Erol, Joon Son Chung, Arda Senocak

Abstract: Transformers have rapidly overtaken CNN-based architectures as the new standard in audio classification. Transformer-based models, such as the Audio Spectrogram Transformers (AST), also inherit the fixed-size input paradigm from CNNs. However, this leads to performance degradation for ASTs in the inference when input lengths vary from the training. This paper introduces an approach that enables th… ▽ More Transformers have rapidly overtaken CNN-based architectures as the new standard in audio classification. Transformer-based models, such as the Audio Spectrogram Transformers (AST), also inherit the fixed-size input paradigm from CNNs. However, this leads to performance degradation for ASTs in the inference when input lengths vary from the training. This paper introduces an approach that enables the use of variable-length audio inputs with AST models during both training and inference. By employing sequence packing, our method ElasticAST, accommodates any audio length during training, thereby offering flexibility across all lengths and resolutions at the inference. This flexibility allows ElasticAST to maintain evaluation capabilities at various lengths or resolutions and achieve similar performance to standard ASTs trained at specific lengths or resolutions. Moreover, experiments demonstrate ElasticAST's better performance when trained and evaluated on native-length audio datasets. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: Interspeech 2024. Code is available at https://github.com/JiuFengSC/ElasticAST

arXiv:2407.08687 [pdf, other]

Scattering transforms on the sphere, application to large scale structure modelling

Authors: Louise Mousset, Erwan Allys, Matthew A. Price, Jonathan Aumont, Jean-Marc Delouis, Ludovic Montier, Jason D. McEwen

Abstract: Scattering transforms are a new type of summary statistics recently developed for the study of highly non-Gaussian processes, which have been shown to be very promising for astrophysical studies. In particular, they allow one to build generative models of complex non-linear fields from a limited amount of data. In the context of upcoming cosmological surveys, the extension of these tools to spheri… ▽ More Scattering transforms are a new type of summary statistics recently developed for the study of highly non-Gaussian processes, which have been shown to be very promising for astrophysical studies. In particular, they allow one to build generative models of complex non-linear fields from a limited amount of data. In the context of upcoming cosmological surveys, the extension of these tools to spherical data is necessary. We develop scattering transforms on the sphere and focus on the construction of maximum-entropy generative models of astrophysical fields. The quality of the generative models, both statistically and visually, is very satisfying, which therefore open up a wide range of new applications for future cosmological studies. △ Less

Submitted 11 July, 2024; originally announced July 2024.

Comments: Contribution to the 2024 Cosmology session of the 58th Rencontres de Moriond. For details, please refer to the full article arXiv:2407.07007

Showing 201–250 of 709,815 results for author: J.