Search | arXiv e-print repository

X-ray Polarization of the Eastern Lobe of SS 433

Authors: Philip Kaaret, Riccardo Ferrazzoli, Stefano Silvestri, Michela Negro, Alberto Manfreda, Kinwah Wu, Enrico Costa, Paolo Soffitta, Samar Safi-Harb, Juri Poutanen, Alexandra Veledina, Alessandro Di Marco, Patrick Slane, Stefano Bianchi, Adam Ingram, Roger W. Romani, Nicolo Cibrario, Brydyn Mac Intyre, Romana Mikusincova, Ajay Ratheesh, James F. Steiner, Jiri Svoboda, Stefano Tugliani, Ivan Agudo, Lucio A. Antonelli , et al. (81 additional authors not shown)

Abstract: How astrophysical systems translate the kinetic energy of bulk motion into the acceleration of particles to very high energies is a pressing question. SS 433 is a microquasar that emits TeV gamma-rays indicating the presence of high-energy particles. A region of hard X-ray emission in the eastern lobe of SS 433 was recently identified as an acceleration site. We observed this region with the Imagi… ▽ More How astrophysical systems translate the kinetic energy of bulk motion into the acceleration of particles to very high energies is a pressing question. SS 433 is a microquasar that emits TeV gamma-rays indicating the presence of high-energy particles. A region of hard X-ray emission in the eastern lobe of SS 433 was recently identified as an acceleration site. We observed this region with the Imaging X-ray Polarimetry Explorer and measured a polarization degree in the range 38% to 77%. The high polarization degree indicates the magnetic field has a well ordered component if the X-rays are due to synchrotron emission. The polarization angle is in the range -12 to +10 degrees (east of north) which indicates that the magnetic field is parallel to the jet. Magnetic fields parallel to the bulk flow have also been found in supernova remnants and the jets of powerful radio galaxies. This may be caused by interaction of the flow with the ambient medium. △ Less

Submitted 27 November, 2023; originally announced November 2023.

Comments: 8 pages, accepted in the Astrophysical Journal Letters

arXiv:2311.16057 [pdf, ps, other]

The Power of Adaptivity in Quantum Query Algorithms

Authors: Uma Girish, Makrand Sinha, Avishay Tal, Kewen Wu

Abstract: Motivated by limitations on the depth of near-term quantum devices, we study the depth-computation trade-off in the query model, where the depth corresponds to the number of adaptive query rounds and the computation per layer corresponds to the number of parallel queries per round. We achieve the strongest known separation between quantum algorithms with $r$ versus $r-1$ rounds of adaptivity. We d… ▽ More Motivated by limitations on the depth of near-term quantum devices, we study the depth-computation trade-off in the query model, where the depth corresponds to the number of adaptive query rounds and the computation per layer corresponds to the number of parallel queries per round. We achieve the strongest known separation between quantum algorithms with $r$ versus $r-1$ rounds of adaptivity. We do so by using the $k$-fold Forrelation problem introduced by Aaronson and Ambainis (SICOMP'18). For $k=2r$, this problem can be solved using an $r$ round quantum algorithm with only one query per round, yet we show that any $r-1$ round quantum algorithm needs an exponential (in the number of qubits) number of parallel queries per round. Our results are proven following the Fourier analytic machinery developed in recent works on quantum-classical separations. The key new component in our result are bounds on the Fourier weights of quantum query algorithms with bounded number of rounds of adaptivity. These may be of independent interest as they distinguish the polynomials that arise from such algorithms from arbitrary bounded polynomials of the same degree. △ Less

Submitted 7 December, 2023; v1 submitted 27 November, 2023; originally announced November 2023.

Comments: 35 pages, 9 figures

arXiv:2311.15302 [pdf]

A Quick Response Algorithm for Dynamic Autonomous Mobile Robot Routing Problem with Time Windows

Authors: Lulu Cheng, Ning Zhao, Mengge Yuan, Kan Wu

Abstract: This paper investigates the optimization problem of scheduling autonomous mobile robots (AMRs) in hospital settings, considering dynamic requests with different priorities. The primary objective is to minimize the daily service cost by dynamically planning routes for the limited number of available AMRs. The total cost consists of AMR's purchase cost, transportation cost, delay penalty cost, and l… ▽ More This paper investigates the optimization problem of scheduling autonomous mobile robots (AMRs) in hospital settings, considering dynamic requests with different priorities. The primary objective is to minimize the daily service cost by dynamically planning routes for the limited number of available AMRs. The total cost consists of AMR's purchase cost, transportation cost, delay penalty cost, and loss of denial of service. To address this problem, we have established a two-stage mathematical programming model. In the first stage, a tabu search algorithm is employed to plan prior routes for all known medical requests. The second stage involves planning for real-time received dynamic requests using the efficient insertion algorithm with decision rules, which enables quick response based on the time window and demand constraints of the dynamic requests. One of the main contributions of this study is to make resource allocation decisions based on the present number of service AMRs for dynamic requests with different priorities. Computational experiments using Lackner instances demonstrate the efficient insertion algorithm with decision rules is very fast and robust in solving the dynamic AMR routing problem with time windows and request priority. Additionally, we provide managerial insights concerning the AMR's safety stock settings, which can aid in decision-making processes. △ Less

Submitted 26 November, 2023; originally announced November 2023.

arXiv:2311.13611 [pdf]

Computational-design Enabled Wearable and Tunable Metamaterials via Freeform Auxetics for Magnetic Resonance Imaging

Authors: Ke Wu, Xia Zhu, Thomas G. Bifano, Stephan W. Anderson, Xin Zhang

Abstract: Metamaterials hold significant promise for enhancing the imaging capabilities of MRI machines as an additive technology, due to their unique ability to enhance local magnetic fields. However, despite their potential, the metamaterials reported in the context of MRI applications have often been impractical. This impracticality arises from their predominantly flat configurations and their susceptibi… ▽ More Metamaterials hold significant promise for enhancing the imaging capabilities of MRI machines as an additive technology, due to their unique ability to enhance local magnetic fields. However, despite their potential, the metamaterials reported in the context of MRI applications have often been impractical. This impracticality arises from their predominantly flat configurations and their susceptibility to shifts in resonance frequencies, preventing them from realizing their optimal performance. Here, we introduce a computational method for designing wearable and tunable metamaterials via freeform auxetics. The proposed computational-design tools yield an approach to solving the complex circle packing problems in an interactive and efficient manner, thus facilitating the development of deployable metamaterials configured in freeform shapes. With such tools, the developed metamaterials may readily conform to a patient's kneecap, ankle, head, or any part of the body in need of imaging, and while ensuring an optimal resonance frequency, thereby paving the way for the widespread adoption of metamaterials in clinical MRI applications. △ Less

Submitted 21 November, 2023; originally announced November 2023.

arXiv:2311.07512 [pdf, other]

doi 10.21105/astro.2311.07512

Galaxy Clusters Discovered via the Thermal Sunyaev-Zel'dovich Effect in the 500-square-degree SPTpol Survey

Authors: L. E. Bleem, M. Klein, T. M. C. Abbott, P. A. R. Ade, M. Aguena, O. Alves, A. J. Anderson, F. Andrade-Oliveira, B. Ansarinejad, M. Archipley, M. L. N. Ashby, J. E. Austermann, D. Bacon, J. A. Beall, A. N. Bender, B. A. Benson, F. Bianchini, S. Bocquet, D. Brooks, D. L. Burke, M. Calzadilla, J. E. Carlstrom, A. Carnero Rosell, J. Carretero, C. L. Chang , et al. (103 additional authors not shown)

Abstract: We present a catalog of 689 galaxy cluster candidates detected at significance $ξ>4$ via their thermal Sunyaev-Zel'dovich (SZ) effect signature in 95 and 150 GHz data from the 500-square-degree SPTpol survey. We use optical and infrared data from the Dark Energy Camera and the Wide-field Infrared Survey Explorer (WISE) and \spitzer \ satellites, to confirm 544 of these candidates as clusters with… ▽ More We present a catalog of 689 galaxy cluster candidates detected at significance $ξ>4$ via their thermal Sunyaev-Zel'dovich (SZ) effect signature in 95 and 150 GHz data from the 500-square-degree SPTpol survey. We use optical and infrared data from the Dark Energy Camera and the Wide-field Infrared Survey Explorer (WISE) and \spitzer \ satellites, to confirm 544 of these candidates as clusters with $\sim94\%$ purity. The sample has an approximately redshift-independent mass threshold at redshift $z>0.25$ and spans $1.5 \times 10^{14} < M_{500c} < 9.1 \times 10^{14}$ $M_\odot/h_{70}$ \ and $0.03<z\lesssim1.6$ in mass and redshift, respectively; 21\% of the confirmed clusters are at $z>1$. We use external radio data from the Sydney University Molonglo Sky Survey (SUMSS) to estimate contamination to the SZ signal from synchrotron sources. The contamination reduces the recovered $ξ$ by a median value of 0.032, or $\sim0.8\%$ of the $ξ=4$ threshold value, and $\sim7\%$ of candidates have a predicted contamination greater than $Δξ= 1$. With the exception of a small number of systems $(<1\%)$, an analysis of clusters detected in single-frequency 95 and 150 GHz data shows no significant contamination of the SZ signal by emission from dusty or synchrotron sources. This cluster sample will be a key component in upcoming astrophysical and cosmological analyses of clusters. The SPTpol millimeter-wave maps and associated data products used to produce this sample are available at https://pole.uchicago.edu/public/data/sptpol_500d_clusters/index.html, and the NASA LAMBDA website. An interactive sky server with the SPTpol maps and Dark Energy Survey data release 2 images is also available at NCSA https://skyviewer.ncsa.illinois.edu. △ Less

Submitted 8 February, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

Comments: Matches version accepted by OJA. 19 pages + references, 14 figures, cluster candidate table provided in Appendix. Data products available at https://pole.uchicago.edu/public/data/sptpol_500d_clusters/index.html and an interactive sky server at https://skyviewer.ncsa.illinois.edu

Journal ref: Open Journal of Astrophysics, Volume 7, 2024

arXiv:2311.06359 [pdf, other]

doi 10.3847/2041-8213/ad132d

Highly Significant Detection of X-Ray Polarization from the Brightest Accreting Neutron Star Sco X-1

Authors: Fabio La Monaca, Alessandro Di Marco, Juri Poutanen, Matteo Bachetti, Sara E. Motta, Alessandro Papitto, Maura Pilia, Fei Xie, Stefano Bianchi, Anna Bobrikova, Enrico Costa, Wei Deng, Mingyu Ge, Giulia Illiano, Shu-Mei Jia, Henric Krawczynski, Eleonora V. Lai, Kuan Liu, Guglielmo Mastroserio, Fabio Muleri, John Rankin, Paolo Soffitta, Alexandra Veledina, Filippo Ambrosino, Melania Del Santo , et al. (94 additional authors not shown)

Abstract: The Imaging X-ray Polarimetry Explorer (IXPE) measured with high significance the X-ray polarization of the brightest Z-source Scorpius X-1, resulting in the nominal 2-8 keV energy band in a polarization degree of 1.0(0.2)% and a polarization angle of 8(6)° at 90% of confidence level. This observation was strictly simultaneous with observations performed by NICER, NuSTAR, and Insight-HXMT, which a… ▽ More The Imaging X-ray Polarimetry Explorer (IXPE) measured with high significance the X-ray polarization of the brightest Z-source Scorpius X-1, resulting in the nominal 2-8 keV energy band in a polarization degree of 1.0(0.2)% and a polarization angle of 8(6)° at 90% of confidence level. This observation was strictly simultaneous with observations performed by NICER, NuSTAR, and Insight-HXMT, which allowed for a precise characterization of its broad-band spectrum from soft to hard X-rays. The source has been observed mainly in its soft state, with short periods of flaring. We also observed low-frequency quasi-periodic oscillations. From a spectro-polarimetric analysis, we associate a polarization to the accretion disk at <3.2% at 90% of confidence level, compatible with expectations for an electron-scattering dominated optically thick atmosphere at the Sco X-1 inclination of 44°; for the higher-energy Comptonized component, we obtain a polarization of 1.3(0.4)%, in agreement with expectations for a slab of Thomson optical depth of ~7 and an electron temperature of ~3 keV. A polarization rotation with respect to previous observations by OSO-8 and PolarLight, and also with respect to the radio-jet position angle, is observed. This result may indicate a variation of the polarization with the source state that can be related to relativistic precession or to a change in the corona geometry with the accretion flow. △ Less

Submitted 24 January, 2024; v1 submitted 10 November, 2023; originally announced November 2023.

Journal ref: ApJL 960 L11 (2024)

arXiv:2311.05497 [pdf, other]

Tracking the X-ray Polarization of the Black Hole Transient Swift J1727.8-1613 during a State Transition

Authors: Adam Ingram, Niek Bollemeijer, Alexandra Veledina, Michal Dovciak, Juri Poutanen, Elise Egron, Thomas D. Russell, Sergei A. Trushkin, Michela Negro, Ajay Ratheesh, Fiamma Capitanio, Riley Connors, Joseph Neilsen, Alexander Kraus, Maria Noemi Iacolina, Alberto Pellizzoni, Maura Pilia, Francesco Carotenuto, Giorgio Matt, Guglielmo Mastroserio, Philip Kaaret, Stefano Bianchi, Javier A. Garcia, Matteo Bachetti, Kinwah Wu , et al. (98 additional authors not shown)

Abstract: We report on an observational campaign on the bright black hole X-ray binary Swift J1727.8$-$1613 centered around five observations by the Imaging X-ray Polarimetry Explorer (IXPE). These observations track for the first time the evolution of the X-ray polarization of a black hole X-ray binary across a hard to soft state transition. The 2--8 keV polarization degree decreased from $\sim$4\% to… ▽ More We report on an observational campaign on the bright black hole X-ray binary Swift J1727.8$-$1613 centered around five observations by the Imaging X-ray Polarimetry Explorer (IXPE). These observations track for the first time the evolution of the X-ray polarization of a black hole X-ray binary across a hard to soft state transition. The 2--8 keV polarization degree decreased from $\sim$4\% to $\sim$3\% across the five observations, but the polarization angle remained oriented in the North-South direction throughout. Based on observations with the Australia Telescope Compact Array (ATCA), we find that the intrinsic 7.25 GHz radio polarization aligns with the X-ray polarization. Assuming the radio polarization aligns with the jet direction (which can be tested in the future with higher spatial resolution images of the jet), our results imply that the X-ray corona is extended in the disk plane, rather than along the jet axis, for the entire hard intermediate state. This in turn implies that the long ($\gtrsim$10 ms) soft lags that we measure with the Neutron star Interior Composition ExploreR (NICER) are dominated by processes other than pure light-crossing delays. Moreover, we find that the evolution of the soft lag amplitude with spectral state does not follow the trend seen for other sources, implying that Swift J1727.8$-$1613 is a member of a hitherto under-sampled sub-population. △ Less

Submitted 24 April, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

Comments: Accepted for publication in ApJ. 21 pages, 8 figures

arXiv:2311.04980 [pdf, other]

MaxEVA: Maximizing the Efficiency of Matrix Multiplication on Versal AI Engine

Authors: Endri Taka, Aman Arora, Kai-Chiang Wu, Diana Marculescu

Abstract: The increasing computational and memory requirements of Deep Learning (DL) workloads has led to outstanding innovations in hardware architectures. An archetype of such architectures is the novel Versal AI Engine (AIE) by AMD/Xilinx. The AIE comprises multiple programmable processors optimized for vector-based algorithms. An AIE array consisting of 400 processor cores, operating at 1.25 GHz is able… ▽ More The increasing computational and memory requirements of Deep Learning (DL) workloads has led to outstanding innovations in hardware architectures. An archetype of such architectures is the novel Versal AI Engine (AIE) by AMD/Xilinx. The AIE comprises multiple programmable processors optimized for vector-based algorithms. An AIE array consisting of 400 processor cores, operating at 1.25 GHz is able to deliver a peak throughput of 8 TFLOPs for 32-bit floating-point (fp32), and 128 TOPs for 8-bit integer (int8) precision. In this work, we propose MaxEVA: a novel framework to efficiently map Matrix Multiplication (MatMul) workloads on Versal AIE devices. Our framework maximizes the performance and energy efficiency of MatMul applications by efficiently exploiting features of the AIE architecture and resolving performance bottlenecks from multiple angles. When demonstrating on the VC1902 device of the VCK190 board, MaxEVA accomplishes up to 5.44 TFLOPs and 77.01 TOPs throughput for fp32 and int8 precisions, respectively. In terms of energy efficiency, MaxEVA attains up to 124.16 GFLOPs/W for fp32, and 1.16 TOPs/W for int8. Our proposed method substantially outperforms the state-of-the-art approach by exhibiting up to 2.19x throughput gain and 20.4% higher energy efficiency. The MaxEVA framework provides notable insights to fill the knowledge gap in effectively designing MatMul-based DL workloads on the new Versal AIE devices. △ Less

Submitted 13 November, 2023; v1 submitted 8 November, 2023; originally announced November 2023.

Comments: Accepted as full paper at FPT 2023

arXiv:2311.04632 [pdf, other]

doi 10.3847/2041-8213/ad1832

X-Ray Polarized View on the Accretion Geometry in the X-Ray Binary Circinus X-1

Authors: John Rankin, Fabio La Monaca, Alessandro Di Marco, Juri Poutanen, Anna Bobrikova, Vadim Kravtsov, Fabio Muleri, Maura Pilia, Alexandra Veledina, Rob Fender, Philip Kaaret, Dawoon E. Kim, Andrea Marinucci, Herman L. Marshall, Alessandro Papitto, Allyn F. Tennant, Sergey S. Tsygankov, Martin C. Weisskopf, Kinwah Wu, Silvia Zane, Filippo Ambrosino, Ruben Farinelli, Andrea Gnarini, Iván Agudo, Lucio A. Antonelli , et al. (79 additional authors not shown)

Abstract: Cir X-1 is a neutron star X-ray binary characterized by strong variations in flux during its eccentric $\sim$16.6 days orbit. There are also strong variations in the spectral state, and historically it has shown both atoll and Z state properties. We observed the source with the Imaging X-ray Polarimetry Explorer during two orbital segments, 6 days apart, for a total of 263~ks. We find an X-ray pol… ▽ More Cir X-1 is a neutron star X-ray binary characterized by strong variations in flux during its eccentric $\sim$16.6 days orbit. There are also strong variations in the spectral state, and historically it has shown both atoll and Z state properties. We observed the source with the Imaging X-ray Polarimetry Explorer during two orbital segments, 6 days apart, for a total of 263~ks. We find an X-ray polarization degree in these segments of $1.6\%\pm0.3\%$ and $1.4\%\pm0.3\%$ at polarization angles of $37^\circ\pm5^\circ$ and $-12^\circ\pm7^\circ$, respectively. Thus we observed a rotation of the polarization angle by $49^\circ\pm8^\circ$ along the orbit. Because variations of accretion flow, and then of the hardness ratio, are expected during the orbit, we also studied the polarization binned in hardness ratio, and found the polarization angle differing by $67^\circ\pm11^\circ$ between the lowest and highest values of the hardness ratio. We discuss possible interpretations of this result that could indicate a possible misalignment between the symmetry axes of the accretion disk and the Comptonizing region caused by the misalignment of the neutron star's angular momentum with respect to the orbital one. △ Less

Submitted 22 December, 2023; v1 submitted 8 November, 2023; originally announced November 2023.

Comments: 15 pages, 6 figures, Accepted for publication in ApJL

Journal ref: ApJL 961 L8 (2024)

arXiv:2311.04531 [pdf, other]

An Unsupervised Deep Learning Approach for the Wave Equation Inverse Problem

Authors: Xiong-Bin Yan, Keke Wu, Zhi-Qin John Xu, Zheng Ma

Abstract: Full-waveform inversion (FWI) is a powerful geophysical imaging technique that infers high-resolution subsurface physical parameters by solving a non-convex optimization problem. However, due to limitations in observation, e.g., limited shots or receivers, and random noise, conventional inversion methods are confronted with numerous challenges, such as the local-minimum problem. In recent years, a… ▽ More Full-waveform inversion (FWI) is a powerful geophysical imaging technique that infers high-resolution subsurface physical parameters by solving a non-convex optimization problem. However, due to limitations in observation, e.g., limited shots or receivers, and random noise, conventional inversion methods are confronted with numerous challenges, such as the local-minimum problem. In recent years, a substantial body of work has demonstrated that the integration of deep neural networks and partial differential equations for solving full-waveform inversion problems has shown promising performance. In this work, drawing inspiration from the expressive capacity of neural networks, we provide an unsupervised learning approach aimed at accurately reconstructing subsurface physical velocity parameters. This method is founded on a re-parametrization technique for Bayesian inference, achieved through a deep neural network with random weights. Notably, our proposed approach does not hinge upon the requirement of the labeled training dataset, rendering it exceedingly versatile and adaptable to diverse subsurface models. Extensive experiments show that the proposed approach performs noticeably better than existing conventional inversion methods. △ Less

Submitted 8 November, 2023; originally announced November 2023.

Comments: 32 Pages,22 figures, 3 tables

arXiv:2311.04049 [pdf, other]

3D EAGAN: 3D edge-aware attention generative adversarial network for prostate segmentation in transrectal ultrasound images

Authors: Mengqing Liu, Xiao Shao, Li** Jiang, Kaizhi Wu

Abstract: Automatic prostate segmentation in TRUS images has always been a challenging problem, since prostates in TRUS images have ambiguous boundaries and inhomogeneous intensity distribution. Although many prostate segmentation methods have been proposed, they still need to be improved due to the lack of sensibility to edge information. Consequently, the objective of this study is to devise a highly effe… ▽ More Automatic prostate segmentation in TRUS images has always been a challenging problem, since prostates in TRUS images have ambiguous boundaries and inhomogeneous intensity distribution. Although many prostate segmentation methods have been proposed, they still need to be improved due to the lack of sensibility to edge information. Consequently, the objective of this study is to devise a highly effective prostate segmentation method that overcomes these limitations and achieves accurate segmentation of prostates in TRUS images. A 3D edge-aware attention generative adversarial network (3D EAGAN)-based prostate segmentation method is proposed in this paper, which consists of an edge-aware segmentation network (EASNet) that performs the prostate segmentation and a discriminator network that distinguishes predicted prostates from real prostates. The proposed EASNet is composed of an encoder-decoder-based U-Net backbone network, a detail compensation module, four 3D spatial and channel attention modules, an edge enhance module, and a global feature extractor. The detail compensation module is proposed to compensate for the loss of detailed information caused by the down-sampling process of the encoder. The features of the detail compensation module are selectively enhanced by the 3D spatial and channel attention module. Furthermore, an edge enhance module is proposed to guide shallow layers in the EASNet to focus on contour and edge information in prostates. Finally, features from shallow layers and hierarchical features from the decoder module are fused through the global feature extractor to predict the segmentation prostates. △ Less

Submitted 7 November, 2023; originally announced November 2023.

arXiv:2311.03912 [pdf, other]

FLORA: Fine-grained Low-Rank Architecture Search for Vision Transformer

Authors: Chi-Chih Chang, Yuan-Yao Sung, Shixing Yu, Ning-Chi Huang, Diana Marculescu, Kai-Chiang Wu

Abstract: Vision Transformers (ViT) have recently demonstrated success across a myriad of computer vision tasks. However, their elevated computational demands pose significant challenges for real-world deployment. While low-rank approximation stands out as a renowned method to reduce computational loads, efficiently automating the target rank selection in ViT remains a challenge. Drawing from the notable si… ▽ More Vision Transformers (ViT) have recently demonstrated success across a myriad of computer vision tasks. However, their elevated computational demands pose significant challenges for real-world deployment. While low-rank approximation stands out as a renowned method to reduce computational loads, efficiently automating the target rank selection in ViT remains a challenge. Drawing from the notable similarity and alignment between the processes of rank selection and One-Shot NAS, we introduce FLORA, an end-to-end automatic framework based on NAS. To overcome the design challenge of supernet posed by vast search space, FLORA employs a low-rank aware candidate filtering strategy. This method adeptly identifies and eliminates underperforming candidates, effectively alleviating potential undertraining and interference among subnetworks. To further enhance the quality of low-rank supernets, we design a low-rank specific training paradigm. First, we propose weight inheritance to construct supernet and enable gradient sharing among low-rank modules. Secondly, we adopt low-rank aware sampling to strategically allocate training resources, taking into account inherited information from pre-trained models. Empirical results underscore FLORA's efficacy. With our method, a more fine-grained rank configuration can be generated automatically and yield up to 33% extra FLOPs reduction compared to a simple uniform configuration. More specific, FLORA-DeiT-B/FLORA-Swin-B can save up to 55%/42% FLOPs almost without performance degradtion. Importantly, FLORA boasts both versatility and orthogonality, offering an extra 21%-26% FLOPs reduction when integrated with leading compression techniques or compact hybrid structures. Our code is publicly available at https://github.com/shadowpa0327/FLORA. △ Less

Submitted 7 November, 2023; originally announced November 2023.

Comments: Accepted by WACV 2024

arXiv:2311.03667 [pdf, other]

X-ray Polarization Reveals the Precessions of the Neutron Star in Hercules X-1

Authors: Jeremy Heyl, Victor Doroshenko, Denis González-Caniulef, Ilaria Caiazzo, Juri Poutanen, Alexander Mushtukov, Sergey S. Tsygankov, Demet Kirmizibayrak, Matteo Bachetti, George G. Pavlov, Sofia V. Forsblom, Christian Malacaria, Valery F. Suleimanov, Iván Agudo, Lucio Angelo Antonelli, Luca Baldini, Wayne H. Baumgartner, Ronaldo Bellazzini, Stefano Bianchi, Stephen D. Bongiorno, Raffaella Bonino, Alessandro Brez, Niccolò Bucciantini, Fiamma Capitanio, Simone Castellano , et al. (78 additional authors not shown)

Abstract: In an accreting X-ray pulsar, a neutron star accretes matter from a stellar companion through an accretion disk. The high magnetic field of the rotating neutron star disrupts the inner edge of the disc, funneling the gas to flow onto the magnetic poles on its surface. Hercules X-1 is in many ways the prototypical X-ray pulsar; it shows persistent X-ray emission and it resides with its companion HZ… ▽ More In an accreting X-ray pulsar, a neutron star accretes matter from a stellar companion through an accretion disk. The high magnetic field of the rotating neutron star disrupts the inner edge of the disc, funneling the gas to flow onto the magnetic poles on its surface. Hercules X-1 is in many ways the prototypical X-ray pulsar; it shows persistent X-ray emission and it resides with its companion HZ Her, a two-solar-mass star, at about 7~kpc from Earth. Its emission varies on three distinct timescales: the neutron star rotates every 1.2~seconds, it is eclipsed by its companion each 1.7~days, and the system exhibits a superorbital period of 35~days which has remained remarkably stable since its discovery. Several lines of evidence point to the source of this variation as the precession of the accretion disc, the precession of the neutron star or both. Despite the many hints over the past fifty years, the precession of the neutron star itself has yet not been confirmed or refuted. We here present X-ray polarization measurements with the Imaging X-ray Polarimetry Explorer (IXPE) which probe the spin geometry of the neutron star. These observations provide direct evidence that the 35-day-period is set by the free precession of the neutron star crust, which has the important implication that its crust is somewhat asymmetric fractionally by a few parts per ten million. Furthermore, we find indications that the basic spin geometry of the neutron star is altered by torques on timescale of a few hundred days. △ Less

Submitted 6 November, 2023; originally announced November 2023.

Comments: 37 pages

arXiv:2311.03637 [pdf, other]

The detection of polarized x-ray emission from the magnetar 1E 2259+586

Authors: Jeremy Heyl, Roberto Taverna, Roberto Turolla, Gian Luca Israel, Mason Ng, Demet Kirmizibayrak, Denis González-Caniulef, Ilaria Caiazzo, Silvia Zane, Steven R. Ehlert, Michela Negro, Iván Agudo, Lucio Angelo Antonelli, Matteo Bachetti, Luca Baldini, Wayne H. Baumgartner, Ronaldo Bellazzini, Stefano Bianchi, Stephen D. Bongiorno, Raffaella Bonino, Alessandro Brez, Niccolò Bucciantini, Fiamma Capitanio, Simone Castellano, Elisabetta Cavazzuti , et al. (76 additional authors not shown)

Abstract: We report on IXPE, NICER and XMM-Newton observations of the magnetar 1E 2259+586. We find that the source is significantly polarized at about or above 20% for all phases except for the secondary peak where it is more weakly polarized. The polarization degree is strongest during the primary minimum which is also the phase where an absorption feature has been identified previously (Pizzocaro et al.… ▽ More We report on IXPE, NICER and XMM-Newton observations of the magnetar 1E 2259+586. We find that the source is significantly polarized at about or above 20% for all phases except for the secondary peak where it is more weakly polarized. The polarization degree is strongest during the primary minimum which is also the phase where an absorption feature has been identified previously (Pizzocaro et al. 2019). The polarization angle of the photons are consistent with a rotating vector model with a mode switch between the primary minimum and the rest of the rotation of the neutron star. We propose a scenario in which the emission at the source is weakly polarized (as in a condensed surface) and, as the radiation passes through a plasma arch, resonant cyclotron scattering off of protons produces the observed polarized radiation. This confirms the magnetar nature of the source with a surface field greater than about 10<sup>15</sup> G △ Less

Submitted 6 November, 2023; originally announced November 2023.

Comments: 13 pages, 13 figures, 5 tables

arXiv:2311.01981 [pdf, other]

ProSG: Using Prompt Synthetic Gradients to Alleviate Prompt Forgetting of RNN-like Language Models

Authors: Haotian Luo, Kunming Wu, Cheng Dai, Sixian Ding, Xinhao Chen

Abstract: RNN-like language models are getting renewed attention from NLP researchers in recent years and several models have made significant progress, which demonstrates performance comparable to traditional transformers. However, due to the recurrent nature of RNNs, this kind of language model can only store information in a set of fixed-length state vectors. As a consequence, they still suffer from forg… ▽ More RNN-like language models are getting renewed attention from NLP researchers in recent years and several models have made significant progress, which demonstrates performance comparable to traditional transformers. However, due to the recurrent nature of RNNs, this kind of language model can only store information in a set of fixed-length state vectors. As a consequence, they still suffer from forgetfulness though after a lot of improvements and optimizations, when given complex instructions or prompts. As the prompted generation is the main and most concerned function of LMs, solving the problem of forgetting in the process of generation is no wonder of vital importance. In this paper, focusing on easing the prompt forgetting during generation, we proposed an architecture to teach the model memorizing prompt during generation by synthetic gradient. To force the model to memorize the prompt, we derive the states that encode the prompt, then transform it into model parameter modification using low-rank gradient approximation, which hard-codes the prompt into model parameters temporarily. We construct a dataset for experiments, and the results have demonstrated the effectiveness of our method in solving the problem of forgetfulness in the process of prompted generation. We will release all the code upon acceptance. △ Less

Submitted 3 November, 2023; originally announced November 2023.

arXiv:2311.01720 [pdf, other]

Learning Reduced-Order Soft Robot Controller

Authors: Chen Liang, Xifeng Gao, Kui Wu, Zherong Pan

Abstract: Deformable robots are notoriously difficult to model or control due to its high-dimensional configuration spaces. Direct trajectory optimization suffers from the curse-of-dimensionality and incurs a high computational cost, while learning-based controller optimization methods are sensitive to hyper-parameter tuning. To overcome these limitations, we hypothesize that high fidelity soft robots can b… ▽ More Deformable robots are notoriously difficult to model or control due to its high-dimensional configuration spaces. Direct trajectory optimization suffers from the curse-of-dimensionality and incurs a high computational cost, while learning-based controller optimization methods are sensitive to hyper-parameter tuning. To overcome these limitations, we hypothesize that high fidelity soft robots can be both simulated and controlled by restricting to low-dimensional spaces. Under such assumption, we propose a two-stage algorithm to identify such simulation- and control-spaces. Our method first identifies the so-called simulation-space that captures the salient deformation modes, to which the robot's governing equation is restricted. We then identify the control-space, to which control signals are restricted. We propose a multi-fidelity Riemannian Bayesian bilevel optimization to identify task-specific control spaces. We show that the dimension of control-space can be less than $10$ for a high-DOF soft robot to accomplish walking and swimming tasks, allowing low-dimensional MPC controllers to be applied to soft robots with tractable computational complexity. △ Less

Submitted 3 November, 2023; originally announced November 2023.

arXiv:2311.01717 [pdf, other]

Second-Order Convergent Collision-Constrained Optimization-Based Planner

Authors: Chen Liang, Xifeng Gao, Kui Wu, Zherong Pan

Abstract: Finding robot poses and trajectories represents a foundational aspect of robot motion planning. Despite decades of research, efficiently and robustly addressing these challenges is still difficult. Existing approaches are often plagued by various limitations, such as intricate geometric approximations, violations of collision constraints, or slow first-order convergence. In this paper, we introduc… ▽ More Finding robot poses and trajectories represents a foundational aspect of robot motion planning. Despite decades of research, efficiently and robustly addressing these challenges is still difficult. Existing approaches are often plagued by various limitations, such as intricate geometric approximations, violations of collision constraints, or slow first-order convergence. In this paper, we introduce two novel optimization formulations that offer provable robustness, achieving second-order convergence while requiring only a convex approximation of the robot's links and obstacles. Our first method, known as the Explicit Collision Barrier (ECB) method, employs a barrier function to guarantee separation between convex objects. ECB uses an efficient matrix factorization technique, enabling a second-order Newton's method with an iterative complexity linear in the number of separating planes. Our second method, referred to as the Implicit Collision Barrier (ICB) method, further transforms the separating planes into implicit functions of robot poses. We show such an implicit objective function is twice-differentiable, with derivatives evaluated at a linear complexity. To assess the effectiveness of our approaches, we conduct a comparative study with a first-order baseline algorithm across six testing scenarios. Our results unequivocally justify that our method exhibits significantly faster convergence rates compared to the baseline algorithm. △ Less

Submitted 3 November, 2023; originally announced November 2023.

arXiv:2311.01288 [pdf, other]

Unraveling Diffusion in Fusion Plasma: A Case Study of In Situ Processing and Particle Sorting

Authors: Junmin Gu, Paul Lin, Kesheng Wu, Seung-Hoe Ku, C. S. Chang, R. Michael Churchill, Jong Choi, Norbert Podhorszki, Scott Klasky

Abstract: This work starts an in situ processing capability to study a certain diffusion process in magnetic confinement fusion. This diffusion process involves plasma particles that are likely to escape confinement. Such particles carry a significant amount of energy from the burning plasma inside the tokamak to the diverter and damaging the diverter plate. This study requires in situ processing because of… ▽ More This work starts an in situ processing capability to study a certain diffusion process in magnetic confinement fusion. This diffusion process involves plasma particles that are likely to escape confinement. Such particles carry a significant amount of energy from the burning plasma inside the tokamak to the diverter and damaging the diverter plate. This study requires in situ processing because of the fast changing nature of the particle diffusion process. However, the in situ processing approach is challenging because the amount of data to be retained for the diffusion calculations increases over time, unlike in other in situ processing cases where the amount of data to be processed is constant over time. Here we report our preliminary efforts to control the memory usage while ensuring the necessary analysis tasks are completed in a timely manner. Compared with an earlier naive attempt to directly computing the same diffusion displacements in the simulation code, this in situ version reduces the memory usage from particle information by nearly 60% and computation time by about 20%. △ Less

Submitted 2 November, 2023; originally announced November 2023.

arXiv:2311.00535 [pdf]

Active Noise Control Portable Device Design

Authors: kai Wu, Yuanyuan Chen

Abstract: While our world is filled with its own natural sounds that we can't resist enjoying, it is also chock-full of other sounds that can be irritating, this is noise. Noise not only influences the working efficiency but also the human's health. The problem of reducing noise is one of great importance and great difficulty. The problem has been addressed in many ways over the years. The current methods f… ▽ More While our world is filled with its own natural sounds that we can't resist enjoying, it is also chock-full of other sounds that can be irritating, this is noise. Noise not only influences the working efficiency but also the human's health. The problem of reducing noise is one of great importance and great difficulty. The problem has been addressed in many ways over the years. The current methods for noise reducing mostly rely on the materials and transmission medium, which are only effective to some extent for the high frequency noise. However, the effective reduction noise method especially for low frequency noise is very limited. Here we come up with a noise reduction system consist of a sensor to detect the noise in the environment. Then the noise will be sent to an electronic control system to process the noise, which will generate a reverse phase frequency signal to counteract the disturbance. Finally, the processed smaller noise will be broadcasted by the speaker. Through this smart noise reduction system, even the noise with low-frequency can be eliminated. The system is also integrated with sleep tracking and music player applications. It can also remember and store settings for the same environment, sense temperature, and smart control of home furniture, fire alarm, etc. This smart system can transfer data easily by Wi-Fi or Bluetooth and controlled by its APP. In this project, we will present a model of the above technology which can be used in various environments to prevent noise pollution and provide a solution to the people who have difficulties finding a peaceful and quiet environment for sleep, work or study. △ Less

Submitted 1 November, 2023; originally announced November 2023.

arXiv:2311.00294 [pdf, ps, other]

Multi-step ahead prediction intervals for non-parametric autoregressions via bootstrap: consistency, debiasing and pertinence

Authors: Dimitris N. Politis, Ke** Wu

Abstract: To address the difficult problem of multi-step ahead prediction of non-parametric autoregressions, we consider a forward bootstrap approach. Employing a local constant estimator, we can analyze a general type of non-parametric time series model, and show that the proposed point predictions are consistent with the true optimal predictor. We construct a quantile prediction interval that is asymptoti… ▽ More To address the difficult problem of multi-step ahead prediction of non-parametric autoregressions, we consider a forward bootstrap approach. Employing a local constant estimator, we can analyze a general type of non-parametric time series model, and show that the proposed point predictions are consistent with the true optimal predictor. We construct a quantile prediction interval that is asymptotically valid. Moreover, using a debiasing technique, we can asymptotically approximate the distribution of multi-step ahead non-parametric estimation by bootstrap. As a result, we can build bootstrap prediction intervals that are pertinent, i.e., can capture the model estimation variability, thus improving upon the standard quantile prediction intervals. Simulation studies are given to illustrate the performance of our point predictions and pertinent prediction intervals for finite samples. △ Less

Submitted 1 November, 2023; originally announced November 2023.

arXiv:2310.18697 [pdf, other]

doi 10.1177/02783649231195380

KernelGPA: A Globally Optimal Solution to Deformable SLAM in Closed-form

Authors: Fang Bai, Kanzhi Wu, Adrien Bartoli

Abstract: We study the generalized Procrustes analysis (GPA), as a minimal formulation to the simultaneous localization and map** (SLAM) problem. We propose KernelGPA, a novel global registration technique to solve SLAM in the deformable environment. We propose the concept of deformable transformation which encodes the entangled pose and deformation. We define deformable transformations using a kernel met… ▽ More We study the generalized Procrustes analysis (GPA), as a minimal formulation to the simultaneous localization and map** (SLAM) problem. We propose KernelGPA, a novel global registration technique to solve SLAM in the deformable environment. We propose the concept of deformable transformation which encodes the entangled pose and deformation. We define deformable transformations using a kernel method, and show that both the deformable transformations and the environment map can be solved globally in closed-form, up to global scale ambiguities. We solve the scale ambiguities by an optimization formulation that maximizes rigidity. We demonstrate KernelGPA using the Gaussian kernel, and validate the superiority of KernelGPA with various datasets. Code and data are available at \url{https://bitbucket.org/FangBai/deformableprocrustes}. △ Less

Submitted 28 October, 2023; originally announced October 2023.

Comments: This paper has been accepted for publication in the International Journal of Robotics Research, 2023. https://doi.org/10.1177/02783649231195380

MSC Class: 68U05 ACM Class: G.1; I.3; I.4

Journal ref: International Journal of Robotics Research, 2023

arXiv:2310.18690 [pdf]

Modeling of an efficient singlet-triplet spin qubit to photon interface assisted by a photonic crystal cavity

Authors: Kui Wu, Sebastian Kindel, Thomas Descamps, Tobias Hangleiter, Jan Christoph Müller, Rebecca Rodrigo, Florian Merget, Hendrik Bluhm, Jeremy Witzens

Abstract: Efficient interconnection between distant semiconductor spin qubits with the help of photonic qubits would offer exciting new prospects for future quantum communication applications. In this paper, we optimize the extraction efficiency of a novel interface between a singlet-triplet spin qubit and a photonic qubit. The interface is based on a 220 nm thick GaAs/AlGaAs heterostructure membrane and co… ▽ More Efficient interconnection between distant semiconductor spin qubits with the help of photonic qubits would offer exciting new prospects for future quantum communication applications. In this paper, we optimize the extraction efficiency of a novel interface between a singlet-triplet spin qubit and a photonic qubit. The interface is based on a 220 nm thick GaAs/AlGaAs heterostructure membrane and consists of a gate-defined double quantum dot (GDQD) supporting a singlet-triplet qubit, an optically active quantum dot (OAQD) consisting of a gate-defined exciton trap, a photonic crystal cavity providing in-plane optical confinement and efficient out-coupling to an ideal free space Gaussian beam while accommodating the gate wiring of the GDQD and OAQD, and a bottom gold reflector to recycle photons and increase the optical extraction efficiency. All essential components can be lithographically defined and deterministically fabricated on the GaAs/AlGaAs heterostructure membrane, which greatly increases the scalability of on-chip integration. According to our simulations, the interface provides an overall coupling efficiency of 28.7% into a free space Gaussian beam, assuming an SiO2 interlayer filling the space between the reflector and the membrane. The performance can be further increased by undercutting this SiO2 interlayer below the photonic crystal. In this case, the overall efficiency is calculated to be 48.5%. △ Less

Submitted 28 October, 2023; originally announced October 2023.

arXiv:2310.18313 [pdf, other]

FP8-LM: Training FP8 Large Language Models

Authors: Houwen Peng, Kan Wu, Yixuan Wei, Guoshuai Zhao, Yuxiang Yang, Ze Liu, Yifan Xiong, Ziyue Yang, Bolin Ni, **gcheng Hu, Ruihang Li, Miaosen Zhang, Chen Li, Jia Ning, Ruizhe Wang, Zheng Zhang, Shuguang Liu, Joe Chau, Han Hu, Peng Cheng

Abstract: In this paper, we explore FP8 low-bit data formats for efficient training of large language models (LLMs). Our key insight is that most variables, such as gradients and optimizer states, in LLM training can employ low-precision data formats without compromising model accuracy and requiring no changes to hyper-parameters. Specifically, we propose a new FP8 automatic mixed-precision framework for tr… ▽ More In this paper, we explore FP8 low-bit data formats for efficient training of large language models (LLMs). Our key insight is that most variables, such as gradients and optimizer states, in LLM training can employ low-precision data formats without compromising model accuracy and requiring no changes to hyper-parameters. Specifically, we propose a new FP8 automatic mixed-precision framework for training LLMs. This framework offers three levels of FP8 utilization to streamline mixed-precision and distributed parallel training for LLMs. It gradually incorporates 8-bit gradients, optimizer states, and distributed learning in an incremental manner. Experiment results show that, during the training of GPT-175B model on H100 GPU platform, our FP8 mixed-precision training framework not only achieved a remarkable 39% reduction in real memory usage but also ran 75% faster than the widely adopted BF16 framework (i.e., Megatron-LM), surpassing the speed of Nvidia Transformer Engine by 37%. This largely reduces the training costs for large foundation models. Furthermore, our FP8 mixed-precision training methodology is generic. It can be seamlessly applied to other tasks such as LLM instruction tuning and reinforcement learning with human feedback, offering savings in fine-tuning expenses. Our FP8 low-precision training framework is open-sourced at {https://github.com/Azure/MS-AMP}{aka.ms/MS.AMP}. △ Less

Submitted 19 December, 2023; v1 submitted 27 October, 2023; originally announced October 2023.

arXiv:2310.17137 [pdf, other]

Large-Scale Gaussian Processes via Alternating Projection

Authors: Kaiwen Wu, Jonathan Wenger, Haydn Jones, Geoff Pleiss, Jacob R. Gardner

Abstract: Training and inference in Gaussian processes (GPs) require solving linear systems with $n\times n$ kernel matrices. To address the prohibitive $\mathcal{O}(n^3)$ time complexity, recent work has employed fast iterative methods, like conjugate gradients (CG). However, as datasets increase in magnitude, the kernel matrices become increasingly ill-conditioned and still require $\mathcal{O}(n^2)$ spac… ▽ More Training and inference in Gaussian processes (GPs) require solving linear systems with $n\times n$ kernel matrices. To address the prohibitive $\mathcal{O}(n^3)$ time complexity, recent work has employed fast iterative methods, like conjugate gradients (CG). However, as datasets increase in magnitude, the kernel matrices become increasingly ill-conditioned and still require $\mathcal{O}(n^2)$ space without partitioning. Thus, while CG increases the size of datasets GPs can be trained on, modern datasets reach scales beyond its applicability. In this work, we propose an iterative method which only accesses subblocks of the kernel matrix, effectively enabling mini-batching. Our algorithm, based on alternating projection, has $\mathcal{O}(n)$ per-iteration time and space complexity, solving many of the practical challenges of scaling GPs to very large datasets. Theoretically, we prove the method enjoys linear convergence. Empirically, we demonstrate its fast convergence in practice and robustness to ill-conditioning. On large-scale benchmark datasets with up to four million data points, our approach accelerates GP training and inference by speed-up factors up to $27\times$ and $72 \times$, respectively, compared to CG. △ Less

Submitted 8 March, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

Comments: AISTATS 2024

arXiv:2310.16869 [pdf]

Single-pixel imaging based on deep learning

Authors: Kai Song, Yaoxing Bian, Ku Wu, Hongrui Liu, Shuang** Han, Jiaming Li, Jiazhao Tian, Chengbin Qin, Jianyong Hu, Liantuan Xiao

Abstract: Single-pixel imaging can collect images at the wavelengths outside the reach of conventional focal plane array detectors. However, the limited image quality and lengthy computational times for iterative reconstruction still impede the practical application of single-pixel imaging. Recently, deep learning has been introduced into single-pixel imaging, which has attracted a lot of attention due to i… ▽ More Single-pixel imaging can collect images at the wavelengths outside the reach of conventional focal plane array detectors. However, the limited image quality and lengthy computational times for iterative reconstruction still impede the practical application of single-pixel imaging. Recently, deep learning has been introduced into single-pixel imaging, which has attracted a lot of attention due to its exceptional reconstruction quality, fast reconstruction speed, and the potential to complete advanced sensing tasks without reconstructing images. Here, this advance is discussed and some opinions are offered. Firstly, based on the fundamental principles of single-pixel imaging and deep learning, the principles and algorithms of single-pixel imaging based on deep learning are described and analyzed. Subsequently, the implementation technologies of single-pixel imaging based on deep learning are reviewed. They are divided into super-resolution single-pixel imaging, single-pixel imaging through scattering media, photon-level single-pixel imaging, optical encryption based on single-pixel imaging, color single-pixel imaging, and image-free sensing according to diverse application fields. Finally, major challenges and corresponding feasible approaches are discussed, as well as more possible applications in the future. △ Less

Submitted 16 November, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

arXiv:2310.13678 [pdf, other]

Long-Form Speech Translation through Segmentation with Finite-State Decoding Constraints on Large Language Models

Authors: Arya D. McCarthy, Hao Zhang, Shankar Kumar, Felix Stahlberg, Ke Wu

Abstract: One challenge in speech translation is that plenty of spoken content is long-form, but short units are necessary for obtaining high-quality translations. To address this mismatch, we adapt large language models (LLMs) to split long ASR transcripts into segments that can be independently translated so as to maximize the overall translation quality. We overcome the tendency of hallucination in LLMs… ▽ More One challenge in speech translation is that plenty of spoken content is long-form, but short units are necessary for obtaining high-quality translations. To address this mismatch, we adapt large language models (LLMs) to split long ASR transcripts into segments that can be independently translated so as to maximize the overall translation quality. We overcome the tendency of hallucination in LLMs by incorporating finite-state constraints during decoding; these eliminate invalid outputs without requiring additional training. We discover that LLMs are adaptable to transcripts containing ASR errors through prompt-tuning or fine-tuning. Relative to a state-of-the-art automatic punctuation baseline, our best LLM improves the average BLEU by 2.9 points for English-German, English-Spanish, and English-Arabic TED talk translation in 9 test sets, just by improving segmentation. △ Less

Submitted 23 October, 2023; v1 submitted 20 October, 2023; originally announced October 2023.

Comments: accepted to the Findings of EMNLP 2023. arXiv admin note: text overlap with arXiv:2212.09895

arXiv:2310.11510 [pdf, other]

Observations of Low and Intermediate Spectral Peak Blazars with the Imaging X-ray Polarimetry Explorer

Authors: Herman L. Marshall, Ioannis Liodakis, Alan P. Marscher, Niccolo Di Lalla, Svetlana G. Jorstad, Dawoon E. Kim, Riccardo Middei, Michela Negro, Nicola Omodei, Abel L. Peirson, Matteo Perri, Simonetta Puccetti, Ivan Agudo, Giacomo Bonnoli, Andrei V. Berdyugin, Elisabetta Cavazzuti, Nicole Rodriguez Cavero, Immacolata Donnarumma, Laura Di Gesu, Jenni Jormanainen, Henric Krawczynski, Elina Lindfors, Frederic Marin, Francesco Massaro, Luigi Pacciani , et al. (133 additional authors not shown)

Abstract: We present X-ray polarimetry observations from the Imaging X-ray Polarimetry Explorer (IXPE) of three low spectral peak and one intermediate spectral peak blazars, namely 3C 273, 3C 279, 3C 454.3, and S5 0716+714. For none of these objects was IXPE able to detect X-ray polarization at the 3$σ$ level. However, we placed upper limits on the polarization degree at $\sim$10-30\%. The undetected polari… ▽ More We present X-ray polarimetry observations from the Imaging X-ray Polarimetry Explorer (IXPE) of three low spectral peak and one intermediate spectral peak blazars, namely 3C 273, 3C 279, 3C 454.3, and S5 0716+714. For none of these objects was IXPE able to detect X-ray polarization at the 3$σ$ level. However, we placed upper limits on the polarization degree at $\sim$10-30\%. The undetected polarizations favor models where the X-ray band is dominated by unpolarized photons upscattered by relativistic electrons in the jets of blazars, although hadronic models are not completely eliminated. We discuss the X-ray polarization upper limits in the context of our contemporaneous multiwavelength polarization campaigns. △ Less

Submitted 19 October, 2023; v1 submitted 17 October, 2023; originally announced October 2023.

Comments: 19 pages, 10 figures, 6 tables; submitted to the Astrophysical Journal

arXiv:2310.11125 [pdf, other]

IXPE observation confirms a high spin in the accreting black hole 4U 1957+115

Authors: L. Marra, M. Brigitte, N. Rodriguez Cavero, S. Chun, J. F. Steiner, M. Dovčiak, M. Nowak, S. Bianchi, F. Capitanio, A. Ingram, G. Matt, F. Muleri, J. Podgorný, J. Poutanen, J. Svoboda, R. Taverna, F. Ursini, A. Veledina, A. De Rosa, J. A. Garcia, A. A. Lutovinov, I. A. Mereminskiy, R. Farinelli, S. Gunji, P. Kaaret , et al. (91 additional authors not shown)

Abstract: We present the results of the first X-ray polarimetric observation of the low-mass X-ray binary 4U 1957+115, performed with the Imaging X-ray Polarimetry Explorer in May 2023. The binary system has been in a high-soft spectral state since its discovery and is thought to host a black hole. The $\sim$571 ks observation reveals a linear polarisation degree of $1.9\% \pm 0.6\%$ and a polarisation angl… ▽ More We present the results of the first X-ray polarimetric observation of the low-mass X-ray binary 4U 1957+115, performed with the Imaging X-ray Polarimetry Explorer in May 2023. The binary system has been in a high-soft spectral state since its discovery and is thought to host a black hole. The $\sim$571 ks observation reveals a linear polarisation degree of $1.9\% \pm 0.6\%$ and a polarisation angle of $-41^\circ.8 \pm 7^\circ.9$ in the 2-8 keV energy range. Spectral modelling is consistent with the dominant contribution coming from the standard accretion disc, while polarimetric data suggest a significant role of returning radiation: photons that are bent by strong gravity effects and forced to return to the disc surface, where they can be reflected before eventually reaching the observer. In this setting, we find that models with a black hole spin lower than 0.96 and an inclination lower than $50^\circ$ are disfavoured. △ Less

Submitted 8 February, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

Comments: 12 pages, 10 figures, 2 tables, accepted for publication in A&A

arXiv:2310.10957 [pdf, other]

Medical Image Segmentation via Sparse Coding Decoder

Authors: Long Zeng, Kaigui Wu

Abstract: Transformers have achieved significant success in medical image segmentation, owing to its capability to capture long-range dependencies. Previous works incorporate convolutional layers into the encoder module of transformers, thereby enhancing their ability to learn local relationships among pixels. However, transformers may suffer from limited generalization capabilities and reduced robustness,… ▽ More Transformers have achieved significant success in medical image segmentation, owing to its capability to capture long-range dependencies. Previous works incorporate convolutional layers into the encoder module of transformers, thereby enhancing their ability to learn local relationships among pixels. However, transformers may suffer from limited generalization capabilities and reduced robustness, attributed to the insufficient spatial recovery ability of their decoders. To address this issue, A convolution sparse vector coding based decoder is proposed , namely CAScaded multi-layer Convolutional Sparse vector Coding DEcoder (CASCSCDE), which represents features extracted by the encoder using sparse vectors. To prove the effectiveness of our CASCSCDE, The widely-used TransUNet model is chosen for the demonstration purpose, and the CASCSCDE is incorporated with TransUNet to establish the TransCASCSCDE architecture. Our experiments demonstrate that TransUNet with CASCSCDE significantly enhances performance on the Synapse benchmark, obtaining up to 3.15\% and 1.16\% improvements in DICE and mIoU scores, respectively. CASCSCDE opens new ways for constructing decoders based on convolutional sparse vector coding. △ Less

Submitted 16 October, 2023; originally announced October 2023.

Comments: 8 pages, 1 figures

MSC Class: 68T07; 68U10 ACM Class: I.4.6; I.4.7; I.5.1

arXiv:2310.10849 [pdf, other]

doi 10.1007/s10909-024-03100-6

Results and Limits of Time Division Multiplexing for the BICEP Array High Frequency Receivers

Authors: S. Fatigoni, P. A. R. Ade, Z. Ahmed, M. Amiri, D. Barkats, R. Basu Thakur, C. A. Bischoff, D. Beck, J. J. Bock, V. Buza, J. Cheshire, J. Connors, J. Cornelison, M. Crumrine, A. J. Cukierman, E. V. Denison, M. I. Dierickx, L. Duband, M. Eiben, J. P. Filippini, A. Fortes, M. Gao, C. Giannakopoulos, N. Goeckner-Wald, D. C. Goldfinger , et al. (62 additional authors not shown)

Abstract: Time-Division Multiplexing is the readout architecture of choice for many ground and space experiments, as it is a very mature technology with proven outstanding low-frequency noise stability, which represents a central challenge in multiplexing. Once fully populated, each of the two BICEP Array high frequency receivers, observing at 150GHz and 220/270GHz, will have 7776 TES detectors tiled on the… ▽ More Time-Division Multiplexing is the readout architecture of choice for many ground and space experiments, as it is a very mature technology with proven outstanding low-frequency noise stability, which represents a central challenge in multiplexing. Once fully populated, each of the two BICEP Array high frequency receivers, observing at 150GHz and 220/270GHz, will have 7776 TES detectors tiled on the focal plane. The constraints set by these two receivers required a redesign of the warm readout electronics. The new version of the standard Multi Channel Electronics, developed and built at the University of British Columbia, is presented here for the first time. BICEP Array operates Time Division Multiplexing readout technology to the limits of its capabilities in terms of multiplexing rate, noise and crosstalk, and applies them in rigorously demanding scientific application requiring extreme noise performance and systematic error control. Future experiments like CMB-S4 plan to use TES bolometers with Time Division/SQUID-based readout for an even larger number of detectors. △ Less

Submitted 24 October, 2023; v1 submitted 16 October, 2023; originally announced October 2023.

Comments: 10 pages, 7 figures, Submitted to Journal of Low Temperature Physics

Journal ref: Journal of Low Temperature Physics (2024)

arXiv:2310.07276 [pdf, other]

BioT5: Enriching Cross-modal Integration in Biology with Chemical Knowledge and Natural Language Associations

Authors: Qizhi Pei, Wei Zhang, **hua Zhu, Kehan Wu, Kaiyuan Gao, Lijun Wu, Yingce Xia, Rui Yan

Abstract: Recent advancements in biological research leverage the integration of molecules, proteins, and natural language to enhance drug discovery. However, current models exhibit several limitations, such as the generation of invalid molecular SMILES, underutilization of contextual information, and equal treatment of structured and unstructured knowledge. To address these issues, we propose… ▽ More Recent advancements in biological research leverage the integration of molecules, proteins, and natural language to enhance drug discovery. However, current models exhibit several limitations, such as the generation of invalid molecular SMILES, underutilization of contextual information, and equal treatment of structured and unstructured knowledge. To address these issues, we propose $\mathbf{BioT5}$, a comprehensive pre-training framework that enriches cross-modal integration in biology with chemical knowledge and natural language associations. $\mathbf{BioT5}$ utilizes SELFIES for $100%$ robust molecular representations and extracts knowledge from the surrounding context of bio-entities in unstructured biological literature. Furthermore, $\mathbf{BioT5}$ distinguishes between structured and unstructured knowledge, leading to more effective utilization of information. After fine-tuning, BioT5 shows superior performance across a wide range of tasks, demonstrating its strong capability of capturing underlying relations and properties of bio-entities. Our code is available at $\href{https://github.com/QizhiPei/BioT5}{Github}$. △ Less

Submitted 28 January, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

Comments: Accepted by Empirical Methods in Natural Language Processing 2023 (EMNLP 2023)

arXiv:2310.06788 [pdf, ps, other]

Discovery of a variable energy-dependent X-ray polarization in the accreting neutron star GX 5-1

Authors: Sergio Fabiani, Fiamma Capitanio, Rosario Iaria, Juri Poutanen, Andrea Gnarini, Francesco Ursini, Ruben Farinelli, Anna Bobrikova, James F. Steiner, Jiri Svoboda, Alessio Anitra, Maria C. Baglio, Francesco Carotenuto, Melania Del Santo, Carlo Ferrigno, Fraser Lewis, David M. Russell, Thomas D. Russell, Jakob van den Eijnden, Massimo Cocchi, Alessandro Di Marco, Fabio La Monaca, Kuan Liu, John Rankin, Martin C. Weisskopf , et al. (94 additional authors not shown)

Abstract: We report on the coordinated observations of the neutron star low-mass X-ray binary (NS-LMXB) \gx in X-rays (IXPE, NICER, Nustar and INTEGRAL), optical (REM and LCO), near-infrared (REM), mid-infrared (VLT VISIR), and radio (ATCA). This Z-source was observed by \IXPE twice in March-April 2023 (Obs. 1 and 2). In the radio band, the source was detected, but only upper-limits to the linear polarizati… ▽ More We report on the coordinated observations of the neutron star low-mass X-ray binary (NS-LMXB) \gx in X-rays (IXPE, NICER, Nustar and INTEGRAL), optical (REM and LCO), near-infrared (REM), mid-infrared (VLT VISIR), and radio (ATCA). This Z-source was observed by \IXPE twice in March-April 2023 (Obs. 1 and 2). In the radio band, the source was detected, but only upper-limits to the linear polarization were obtained at a $3σ$ level of $6.1\%$ at 5.5 GHz and $5.9\%$ at 9 GHz in Obs.~1 and $12.5\%$ at 5.5~GHz and $20\%$ at 9~GHz in Obs.~2. The mid-IR, near-IR and optical observations suggest the presence of a compact jet which peaks in the mid- or far-IR. The X-ray polarization degree was found to be $3.7\% \pm 0.4 \%$ (at $90\%$ confidence level) during Obs.~1 when the source was in the horizontal branch of the Z-track and $1.8\% \pm 0.4 \%$ during Obs.~2 when the source was in the normal-flaring branch. These results confirm the variation of polarization degree as a function of the position of the source in the color-color diagram as for previously observed Z-track sources (Cyg~X-2 and XTE~1701$-$462). Evidence for a variation of the polarization angle $\sim 20^\circ$ with energy is found in both observations, likely related to the different, non-orthogonal polarization angles of the disk and Comptonization components which peak at different energies. △ Less

Submitted 9 December, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

Comments: Submitted to Astronomy and Astrophysics on 06 July 2023. Accepted on 21 November 2023

arXiv:2310.06097 [pdf, other]

doi 10.1051/0004-6361/202347408

Magnetic Field Properties inside the Jet of Mrk 421: Multiwavelength Polarimetry Including the Imaging X-ray Polarimetry Explorer

Authors: Dawoon E. Kim, Laura Di Gesu, Ioannis Liodakis, Alan P. Marscher, Svetlana G. Jorstad, Riccardo Midde, Herman L. Marshall, Luigi Pacciani, Iván Agudo, Fabrizio Tavecchio, Nicolò Cibrario, Stefano Tugliani, Raffaella Bonino, Michela Negro, Simonetta Puccetti, Francesco Tombesi, Enrico Costa, Immacolata Donnarumma, Paolo Soffitta, Tsunefumi Mizuno, Yasushi Fukazawa, Koji S. Kawabata, Tatsuya Nakaoka, Makoto Uemura, Ryo Imazawa , et al. (111 additional authors not shown)

Abstract: We conducted a polarimetry campaign from radio to X-ray wavelengths of the high-synchrotron-peak (HSP) blazar Mrk 421, including Imaging X-ray Polarimetry Explorer (IXPE) measurements on 2022 December 6-8. We detected X-ray polarization of Mrk 421 with a degree of $Π_{\rm X}$=14$\pm$1$\%$ and an electric-vector position angle $ψ_{\rm X}$=107$\pm$3$^{\circ}$ in the 2-8 keV band. From the time varia… ▽ More We conducted a polarimetry campaign from radio to X-ray wavelengths of the high-synchrotron-peak (HSP) blazar Mrk 421, including Imaging X-ray Polarimetry Explorer (IXPE) measurements on 2022 December 6-8. We detected X-ray polarization of Mrk 421 with a degree of $Π_{\rm X}$=14$\pm$1$\%$ and an electric-vector position angle $ψ_{\rm X}$=107$\pm$3$^{\circ}$ in the 2-8 keV band. From the time variability analysis, we find a significant episodic variation in $ψ_{\rm X}$. During 7 months from the first IXPE pointing of Mrk 421 in 2022 May, $ψ_{\rm X}$ varied across the range of 0$^{\circ}$ to 180$^{\circ}$, while $Π_{\rm X}$ maintained similar values within $\sim$10-15$\%$. Furthermore, a swing in $ψ_{\rm X}$ in 2022 June was accompanied by simultaneous spectral variations. The results of the multiwavelength polarimetry show that the X-ray polarization degree was generally $\sim$2-3 times greater than that at longer wavelengths, while the polarization angle fluctuated. Additionally, based on radio, infrared, and optical polarimetry, we find that rotation of $ψ$ occurred in the opposite direction with respect to the rotation of $ψ_{\rm X}$ over longer timescales at similar epochs. The polarization behavior observed across multiple wavelengths is consistent with previous IXPE findings for HSP blazars. This result favors the energy-stratified shock model developed to explain variable emission in relativistic jets. The accompanying spectral variation during the $ψ_{\rm X}$ rotation can be explained by a fluctuation in the physical conditions, e.g., in the energy distribution of relativistic electrons. The opposite rotation direction of $ψ$ between the X-ray and longer-wavelength polarization accentuates the conclusion that the X-ray emitting region is spatially separated from that at longer wavelengths. △ Less

Submitted 9 October, 2023; originally announced October 2023.

Comments: 17 pages, 13 figures, 4 tables; Accepted for publication in A&A

Journal ref: A&A 681, A12 (2024)

arXiv:2310.05035 [pdf, other]

Self-Convinced Prompting: Few-Shot Question Answering with Repeated Introspection

Authors: Haodi Zhang, Min Cai, Xinhe Zhang, Chen Jason Zhang, Rui Mao, Kaishun Wu

Abstract: While large language models (LLMs) such as ChatGPT and PaLM have demonstrated remarkable performance in various language understanding and generation tasks, their capabilities in complex reasoning and intricate knowledge utilization still fall short of human-level proficiency. Recent studies have established the effectiveness of prompts in steering LLMs towards generating desired outputs. Building… ▽ More While large language models (LLMs) such as ChatGPT and PaLM have demonstrated remarkable performance in various language understanding and generation tasks, their capabilities in complex reasoning and intricate knowledge utilization still fall short of human-level proficiency. Recent studies have established the effectiveness of prompts in steering LLMs towards generating desired outputs. Building on these insights, we introduce a novel framework that harnesses the potential of large-scale pre-trained language models, to iteratively enhance performance of the LLMs. Our framework incorporates three components: \textit{Normal CoT}, a \textit{Convincer}, and an \textit{Answerer}. It processes the output of a typical few-shot chain-of-thought prompt, assesses the correctness of the response, scrutinizes the answer, refines the reasoning, and ultimately produces a new solution. Experimental results on the 7 datasets of miscellaneous problems validate the efficacy of the Self-Convince framework, achieving substantial improvements compared to the baselines. This study contributes to the burgeoning body of research focused on integrating pre-trained language models with tailored prompts and iterative refinement processes to augment their performance in complex tasks. △ Less

Submitted 10 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

arXiv:2310.04807 [pdf, other]

OEDG: Oscillation-eliminating discontinuous Galerkin method for hyperbolic conservation laws

Authors: Manting Peng, Zheng Sun, Kailiang Wu

Abstract: Controlling spurious oscillations is crucial for designing reliable numerical schemes for hyperbolic conservation laws. This paper proposes a novel, robust, and efficient oscillation-eliminating discontinuous Galerkin (OEDG) method on general meshes, motivated by the dam** technique in [Lu, Liu, and Shu, SIAM J. Numer. Anal., 59:1299-1324, 2021]. The OEDG method incorporates an OE procedure afte… ▽ More Controlling spurious oscillations is crucial for designing reliable numerical schemes for hyperbolic conservation laws. This paper proposes a novel, robust, and efficient oscillation-eliminating discontinuous Galerkin (OEDG) method on general meshes, motivated by the dam** technique in [Lu, Liu, and Shu, SIAM J. Numer. Anal., 59:1299-1324, 2021]. The OEDG method incorporates an OE procedure after each Runge-Kutta stage, devised by alternately evolving conventional semidiscrete DG scheme and a dam** equation. A novel dam** operator is carefully designed to possess scale-invariant and evolution-invariant properties. We rigorously prove optimal error estimates of the fully discrete OEDG method for linear scalar conservation laws. This might be the first generic fully-discrete error estimates for nonlinear DG schemes with automatic oscillation control mechanism. The OEDG method exhibits many notable advantages. It effectively eliminates spurious oscillations for challenging problems across various scales and wave speeds, without problem-specific parameters. It obviates the need for characteristic decomposition in hyperbolic systems. It retains key properties of conventional DG method, such as conservation, optimal convergence rates, and superconvergence. Moreover, it remains stable under normal CFL condition. The OE procedure is non-intrusive, facilitating integration into existing DG codes as an independent module. Its implementation is easy and efficient, involving only simple multiplications of modal coefficients by scalars. The OEDG approach provides new insights into the dam** mechanism for oscillation control. It reveals the role of dam** operator as a modal filter and establishes close relations between the dam** and spectral viscosity techniques. Extensive numerical results confirm the theoretical analysis and validate the effectiveness and advantages of the OEDG method. △ Less

Submitted 7 October, 2023; originally announced October 2023.

Comments: 37 pages, 14 figures, 6 tables

arXiv:2310.04751 [pdf]

FEcMD: A multi-physics and multi-scale computational program for electron emission characteristics dynamically coupled with atomic structure in metal nano-emitters

Authors: Nan Li, Xinyu Gao, Xianghui Feng, Kai Wu, Yonghong Cheng, Bing Xiao

Abstract: Field emission coupled with molecular dynamics simulation (FEcMD) software package is a computational tool for studying the electron emission characteristics and the atomic structure evolution of micro- and nano-protrusions made of pure metals or multi-component alloys by means of multi-physics and multi-scale methodology. The implementations of molecular dynamics, the electrodynamics, and the hea… ▽ More Field emission coupled with molecular dynamics simulation (FEcMD) software package is a computational tool for studying the electron emission characteristics and the atomic structure evolution of micro- and nano-protrusions made of pure metals or multi-component alloys by means of multi-physics and multi-scale methodology. The implementations of molecular dynamics, the electrodynamics, and the heat conduction in FEcMD program are addressed. For molecular dynamics simulation, the Lennard-Jones potentials, embedded atomic method (EAM), and moment tensor potentials (MTP) are fully supported for both alloys and pure metals. In the electrodynamics, the FEcMD program incorporates the space charge fields (space charge potential and exchange-correlation effects) in the Wentzel-Kramers-Brillouin-Jeffreys (WKBJ) approximation to evaluate the field emission current density more reliably for nano-gaps between two metal electrodes. Additionally, the advanced two-temperature heat conduction model is implemented in FEcMD program, and which provides more reliable descriptions for the temperature evolutions of electron and phonon subsystems under the radiofrequency (RF) or pulse electric fields. Comprehensive benchmark tests are performed for each module in FEcMD software to validate the numerical results, and also to access the accuracy and efficiency of the implemented algorithms. Finally, some typical applications of FEcMD program are also demonstrated for understanding the evolution of temperature and electric field coupled with the dynamic changing of atomic structures for metal micro- and nano-protrusions. △ Less

Submitted 7 October, 2023; originally announced October 2023.

arXiv:2310.04696 [pdf, other]

Serving Deep Learning Model in Relational Databases

Authors: Alexandre Eichenberger, Qi Lin, Saif Masood, Hong Min, Alexander Sim, Jie Wang, Yida Wang, Kesheng Wu, Binhang Yuan, Lixi Zhou, Jia Zou

Abstract: Serving deep learning (DL) models on relational data has become a critical requirement across diverse commercial and scientific domains, sparking growing interest recently. In this visionary paper, we embark on a comprehensive exploration of representative architectures to address the requirement. We highlight three pivotal paradigms: The state-of-the-artDL-Centricarchitecture offloadsDL computati… ▽ More Serving deep learning (DL) models on relational data has become a critical requirement across diverse commercial and scientific domains, sparking growing interest recently. In this visionary paper, we embark on a comprehensive exploration of representative architectures to address the requirement. We highlight three pivotal paradigms: The state-of-the-artDL-Centricarchitecture offloadsDL computations to dedicated DL frameworks. The potential UDF-Centric architecture encapsulates one or more tensor computations into User Defined Functions (UDFs) within the database system. The potentialRelation-Centricarchitecture aims to represent a large-scale tensor computation through relational operators. While each of these architectures demonstrates promise in specific use scenarios, we identify urgent requirements for seamless integration of these architectures and the middle ground between these architectures. We delve into the gaps that impede the integration and explore innovative strategies to close them. We present a pathway to establish a novel database system for enabling a broad class of data-intensive DL inference applications. △ Less

Submitted 9 October, 2023; v1 submitted 7 October, 2023; originally announced October 2023.

Comments: Authors are ordered alphabetically; Jia Zou is the corresponding author

arXiv:2310.04482 [pdf, other]

EMOFM: Ensemble MLP mOdel with Feature-based Mixers for Click-Through Rate Prediction

Authors: Yujian Betterest Li, Kai Wu

Abstract: Track one of CTI competition is on click-through rate (CTR) prediction. The dataset contains millions of records and each field-wise feature in a record consists of hashed integers for privacy. For this task, the keys of network-based methods might be type-wise feature extraction and information fusion across different fields. Multi-layer perceptrons (MLPs) are able to extract field feature, but c… ▽ More Track one of CTI competition is on click-through rate (CTR) prediction. The dataset contains millions of records and each field-wise feature in a record consists of hashed integers for privacy. For this task, the keys of network-based methods might be type-wise feature extraction and information fusion across different fields. Multi-layer perceptrons (MLPs) are able to extract field feature, but could not efficiently fuse features. Motivated by the natural fusion characteristic of cross attention and the efficiency of transformer-based structures, we propose simple plug-in mixers for field/type-wise feature fusion, and thus construct an field&type-wise ensemble model, namely EMOFM (Ensemble MLP mOdel with Feature-based Mixers). In the experiments, the proposed model is evaluated on the dataset, the optimization process is visualized and ablation studies are explored. It is shown that EMOFM outperforms compared baselines. In the end, we discuss on future work. WARNING: The comparison might not be fair enough since the proposed method is designed for this data in particular while compared methods are not. For example, EMOFM especially takes different types of interactions into consideration while others do not. Anyway, we do hope that the ideas inside our method could help other developers/learners/researchers/thinkers and so on. △ Less

Submitted 15 October, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

arXiv:2310.04260 [pdf, other]

An X-rays-to-radio investigation of the nuclear polarization from the radio-galaxy Centaurus A

Authors: Frédéric Marin, Thibault Barnouin, Steven R. Ehlert, Abel Lawrence Peirson, Enrique Lopez-Rodriguez, Maria Petropoulou, Kinwah Wu, Iván Martí-Vidal

Abstract: Centaurus A is one of the closest radio-galaxies to Earth. Its proximity allowed us to extensively study its active galactic nucleus but the core emission mechanism remains elusive because of local strong dust and gas obscuration. The capability of polarimetry to shave-off contaminating emission has been exploited without success in the near-infrared by previous studies but the very recent measure… ▽ More Centaurus A is one of the closest radio-galaxies to Earth. Its proximity allowed us to extensively study its active galactic nucleus but the core emission mechanism remains elusive because of local strong dust and gas obscuration. The capability of polarimetry to shave-off contaminating emission has been exploited without success in the near-infrared by previous studies but the very recent measurement of the 2 - 8 keV polarization by the Imaging X-ray Polarimetry Explorer (IXPE) brought the question back to the fore. To determine what is the prevalent photon generation mechanism to the multi-wavelength emission from the core of Centaurus A, we retrieved from the archives the panchromatic polarization measurements of the central compact component. We built the total and polarized flux spectral energy distributions of the core and demonstrated that synchrotron self-Compton models nicely fit the polarized flux from the radio to the X-ray band. The linear polarization of the synchrotron continuum is perpendicular to the jet radio axis from the optical to the radio band, and parallel to it at higher energies. The observed smooth rotation of the polarization angle in the ultraviolet band is attributed to synchrotron emission from regions that are getting closer to the particle acceleration site, where the orientation of the jet's magnetic fields become perpendicular to the jet axis. This phenomenon support the shock acceleration mechanism for particle acceleration in Centaurus A, in line with IXPE observations of several high-synchrotron peak blazars. △ Less

Submitted 6 October, 2023; originally announced October 2023.

Comments: 10 pages, 6 figures, accepted for publication in MNRAS

MSC Class: 85-06 ACM Class: J.2.3; J.2.9

arXiv:2310.03297 [pdf, other]

Passive Respiration Detection via mmWave Communication Signal Under Interference

Authors: Kehan Wu, Renqi Chen, Haiyu Wang, Chenqing Ji, Jiayuan Zhu, Guang Wu

Abstract: Recent research has highlighted the detection of human respiration rate using commodity WiFi devices. Nevertheless, these devices encounter challenges in accurately discerning human respiration amidst the prevailing human motion interference encountered in daily life. To tackle this predicament, this paper introduces a passive sensing and communication system designed specifically for respiration… ▽ More Recent research has highlighted the detection of human respiration rate using commodity WiFi devices. Nevertheless, these devices encounter challenges in accurately discerning human respiration amidst the prevailing human motion interference encountered in daily life. To tackle this predicament, this paper introduces a passive sensing and communication system designed specifically for respiration detection in the presence of robust human motion interference. Operating within the 60.48 GHz band, the proposed system aims to detect human respiration even when confronted with substantial human motion interference within close proximity. Subsequently, a neural network is trained using the collected data by us to enable human respiration detection. The experimental results demonstrate a consistently high accuracy rate over 90\% of the human respiration detection under interference, given an adequate sensing duration. Finally, an empirical model is derived analytically to achieve the respiratory rate counting in 10 seconds. △ Less

Submitted 4 January, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

Comments: Submitted to WCNC2024 Workshop

arXiv:2310.02347 [pdf, other]

Strong Mixed-Integer Formulations for Transmission Expansion Planning with FACTS Devices

Authors: Kevin Wu, Mathieu Tanneau, Pascal Van Hentenryck

Abstract: Transmission Network Expansion Planning (TNEP) problems find the most economical way of expanding a given grid given long-term growth in generation capacity and demand patterns. The recent development of Flexible AC Transmission System (FACTS) devices, which can dynamically re-route power flows by adjusting individual branches' impedance, call for their integration into TNEP problems. However, the… ▽ More Transmission Network Expansion Planning (TNEP) problems find the most economical way of expanding a given grid given long-term growth in generation capacity and demand patterns. The recent development of Flexible AC Transmission System (FACTS) devices, which can dynamically re-route power flows by adjusting individual branches' impedance, call for their integration into TNEP problems. However, the resulting TNEP+FACTS formulations are significantly harder to solve than traditional TNEP instances, due to the nonlinearity of FACTS behavior. This paper proposes a new mixed-integer formulation for TNEP+FACTS, which directly represents the change in power flow induced by individual FACTS devices. The proposed formulation uses an extended formulation and facet-defining constraints, which are stronger than big-M constraints typically used in the literature. The paper conducts numerical experiments on a synthetic model of the Texas system with high renewable penetration. The results demonstrate the computational superiority of the proposed approach, which achieves a 4x speedup over state-of-the-art formulations, and highlight the potential of FACTS devices to mitigate congestion. △ Less

Submitted 8 April, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

arXiv:2310.01635 [pdf, other]

X-ray Polarization of the BL Lac Type Blazar 1ES 0229+200

Authors: Steven R. Ehlert, Ioannis Liodakis, Riccardo Middei, Alan P. Marscher, Fabrizio Tavecchio, Iván Agudo, Pouya M. Kouch, Elina Lindfors, Kari Nilsson, Ioannis Myserlis, Mark Gurwell, Ramprasad Rao, Francisco Jose Aceituno, Giacomo Bonnoli, Victor Casanova, Beatriz Agiz-Gonzalez, Juan Escudero, Jorge Otero Santos, Alfredo Sota, Emmanouil Angelakis, Alexander Kraus, Garrett K. Keating, Lucio A. Antonelli, Matteo Bachetti, Luca Baldini , et al. (88 additional authors not shown)

Abstract: We present polarization measurements in the $2-8 \thinspace \mathrm{keV}$ band from blazar 1ES 0229+200, the first extreme high synchrotron peaked source to be observed by the Imaging X-ray Polarimetry Explorer (IXPE). Combining two exposures separated by about two weeks, we find the degree of polarization to be $Π_{X} = 17.9 \pm 2.8 \%$ at an electric-vector position angle… ▽ More We present polarization measurements in the $2-8 \thinspace \mathrm{keV}$ band from blazar 1ES 0229+200, the first extreme high synchrotron peaked source to be observed by the Imaging X-ray Polarimetry Explorer (IXPE). Combining two exposures separated by about two weeks, we find the degree of polarization to be $Π_{X} = 17.9 \pm 2.8 \%$ at an electric-vector position angle $ψ_X = 25.0 \pm 4.6^{\circ}$ using a spectropolarimetric fit from joint IXPE and XMM-Newton observations. There is no evidence for the polarization degree or angle varying significantly with energy or time on both short time scales (hours) or longer time scales (days). The contemporaneous polarization degree at optical wavelengths was $>$7$\times$ lower, making 1ES 0229+200 the most strongly chromatic blazar yet observed. This high X-ray polarization compared to the optical provides further support that X-ray emission in high-peaked blazars originates in shock-accelerated, energy-stratified electron populations, but is in tension with many recent modeling efforts attempting to reproduce the spectral energy distribution of 1ES 0229+200 which attribute the extremely high energy synchrotron and Compton peaks to Fermi acceleration in the vicinity of strongly turbulent magnetic fields. △ Less

Submitted 2 October, 2023; originally announced October 2023.

Comments: 17 Pages, 6 Figures, Resubmitted to ApJ after addressing referee comments

arXiv:2310.01260 [pdf, other]

SPELL: Semantic Prompt Evolution based on a LLM

Authors: Yujian Betterest Li, Kai Wu

Abstract: Prompt engineering is a new paradigm for enhancing the performance of trained neural network models. For optimizing text-style prompts, existing methods usually individually operate small portions of a text step by step, which either breaks the fluency or could not globally adjust a prompt. Since large language models (LLMs) have powerful ability of generating coherent texts token by token, can we… ▽ More Prompt engineering is a new paradigm for enhancing the performance of trained neural network models. For optimizing text-style prompts, existing methods usually individually operate small portions of a text step by step, which either breaks the fluency or could not globally adjust a prompt. Since large language models (LLMs) have powerful ability of generating coherent texts token by token, can we utilize LLMs for improving prompts? Based on this motivation, in this paper, considering a trained LLM as a text generator, we attempt to design a black-box evolution algorithm for automatically optimizing texts, namely SPELL (Semantic Prompt Evolution based on a LLM). The proposed method is evaluated with different LLMs and evolution parameters in different text tasks. Experimental results show that SPELL could rapidly improve the prompts indeed. We further explore the evolution process and discuss on the limitations, potential possibilities and future work. △ Less

Submitted 2 October, 2023; originally announced October 2023.

arXiv:2310.00902 [pdf, other]

DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models

Authors: Yongchan Kwon, Eric Wu, Kevin Wu, James Zou

Abstract: Quantifying the impact of training data points is crucial for understanding the outputs of machine learning models and for improving the transparency of the AI pipeline. The influence function is a principled and popular data attribution method, but its computational cost often makes it challenging to use. This issue becomes more pronounced in the setting of large language models and text-to-image… ▽ More Quantifying the impact of training data points is crucial for understanding the outputs of machine learning models and for improving the transparency of the AI pipeline. The influence function is a principled and popular data attribution method, but its computational cost often makes it challenging to use. This issue becomes more pronounced in the setting of large language models and text-to-image models. In this work, we propose DataInf, an efficient influence approximation method that is practical for large-scale generative AI models. Leveraging an easy-to-compute closed-form expression, DataInf outperforms existing influence computation algorithms in terms of computational and memory efficiency. Our theoretical analysis shows that DataInf is particularly well-suited for parameter-efficient fine-tuning techniques such as LoRA. Through systematic empirical evaluations, we show that DataInf accurately approximates influence scores and is orders of magnitude faster than existing methods. In applications to RoBERTa-large, Llama-2-13B-chat, and stable-diffusion-v1.5 models, DataInf effectively identifies the most influential fine-tuning examples better than other approximate influence scores. Moreover, it can help to identify which data points are mislabeled. △ Less

Submitted 13 March, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

Comments: ICLR 2024

arXiv:2310.00153 [pdf]

Conformal Metamaterials with Active Tunability and Self-adaptivity for Magnetic Resonance Imaging

Authors: Ke Wu, Xia Zhu, Xiaoguang Zhao, Stephan W. Anderson, Xin Zhang

Abstract: Ongoing effort has been devoted to applying metamaterials to boost the imaging performance of magnetic resonance imaging owing to their unique capacity for electromagnetic field confinement and enhancement. However, there are still major obstacles to widespread clinical adoption of conventional metamaterials due to several notable restrictions, namely: their typically bulky and rigid structures, d… ▽ More Ongoing effort has been devoted to applying metamaterials to boost the imaging performance of magnetic resonance imaging owing to their unique capacity for electromagnetic field confinement and enhancement. However, there are still major obstacles to widespread clinical adoption of conventional metamaterials due to several notable restrictions, namely: their typically bulky and rigid structures, deviations in their optimal resonance frequency, and their inevitable interference with the transmission RF field in MRI. Herein, we address these restrictions and report a conformal, smart metamaterial, which may not only be readily tuned to achieve the desired, precise frequency match with MRI by a controlling circuit, but is also capable of selectively amplifying the magnetic field during the RF reception phase by sensing the excitation signal strength passively, thereby remaining off during the RF transmission phase and thereby ensuring its optimal performance when applied to MRI as an additive technology. By addressing a host of current technological challenges, the metamaterial presented herein paves the way toward the wide-ranging utilization of metamaterials in clinical MRI, thereby translating this promising technology to the MRI bedside. △ Less

Submitted 29 September, 2023; originally announced October 2023.

Comments: 21 pages, 7 figures

arXiv:2309.16067 [pdf, other]

The Polarized Cosmic Hand: IXPE Observations of PSR B1509-58/MSH 15-52

Authors: Roger W. Romani, Josephine Wong, Niccolo Di Lalla, Nicola Omodei, Fei Xie, C. -Y. Ng, Riccardo Ferrazzoli, Alessandro Di Marco, Niccolo Bucciantini, Maura Pilia, Patrick Slane, Martin C. Weisskopf, Simon Johnston, Marta Burgay, Deng Wei, Yi-Jung Yang, Shumeng Zhang, Lucio A. Antonelli, Matteo Bachetti, Luca Baldini, Wayne H. Baumgartner, Ronaldo Bellazzini, Stefano Bianchi, Stephen D. Bongiorno, Raffaella Bonino , et al. (78 additional authors not shown)

Abstract: We describe IXPE polarization observations of the Pulsar Wind Nebula (PWN) MSH15-52, the `Cosmic Hand'. We find X-ray polarization across the PWN, with B field vectors generally aligned with filamentary X-ray structures. High significance polarization is seen in arcs surrounding the pulsar and toward the end of the `jet', with polarization degree PD>70%, thus approaching the maximum allowed synchr… ▽ More We describe IXPE polarization observations of the Pulsar Wind Nebula (PWN) MSH15-52, the `Cosmic Hand'. We find X-ray polarization across the PWN, with B field vectors generally aligned with filamentary X-ray structures. High significance polarization is seen in arcs surrounding the pulsar and toward the end of the `jet', with polarization degree PD>70%, thus approaching the maximum allowed synchrotron value. In contrast, the base of the jet has lower polarization, indicating a complex magnetic field at significant angle to the jet axis. We also detect significant polarization from PSR B1509-58 itself. Although only the central pulse-phase bin of the pulse has high individual significance, flanking bins provide lower significance detections and, in conjunction with the X-ray image and radio polarization, can be used to constrain rotating vector model solutions for the pulsar geometry. △ Less

Submitted 27 September, 2023; originally announced September 2023.

Comments: To appear in the Astrophysical Journal

arXiv:2309.15928 [pdf, other]

Discovery of X-ray Polarization from the Black Hole Transient Swift J1727.8-1613

Authors: Alexandra Veledina, Fabio Muleri, Michal Dovciak, Juri Poutanen, Ajay Ratheesh, Fiamma Capitanio, Giorgio Matt, Paolo Soffitta, Allyn F. Tennant, Michela Negro, Philip Kaaret, Enrico Costa, Adam Ingram, Jiri Svoboda, Henric Krawczynski, Stefano Bianchi, James F. Steiner, Javier A. Garcia, Vadim Kravtsov, Anagha P. Nitindala, Melissa Ewing, Guglielmo Mastroserio, Andrea Marinucci, Francesco Ursini, Francesco Tombesi , et al. (91 additional authors not shown)

Abstract: We report the first detection of the X-ray polarization of the bright transient Swift J1727.8-1613 with the Imaging X-ray Polarimetry Explorer. The observation was performed at the beginning of the 2023 discovery outburst, when the source resided in the bright hard state. We find a time- and energy-averaged polarization degree of 4.1%+/-0.2% and a polarization angle of 2.2+/-1.3 degrees (errors at… ▽ More We report the first detection of the X-ray polarization of the bright transient Swift J1727.8-1613 with the Imaging X-ray Polarimetry Explorer. The observation was performed at the beginning of the 2023 discovery outburst, when the source resided in the bright hard state. We find a time- and energy-averaged polarization degree of 4.1%+/-0.2% and a polarization angle of 2.2+/-1.3 degrees (errors at 68% confidence level; this translates to about 20-sigma significance of the polarization detection). This finding suggests that the hot corona emitting the bulk of the detected X-rays is elongated, rather than spherical. The X-ray polarization angle is consistent with that found in sub-mm wavelengths. Since the sub-mm polarization was found to be aligned with the jet direction in other X-ray binaries, this indicates that the corona is elongated orthogonal to the jet. △ Less

Submitted 27 September, 2023; originally announced September 2023.

Comments: 10 pages, 3 figures, submitted

arXiv:2309.15266 [pdf, ps, other]

A New Spectral Conjugate Subgradient Method with Application in Computed Tomography Image Reconstruction

Authors: Milagros Loreto, Thomas Humphries, Chella Raghavan, Kenneth Wu, Sam Kwak

Abstract: A new spectral conjugate subgradient method is presented to solve nonsmooth unconstrained optimization problems. The method combines the spectral conjugate gradient method for smooth problems with the spectral subgradient method for nonsmooth problems. We study the effect of two different choices of line search, as well as three formulas for determining the conjugate directions. In addition to num… ▽ More A new spectral conjugate subgradient method is presented to solve nonsmooth unconstrained optimization problems. The method combines the spectral conjugate gradient method for smooth problems with the spectral subgradient method for nonsmooth problems. We study the effect of two different choices of line search, as well as three formulas for determining the conjugate directions. In addition to numerical experiments with standard nonsmooth test problems, we also apply the method to several image reconstruction problems in computed tomography, using total variation regularization. Performance profiles are used to compare the performance of the algorithm using different line search strategies and conjugate directions to that of the original spectral subgradient method. Our results show that the spectral conjugate subgradient algorithm outperforms the original spectral subgradient method, and that the use of the Polak-Ribiere formula for conjugate directions provides the best and most robust performance. △ Less

Submitted 5 June, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

Comments: 23 pages, 7 figures

MSC Class: 90C30; 90C56; 94A08

arXiv:2309.13890 [pdf, other]

Bitstream-Corrupted Video Recovery: A Novel Benchmark Dataset and Method

Authors: Tianyi Liu, Kejun Wu, Yi Wang, Wenyang Liu, Kim-Hui Yap, Lap-Pui Chau

Abstract: The past decade has witnessed great strides in video recovery by specialist technologies, like video inpainting, completion, and error concealment. However, they typically simulate the missing content by manual-designed error masks, thus failing to fill in the realistic video loss in video communication (e.g., telepresence, live streaming, and internet video) and multimedia forensics. To address t… ▽ More The past decade has witnessed great strides in video recovery by specialist technologies, like video inpainting, completion, and error concealment. However, they typically simulate the missing content by manual-designed error masks, thus failing to fill in the realistic video loss in video communication (e.g., telepresence, live streaming, and internet video) and multimedia forensics. To address this, we introduce the bitstream-corrupted video (BSCV) benchmark, the first benchmark dataset with more than 28,000 video clips, which can be used for bitstream-corrupted video recovery in the real world. The BSCV is a collection of 1) a proposed three-parameter corruption model for video bitstream, 2) a large-scale dataset containing rich error patterns, multiple corruption levels, and flexible dataset branches, and 3) a plug-and-play module in video recovery framework that serves as a benchmark. We evaluate state-of-the-art video inpainting methods on the BSCV dataset, demonstrating existing approaches' limitations and our framework's advantages in solving the bitstream-corrupted video recovery problem. The benchmark and dataset are released at https://github.com/LIUTIGHE/BSCV-Dataset. △ Less

Submitted 26 September, 2023; v1 submitted 25 September, 2023; originally announced September 2023.

Comments: Accepted by NeurIPS Dataset and Benchmark Track 2023

arXiv:2309.12694 [pdf, other]

Recurrent Temporal Revision Graph Networks

Authors: Yizhou Chen, Anxiang Zeng, Guangda Huzhang, Qingtao Yu, Kerui Zhang, Cao Yuanpeng, Kangle Wu, Han Yu, Zhiming Zhou

Abstract: Temporal graphs offer more accurate modeling of many real-world scenarios than static graphs. However, neighbor aggregation, a critical building block of graph networks, for temporal graphs, is currently straightforwardly extended from that of static graphs. It can be computationally expensive when involving all historical neighbors during such aggregation. In practice, typically only a subset of… ▽ More Temporal graphs offer more accurate modeling of many real-world scenarios than static graphs. However, neighbor aggregation, a critical building block of graph networks, for temporal graphs, is currently straightforwardly extended from that of static graphs. It can be computationally expensive when involving all historical neighbors during such aggregation. In practice, typically only a subset of the most recent neighbors are involved. However, such subsampling leads to incomplete and biased neighbor information. To address this limitation, we propose a novel framework for temporal neighbor aggregation that uses the recurrent neural network with node-wise hidden states to integrate information from all historical neighbors for each node to acquire the complete neighbor information. We demonstrate the superior theoretical expressiveness of the proposed framework as well as its state-of-the-art performance in real-world applications. Notably, it achieves a significant +9.6% improvement on averaged precision in a real-world Ecommerce dataset over existing methods on 2-layer models. △ Less

Submitted 25 September, 2023; v1 submitted 22 September, 2023; originally announced September 2023.

Showing 151–200 of 1,302 results for author: Wu, K