Search | arXiv e-print repository

arXiv:2402.19202 [pdf, other]

doi 10.1051/0004-6361/202449394

VLBI observations of the high-redshift X-ray bright blazar SRGE J170245.3+130104

Authors: Yuanqi Liu, Tao An, Shaoguang Guo, Yingkang Zhang, Ailing Wang, Zhijun Xu, Georgii Khorunzhev, Yulia Sotnikova, Timur Mufakharov, Alexander Mikhailov, Marat Mingaliev

Abstract: Aims. The X-ray luminous and radio-loud AGN SRGE J170245.3+130104 discovered at z $\sim$ 5.5 provides unique chances to probe the SMBH growth and evolution with powerful jets in the early Universe. Methods. We present 1.35 - 5.1 GHz Very Long Baseline Array (VLBA) results on the radio continuum emission and spectrum analysis for this quasar in a low flux density state. Results. This source is unre… ▽ More Aims. The X-ray luminous and radio-loud AGN SRGE J170245.3+130104 discovered at z $\sim$ 5.5 provides unique chances to probe the SMBH growth and evolution with powerful jets in the early Universe. Methods. We present 1.35 - 5.1 GHz Very Long Baseline Array (VLBA) results on the radio continuum emission and spectrum analysis for this quasar in a low flux density state. Results. This source is unresolved at three frequencies with the total flux densities of 8.35$\pm$0.09 mJy beam-1, 7.47$\pm$0.08 mJy beam-1, and 6.57$\pm$0.02 mJy beam-1 at 1.73 GHz, 2.26 GHz, and 4.87 GHz, respectively. Meanwhile, the brightness temperature is higher than 109 K. Conclusions. Compared with previous radio observations with arcsec-scale resolution, nearly all the radio emission from this source concentrates in the very central milli-arcsecond (mas) scale area. We confirm this source is a bright blazar at z > 5. This young AGN provide us the great chances to understand the first generation of strong jets in the early Universe. △ Less

Submitted 29 February, 2024; originally announced February 2024.

Comments: 5 pages, 2 figures, Accepted for publication in Astronomy & Astrophysics

Journal ref: A&A 685, A111 (2024)

arXiv:2402.18965 [pdf, other]

Induced Gravitational Wave interpretation of PTA data: a complete study for general equation of state

Authors: Guillem Domènech, Shi Pi, Ao Wang, Jianing Wang

Abstract: We thoroughly study the induced gravitational wave interpretation of the possible gravitational wave background reported by PTA collaborations, considering the unknown equation of state $w$ of the early universe. We perform a Bayesian analysis of the NANOGrav data using the publicly available \textsc{PTArcade} code together with \textsc{SIGWfast} for the numerical integration of the induced gravit… ▽ More We thoroughly study the induced gravitational wave interpretation of the possible gravitational wave background reported by PTA collaborations, considering the unknown equation of state $w$ of the early universe. We perform a Bayesian analysis of the NANOGrav data using the publicly available \textsc{PTArcade} code together with \textsc{SIGWfast} for the numerical integration of the induced gravitational wave spectrum. We focus on two cases: a monochromatic and a log-normal primordial spectrum of fluctuations. For the log-normal spectrum, we show that, while the results are not very sensitive to $w$ when the GW peak is close to the PTA window, radiation domination is out of the $2σ$ contours when only the infra-red power-law tail contributes. For the monochromatic spectrum, the $2σ$ bounds yield $0.1\lesssim w\lesssim0.9$ so that radiation domination is close to the central value. We also investigate the primordial black hole (PBH) counterpart using the peak formalism. We show that, in general terms, a larger width and stiffer equation of state alleviates the overproduction of PBHs. No PBH overproduction requires $w\gtrsim0.42$ up to 2-$σ$ level for the monochromatic spectrum. Furthermore, including bounds from the cosmic microwave background, we find in general that the mass range of the PBH counterpart is bounded by $10^{-5} M_\odot\lesssim M_{\rm PBH}\lesssim10^{-1} M_\odot$. Lastly, we find that the PTA signal can explain the microlensing events reported by OGLE for $0.42\lesssim w\lesssim 0.50$. Our work showcases a complete treatment of induced gravitational waves and primordial black holes for general $w$ for future data analysis. △ Less

Submitted 8 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

Comments: 22 pages, 4 figures, minor revisions, references added

arXiv:2402.18270 [pdf, other]

FPM-WSI: Fourier ptychographic whole slide imaging via feature-domain backdiffraction

Authors: Shuhe Zhang, Aiye Wang, **ghao Xu, Tianci Feng, **hua Zhou, An Pan

Abstract: Fourier ptychographic microscopy (FPM), characterized by high-throughput computational imaging, theoretically provides a cunning solution to the trade-off between spatial resolution and field of view (FOV), which has a promising prospect in the application of digital pathology. However, block reconstruction and then stitching has currently become an unavoidable procedure due to vignetting effects.… ▽ More Fourier ptychographic microscopy (FPM), characterized by high-throughput computational imaging, theoretically provides a cunning solution to the trade-off between spatial resolution and field of view (FOV), which has a promising prospect in the application of digital pathology. However, block reconstruction and then stitching has currently become an unavoidable procedure due to vignetting effects. The stitched image tends to present color inconsistency in different image segments, or even stitching artifacts. In response, we reported a computational framework based on feature-domain backdiffraction to realize full-FOV, stitching-free FPM reconstruction. Different from conventional algorithms that establish the loss function in the image domain, our method formulates it in the feature domain, where effective information of images is extracted by a feature extractor to bypass the vignetting effect. The feature-domain error between predicted images based on estimation of model parameters and practically captured images is then digitally diffracted back through the optical system for complex amplitude reconstruction and aberration compensation. Through massive simulations and experiments, the method presents effective elimination of vignetting artifacts, and reduces the requirement of precise knowledge of illumination positions. We also found its great potential to recover the data with a lower overlap** rate of spectrum and to realize automatic blind-digital refocusing without a prior defocus distance. △ Less

Submitted 28 February, 2024; originally announced February 2024.

arXiv:2402.18026 [pdf, other]

doi 10.1103/PhysRevD.110.022001

Characterization of the Astrophysical Diffuse Neutrino Flux using Starting Track Events in IceCube

Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, G. Anton, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise , et al. (394 additional authors not shown)

Abstract: A measurement of the diffuse astrophysical neutrino spectrum is presented using IceCube data collected from 2011-2022 (10.3 years). We developed novel detection techniques to search for events with a contained vertex and exiting track induced by muon neutrinos undergoing a charged-current interaction. Searching for these starting track events allows us to not only more effectively reject atmospher… ▽ More A measurement of the diffuse astrophysical neutrino spectrum is presented using IceCube data collected from 2011-2022 (10.3 years). We developed novel detection techniques to search for events with a contained vertex and exiting track induced by muon neutrinos undergoing a charged-current interaction. Searching for these starting track events allows us to not only more effectively reject atmospheric muons but also atmospheric neutrino backgrounds in the southern sky, opening a new window to the sub-100 TeV astrophysical neutrino sky. The event selection is constructed using a dynamic starting track veto and machine learning algorithms. We use this data to measure the astrophysical diffuse flux as a single power law flux (SPL) with a best-fit spectral index of $γ= 2.58 ^{+0.10}_{-0.09}$ and per-flavor normalization of $φ^{\mathrm{Astro}}_{\mathrm{per-flavor}} = 1.68 ^{+0.19}_{-0.22} \times 10^{-18} \times \mathrm{GeV}^{-1} \mathrm{cm}^{-2} \mathrm{s}^{-1} \mathrm{sr}^{-1}$ (at 100 TeV). The sensitive energy range for this dataset is 3 - 550 TeV under the SPL assumption. This data was also used to measure the flux under a broken power law, however we did not find any evidence of a low energy cutoff. △ Less

Submitted 27 February, 2024; originally announced February 2024.

Comments: 27 pages, 28 figures

Journal ref: Phys. Rev. D 110, 022001 (2024)

arXiv:2402.17685 [pdf, other]

Attosecond X-ray Chronoscopy of Core-level Photoemission

Authors: Jia-Bao Ji, Zhaoheng Guo, Taran Driver, Cynthia S. Trevisan, David Cesar, Xinxin Cheng, Joseph Duris, Paris L. Franz, James Glownia, Xiaochun Gong, Daniel Hammerland, Meng Han, Saijoscha Heck, Matthias Hoffmann, Andrei Kamalov, Kirk A. Larsen, Xiang Li, Ming-Fu Lin, Yuchen Liu, C. William McCurdy, Razib Obaid, Jordan T. ONeal, Thomas N. Rescigno, River R. Robles, Nicholas Sudar , et al. (10 additional authors not shown)

Abstract: Attosecond photoemission or photoionization delays are a unique probe of the structure and the electronic dynamics of matter. However, spectral congestion and spatial delocalization of valence electron wave functions set fundamental limits to the complexity of systems that can be studied and the information that can be retrieved, respectively. Using attosecond X-ray pulses from LCLS, we demonstrat… ▽ More Attosecond photoemission or photoionization delays are a unique probe of the structure and the electronic dynamics of matter. However, spectral congestion and spatial delocalization of valence electron wave functions set fundamental limits to the complexity of systems that can be studied and the information that can be retrieved, respectively. Using attosecond X-ray pulses from LCLS, we demonstrate the key advantages of measuring core-level delays: the photoelectron spectra remain atom-like, the measurements become element specific and the observed scattering dynamics originate from a point-like source. We exploit these unique features to reveal the effects of electronegativity and symmetry on attosecond scattering dynamics by measuring the photoionization delays between N-1s and C-1s core shells of a series of aromatic azabenzene molecules. Remarkably, the delays systematically increase with the number of nitrogen atoms in the molecule and reveal multiple resonances. We identify two previously unknown mechanisms regulating the associated attosecond dynamics, namely the enhanced confinement of the trapped wavefunction with increasing electronegativity of the atoms and the decrease of the coupling strength among the photoemitted partial waves with increasing symmetry. This study demonstrates the unique opportunities opened by measurements of core-level photoionization delays for unravelling attosecond electron dynamics in complex matter. △ Less

Submitted 8 April, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

arXiv:2402.17370 [pdf, other]

An Efficient MLP-based Point-guided Segmentation Network for Ore Images with Ambiguous Boundary

Authors: Guodong Sun, Yuting Peng, Le Cheng, Mengya Xu, An Wang, Bo Wu, Hongliang Ren, Yang Zhang

Abstract: The precise segmentation of ore images is critical to the successful execution of the beneficiation process. Due to the homogeneous appearance of the ores, which leads to low contrast and unclear boundaries, accurate segmentation becomes challenging, and recognition becomes problematic. This paper proposes a lightweight framework based on Multi-Layer Perceptron (MLP), which focuses on solving the… ▽ More The precise segmentation of ore images is critical to the successful execution of the beneficiation process. Due to the homogeneous appearance of the ores, which leads to low contrast and unclear boundaries, accurate segmentation becomes challenging, and recognition becomes problematic. This paper proposes a lightweight framework based on Multi-Layer Perceptron (MLP), which focuses on solving the problem of edge burring. Specifically, we introduce a lightweight backbone better suited for efficiently extracting low-level features. Besides, we design a feature pyramid network consisting of two MLP structures that balance local and global information thus enhancing detection accuracy. Furthermore, we propose a novel loss function that guides the prediction points to match the instance edge points to achieve clear object boundaries. We have conducted extensive experiments to validate the efficacy of our proposed method. Our approach achieves a remarkable processing speed of over 27 frames per second (FPS) with a model size of only 73 MB. Moreover, our method delivers a consistently high level of accuracy, with impressive performance scores of 60.4 and 48.9 in~$AP_{50}^{box}$ and~$AP_{50}^{mask}$ respectively, as compared to the currently available state-of-the-art techniques, when tested on the ore image dataset. The source code will be released at \url{https://github.com/MVME-HBUT/ORENEXT}. △ Less

Submitted 27 February, 2024; originally announced February 2024.

Comments: 10 pages, 8 figures

arXiv:2402.16641 [pdf, other]

Towards Open-ended Visual Quality Comparison

Authors: Haoning Wu, Hanwei Zhu, Zicheng Zhang, Erli Zhang, Chaofeng Chen, Liang Liao, Chunyi Li, Annan Wang, Wenxiu Sun, Qiong Yan, Xiaohong Liu, Guangtao Zhai, Shiqi Wang, Weisi Lin

Abstract: Comparative settings (e.g. pairwise choice, listwise ranking) have been adopted by a wide range of subjective studies for image quality assessment (IQA), as it inherently standardizes the evaluation criteria across different observers and offer more clear-cut responses. In this work, we extend the edge of emerging large multi-modality models (LMMs) to further advance visual quality comparison into… ▽ More Comparative settings (e.g. pairwise choice, listwise ranking) have been adopted by a wide range of subjective studies for image quality assessment (IQA), as it inherently standardizes the evaluation criteria across different observers and offer more clear-cut responses. In this work, we extend the edge of emerging large multi-modality models (LMMs) to further advance visual quality comparison into open-ended settings, that 1) can respond to open-range questions on quality comparison; 2) can provide detailed reasonings beyond direct answers. To this end, we propose the Co-Instruct. To train this first-of-its-kind open-source open-ended visual quality comparer, we collect the Co-Instruct-562K dataset, from two sources: (a) LLM-merged single image quality description, (b) GPT-4V "teacher" responses on unlabeled data. Furthermore, to better evaluate this setting, we propose the MICBench, the first benchmark on multi-image comparison for LMMs. We demonstrate that Co-Instruct not only achieves in average 30% higher accuracy than state-of-the-art open-source LMMs, but also outperforms GPT-4V (its teacher), on both existing related benchmarks and the proposed MICBench. Our model is published at https://huggingface.co/q-future/co-instruct. △ Less

Submitted 4 March, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

Comments: Fix typos

arXiv:2402.15631 [pdf, other]

Fine-Grained Self-Endorsement Improves Factuality and Reasoning

Authors: Ante Wang, Linfeng Song, Baolin Peng, Ye Tian, Lifeng **, Haitao Mi, **song Su, Dong Yu

Abstract: This work studies improving large language model (LLM) generations at inference time by mitigating fact-conflicting hallucinations. Particularly, we propose a self-endorsement framework that leverages the fine-grained fact-level comparisons across multiple sampled responses. Compared with prior ensemble methods (Wang et al., 2022;Chen et al., 2023)) that perform response-level selection, our appro… ▽ More This work studies improving large language model (LLM) generations at inference time by mitigating fact-conflicting hallucinations. Particularly, we propose a self-endorsement framework that leverages the fine-grained fact-level comparisons across multiple sampled responses. Compared with prior ensemble methods (Wang et al., 2022;Chen et al., 2023)) that perform response-level selection, our approach can better alleviate hallucinations, especially for longform generation tasks. Our approach can broadly benefit smaller and open-source LLMs as it mainly conducts simple content-based comparisons. Experiments on Biographies show that our method can effectively improve the factuality of generations with simple and intuitive prompts across different scales of LLMs. Besides, comprehensive analyses on TriviaQA and GSM8K demonstrate the potential of self-endorsement for broader application. △ Less

Submitted 23 February, 2024; originally announced February 2024.

arXiv:2402.15605 [pdf, other]

doi 10.1145/3613904.3642813

Do You See What I See? A Qualitative Study Eliciting High-Level Visualization Comprehension

Authors: Ghulam Jilani Quadri, Arran Zeyu Wang, Zhehao Wang, Jennifer Adorno, Paul Rosen, Danielle Albers Szafir

Abstract: Designers often create visualizations to achieve specific high-level analytical or communication goals. These goals require people to naturally extract complex, contextualized, and interconnected patterns in data. While limited prior work has studied general high-level interpretation, prevailing perceptual studies of visualization effectiveness primarily focus on isolated, predefined, low-level ta… ▽ More Designers often create visualizations to achieve specific high-level analytical or communication goals. These goals require people to naturally extract complex, contextualized, and interconnected patterns in data. While limited prior work has studied general high-level interpretation, prevailing perceptual studies of visualization effectiveness primarily focus on isolated, predefined, low-level tasks, such as estimating statistical quantities. This study more holistically explores visualization interpretation to examine the alignment between designers' communicative goals and what their audience sees in a visualization, which we refer to as their comprehension. We found that statistics people effectively estimate from visualizations in classical graphical perception studies may differ from the patterns people intuitively comprehend in a visualization. We conducted a qualitative study on three types of visualizations -- line graphs, bar graphs, and scatterplots -- to investigate the high-level patterns people naturally draw from a visualization. Participants described a series of graphs using natural language and think-aloud protocols. We found that comprehension varies with a range of factors, including graph complexity and data distribution. Specifically, 1) a visualization's stated objective often does not align with people's comprehension, 2) results from traditional experiments may not predict the knowledge people build with a graph, and 3) chart type alone is insufficient to predict the information people extract from a graph. Our study confirms the importance of defining visualization effectiveness from multiple perspectives to assess and inform visualization practices. △ Less

Submitted 23 February, 2024; originally announced February 2024.

Comments: Accepted for publication in ACM CHI 2024

Journal ref: In Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI '24), May 11-16, 2024, Honolulu, HI, USA

arXiv:2402.14477 [pdf, other]

Pressure tunable magnetic skyrmion phase in Co8Zn8Mn4 single crystals

Authors: Zhun Li, Xinrun Mi, Xinming Wang, Jian Lyu, Na Su, Aifeng Wang, Yisheng Chai, Bao Yuan, Wanju Luo, Hui Cheng, Jianxiang Gao, Hongliang Wang, Lijie Hao, Mingquan He, Junying Shen, Young Sun, Xin Tong

Abstract: In a magnetic skyrmion phase, magnetic moments form vortex-like topological textures which are of both fundamental and industrial interests. In $β$-Mn-type Co-Zn-Mn alloys, chrial magnetic skyrmions emerge above room temperature, providing a unique system for studying the skrymion physics and exploring spintronics applications. However, the magnetic skyrmion phase is typically confined in a narrow… ▽ More In a magnetic skyrmion phase, magnetic moments form vortex-like topological textures which are of both fundamental and industrial interests. In $β$-Mn-type Co-Zn-Mn alloys, chrial magnetic skyrmions emerge above room temperature, providing a unique system for studying the skrymion physics and exploring spintronics applications. However, the magnetic skyrmion phase is typically confined in a narrow and limited temperature ($T$) and magnetic field ($H$) range. Here, we demonstrate that hydrostatic pressure can expand the skyrmion phase in the $T-H$ phase diagram of single-crystalline Co$_8$Zn$_8$Mn$_4$. At ambient pressure, signatures of skyrmions are seen within $T\sim302-308$ K and $H\sim50-100$ Oe. Applying a moderate pressure of 6 kbar extends this range to $T\sim300-310$ K and $H\sim50-150$ Oe. However, further escalation of pressure to 10 kbar results in a slight contraction of the skyrmion phase. These findings underscore the sensitivity of the skyrmion phase in Co$_8$Zn$_8$Mn$_4$ to external pressures, and hint at the potential of strain engineering, particularly in $β$-Mn-type Co-Zn-Mn thin films, as a promising avenue to customize the skyrmion phase. △ Less

Submitted 22 February, 2024; originally announced February 2024.

Comments: 7 pages, 4 figures

arXiv:2402.14367 [pdf, other]

Representation Learning for Frequent Subgraph Mining

Authors: Rex Ying, Tianyu Fu, Andrew Wang, Jiaxuan You, Yu Wang, Jure Leskovec

Abstract: Identifying frequent subgraphs, also called network motifs, is crucial in analyzing and predicting properties of real-world networks. However, finding large commonly-occurring motifs remains a challenging problem not only due to its NP-hard subroutine of subgraph counting, but also the exponential growth of the number of possible subgraphs patterns. Here we present Subgraph Pattern Miner (SPMiner)… ▽ More Identifying frequent subgraphs, also called network motifs, is crucial in analyzing and predicting properties of real-world networks. However, finding large commonly-occurring motifs remains a challenging problem not only due to its NP-hard subroutine of subgraph counting, but also the exponential growth of the number of possible subgraphs patterns. Here we present Subgraph Pattern Miner (SPMiner), a novel neural approach for approximately finding frequent subgraphs in a large target graph. SPMiner combines graph neural networks, order embedding space, and an efficient search strategy to identify network subgraph patterns that appear most frequently in the target graph. SPMiner first decomposes the target graph into many overlap** subgraphs and then encodes each subgraph into an order embedding space. SPMiner then uses a monotonic walk in the order embedding space to identify frequent motifs. Compared to existing approaches and possible neural alternatives, SPMiner is more accurate, faster, and more scalable. For 5- and 6-node motifs, we show that SPMiner can almost perfectly identify the most frequent motifs while being 100x faster than exact enumeration methods. In addition, SPMiner can also reliably identify frequent 10-node motifs, which is well beyond the size limit of exact enumeration approaches. And last, we show that SPMiner can find large up to 20 node motifs with 10-100x higher frequency than those found by current approximate methods. △ Less

Submitted 22 February, 2024; originally announced February 2024.

Comments: Oral Presentation in The Graph Representation Learning and Beyond (GRL+) Workshop from The 37th International Conference on Ma- chine Learning, 2020

arXiv:2402.13929 [pdf, other]

SDXL-Lightning: Progressive Adversarial Diffusion Distillation

Authors: Shanchuan Lin, Anran Wang, Xiao Yang

Abstract: We propose a diffusion distillation method that achieves new state-of-the-art in one-step/few-step 1024px text-to-image generation based on SDXL. Our method combines progressive and adversarial distillation to achieve a balance between quality and mode coverage. In this paper, we discuss the theoretical analysis, discriminator design, model formulation, and training techniques. We open-source our… ▽ More We propose a diffusion distillation method that achieves new state-of-the-art in one-step/few-step 1024px text-to-image generation based on SDXL. Our method combines progressive and adversarial distillation to achieve a balance between quality and mode coverage. In this paper, we discuss the theoretical analysis, discriminator design, model formulation, and training techniques. We open-source our distilled SDXL-Lightning models both as LoRA and full UNet weights. △ Less

Submitted 2 March, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

arXiv:2402.13678 [pdf, other]

Weak Poincaré inequality comparisons for ideal and hybrid slice sampling

Authors: Sam Power, Daniel Rudolf, Björn Sprungk, Andi Q. Wang

Abstract: Using the framework of weak Poincar{é} inequalities, we provide a general comparison between the Hybrid and Ideal Slice Sampling Markov chains in terms of their Dirichlet forms. In particular, under suitable assumptions Hybrid Slice Sampling will inherit fast convergence from Ideal Slice Sampling and conversely. We apply our results to analyse the convergence of the Independent Metropolis--Hasting… ▽ More Using the framework of weak Poincar{é} inequalities, we provide a general comparison between the Hybrid and Ideal Slice Sampling Markov chains in terms of their Dirichlet forms. In particular, under suitable assumptions Hybrid Slice Sampling will inherit fast convergence from Ideal Slice Sampling and conversely. We apply our results to analyse the convergence of the Independent Metropolis--Hastings, Slice Sampling with Step**-Out and Shrinkage, and Hit-and-Run-within-Slice Sampling algorithms. △ Less

Submitted 21 February, 2024; originally announced February 2024.

Comments: 35 pages, 2 figures

MSC Class: 65C05; 60J22

arXiv:2402.12764 [pdf, other]

Attosecond Delays in X-ray Molecular Ionization

Authors: Taran Driver, Miles Mountney, Jun Wang, Lisa Ortmann, Andre Al-Haddad, Nora Berrah, Christoph Bostedt, Elio G. Champenois, Louis F. DiMauro, Joseph Duris, Douglas Garratt, James M. Glownia, Zhaoheng Guo, Daniel Haxton, Erik Isele, Igor Ivanov, Jiabao Ji, Andrei Kamalov, Siqi Li, Ming-Fu Lin, Jon P. Marangos, Razib Obaid, Jordan T. O'Neal, Philipp Rosenberger, Niranjan H. Shivaram , et al. (12 additional authors not shown)

Abstract: The photoelectric effect is not truly instantaneous, but exhibits attosecond delays that can reveal complex molecular dynamics. Sub-femtosecond duration light pulses provide the requisite tools to resolve the dynamics of photoionization. Accordingly, the past decade has produced a large volume of work on photoionization delays following single photon absorption of an extreme ultraviolet (XUV) phot… ▽ More The photoelectric effect is not truly instantaneous, but exhibits attosecond delays that can reveal complex molecular dynamics. Sub-femtosecond duration light pulses provide the requisite tools to resolve the dynamics of photoionization. Accordingly, the past decade has produced a large volume of work on photoionization delays following single photon absorption of an extreme ultraviolet (XUV) photon. However, the measurement of time-resolved core-level photoionization remained out of reach. The required x-ray photon energies needed for core-level photoionization were not available with attosecond tabletop sources. We have now measured the x-ray photoemission delay of core-level electrons, and here report unexpectedly large delays, ranging up to 700 attoseconds in NO near the oxygen K-shell threshold. These measurements exploit attosecond soft x-ray pulses from a free-electron laser (XFEL) to scan across the entire region near the K-shell threshold. Furthermore, we find the delay spectrum is richly modulated, suggesting several contributions including transient trap** of the photoelectron due to shape resonances, collisions with the Auger-Meitner electron that is emitted in the rapid non-radiative relaxation of the molecule, and multi-electron scattering effects. The results demonstrate how x-ray attosecond experiments, supported by comprehensive theoretical modelling, can unravel the complex correlated dynamics of core-level photoionization. △ Less

Submitted 20 February, 2024; originally announced February 2024.

arXiv:2402.12370 [pdf, other]

AnaloBench: Benchmarking the Identification of Abstract and Long-context Analogies

Authors: Xiao Ye, Andrew Wang, Jacob Choi, Yining Lu, Shreya Sharma, Lingfeng Shen, Vijay Tiyyala, Nicholas Andrews, Daniel Khashabi

Abstract: Humans regularly engage in analogical thinking, relating personal experiences to current situations ($X$ is analogous to $Y$ because of $Z$). Analogical thinking allows humans to solve problems in creative ways, grasp difficult concepts, and articulate ideas more effectively. Can language models (LMs) do the same? To answer this question, we propose ANALOBENCH, a benchmark to determine analogical… ▽ More Humans regularly engage in analogical thinking, relating personal experiences to current situations ($X$ is analogous to $Y$ because of $Z$). Analogical thinking allows humans to solve problems in creative ways, grasp difficult concepts, and articulate ideas more effectively. Can language models (LMs) do the same? To answer this question, we propose ANALOBENCH, a benchmark to determine analogical reasoning ability in LMs. Our benchmarking approach focuses on aspects of this ability that are common among humans: (i) recalling related experiences from a large amount of information, and (ii) applying analogical reasoning to complex and lengthy scenarios. We test a broad collection of proprietary models (e.g., GPT family, Claude V2) and open source models such as LLaMA2. As in prior results, scaling up LMs results in some performance boosts. Surprisingly, scale offers minimal gains when, (i) analogies involve lengthy scenarios, or (ii) recalling relevant scenarios from a large pool of information, a process analogous to finding a needle in a haystack. We hope these observations encourage further research in this field. △ Less

Submitted 19 February, 2024; originally announced February 2024.

arXiv:2402.11588 [pdf, other]

SDiT: Spiking Diffusion Model with Transformer

Authors: Shu Yang, Hanzhi Ma, Chengting Yu, Aili Wang, Er-** Li

Abstract: Spiking neural networks (SNNs) have low power consumption and bio-interpretable characteristics, and are considered to have tremendous potential for energy-efficient computing. However, the exploration of SNNs on image generation tasks remains very limited, and a unified and effective structure for SNN-based generative models has yet to be proposed. In this paper, we explore a novel diffusion mode… ▽ More Spiking neural networks (SNNs) have low power consumption and bio-interpretable characteristics, and are considered to have tremendous potential for energy-efficient computing. However, the exploration of SNNs on image generation tasks remains very limited, and a unified and effective structure for SNN-based generative models has yet to be proposed. In this paper, we explore a novel diffusion model architecture within spiking neural networks. We utilize transformer to replace the commonly used U-net structure in mainstream diffusion models. It can generate higher quality images with relatively lower computational cost and shorter sampling time. It aims to provide an empirical baseline for research of generative models based on SNNs. Experiments on MNIST, Fashion-MNIST, and CIFAR-10 datasets demonstrate that our work is highly competitive compared to existing SNN generative models. △ Less

Submitted 24 February, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

arXiv:2402.09750 [pdf, other]

Exploring the Potential of Large Language Models in Artistic Creation: Collaboration and Reflection on Creative Programming

Authors: Anqi Wang, Zhizhuo Yin, Yulu Hu, Yuanyuan Mao, Pan Hui

Abstract: Recently, the potential of large language models (LLMs) has been widely used in assisting programming. However, current research does not explore the artist potential of LLMs in creative coding within artist and AI collaboration. Our work probes the reflection type of artists in the creation process with such collaboration. We compare two common collaboration approaches: invoking the entire progra… ▽ More Recently, the potential of large language models (LLMs) has been widely used in assisting programming. However, current research does not explore the artist potential of LLMs in creative coding within artist and AI collaboration. Our work probes the reflection type of artists in the creation process with such collaboration. We compare two common collaboration approaches: invoking the entire program and multiple subtasks. Our findings exhibit artists' different stimulated reflections in two different methods. Our finding also shows the correlation of reflection type with user performance, user satisfaction, and subjective experience in two collaborations through conducting two methods, including experimental data and qualitative interviews. In this sense, our work reveals the artistic potential of LLM in creative coding. Meanwhile, we provide a critical lens of human-AI collaboration from the artists' perspective and expound design suggestions for future work of AI-assisted creative tasks. △ Less

Submitted 15 February, 2024; originally announced February 2024.

Comments: 15 pages, 4 figures

ACM Class: J.5

arXiv:2402.09387 [pdf, other]

Active Disruption Avoidance and Trajectory Design for Tokamak Ramp-downs with Neural Differential Equations and Reinforcement Learning

Authors: Allen M. Wang, Oswin So, Charles Dawson, Darren T. Garnier, Cristina Rea, Chuchu Fan

Abstract: The tokamak offers a promising path to fusion energy, but plasma disruptions pose a major economic risk, motivating considerable advances in disruption avoidance. This work develops a reinforcement learning approach to this problem by training a policy to safely ramp-down the plasma current while avoiding limits on a number of quantities correlated with disruptions. The policy training environment… ▽ More The tokamak offers a promising path to fusion energy, but plasma disruptions pose a major economic risk, motivating considerable advances in disruption avoidance. This work develops a reinforcement learning approach to this problem by training a policy to safely ramp-down the plasma current while avoiding limits on a number of quantities correlated with disruptions. The policy training environment is a hybrid physics and machine learning model trained on simulations of the SPARC primary reference discharge (PRD) ramp-down, an upcoming burning plasma scenario which we use as a testbed. To address physics uncertainty and model inaccuracies, the simulation environment is massively parallelized on GPU with randomized physics parameters during policy training. The trained policy is then successfully transferred to a higher fidelity simulator where it successfully ramps down the plasma while avoiding user-specified disruptive limits. We also address the crucial issue of safety criticality by demonstrating that a constraint-conditioned policy can be used as a trajectory design assistant to design a library of feed-forward trajectories to handle different physics conditions and user settings. As a library of trajectories is more interpretable and verifiable offline, we argue such an approach is a promising path for leveraging the capabilities of reinforcement learning in the safety-critical context of burning plasma tokamaks. Finally, we demonstrate how the training environment can be a useful platform for other feed-forward optimization approaches by using an evolutionary algorithm to perform optimization of feed-forward trajectories that are robust to physics uncertainty △ Less

Submitted 14 February, 2024; originally announced February 2024.

arXiv:2402.09208 [pdf]

Coexistence of Superconductivity and Antiferromagnetism in Topological Magnet MnBi2Te4 Films

Authors: Wei Yuan, Zi-Jie Yan, Hemian Yi, Zihao Wang, Stephen Paolini, Yi-Fan Zhao, Ling-Jie Zhou, Annie G. Wang, Ke Wang, Thomas Prokscha, Zaher Salman, Andreas Suter, Purnima P. Balakrishnan, Alexander J. Grutter, Laurel E. Winter, John Singleton, Moses H. W. Chan, Cui-Zu Chang

Abstract: The interface of two materials can harbor unexpected emergent phenomena. One example is interface-induced superconductivity. In this work, we employ molecular beam epitaxy to grow a series of heterostructures formed by stacking together two non-superconducting antiferromagnetic materials, an intrinsic antiferromagnetic topological insulator MnBi2Te4 and an antiferromagnetic iron chalcogenide FeTe.… ▽ More The interface of two materials can harbor unexpected emergent phenomena. One example is interface-induced superconductivity. In this work, we employ molecular beam epitaxy to grow a series of heterostructures formed by stacking together two non-superconducting antiferromagnetic materials, an intrinsic antiferromagnetic topological insulator MnBi2Te4 and an antiferromagnetic iron chalcogenide FeTe. Our electrical transport measurements reveal interface-induced superconductivity in these heterostructures. By performing scanning tunneling microscopy and spectroscopy measurements, we observe a proximity-induced superconducting gap on the top surface of the MnBi2Te4 layer, confirming the interaction between superconductivity and antiferromagnetism in the MnBi2Te4 layer. Our findings will advance the fundamental inquiries into the topological superconducting phase in hybrid devices and provide a promising platform for the exploration of chiral Majorana physics in MnBi2Te4-based heterostructures. △ Less

Submitted 14 February, 2024; originally announced February 2024.

Comments: 24 pages, 4 figures, comments are welcome

arXiv:2402.08865 [pdf, other]

doi 10.1103/PhysRevD.109.112010

New constraints on ultraheavy dark matter from the LZ experiment

Authors: J. Aalbers, D. S. Akerib, A. K. Al Musalhi, C. S. Amarasinghe, A. Ames, T. J. Anderson, N. Angelides, H. M. Araújo, J. E. Armstrong, M. Arthurs, A. Baker, S. Balashov, J. Bang, J. W. Bargemann, A. Baxter, K. Beattie, T. Benson, A. Bhatti, A. Biekert, T. P. Biesiadzinski, H. J. Birch, E. Bishop, G. M. Blockinger, B. Boxer, C. A. J. Brew , et al. (174 additional authors not shown)

Abstract: Searches for dark matter with liquid xenon time projection chamber experiments have traditionally focused on the region of the parameter space that is characteristic of weakly interacting massive particles, ranging from a few GeV/$c^2$ to a few TeV/$c^2$. Models of dark matter with a mass much heavier than this are well motivated by early production mechanisms different from the standard thermal f… ▽ More Searches for dark matter with liquid xenon time projection chamber experiments have traditionally focused on the region of the parameter space that is characteristic of weakly interacting massive particles, ranging from a few GeV/$c^2$ to a few TeV/$c^2$. Models of dark matter with a mass much heavier than this are well motivated by early production mechanisms different from the standard thermal freeze-out, but they have generally been less explored experimentally. In this work, we present a re-analysis of the first science run (SR1) of the LZ experiment, with an exposure of $0.9$ tonne$\times$year, to search for ultraheavy particle dark matter. The signal topology consists of multiple energy deposits in the active region of the detector forming a straight line, from which the velocity of the incoming particle can be reconstructed on an event-by-event basis. Zero events with this topology were observed after applying the data selection calibrated on a simulated sample of signal-like events. New experimental constraints are derived, which rule out previously unexplored regions of the dark matter parameter space of spin-independent interactions beyond a mass of 10$^{17}$ GeV/$c^2$. △ Less

Submitted 13 February, 2024; originally announced February 2024.

Comments: 9 pages, 7 figures

Journal ref: Phys. Rev. D 109, 112010 (2024)

arXiv:2402.08159 [pdf, other]

Poisson flow consistency models for low-dose CT image denoising

Authors: Dennis Hein, Adam Wang, Ge Wang

Abstract: Diffusion and Poisson flow models have demonstrated remarkable success for a wide range of generative tasks. Nevertheless, their iterative nature results in computationally expensive sampling and the number of function evaluations (NFE) required can be orders of magnitude larger than for single-step methods. Consistency models are a recent class of deep generative models which enable single-step s… ▽ More Diffusion and Poisson flow models have demonstrated remarkable success for a wide range of generative tasks. Nevertheless, their iterative nature results in computationally expensive sampling and the number of function evaluations (NFE) required can be orders of magnitude larger than for single-step methods. Consistency models are a recent class of deep generative models which enable single-step sampling of high quality data without the need for adversarial training. In this paper, we introduce a novel image denoising technique which combines the flexibility afforded in Poisson flow generative models (PFGM)++ with the, high quality, single step sampling of consistency models. The proposed method first learns a trajectory between a noise distribution and the posterior distribution of interest by training PFGM++ in a supervised fashion. These pre-trained PFGM++ are subsequently "distilled" into Poisson flow consistency models (PFCM) via an updated version of consistency distillation. We call this approach posterior sampling Poisson flow consistency models (PS-PFCM). Our results indicate that the added flexibility of tuning the hyperparameter D, the dimensionality of the augmentation variables in PFGM++, allows us to outperform consistency models, a current state-of-the-art diffusion-style model with NFE=1 on clinical low-dose CT images. Notably, PFCM is in itself a novel family of deep generative models and we provide initial results on the CIFAR-10 dataset. △ Less

Submitted 12 February, 2024; originally announced February 2024.

arXiv:2402.06985 [pdf, other]

OSSAR: Towards Open-Set Surgical Activity Recognition in Robot-assisted Surgery

Authors: Long Bai, Guankun Wang, Jie Wang, Xiaoxiao Yang, Huxin Gao, Xin Liang, An Wang, Mobarakol Islam, Hongliang Ren

Abstract: In the realm of automated robotic surgery and computer-assisted interventions, understanding robotic surgical activities stands paramount. Existing algorithms dedicated to surgical activity recognition predominantly cater to pre-defined closed-set paradigms, ignoring the challenges of real-world open-set scenarios. Such algorithms often falter in the presence of test samples originating from class… ▽ More In the realm of automated robotic surgery and computer-assisted interventions, understanding robotic surgical activities stands paramount. Existing algorithms dedicated to surgical activity recognition predominantly cater to pre-defined closed-set paradigms, ignoring the challenges of real-world open-set scenarios. Such algorithms often falter in the presence of test samples originating from classes unseen during training phases. To tackle this problem, we introduce an innovative Open-Set Surgical Activity Recognition (OSSAR) framework. Our solution leverages the hyperspherical reciprocal point strategy to enhance the distinction between known and unknown classes in the feature space. Additionally, we address the issue of over-confidence in the closed set by refining model calibration, avoiding misclassification of unknown classes as known ones. To support our assertions, we establish an open-set surgical activity benchmark utilizing the public JIGSAWS dataset. Besides, we also collect a novel dataset on endoscopic submucosal dissection for surgical activity tasks. Extensive comparisons and ablation experiments on these datasets demonstrate the significant outperformance of our method over existing state-of-the-art approaches. Our proposed solution can effectively address the challenges of real-world surgical scenarios. Our code is publicly accessible at https://github.com/longbai1006/OSSAR. △ Less

Submitted 10 February, 2024; originally announced February 2024.

Comments: To appear in IEEE ICRA 2024

arXiv:2402.04420 [pdf, other]

Measuring machine learning harms from stereotypes: requires understanding who is being harmed by which errors in what ways

Authors: Angelina Wang, Xuechunzi Bai, Solon Barocas, Su Lin Blodgett

Abstract: As machine learning applications proliferate, we need an understanding of their potential for harm. However, current fairness metrics are rarely grounded in human psychological experiences of harm. Drawing on the social psychology of stereotypes, we use a case study of gender stereotypes in image search to examine how people react to machine learning errors. First, we use survey studies to show th… ▽ More As machine learning applications proliferate, we need an understanding of their potential for harm. However, current fairness metrics are rarely grounded in human psychological experiences of harm. Drawing on the social psychology of stereotypes, we use a case study of gender stereotypes in image search to examine how people react to machine learning errors. First, we use survey studies to show that not all machine learning errors reflect stereotypes nor are equally harmful. Then, in experimental studies we randomly expose participants to stereotype-reinforcing, -violating, and -neutral machine learning errors. We find stereotype-reinforcing errors induce more experientially (i.e., subjectively) harmful experiences, while having minimal changes to cognitive beliefs, attitudes, or behaviors. This experiential harm impacts women more than men. However, certain stereotype-violating errors are more experientially harmful for men, potentially due to perceived threats to masculinity. We conclude that harm cannot be the sole guide in fairness mitigation, and propose a nuanced perspective depending on who is experiencing what harm and why. △ Less

Submitted 6 February, 2024; originally announced February 2024.

Comments: earlier draft non-archival at EAAMO 2023

arXiv:2402.04105 [pdf, other]

Measuring Implicit Bias in Explicitly Unbiased Large Language Models

Authors: Xuechunzi Bai, Angelina Wang, Ilia Sucholutsky, Thomas L. Griffiths

Abstract: Large language models (LLMs) can pass explicit social bias tests but still harbor implicit biases, similar to humans who endorse egalitarian beliefs yet exhibit subtle biases. Measuring such implicit biases can be a challenge: as LLMs become increasingly proprietary, it may not be possible to access their embeddings and apply existing bias measures; furthermore, implicit biases are primarily a con… ▽ More Large language models (LLMs) can pass explicit social bias tests but still harbor implicit biases, similar to humans who endorse egalitarian beliefs yet exhibit subtle biases. Measuring such implicit biases can be a challenge: as LLMs become increasingly proprietary, it may not be possible to access their embeddings and apply existing bias measures; furthermore, implicit biases are primarily a concern if they affect the actual decisions that these systems make. We address both challenges by introducing two new measures of bias: LLM Implicit Bias, a prompt-based method for revealing implicit bias; and LLM Decision Bias, a strategy to detect subtle discrimination in decision-making tasks. Both measures are based on psychological research: LLM Implicit Bias adapts the Implicit Association Test, widely used to study the automatic associations between concepts held in human minds; and LLM Decision Bias operationalizes psychological results indicating that relative evaluations between two candidates, not absolute evaluations assessing each independently, are more diagnostic of implicit biases. Using these measures, we found pervasive stereotype biases mirroring those in society in 8 value-aligned models across 4 social categories (race, gender, religion, health) in 21 stereotypes (such as race and criminality, race and weapons, gender and science, age and negativity). Our prompt-based LLM Implicit Bias measure correlates with existing language model embedding-based bias methods, but better predicts downstream behaviors measured by LLM Decision Bias. These new prompt-based measures draw from psychology's long history of research into measuring stereotype biases based on purely observable behavior; they expose nuanced biases in proprietary value-aligned LLMs that appear unbiased according to standard benchmarks. △ Less

Submitted 23 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

arXiv:2402.03596 [pdf, other]

PandaX-xT: a Multi-ten-tonne Liquid Xenon Observatory at the China **** Underground Laboratory

Authors: PandaX Collaboration, Abdusalam Abdukerim, Zihao Bo, Wei Chen, Xun Chen, Chen Cheng, Zhaokan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Lisheng Geng, Karl Giboni, Linhui Gu, Xunan Guo, Xuyuan Guo, Zhichao Guo, Chencheng Han, Ke Han, Changda He, **rong He, Di Huang, Junting Huang, Zhou Huang, Ruquan Hou, Yu Hou , et al. (68 additional authors not shown)

Abstract: We propose a major upgrade to the existing PandaX-4T experiment in the China **** Underground Laboratory. The new experiment, PandaX-xT, will be a multi-ten-tonne liquid xenon, ultra-low background, and general-purpose observatory. The full-scaled PandaX-xT contains a 43-tonne liquid xenon active target. Such an experiment will significantly advance our fundamental understanding of particle phy… ▽ More We propose a major upgrade to the existing PandaX-4T experiment in the China **** Underground Laboratory. The new experiment, PandaX-xT, will be a multi-ten-tonne liquid xenon, ultra-low background, and general-purpose observatory. The full-scaled PandaX-xT contains a 43-tonne liquid xenon active target. Such an experiment will significantly advance our fundamental understanding of particle physics and astrophysics. The sensitivity of dark matter direct detection will be improved by nearly two orders of magnitude compared to the current best limits, approaching the so-called "neutrino floor" for a dark matter mass above 10 GeV/$c^2$, providing a decisive test to the Weakly Interacting Massive Particle paradigm. By searching for the neutrinoless double beta decay of $^{136}$Xe isotope in the detector, the effective Majorana neutrino mass can be measured to a [10 -- 41] meV/$c^2$ sensitivity, providing a key test to the Dirac/Majorana nature of neutrino s. Astrophysical neutrinos and other ultra-rare interactions can also be measured and searched for with an unprecedented background level, opening up new windows of discovery. Depending on the findings, PandaX-xT will seek the next stage upgrade utilizing isotopic separation on natural xenon. △ Less

Submitted 5 February, 2024; originally announced February 2024.

arXiv:2402.02100 [pdf, ps, other]

doi 10.1103/PhysRevA.109.L060601

Weak-measurement-based pseudospin pointer: A cost-effective scheme for precision measurement

Authors: Ling Ye, Lan Luo, An Wang, Rongchun Ge, Zhiyou Zhang

Abstract: As an essential component of state-of-the-art quantum technologies, fast and efficient quantum measurements are in persistent demand over time. We present a proof-of-principle experiment on a new dimensionless pseudo-spin pointer based on weak measurement. In the context of optical parameter estimation, we demonstrate that the parametric distribution's moment is obtained experimentally by employin… ▽ More As an essential component of state-of-the-art quantum technologies, fast and efficient quantum measurements are in persistent demand over time. We present a proof-of-principle experiment on a new dimensionless pseudo-spin pointer based on weak measurement. In the context of optical parameter estimation, we demonstrate that the parametric distribution's moment is obtained experimentally by employing the dimensionless pointer without measuring the distribution literally. In addition to the sheer liberation of experimental expense, the photon-countering-based pointer is well-calibrated for the detection of weak signals. We show that for signals $3$-$4$ orders of weaker in strength than the area-array camera method, an order of improvement in precision is achieved experimentally. △ Less

Submitted 12 June, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

Comments: 6 figures

Journal ref: Phys. Rev. A 109, L060601 (2024)

arXiv:2402.01908 [pdf, other]

Large language models cannot replace human participants because they cannot portray identity groups

Authors: Angelina Wang, Jamie Morgenstern, John P. Dickerson

Abstract: Large language models (LLMs) are increasing in capability and popularity, propelling their application in new domains -- including as replacements for human participants in computational social science, user testing, annotation tasks, and more. Traditionally, in all of these settings survey distributors are careful to find representative samples of the human population to ensure the validity of th… ▽ More Large language models (LLMs) are increasing in capability and popularity, propelling their application in new domains -- including as replacements for human participants in computational social science, user testing, annotation tasks, and more. Traditionally, in all of these settings survey distributors are careful to find representative samples of the human population to ensure the validity of their results and understand potential demographic differences. This means in order to be a suitable replacement, LLMs will need to be able to capture the influence of positionality (i.e., relevance of social identities like gender and race). However, we show that there are two inherent limitations in the way current LLMs are trained that prevent this. We argue analytically for why LLMs are doomed to both misportray and flatten the representations of demographic groups, then empirically show this to be true on 4 LLMs through a series of human studies with 3200 participants across 16 demographic identities. We also discuss a third consideration about how identity prompts can essentialize identities. Throughout, we connect each of these limitations to a pernicious history that shows why each is harmful for marginalized demographic groups. Overall, we urge caution in use cases where LLMs are intended to replace human participants whose identities are relevant to the task at hand. At the same time, in cases where the goal is to supplement rather than replace (e.g., pilot studies), we provide empirically-better inference-time techniques to reduce, but not remove, these harms. △ Less

Submitted 2 February, 2024; originally announced February 2024.

arXiv:2402.00300 [pdf, other]

Self-supervised learning of video representations from a child's perspective

Authors: A. Emin Orhan, Wentao Wang, Alex N. Wang, Mengye Ren, Brenden M. Lake

Abstract: Children learn powerful internal models of the world around them from a few years of egocentric visual experience. Can such internal models be learned from a child's visual experience with highly generic learning algorithms or do they require strong inductive biases? Recent advances in collecting large-scale, longitudinal, developmentally realistic video datasets and generic self-supervised learni… ▽ More Children learn powerful internal models of the world around them from a few years of egocentric visual experience. Can such internal models be learned from a child's visual experience with highly generic learning algorithms or do they require strong inductive biases? Recent advances in collecting large-scale, longitudinal, developmentally realistic video datasets and generic self-supervised learning (SSL) algorithms are allowing us to begin to tackle this nature vs. nurture question. However, existing work typically focuses on image-based SSL algorithms and visual capabilities that can be learned from static images (e.g. object recognition), thus ignoring temporal aspects of the world. To close this gap, here we train self-supervised video models on longitudinal, egocentric headcam recordings collected from a child over a two year period in their early development (6-31 months). The resulting models are highly effective at facilitating the learning of action concepts from a small number of labeled examples; they have favorable data size scaling properties; and they display emergent video interpolation capabilities. Video models also learn more robust object representations than image-based models trained with the exact same data. These results suggest that important temporal aspects of a child's internal model of the world may be learnable from their visual experience using highly generic learning algorithms and without strong inductive biases. △ Less

Submitted 31 January, 2024; originally announced February 2024.

Comments: 7 pages, 6 figures; code & models available from https://github.com/eminorhan/video-models

arXiv:2401.15250 [pdf, other]

Experimental Demonstration of Attosecond Pump-Probe Spectroscopy with an X-ray Free-Electron Laser

Authors: Zhaoheng Guo, Taran Driver, Sandra Beauvarlet, David Cesar, Joseph Duris, Paris L. Franz, Oliver Alexander, Dorian Bohler, Christoph Bostedt, Vitali Averbukh, Xinxin Cheng, Louis F. DiMauro, Gilles Doumy, Ruaridh Forbes, Oliver Gessner, James M. Glownia, Erik Isele, Andrei Kamalov, Kirk A. Larsen, Siqi Li, Xiang Li, Ming-Fu Lin, Gregory A. McCracken, Razib Obaid, Jordan T. ONeal , et al. (25 additional authors not shown)

Abstract: Pump-probe experiments with sub-femtosecond resolution are the key to understanding electronic dynamics in quantum systems. Here we demonstrate the generation and control of sub-femtosecond pulse pairs from a two-colour X-ray free-electron laser (XFEL). By measuring the delay between the two pulses with an angular streaking diagnostic, we characterise the group velocity of the XFEL and demonstrate… ▽ More Pump-probe experiments with sub-femtosecond resolution are the key to understanding electronic dynamics in quantum systems. Here we demonstrate the generation and control of sub-femtosecond pulse pairs from a two-colour X-ray free-electron laser (XFEL). By measuring the delay between the two pulses with an angular streaking diagnostic, we characterise the group velocity of the XFEL and demonstrate control of the pulse delay down to 270 as. We demonstrate the application of this technique to a pump-probe measurement in core-excited para-aminophenol. These results demonstrate the ability to perform pump-probe experiments with sub-femtosecond resolution and atomic site specificity. △ Less

Submitted 26 January, 2024; originally announced January 2024.

Comments: 55 pages, main manuscript (5 figures) + supplementary materials (25 figures), 30 figures total. Submitted to Nature Photonics

arXiv:2401.14609 [pdf, other]

Physically Informed Synchronic-adaptive Learning for Industrial Systems Modeling in Heterogeneous Media with Unavailable Time-varying Interface

Authors: Aina Wang, Pan Qin, Xi-Ming Sun

Abstract: Partial differential equations (PDEs) are commonly employed to model complex industrial systems characterized by multivariable dependence. Existing physics-informed neural networks (PINNs) excel in solving PDEs in a homogeneous medium. However, their feasibility is diminished when PDE parameters are unknown due to a lack of physical attributions and time-varying interface is unavailable arising fr… ▽ More Partial differential equations (PDEs) are commonly employed to model complex industrial systems characterized by multivariable dependence. Existing physics-informed neural networks (PINNs) excel in solving PDEs in a homogeneous medium. However, their feasibility is diminished when PDE parameters are unknown due to a lack of physical attributions and time-varying interface is unavailable arising from heterogeneous media. To this end, we propose a data-physics-hybrid method, physically informed synchronic-adaptive learning (PISAL), to solve PDEs for industrial systems modeling in heterogeneous media. First, Net1, Net2, and NetI, are constructed to approximate the solutions satisfying PDEs and the interface. Net1 and Net2 are utilized to synchronously learn each solution satisfying PDEs with diverse parameters, while NetI is employed to adaptively learn the unavailable time-varying interface. Then, a criterion combined with NetI is introduced to adaptively distinguish the attributions of measurements and collocation points. Furthermore, NetI is integrated into a data-physics-hybrid loss function. Accordingly, a synchronic-adaptive learning (SAL) strategy is proposed to decompose and optimize each subdomain. Besides, we theoretically prove the approximation capability of PISAL. Extensive experimental results verify that the proposed PISAL can be used for industrial systems modeling in heterogeneous media, which faces the challenges of lack of physical attributions and unavailable time-varying interface. △ Less

Submitted 25 January, 2024; originally announced January 2024.

arXiv:2401.12723 [pdf, other]

A regular MOG black hole's impact on shadows and gravitational weak lensing in the presence of quintessence field

Authors: Ahmad Al-Badawi, Sanjar Shaymatov, Mirzabek Alloqulov, Anzhong Wang

Abstract: We investigate the impact of the modified gravity (MOG) field and the quintessence scalar field on horizon evolution, black hole (BH) shadow and the weak gravitational lensing around a static spherically symmetric BH. We first begin to write the BH metric associated with the MOG parameter and quintessence scalar field. We then determine the BH shadow and obtain numerical solutions for the photon s… ▽ More We investigate the impact of the modified gravity (MOG) field and the quintessence scalar field on horizon evolution, black hole (BH) shadow and the weak gravitational lensing around a static spherically symmetric BH. We first begin to write the BH metric associated with the MOG parameter and quintessence scalar field. We then determine the BH shadow and obtain numerical solutions for the photon sphere and shadow radius. We show that the MOG ($α$) and the quintessence ($c$) parameters have a significant impact on BH shadow and photon sphere. Based on the analysis, we further show that the combined effects of the MOG parameter and quintessential field can increase the values of BH shadow and photon sphere radii. We also obtain constraints on the BH parameters by applying the observational data of Sgr A$^{\star}$ and M87$^{\star}$. Finally, we consider the weak deflection angle of BH within the context of the Gauss-Bonnet theorem (GBT) and show that the combined effects of the MOG and quintessence parameters do make the value of the deflection angle grow, referring to remarkable property being in well agreement with the physical meaning of both parameters that can maintain the strong gravitational field in the surrounding environment of BH. △ Less

Submitted 23 January, 2024; originally announced January 2024.

Comments: 14 pages, 2 tables, 9 captioned figures

arXiv:2401.12158 [pdf]

doi 10.1063/5.0198885

Electron mobility of SnO2 from first principles

Authors: Amanda Wang, Kyle Bushick, Nick Pant, Woncheol Lee, Xiao Zhang, Joshua Leveillee, Feliciano Giustino, Samuel Poncé, Emmanouil Kioupakis

Abstract: The transparent conducting oxide SnO2 is a wide bandgap semiconductor that is easily n-type doped and widely used in various electronic and optoelectronic applications. Experimental reports of the electron mobility of this material vary widely depending on the growth conditions and do** concentrations. In this work, we calculate the electron mobility of SnO2 from first principles to examine the… ▽ More The transparent conducting oxide SnO2 is a wide bandgap semiconductor that is easily n-type doped and widely used in various electronic and optoelectronic applications. Experimental reports of the electron mobility of this material vary widely depending on the growth conditions and do** concentrations. In this work, we calculate the electron mobility of SnO2 from first principles to examine the temperature- and do**-concentration dependence, and to elucidate the scattering mechanisms that limit transport. We include both electron-phonon scattering and electron-ionized impurity scattering to accurately model scattering in a doped semiconductor. We find a strongly anisotropic mobility that favors transport in the direction parallel to the c-axis. At room temperature and intrinsic carrier concentrations, the low-energy polar-optical phonon modes dominate scattering, while ionized-impurity scattering dominates above 10^18 cm^-3. △ Less

Submitted 22 April, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

Comments: 24 pages, 3 figures in main text, 6 figures in supplementary material

Journal ref: Appl. Phys. Lett. 22 April 2024; 124 (17): 172103

arXiv:2401.11770 [pdf, ps, other]

Evidence for Unfolded Fermi Surfaces in the Charge-Density-Wave State of Kagome Metal FeGe Revealed by de Haas-van Alphen Effect

Authors: Kaixin Tang, Han**g Zhou, Houpu Li, Senyang Pan, Xueliang Wu, Hongyu Li, Nan Zhang, Chuanying Xi, **glei Zhang, Aifeng Wang, Xiangang Wan, Ziji Xiang, Xianhui Chen

Abstract: The antiferromagnetic kagome lattice compound FeGe has been revealed to host an emergent charge-density-wave (CDW) state which manifests complex interplay between the spin, charge and lattice degrees of freedom. Here, we present a comprehensive study of the de Haas-van Alphen effect by measuring torque magnetometry under magnetic fields up to 45.2 T to map Fermi surfaces in this unusual CDW state.… ▽ More The antiferromagnetic kagome lattice compound FeGe has been revealed to host an emergent charge-density-wave (CDW) state which manifests complex interplay between the spin, charge and lattice degrees of freedom. Here, we present a comprehensive study of the de Haas-van Alphen effect by measuring torque magnetometry under magnetic fields up to 45.2 T to map Fermi surfaces in this unusual CDW state. For field along the $c$ direction, we resolve four cyclotron orbits; the largest one roughly corresponding to the area of the 2$\times$2 folded Brillouin zone. Three smaller orbits are characterized by light effective cyclotron masses range from (0.18-0.30) $m_e$. Angle-resolved measurements identify one Fermi surface segment with weak anisotropy. Combined with band structure calculations, our results suggest that features of unfolded Fermi surfaces are robust against CDW reconstruction, corroborating the novel effect of a short-ranged CDW on the electronic structure. △ Less

Submitted 22 January, 2024; originally announced January 2024.

Comments: 7 pages, 4 figures, to be published in Phys. Rev. Research

arXiv:2401.10862 [pdf, other]

Pruning for Protection: Increasing Jailbreak Resistance in Aligned LLMs Without Fine-Tuning

Authors: Adib Hasan, Ileana Rugina, Alex Wang

Abstract: Large Language Models (LLMs) are susceptible to `jailbreaking' prompts, which can induce the generation of harmful content. This paper demonstrates that moderate WANDA pruning (Sun et al., 2023) can increase their resistance to such attacks without the need for fine-tuning, while maintaining performance on standard benchmarks. Our findings suggest that the benefits of pruning correlate with the in… ▽ More Large Language Models (LLMs) are susceptible to `jailbreaking' prompts, which can induce the generation of harmful content. This paper demonstrates that moderate WANDA pruning (Sun et al., 2023) can increase their resistance to such attacks without the need for fine-tuning, while maintaining performance on standard benchmarks. Our findings suggest that the benefits of pruning correlate with the initial safety levels of the model, indicating a regularizing effect of WANDA pruning. We introduce a dataset of 225 harmful tasks across five categories to systematically evaluate this safety enhancement. We argue that safety improvements can be understood through a regularization perspective. First, we show that pruning helps LLMs focus more effectively on task-relevant tokens within jailbreaking prompts. Then, we analyze the effects of pruning on the perplexity of malicious prompts before and after their integration into jailbreak templates. Finally, we demonstrate statistically significant performance improvements under domain shifts when applying WANDA to linear models. △ Less

Submitted 28 April, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

arXiv:2401.09791 [pdf]

BreastRegNet: A Deep Learning Framework for Registration of Breast Faxitron and Histopathology Images

Authors: Negar Golestani, Aihui Wang, Gregory R Bean, Mirabela Rusu

Abstract: A standard treatment protocol for breast cancer entails administering neoadjuvant therapy followed by surgical removal of the tumor and surrounding tissue. Pathologists typically rely on cabinet X-ray radiographs, known as Faxitron, to examine the excised breast tissue and diagnose the extent of residual disease. However, accurately determining the location, size, and focality of residual cancer c… ▽ More A standard treatment protocol for breast cancer entails administering neoadjuvant therapy followed by surgical removal of the tumor and surrounding tissue. Pathologists typically rely on cabinet X-ray radiographs, known as Faxitron, to examine the excised breast tissue and diagnose the extent of residual disease. However, accurately determining the location, size, and focality of residual cancer can be challenging, and incorrect assessments can lead to clinical consequences. The utilization of automated methods can improve the histopathology process, allowing pathologists to choose regions for sampling more effectively and precisely. Despite the recognized necessity, there are currently no such methods available. Training such automated detection models require accurate ground truth labels on ex-vivo radiology images, which can be acquired through registering Faxitron and histopathology images and map** the extent of cancer from histopathology to x-ray images. This study introduces a deep learning-based image registration approach trained on mono-modal synthetic image pairs. The models were trained using data from 50 women who received neoadjuvant chemotherapy and underwent surgery. The results demonstrate that our method is faster and yields significantly lower average landmark error ($2.1\pm1.96$ mm) over the state-of-the-art iterative ($4.43\pm4.1$ mm) and deep learning ($4.02\pm3.15$ mm) approaches. Improved performance of our approach in integrating radiology and pathology information facilitates generating large datasets, which allows training models for more accurate breast cancer detection. △ Less

Submitted 18 January, 2024; originally announced January 2024.

arXiv:2401.08822 [pdf, other]

doi 10.1177/14738716241229437

An Empirical Study of Counterfactual Visualization to Support Visual Causal Inference

Authors: Arran Zeyu Wang, David Borland, David Gotz

Abstract: Counterfactuals -- expressing what might have been true under different circumstances -- have been widely applied in statistics and machine learning to help understand causal relationships. More recently, counterfactuals have begun to emerge as a technique being applied within visualization research. However, it remains unclear to what extent counterfactuals can aid with visual data communication.… ▽ More Counterfactuals -- expressing what might have been true under different circumstances -- have been widely applied in statistics and machine learning to help understand causal relationships. More recently, counterfactuals have begun to emerge as a technique being applied within visualization research. However, it remains unclear to what extent counterfactuals can aid with visual data communication. In this paper, we primarily focus on assessing the quality of users' understanding of data when provided with counterfactual visualizations. We propose a preliminary model of causality comprehension by connecting theories from causal inference and visual data communication. Leveraging this model, we conducted an empirical study to explore how counterfactuals can improve users' understanding of data in static visualizations. Our results indicate that visualizing counterfactuals had a positive impact on participants' interpretations of causal relations within datasets. These results motivate a discussion of how to more effectively incorporate counterfactuals into data visualizations. △ Less

Submitted 16 January, 2024; originally announced January 2024.

Comments: Accepted for publication in Information Visualization

Journal ref: Information Visualization 2024; 23(2): 197-214

arXiv:2401.08411 [pdf, other]

doi 10.1109/MCG.2023.3338788

Using Counterfactuals to Improve Causal Inferences from Visualizations

Authors: David Borland, Arran Zeyu Wang, David Gotz

Abstract: Traditional approaches to data visualization have often focused on comparing different subsets of data, and this is reflected in the many techniques developed and evaluated over the years for visual comparison. Similarly, common workflows for exploratory visualization are built upon the idea of users interactively applying various filter and grou** mechanisms in search of new insights. This para… ▽ More Traditional approaches to data visualization have often focused on comparing different subsets of data, and this is reflected in the many techniques developed and evaluated over the years for visual comparison. Similarly, common workflows for exploratory visualization are built upon the idea of users interactively applying various filter and grou** mechanisms in search of new insights. This paradigm has proven effective at hel** users identify correlations between variables that can inform thinking and decision-making. However, recent studies show that consumers of visualizations often draw causal conclusions even when not supported by the data. Motivated by these observations, this article highlights recent advances from a growing community of researchers exploring methods that aim to directly support visual causal inference. However, many of these approaches have their own limitations which limit their use in many real-world scenarios. This article therefore also outlines a set of key open challenges and corresponding priorities for new research to advance the state of the art in visual causal inference. △ Less

Submitted 16 January, 2024; originally announced January 2024.

Comments: Accepted for publication in IEEE Computer Graphics and Applications, 44(1), Jan/Feb, 2024

Journal ref: IEEE Computer Graphics and Applications 2024; 44(1): 95-104

arXiv:2401.07045 [pdf, other]

Measurement of Solar $pp$ Neutrino Flux using Electron Recoil Data from PandaX-4T Commissioning Run

Authors: PandaX Collaboration, Xiaoying Lu, Abdusalam Abdukerim, Zihao Bo, Wei Chen, Xun Chen, Yunhua Chen, Chen Cheng, Zhaokan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Lisheng Geng, Karl Giboni, Xuyuan Guo, Chencheng Han, Ke Han, Changda He, **rong He, Di Huang, Junting Huang, Zhou Huang, Ruquan Hou, Yu Hou, Xiangdong Ji , et al. (67 additional authors not shown)

Abstract: The proton-proton ($pp$) fusion chain dominates the neutrino production from the Sun. The uncertainty of the predicted $pp$ neutrino flux is at the sub-percent level, whereas that of the best measurement is $\mathcal{O}(10\%)$. In this paper, we present the first result to measure the solar $pp$ neutrinos in the electron recoil energy range from 24 to 144 keV, using the PandaX-4T commissioning dat… ▽ More The proton-proton ($pp$) fusion chain dominates the neutrino production from the Sun. The uncertainty of the predicted $pp$ neutrino flux is at the sub-percent level, whereas that of the best measurement is $\mathcal{O}(10\%)$. In this paper, we present the first result to measure the solar $pp$ neutrinos in the electron recoil energy range from 24 to 144 keV, using the PandaX-4T commissioning data with 0.63 tonne$\times$year exposure. The $pp$ neutrino flux is determined to be $(8.0 \pm 3.9 \,{\rm{(stat)}} \pm 10.0 \,{\rm{(syst)}} )\times 10^{10}\, $$\rm{s}^{-1} \rm{cm}^{-2}$, consistent with Standard Solar Model and existing measurements, corresponding to a flux upper limit of $23.3\times 10^{10}\, $$\rm{s}^{-1} \rm{cm}^{-2}$ at 90\% C.L.. △ Less

Submitted 2 July, 2024; v1 submitted 13 January, 2024; originally announced January 2024.

Comments: 6 pages, 5 figures

arXiv:2401.06530 [pdf, other]

Performance study of the JadePix-3 telescope from a beam test

Authors: Sheng Dong, Zhiliang Chen, Jia Zhou, Xinye Zhai, Anqi Wang, Yunxiang Wang, Hulin Wang, Lailin Xu, **g Dong, Yang Zhou, Yunpeng Lu, Mingyi Dong, Hongyu Zhang, Qun Ouyang

Abstract: We present the results of a beam test conducted on a telescope using the JadePix-3 pixel sensor, developed with TowerJazz 180 nm CMOS imaging technology. The telescope is composed of five planes, each equipped with a JadePix-3 sensor with pitches of 26 um x 16 um and 23.11 um x 16 um. In addition, it features an FPGA-based synchronous readout system. The telescope underwent testing using an electr… ▽ More We present the results of a beam test conducted on a telescope using the JadePix-3 pixel sensor, developed with TowerJazz 180 nm CMOS imaging technology. The telescope is composed of five planes, each equipped with a JadePix-3 sensor with pitches of 26 um x 16 um and 23.11 um x 16 um. In addition, it features an FPGA-based synchronous readout system. The telescope underwent testing using an electron beam with energy ranging from 4 to 6 GeV. At an electron energy of 5.4 GeV, the telescope demonstrated superior spatial resolutions of 2.6 and 2.3 um in two dimensions. By designating the central plane as the device under test, we evaluated the JadePix-3 sensor's spatial resolutions as 5.2 and 4.6 um in two dimensions, achieving a detection efficiency of over 99%. △ Less

Submitted 26 May, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

arXiv:2401.05758 [pdf]

Analytical impact excitation of Er/O/B co-doped Si light emitting diodes

Authors: Xiaoming Wang, Jia**g He, Ao Wang, Kun Zhang, Yufei Sheng, Weida Hu, Chaoyuan **, Hua Bao, Ya** Dan

Abstract: Er doped Si light emitting diodes may find important applications in the generation and storage of quantum information. These diodes exhibit an emission efficiency two orders of magnitude higher at reverse bias than forward bias due to impact excitation. However, physics of impact excitation in these devices remains largely unexplored. In this work, we fabricated an Er/O/B co-doped Si light emitti… ▽ More Er doped Si light emitting diodes may find important applications in the generation and storage of quantum information. These diodes exhibit an emission efficiency two orders of magnitude higher at reverse bias than forward bias due to impact excitation. However, physics of impact excitation in these devices remains largely unexplored. In this work, we fabricated an Er/O/B co-doped Si light emitting diode which exhibits a strong electro-luminescence by the impact excitation of electrons inelastically colliding the Er ions. An analytical impact excitation theory was established to predict the electroluminescence intensity and internal quantum efficiency which fit well with the experimental data. From the fittings, we find that the excitable Er ions reach a record concentration of 1.9 x 10^19 cm-3 and up to 45% of them are in excitation state by impact excitation. This work has important implications for develo** efficient classical and quantum light sources based on rare earth elements. △ Less

Submitted 11 January, 2024; originally announced January 2024.

Comments: 16 pages, 5 figures

Journal ref: Physical Review Letters 2024

arXiv:2401.05654 [pdf, other]

Towards Conversational Diagnostic AI

Authors: Tao Tu, Anil Palepu, Mike Schaekermann, Khaled Saab, Jan Freyberg, Ryutaro Tanno, Amy Wang, Brenna Li, Mohamed Amin, Nenad Tomasev, Shekoofeh Azizi, Karan Singhal, Yong Cheng, Le Hou, Albert Webson, Kavita Kulkarni, S Sara Mahdavi, Christopher Semturs, Juraj Gottweis, Joelle Barral, Katherine Chou, Greg S Corrado, Yossi Matias, Alan Karthikesalingam, Vivek Natarajan

Abstract: At the heart of medicine lies the physician-patient dialogue, where skillful history-taking paves the way for accurate diagnosis, effective management, and enduring trust. Artificial Intelligence (AI) systems capable of diagnostic dialogue could increase accessibility, consistency, and quality of care. However, approximating clinicians' expertise is an outstanding grand challenge. Here, we introdu… ▽ More At the heart of medicine lies the physician-patient dialogue, where skillful history-taking paves the way for accurate diagnosis, effective management, and enduring trust. Artificial Intelligence (AI) systems capable of diagnostic dialogue could increase accessibility, consistency, and quality of care. However, approximating clinicians' expertise is an outstanding grand challenge. Here, we introduce AMIE (Articulate Medical Intelligence Explorer), a Large Language Model (LLM) based AI system optimized for diagnostic dialogue. AMIE uses a novel self-play based simulated environment with automated feedback mechanisms for scaling learning across diverse disease conditions, specialties, and contexts. We designed a framework for evaluating clinically-meaningful axes of performance including history-taking, diagnostic accuracy, management reasoning, communication skills, and empathy. We compared AMIE's performance to that of primary care physicians (PCPs) in a randomized, double-blind crossover study of text-based consultations with validated patient actors in the style of an Objective Structured Clinical Examination (OSCE). The study included 149 case scenarios from clinical providers in Canada, the UK, and India, 20 PCPs for comparison with AMIE, and evaluations by specialist physicians and patient actors. AMIE demonstrated greater diagnostic accuracy and superior performance on 28 of 32 axes according to specialist physicians and 24 of 26 axes according to patient actors. Our research has several limitations and should be interpreted with appropriate caution. Clinicians were limited to unfamiliar synchronous text-chat which permits large-scale LLM-patient interactions but is not representative of usual clinical practice. While further research is required before AMIE could be translated to real-world settings, the results represent a milestone towards conversational diagnostic AI. △ Less

Submitted 10 January, 2024; originally announced January 2024.

Comments: 46 pages, 5 figures in main text, 19 figures in appendix

arXiv:2401.04929 [pdf, other]

Learning-Based Difficulty Calibration for Enhanced Membership Inference Attacks

Authors: Haonan Shi, Tu Ouyang, An Wang

Abstract: Machine learning models, in particular deep neural networks, are currently an integral part of various applications, from healthcare to finance. However, using sensitive data to train these models raises concerns about privacy and security. One method that has emerged to verify if the trained models are privacy-preserving is Membership Inference Attacks (MIA), which allows adversaries to determine… ▽ More Machine learning models, in particular deep neural networks, are currently an integral part of various applications, from healthcare to finance. However, using sensitive data to train these models raises concerns about privacy and security. One method that has emerged to verify if the trained models are privacy-preserving is Membership Inference Attacks (MIA), which allows adversaries to determine whether a specific data point was part of a model's training dataset. While a series of MIAs have been proposed in the literature, only a few can achieve high True Positive Rates (TPR) in the low False Positive Rate (FPR) region (0.01%~1%). This is a crucial factor to consider for an MIA to be practically useful in real-world settings. In this paper, we present a novel approach to MIA that is aimed at significantly improving TPR at low FPRs. Our method, named learning-based difficulty calibration for MIA(LDC-MIA), characterizes data records by their hardness levels using a neural network classifier to determine membership. The experiment results show that LDC-MIA can improve TPR at low FPR by up to 4x compared to the other difficulty calibration based MIAs. It also has the highest Area Under ROC curve (AUC) across all datasets. Our method's cost is comparable with most of the existing MIAs, but is orders of magnitude more efficient than one of the state-of-the-art methods, LiRA, while achieving similar performance. △ Less

Submitted 9 July, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

Comments: Accepted to IEEE Euro S&P 2024

arXiv:2401.04215 [pdf]

A coronary artery phantom for task-based CT performance assessment and a comparative study of clinical CT, photon counting CT, and micro CT

Authors: Jed D. Pack, Paul Fitzgerald, Stephen Araujo, Ying Fan, Grant Stevens, Jonathan Gerdes, Adam Wang, Koen Nieman, Ge Wang, Bruno De Man

Abstract: While drastic improvements in CT technology have occurred in the past 25 years, spatial resolution is one area where progress has been limited until recently. New photon counting CT systems, are capable of much better spatial resolution than their (energy integrating) predecessors. These improvements have the potential to improve the evaluation obstructive coronary artery disease by enabling more… ▽ More While drastic improvements in CT technology have occurred in the past 25 years, spatial resolution is one area where progress has been limited until recently. New photon counting CT systems, are capable of much better spatial resolution than their (energy integrating) predecessors. These improvements have the potential to improve the evaluation obstructive coronary artery disease by enabling more accurate delineation between calcified plaque and coronary vessel lumen. A new set of vessel phantoms has been designed and manufactured for quantifying this improvement. Comparisons are made between an existing clinical CT system, a prototype photon counting system, with images from a micro CT system being used as the gold standard. Scans were made of the same objects on all three systems. The resulting images were registered and the luminal cross section areas were compared. Luminal cross-sections near calcified plaques were reduced due to blooming, but this effect was much less pronounced in images from the prototype photon counting system as compared to the images from the clinical CT system. △ Less

Submitted 8 January, 2024; originally announced January 2024.

arXiv:2401.00849 [pdf, other]

COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training

Authors: Alex **peng Wang, Linjie Li, Kevin Qinghong Lin, Jianfeng Wang, Kevin Lin, Zhengyuan Yang, Lijuan Wang, Mike Zheng Shou

Abstract: In the evolution of Vision-Language Pre-training, shifting from short-text comprehension to encompassing extended textual contexts is pivotal. Recent autoregressive vision-language models like \cite{flamingo, palme}, leveraging the long-context capability of Large Language Models, have excelled in few-shot text generation tasks but face challenges in alignment tasks. Addressing this gap, we introd… ▽ More In the evolution of Vision-Language Pre-training, shifting from short-text comprehension to encompassing extended textual contexts is pivotal. Recent autoregressive vision-language models like \cite{flamingo, palme}, leveraging the long-context capability of Large Language Models, have excelled in few-shot text generation tasks but face challenges in alignment tasks. Addressing this gap, we introduce the contrastive loss into text generation models, presenting the COntrastive-Streamlined MultimOdal framework (\ModelName), strategically partitioning the language model into dedicated unimodal text processing and adept multimodal data handling components. \ModelName, our unified framework, merges unimodal and multimodal elements, enhancing model performance for tasks involving textual and visual data while notably reducing learnable parameters. However, these models demand extensive long-text datasets, yet the availability of high-quality long-text video datasets remains limited. To bridge this gap, this work introduces \VideoDatasetName, an inaugural interleaved video-text dataset featuring comprehensive captions, marking a significant step forward. Demonstrating its impact, we illustrate how \VideoDatasetName{} enhances model performance in image-text tasks. With 34% learnable parameters and utilizing 72\% of the available data, our model demonstrates significant superiority over OpenFlamingo~\cite{openflamingo}. For instance, in the 4-shot flickr captioning task, performance notably improves from 57.2% to 65.\%. The contributions of \ModelName{} and \VideoDatasetName{} are underscored by notable performance gains across 14 diverse downstream datasets encompassing both image-text and video-text tasks. △ Less

Submitted 1 January, 2024; originally announced January 2024.

Comments: 16 pages; Website: http://fingerrec.github.io/cosmo

arXiv:2401.00496 [pdf, other]

SAR-RARP50: Segmentation of surgical instrumentation and Action Recognition on Robot-Assisted Radical Prostatectomy Challenge

Authors: Dimitrios Psychogyios, Emanuele Colleoni, Beatrice Van Amsterdam, Chih-Yang Li, Shu-Yu Huang, Yuchong Li, Fucang Jia, Baosheng Zou, Guotai Wang, Yang Liu, Maxence Boels, Jiayu Huo, Rachel Sparks, Prokar Dasgupta, Alejandro Granados, Sebastien Ourselin, Mengya Xu, An Wang, Yanan Wu, Long Bai, Hongliang Ren, Atsushi Yamada, Yuriko Harai, Yuto Ishikawa, Kazuyuki Hayashi , et al. (25 additional authors not shown)

Abstract: Surgical tool segmentation and action recognition are fundamental building blocks in many computer-assisted intervention applications, ranging from surgical skills assessment to decision support systems. Nowadays, learning-based action recognition and segmentation approaches outperform classical methods, relying, however, on large, annotated datasets. Furthermore, action recognition and tool segme… ▽ More Surgical tool segmentation and action recognition are fundamental building blocks in many computer-assisted intervention applications, ranging from surgical skills assessment to decision support systems. Nowadays, learning-based action recognition and segmentation approaches outperform classical methods, relying, however, on large, annotated datasets. Furthermore, action recognition and tool segmentation algorithms are often trained and make predictions in isolation from each other, without exploiting potential cross-task relationships. With the EndoVis 2022 SAR-RARP50 challenge, we release the first multimodal, publicly available, in-vivo, dataset for surgical action recognition and semantic instrumentation segmentation, containing 50 suturing video segments of Robotic Assisted Radical Prostatectomy (RARP). The aim of the challenge is twofold. First, to enable researchers to leverage the scale of the provided dataset and develop robust and highly accurate single-task action recognition and tool segmentation approaches in the surgical domain. Second, to further explore the potential of multitask-based learning approaches and determine their comparative advantage against their single-task counterparts. A total of 12 teams participated in the challenge, contributing 7 action recognition methods, 9 instrument segmentation techniques, and 4 multitask approaches that integrated both action recognition and instrument segmentation. The complete SAR-RARP50 dataset is available at: https://rdr.ucl.ac.uk/projects/SARRARP50_Segmentation_of_surgical_instrumentation_and_Action_Recognition_on_Robot-Assisted_Radical_Prostatectomy_Challenge/191091 △ Less

Submitted 23 January, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

arXiv:2401.00410 [pdf]

doi 10.1007/s42864-022-00192-z

Electrical and thermal transport properties of kagome metals AV$_3$Sb$_5$ (A=K, Rb, Cs)

Authors: Xinrun Mi, Kunya Yang, Yuhan Gan, Long Zhang, Aifeng Wang, Yisheng Chai, Xiaoyuan Zhou, Mingquan He

Abstract: The interplay between lattice geometry, band topology and electronic correlations in the newly discovered kagome compounds AV$_3$Sb$_5$ (A=K, Rb, Cs) makes this family a novel playground to investigate emergent quantum phenomena, such as unconventional superconductivity, chiral charge density wave and electronic nematicity. These exotic quantum phases naturally leave nontrivial fingerprints in tra… ▽ More The interplay between lattice geometry, band topology and electronic correlations in the newly discovered kagome compounds AV$_3$Sb$_5$ (A=K, Rb, Cs) makes this family a novel playground to investigate emergent quantum phenomena, such as unconventional superconductivity, chiral charge density wave and electronic nematicity. These exotic quantum phases naturally leave nontrivial fingerprints in transport properties of AV$_3$Sb$_5$, both in electrical and thermal channels, which are prominent probes to uncover the underlying mechanisms. In this brief review, we highlight the unusual electrical and thermal transport properties observed in the unconventional charge ordered state of AV3Sb5, including giant anomalous Hall, anomalous Nernst, ambipolar Nernst and anomalous thermal Hall effects. Connections of these anomalous transport properties to time-reversal symmetry breaking, topological and multiband fermiology, as well as electronic nematicity, are also discussed. Finally, a perspective together with challenges of this rapid growing field are given. △ Less

Submitted 31 December, 2023; originally announced January 2024.

Comments: 34 pages,9 figures,an review article published in Tungsten 5,300(2023)

Journal ref: Tungsten 5,300(2023)

arXiv:2401.00373 [pdf, other]

Improvement on the Linearity Response of PandaX-4T with new Photomultiplier Tubes Bases

Authors: Lingyin Luo, Deqing Fang, Ke Han, Di Huang, Xiaofeng Shang, Anqing Wang, Qiuhong Wang, Shaobo Wang, Siguang Wang, Xiang Xiao, Binbin Yan, Xiyu Yan

Abstract: With the expanding reach of physics, xenon-based detectors such as PandaX-4T in the China **** Underground Laboratory aim to cover an energy range from sub-keV to multi-MeV. A linear response of the photomultiplier tubes (PMTs) is required for both scintillation and electroluminescence signals. Through a dedicated bench test, we investigated the cause of the non-linear response in the Hamamatsu… ▽ More With the expanding reach of physics, xenon-based detectors such as PandaX-4T in the China **** Underground Laboratory aim to cover an energy range from sub-keV to multi-MeV. A linear response of the photomultiplier tubes (PMTs) is required for both scintillation and electroluminescence signals. Through a dedicated bench test, we investigated the cause of the non-linear response in the Hamamatsu R11410-23 PMTs used in PandaX-4T. The saturation and suppression of the PMT waveform observed during the commissioning of PandaX-4T were caused by the high-voltage divider base. The bench test data validated the de-saturation algorithm used in the PandaX-4T data analysis. We also confirmed the improvement in linearity of a new PMT base design, which will be used to upgrade the PMT readout system in PandaX-4T. △ Less

Submitted 7 April, 2024; v1 submitted 30 December, 2023; originally announced January 2024.

Comments: 13 pages, 9 figures

arXiv:2312.17242 [pdf, other]

Learning to Generate Text in Arbitrary Writing Styles

Authors: Aleem Khan, Andrew Wang, Sophia Hager, Nicholas Andrews

Abstract: Prior work in style-controlled text generation has focused on tasks such as emulating the style of prolific literary authors, producing formal or informal text, and mitigating toxicity of generated text. Plentiful demonstrations of these styles are available, and as a result modern language models are often able to emulate them, either via prompting or discriminative control. However, in applicati… ▽ More Prior work in style-controlled text generation has focused on tasks such as emulating the style of prolific literary authors, producing formal or informal text, and mitigating toxicity of generated text. Plentiful demonstrations of these styles are available, and as a result modern language models are often able to emulate them, either via prompting or discriminative control. However, in applications such as writing assistants, it is desirable for language models to produce text in an author-specific style on the basis of a potentially small writing sample. For example, someone writing in a particular dialect may prefer writing suggestions that retain the same dialect. We find that instruction-tuned language models can struggle to reproduce author-specific style demonstrated in a prompt. Instead, we propose to guide a language model to generate text in a target style using contrastively-trained representations that capture stylometric features. Our approach (StyleMC) combines an author-adapted language model with sequence-level inference to improve stylistic consistency, and is found to be effective in a variety of conditions, including unconditional generation and style transfer. Additionally, we find that the proposed approach can serve as an effective anonymization method, by editing a document to mask authorship while preserving the original meaning △ Less

Submitted 4 March, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

arXiv:2312.17192 [pdf, other]

HISR: Hybrid Implicit Surface Representation for Photorealistic 3D Human Reconstruction

Authors: Angtian Wang, Yuanlu Xu, Nikolaos Sarafianos, Robert Maier, Edmond Boyer, Alan Yuille, Tony Tung

Abstract: Neural reconstruction and rendering strategies have demonstrated state-of-the-art performances due, in part, to their ability to preserve high level shape details. Existing approaches, however, either represent objects as implicit surface functions or neural volumes and still struggle to recover shapes with heterogeneous materials, in particular human skin, hair or clothes. To this aim, we present… ▽ More Neural reconstruction and rendering strategies have demonstrated state-of-the-art performances due, in part, to their ability to preserve high level shape details. Existing approaches, however, either represent objects as implicit surface functions or neural volumes and still struggle to recover shapes with heterogeneous materials, in particular human skin, hair or clothes. To this aim, we present a new hybrid implicit surface representation to model human shapes. This representation is composed of two surface layers that represent opaque and translucent regions on the clothed human body. We segment different regions automatically using visual cues and learn to reconstruct two signed distance functions (SDFs). We perform surface-based rendering on opaque regions (e.g., body, face, clothes) to preserve high-fidelity surface normals and volume rendering on translucent regions (e.g., hair). Experiments demonstrate that our approach obtains state-of-the-art results on 3D human reconstructions, and also shows competitive performances on other objects. △ Less

Submitted 28 December, 2023; originally announced December 2023.

Comments: Accepted by AAAI 2024 main track

arXiv:2312.17143 [pdf, other]

doi 10.1103/PhysRevB.108.245156

Coexistence of Dirac fermion and charge density wave in square-net-based semimetal LaAuSb2

Authors: Xueliang Wu, Zhixiang Hu, David Graf, Yu Liu, Chaoyue Deng, Huixia Fu, Asish K. Kundu, Tonica Valla, Cedomir Petrovic, Aifeng Wang

Abstract: We report a comprehensive study of magnetotransport properties, angle-resolved photoemission spectroscopy (ARPES), and density functional theory (DFT) calculations on self-flux grown LaAuSb$_2$ single crystals. Resistivity and Hall measurements reveal a charge density wave (CDW) transition at 77 K. MR and de Haas-Van Alphen (dHvA) measurements indicate that the transport properties of LaAuSb$_2$ a… ▽ More We report a comprehensive study of magnetotransport properties, angle-resolved photoemission spectroscopy (ARPES), and density functional theory (DFT) calculations on self-flux grown LaAuSb$_2$ single crystals. Resistivity and Hall measurements reveal a charge density wave (CDW) transition at 77 K. MR and de Haas-Van Alphen (dHvA) measurements indicate that the transport properties of LaAuSb$_2$ are dominated by Dirac fermions that arise from Sb square nets. ARPES measurements and DFT calculations reveal an electronic structure with a common feature of the square-net-based topological semimetals, which is in good agreement with the magnetotransport properties. Our results indicate the coexistence of CDW and Dirac fermion in LaAuSb$_2$, both of which are linked to the bands arising from the Sb-square net, suggesting that the square net could serve as a structural motif to explore various electronic orders. △ Less

Submitted 28 December, 2023; originally announced December 2023.

Comments: 11 pages, 6 figures

Journal ref: Phys. Rev. B 108, 245156 (2023)

Showing 101–150 of 1,301 results for author: Wang, A