-
VLBI observations of the high-redshift X-ray bright blazar SRGE J170245.3+130104
Authors:
Yuanqi Liu,
Tao An,
Shaoguang Guo,
Yingkang Zhang,
Ailing Wang,
Zhijun Xu,
Georgii Khorunzhev,
Yulia Sotnikova,
Timur Mufakharov,
Alexander Mikhailov,
Marat Mingaliev
Abstract:
Aims. The X-ray luminous and radio-loud AGN SRGE J170245.3+130104 discovered at z $\sim$ 5.5 provides unique chances to probe the SMBH growth and evolution with powerful jets in the early Universe. Methods. We present 1.35 - 5.1 GHz Very Long Baseline Array (VLBA) results on the radio continuum emission and spectrum analysis for this quasar in a low flux density state. Results. This source is unre…
▽ More
Aims. The X-ray luminous and radio-loud AGN SRGE J170245.3+130104 discovered at z $\sim$ 5.5 provides unique chances to probe the SMBH growth and evolution with powerful jets in the early Universe. Methods. We present 1.35 - 5.1 GHz Very Long Baseline Array (VLBA) results on the radio continuum emission and spectrum analysis for this quasar in a low flux density state. Results. This source is unresolved at three frequencies with the total flux densities of 8.35$\pm$0.09 mJy beam-1, 7.47$\pm$0.08 mJy beam-1, and 6.57$\pm$0.02 mJy beam-1 at 1.73 GHz, 2.26 GHz, and 4.87 GHz, respectively. Meanwhile, the brightness temperature is higher than 109 K. Conclusions. Compared with previous radio observations with arcsec-scale resolution, nearly all the radio emission from this source concentrates in the very central milli-arcsecond (mas) scale area. We confirm this source is a bright blazar at z > 5. This young AGN provide us the great chances to understand the first generation of strong jets in the early Universe.
△ Less
Submitted 29 February, 2024;
originally announced February 2024.
-
Induced Gravitational Wave interpretation of PTA data: a complete study for general equation of state
Authors:
Guillem Domènech,
Shi Pi,
Ao Wang,
Jianing Wang
Abstract:
We thoroughly study the induced gravitational wave interpretation of the possible gravitational wave background reported by PTA collaborations, considering the unknown equation of state $w$ of the early universe. We perform a Bayesian analysis of the NANOGrav data using the publicly available \textsc{PTArcade} code together with \textsc{SIGWfast} for the numerical integration of the induced gravit…
▽ More
We thoroughly study the induced gravitational wave interpretation of the possible gravitational wave background reported by PTA collaborations, considering the unknown equation of state $w$ of the early universe. We perform a Bayesian analysis of the NANOGrav data using the publicly available \textsc{PTArcade} code together with \textsc{SIGWfast} for the numerical integration of the induced gravitational wave spectrum. We focus on two cases: a monochromatic and a log-normal primordial spectrum of fluctuations. For the log-normal spectrum, we show that, while the results are not very sensitive to $w$ when the GW peak is close to the PTA window, radiation domination is out of the $2σ$ contours when only the infra-red power-law tail contributes. For the monochromatic spectrum, the $2σ$ bounds yield $0.1\lesssim w\lesssim0.9$ so that radiation domination is close to the central value. We also investigate the primordial black hole (PBH) counterpart using the peak formalism. We show that, in general terms, a larger width and stiffer equation of state alleviates the overproduction of PBHs. No PBH overproduction requires $w\gtrsim0.42$ up to 2-$σ$ level for the monochromatic spectrum. Furthermore, including bounds from the cosmic microwave background, we find in general that the mass range of the PBH counterpart is bounded by $10^{-5} M_\odot\lesssim M_{\rm PBH}\lesssim10^{-1} M_\odot$. Lastly, we find that the PTA signal can explain the microlensing events reported by OGLE for $0.42\lesssim w\lesssim 0.50$. Our work showcases a complete treatment of induced gravitational waves and primordial black holes for general $w$ for future data analysis.
△ Less
Submitted 8 March, 2024; v1 submitted 29 February, 2024;
originally announced February 2024.
-
FPM-WSI: Fourier ptychographic whole slide imaging via feature-domain backdiffraction
Authors:
Shuhe Zhang,
Aiye Wang,
**ghao Xu,
Tianci Feng,
**hua Zhou,
An Pan
Abstract:
Fourier ptychographic microscopy (FPM), characterized by high-throughput computational imaging, theoretically provides a cunning solution to the trade-off between spatial resolution and field of view (FOV), which has a promising prospect in the application of digital pathology. However, block reconstruction and then stitching has currently become an unavoidable procedure due to vignetting effects.…
▽ More
Fourier ptychographic microscopy (FPM), characterized by high-throughput computational imaging, theoretically provides a cunning solution to the trade-off between spatial resolution and field of view (FOV), which has a promising prospect in the application of digital pathology. However, block reconstruction and then stitching has currently become an unavoidable procedure due to vignetting effects. The stitched image tends to present color inconsistency in different image segments, or even stitching artifacts. In response, we reported a computational framework based on feature-domain backdiffraction to realize full-FOV, stitching-free FPM reconstruction. Different from conventional algorithms that establish the loss function in the image domain, our method formulates it in the feature domain, where effective information of images is extracted by a feature extractor to bypass the vignetting effect. The feature-domain error between predicted images based on estimation of model parameters and practically captured images is then digitally diffracted back through the optical system for complex amplitude reconstruction and aberration compensation. Through massive simulations and experiments, the method presents effective elimination of vignetting artifacts, and reduces the requirement of precise knowledge of illumination positions. We also found its great potential to recover the data with a lower overlap** rate of spectrum and to realize automatic blind-digital refocusing without a prior defocus distance.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
Characterization of the Astrophysical Diffuse Neutrino Flux using Starting Track Events in IceCube
Authors:
R. Abbasi,
M. Ackermann,
J. Adams,
S. K. Agarwalla,
J. A. Aguilar,
M. Ahlers,
J. M. Alameddine,
N. M. Amin,
K. Andeen,
G. Anton,
C. Argüelles,
Y. Ashida,
S. Athanasiadou,
L. Ausborm,
S. N. Axani,
X. Bai,
A. Balagopal V.,
M. Baricevic,
S. W. Barwick,
S. Bash,
V. Basu,
R. Bay,
J. J. Beatty,
J. Becker Tjus,
J. Beise
, et al. (394 additional authors not shown)
Abstract:
A measurement of the diffuse astrophysical neutrino spectrum is presented using IceCube data collected from 2011-2022 (10.3 years). We developed novel detection techniques to search for events with a contained vertex and exiting track induced by muon neutrinos undergoing a charged-current interaction. Searching for these starting track events allows us to not only more effectively reject atmospher…
▽ More
A measurement of the diffuse astrophysical neutrino spectrum is presented using IceCube data collected from 2011-2022 (10.3 years). We developed novel detection techniques to search for events with a contained vertex and exiting track induced by muon neutrinos undergoing a charged-current interaction. Searching for these starting track events allows us to not only more effectively reject atmospheric muons but also atmospheric neutrino backgrounds in the southern sky, opening a new window to the sub-100 TeV astrophysical neutrino sky. The event selection is constructed using a dynamic starting track veto and machine learning algorithms. We use this data to measure the astrophysical diffuse flux as a single power law flux (SPL) with a best-fit spectral index of $γ= 2.58 ^{+0.10}_{-0.09}$ and per-flavor normalization of $φ^{\mathrm{Astro}}_{\mathrm{per-flavor}} = 1.68 ^{+0.19}_{-0.22} \times 10^{-18} \times \mathrm{GeV}^{-1} \mathrm{cm}^{-2} \mathrm{s}^{-1} \mathrm{sr}^{-1}$ (at 100 TeV). The sensitive energy range for this dataset is 3 - 550 TeV under the SPL assumption. This data was also used to measure the flux under a broken power law, however we did not find any evidence of a low energy cutoff.
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
Attosecond X-ray Chronoscopy of Core-level Photoemission
Authors:
Jia-Bao Ji,
Zhaoheng Guo,
Taran Driver,
Cynthia S. Trevisan,
David Cesar,
Xinxin Cheng,
Joseph Duris,
Paris L. Franz,
James Glownia,
Xiaochun Gong,
Daniel Hammerland,
Meng Han,
Saijoscha Heck,
Matthias Hoffmann,
Andrei Kamalov,
Kirk A. Larsen,
Xiang Li,
Ming-Fu Lin,
Yuchen Liu,
C. William McCurdy,
Razib Obaid,
Jordan T. ONeal,
Thomas N. Rescigno,
River R. Robles,
Nicholas Sudar
, et al. (10 additional authors not shown)
Abstract:
Attosecond photoemission or photoionization delays are a unique probe of the structure and the electronic dynamics of matter. However, spectral congestion and spatial delocalization of valence electron wave functions set fundamental limits to the complexity of systems that can be studied and the information that can be retrieved, respectively. Using attosecond X-ray pulses from LCLS, we demonstrat…
▽ More
Attosecond photoemission or photoionization delays are a unique probe of the structure and the electronic dynamics of matter. However, spectral congestion and spatial delocalization of valence electron wave functions set fundamental limits to the complexity of systems that can be studied and the information that can be retrieved, respectively. Using attosecond X-ray pulses from LCLS, we demonstrate the key advantages of measuring core-level delays: the photoelectron spectra remain atom-like, the measurements become element specific and the observed scattering dynamics originate from a point-like source. We exploit these unique features to reveal the effects of electronegativity and symmetry on attosecond scattering dynamics by measuring the photoionization delays between N-1s and C-1s core shells of a series of aromatic azabenzene molecules. Remarkably, the delays systematically increase with the number of nitrogen atoms in the molecule and reveal multiple resonances. We identify two previously unknown mechanisms regulating the associated attosecond dynamics, namely the enhanced confinement of the trapped wavefunction with increasing electronegativity of the atoms and the decrease of the coupling strength among the photoemitted partial waves with increasing symmetry. This study demonstrates the unique opportunities opened by measurements of core-level photoionization delays for unravelling attosecond electron dynamics in complex matter.
△ Less
Submitted 8 April, 2024; v1 submitted 27 February, 2024;
originally announced February 2024.
-
An Efficient MLP-based Point-guided Segmentation Network for Ore Images with Ambiguous Boundary
Authors:
Guodong Sun,
Yuting Peng,
Le Cheng,
Mengya Xu,
An Wang,
Bo Wu,
Hongliang Ren,
Yang Zhang
Abstract:
The precise segmentation of ore images is critical to the successful execution of the beneficiation process. Due to the homogeneous appearance of the ores, which leads to low contrast and unclear boundaries, accurate segmentation becomes challenging, and recognition becomes problematic. This paper proposes a lightweight framework based on Multi-Layer Perceptron (MLP), which focuses on solving the…
▽ More
The precise segmentation of ore images is critical to the successful execution of the beneficiation process. Due to the homogeneous appearance of the ores, which leads to low contrast and unclear boundaries, accurate segmentation becomes challenging, and recognition becomes problematic. This paper proposes a lightweight framework based on Multi-Layer Perceptron (MLP), which focuses on solving the problem of edge burring. Specifically, we introduce a lightweight backbone better suited for efficiently extracting low-level features. Besides, we design a feature pyramid network consisting of two MLP structures that balance local and global information thus enhancing detection accuracy. Furthermore, we propose a novel loss function that guides the prediction points to match the instance edge points to achieve clear object boundaries. We have conducted extensive experiments to validate the efficacy of our proposed method. Our approach achieves a remarkable processing speed of over 27 frames per second (FPS) with a model size of only 73 MB. Moreover, our method delivers a consistently high level of accuracy, with impressive performance scores of 60.4 and 48.9 in~$AP_{50}^{box}$ and~$AP_{50}^{mask}$ respectively, as compared to the currently available state-of-the-art techniques, when tested on the ore image dataset. The source code will be released at \url{https://github.com/MVME-HBUT/ORENEXT}.
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
Towards Open-ended Visual Quality Comparison
Authors:
Haoning Wu,
Hanwei Zhu,
Zicheng Zhang,
Erli Zhang,
Chaofeng Chen,
Liang Liao,
Chunyi Li,
Annan Wang,
Wenxiu Sun,
Qiong Yan,
Xiaohong Liu,
Guangtao Zhai,
Shiqi Wang,
Weisi Lin
Abstract:
Comparative settings (e.g. pairwise choice, listwise ranking) have been adopted by a wide range of subjective studies for image quality assessment (IQA), as it inherently standardizes the evaluation criteria across different observers and offer more clear-cut responses. In this work, we extend the edge of emerging large multi-modality models (LMMs) to further advance visual quality comparison into…
▽ More
Comparative settings (e.g. pairwise choice, listwise ranking) have been adopted by a wide range of subjective studies for image quality assessment (IQA), as it inherently standardizes the evaluation criteria across different observers and offer more clear-cut responses. In this work, we extend the edge of emerging large multi-modality models (LMMs) to further advance visual quality comparison into open-ended settings, that 1) can respond to open-range questions on quality comparison; 2) can provide detailed reasonings beyond direct answers. To this end, we propose the Co-Instruct. To train this first-of-its-kind open-source open-ended visual quality comparer, we collect the Co-Instruct-562K dataset, from two sources: (a) LLM-merged single image quality description, (b) GPT-4V "teacher" responses on unlabeled data. Furthermore, to better evaluate this setting, we propose the MICBench, the first benchmark on multi-image comparison for LMMs. We demonstrate that Co-Instruct not only achieves in average 30% higher accuracy than state-of-the-art open-source LMMs, but also outperforms GPT-4V (its teacher), on both existing related benchmarks and the proposed MICBench. Our model is published at https://huggingface.co/q-future/co-instruct.
△ Less
Submitted 4 March, 2024; v1 submitted 26 February, 2024;
originally announced February 2024.
-
Fine-Grained Self-Endorsement Improves Factuality and Reasoning
Authors:
Ante Wang,
Linfeng Song,
Baolin Peng,
Ye Tian,
Lifeng **,
Haitao Mi,
**song Su,
Dong Yu
Abstract:
This work studies improving large language model (LLM) generations at inference time by mitigating fact-conflicting hallucinations. Particularly, we propose a self-endorsement framework that leverages the fine-grained fact-level comparisons across multiple sampled responses. Compared with prior ensemble methods (Wang et al., 2022;Chen et al., 2023)) that perform response-level selection, our appro…
▽ More
This work studies improving large language model (LLM) generations at inference time by mitigating fact-conflicting hallucinations. Particularly, we propose a self-endorsement framework that leverages the fine-grained fact-level comparisons across multiple sampled responses. Compared with prior ensemble methods (Wang et al., 2022;Chen et al., 2023)) that perform response-level selection, our approach can better alleviate hallucinations, especially for longform generation tasks. Our approach can broadly benefit smaller and open-source LLMs as it mainly conducts simple content-based comparisons. Experiments on Biographies show that our method can effectively improve the factuality of generations with simple and intuitive prompts across different scales of LLMs. Besides, comprehensive analyses on TriviaQA and GSM8K demonstrate the potential of self-endorsement for broader application.
△ Less
Submitted 23 February, 2024;
originally announced February 2024.
-
Do You See What I See? A Qualitative Study Eliciting High-Level Visualization Comprehension
Authors:
Ghulam Jilani Quadri,
Arran Zeyu Wang,
Zhehao Wang,
Jennifer Adorno,
Paul Rosen,
Danielle Albers Szafir
Abstract:
Designers often create visualizations to achieve specific high-level analytical or communication goals. These goals require people to naturally extract complex, contextualized, and interconnected patterns in data. While limited prior work has studied general high-level interpretation, prevailing perceptual studies of visualization effectiveness primarily focus on isolated, predefined, low-level ta…
▽ More
Designers often create visualizations to achieve specific high-level analytical or communication goals. These goals require people to naturally extract complex, contextualized, and interconnected patterns in data. While limited prior work has studied general high-level interpretation, prevailing perceptual studies of visualization effectiveness primarily focus on isolated, predefined, low-level tasks, such as estimating statistical quantities. This study more holistically explores visualization interpretation to examine the alignment between designers' communicative goals and what their audience sees in a visualization, which we refer to as their comprehension. We found that statistics people effectively estimate from visualizations in classical graphical perception studies may differ from the patterns people intuitively comprehend in a visualization. We conducted a qualitative study on three types of visualizations -- line graphs, bar graphs, and scatterplots -- to investigate the high-level patterns people naturally draw from a visualization. Participants described a series of graphs using natural language and think-aloud protocols. We found that comprehension varies with a range of factors, including graph complexity and data distribution. Specifically, 1) a visualization's stated objective often does not align with people's comprehension, 2) results from traditional experiments may not predict the knowledge people build with a graph, and 3) chart type alone is insufficient to predict the information people extract from a graph. Our study confirms the importance of defining visualization effectiveness from multiple perspectives to assess and inform visualization practices.
△ Less
Submitted 23 February, 2024;
originally announced February 2024.
-
Pressure tunable magnetic skyrmion phase in Co8Zn8Mn4 single crystals
Authors:
Zhun Li,
Xinrun Mi,
Xinming Wang,
Jian Lyu,
Na Su,
Aifeng Wang,
Yisheng Chai,
Bao Yuan,
Wanju Luo,
Hui Cheng,
Jianxiang Gao,
Hongliang Wang,
Lijie Hao,
Mingquan He,
Junying Shen,
Young Sun,
Xin Tong
Abstract:
In a magnetic skyrmion phase, magnetic moments form vortex-like topological textures which are of both fundamental and industrial interests. In $β$-Mn-type Co-Zn-Mn alloys, chrial magnetic skyrmions emerge above room temperature, providing a unique system for studying the skrymion physics and exploring spintronics applications. However, the magnetic skyrmion phase is typically confined in a narrow…
▽ More
In a magnetic skyrmion phase, magnetic moments form vortex-like topological textures which are of both fundamental and industrial interests. In $β$-Mn-type Co-Zn-Mn alloys, chrial magnetic skyrmions emerge above room temperature, providing a unique system for studying the skrymion physics and exploring spintronics applications. However, the magnetic skyrmion phase is typically confined in a narrow and limited temperature ($T$) and magnetic field ($H$) range. Here, we demonstrate that hydrostatic pressure can expand the skyrmion phase in the $T-H$ phase diagram of single-crystalline Co$_8$Zn$_8$Mn$_4$. At ambient pressure, signatures of skyrmions are seen within $T\sim302-308$ K and $H\sim50-100$ Oe. Applying a moderate pressure of 6 kbar extends this range to $T\sim300-310$ K and $H\sim50-150$ Oe. However, further escalation of pressure to 10 kbar results in a slight contraction of the skyrmion phase. These findings underscore the sensitivity of the skyrmion phase in Co$_8$Zn$_8$Mn$_4$ to external pressures, and hint at the potential of strain engineering, particularly in $β$-Mn-type Co-Zn-Mn thin films, as a promising avenue to customize the skyrmion phase.
△ Less
Submitted 22 February, 2024;
originally announced February 2024.
-
Representation Learning for Frequent Subgraph Mining
Authors:
Rex Ying,
Tianyu Fu,
Andrew Wang,
Jiaxuan You,
Yu Wang,
Jure Leskovec
Abstract:
Identifying frequent subgraphs, also called network motifs, is crucial in analyzing and predicting properties of real-world networks. However, finding large commonly-occurring motifs remains a challenging problem not only due to its NP-hard subroutine of subgraph counting, but also the exponential growth of the number of possible subgraphs patterns. Here we present Subgraph Pattern Miner (SPMiner)…
▽ More
Identifying frequent subgraphs, also called network motifs, is crucial in analyzing and predicting properties of real-world networks. However, finding large commonly-occurring motifs remains a challenging problem not only due to its NP-hard subroutine of subgraph counting, but also the exponential growth of the number of possible subgraphs patterns. Here we present Subgraph Pattern Miner (SPMiner), a novel neural approach for approximately finding frequent subgraphs in a large target graph. SPMiner combines graph neural networks, order embedding space, and an efficient search strategy to identify network subgraph patterns that appear most frequently in the target graph. SPMiner first decomposes the target graph into many overlap** subgraphs and then encodes each subgraph into an order embedding space. SPMiner then uses a monotonic walk in the order embedding space to identify frequent motifs. Compared to existing approaches and possible neural alternatives, SPMiner is more accurate, faster, and more scalable. For 5- and 6-node motifs, we show that SPMiner can almost perfectly identify the most frequent motifs while being 100x faster than exact enumeration methods. In addition, SPMiner can also reliably identify frequent 10-node motifs, which is well beyond the size limit of exact enumeration approaches. And last, we show that SPMiner can find large up to 20 node motifs with 10-100x higher frequency than those found by current approximate methods.
△ Less
Submitted 22 February, 2024;
originally announced February 2024.
-
SDXL-Lightning: Progressive Adversarial Diffusion Distillation
Authors:
Shanchuan Lin,
Anran Wang,
Xiao Yang
Abstract:
We propose a diffusion distillation method that achieves new state-of-the-art in one-step/few-step 1024px text-to-image generation based on SDXL. Our method combines progressive and adversarial distillation to achieve a balance between quality and mode coverage. In this paper, we discuss the theoretical analysis, discriminator design, model formulation, and training techniques. We open-source our…
▽ More
We propose a diffusion distillation method that achieves new state-of-the-art in one-step/few-step 1024px text-to-image generation based on SDXL. Our method combines progressive and adversarial distillation to achieve a balance between quality and mode coverage. In this paper, we discuss the theoretical analysis, discriminator design, model formulation, and training techniques. We open-source our distilled SDXL-Lightning models both as LoRA and full UNet weights.
△ Less
Submitted 2 March, 2024; v1 submitted 21 February, 2024;
originally announced February 2024.
-
Weak Poincaré inequality comparisons for ideal and hybrid slice sampling
Authors:
Sam Power,
Daniel Rudolf,
Björn Sprungk,
Andi Q. Wang
Abstract:
Using the framework of weak Poincar{é} inequalities, we provide a general comparison between the Hybrid and Ideal Slice Sampling Markov chains in terms of their Dirichlet forms. In particular, under suitable assumptions Hybrid Slice Sampling will inherit fast convergence from Ideal Slice Sampling and conversely. We apply our results to analyse the convergence of the Independent Metropolis--Hasting…
▽ More
Using the framework of weak Poincar{é} inequalities, we provide a general comparison between the Hybrid and Ideal Slice Sampling Markov chains in terms of their Dirichlet forms. In particular, under suitable assumptions Hybrid Slice Sampling will inherit fast convergence from Ideal Slice Sampling and conversely. We apply our results to analyse the convergence of the Independent Metropolis--Hastings, Slice Sampling with Step**-Out and Shrinkage, and Hit-and-Run-within-Slice Sampling algorithms.
△ Less
Submitted 21 February, 2024;
originally announced February 2024.
-
Attosecond Delays in X-ray Molecular Ionization
Authors:
Taran Driver,
Miles Mountney,
Jun Wang,
Lisa Ortmann,
Andre Al-Haddad,
Nora Berrah,
Christoph Bostedt,
Elio G. Champenois,
Louis F. DiMauro,
Joseph Duris,
Douglas Garratt,
James M. Glownia,
Zhaoheng Guo,
Daniel Haxton,
Erik Isele,
Igor Ivanov,
Jiabao Ji,
Andrei Kamalov,
Siqi Li,
Ming-Fu Lin,
Jon P. Marangos,
Razib Obaid,
Jordan T. O'Neal,
Philipp Rosenberger,
Niranjan H. Shivaram
, et al. (12 additional authors not shown)
Abstract:
The photoelectric effect is not truly instantaneous, but exhibits attosecond delays that can reveal complex molecular dynamics. Sub-femtosecond duration light pulses provide the requisite tools to resolve the dynamics of photoionization. Accordingly, the past decade has produced a large volume of work on photoionization delays following single photon absorption of an extreme ultraviolet (XUV) phot…
▽ More
The photoelectric effect is not truly instantaneous, but exhibits attosecond delays that can reveal complex molecular dynamics. Sub-femtosecond duration light pulses provide the requisite tools to resolve the dynamics of photoionization. Accordingly, the past decade has produced a large volume of work on photoionization delays following single photon absorption of an extreme ultraviolet (XUV) photon. However, the measurement of time-resolved core-level photoionization remained out of reach. The required x-ray photon energies needed for core-level photoionization were not available with attosecond tabletop sources. We have now measured the x-ray photoemission delay of core-level electrons, and here report unexpectedly large delays, ranging up to 700 attoseconds in NO near the oxygen K-shell threshold. These measurements exploit attosecond soft x-ray pulses from a free-electron laser (XFEL) to scan across the entire region near the K-shell threshold. Furthermore, we find the delay spectrum is richly modulated, suggesting several contributions including transient trap** of the photoelectron due to shape resonances, collisions with the Auger-Meitner electron that is emitted in the rapid non-radiative relaxation of the molecule, and multi-electron scattering effects. The results demonstrate how x-ray attosecond experiments, supported by comprehensive theoretical modelling, can unravel the complex correlated dynamics of core-level photoionization.
△ Less
Submitted 20 February, 2024;
originally announced February 2024.
-
AnaloBench: Benchmarking the Identification of Abstract and Long-context Analogies
Authors:
Xiao Ye,
Andrew Wang,
Jacob Choi,
Yining Lu,
Shreya Sharma,
Lingfeng Shen,
Vijay Tiyyala,
Nicholas Andrews,
Daniel Khashabi
Abstract:
Humans regularly engage in analogical thinking, relating personal experiences to current situations ($X$ is analogous to $Y$ because of $Z$). Analogical thinking allows humans to solve problems in creative ways, grasp difficult concepts, and articulate ideas more effectively. Can language models (LMs) do the same? To answer this question, we propose ANALOBENCH, a benchmark to determine analogical…
▽ More
Humans regularly engage in analogical thinking, relating personal experiences to current situations ($X$ is analogous to $Y$ because of $Z$). Analogical thinking allows humans to solve problems in creative ways, grasp difficult concepts, and articulate ideas more effectively. Can language models (LMs) do the same? To answer this question, we propose ANALOBENCH, a benchmark to determine analogical reasoning ability in LMs. Our benchmarking approach focuses on aspects of this ability that are common among humans: (i) recalling related experiences from a large amount of information, and (ii) applying analogical reasoning to complex and lengthy scenarios. We test a broad collection of proprietary models (e.g., GPT family, Claude V2) and open source models such as LLaMA2. As in prior results, scaling up LMs results in some performance boosts. Surprisingly, scale offers minimal gains when, (i) analogies involve lengthy scenarios, or (ii) recalling relevant scenarios from a large pool of information, a process analogous to finding a needle in a haystack. We hope these observations encourage further research in this field.
△ Less
Submitted 19 February, 2024;
originally announced February 2024.
-
SDiT: Spiking Diffusion Model with Transformer
Authors:
Shu Yang,
Hanzhi Ma,
Chengting Yu,
Aili Wang,
Er-** Li
Abstract:
Spiking neural networks (SNNs) have low power consumption and bio-interpretable characteristics, and are considered to have tremendous potential for energy-efficient computing. However, the exploration of SNNs on image generation tasks remains very limited, and a unified and effective structure for SNN-based generative models has yet to be proposed. In this paper, we explore a novel diffusion mode…
▽ More
Spiking neural networks (SNNs) have low power consumption and bio-interpretable characteristics, and are considered to have tremendous potential for energy-efficient computing. However, the exploration of SNNs on image generation tasks remains very limited, and a unified and effective structure for SNN-based generative models has yet to be proposed. In this paper, we explore a novel diffusion model architecture within spiking neural networks. We utilize transformer to replace the commonly used U-net structure in mainstream diffusion models. It can generate higher quality images with relatively lower computational cost and shorter sampling time. It aims to provide an empirical baseline for research of generative models based on SNNs. Experiments on MNIST, Fashion-MNIST, and CIFAR-10 datasets demonstrate that our work is highly competitive compared to existing SNN generative models.
△ Less
Submitted 24 February, 2024; v1 submitted 18 February, 2024;
originally announced February 2024.
-
Exploring the Potential of Large Language Models in Artistic Creation: Collaboration and Reflection on Creative Programming
Authors:
Anqi Wang,
Zhizhuo Yin,
Yulu Hu,
Yuanyuan Mao,
Pan Hui
Abstract:
Recently, the potential of large language models (LLMs) has been widely used in assisting programming. However, current research does not explore the artist potential of LLMs in creative coding within artist and AI collaboration. Our work probes the reflection type of artists in the creation process with such collaboration. We compare two common collaboration approaches: invoking the entire progra…
▽ More
Recently, the potential of large language models (LLMs) has been widely used in assisting programming. However, current research does not explore the artist potential of LLMs in creative coding within artist and AI collaboration. Our work probes the reflection type of artists in the creation process with such collaboration. We compare two common collaboration approaches: invoking the entire program and multiple subtasks. Our findings exhibit artists' different stimulated reflections in two different methods. Our finding also shows the correlation of reflection type with user performance, user satisfaction, and subjective experience in two collaborations through conducting two methods, including experimental data and qualitative interviews. In this sense, our work reveals the artistic potential of LLM in creative coding. Meanwhile, we provide a critical lens of human-AI collaboration from the artists' perspective and expound design suggestions for future work of AI-assisted creative tasks.
△ Less
Submitted 15 February, 2024;
originally announced February 2024.
-
Active Disruption Avoidance and Trajectory Design for Tokamak Ramp-downs with Neural Differential Equations and Reinforcement Learning
Authors:
Allen M. Wang,
Oswin So,
Charles Dawson,
Darren T. Garnier,
Cristina Rea,
Chuchu Fan
Abstract:
The tokamak offers a promising path to fusion energy, but plasma disruptions pose a major economic risk, motivating considerable advances in disruption avoidance. This work develops a reinforcement learning approach to this problem by training a policy to safely ramp-down the plasma current while avoiding limits on a number of quantities correlated with disruptions. The policy training environment…
▽ More
The tokamak offers a promising path to fusion energy, but plasma disruptions pose a major economic risk, motivating considerable advances in disruption avoidance. This work develops a reinforcement learning approach to this problem by training a policy to safely ramp-down the plasma current while avoiding limits on a number of quantities correlated with disruptions. The policy training environment is a hybrid physics and machine learning model trained on simulations of the SPARC primary reference discharge (PRD) ramp-down, an upcoming burning plasma scenario which we use as a testbed. To address physics uncertainty and model inaccuracies, the simulation environment is massively parallelized on GPU with randomized physics parameters during policy training. The trained policy is then successfully transferred to a higher fidelity simulator where it successfully ramps down the plasma while avoiding user-specified disruptive limits. We also address the crucial issue of safety criticality by demonstrating that a constraint-conditioned policy can be used as a trajectory design assistant to design a library of feed-forward trajectories to handle different physics conditions and user settings. As a library of trajectories is more interpretable and verifiable offline, we argue such an approach is a promising path for leveraging the capabilities of reinforcement learning in the safety-critical context of burning plasma tokamaks. Finally, we demonstrate how the training environment can be a useful platform for other feed-forward optimization approaches by using an evolutionary algorithm to perform optimization of feed-forward trajectories that are robust to physics uncertainty
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
Coexistence of Superconductivity and Antiferromagnetism in Topological Magnet MnBi2Te4 Films
Authors:
Wei Yuan,
Zi-Jie Yan,
Hemian Yi,
Zihao Wang,
Stephen Paolini,
Yi-Fan Zhao,
Ling-Jie Zhou,
Annie G. Wang,
Ke Wang,
Thomas Prokscha,
Zaher Salman,
Andreas Suter,
Purnima P. Balakrishnan,
Alexander J. Grutter,
Laurel E. Winter,
John Singleton,
Moses H. W. Chan,
Cui-Zu Chang
Abstract:
The interface of two materials can harbor unexpected emergent phenomena. One example is interface-induced superconductivity. In this work, we employ molecular beam epitaxy to grow a series of heterostructures formed by stacking together two non-superconducting antiferromagnetic materials, an intrinsic antiferromagnetic topological insulator MnBi2Te4 and an antiferromagnetic iron chalcogenide FeTe.…
▽ More
The interface of two materials can harbor unexpected emergent phenomena. One example is interface-induced superconductivity. In this work, we employ molecular beam epitaxy to grow a series of heterostructures formed by stacking together two non-superconducting antiferromagnetic materials, an intrinsic antiferromagnetic topological insulator MnBi2Te4 and an antiferromagnetic iron chalcogenide FeTe. Our electrical transport measurements reveal interface-induced superconductivity in these heterostructures. By performing scanning tunneling microscopy and spectroscopy measurements, we observe a proximity-induced superconducting gap on the top surface of the MnBi2Te4 layer, confirming the interaction between superconductivity and antiferromagnetism in the MnBi2Te4 layer. Our findings will advance the fundamental inquiries into the topological superconducting phase in hybrid devices and provide a promising platform for the exploration of chiral Majorana physics in MnBi2Te4-based heterostructures.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
New constraints on ultraheavy dark matter from the LZ experiment
Authors:
J. Aalbers,
D. S. Akerib,
A. K. Al Musalhi,
C. S. Amarasinghe,
A. Ames,
T. J. Anderson,
N. Angelides,
H. M. Araújo,
J. E. Armstrong,
M. Arthurs,
A. Baker,
S. Balashov,
J. Bang,
J. W. Bargemann,
A. Baxter,
K. Beattie,
T. Benson,
A. Bhatti,
A. Biekert,
T. P. Biesiadzinski,
H. J. Birch,
E. Bishop,
G. M. Blockinger,
B. Boxer,
C. A. J. Brew
, et al. (174 additional authors not shown)
Abstract:
Searches for dark matter with liquid xenon time projection chamber experiments have traditionally focused on the region of the parameter space that is characteristic of weakly interacting massive particles, ranging from a few GeV/$c^2$ to a few TeV/$c^2$. Models of dark matter with a mass much heavier than this are well motivated by early production mechanisms different from the standard thermal f…
▽ More
Searches for dark matter with liquid xenon time projection chamber experiments have traditionally focused on the region of the parameter space that is characteristic of weakly interacting massive particles, ranging from a few GeV/$c^2$ to a few TeV/$c^2$. Models of dark matter with a mass much heavier than this are well motivated by early production mechanisms different from the standard thermal freeze-out, but they have generally been less explored experimentally. In this work, we present a re-analysis of the first science run (SR1) of the LZ experiment, with an exposure of $0.9$ tonne$\times$year, to search for ultraheavy particle dark matter. The signal topology consists of multiple energy deposits in the active region of the detector forming a straight line, from which the velocity of the incoming particle can be reconstructed on an event-by-event basis. Zero events with this topology were observed after applying the data selection calibrated on a simulated sample of signal-like events. New experimental constraints are derived, which rule out previously unexplored regions of the dark matter parameter space of spin-independent interactions beyond a mass of 10$^{17}$ GeV/$c^2$.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
Poisson flow consistency models for low-dose CT image denoising
Authors:
Dennis Hein,
Adam Wang,
Ge Wang
Abstract:
Diffusion and Poisson flow models have demonstrated remarkable success for a wide range of generative tasks. Nevertheless, their iterative nature results in computationally expensive sampling and the number of function evaluations (NFE) required can be orders of magnitude larger than for single-step methods. Consistency models are a recent class of deep generative models which enable single-step s…
▽ More
Diffusion and Poisson flow models have demonstrated remarkable success for a wide range of generative tasks. Nevertheless, their iterative nature results in computationally expensive sampling and the number of function evaluations (NFE) required can be orders of magnitude larger than for single-step methods. Consistency models are a recent class of deep generative models which enable single-step sampling of high quality data without the need for adversarial training. In this paper, we introduce a novel image denoising technique which combines the flexibility afforded in Poisson flow generative models (PFGM)++ with the, high quality, single step sampling of consistency models. The proposed method first learns a trajectory between a noise distribution and the posterior distribution of interest by training PFGM++ in a supervised fashion. These pre-trained PFGM++ are subsequently "distilled" into Poisson flow consistency models (PFCM) via an updated version of consistency distillation. We call this approach posterior sampling Poisson flow consistency models (PS-PFCM). Our results indicate that the added flexibility of tuning the hyperparameter D, the dimensionality of the augmentation variables in PFGM++, allows us to outperform consistency models, a current state-of-the-art diffusion-style model with NFE=1 on clinical low-dose CT images. Notably, PFCM is in itself a novel family of deep generative models and we provide initial results on the CIFAR-10 dataset.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
OSSAR: Towards Open-Set Surgical Activity Recognition in Robot-assisted Surgery
Authors:
Long Bai,
Guankun Wang,
Jie Wang,
Xiaoxiao Yang,
Huxin Gao,
Xin Liang,
An Wang,
Mobarakol Islam,
Hongliang Ren
Abstract:
In the realm of automated robotic surgery and computer-assisted interventions, understanding robotic surgical activities stands paramount. Existing algorithms dedicated to surgical activity recognition predominantly cater to pre-defined closed-set paradigms, ignoring the challenges of real-world open-set scenarios. Such algorithms often falter in the presence of test samples originating from class…
▽ More
In the realm of automated robotic surgery and computer-assisted interventions, understanding robotic surgical activities stands paramount. Existing algorithms dedicated to surgical activity recognition predominantly cater to pre-defined closed-set paradigms, ignoring the challenges of real-world open-set scenarios. Such algorithms often falter in the presence of test samples originating from classes unseen during training phases. To tackle this problem, we introduce an innovative Open-Set Surgical Activity Recognition (OSSAR) framework. Our solution leverages the hyperspherical reciprocal point strategy to enhance the distinction between known and unknown classes in the feature space. Additionally, we address the issue of over-confidence in the closed set by refining model calibration, avoiding misclassification of unknown classes as known ones. To support our assertions, we establish an open-set surgical activity benchmark utilizing the public JIGSAWS dataset. Besides, we also collect a novel dataset on endoscopic submucosal dissection for surgical activity tasks. Extensive comparisons and ablation experiments on these datasets demonstrate the significant outperformance of our method over existing state-of-the-art approaches. Our proposed solution can effectively address the challenges of real-world surgical scenarios. Our code is publicly accessible at https://github.com/longbai1006/OSSAR.
△ Less
Submitted 10 February, 2024;
originally announced February 2024.
-
Measuring machine learning harms from stereotypes: requires understanding who is being harmed by which errors in what ways
Authors:
Angelina Wang,
Xuechunzi Bai,
Solon Barocas,
Su Lin Blodgett
Abstract:
As machine learning applications proliferate, we need an understanding of their potential for harm. However, current fairness metrics are rarely grounded in human psychological experiences of harm. Drawing on the social psychology of stereotypes, we use a case study of gender stereotypes in image search to examine how people react to machine learning errors. First, we use survey studies to show th…
▽ More
As machine learning applications proliferate, we need an understanding of their potential for harm. However, current fairness metrics are rarely grounded in human psychological experiences of harm. Drawing on the social psychology of stereotypes, we use a case study of gender stereotypes in image search to examine how people react to machine learning errors. First, we use survey studies to show that not all machine learning errors reflect stereotypes nor are equally harmful. Then, in experimental studies we randomly expose participants to stereotype-reinforcing, -violating, and -neutral machine learning errors. We find stereotype-reinforcing errors induce more experientially (i.e., subjectively) harmful experiences, while having minimal changes to cognitive beliefs, attitudes, or behaviors. This experiential harm impacts women more than men. However, certain stereotype-violating errors are more experientially harmful for men, potentially due to perceived threats to masculinity. We conclude that harm cannot be the sole guide in fairness mitigation, and propose a nuanced perspective depending on who is experiencing what harm and why.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Measuring Implicit Bias in Explicitly Unbiased Large Language Models
Authors:
Xuechunzi Bai,
Angelina Wang,
Ilia Sucholutsky,
Thomas L. Griffiths
Abstract:
Large language models (LLMs) can pass explicit social bias tests but still harbor implicit biases, similar to humans who endorse egalitarian beliefs yet exhibit subtle biases. Measuring such implicit biases can be a challenge: as LLMs become increasingly proprietary, it may not be possible to access their embeddings and apply existing bias measures; furthermore, implicit biases are primarily a con…
▽ More
Large language models (LLMs) can pass explicit social bias tests but still harbor implicit biases, similar to humans who endorse egalitarian beliefs yet exhibit subtle biases. Measuring such implicit biases can be a challenge: as LLMs become increasingly proprietary, it may not be possible to access their embeddings and apply existing bias measures; furthermore, implicit biases are primarily a concern if they affect the actual decisions that these systems make. We address both challenges by introducing two new measures of bias: LLM Implicit Bias, a prompt-based method for revealing implicit bias; and LLM Decision Bias, a strategy to detect subtle discrimination in decision-making tasks. Both measures are based on psychological research: LLM Implicit Bias adapts the Implicit Association Test, widely used to study the automatic associations between concepts held in human minds; and LLM Decision Bias operationalizes psychological results indicating that relative evaluations between two candidates, not absolute evaluations assessing each independently, are more diagnostic of implicit biases. Using these measures, we found pervasive stereotype biases mirroring those in society in 8 value-aligned models across 4 social categories (race, gender, religion, health) in 21 stereotypes (such as race and criminality, race and weapons, gender and science, age and negativity). Our prompt-based LLM Implicit Bias measure correlates with existing language model embedding-based bias methods, but better predicts downstream behaviors measured by LLM Decision Bias. These new prompt-based measures draw from psychology's long history of research into measuring stereotype biases based on purely observable behavior; they expose nuanced biases in proprietary value-aligned LLMs that appear unbiased according to standard benchmarks.
△ Less
Submitted 23 May, 2024; v1 submitted 6 February, 2024;
originally announced February 2024.
-
PandaX-xT: a Multi-ten-tonne Liquid Xenon Observatory at the China **** Underground Laboratory
Authors:
PandaX Collaboration,
Abdusalam Abdukerim,
Zihao Bo,
Wei Chen,
Xun Chen,
Chen Cheng,
Zhaokan Cheng,
Xiangyi Cui,
Yingjie Fan,
Deqing Fang,
Lisheng Geng,
Karl Giboni,
Linhui Gu,
Xunan Guo,
Xuyuan Guo,
Zhichao Guo,
Chencheng Han,
Ke Han,
Changda He,
**rong He,
Di Huang,
Junting Huang,
Zhou Huang,
Ruquan Hou,
Yu Hou
, et al. (68 additional authors not shown)
Abstract:
We propose a major upgrade to the existing PandaX-4T experiment in the China **** Underground Laboratory. The new experiment, PandaX-xT, will be a multi-ten-tonne liquid xenon, ultra-low background, and general-purpose observatory. The full-scaled PandaX-xT contains a 43-tonne liquid xenon active target. Such an experiment will significantly advance our fundamental understanding of particle phy…
▽ More
We propose a major upgrade to the existing PandaX-4T experiment in the China **** Underground Laboratory. The new experiment, PandaX-xT, will be a multi-ten-tonne liquid xenon, ultra-low background, and general-purpose observatory. The full-scaled PandaX-xT contains a 43-tonne liquid xenon active target. Such an experiment will significantly advance our fundamental understanding of particle physics and astrophysics. The sensitivity of dark matter direct detection will be improved by nearly two orders of magnitude compared to the current best limits, approaching the so-called "neutrino floor" for a dark matter mass above 10 GeV/$c^2$, providing a decisive test to the Weakly Interacting Massive Particle paradigm. By searching for the neutrinoless double beta decay of $^{136}$Xe isotope in the detector, the effective Majorana neutrino mass can be measured to a [10 -- 41] meV/$c^2$ sensitivity, providing a key test to the Dirac/Majorana nature of neutrino s. Astrophysical neutrinos and other ultra-rare interactions can also be measured and searched for with an unprecedented background level, opening up new windows of discovery. Depending on the findings, PandaX-xT will seek the next stage upgrade utilizing isotopic separation on natural xenon.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Weak-measurement-based pseudospin pointer: A cost-effective scheme for precision measurement
Authors:
Ling Ye,
Lan Luo,
An Wang,
Rongchun Ge,
Zhiyou Zhang
Abstract:
As an essential component of state-of-the-art quantum technologies, fast and efficient quantum measurements are in persistent demand over time. We present a proof-of-principle experiment on a new dimensionless pseudo-spin pointer based on weak measurement. In the context of optical parameter estimation, we demonstrate that the parametric distribution's moment is obtained experimentally by employin…
▽ More
As an essential component of state-of-the-art quantum technologies, fast and efficient quantum measurements are in persistent demand over time. We present a proof-of-principle experiment on a new dimensionless pseudo-spin pointer based on weak measurement. In the context of optical parameter estimation, we demonstrate that the parametric distribution's moment is obtained experimentally by employing the dimensionless pointer without measuring the distribution literally. In addition to the sheer liberation of experimental expense, the photon-countering-based pointer is well-calibrated for the detection of weak signals. We show that for signals $3$-$4$ orders of weaker in strength than the area-array camera method, an order of improvement in precision is achieved experimentally.
△ Less
Submitted 12 June, 2024; v1 submitted 3 February, 2024;
originally announced February 2024.
-
Large language models cannot replace human participants because they cannot portray identity groups
Authors:
Angelina Wang,
Jamie Morgenstern,
John P. Dickerson
Abstract:
Large language models (LLMs) are increasing in capability and popularity, propelling their application in new domains -- including as replacements for human participants in computational social science, user testing, annotation tasks, and more. Traditionally, in all of these settings survey distributors are careful to find representative samples of the human population to ensure the validity of th…
▽ More
Large language models (LLMs) are increasing in capability and popularity, propelling their application in new domains -- including as replacements for human participants in computational social science, user testing, annotation tasks, and more. Traditionally, in all of these settings survey distributors are careful to find representative samples of the human population to ensure the validity of their results and understand potential demographic differences. This means in order to be a suitable replacement, LLMs will need to be able to capture the influence of positionality (i.e., relevance of social identities like gender and race). However, we show that there are two inherent limitations in the way current LLMs are trained that prevent this. We argue analytically for why LLMs are doomed to both misportray and flatten the representations of demographic groups, then empirically show this to be true on 4 LLMs through a series of human studies with 3200 participants across 16 demographic identities. We also discuss a third consideration about how identity prompts can essentialize identities. Throughout, we connect each of these limitations to a pernicious history that shows why each is harmful for marginalized demographic groups. Overall, we urge caution in use cases where LLMs are intended to replace human participants whose identities are relevant to the task at hand. At the same time, in cases where the goal is to supplement rather than replace (e.g., pilot studies), we provide empirically-better inference-time techniques to reduce, but not remove, these harms.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Self-supervised learning of video representations from a child's perspective
Authors:
A. Emin Orhan,
Wentao Wang,
Alex N. Wang,
Mengye Ren,
Brenden M. Lake
Abstract:
Children learn powerful internal models of the world around them from a few years of egocentric visual experience. Can such internal models be learned from a child's visual experience with highly generic learning algorithms or do they require strong inductive biases? Recent advances in collecting large-scale, longitudinal, developmentally realistic video datasets and generic self-supervised learni…
▽ More
Children learn powerful internal models of the world around them from a few years of egocentric visual experience. Can such internal models be learned from a child's visual experience with highly generic learning algorithms or do they require strong inductive biases? Recent advances in collecting large-scale, longitudinal, developmentally realistic video datasets and generic self-supervised learning (SSL) algorithms are allowing us to begin to tackle this nature vs. nurture question. However, existing work typically focuses on image-based SSL algorithms and visual capabilities that can be learned from static images (e.g. object recognition), thus ignoring temporal aspects of the world. To close this gap, here we train self-supervised video models on longitudinal, egocentric headcam recordings collected from a child over a two year period in their early development (6-31 months). The resulting models are highly effective at facilitating the learning of action concepts from a small number of labeled examples; they have favorable data size scaling properties; and they display emergent video interpolation capabilities. Video models also learn more robust object representations than image-based models trained with the exact same data. These results suggest that important temporal aspects of a child's internal model of the world may be learnable from their visual experience using highly generic learning algorithms and without strong inductive biases.
△ Less
Submitted 31 January, 2024;
originally announced February 2024.
-
Experimental Demonstration of Attosecond Pump-Probe Spectroscopy with an X-ray Free-Electron Laser
Authors:
Zhaoheng Guo,
Taran Driver,
Sandra Beauvarlet,
David Cesar,
Joseph Duris,
Paris L. Franz,
Oliver Alexander,
Dorian Bohler,
Christoph Bostedt,
Vitali Averbukh,
Xinxin Cheng,
Louis F. DiMauro,
Gilles Doumy,
Ruaridh Forbes,
Oliver Gessner,
James M. Glownia,
Erik Isele,
Andrei Kamalov,
Kirk A. Larsen,
Siqi Li,
Xiang Li,
Ming-Fu Lin,
Gregory A. McCracken,
Razib Obaid,
Jordan T. ONeal
, et al. (25 additional authors not shown)
Abstract:
Pump-probe experiments with sub-femtosecond resolution are the key to understanding electronic dynamics in quantum systems. Here we demonstrate the generation and control of sub-femtosecond pulse pairs from a two-colour X-ray free-electron laser (XFEL). By measuring the delay between the two pulses with an angular streaking diagnostic, we characterise the group velocity of the XFEL and demonstrate…
▽ More
Pump-probe experiments with sub-femtosecond resolution are the key to understanding electronic dynamics in quantum systems. Here we demonstrate the generation and control of sub-femtosecond pulse pairs from a two-colour X-ray free-electron laser (XFEL). By measuring the delay between the two pulses with an angular streaking diagnostic, we characterise the group velocity of the XFEL and demonstrate control of the pulse delay down to 270 as. We demonstrate the application of this technique to a pump-probe measurement in core-excited para-aminophenol. These results demonstrate the ability to perform pump-probe experiments with sub-femtosecond resolution and atomic site specificity.
△ Less
Submitted 26 January, 2024;
originally announced January 2024.
-
Physically Informed Synchronic-adaptive Learning for Industrial Systems Modeling in Heterogeneous Media with Unavailable Time-varying Interface
Authors:
Aina Wang,
Pan Qin,
Xi-Ming Sun
Abstract:
Partial differential equations (PDEs) are commonly employed to model complex industrial systems characterized by multivariable dependence. Existing physics-informed neural networks (PINNs) excel in solving PDEs in a homogeneous medium. However, their feasibility is diminished when PDE parameters are unknown due to a lack of physical attributions and time-varying interface is unavailable arising fr…
▽ More
Partial differential equations (PDEs) are commonly employed to model complex industrial systems characterized by multivariable dependence. Existing physics-informed neural networks (PINNs) excel in solving PDEs in a homogeneous medium. However, their feasibility is diminished when PDE parameters are unknown due to a lack of physical attributions and time-varying interface is unavailable arising from heterogeneous media. To this end, we propose a data-physics-hybrid method, physically informed synchronic-adaptive learning (PISAL), to solve PDEs for industrial systems modeling in heterogeneous media. First, Net1, Net2, and NetI, are constructed to approximate the solutions satisfying PDEs and the interface. Net1 and Net2 are utilized to synchronously learn each solution satisfying PDEs with diverse parameters, while NetI is employed to adaptively learn the unavailable time-varying interface. Then, a criterion combined with NetI is introduced to adaptively distinguish the attributions of measurements and collocation points. Furthermore, NetI is integrated into a data-physics-hybrid loss function. Accordingly, a synchronic-adaptive learning (SAL) strategy is proposed to decompose and optimize each subdomain. Besides, we theoretically prove the approximation capability of PISAL. Extensive experimental results verify that the proposed PISAL can be used for industrial systems modeling in heterogeneous media, which faces the challenges of lack of physical attributions and unavailable time-varying interface.
△ Less
Submitted 25 January, 2024;
originally announced January 2024.
-
A regular MOG black hole's impact on shadows and gravitational weak lensing in the presence of quintessence field
Authors:
Ahmad Al-Badawi,
Sanjar Shaymatov,
Mirzabek Alloqulov,
Anzhong Wang
Abstract:
We investigate the impact of the modified gravity (MOG) field and the quintessence scalar field on horizon evolution, black hole (BH) shadow and the weak gravitational lensing around a static spherically symmetric BH. We first begin to write the BH metric associated with the MOG parameter and quintessence scalar field. We then determine the BH shadow and obtain numerical solutions for the photon s…
▽ More
We investigate the impact of the modified gravity (MOG) field and the quintessence scalar field on horizon evolution, black hole (BH) shadow and the weak gravitational lensing around a static spherically symmetric BH. We first begin to write the BH metric associated with the MOG parameter and quintessence scalar field. We then determine the BH shadow and obtain numerical solutions for the photon sphere and shadow radius. We show that the MOG ($α$) and the quintessence ($c$) parameters have a significant impact on BH shadow and photon sphere. Based on the analysis, we further show that the combined effects of the MOG parameter and quintessential field can increase the values of BH shadow and photon sphere radii. We also obtain constraints on the BH parameters by applying the observational data of Sgr A$^{\star}$ and M87$^{\star}$. Finally, we consider the weak deflection angle of BH within the context of the Gauss-Bonnet theorem (GBT) and show that the combined effects of the MOG and quintessence parameters do make the value of the deflection angle grow, referring to remarkable property being in well agreement with the physical meaning of both parameters that can maintain the strong gravitational field in the surrounding environment of BH.
△ Less
Submitted 23 January, 2024;
originally announced January 2024.
-
Electron mobility of SnO2 from first principles
Authors:
Amanda Wang,
Kyle Bushick,
Nick Pant,
Woncheol Lee,
Xiao Zhang,
Joshua Leveillee,
Feliciano Giustino,
Samuel Poncé,
Emmanouil Kioupakis
Abstract:
The transparent conducting oxide SnO2 is a wide bandgap semiconductor that is easily n-type doped and widely used in various electronic and optoelectronic applications. Experimental reports of the electron mobility of this material vary widely depending on the growth conditions and do** concentrations. In this work, we calculate the electron mobility of SnO2 from first principles to examine the…
▽ More
The transparent conducting oxide SnO2 is a wide bandgap semiconductor that is easily n-type doped and widely used in various electronic and optoelectronic applications. Experimental reports of the electron mobility of this material vary widely depending on the growth conditions and do** concentrations. In this work, we calculate the electron mobility of SnO2 from first principles to examine the temperature- and do**-concentration dependence, and to elucidate the scattering mechanisms that limit transport. We include both electron-phonon scattering and electron-ionized impurity scattering to accurately model scattering in a doped semiconductor. We find a strongly anisotropic mobility that favors transport in the direction parallel to the c-axis. At room temperature and intrinsic carrier concentrations, the low-energy polar-optical phonon modes dominate scattering, while ionized-impurity scattering dominates above 10^18 cm^-3.
△ Less
Submitted 22 April, 2024; v1 submitted 22 January, 2024;
originally announced January 2024.
-
Evidence for Unfolded Fermi Surfaces in the Charge-Density-Wave State of Kagome Metal FeGe Revealed by de Haas-van Alphen Effect
Authors:
Kaixin Tang,
Han**g Zhou,
Houpu Li,
Senyang Pan,
Xueliang Wu,
Hongyu Li,
Nan Zhang,
Chuanying Xi,
**glei Zhang,
Aifeng Wang,
Xiangang Wan,
Ziji Xiang,
Xianhui Chen
Abstract:
The antiferromagnetic kagome lattice compound FeGe has been revealed to host an emergent charge-density-wave (CDW) state which manifests complex interplay between the spin, charge and lattice degrees of freedom. Here, we present a comprehensive study of the de Haas-van Alphen effect by measuring torque magnetometry under magnetic fields up to 45.2 T to map Fermi surfaces in this unusual CDW state.…
▽ More
The antiferromagnetic kagome lattice compound FeGe has been revealed to host an emergent charge-density-wave (CDW) state which manifests complex interplay between the spin, charge and lattice degrees of freedom. Here, we present a comprehensive study of the de Haas-van Alphen effect by measuring torque magnetometry under magnetic fields up to 45.2 T to map Fermi surfaces in this unusual CDW state. For field along the $c$ direction, we resolve four cyclotron orbits; the largest one roughly corresponding to the area of the 2$\times$2 folded Brillouin zone. Three smaller orbits are characterized by light effective cyclotron masses range from (0.18-0.30) $m_e$. Angle-resolved measurements identify one Fermi surface segment with weak anisotropy. Combined with band structure calculations, our results suggest that features of unfolded Fermi surfaces are robust against CDW reconstruction, corroborating the novel effect of a short-ranged CDW on the electronic structure.
△ Less
Submitted 22 January, 2024;
originally announced January 2024.
-
Pruning for Protection: Increasing Jailbreak Resistance in Aligned LLMs Without Fine-Tuning
Authors:
Adib Hasan,
Ileana Rugina,
Alex Wang
Abstract:
Large Language Models (LLMs) are susceptible to `jailbreaking' prompts, which can induce the generation of harmful content. This paper demonstrates that moderate WANDA pruning (Sun et al., 2023) can increase their resistance to such attacks without the need for fine-tuning, while maintaining performance on standard benchmarks. Our findings suggest that the benefits of pruning correlate with the in…
▽ More
Large Language Models (LLMs) are susceptible to `jailbreaking' prompts, which can induce the generation of harmful content. This paper demonstrates that moderate WANDA pruning (Sun et al., 2023) can increase their resistance to such attacks without the need for fine-tuning, while maintaining performance on standard benchmarks. Our findings suggest that the benefits of pruning correlate with the initial safety levels of the model, indicating a regularizing effect of WANDA pruning. We introduce a dataset of 225 harmful tasks across five categories to systematically evaluate this safety enhancement. We argue that safety improvements can be understood through a regularization perspective. First, we show that pruning helps LLMs focus more effectively on task-relevant tokens within jailbreaking prompts. Then, we analyze the effects of pruning on the perplexity of malicious prompts before and after their integration into jailbreak templates. Finally, we demonstrate statistically significant performance improvements under domain shifts when applying WANDA to linear models.
△ Less
Submitted 28 April, 2024; v1 submitted 19 January, 2024;
originally announced January 2024.
-
BreastRegNet: A Deep Learning Framework for Registration of Breast Faxitron and Histopathology Images
Authors:
Negar Golestani,
Aihui Wang,
Gregory R Bean,
Mirabela Rusu
Abstract:
A standard treatment protocol for breast cancer entails administering neoadjuvant therapy followed by surgical removal of the tumor and surrounding tissue. Pathologists typically rely on cabinet X-ray radiographs, known as Faxitron, to examine the excised breast tissue and diagnose the extent of residual disease. However, accurately determining the location, size, and focality of residual cancer c…
▽ More
A standard treatment protocol for breast cancer entails administering neoadjuvant therapy followed by surgical removal of the tumor and surrounding tissue. Pathologists typically rely on cabinet X-ray radiographs, known as Faxitron, to examine the excised breast tissue and diagnose the extent of residual disease. However, accurately determining the location, size, and focality of residual cancer can be challenging, and incorrect assessments can lead to clinical consequences. The utilization of automated methods can improve the histopathology process, allowing pathologists to choose regions for sampling more effectively and precisely. Despite the recognized necessity, there are currently no such methods available. Training such automated detection models require accurate ground truth labels on ex-vivo radiology images, which can be acquired through registering Faxitron and histopathology images and map** the extent of cancer from histopathology to x-ray images. This study introduces a deep learning-based image registration approach trained on mono-modal synthetic image pairs. The models were trained using data from 50 women who received neoadjuvant chemotherapy and underwent surgery. The results demonstrate that our method is faster and yields significantly lower average landmark error ($2.1\pm1.96$ mm) over the state-of-the-art iterative ($4.43\pm4.1$ mm) and deep learning ($4.02\pm3.15$ mm) approaches. Improved performance of our approach in integrating radiology and pathology information facilitates generating large datasets, which allows training models for more accurate breast cancer detection.
△ Less
Submitted 18 January, 2024;
originally announced January 2024.
-
An Empirical Study of Counterfactual Visualization to Support Visual Causal Inference
Authors:
Arran Zeyu Wang,
David Borland,
David Gotz
Abstract:
Counterfactuals -- expressing what might have been true under different circumstances -- have been widely applied in statistics and machine learning to help understand causal relationships. More recently, counterfactuals have begun to emerge as a technique being applied within visualization research. However, it remains unclear to what extent counterfactuals can aid with visual data communication.…
▽ More
Counterfactuals -- expressing what might have been true under different circumstances -- have been widely applied in statistics and machine learning to help understand causal relationships. More recently, counterfactuals have begun to emerge as a technique being applied within visualization research. However, it remains unclear to what extent counterfactuals can aid with visual data communication. In this paper, we primarily focus on assessing the quality of users' understanding of data when provided with counterfactual visualizations. We propose a preliminary model of causality comprehension by connecting theories from causal inference and visual data communication. Leveraging this model, we conducted an empirical study to explore how counterfactuals can improve users' understanding of data in static visualizations. Our results indicate that visualizing counterfactuals had a positive impact on participants' interpretations of causal relations within datasets. These results motivate a discussion of how to more effectively incorporate counterfactuals into data visualizations.
△ Less
Submitted 16 January, 2024;
originally announced January 2024.
-
Using Counterfactuals to Improve Causal Inferences from Visualizations
Authors:
David Borland,
Arran Zeyu Wang,
David Gotz
Abstract:
Traditional approaches to data visualization have often focused on comparing different subsets of data, and this is reflected in the many techniques developed and evaluated over the years for visual comparison. Similarly, common workflows for exploratory visualization are built upon the idea of users interactively applying various filter and grou** mechanisms in search of new insights. This para…
▽ More
Traditional approaches to data visualization have often focused on comparing different subsets of data, and this is reflected in the many techniques developed and evaluated over the years for visual comparison. Similarly, common workflows for exploratory visualization are built upon the idea of users interactively applying various filter and grou** mechanisms in search of new insights. This paradigm has proven effective at hel** users identify correlations between variables that can inform thinking and decision-making. However, recent studies show that consumers of visualizations often draw causal conclusions even when not supported by the data. Motivated by these observations, this article highlights recent advances from a growing community of researchers exploring methods that aim to directly support visual causal inference. However, many of these approaches have their own limitations which limit their use in many real-world scenarios. This article therefore also outlines a set of key open challenges and corresponding priorities for new research to advance the state of the art in visual causal inference.
△ Less
Submitted 16 January, 2024;
originally announced January 2024.
-
Measurement of Solar $pp$ Neutrino Flux using Electron Recoil Data from PandaX-4T Commissioning Run
Authors:
PandaX Collaboration,
Xiaoying Lu,
Abdusalam Abdukerim,
Zihao Bo,
Wei Chen,
Xun Chen,
Yunhua Chen,
Chen Cheng,
Zhaokan Cheng,
Xiangyi Cui,
Yingjie Fan,
Deqing Fang,
Lisheng Geng,
Karl Giboni,
Xuyuan Guo,
Chencheng Han,
Ke Han,
Changda He,
**rong He,
Di Huang,
Junting Huang,
Zhou Huang,
Ruquan Hou,
Yu Hou,
Xiangdong Ji
, et al. (67 additional authors not shown)
Abstract:
The proton-proton ($pp$) fusion chain dominates the neutrino production from the Sun. The uncertainty of the predicted $pp$ neutrino flux is at the sub-percent level, whereas that of the best measurement is $\mathcal{O}(10\%)$. In this paper, we present the first result to measure the solar $pp$ neutrinos in the electron recoil energy range from 24 to 144 keV, using the PandaX-4T commissioning dat…
▽ More
The proton-proton ($pp$) fusion chain dominates the neutrino production from the Sun. The uncertainty of the predicted $pp$ neutrino flux is at the sub-percent level, whereas that of the best measurement is $\mathcal{O}(10\%)$. In this paper, we present the first result to measure the solar $pp$ neutrinos in the electron recoil energy range from 24 to 144 keV, using the PandaX-4T commissioning data with 0.63 tonne$\times$year exposure. The $pp$ neutrino flux is determined to be $(8.0 \pm 3.9 \,{\rm{(stat)}} \pm 10.0 \,{\rm{(syst)}} )\times 10^{10}\, $$\rm{s}^{-1} \rm{cm}^{-2}$, consistent with Standard Solar Model and existing measurements, corresponding to a flux upper limit of $23.3\times 10^{10}\, $$\rm{s}^{-1} \rm{cm}^{-2}$ at 90\% C.L..
△ Less
Submitted 2 July, 2024; v1 submitted 13 January, 2024;
originally announced January 2024.
-
Performance study of the JadePix-3 telescope from a beam test
Authors:
Sheng Dong,
Zhiliang Chen,
Jia Zhou,
Xinye Zhai,
Anqi Wang,
Yunxiang Wang,
Hulin Wang,
Lailin Xu,
**g Dong,
Yang Zhou,
Yunpeng Lu,
Mingyi Dong,
Hongyu Zhang,
Qun Ouyang
Abstract:
We present the results of a beam test conducted on a telescope using the JadePix-3 pixel sensor, developed with TowerJazz 180 nm CMOS imaging technology. The telescope is composed of five planes, each equipped with a JadePix-3 sensor with pitches of 26 um x 16 um and 23.11 um x 16 um. In addition, it features an FPGA-based synchronous readout system. The telescope underwent testing using an electr…
▽ More
We present the results of a beam test conducted on a telescope using the JadePix-3 pixel sensor, developed with TowerJazz 180 nm CMOS imaging technology. The telescope is composed of five planes, each equipped with a JadePix-3 sensor with pitches of 26 um x 16 um and 23.11 um x 16 um. In addition, it features an FPGA-based synchronous readout system. The telescope underwent testing using an electron beam with energy ranging from 4 to 6 GeV. At an electron energy of 5.4 GeV, the telescope demonstrated superior spatial resolutions of 2.6 and 2.3 um in two dimensions. By designating the central plane as the device under test, we evaluated the JadePix-3 sensor's spatial resolutions as 5.2 and 4.6 um in two dimensions, achieving a detection efficiency of over 99%.
△ Less
Submitted 26 May, 2024; v1 submitted 12 January, 2024;
originally announced January 2024.
-
Analytical impact excitation of Er/O/B co-doped Si light emitting diodes
Authors:
Xiaoming Wang,
Jia**g He,
Ao Wang,
Kun Zhang,
Yufei Sheng,
Weida Hu,
Chaoyuan **,
Hua Bao,
Ya** Dan
Abstract:
Er doped Si light emitting diodes may find important applications in the generation and storage of quantum information. These diodes exhibit an emission efficiency two orders of magnitude higher at reverse bias than forward bias due to impact excitation. However, physics of impact excitation in these devices remains largely unexplored. In this work, we fabricated an Er/O/B co-doped Si light emitti…
▽ More
Er doped Si light emitting diodes may find important applications in the generation and storage of quantum information. These diodes exhibit an emission efficiency two orders of magnitude higher at reverse bias than forward bias due to impact excitation. However, physics of impact excitation in these devices remains largely unexplored. In this work, we fabricated an Er/O/B co-doped Si light emitting diode which exhibits a strong electro-luminescence by the impact excitation of electrons inelastically colliding the Er ions. An analytical impact excitation theory was established to predict the electroluminescence intensity and internal quantum efficiency which fit well with the experimental data. From the fittings, we find that the excitable Er ions reach a record concentration of 1.9 x 10^19 cm-3 and up to 45% of them are in excitation state by impact excitation. This work has important implications for develo** efficient classical and quantum light sources based on rare earth elements.
△ Less
Submitted 11 January, 2024;
originally announced January 2024.
-
Towards Conversational Diagnostic AI
Authors:
Tao Tu,
Anil Palepu,
Mike Schaekermann,
Khaled Saab,
Jan Freyberg,
Ryutaro Tanno,
Amy Wang,
Brenna Li,
Mohamed Amin,
Nenad Tomasev,
Shekoofeh Azizi,
Karan Singhal,
Yong Cheng,
Le Hou,
Albert Webson,
Kavita Kulkarni,
S Sara Mahdavi,
Christopher Semturs,
Juraj Gottweis,
Joelle Barral,
Katherine Chou,
Greg S Corrado,
Yossi Matias,
Alan Karthikesalingam,
Vivek Natarajan
Abstract:
At the heart of medicine lies the physician-patient dialogue, where skillful history-taking paves the way for accurate diagnosis, effective management, and enduring trust. Artificial Intelligence (AI) systems capable of diagnostic dialogue could increase accessibility, consistency, and quality of care. However, approximating clinicians' expertise is an outstanding grand challenge. Here, we introdu…
▽ More
At the heart of medicine lies the physician-patient dialogue, where skillful history-taking paves the way for accurate diagnosis, effective management, and enduring trust. Artificial Intelligence (AI) systems capable of diagnostic dialogue could increase accessibility, consistency, and quality of care. However, approximating clinicians' expertise is an outstanding grand challenge. Here, we introduce AMIE (Articulate Medical Intelligence Explorer), a Large Language Model (LLM) based AI system optimized for diagnostic dialogue.
AMIE uses a novel self-play based simulated environment with automated feedback mechanisms for scaling learning across diverse disease conditions, specialties, and contexts. We designed a framework for evaluating clinically-meaningful axes of performance including history-taking, diagnostic accuracy, management reasoning, communication skills, and empathy. We compared AMIE's performance to that of primary care physicians (PCPs) in a randomized, double-blind crossover study of text-based consultations with validated patient actors in the style of an Objective Structured Clinical Examination (OSCE). The study included 149 case scenarios from clinical providers in Canada, the UK, and India, 20 PCPs for comparison with AMIE, and evaluations by specialist physicians and patient actors. AMIE demonstrated greater diagnostic accuracy and superior performance on 28 of 32 axes according to specialist physicians and 24 of 26 axes according to patient actors. Our research has several limitations and should be interpreted with appropriate caution. Clinicians were limited to unfamiliar synchronous text-chat which permits large-scale LLM-patient interactions but is not representative of usual clinical practice. While further research is required before AMIE could be translated to real-world settings, the results represent a milestone towards conversational diagnostic AI.
△ Less
Submitted 10 January, 2024;
originally announced January 2024.
-
Learning-Based Difficulty Calibration for Enhanced Membership Inference Attacks
Authors:
Haonan Shi,
Tu Ouyang,
An Wang
Abstract:
Machine learning models, in particular deep neural networks, are currently an integral part of various applications, from healthcare to finance. However, using sensitive data to train these models raises concerns about privacy and security. One method that has emerged to verify if the trained models are privacy-preserving is Membership Inference Attacks (MIA), which allows adversaries to determine…
▽ More
Machine learning models, in particular deep neural networks, are currently an integral part of various applications, from healthcare to finance. However, using sensitive data to train these models raises concerns about privacy and security. One method that has emerged to verify if the trained models are privacy-preserving is Membership Inference Attacks (MIA), which allows adversaries to determine whether a specific data point was part of a model's training dataset. While a series of MIAs have been proposed in the literature, only a few can achieve high True Positive Rates (TPR) in the low False Positive Rate (FPR) region (0.01%~1%). This is a crucial factor to consider for an MIA to be practically useful in real-world settings. In this paper, we present a novel approach to MIA that is aimed at significantly improving TPR at low FPRs. Our method, named learning-based difficulty calibration for MIA(LDC-MIA), characterizes data records by their hardness levels using a neural network classifier to determine membership. The experiment results show that LDC-MIA can improve TPR at low FPR by up to 4x compared to the other difficulty calibration based MIAs. It also has the highest Area Under ROC curve (AUC) across all datasets. Our method's cost is comparable with most of the existing MIAs, but is orders of magnitude more efficient than one of the state-of-the-art methods, LiRA, while achieving similar performance.
△ Less
Submitted 9 July, 2024; v1 submitted 9 January, 2024;
originally announced January 2024.
-
A coronary artery phantom for task-based CT performance assessment and a comparative study of clinical CT, photon counting CT, and micro CT
Authors:
Jed D. Pack,
Paul Fitzgerald,
Stephen Araujo,
Ying Fan,
Grant Stevens,
Jonathan Gerdes,
Adam Wang,
Koen Nieman,
Ge Wang,
Bruno De Man
Abstract:
While drastic improvements in CT technology have occurred in the past 25 years, spatial resolution is one area where progress has been limited until recently. New photon counting CT systems, are capable of much better spatial resolution than their (energy integrating) predecessors. These improvements have the potential to improve the evaluation obstructive coronary artery disease by enabling more…
▽ More
While drastic improvements in CT technology have occurred in the past 25 years, spatial resolution is one area where progress has been limited until recently. New photon counting CT systems, are capable of much better spatial resolution than their (energy integrating) predecessors. These improvements have the potential to improve the evaluation obstructive coronary artery disease by enabling more accurate delineation between calcified plaque and coronary vessel lumen. A new set of vessel phantoms has been designed and manufactured for quantifying this improvement. Comparisons are made between an existing clinical CT system, a prototype photon counting system, with images from a micro CT system being used as the gold standard. Scans were made of the same objects on all three systems. The resulting images were registered and the luminal cross section areas were compared. Luminal cross-sections near calcified plaques were reduced due to blooming, but this effect was much less pronounced in images from the prototype photon counting system as compared to the images from the clinical CT system.
△ Less
Submitted 8 January, 2024;
originally announced January 2024.
-
COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training
Authors:
Alex **peng Wang,
Linjie Li,
Kevin Qinghong Lin,
Jianfeng Wang,
Kevin Lin,
Zhengyuan Yang,
Lijuan Wang,
Mike Zheng Shou
Abstract:
In the evolution of Vision-Language Pre-training, shifting from short-text comprehension to encompassing extended textual contexts is pivotal. Recent autoregressive vision-language models like \cite{flamingo, palme}, leveraging the long-context capability of Large Language Models, have excelled in few-shot text generation tasks but face challenges in alignment tasks. Addressing this gap, we introd…
▽ More
In the evolution of Vision-Language Pre-training, shifting from short-text comprehension to encompassing extended textual contexts is pivotal. Recent autoregressive vision-language models like \cite{flamingo, palme}, leveraging the long-context capability of Large Language Models, have excelled in few-shot text generation tasks but face challenges in alignment tasks. Addressing this gap, we introduce the contrastive loss into text generation models, presenting the COntrastive-Streamlined MultimOdal framework (\ModelName), strategically partitioning the language model into dedicated unimodal text processing and adept multimodal data handling components. \ModelName, our unified framework, merges unimodal and multimodal elements, enhancing model performance for tasks involving textual and visual data while notably reducing learnable parameters. However, these models demand extensive long-text datasets, yet the availability of high-quality long-text video datasets remains limited. To bridge this gap, this work introduces \VideoDatasetName, an inaugural interleaved video-text dataset featuring comprehensive captions, marking a significant step forward. Demonstrating its impact, we illustrate how \VideoDatasetName{} enhances model performance in image-text tasks. With 34% learnable parameters and utilizing 72\% of the available data, our model demonstrates significant superiority over OpenFlamingo~\cite{openflamingo}. For instance, in the 4-shot flickr captioning task, performance notably improves from 57.2% to 65.\%. The contributions of \ModelName{} and \VideoDatasetName{} are underscored by notable performance gains across 14 diverse downstream datasets encompassing both image-text and video-text tasks.
△ Less
Submitted 1 January, 2024;
originally announced January 2024.
-
SAR-RARP50: Segmentation of surgical instrumentation and Action Recognition on Robot-Assisted Radical Prostatectomy Challenge
Authors:
Dimitrios Psychogyios,
Emanuele Colleoni,
Beatrice Van Amsterdam,
Chih-Yang Li,
Shu-Yu Huang,
Yuchong Li,
Fucang Jia,
Baosheng Zou,
Guotai Wang,
Yang Liu,
Maxence Boels,
Jiayu Huo,
Rachel Sparks,
Prokar Dasgupta,
Alejandro Granados,
Sebastien Ourselin,
Mengya Xu,
An Wang,
Yanan Wu,
Long Bai,
Hongliang Ren,
Atsushi Yamada,
Yuriko Harai,
Yuto Ishikawa,
Kazuyuki Hayashi
, et al. (25 additional authors not shown)
Abstract:
Surgical tool segmentation and action recognition are fundamental building blocks in many computer-assisted intervention applications, ranging from surgical skills assessment to decision support systems. Nowadays, learning-based action recognition and segmentation approaches outperform classical methods, relying, however, on large, annotated datasets. Furthermore, action recognition and tool segme…
▽ More
Surgical tool segmentation and action recognition are fundamental building blocks in many computer-assisted intervention applications, ranging from surgical skills assessment to decision support systems. Nowadays, learning-based action recognition and segmentation approaches outperform classical methods, relying, however, on large, annotated datasets. Furthermore, action recognition and tool segmentation algorithms are often trained and make predictions in isolation from each other, without exploiting potential cross-task relationships. With the EndoVis 2022 SAR-RARP50 challenge, we release the first multimodal, publicly available, in-vivo, dataset for surgical action recognition and semantic instrumentation segmentation, containing 50 suturing video segments of Robotic Assisted Radical Prostatectomy (RARP). The aim of the challenge is twofold. First, to enable researchers to leverage the scale of the provided dataset and develop robust and highly accurate single-task action recognition and tool segmentation approaches in the surgical domain. Second, to further explore the potential of multitask-based learning approaches and determine their comparative advantage against their single-task counterparts. A total of 12 teams participated in the challenge, contributing 7 action recognition methods, 9 instrument segmentation techniques, and 4 multitask approaches that integrated both action recognition and instrument segmentation. The complete SAR-RARP50 dataset is available at: https://rdr.ucl.ac.uk/projects/SARRARP50_Segmentation_of_surgical_instrumentation_and_Action_Recognition_on_Robot-Assisted_Radical_Prostatectomy_Challenge/191091
△ Less
Submitted 23 January, 2024; v1 submitted 31 December, 2023;
originally announced January 2024.
-
Electrical and thermal transport properties of kagome metals AV$_3$Sb$_5$ (A=K, Rb, Cs)
Authors:
Xinrun Mi,
Kunya Yang,
Yuhan Gan,
Long Zhang,
Aifeng Wang,
Yisheng Chai,
Xiaoyuan Zhou,
Mingquan He
Abstract:
The interplay between lattice geometry, band topology and electronic correlations in the newly discovered kagome compounds AV$_3$Sb$_5$ (A=K, Rb, Cs) makes this family a novel playground to investigate emergent quantum phenomena, such as unconventional superconductivity, chiral charge density wave and electronic nematicity. These exotic quantum phases naturally leave nontrivial fingerprints in tra…
▽ More
The interplay between lattice geometry, band topology and electronic correlations in the newly discovered kagome compounds AV$_3$Sb$_5$ (A=K, Rb, Cs) makes this family a novel playground to investigate emergent quantum phenomena, such as unconventional superconductivity, chiral charge density wave and electronic nematicity. These exotic quantum phases naturally leave nontrivial fingerprints in transport properties of AV$_3$Sb$_5$, both in electrical and thermal channels, which are prominent probes to uncover the underlying mechanisms. In this brief review, we highlight the unusual electrical and thermal transport properties observed in the unconventional charge ordered state of AV3Sb5, including giant anomalous Hall, anomalous Nernst, ambipolar Nernst and anomalous thermal Hall effects. Connections of these anomalous transport properties to time-reversal symmetry breaking, topological and multiband fermiology, as well as electronic nematicity, are also discussed. Finally, a perspective together with challenges of this rapid growing field are given.
△ Less
Submitted 31 December, 2023;
originally announced January 2024.
-
Improvement on the Linearity Response of PandaX-4T with new Photomultiplier Tubes Bases
Authors:
Lingyin Luo,
Deqing Fang,
Ke Han,
Di Huang,
Xiaofeng Shang,
Anqing Wang,
Qiuhong Wang,
Shaobo Wang,
Siguang Wang,
Xiang Xiao,
Binbin Yan,
Xiyu Yan
Abstract:
With the expanding reach of physics, xenon-based detectors such as PandaX-4T in the China **** Underground Laboratory aim to cover an energy range from sub-keV to multi-MeV. A linear response of the photomultiplier tubes (PMTs) is required for both scintillation and electroluminescence signals. Through a dedicated bench test, we investigated the cause of the non-linear response in the Hamamatsu…
▽ More
With the expanding reach of physics, xenon-based detectors such as PandaX-4T in the China **** Underground Laboratory aim to cover an energy range from sub-keV to multi-MeV. A linear response of the photomultiplier tubes (PMTs) is required for both scintillation and electroluminescence signals. Through a dedicated bench test, we investigated the cause of the non-linear response in the Hamamatsu R11410-23 PMTs used in PandaX-4T. The saturation and suppression of the PMT waveform observed during the commissioning of PandaX-4T were caused by the high-voltage divider base. The bench test data validated the de-saturation algorithm used in the PandaX-4T data analysis. We also confirmed the improvement in linearity of a new PMT base design, which will be used to upgrade the PMT readout system in PandaX-4T.
△ Less
Submitted 7 April, 2024; v1 submitted 30 December, 2023;
originally announced January 2024.
-
Learning to Generate Text in Arbitrary Writing Styles
Authors:
Aleem Khan,
Andrew Wang,
Sophia Hager,
Nicholas Andrews
Abstract:
Prior work in style-controlled text generation has focused on tasks such as emulating the style of prolific literary authors, producing formal or informal text, and mitigating toxicity of generated text. Plentiful demonstrations of these styles are available, and as a result modern language models are often able to emulate them, either via prompting or discriminative control. However, in applicati…
▽ More
Prior work in style-controlled text generation has focused on tasks such as emulating the style of prolific literary authors, producing formal or informal text, and mitigating toxicity of generated text. Plentiful demonstrations of these styles are available, and as a result modern language models are often able to emulate them, either via prompting or discriminative control. However, in applications such as writing assistants, it is desirable for language models to produce text in an author-specific style on the basis of a potentially small writing sample. For example, someone writing in a particular dialect may prefer writing suggestions that retain the same dialect. We find that instruction-tuned language models can struggle to reproduce author-specific style demonstrated in a prompt. Instead, we propose to guide a language model to generate text in a target style using contrastively-trained representations that capture stylometric features. Our approach (StyleMC) combines an author-adapted language model with sequence-level inference to improve stylistic consistency, and is found to be effective in a variety of conditions, including unconditional generation and style transfer. Additionally, we find that the proposed approach can serve as an effective anonymization method, by editing a document to mask authorship while preserving the original meaning
△ Less
Submitted 4 March, 2024; v1 submitted 28 December, 2023;
originally announced December 2023.
-
HISR: Hybrid Implicit Surface Representation for Photorealistic 3D Human Reconstruction
Authors:
Angtian Wang,
Yuanlu Xu,
Nikolaos Sarafianos,
Robert Maier,
Edmond Boyer,
Alan Yuille,
Tony Tung
Abstract:
Neural reconstruction and rendering strategies have demonstrated state-of-the-art performances due, in part, to their ability to preserve high level shape details. Existing approaches, however, either represent objects as implicit surface functions or neural volumes and still struggle to recover shapes with heterogeneous materials, in particular human skin, hair or clothes. To this aim, we present…
▽ More
Neural reconstruction and rendering strategies have demonstrated state-of-the-art performances due, in part, to their ability to preserve high level shape details. Existing approaches, however, either represent objects as implicit surface functions or neural volumes and still struggle to recover shapes with heterogeneous materials, in particular human skin, hair or clothes. To this aim, we present a new hybrid implicit surface representation to model human shapes. This representation is composed of two surface layers that represent opaque and translucent regions on the clothed human body. We segment different regions automatically using visual cues and learn to reconstruct two signed distance functions (SDFs). We perform surface-based rendering on opaque regions (e.g., body, face, clothes) to preserve high-fidelity surface normals and volume rendering on translucent regions (e.g., hair). Experiments demonstrate that our approach obtains state-of-the-art results on 3D human reconstructions, and also shows competitive performances on other objects.
△ Less
Submitted 28 December, 2023;
originally announced December 2023.
-
Coexistence of Dirac fermion and charge density wave in square-net-based semimetal LaAuSb2
Authors:
Xueliang Wu,
Zhixiang Hu,
David Graf,
Yu Liu,
Chaoyue Deng,
Huixia Fu,
Asish K. Kundu,
Tonica Valla,
Cedomir Petrovic,
Aifeng Wang
Abstract:
We report a comprehensive study of magnetotransport properties, angle-resolved photoemission spectroscopy (ARPES), and density functional theory (DFT) calculations on self-flux grown LaAuSb$_2$ single crystals. Resistivity and Hall measurements reveal a charge density wave (CDW) transition at 77 K. MR and de Haas-Van Alphen (dHvA) measurements indicate that the transport properties of LaAuSb$_2$ a…
▽ More
We report a comprehensive study of magnetotransport properties, angle-resolved photoemission spectroscopy (ARPES), and density functional theory (DFT) calculations on self-flux grown LaAuSb$_2$ single crystals. Resistivity and Hall measurements reveal a charge density wave (CDW) transition at 77 K. MR and de Haas-Van Alphen (dHvA) measurements indicate that the transport properties of LaAuSb$_2$ are dominated by Dirac fermions that arise from Sb square nets. ARPES measurements and DFT calculations reveal an electronic structure with a common feature of the square-net-based topological semimetals, which is in good agreement with the magnetotransport properties. Our results indicate the coexistence of CDW and Dirac fermion in LaAuSb$_2$, both of which are linked to the bands arising from the Sb-square net, suggesting that the square net could serve as a structural motif to explore various electronic orders.
△ Less
Submitted 28 December, 2023;
originally announced December 2023.