-
Origin of anomalous magnetotransport in kagome superconductors AV$_{3}$Sb$_{5}$ (A=K,Rb,Cs)
Authors:
A. E. Koshelev,
R. Chapai,
D. Y. Chung,
J. F. Mitchell,
U. Welp
Abstract:
Multiple anomalous features in electronic spectra of metals with kagome lattice structure -- van Hove singularities, Dirac points, and flat bands -- imply that materials containing this structural motif may lie at a nexus of topological and correlated electron physics. Due to the prospects of such exceptional electronic behavior, the recent discovery of superconductivity coexisting with charge-den…
▽ More
Multiple anomalous features in electronic spectra of metals with kagome lattice structure -- van Hove singularities, Dirac points, and flat bands -- imply that materials containing this structural motif may lie at a nexus of topological and correlated electron physics. Due to the prospects of such exceptional electronic behavior, the recent discovery of superconductivity coexisting with charge-density wave (CDW) order in the layered kagome metals AV$_{3}$Sb$_{5}$ (A=K,Rb,Cs) has attracted considerable attention. Notably, these kagome metals express unconventional magnetotransport behavior, including a linear-in-H diagonal resistivity at low fields, and an even more peculiar, nonmonotonic sign-changing behavior of the Hall resistivity, which has been speculated to arise from a chiral CDW. We argue here that this unusual magnetotransport derives not from such unconventional phenomena, but rather from the unique fermiology of the AV$_{3}$Sb$_{5}$ materials. Specifically, it is caused by a large, concave hexagonal Fermi surface sheet formed in the close proximity to the van Hove singularities, which is backfolded into a small hexagonal sheet and two large triangular sheets in the CDW state. We introduce a model of the electronic structure of these Fermi surface sheets that allows for a full analytical treatment within Boltzmann kinetic theory and that enables semi-quantitative fits of our transport data. Specifically, we find that the anomalous magnetotransport behavior is caused by the confluence of strong reduction of the Fermi velocity near the van Hove singularities located near the vertices of the hexagonal sheet and sharp corners in Fermi surface generated by the CDW reconstruction.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
From Majority to Minority: A Diffusion-based Augmentation for Underrepresented Groups in Skin Lesion Analysis
Authors:
Janet Wang,
Yunsung Chung,
Zhengming Ding,
Jihun Hamm
Abstract:
AI-based diagnoses have demonstrated dermatologist-level performance in classifying skin cancer. However, such systems are prone to under-performing when tested on data from minority groups that lack sufficient representation in the training sets. Although data collection and annotation offer the best means for promoting minority groups, these processes are costly and time-consuming. Prior works h…
▽ More
AI-based diagnoses have demonstrated dermatologist-level performance in classifying skin cancer. However, such systems are prone to under-performing when tested on data from minority groups that lack sufficient representation in the training sets. Although data collection and annotation offer the best means for promoting minority groups, these processes are costly and time-consuming. Prior works have suggested that data from majority groups may serve as a valuable information source to supplement the training of diagnosis tools for minority groups. In this work, we propose an effective diffusion-based augmentation framework that maximizes the use of rich information from majority groups to benefit minority groups. Using groups with different skin types as a case study, our results show that the proposed framework can generate synthetic images that improve diagnostic results for the minority groups, even when there is little or no reference data from these target groups. The practical value of our work is evident in medical imaging analysis, where under-diagnosis persists as a problem for certain groups due to insufficient representation.
△ Less
Submitted 26 June, 2024;
originally announced June 2024.
-
What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions
Authors:
Sang Keun Choe,
Hwijeen Ahn,
Juhan Bae,
Kewen Zhao,
Minsoo Kang,
Youngseog Chung,
Adithya Pratapa,
Willie Neiswanger,
Emma Strubell,
Teruko Mitamura,
Jeff Schneider,
Eduard Hovy,
Roger Grosse,
Eric Xing
Abstract:
Large language models (LLMs) are trained on a vast amount of human-written data, but data providers often remain uncredited. In response to this issue, data valuation (or data attribution), which quantifies the contribution or value of each data to the model output, has been discussed as a potential solution. Nevertheless, applying existing data valuation methods to recent LLMs and their vast trai…
▽ More
Large language models (LLMs) are trained on a vast amount of human-written data, but data providers often remain uncredited. In response to this issue, data valuation (or data attribution), which quantifies the contribution or value of each data to the model output, has been discussed as a potential solution. Nevertheless, applying existing data valuation methods to recent LLMs and their vast training datasets has been largely limited by prohibitive compute and memory costs. In this work, we focus on influence functions, a popular gradient-based data valuation method, and significantly improve its scalability with an efficient gradient projection strategy called LoGra that leverages the gradient structure in backpropagation. We then provide a theoretical motivation of gradient projection approaches to influence functions to promote trust in the data valuation process. Lastly, we lower the barrier to implementing data valuation systems by introducing LogIX, a software package that can transform existing training code into data valuation code with minimal effort. In our data valuation experiments, LoGra achieves competitive accuracy against more expensive baselines while showing up to 6,500x improvement in throughput and 5x reduction in GPU memory usage when applied to Llama3-8B-Instruct and the 1B-token dataset.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
QComp: A QSAR-Based Data Completion Framework for Drug Discovery
Authors:
Bingjia Yang,
Yunsie Chung,
Archer Y. Yang,
Bo Yuan,
Xiang Yu
Abstract:
In drug discovery, in vitro and in vivo experiments reveal biochemical activities related to the efficacy and toxicity of compounds. The experimental data accumulate into massive, ever-evolving, and sparse datasets. Quantitative Structure-Activity Relationship (QSAR) models, which predict biochemical activities using only the structural information of compounds, face challenges in integrating the…
▽ More
In drug discovery, in vitro and in vivo experiments reveal biochemical activities related to the efficacy and toxicity of compounds. The experimental data accumulate into massive, ever-evolving, and sparse datasets. Quantitative Structure-Activity Relationship (QSAR) models, which predict biochemical activities using only the structural information of compounds, face challenges in integrating the evolving experimental data as studies progress. We develop QSAR-Complete (QComp), a data completion framework to address this issue. Based on pre-existing QSAR models, QComp utilizes the correlation inherent in experimental data to enhance prediction accuracy across various tasks. Moreover, QComp emerges as a promising tool for guiding the optimal sequence of experiments by quantifying the reduction in statistical uncertainty for specific endpoints, thereby aiding in rational decision-making throughout the drug discovery process.
△ Less
Submitted 19 May, 2024;
originally announced May 2024.
-
Machine Learning Driven Biomarker Selection for Medical Diagnosis
Authors:
Divyagna Bavikadi,
Ayushi Agarwal,
Shashank Ganta,
Yunro Chung,
Lusheng Song,
Ji Qiu,
Paulo Shakarian
Abstract:
Recent advances in experimental methods have enabled researchers to collect data on thousands of analytes simultaneously. This has led to correlational studies that associated molecular measurements with diseases such as Alzheimer's, Liver, and Gastric Cancer. However, the use of thousands of biomarkers selected from the analytes is not practical for real-world medical diagnosis and is likely unde…
▽ More
Recent advances in experimental methods have enabled researchers to collect data on thousands of analytes simultaneously. This has led to correlational studies that associated molecular measurements with diseases such as Alzheimer's, Liver, and Gastric Cancer. However, the use of thousands of biomarkers selected from the analytes is not practical for real-world medical diagnosis and is likely undesirable due to potentially formed spurious correlations. In this study, we evaluate 4 different methods for biomarker selection and 4 different machine learning (ML) classifiers for identifying correlations, evaluating 16 approaches in all. We found that contemporary methods outperform previously reported logistic regression in cases where 3 and 10 biomarkers are permitted. When specificity is fixed at 0.9, ML approaches produced a sensitivity of 0.240 (3 biomarkers) and 0.520 (10 biomarkers), while standard logistic regression provided a sensitivity of 0.000 (3 biomarkers) and 0.040 (10 biomarkers). We also noted that causal-based methods for biomarker selection proved to be the most performant when fewer biomarkers were permitted, while univariate feature selection was the most performant when a greater number of biomarkers were permitted.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
One vs. Many: Comprehending Accurate Information from Multiple Erroneous and Inconsistent AI Generations
Authors:
Yoonjoo Lee,
Kihoon Son,
Tae Soo Kim,
Jisu Kim,
John Joon Young Chung,
Eytan Adar,
Juho Kim
Abstract:
As Large Language Models (LLMs) are nondeterministic, the same input can generate different outputs, some of which may be incorrect or hallucinated. If run again, the LLM may correct itself and produce the correct answer. Unfortunately, most LLM-powered systems resort to single results which, correct or not, users accept. Having the LLM produce multiple outputs may help identify disagreements or a…
▽ More
As Large Language Models (LLMs) are nondeterministic, the same input can generate different outputs, some of which may be incorrect or hallucinated. If run again, the LLM may correct itself and produce the correct answer. Unfortunately, most LLM-powered systems resort to single results which, correct or not, users accept. Having the LLM produce multiple outputs may help identify disagreements or alternatives. However, it is not obvious how the user will interpret conflicts or inconsistencies. To this end, we investigate how users perceive the AI model and comprehend the generated information when they receive multiple, potentially inconsistent, outputs. Through a preliminary study, we identified five types of output inconsistencies. Based on these categories, we conducted a study (N=252) in which participants were given one or more LLM-generated passages to an information-seeking question. We found that inconsistency within multiple LLM-generated outputs lowered the participants' perceived AI capacity, while also increasing their comprehension of the given information. Specifically, we observed that this positive effect of inconsistencies was most significant for participants who read two passages, compared to those who read three. Based on these findings, we present design implications that, instead of regarding LLM output inconsistencies as a drawback, we can reveal the potential inconsistencies to transparently indicate the limitations of these models and promote critical LLM usage.
△ Less
Submitted 9 May, 2024;
originally announced May 2024.
-
Noise Correlations in a 1D Silicon Spin Qubit Array
Authors:
M. B. Donnelly,
J. Rowlands,
L. Kranz,
Y. L. Hsueh,
Y. Chung,
A. V. Timofeev,
H. Geng,
P. Singh-Gregory,
S. K. Gorman,
J. G. Keizer,
R. Rahman,
M. Y. Simmons
Abstract:
Correlated noise across multi-qubit architectures is known to be highly detrimental to the operation of error correcting codes and the long-term feasibility of quantum processors. The recent discovery of spatially dependent correlated noise in multi-qubit architectures of superconducting qubits arising from the impact of cosmic radiation and high-energy particles giving rise to quasiparticle poiso…
▽ More
Correlated noise across multi-qubit architectures is known to be highly detrimental to the operation of error correcting codes and the long-term feasibility of quantum processors. The recent discovery of spatially dependent correlated noise in multi-qubit architectures of superconducting qubits arising from the impact of cosmic radiation and high-energy particles giving rise to quasiparticle poisoning within the substrate has led to intense investigations of mitigation strategies to address this. In contrast correlated noise in semiconductor spin qubits as a function of distance has not been reported to date. Here we report the magnitude, frequency and spatial dependence of noise correlations between four silicon quantum dot pairs as a function of inter-dot distance at frequencies from 0.3mHz to 1mHz. We find the magnitude of charge noise correlations, quantified by the magnitude square coherence $C_{xy}$, are significantly suppressed from $>0.5$ to $<0.1$ as the inter-dot distance increases from 75nm to 300nm. Using an analytical model we confirm that, in contrast to superconducting qubits, the dominant source of correlated noise arises from low frequency charge noise from the presence of two level fluctuators (TLFs) at the native silicon-silicon dioxide surface. Knowing this, we conclude with an important and timely discussion of charge noise mitigation strategies.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Full Shot Predictions for the DIII-D Tokamak via Deep Recurrent Networks
Authors:
Ian Char,
Youngseog Chung,
Joseph Abbate,
Egemen Kolemen,
Jeff Schneider
Abstract:
Although tokamaks are one of the most promising devices for realizing nuclear fusion as an energy source, there are still key obstacles when it comes to understanding the dynamics of the plasma and controlling it. As such, it is crucial that high quality models are developed to assist in overcoming these obstacles. In this work, we take an entirely data driven approach to learn such a model. In pa…
▽ More
Although tokamaks are one of the most promising devices for realizing nuclear fusion as an energy source, there are still key obstacles when it comes to understanding the dynamics of the plasma and controlling it. As such, it is crucial that high quality models are developed to assist in overcoming these obstacles. In this work, we take an entirely data driven approach to learn such a model. In particular, we use historical data from the DIII-D tokamak to train a deep recurrent network that is able to predict the full time evolution of plasma discharges (or "shots"). Following this, we investigate how different training and inference procedures affect the quality and calibration of the shot predictions.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Grover's algorithm in a four-qubit silicon processor above the fault-tolerant threshold
Authors:
Ian Thorvaldson,
Dean Poulos,
Christian M. Moehle,
Saiful H. Misha,
Hermann Edlbauer,
Jonathan Reiner,
Helen Geng,
Benoit Voisin,
Michael T. Jones,
Matthew B. Donnelly,
Luis F. Pena,
Charles D. Hill,
Casey R. Myers,
Joris G. Keizer,
Yousun Chung,
Samuel K. Gorman,
Ludwik Kranz,
Michelle Y. Simmons
Abstract:
Spin qubits in silicon are strong contenders for realizing a practical quantum computer. This technology has made remarkable progress with the demonstration of single and two-qubit gates above the fault-tolerant threshold and entanglement of up to three qubits. However, maintaining high fidelity operations while executing multi-qubit algorithms has remained elusive, only being achieved for two spi…
▽ More
Spin qubits in silicon are strong contenders for realizing a practical quantum computer. This technology has made remarkable progress with the demonstration of single and two-qubit gates above the fault-tolerant threshold and entanglement of up to three qubits. However, maintaining high fidelity operations while executing multi-qubit algorithms has remained elusive, only being achieved for two spin qubits to date due to the small qubit size, which makes it difficult to control qubits without creating crosstalk errors. Here, we use a four-qubit silicon processor with every operation above the fault tolerant limit and demonstrate Grover's algorithm with a ~95% probability of finding the marked state, one of the most successful implementations to date. Our four-qubit processor is made of three phosphorus atoms and one electron spin precision-patterned into 1.5 nm${}^2$ isotopically pure silicon. The strong resulting confinement potential, without additional confinement gates that can increase cross-talk, leverages the benefits of having both electron and phosphorus nuclear spins. Significantly, the all-to-all connectivity of the nuclear spins provided by the hyperfine interaction not only allows for efficient multi-qubit operations, but also provides individual qubit addressability. Together with the long coherence times of the nuclear and electron spins, this results in all four single qubit fidelities above 99.9% and controlled-Z gates between all pairs of nuclear spins above 99% fidelity. The high control fidelities, combined with >99% fidelity readout of all nuclear spins, allows for the creation of a three-qubit Greenberger-Horne-Zeilinger (GHZ) state with 96.2% fidelity, the highest reported for semiconductor spin qubits so far. Such nuclear spin registers can be coupled via electron exchange, establishing a path for larger scale fault-tolerant quantum processors.
△ Less
Submitted 12 April, 2024;
originally announced April 2024.
-
Origin of pinning disorder in magnetic-field-induced Wigner solids
Authors:
Matthew L. Freeman,
P. T. Madathil,
L. N. Pfeiffer,
K. W. Baldwin,
Y. J. Chung,
R. Winkler,
M. Shayegan,
L. W. Engel
Abstract:
At low Landau level filling factors ($ν$), Wigner solid phases of two-dimensional electron systems in GaAs are pinned by disorder, and exhibit a pinning mode, whose frequency is a measure of the disorder that pins the Wigner solid. Despite numerous studies spanning the last three decades, the origin of the disorder that causes the pinning and determines the pinning mode frequency remains unknown.…
▽ More
At low Landau level filling factors ($ν$), Wigner solid phases of two-dimensional electron systems in GaAs are pinned by disorder, and exhibit a pinning mode, whose frequency is a measure of the disorder that pins the Wigner solid. Despite numerous studies spanning the last three decades, the origin of the disorder that causes the pinning and determines the pinning mode frequency remains unknown. Here we present a study of the pinning mode resonance in the low-$ν$ Wigner solid phases of a series of ultralow-disorder GaAs quantum wells which are similar except for their varying well widths, $d$. The pinning mode frequencies,$f_p$, decrease strongly as $d$ increases, with the widest well exhibiting $f_p$ as low as $\simeq$35 MHz. The amount of reduction of \fp\ with increasing $d$ can be explained remarkably well by tails of the wave function im**ing into the alloy-disordered Al$_x$Ga$_{1-x}$As barriers that contain the electrons. However, it is imperative that the model for the confinement and wave function includes the Coulomb repulsion in the growth direction between the electrons as they occupy the quantum well.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
NLP for Counterspeech against Hate: A Survey and How-To Guide
Authors:
Helena Bonaldi,
Yi-Ling Chung,
Gavin Abercrombie,
Marco Guerini
Abstract:
In recent years, counterspeech has emerged as one of the most promising strategies to fight online hate. These non-escalatory responses tackle online abuse while preserving the freedom of speech of the users, and can have a tangible impact in reducing online and offline violence. Recently, there has been growing interest from the Natural Language Processing (NLP) community in addressing the challe…
▽ More
In recent years, counterspeech has emerged as one of the most promising strategies to fight online hate. These non-escalatory responses tackle online abuse while preserving the freedom of speech of the users, and can have a tangible impact in reducing online and offline violence. Recently, there has been growing interest from the Natural Language Processing (NLP) community in addressing the challenges of analysing, collecting, classifying, and automatically generating counterspeech, to reduce the huge burden of manually producing it. In particular, researchers have taken different directions in addressing these challenges, thus providing a variety of related tasks and resources. In this paper, we provide a guide for doing research on counterspeech, by describing - with detailed examples - the steps to undertake, and providing best practices that can be learnt from the NLP studies on this topic. Finally, we discuss open challenges and future directions of counterspeech research in NLP.
△ Less
Submitted 29 March, 2024;
originally announced March 2024.
-
A Design Space for Intelligent and Interactive Writing Assistants
Authors:
Mina Lee,
Katy Ilonka Gero,
John Joon Young Chung,
Simon Buckingham Shum,
Vipul Raheja,
Hua Shen,
Subhashini Venugopalan,
Thiemo Wambsganss,
David Zhou,
Emad A. Alghamdi,
Tal August,
Avinash Bhat,
Madiha Zahrah Choksi,
Senjuti Dutta,
** L. C. Guo,
Md Naimul Hoque,
Yewon Kim,
Simon Knight,
Seyed Parsa Neshaei,
Agnia Sergeyuk,
Antonette Shibani,
Disha Shrivastava,
Lila Shroff,
Jessi Stark,
Sarah Sterman
, et al. (11 additional authors not shown)
Abstract:
In our era of rapid technological advancement, the research landscape for writing assistants has become increasingly fragmented across various research communities. We seek to address this challenge by proposing a design space as a structured way to examine and explore the multidimensional space of intelligent and interactive writing assistants. Through a large community collaboration, we explore…
▽ More
In our era of rapid technological advancement, the research landscape for writing assistants has become increasingly fragmented across various research communities. We seek to address this challenge by proposing a design space as a structured way to examine and explore the multidimensional space of intelligent and interactive writing assistants. Through a large community collaboration, we explore five aspects of writing assistants: task, user, technology, interaction, and ecosystem. Within each aspect, we define dimensions (i.e., fundamental components of an aspect) and codes (i.e., potential options for each dimension) by systematically reviewing 115 papers. Our design space aims to offer researchers and designers a practical tool to navigate, comprehend, and compare the various possibilities of writing assistants, and aid in the envisioning and design of new writing assistants.
△ Less
Submitted 26 March, 2024; v1 submitted 21 March, 2024;
originally announced March 2024.
-
Basque and Spanish Counter Narrative Generation: Data Creation and Evaluation
Authors:
Jaione Bengoetxea,
Yi-Ling Chung,
Marco Guerini,
Rodrigo Agerri
Abstract:
Counter Narratives (CNs) are non-negative textual responses to Hate Speech (HS) aiming at defusing online hatred and mitigating its spreading across media. Despite the recent increase in HS content posted online, research on automatic CN generation has been relatively scarce and predominantly focused on English. In this paper, we present CONAN-EUS, a new Basque and Spanish dataset for CN generatio…
▽ More
Counter Narratives (CNs) are non-negative textual responses to Hate Speech (HS) aiming at defusing online hatred and mitigating its spreading across media. Despite the recent increase in HS content posted online, research on automatic CN generation has been relatively scarce and predominantly focused on English. In this paper, we present CONAN-EUS, a new Basque and Spanish dataset for CN generation developed by means of Machine Translation (MT) and professional post-edition. Being a parallel corpus, also with respect to the original English CONAN, it allows to perform novel research on multilingual and crosslingual automatic generation of CNs. Our experiments on CN generation with mT5, a multilingual encoder-decoder model, show that generation greatly benefits from training on post-edited data, as opposed to relying on silver MT data only. These results are confirmed by their correlation with a qualitative manual evaluation, demonstrating that manually revised training data remains crucial for the quality of the generated CNs. Furthermore, multilingual data augmentation improves results over monolingual settings for structurally similar languages such as English and Spanish, while being detrimental for Basque, a language isolate. Similar findings occur in zero-shot crosslingual evaluations, where model transfer (fine-tuning in English and generating in a different target language) outperforms fine-tuning mT5 on machine translated data for Spanish but not for Basque. This provides an interesting insight into the asymmetry in the multilinguality of generative models, a challenging topic which is still open to research.
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
Signatures of correlated defects in an ultra-clean Wigner crystal in the extreme quantum limit
Authors:
P. T. Madathil,
C. Wang,
S. K. Singh,
A. Gupta,
K. A. Villegas Rosales,
Y. J. Chung,
K. W. West,
K. W. Baldwin,
L. N. Pfeiffer,
L. W. Engel,
M. Shayegan
Abstract:
Low-disorder two-dimensional electron systems in the presence of a strong, perpendicular magnetic field terminate at very small Landau level filling factors in a Wigner crystal (WC), where the electrons form an ordered array to minimize the Coulomb repulsion. The nature of this exotic, many-body, quantum phase is yet to be fully understood and experimentally revealed. Here we probe one of WC's mos…
▽ More
Low-disorder two-dimensional electron systems in the presence of a strong, perpendicular magnetic field terminate at very small Landau level filling factors in a Wigner crystal (WC), where the electrons form an ordered array to minimize the Coulomb repulsion. The nature of this exotic, many-body, quantum phase is yet to be fully understood and experimentally revealed. Here we probe one of WC's most fundamental parameters, namely the energy gap that determines its low-temperature conductivity, in record-mobility, ultra-high-purity, two-dimensional electrons confined to GaAs quantum wells. The WC domains in these samples contain $\simeq$ 1000 electrons. The measured gaps are a factor of three larger than previously reported for lower quality samples, and agree remarkably well with values predicted for the lowest-energy, intrinsic, hyper-corelated bubble defects in a WC made of flux-electron composite fermions, rather than bare electrons. The agreement is particularly noteworthy, given that the calculations are done for disorder-free composite fermion WCs, and there are no adjustable parameters. The results reflect the exceptionally high quality of the samples, and suggest that composite fermion WCs are indeed more stable compared to their electron counterparts.
△ Less
Submitted 12 March, 2024;
originally announced March 2024.
-
Accurate Spatial Gene Expression Prediction by integrating Multi-resolution features
Authors:
Youngmin Chung,
Ji Hun Ha,
Kyeong Chan Im,
Joo Sang Lee
Abstract:
Recent advancements in Spatial Transcriptomics (ST) technology have facilitated detailed gene expression analysis within tissue contexts. However, the high costs and methodological limitations of ST necessitate a more robust predictive model. In response, this paper introduces TRIPLEX, a novel deep learning framework designed to predict spatial gene expression from Whole Slide Images (WSIs). TRIPL…
▽ More
Recent advancements in Spatial Transcriptomics (ST) technology have facilitated detailed gene expression analysis within tissue contexts. However, the high costs and methodological limitations of ST necessitate a more robust predictive model. In response, this paper introduces TRIPLEX, a novel deep learning framework designed to predict spatial gene expression from Whole Slide Images (WSIs). TRIPLEX uniquely harnesses multi-resolution features, capturing cellular morphology at individual spots, the local context around these spots, and the global tissue organization. By integrating these features through an effective fusion strategy, TRIPLEX achieves accurate gene expression prediction. Our comprehensive benchmark study, conducted on three public ST datasets and supplemented with Visium data from 10X Genomics, demonstrates that TRIPLEX outperforms current state-of-the-art models in Mean Squared Error (MSE), Mean Absolute Error (MAE), and Pearson Correlation Coefficient (PCC). The model's predictions align closely with ground truth gene expression profiles and tumor annotations, underscoring TRIPLEX's potential in advancing cancer diagnosis and treatment.
△ Less
Submitted 25 April, 2024; v1 submitted 12 March, 2024;
originally announced March 2024.
-
Authors' Values and Attitudes Towards AI-bridged Scalable Personalization of Creative Language Arts
Authors:
Taewook Kim,
Hyomin Han,
Eytan Adar,
Matthew Kay,
John Joon Young Chung
Abstract:
Generative AI has the potential to create a new form of interactive media: AI-bridged creative language arts (CLA), which bridge the author and audience by personalizing the author's vision to the audience's context and taste at scale. However, it is unclear what the authors' values and attitudes would be regarding AI-bridged CLA. To identify these values and attitudes, we conducted an interview s…
▽ More
Generative AI has the potential to create a new form of interactive media: AI-bridged creative language arts (CLA), which bridge the author and audience by personalizing the author's vision to the audience's context and taste at scale. However, it is unclear what the authors' values and attitudes would be regarding AI-bridged CLA. To identify these values and attitudes, we conducted an interview study with 18 authors across eight genres (e.g., poetry, comics) by presenting speculative but realistic AI-bridged CLA scenarios. We identified three benefits derived from the dynamics between author, artifact, and audience: those that 1) authors get from the process, 2) audiences get from the artifact, and 3) authors get from the audience. We found how AI-bridged CLA would either promote or reduce these benefits, along with authors' concerns. We hope our investigation hints at how AI can provide intriguing experiences to CLA audiences while promoting authors' values.
△ Less
Submitted 1 March, 2024;
originally announced March 2024.
-
Anomalous acousto-current within the quantum Hall plateaus
Authors:
Renfei Wang,
Xiao Liu,
Mengmeng Wu,
Yoon Jang Chung,
Adbhut Gupta,
Kirk W. Baldwin,
Mansour Shayegan,
Loren Pfeiffer,
Xi Lin,
Yang Liu
Abstract:
We systematically study the acousto-current of two-dimensional electron systems in the integer and fractional quantum Hall regimes using surface acoustic waves. We are able to separate the co-existing acoustic scattering and drag, when phonons induce drag current and tune the electron conductivity, respectively. At large acoustic power, the drag current is finite when the system is compressible an…
▽ More
We systematically study the acousto-current of two-dimensional electron systems in the integer and fractional quantum Hall regimes using surface acoustic waves. We are able to separate the co-existing acoustic scattering and drag, when phonons induce drag current and tune the electron conductivity, respectively. At large acoustic power, the drag current is finite when the system is compressible and exhibits minima when incompressible quantum Hall effects appear. Surprisingly, it exhibits anomalously large bipolar spikes within the quantum Hall plateaus while it vanishes linearly with reduced acoustic power at compressible phases. The current peaks reverse their polarity at the two flanks of exact integer or fractional fillings, consistent with the opposite electric charge of the quasiparticle/quasihole.
△ Less
Submitted 22 March, 2024; v1 submitted 26 February, 2024;
originally announced February 2024.
-
HEAL: Brain-inspired Hyperdimensional Efficient Active Learning
Authors:
Yang Ni,
Zhuowen Zou,
Wenjun Huang,
Hanning Chen,
William Youngwoo Chung,
Samuel Cho,
Ranganath Krishnan,
Pietro Mercati,
Mohsen Imani
Abstract:
Drawing inspiration from the outstanding learning capability of our human brains, Hyperdimensional Computing (HDC) emerges as a novel computing paradigm, and it leverages high-dimensional vector presentation and operations for brain-like lightweight Machine Learning (ML). Practical deployments of HDC have significantly enhanced the learning efficiency compared to current deep ML methods on a broad…
▽ More
Drawing inspiration from the outstanding learning capability of our human brains, Hyperdimensional Computing (HDC) emerges as a novel computing paradigm, and it leverages high-dimensional vector presentation and operations for brain-like lightweight Machine Learning (ML). Practical deployments of HDC have significantly enhanced the learning efficiency compared to current deep ML methods on a broad spectrum of applications. However, boosting the data efficiency of HDC classifiers in supervised learning remains an open question. In this paper, we introduce Hyperdimensional Efficient Active Learning (HEAL), a novel Active Learning (AL) framework tailored for HDC classification. HEAL proactively annotates unlabeled data points via uncertainty and diversity-guided acquisition, leading to a more efficient dataset annotation and lowering labor costs. Unlike conventional AL methods that only support classifiers built upon deep neural networks (DNN), HEAL operates without the need for gradient or probabilistic computations. This allows it to be effortlessly integrated with any existing HDC classifier architecture. The key design of HEAL is a novel approach for uncertainty estimation in HDC classifiers through a lightweight HDC ensemble with prior hypervectors. Additionally, by exploiting hypervectors as prototypes (i.e., compact representations), we develop an extra metric for HEAL to select diverse samples within each batch for annotation. Our evaluation shows that HEAL surpasses a diverse set of baselines in AL quality and achieves notably faster acquisition than many BNN-powered or diversity-guided AL methods, recording 11 times to 40,000 times speedup in acquisition runtime per batch.
△ Less
Submitted 17 February, 2024;
originally announced February 2024.
-
Beyond the Mud: Datasets and Benchmarks for Computer Vision in Off-Road Racing
Authors:
Jacob Tyo,
Motolani Olarinre,
Youngseog Chung,
Zachary C. Lipton
Abstract:
Despite significant progress in optical character recognition (OCR) and computer vision systems, robustly recognizing text and identifying people in images taken in unconstrained \emph{in-the-wild} environments remain an ongoing challenge. However, such obstacles must be overcome in practical applications of vision systems, such as identifying racers in photos taken during off-road racing events.…
▽ More
Despite significant progress in optical character recognition (OCR) and computer vision systems, robustly recognizing text and identifying people in images taken in unconstrained \emph{in-the-wild} environments remain an ongoing challenge. However, such obstacles must be overcome in practical applications of vision systems, such as identifying racers in photos taken during off-road racing events. To this end, we introduce two new challenging real-world datasets - the off-road motorcycle Racer Number Dataset (RND) and the Muddy Racer re-iDentification Dataset (MUDD) - to highlight the shortcomings of current methods and drive advances in OCR and person re-identification (ReID) under extreme conditions. These two datasets feature over 6,300 images taken during off-road competitions which exhibit a variety of factors that undermine even modern vision systems, namely mud, complex poses, and motion blur. We establish benchmark performance on both datasets using state-of-the-art models. Off-the-shelf models transfer poorly, reaching only 15% end-to-end (E2E) F1 score on text spotting, and 33% rank-1 accuracy on ReID. Fine-tuning yields major improvements, bringing model performance to 53% F1 score for E2E text spotting and 79% rank-1 accuracy on ReID, but still falls short of good performance. Our analysis exposes open problems in real-world OCR and ReID that necessitate domain-targeted techniques. With these datasets and analysis of model limitations, we aim to foster innovations in handling real-world conditions like mud and complex poses to drive progress in robust computer vision. All data was sourced from PerformancePhoto.co, a website used by professional motorsports photographers, racers, and fans. The top-performing text spotting and ReID models are deployed on this platform to power real-time race photo search.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
Moving crystal phases of a quantum Wigner solid in an ultra-high-quality 2D electron system
Authors:
P. T. Madathil,
K. A. Villegas Rosales,
Y. J. Chung,
K. W. West,
K. W. Baldwin,
L. N. Pfeiffer,
L. W. Engel,
M. Shayegan
Abstract:
In low-disorder, two-dimensional electron systems (2DESs), the fractional quantum Hall states at very small Landau level fillings ($ν$) terminate in a Wigner solid (WS) phase, where electrons arrange themselves in a periodic array. The WS is typically pinned by the residual disorder sites and manifests an insulating behavior, with non-linear current-voltage (\textit{I-V}) and noise characteristics…
▽ More
In low-disorder, two-dimensional electron systems (2DESs), the fractional quantum Hall states at very small Landau level fillings ($ν$) terminate in a Wigner solid (WS) phase, where electrons arrange themselves in a periodic array. The WS is typically pinned by the residual disorder sites and manifests an insulating behavior, with non-linear current-voltage (\textit{I-V}) and noise characteristics. We report here, measurements on an ultra-low-disorder, dilute 2DES, confined to a GaAs quantum well. In the $ν< 1/5$ range, superimposed on a highly-insulating longitudinal resistance, the 2DES exhibits a develo** fractional quantum Hall state at $ν=1/7$, attesting to its exceptional high quality, and dominance of electron-electron interaction in the low filling regime. In the nearby insulating phases, we observe remarkable non-linear \textit{I-V} and noise characteristics as a function of increasing current, with current thresholds delineating three distinct phases of the WS: a pinned phase (P1) with very small noise, a second phase (P2) in which $dV/dI$ fluctuates between positive and negative values and is accompanied by very high noise, and a third phase (P3) where $dV/dI$ is nearly constant and small, and noise is about an order of magnitude lower than in P2. In the depinned (P2 and P3) phases, the noise spectrum also reveals well-defined peaks at frequencies that vary linearly with the applied current, suggestive of washboard frequencies. We discuss the data in light of a recent theory that proposes different dynamic phases for a driven WS.
△ Less
Submitted 24 January, 2024;
originally announced January 2024.
-
Cheap Learning: Maximising Performance of Language Models for Social Data Science Using Minimal Data
Authors:
Leonardo Castro-Gonzalez,
Yi-Ling Chung,
Hannak Rose Kirk,
John Francis,
Angus R. Williams,
Pica Johansson,
Jonathan Bright
Abstract:
The field of machine learning has recently made significant progress in reducing the requirements for labelled training data when building new models. These `cheaper' learning techniques hold significant potential for the social sciences, where development of large labelled training datasets is often a significant practical impediment to the use of machine learning for analytical tasks. In this ar…
▽ More
The field of machine learning has recently made significant progress in reducing the requirements for labelled training data when building new models. These `cheaper' learning techniques hold significant potential for the social sciences, where development of large labelled training datasets is often a significant practical impediment to the use of machine learning for analytical tasks. In this article we review three `cheap' techniques that have developed in recent years: weak supervision, transfer learning and prompt engineering. For the latter, we also review the particular case of zero-shot prompting of large language models. For each technique we provide a guide of how it works and demonstrate its application across six different realistic social science applications (two different tasks paired with three different dataset makeups). We show good performance for all techniques, and in particular we demonstrate how prompting of large language models can achieve high accuracy at very low cost. Our results are accompanied by a code repository to make it easy for others to duplicate our work and use it in their own research. Overall, our article is intended to stimulate further uptake of these techniques in the social sciences.
△ Less
Submitted 22 January, 2024;
originally announced January 2024.
-
T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis
Authors:
Yoon** Chung,
Junwon Lee,
Juhan Nam
Abstract:
Foley sound, audio content inserted synchronously with videos, plays a critical role in the user experience of multimedia content. Recently, there has been active research in Foley sound synthesis, leveraging the advancements in deep generative models. However, such works mainly focus on replicating a single sound class or a textual sound description, neglecting temporal information, which is cruc…
▽ More
Foley sound, audio content inserted synchronously with videos, plays a critical role in the user experience of multimedia content. Recently, there has been active research in Foley sound synthesis, leveraging the advancements in deep generative models. However, such works mainly focus on replicating a single sound class or a textual sound description, neglecting temporal information, which is crucial in the practical applications of Foley sound. We present T-Foley, a Temporal-event-guided waveform generation model for Foley sound synthesis. T-Foley generates high-quality audio using two conditions: the sound class and temporal event feature. For temporal conditioning, we devise a temporal event feature and a novel conditioning technique named Block-FiLM. T-Foley achieves superior performance in both objective and subjective evaluation metrics and generates Foley sound well-synchronized with the temporal events. Additionally, we showcase T-Foley's practical applications, particularly in scenarios involving vocal mimicry for temporal event control. We show the demo on our companion website.
△ Less
Submitted 17 January, 2024;
originally announced January 2024.
-
E2HQV: High-Quality Video Generation from Event Camera via Theory-Inspired Model-Aided Deep Learning
Authors:
Qiang Qu,
Yiran Shen,
Xiaoming Chen,
Yuk Ying Chung,
Tongliang Liu
Abstract:
The bio-inspired event cameras or dynamic vision sensors are capable of asynchronously capturing per-pixel brightness changes (called event-streams) in high temporal resolution and high dynamic range. However, the non-structural spatial-temporal event-streams make it challenging for providing intuitive visualization with rich semantic information for human vision. It calls for events-to-video (E2V…
▽ More
The bio-inspired event cameras or dynamic vision sensors are capable of asynchronously capturing per-pixel brightness changes (called event-streams) in high temporal resolution and high dynamic range. However, the non-structural spatial-temporal event-streams make it challenging for providing intuitive visualization with rich semantic information for human vision. It calls for events-to-video (E2V) solutions which take event-streams as input and generate high quality video frames for intuitive visualization. However, current solutions are predominantly data-driven without considering the prior knowledge of the underlying statistics relating event-streams and video frames. It highly relies on the non-linearity and generalization capability of the deep neural networks, thus, is struggling on reconstructing detailed textures when the scenes are complex. In this work, we propose \textbf{E2HQV}, a novel E2V paradigm designed to produce high-quality video frames from events. This approach leverages a model-aided deep learning framework, underpinned by a theory-inspired E2V model, which is meticulously derived from the fundamental imaging principles of event cameras. To deal with the issue of state-reset in the recurrent components of E2HQV, we also design a temporal shift embedding module to further improve the quality of the video frames. Comprehensive evaluations on the real world event camera datasets validate our approach, with E2HQV, notably outperforming state-of-the-art approaches, e.g., surpassing the second best by over 40\% for some evaluation metrics.
△ Less
Submitted 16 January, 2024;
originally announced January 2024.
-
Beyond Subspace Isolation: Many-to-Many Transformer for Light Field Image Super-resolution
Authors:
Zeke Zexi Hu,
Xiaoming Chen,
Vera Yuk Ying Chung,
Yiran Shen
Abstract:
The effective extraction of spatial-angular features plays a crucial role in light field image super-resolution (LFSR) tasks, and the introduction of convolution and Transformers leads to significant improvement in this area. Nevertheless, due to the large 4D data volume of light field images, many existing methods opted to decompose the data into a number of lower-dimensional subspaces and perfor…
▽ More
The effective extraction of spatial-angular features plays a crucial role in light field image super-resolution (LFSR) tasks, and the introduction of convolution and Transformers leads to significant improvement in this area. Nevertheless, due to the large 4D data volume of light field images, many existing methods opted to decompose the data into a number of lower-dimensional subspaces and perform Transformers in each sub-space individually. As a side effect, these methods inadvertently restrict the self-attention mechanisms to a One-to-One scheme accessing only a limited subset of LF data, explicitly preventing comprehensive optimization on all spatial and angular cues. In this paper, we identify this limitation as subspace isolation and introduce a novel Many-to-Many Transformer (M2MT) to address it. M2MT aggregates angular information in the spatial subspace before performing the self-attention mechanism. It enables complete access to all information across all sub-aperture images (SAIs) in a light field image. Consequently, M2MT is enabled to comprehensively capture long-range correlation dependencies. With M2MT as the pivotal component, we develop a simple yet effective M2MT network for LFSR. Our experimental results demonstrate that M2MT achieves state-of-the-art performance across various public datasets. We further conduct in-depth analysis using local attribution maps (LAM) to obtain visual interpretability, and the results validate that M2MT is empowered with a truly non-local context in both spatial and angular subspaces to mitigate subspace isolation and acquire effective spatial-angular representation.
△ Less
Submitted 1 January, 2024;
originally announced January 2024.
-
CreativeConnect: Supporting Reference Recombination for Graphic Design Ideation with Generative AI
Authors:
DaEun Choi,
Sumin Hong,
Jeongeon Park,
John Joon Young Chung,
Juho Kim
Abstract:
Graphic designers often get inspiration through the recombination of references. Our formative study (N=6) reveals that graphic designers focus on conceptual keywords during this process, and want support for discovering the keywords, expanding them, and exploring diverse recombination options of them, while still having room for designers' creativity. We propose CreativeConnect, a system with gen…
▽ More
Graphic designers often get inspiration through the recombination of references. Our formative study (N=6) reveals that graphic designers focus on conceptual keywords during this process, and want support for discovering the keywords, expanding them, and exploring diverse recombination options of them, while still having room for designers' creativity. We propose CreativeConnect, a system with generative AI pipelines that helps users discover useful elements from the reference image using keywords, recommends relevant keywords, generates diverse recombination options with user-selected keywords, and shows recombinations as sketches with text descriptions. Our user study (N=16) showed that CreativeConnect helped users discover keywords from the reference and generate multiple ideas based on them, ultimately hel** users produce more design ideas with higher self-reported creativity compared to the baseline system without generative pipelines. While CreativeConnect was shown effective in ideation, we discussed how CreativeConnect can be extended to support other types of tasks in creativity support.
△ Less
Submitted 6 March, 2024; v1 submitted 19 December, 2023;
originally announced December 2023.
-
Regional Correlation Aided Mobile Traffic Prediction with Spatiotemporal Deep Learning
Authors:
JeongJun Park,
Lusungu J. Mwasinga,
Huigyu Yang,
Syed M. Raza,
Duc-Tai Le,
Moonseong Kim,
Min Young Chung,
Hyunseung Choo
Abstract:
Mobile traffic data in urban regions shows differentiated patterns during different hours of the day. The exploitation of these patterns enables highly accurate mobile traffic prediction for proactive network management. However, recent Deep Learning (DL) driven studies have only exploited spatiotemporal features and have ignored the geographical correlations, causing high complexity and erroneous…
▽ More
Mobile traffic data in urban regions shows differentiated patterns during different hours of the day. The exploitation of these patterns enables highly accurate mobile traffic prediction for proactive network management. However, recent Deep Learning (DL) driven studies have only exploited spatiotemporal features and have ignored the geographical correlations, causing high complexity and erroneous mobile traffic predictions. This paper addresses these limitations by proposing an enhanced mobile traffic prediction scheme that combines the clustering strategy of daily mobile traffic peak time and novel multi Temporal Convolutional Network with a Long Short Term Memory (multi TCN-LSTM) model. The mobile network cells that exhibit peak traffic during the same hour of the day are clustered together. Our experiments on large-scale real-world mobile traffic data show up to 28% performance improvement compared to state-of-the-art studies, which confirms the efficacy and viability of the proposed approach.
△ Less
Submitted 11 December, 2023;
originally announced December 2023.
-
Extended Kohler's Rule of Magnetoresistance in TaCo$_2$Te$_2$
Authors:
Samuel Pate,
Bowen Chen,
Bing Shen,
Kezhen Li,
Xiuquan Zhou,
Duck Young Chung,
Ralu Divan,
Mercouri G. Kanatzidis,
Ulrich Welp,
Wai-Kwong Kwok,
Zhi-Li Xiao
Abstract:
TaCo$_2$Te$_2$ is recently reported to be an air-stable, high mobility Van der Waals material with probable magnetic order. Here we investigate the scaling behavior of its magnetoresistance. We measured both the longitudinal ($ρ_{xx}$) and Hall ($ρ_{xy}$) magnetoresistivities of TaCo$_2$Te$_2$ crystals in magnetic fields parallel to the c-axis and found that the magnetoresistance violates the Kohl…
▽ More
TaCo$_2$Te$_2$ is recently reported to be an air-stable, high mobility Van der Waals material with probable magnetic order. Here we investigate the scaling behavior of its magnetoresistance. We measured both the longitudinal ($ρ_{xx}$) and Hall ($ρ_{xy}$) magnetoresistivities of TaCo$_2$Te$_2$ crystals in magnetic fields parallel to the c-axis and found that the magnetoresistance violates the Kohler's rule $MR \sim f[H/ρ_0]$ while obeying the extended Kohler's rule $MR \sim f[H/(n_Tρ_0)]$, where $MR \sim [ρ_{xx}(H)-ρ_0]/ρ_0$, $H$ is the magnetic field, $n_T$ is a thermal factor, $ρ_{xx}(H)$ and $ρ_0$ are the resistivities at $H$ and zero field, respectively. While deviating from those of the densities of electrons ($n_e$) and holes ($n_h$) obtained from the two-band model analysis of the magnetoconductivities, the temperature dependence of $n_T$ is close to that of the Hall carrier densities $n_H$ calculated from the slopes of $ρ_{xy}(H)$ curves at low magnetic fields, providing a new way to obtain the thermal factor in the extended Kohler's rule.
△ Less
Submitted 9 December, 2023;
originally announced December 2023.
-
Seamless: Multilingual Expressive and Streaming Speech Translation
Authors:
Seamless Communication,
Loïc Barrault,
Yu-An Chung,
Mariano Coria Meglioli,
David Dale,
Ning Dong,
Mark Duppenthaler,
Paul-Ambroise Duquenne,
Brian Ellis,
Hady Elsahar,
Justin Haaheim,
John Hoffman,
Min-Jae Hwang,
Hirofumi Inaguma,
Christopher Klaiber,
Ilia Kulikov,
Pengwei Li,
Daniel Licht,
Jean Maillard,
Ruslan Mavlyutov,
Alice Rakotoarison,
Kaushik Ram Sadagopan,
Abinesh Ramakrishnan,
Tuan Tran,
Guillaume Wenzek
, et al. (40 additional authors not shown)
Abstract:
Large-scale automatic speech translation systems today lack key features that help machine-mediated communication feel seamless when compared to human-to-human dialogue. In this work, we introduce a family of models that enable end-to-end expressive and multilingual translations in a streaming fashion. First, we contribute an improved version of the massively multilingual and multimodal SeamlessM4…
▽ More
Large-scale automatic speech translation systems today lack key features that help machine-mediated communication feel seamless when compared to human-to-human dialogue. In this work, we introduce a family of models that enable end-to-end expressive and multilingual translations in a streaming fashion. First, we contribute an improved version of the massively multilingual and multimodal SeamlessM4T model-SeamlessM4T v2. This newer model, incorporating an updated UnitY2 framework, was trained on more low-resource language data. SeamlessM4T v2 provides the foundation on which our next two models are initiated. SeamlessExpressive enables translation that preserves vocal styles and prosody. Compared to previous efforts in expressive speech research, our work addresses certain underexplored aspects of prosody, such as speech rate and pauses, while also preserving the style of one's voice. As for SeamlessStreaming, our model leverages the Efficient Monotonic Multihead Attention mechanism to generate low-latency target translations without waiting for complete source utterances. As the first of its kind, SeamlessStreaming enables simultaneous speech-to-speech/text translation for multiple source and target languages. To ensure that our models can be used safely and responsibly, we implemented the first known red-teaming effort for multimodal machine translation, a system for the detection and mitigation of added toxicity, a systematic evaluation of gender bias, and an inaudible localized watermarking mechanism designed to dampen the impact of deepfakes. Consequently, we bring major components from SeamlessExpressive and SeamlessStreaming together to form Seamless, the first publicly available system that unlocks expressive cross-lingual communication in real-time. The contributions to this work are publicly released and accessible at https://github.com/facebookresearch/seamless_communication
△ Less
Submitted 8 December, 2023;
originally announced December 2023.
-
Dynamical origin of Type-I Seesaw with large mixing
Authors:
Yi Chung
Abstract:
We investigate Type-I Seesaw models where the right-handed neutrino masses are dynamically generated by strong interactions. Using horizontal gauge symmetry as the source of strong dynamics, a nontrivial flavor structure can also be introduced dynamically. We find that the right-handed neutrino mass matrix with a strongly anti-diagonal structure emerges when the three right-handed neutrinos are in…
▽ More
We investigate Type-I Seesaw models where the right-handed neutrino masses are dynamically generated by strong interactions. Using horizontal gauge symmetry as the source of strong dynamics, a nontrivial flavor structure can also be introduced dynamically. We find that the right-handed neutrino mass matrix with a strongly anti-diagonal structure emerges when the three right-handed neutrinos are in the triplet representation of $SU(2)_H$ horizontal gauge symmetry. With an assumption of a Dirac neutrino mass matrix with hierarchical eigenvalues and small mixing angles, analogous to the up-type quark sector, and certain substructures, the resulting light neutrino mass matrix from the type-I seesaw mechanism can accommodate the large mixing and weak hierarchy observed in low-energy neutrino data. The neutrino puzzles can, therefore, be understood as the consequence of strong horizontal gauge interactions. We also discuss the potential UV completion and the phenomenology that could be tested in the future.
△ Less
Submitted 18 April, 2024; v1 submitted 28 November, 2023;
originally announced November 2023.
-
Third-generation-philic Hidden Naturalness
Authors:
Yi Chung,
Florian Goertz
Abstract:
We present a solution to the electroweak hierarchy problem, where the relevant new particles are third-generation-philic and hidden in SM processes with third-generation fermions. Due to this feature, the mass bounds from direct searches are much weaker and the required fine-tuning can be reduced drastically. A concrete model is constructed based on a $SU(6)/Sp(6)$ fundamental composite Higgs mode…
▽ More
We present a solution to the electroweak hierarchy problem, where the relevant new particles are third-generation-philic and hidden in SM processes with third-generation fermions. Due to this feature, the mass bounds from direct searches are much weaker and the required fine-tuning can be reduced drastically. A concrete model is constructed based on a $SU(6)/Sp(6)$ fundamental composite Higgs model with collective symmetry breaking and extended hypercolor mechanism. The construction allows to raise the scale $f$ to $\sim 3\,$TeV, corresponding to resonances at $M_ρ\gtrsim 10$ TeV, without much tuning - employing ingredients that are naturally inherent in the (composite) Goldstone-Higgs framework. The experimental signatures are discussed in detail. It is found that current bounds allow for a model with negligible tuning.
△ Less
Submitted 28 November, 2023;
originally announced November 2023.
-
Fractional Quantum Hall State at Filling Factor $ν=1/4$ in Ultra-High-Quality GaAs 2D Hole Systems
Authors:
Chengyu Wang,
A. Gupta,
S. K. Singh,
P. T. Madathil,
Y. J. Chung,
L. N. Pfeiffer,
K. W. Baldwin,
R. Winkler,
M. Shayegan
Abstract:
Single-component fractional quantum Hall states (FQHSs) at even-denominator filling factors may host non-Abelian quasiparticles that are considered to be building blocks of topological quantum computers. Such states, however, are rarely observed in the lowest-energy Landau level, namely at filling factors $ν<1$. Here we report evidence for an even-denominator FQHS at $ν=1/4$ in ultra-high-quality…
▽ More
Single-component fractional quantum Hall states (FQHSs) at even-denominator filling factors may host non-Abelian quasiparticles that are considered to be building blocks of topological quantum computers. Such states, however, are rarely observed in the lowest-energy Landau level, namely at filling factors $ν<1$. Here we report evidence for an even-denominator FQHS at $ν=1/4$ in ultra-high-quality two-dimensional hole systems confined to modulation-doped GaAs quantum wells. We observe a deep minimum in the longitudinal resistance at $ν=1/4$, superimposed on a highly insulating background, suggesting a close competition between the $ν=1/4$ FQHS and the magnetic-field-induced, pinned Wigner solid states. Our experimental observations are consistent with the very recent theoretical calculations which predict that substantial Landau level mixing, caused by the large hole effective mass, can induce composite fermion pairing and lead to a non-Abelian FQHS at $ν=1/4$. Our results demonstrate that Landau level mixing can provide a very potent means for tuning the interaction between composite fermions and creating new non-Abelian FQHSs.
△ Less
Submitted 22 November, 2023;
originally announced November 2023.
-
Reading Between the Mud: A Challenging Motorcycle Racer Number Dataset
Authors:
Jacob Tyo,
Youngseog Chung,
Motolani Olarinre,
Zachary C. Lipton
Abstract:
This paper introduces the off-road motorcycle Racer number Dataset (RnD), a new challenging dataset for optical character recognition (OCR) research. RnD contains 2,411 images from professional motorsports photographers that depict motorcycle racers in off-road competitions. The images exhibit a wide variety of factors that make OCR difficult, including mud occlusions, motion blur, non-standard fo…
▽ More
This paper introduces the off-road motorcycle Racer number Dataset (RnD), a new challenging dataset for optical character recognition (OCR) research. RnD contains 2,411 images from professional motorsports photographers that depict motorcycle racers in off-road competitions. The images exhibit a wide variety of factors that make OCR difficult, including mud occlusions, motion blur, non-standard fonts, glare, complex backgrounds, etc. The dataset has 5,578 manually annotated bounding boxes around visible motorcycle numbers, along with transcribed digits and letters. Our experiments benchmark leading OCR algorithms and reveal an end-to-end F1 score of only 0.527 on RnD, even after fine-tuning. Analysis of performance on different occlusion types shows mud as the primary challenge, degrading accuracy substantially compared to normal conditions. But the models struggle with other factors including glare, blur, shadows, and dust. Analysis exposes substantial room for improvement and highlights failure cases of existing models. RnD represents a valuable new benchmark to drive innovation in real-world OCR capabilities. The authors hope the community will build upon this dataset and baseline experiments to make progress on the open problem of robustly recognizing text in unconstrained natural environments. The dataset is available at https://github.com/JacobTyo/SwinTextSpotter.
△ Less
Submitted 14 November, 2023;
originally announced November 2023.
-
MUDD: A New Re-Identification Dataset with Efficient Annotation for Off-Road Racers in Extreme Conditions
Authors:
Jacob Tyo,
Motolani Olarinre,
Youngseog Chung,
Zachary C. Lipton
Abstract:
Re-identifying individuals in unconstrained environments remains an open challenge in computer vision. We introduce the Muddy Racer re-IDentification Dataset (MUDD), the first large-scale benchmark for matching identities of motorcycle racers during off-road competitions. MUDD exhibits heavy mud occlusion, motion blurring, complex poses, and extreme lighting conditions previously unseen in existin…
▽ More
Re-identifying individuals in unconstrained environments remains an open challenge in computer vision. We introduce the Muddy Racer re-IDentification Dataset (MUDD), the first large-scale benchmark for matching identities of motorcycle racers during off-road competitions. MUDD exhibits heavy mud occlusion, motion blurring, complex poses, and extreme lighting conditions previously unseen in existing re-id datasets. We present an annotation methodology incorporating auxiliary information that reduced labeling time by over 65%. We establish benchmark performance using state-of-the-art re-id models including OSNet and ResNet-50. Without fine-tuning, the best models achieve only 33% Rank-1 accuracy. Fine-tuning on MUDD boosts results to 79% Rank-1, but significant room for improvement remains. We analyze the impact of real-world factors including mud, pose, lighting, and more. Our work exposes open problems in re-identifying individuals under extreme conditions. We hope MUDD serves as a diverse and challenging benchmark to spur progress in robust re-id, especially for computer vision applications in emerging sports analytics. All code and data can be found at https://github.com/JacobTyo/MUDD.
△ Less
Submitted 14 November, 2023;
originally announced November 2023.
-
Impossibility of bipartite full nonlocality, all-versus-nothing proofs, and pseudo-telepathy in small Bell scenarios
Authors:
Yuan Liu,
Ho Yiu Chung,
Emmanuel Zambrini Cruzeiro,
Junior R. Gonzales-Ureta,
Ravishankar Ramanathan,
Adán Cabello
Abstract:
We show that the following statements are equivalent: (i) A quantum correlation p is in a face of the nonsignaling polytope that does not contain local points. (ii) p has local fraction zero; i.e., p has full nonlocality (FN). (iii) p provides an all-versus-nothing (AVN) or Greenberger-Horne-Zeilinger-like proof of nonlocality. (iv) p is a pseudo telepathy (PT) strategy. These connections imply th…
▽ More
We show that the following statements are equivalent: (i) A quantum correlation p is in a face of the nonsignaling polytope that does not contain local points. (ii) p has local fraction zero; i.e., p has full nonlocality (FN). (iii) p provides an all-versus-nothing (AVN) or Greenberger-Horne-Zeilinger-like proof of nonlocality. (iv) p is a pseudo telepathy (PT) strategy. These connections imply that a long-standing question posed by Gisin, Méthot, and Scarani of whether quantum PT is possible with minimal requirements is fundamental for quantum information, quantum computation, and foundations of quantum mechanics, and can be addressed by a variety of strategies. Here, by combining different methods, we show that the answer is negative: according to quantum mechanics, nature does not allow for FN/AVN/PT in the (3,3;3,2) Bell scenario. Moreover, we show that FN/AVN/PT is also impossible in (3,2;3,4). We also study (3,3;3,3) and found no example of FN/AVN/PT. We discuss the implications of these results and further applications of the methods presented.
△ Less
Submitted 16 October, 2023;
originally announced October 2023.
-
CoLLD: Contrastive Layer-to-layer Distillation for Compressing Multilingual Pre-trained Speech Encoders
Authors:
Heng-Jui Chang,
Ning Dong,
Ruslan Mavlyutov,
Sravya Popuri,
Yu-An Chung
Abstract:
Large-scale self-supervised pre-trained speech encoders outperform conventional approaches in speech recognition and translation tasks. Due to the high cost of develo** these large models, building new encoders for new tasks and deploying them to on-device applications are infeasible. Prior studies propose model compression methods to address this issue, but those works focus on smaller models a…
▽ More
Large-scale self-supervised pre-trained speech encoders outperform conventional approaches in speech recognition and translation tasks. Due to the high cost of develo** these large models, building new encoders for new tasks and deploying them to on-device applications are infeasible. Prior studies propose model compression methods to address this issue, but those works focus on smaller models and less realistic tasks. Thus, we propose Contrastive Layer-to-layer Distillation (CoLLD), a novel knowledge distillation method to compress pre-trained speech encoders by leveraging masked prediction and contrastive learning to train student models to copy the behavior of a large teacher model. CoLLD outperforms prior methods and closes the gap between small and large models on multilingual speech-to-text translation and recognition benchmarks.
△ Less
Submitted 27 December, 2023; v1 submitted 14 September, 2023;
originally announced September 2023.
-
(Almost-)Quantum Bell Inequalities and Device-Independent Applications
Authors:
Yuan Liu,
Ho Yiu Chung,
Ravishankar Ramanathan
Abstract:
Investigations of the boundary of the quantum correlation set through the derivation of quantum Bell inequalities have gained increased attention in recent years, which are related to Tsirelson's problem and have significant applications in DI information processing. However, determining quantum Bell inequalities is a notoriously difficult task and only isolated examples are known. In this paper,…
▽ More
Investigations of the boundary of the quantum correlation set through the derivation of quantum Bell inequalities have gained increased attention in recent years, which are related to Tsirelson's problem and have significant applications in DI information processing. However, determining quantum Bell inequalities is a notoriously difficult task and only isolated examples are known. In this paper, we present families of (almost-)quantum Bell inequalities and highlight three foundational and DI applications. Firstly, quantum correlations on the non-signaling boundary are crucial in the DI randomness extraction from weak sources. In the practical Bell scenario of two players with two k-outcome measurements, we derive quantum Bell inequalities that show a separation of the quantum boundary from nonlocal faces of the non-signaling polytope of dimension $\leq 4k-4$, extending previous results. As an immediate by-product of this, we give a general proof of Aumann's Agreement theorem for quantum systems and the almost-quantum correlations, which implies Aumann's agreement theorem is a reasonable physical principle in the context of epistemics to pick out both quantum theory and almost-quantum correlations from general no-signaling theories. Secondly, we present a family of quantum Bell inequalities in the two players with m binary measurements scenarios, that serve to self-test the two-qubit singlet and 2m measurements. Interestingly, this claim generalizes the result for m=2 discovered by Tsirelson-Landau-Masanes and shows an improvement over the state-of-the-art DIRA. Lastly, we use our quantum Bell inequalities to derive the general form of the principle of no advantage in nonlocal computation, which is an information-theoretic principle that serves to characterize the quantum correlation set. With this, we provide the most precise characterization of the quantum boundary known so far.
△ Less
Submitted 18 March, 2024; v1 submitted 12 September, 2023;
originally announced September 2023.
-
Metastable Charge Distribution Between Degenerate Landau Levels
Authors:
Wenlu Lin,
Xing Fan,
Lili Zhao,
Yoon Jang Chung,
Adbhut Gupta,
Kirk W. Baldwin,
Loren Pfeiffer,
Hong Lu,
Yang Liu
Abstract:
We study two dimensional electron systems confined in wide quantum wells whose subband separation is comparable with the Zeeman energy. Two N = 0 Landau levels from different subbands and with opposite spins are pinned in energy when they cross each other and electrons can freely transfer between them. When the disorder is strong, we observe clear hysteresis in our data corresponding to instabilit…
▽ More
We study two dimensional electron systems confined in wide quantum wells whose subband separation is comparable with the Zeeman energy. Two N = 0 Landau levels from different subbands and with opposite spins are pinned in energy when they cross each other and electrons can freely transfer between them. When the disorder is strong, we observe clear hysteresis in our data corresponding to instability of the electron distribution in the two crossing levels. When the intra-layer interaction dominates, multiple minima appear when a Landau level is 1/3 or 2/3 filled and fractional quantum hall effect can be stabilized.
△ Less
Submitted 26 February, 2024; v1 submitted 8 September, 2023;
originally announced September 2023.
-
Naturalness-motivated composite Higgs model for generating the top Yukawa coupling
Authors:
Yi Chung
Abstract:
The large top Yukawa coupling results in the top quark contributing significantly to the quantum correction of the Higgs mass term. Traditionally, this effect is canceled by the presence of top partners in symmetry-based models. However, the absence of light top partners poses a challenge to the Naturalness of these models. In this paper, we study a model based on composite Higgs with the top Yuka…
▽ More
The large top Yukawa coupling results in the top quark contributing significantly to the quantum correction of the Higgs mass term. Traditionally, this effect is canceled by the presence of top partners in symmetry-based models. However, the absence of light top partners poses a challenge to the Naturalness of these models. In this paper, we study a model based on composite Higgs with the top Yukawa coupling originating from dimension-six four-fermion operators. The low cutoff scale of the top quark loop required by the Naturalness principle can be realized with a light gauge boson $E_μ$ which connects the hyperfermions and top quarks. A scalar-less dynamical model with weakly coupled extended $SU(4)_{EC}$ gauge group is presented. The model features an $E_μ$ boson and a $Z'_E$ boson both at the sub-TeV scale, which lead to a rich phenomenology, especially in the top physics.
△ Less
Submitted 14 May, 2024; v1 submitted 31 August, 2023;
originally announced September 2023.
-
SeamlessM4T: Massively Multilingual & Multimodal Machine Translation
Authors:
Seamless Communication,
Loïc Barrault,
Yu-An Chung,
Mariano Cora Meglioli,
David Dale,
Ning Dong,
Paul-Ambroise Duquenne,
Hady Elsahar,
Hongyu Gong,
Kevin Heffernan,
John Hoffman,
Christopher Klaiber,
Pengwei Li,
Daniel Licht,
Jean Maillard,
Alice Rakotoarison,
Kaushik Ram Sadagopan,
Guillaume Wenzek,
Ethan Ye,
Bapi Akula,
Peng-Jen Chen,
Naji El Hachem,
Brian Ellis,
Gabriel Mejia Gonzalez,
Justin Haaheim
, et al. (43 additional authors not shown)
Abstract:
What does it take to create the Babel Fish, a tool that can help individuals translate speech between any two languages? While recent breakthroughs in text-based models have pushed machine translation coverage beyond 200 languages, unified speech-to-speech translation models have yet to achieve similar strides. More specifically, conventional speech-to-speech translation systems rely on cascaded s…
▽ More
What does it take to create the Babel Fish, a tool that can help individuals translate speech between any two languages? While recent breakthroughs in text-based models have pushed machine translation coverage beyond 200 languages, unified speech-to-speech translation models have yet to achieve similar strides. More specifically, conventional speech-to-speech translation systems rely on cascaded systems that perform translation progressively, putting high-performing unified systems out of reach. To address these gaps, we introduce SeamlessM4T, a single model that supports speech-to-speech translation, speech-to-text translation, text-to-speech translation, text-to-text translation, and automatic speech recognition for up to 100 languages. To build this, we used 1 million hours of open speech audio data to learn self-supervised speech representations with w2v-BERT 2.0. Subsequently, we created a multimodal corpus of automatically aligned speech translations. Filtered and combined with human-labeled and pseudo-labeled data, we developed the first multilingual system capable of translating from and into English for both speech and text. On FLEURS, SeamlessM4T sets a new standard for translations into multiple target languages, achieving an improvement of 20% BLEU over the previous SOTA in direct speech-to-text translation. Compared to strong cascaded models, SeamlessM4T improves the quality of into-English translation by 1.3 BLEU points in speech-to-text and by 2.6 ASR-BLEU points in speech-to-speech. Tested for robustness, our system performs better against background noises and speaker variations in speech-to-text tasks compared to the current SOTA model. Critically, we evaluated SeamlessM4T on gender bias and added toxicity to assess translation safety. Finally, all contributions in this work are open-sourced and accessible at https://github.com/facebookresearch/seamless_communication
△ Less
Submitted 24 October, 2023; v1 submitted 22 August, 2023;
originally announced August 2023.
-
PromptPaint: Steering Text-to-Image Generation Through Paint Medium-like Interactions
Authors:
John Joon Young Chung,
Eytan Adar
Abstract:
While diffusion-based text-to-image (T2I) models provide a simple and powerful way to generate images, guiding this generation remains a challenge. For concepts that are difficult to describe through language, users may struggle to create prompts. Moreover, many of these models are built as end-to-end systems, lacking support for iterative sha** of the image. In response, we introduce PromptPain…
▽ More
While diffusion-based text-to-image (T2I) models provide a simple and powerful way to generate images, guiding this generation remains a challenge. For concepts that are difficult to describe through language, users may struggle to create prompts. Moreover, many of these models are built as end-to-end systems, lacking support for iterative sha** of the image. In response, we introduce PromptPaint, which combines T2I generation with interactions that model how we use colored paints. PromptPaint allows users to go beyond language to mix prompts that express challenging concepts. Just as we iteratively tune colors through layered placements of paint on a physical canvas, PromptPaint similarly allows users to apply different prompts to different canvas areas and times of the generative process. Through a set of studies, we characterize different approaches for mixing prompts, design trade-offs, and socio-technical challenges for generative models. With PromptPaint we provide insight into future steerable generative tools.
△ Less
Submitted 9 August, 2023;
originally announced August 2023.
-
DoDo Learning: DOmain-DemOgraphic Transfer in Language Models for Detecting Abuse Targeted at Public Figures
Authors:
Angus R. Williams,
Hannah Rose Kirk,
Liam Burke,
Yi-Ling Chung,
Ivan Debono,
Pica Johansson,
Francesca Stevens,
Jonathan Bright,
Scott A. Hale
Abstract:
Public figures receive a disproportionate amount of abuse on social media, impacting their active participation in public life. Automated systems can identify abuse at scale but labelling training data is expensive, complex and potentially harmful. So, it is desirable that systems are efficient and generalisable, handling both shared and specific aspects of online abuse. We explore the dynamics of…
▽ More
Public figures receive a disproportionate amount of abuse on social media, impacting their active participation in public life. Automated systems can identify abuse at scale but labelling training data is expensive, complex and potentially harmful. So, it is desirable that systems are efficient and generalisable, handling both shared and specific aspects of online abuse. We explore the dynamics of cross-group text classification in order to understand how well classifiers trained on one domain or demographic can transfer to others, with a view to building more generalisable abuse classifiers. We fine-tune language models to classify tweets targeted at public figures across DOmains (sport and politics) and DemOgraphics (women and men) using our novel DODO dataset, containing 28,000 labelled entries, split equally across four domain-demographic pairs. We find that (i) small amounts of diverse data are hugely beneficial to generalisation and model adaptation; (ii) models transfer more easily across demographics but models trained on cross-domain data are more generalisable; (iii) some groups contribute more to generalisability than others; and (iv) dataset similarity is a signal of transferability.
△ Less
Submitted 25 April, 2024; v1 submitted 31 July, 2023;
originally announced July 2023.
-
Photometry of Type II Supernova SN 2023ixf with a Worldwide Citizen Science Network
Authors:
Lauren A. Sgro,
Thomas M. Esposito,
Guillaume Blaclard,
Sebastian Gomez,
Franck Marchis,
Alexei V. Filippenko,
Daniel O'Conner Peluso,
Stephen S. Lawrence,
Aad Verveen,
Andreas Wagner,
Anouchka Nardi,
Barbara Wiart,
Benjamin Mirwald,
Bill Christensen,
Bob Eramia,
Bruce Parker,
Bruno Guillet,
Byungki Kim,
Chelsey A. Logan,
Christopher C. M. Kyba,
Christopher Toulmin,
Claudio G. Vantaggiato,
Dana Adhis,
Dave Gary,
Dave Goodey
, et al. (66 additional authors not shown)
Abstract:
We present highly sampled photometry of the supernova (SN) 2023ixf, a Type II SN in M101, beginning 2 days before its first known detection. To gather these data, we enlisted the global Unistellar Network of citizen scientists. These 252 observations from 115 telescopes show the SN's rising brightness associated with shock emergence followed by gradual decay. We measure a peak $M_{V}$ = -18.18…
▽ More
We present highly sampled photometry of the supernova (SN) 2023ixf, a Type II SN in M101, beginning 2 days before its first known detection. To gather these data, we enlisted the global Unistellar Network of citizen scientists. These 252 observations from 115 telescopes show the SN's rising brightness associated with shock emergence followed by gradual decay. We measure a peak $M_{V}$ = -18.18 $\pm$ 0.09 mag at 2023-05-25 21:37 UTC in agreement with previously published analyses.
△ Less
Submitted 7 July, 2023;
originally announced July 2023.
-
Understanding Counterspeech for Online Harm Mitigation
Authors:
Yi-Ling Chung,
Gavin Abercrombie,
Florence Enock,
Jonathan Bright,
Verena Rieser
Abstract:
Counterspeech offers direct rebuttals to hateful speech by challenging perpetrators of hate and showing support to targets of abuse. It provides a promising alternative to more contentious measures, such as content moderation and deplatforming, by contributing a greater amount of positive online speech rather than attempting to mitigate harmful content through removal. Advances in the development…
▽ More
Counterspeech offers direct rebuttals to hateful speech by challenging perpetrators of hate and showing support to targets of abuse. It provides a promising alternative to more contentious measures, such as content moderation and deplatforming, by contributing a greater amount of positive online speech rather than attempting to mitigate harmful content through removal. Advances in the development of large language models mean that the process of producing counterspeech could be made more efficient by automating its generation, which would enable large-scale online campaigns. However, we currently lack a systematic understanding of several important factors relating to the efficacy of counterspeech for hate mitigation, such as which types of counterspeech are most effective, what are the optimal conditions for implementation, and which specific effects of hate it can best ameliorate. This paper aims to fill this gap by systematically reviewing counterspeech research in the social sciences and comparing methodologies and findings with computer science efforts in automatic counterspeech generation. By taking this multi-disciplinary view, we identify promising future directions in both fields.
△ Less
Submitted 1 July, 2023;
originally announced July 2023.
-
Probing quantum phases in ultra-high-mobility two-dimensional electron systems using surface acoustic waves
Authors:
Mengmeng Wu,
Xiao Liu,
Renfei Wang,
Yoon Jang Chung,
Adbhut Gupta,
Kirk W. Baldwin,
Loren Pfeiffer,
Xi Lin,
Yang Liu
Abstract:
Transport measurement, which applies an electric field and studies the migration of charged particles, i.e. the current, is the most widely used technique in condensed matter studies. It is generally assumed that the quantum phase remains unchanged when it hosts a sufficiently small probing current, which is, surprisingly, rarely examined experimentally. In this work, we study the ultra-high mobil…
▽ More
Transport measurement, which applies an electric field and studies the migration of charged particles, i.e. the current, is the most widely used technique in condensed matter studies. It is generally assumed that the quantum phase remains unchanged when it hosts a sufficiently small probing current, which is, surprisingly, rarely examined experimentally. In this work, we study the ultra-high mobility two-dimensional electron system using a propagating surface acoustic wave, whose traveling speed is affected by the electrons' compressibility. The acoustic power used in our study is several orders of magnitude lower than previous reports, and its induced perturbation to the system is smaller than the transport current. Therefore we are able to observe the quantum phases become more incompressible when hosting a perturbative current.
△ Less
Submitted 31 January, 2024; v1 submitted 5 July, 2023;
originally announced July 2023.
-
FBA-Net: Foreground and Background Aware Contrastive Learning for Semi-Supervised Atrium Segmentation
Authors:
Yunsung Chung,
Chanho Lim,
Chao Huang,
Nassir Marrouche,
Jihun Hamm
Abstract:
Medical image segmentation of gadolinium enhancement magnetic resonance imaging (GE MRI) is an important task in clinical applications. However, manual annotation is time-consuming and requires specialized expertise. Semi-supervised segmentation methods that leverage both labeled and unlabeled data have shown promise, with contrastive learning emerging as a particularly effective approach. In this…
▽ More
Medical image segmentation of gadolinium enhancement magnetic resonance imaging (GE MRI) is an important task in clinical applications. However, manual annotation is time-consuming and requires specialized expertise. Semi-supervised segmentation methods that leverage both labeled and unlabeled data have shown promise, with contrastive learning emerging as a particularly effective approach. In this paper, we propose a contrastive learning strategy of foreground and background representations for semi-supervised 3D medical image segmentation (FBA-Net). Specifically, we leverage the contrastive loss to learn representations of both the foreground and background regions in the images. By training the network to distinguish between foreground-background pairs, we aim to learn a representation that can effectively capture the anatomical structures of interest. Experiments on three medical segmentation datasets demonstrate state-of-the-art performance. Notably, our method achieves a Dice score of 91.31% with only 20% labeled data, which is remarkably close to the 91.62% score of the fully supervised method that uses 100% labeled data on the left atrium dataset. Our framework has the potential to advance the field of semi-supervised 3D medical image segmentation and enable more efficient and accurate analysis of medical images with a limited amount of annotated labels.
△ Less
Submitted 27 June, 2023;
originally announced June 2023.
-
Morita equivalence of two $\ell^p$ Roe-type algebras
Authors:
Yeong Chyuan Chung
Abstract:
Given a metric space with bounded geometry, one may associate with it the $\ell^p$ uniform Roe algebra and the $\ell^p$ uniform algebra, both containing information about the large scale geometry of the metric space. We show that these two Banach algebras are Morita equivalent in the sense of Lafforgue for $1\leq p<\infty$. As a consequence, these two Banach algebras have the same $K$-theory. We t…
▽ More
Given a metric space with bounded geometry, one may associate with it the $\ell^p$ uniform Roe algebra and the $\ell^p$ uniform algebra, both containing information about the large scale geometry of the metric space. We show that these two Banach algebras are Morita equivalent in the sense of Lafforgue for $1\leq p<\infty$. As a consequence, these two Banach algebras have the same $K$-theory. We then define an $\ell^p$ uniform coarse assembly map taking values in the $K$-theory of the $\ell^p$ uniform Roe algebra and show that it is not always surjective.
△ Less
Submitted 23 December, 2023; v1 submitted 26 June, 2023;
originally announced June 2023.
-
Increasing Diversity While Maintaining Accuracy: Text Data Generation with Large Language Models and Human Interventions
Authors:
John Joon Young Chung,
Ece Kamar,
Saleema Amershi
Abstract:
Large language models (LLMs) can be used to generate text data for training and evaluating other models. However, creating high-quality datasets with LLMs can be challenging. In this work, we explore human-AI partnerships to facilitate high diversity and accuracy in LLM-based text data generation. We first examine two approaches to diversify text generation: 1) logit suppression, which minimizes t…
▽ More
Large language models (LLMs) can be used to generate text data for training and evaluating other models. However, creating high-quality datasets with LLMs can be challenging. In this work, we explore human-AI partnerships to facilitate high diversity and accuracy in LLM-based text data generation. We first examine two approaches to diversify text generation: 1) logit suppression, which minimizes the generation of languages that have already been frequently generated, and 2) temperature sampling, which flattens the token sampling probability. We found that diversification approaches can increase data diversity but often at the cost of data accuracy (i.e., text and labels being appropriate for the target domain). To address this issue, we examined two human interventions, 1) label replacement (LR), correcting misaligned labels, and 2) out-of-scope filtering (OOSF), removing instances that are out of the user's domain of interest or to which no considered label applies. With oracle studies, we found that LR increases the absolute accuracy of models trained with diversified datasets by 14.4%. Moreover, we found that some models trained with data generated with LR interventions outperformed LLM-based few-shot classification. In contrast, OOSF was not effective in increasing model accuracy, implying the need for future work in human-in-the-loop text data generation.
△ Less
Submitted 7 June, 2023;
originally announced June 2023.
-
Delocalization and Universality of the Fractional Quantum Hall Plateau-to-Plateau Transitions
Authors:
P. T. Madathil,
K. A. Villegas Rosales,
C. T. Tai,
Y. J. Chung,
L. N. Pfeiffer,
K. W. West,
K. W. Baldwin,
M. Shayegan
Abstract:
Disorder and electron-electron interaction play essential roles in the physics of electron systems in condensed matter. In two-dimensional, quantum Hall systems, extensive studies of disorder-induced localization have led to the emergence of a scaling picture with a single extended state, characterized by a power-law divergence of the localization length in the zero-temperature limit. Experimental…
▽ More
Disorder and electron-electron interaction play essential roles in the physics of electron systems in condensed matter. In two-dimensional, quantum Hall systems, extensive studies of disorder-induced localization have led to the emergence of a scaling picture with a single extended state, characterized by a power-law divergence of the localization length in the zero-temperature limit. Experimentally, scaling has been investigated via measuring the temperature dependence of plateau-to-plateau transitions between the integer quantum Hall states (IQHSs), yielding a critical exponent $κ\simeq 0.42$. Here we report scaling measurements in the fractional quantum Hall state (FQHS) regime where interaction plays a dominant role. Our study is partly motivated by recent calculations, based on the composite fermion theory, that suggest identical critical exponents in both IQHS and FQHS cases to the extent that the interaction between composite fermions is negligible. The samples used in our experiments are two-dimensional electron systems confined to GaAs quantum wells of exceptionally high quality. We find that $κ$ varies for transitions between different FQHSs observed on the flanks of Landau level filling factor $ν=1/2$, and has a value close to that reported for the IQHS transitions only for a limited number of transitions between high-order FQHSs with intermediate strength. We discuss possible origins of the non-universal $κ$ observed in our experiments.
△ Less
Submitted 6 June, 2023;
originally announced June 2023.
-
Topological Data Analysis Assisted Automated Sleep Stage Scoring Using Airflow Signals
Authors:
Yu-Min Chung,
Whitney K. Huang,
Hau-Tieng Wu
Abstract:
Objective: Breathing pattern variability (BPV), as a universal physiological feature, encodes rich health information. We aim to show that, a high-quality automatic sleep stage scoring based on a proper quantification of BPV extracting from the single airflow signal can be achieved.
Methods: Topological data analysis (TDA) is applied to characterize BPV from the intrinsically nonstationary airfl…
▽ More
Objective: Breathing pattern variability (BPV), as a universal physiological feature, encodes rich health information. We aim to show that, a high-quality automatic sleep stage scoring based on a proper quantification of BPV extracting from the single airflow signal can be achieved.
Methods: Topological data analysis (TDA) is applied to characterize BPV from the intrinsically nonstationary airflow signal, where the extracted features are used to train an automatic sleep stage scoring model using the XGBoost learner. The noise and artifacts commonly present in the airflow signal are recycled to enhance the performance of the trained system. The state-of-the-art approach is implemented for a comparison.
Results: When applied to 30 whole night polysomnogram signals with standard annotations, the leave-one-subject-out cross-validation shows that the proposed features (overall accuracy 78.8\%$\pm$8.7\% and Cohen's kappa 0.56$\pm 0.15$) outperforms those considered in the state-of-the-art work (overall accuracy 75.0\%$\pm$9.6\% and Cohen's kappa 0.50$\pm 0.15$) when applied to automatically score wake, rapid eyeball movement (REM) and non-REM (NREM). The TDA features are shown to contain complementary information to the traditional features commonly used in the literature via examining the feature importance. The respiratory quality index is found to be essential in the trained system.
Conclusion: The proposed TDA-assisted automatic annotation system can accurately distinguish wake, REM and NREM from the airflow signal.
Significance: Since only one single airflow channel is needed and BPV is universal, the result suggests that the TDA-assisted signal processing has potential to be applied to other biomedical signals and homecare problems other than the sleep stage annotation.
△ Less
Submitted 5 June, 2023;
originally announced June 2023.
-
Parity Calibration
Authors:
Youngseog Chung,
Aaron Rumack,
Chirag Gupta
Abstract:
In a sequential regression setting, a decision-maker may be primarily concerned with whether the future observation will increase or decrease compared to the current one, rather than the actual value of the future observation. In this context, we introduce the notion of parity calibration, which captures the goal of calibrated forecasting for the increase-decrease (or "parity") event in a timeseri…
▽ More
In a sequential regression setting, a decision-maker may be primarily concerned with whether the future observation will increase or decrease compared to the current one, rather than the actual value of the future observation. In this context, we introduce the notion of parity calibration, which captures the goal of calibrated forecasting for the increase-decrease (or "parity") event in a timeseries. Parity probabilities can be extracted from a forecasted distribution for the output, but we show that such a strategy leads to theoretical unpredictability and poor practical performance. We then observe that although the original task was regression, parity calibration can be expressed as binary calibration. Drawing on this connection, we use an online binary calibration method to achieve parity calibration. We demonstrate the effectiveness of our approach on real-world case studies in epidemiology, weather forecasting, and model-based control in nuclear fusion.
△ Less
Submitted 7 June, 2023; v1 submitted 29 May, 2023;
originally announced May 2023.