-
Proxy Methods for Domain Adaptation
Authors:
Katherine Tsai,
Stephen R. Pfohl,
Olawale Salaudeen,
Nicole Chiou,
Matt J. Kusner,
Alexander D'Amour,
Sanmi Koyejo,
Arthur Gretton
Abstract:
We study the problem of domain adaptation under distribution shift, where the shift is due to a change in the distribution of an unobserved, latent variable that confounds both the covariates and the labels. In this setting, neither the covariate shift nor the label shift assumptions apply. Our approach to adaptation employs proximal causal learning, a technique for estimating causal effects in se…
▽ More
We study the problem of domain adaptation under distribution shift, where the shift is due to a change in the distribution of an unobserved, latent variable that confounds both the covariates and the labels. In this setting, neither the covariate shift nor the label shift assumptions apply. Our approach to adaptation employs proximal causal learning, a technique for estimating causal effects in settings where proxies of unobserved confounders are available. We demonstrate that proxy variables allow for adaptation to distribution shift without explicitly recovering or modeling latent variables. We consider two settings, (i) Concept Bottleneck: an additional ''concept'' variable is observed that mediates the relationship between the covariates and labels; (ii) Multi-domain: training data from multiple source domains is available, where each source domain exhibits a different distribution over the latent confounder. We develop a two-stage kernel estimation approach to adapt to complex distribution shifts in both settings. In our experiments, we show that our approach outperforms other methods, notably those which explicitly recover the latent confounder.
△ Less
Submitted 12 March, 2024;
originally announced March 2024.
-
Using ResNet to Utilize 4-class T2-FLAIR Slice Classification Based on the Cholinergic Pathways Hyperintensities Scale for Pathological Aging
Authors:
Wei-Chun Kevin Tsai,
Yi-Chien Liu,
Ming-Chun Yu,
Chia-Ju Chou,
Sui-Hing Yan,
Yang-Teng Fan,
Yan-Hsiang Huang,
Yen-Ling Chiu,
Yi-Fang Chuang,
Ran-Zan Wang,
Yao-Chia Shih
Abstract:
The Cholinergic Pathways Hyperintensities Scale (CHIPS) is a visual rating scale used to assess the extent of cholinergic white matter hyperintensities in T2-FLAIR images, serving as an indicator of dementia severity. However, the manual selection of four specific slices for rating throughout the entire brain is a time-consuming process. Our goal was to develop a deep learning-based model capable…
▽ More
The Cholinergic Pathways Hyperintensities Scale (CHIPS) is a visual rating scale used to assess the extent of cholinergic white matter hyperintensities in T2-FLAIR images, serving as an indicator of dementia severity. However, the manual selection of four specific slices for rating throughout the entire brain is a time-consuming process. Our goal was to develop a deep learning-based model capable of automatically identifying the four slices relevant to CHIPS. To achieve this, we trained a 4-class slice classification model (BSCA) using the ADNI T2-FLAIR dataset (N=150) with the assistance of ResNet. Subsequently, we tested the model's performance on a local dataset (N=30). The results demonstrated the efficacy of our model, with an accuracy of 99.82% and an F1-score of 99.83%. This achievement highlights the potential impact of BSCA as an automatic screening tool, streamlining the selection of four specific T2-FLAIR slices that encompass white matter landmarks along the cholinergic pathways. Clinicians can leverage this tool to assess the risk of clinical dementia development efficiently.
△ Less
Submitted 9 November, 2023;
originally announced November 2023.
-
Goodness-of-Fit of Attributed Probabilistic Graph Generative Models
Authors:
Pablo Robles-Granda,
Katherine Tsai,
Oluwasanmi Koyejo
Abstract:
Probabilistic generative models of graphs are important tools that enable representation and sampling. Many recent works have created probabilistic models of graphs that are capable of representing not only entity interactions but also their attributes. However, given a generative model of random attributed graph(s), the general conditions that establish goodness of fit are not clear a-priori. In…
▽ More
Probabilistic generative models of graphs are important tools that enable representation and sampling. Many recent works have created probabilistic models of graphs that are capable of representing not only entity interactions but also their attributes. However, given a generative model of random attributed graph(s), the general conditions that establish goodness of fit are not clear a-priori. In this paper, we define goodness of fit in terms of the mean square contingency coefficient for random binary networks. For this statistic, we outline a procedure for assessing the quality of the structure of a learned attributed graph by ensuring that the discrepancy of the mean square contingency coefficient (constant, or random) is minimal with high probability. We apply these criteria to verify the representation capability of a probabilistic generative model for various popular types of graph models.
△ Less
Submitted 28 July, 2023;
originally announced August 2023.
-
Adapting to Latent Subgroup Shifts via Concepts and Proxies
Authors:
Ibrahim Alabdulmohsin,
Nicole Chiou,
Alexander D'Amour,
Arthur Gretton,
Sanmi Koyejo,
Matt J. Kusner,
Stephen R. Pfohl,
Olawale Salaudeen,
Jessica Schrouff,
Katherine Tsai
Abstract:
We address the problem of unsupervised domain adaptation when the source domain differs from the target domain because of a shift in the distribution of a latent subgroup. When this subgroup confounds all observed data, neither covariate shift nor label shift assumptions apply. We show that the optimal target predictor can be non-parametrically identified with the help of concept and proxy variabl…
▽ More
We address the problem of unsupervised domain adaptation when the source domain differs from the target domain because of a shift in the distribution of a latent subgroup. When this subgroup confounds all observed data, neither covariate shift nor label shift assumptions apply. We show that the optimal target predictor can be non-parametrically identified with the help of concept and proxy variables available only in the source domain, and unlabeled data from the target. The identification results are constructive, immediately suggesting an algorithm for estimating the optimal predictor in the target. For continuous observations, when this algorithm becomes impractical, we propose a latent variable model specific to the data generation process at hand. We show how the approach degrades as the size of the shift changes, and verify that it outperforms both covariate and label shift adjustment.
△ Less
Submitted 21 December, 2022;
originally announced December 2022.
-
Latent Multimodal Functional Graphical Model Estimation
Authors:
Katherine Tsai,
Boxin Zhao,
Sanmi Koyejo,
Mladen Kolar
Abstract:
Joint multimodal functional data acquisition, where functional data from multiple modes are measured simultaneously from the same subject, has emerged as an exciting modern approach enabled by recent engineering breakthroughs in the neurological and biological sciences. One prominent motivation to acquire such data is to enable new discoveries of the underlying connectivity by combining multimodal…
▽ More
Joint multimodal functional data acquisition, where functional data from multiple modes are measured simultaneously from the same subject, has emerged as an exciting modern approach enabled by recent engineering breakthroughs in the neurological and biological sciences. One prominent motivation to acquire such data is to enable new discoveries of the underlying connectivity by combining multimodal signals. Despite the scientific interest, there remains a gap in principled statistical methods for estimating the graph underlying multimodal functional data. To this end, we propose a new integrative framework that models the data generation process and identifies operators map** from the observation space to the latent space. We then develop an estimator that simultaneously estimates the transformation operators and the latent graph. This estimator is based on the partial correlation operator, which we rigorously extend from the multivariate to the functional setting. Our procedure is provably efficient, with the estimator converging to a stationary point with quantifiable statistical error. Furthermore, we show recovery of the latent graph under mild conditions. Our work is applied to analyze simultaneously acquired multimodal brain imaging data where the graph indicates functional connectivity of the brain. We present simulation and empirical results that support the benefits of joint estimation.
△ Less
Submitted 1 October, 2023; v1 submitted 31 October, 2022;
originally announced October 2022.
-
Analyzing Image-based Political Propaganda in Referendum Campaigns: From Elements to Strategies
Authors:
Ming-Hung Wang,
Wei-Yang Chang,
Kuan-Hung Kuo,
Kuo-Yu Tsai
Abstract:
With the increasing popularity of social network services, paradigm-shifting has occurred in political communication. Politicians, candidates, and political organizations establish their fan pages to interact with online citizens. Initially, they publish text-only content on sites; then, they create multimedia content such as photos, images, and videos to approach more people. This paper takes a f…
▽ More
With the increasing popularity of social network services, paradigm-shifting has occurred in political communication. Politicians, candidates, and political organizations establish their fan pages to interact with online citizens. Initially, they publish text-only content on sites; then, they create multimedia content such as photos, images, and videos to approach more people. This paper takes a first look at image-based political propaganda during a national referendum in Taiwan. Unlike elections, a referendum is a vote on policies. We investigated more than 2,000 images posted on Facebook by the two major parties to understand the elements of images and the strategies of political organizations. In addition, we studied the data collection's textual content, objects, and colors. The results suggest the aspects of propaganda materials vary with different political organizations. However, the coloring strategies are similar, using representative colors for consolidation and the opponent's colors for attacks.
△ Less
Submitted 26 May, 2022;
originally announced May 2022.
-
Joint Gaussian Graphical Model Estimation: A Survey
Authors:
Katherine Tsai,
Oluwasanmi Koyejo,
Mladen Kolar
Abstract:
Graphs from complex systems often share a partial underlying structure across domains while retaining individual features. Thus, identifying common structures can shed light on the underlying signal, for instance, when applied to scientific discoveries or clinical diagnoses. Furthermore, growing evidence shows that the shared structure across domains boosts the estimation power of graphs, particul…
▽ More
Graphs from complex systems often share a partial underlying structure across domains while retaining individual features. Thus, identifying common structures can shed light on the underlying signal, for instance, when applied to scientific discoveries or clinical diagnoses. Furthermore, growing evidence shows that the shared structure across domains boosts the estimation power of graphs, particularly for high-dimensional data. However, building a joint estimator to extract the common structure may be more complicated than it seems, most often due to data heterogeneity across sources. This manuscript surveys recent work on statistical inference of joint Gaussian graphical models, identifying model structures that fit various data generation processes. Simulations under different data generation processes are implemented with detailed discussions on the choice of models.
△ Less
Submitted 3 April, 2022; v1 submitted 19 October, 2021;
originally announced October 2021.
-
Blind Monaural Source Separation on Heart and Lung Sounds Based on Periodic-Coded Deep Autoencoder
Authors:
Kun-Hsi Tsai,
Wei-Chien Wang,
Chui-Hsuan Cheng,
Chan-Yen Tsai,
Jou-Kou Wang,
Tzu-Hao Lin,
Shih-Hau Fang,
Li-Chin Chen,
Yu Tsao
Abstract:
Auscultation is the most efficient way to diagnose cardiovascular and respiratory diseases. To reach accurate diagnoses, a device must be able to recognize heart and lung sounds from various clinical situations. However, the recorded chest sounds are mixed by heart and lung sounds. Thus, effectively separating these two sounds is critical in the pre-processing stage. Recent advances in machine lea…
▽ More
Auscultation is the most efficient way to diagnose cardiovascular and respiratory diseases. To reach accurate diagnoses, a device must be able to recognize heart and lung sounds from various clinical situations. However, the recorded chest sounds are mixed by heart and lung sounds. Thus, effectively separating these two sounds is critical in the pre-processing stage. Recent advances in machine learning have progressed on monaural source separations, but most of the well-known techniques require paired mixed sounds and individual pure sounds for model training. As the preparation of pure heart and lung sounds is difficult, special designs must be considered to derive effective heart and lung sound separation techniques. In this study, we proposed a novel periodicity-coded deep auto-encoder (PC-DAE) approach to separate mixed heart-lung sounds in an unsupervised manner via the assumption of different periodicities between heart rate and respiration rate. The PC-DAE benefits from deep-learning-based models by extracting representative features and considers the periodicity of heart and lung sounds to carry out the separation. We evaluated PC-DAE on two datasets. The first one includes sounds from the Student Auscultation Manikin (SAM), and the second is prepared by recording chest sounds in real-world conditions. Experimental results indicate that PC-DAE outperforms several well-known separations works in terms of standardized evaluation metrics. Moreover, waveforms and spectrograms demonstrate the effectiveness of PC-DAE compared to existing approaches. It is also confirmed that by using the proposed PC-DAE as a pre-processing stage, the heart sound recognition accuracies can be notably boosted. The experimental results confirmed the effectiveness of PC-DAE and its potential to be used in clinical applications.
△ Less
Submitted 11 December, 2020;
originally announced December 2020.
-
A Nonconvex Framework for Structured Dynamic Covariance Recovery
Authors:
Katherine Tsai,
Mladen Kolar,
Oluwasanmi Koyejo
Abstract:
We propose a flexible yet interpretable model for high-dimensional data with time-varying second order statistics, motivated and applied to functional neuroimaging data. Motivated by the neuroscience literature, we factorize the covariances into sparse spatial and smooth temporal components. While this factorization results in both parsimony and domain interpretability, the resulting estimation pr…
▽ More
We propose a flexible yet interpretable model for high-dimensional data with time-varying second order statistics, motivated and applied to functional neuroimaging data. Motivated by the neuroscience literature, we factorize the covariances into sparse spatial and smooth temporal components. While this factorization results in both parsimony and domain interpretability, the resulting estimation problem is nonconvex. To this end, we design a two-stage optimization scheme with a carefully tailored spectral initialization, combined with iteratively refined alternating projected gradient descent. We prove a linear convergence rate up to a nontrivial statistical error for the proposed descent scheme and establish sample complexity guarantees for the estimator. We further quantify the statistical error for the multivariate Gaussian case. Empirical results using simulated and real brain imaging data illustrate that our approach outperforms existing baselines.
△ Less
Submitted 17 July, 2021; v1 submitted 11 November, 2020;
originally announced November 2020.
-
Achieving Correlated Equilibrium by Studying Opponent's Behavior Through Policy-Based Deep Reinforcement Learning
Authors:
Kuo Chun Tsai,
Zhu Han
Abstract:
Game theory is a very profound study on distributed decision-making behavior and has been extensively developed by many scholars. However, many existing works rely on certain strict assumptions such as knowing the opponent's private behaviors, which might not be practical. In this work, we focused on two Nobel winning concepts, the Nash equilibrium and the correlated equilibrium. Specifically, we…
▽ More
Game theory is a very profound study on distributed decision-making behavior and has been extensively developed by many scholars. However, many existing works rely on certain strict assumptions such as knowing the opponent's private behaviors, which might not be practical. In this work, we focused on two Nobel winning concepts, the Nash equilibrium and the correlated equilibrium. Specifically, we successfully reached the correlated equilibrium outside the convex hull of the Nash equilibria with our proposed deep reinforcement learning algorithm. With the correlated equilibrium probability distribution, we also propose a mathematical model to inverse the calculation of the correlated equilibrium probability distribution to estimate the opponent's payoff vector. With those payoffs, deep reinforcement learning learns why and how the rational opponent plays, instead of just learning the regions for corresponding strategies and actions. Through simulations, we showed that our proposed method can achieve the optimal correlated equilibrium and outside the convex hull of the Nash equilibrium with limited interaction among players.
△ Less
Submitted 18 April, 2020;
originally announced April 2020.
-
Unexpected two-fold symmetric superconductivity in few-layer NbSe$_2$
Authors:
Alex Hamill,
Brett Heischmidt,
Egon Sohn,
Daniel Shaffer,
Kan-Ting Tsai,
Xi Zhang,
Xiaoxiang Xi,
Alexey Suslov,
Helmuth Berger,
László Forró,
Fiona J. Burnell,
Jie Shan,
Kin Fai Mak,
Rafael M. Fernandes,
Ke Wang,
Vlad S. Pribiag
Abstract:
Two-dimensional transition metal dichalcogenides (TMDs) have been attracting significant interest due to a range of properties, such as layer-dependent inversion symmetry, valley-contrasted Berry curvatures, and strong spin-orbit coupling (SOC). Of particular interest is niobium diselenide (NbSe2), whose superconducting state in few-layer samples is profoundly affected by an unusual type of SOC ca…
▽ More
Two-dimensional transition metal dichalcogenides (TMDs) have been attracting significant interest due to a range of properties, such as layer-dependent inversion symmetry, valley-contrasted Berry curvatures, and strong spin-orbit coupling (SOC). Of particular interest is niobium diselenide (NbSe2), whose superconducting state in few-layer samples is profoundly affected by an unusual type of SOC called Ising SOC. Combined with the reduced dimensionality, the latter stabilizes the superconducting state against magnetic fields up to ~35 T and could lead to other exotic properties such as nodal and crystalline topological superconductivity. Here, we report transport measurements of few-layer NbSe$_2$ under in-plane external magnetic fields, revealing an unexpected two-fold rotational symmetry of the superconducting state. In contrast to the three-fold symmetry of the lattice, we observe that the magnetoresistance and critical field exhibit a two-fold oscillation with respect to an applied in-plane magnetic field. We find similar two-fold oscillations deep inside the superconducting state in differential conductance measurements on NbSe$_2$/CrBr$_3$ superconductor-magnet junctions. In both cases, the anisotropy vanishes in the normal state, demonstrating that it is an intrinsic property of the superconducting phase. We attribute the behavior to the mixing between two closely competing pairing instabilities, namely, the conventional s-wave instability typical of bulk NbSe$_2$ and an unconventional d- or p-wave channel that emerges in few-layer NbSe2. Our results thus demonstrate the unconventional character of the pairing interaction in a few-layer TMD, opening a new avenue to search for exotic superconductivity in this family of 2D materials.
△ Less
Submitted 6 April, 2020;
originally announced April 2020.
-
Correlated Insulating States and Transport Signature of Superconductivity in Twisted Trilayer Graphene Moiré of Moiré Superlattices
Authors:
Kan-Ting Tsai,
Xi Zhang,
Ziyan Zhu,
Yujie Luo,
Stephen Carr,
Mitchell Luskin,
Efthimios Kaxiras,
Ke Wang
Abstract:
Layers of two-dimensional materials stacked with a small twist-angle give rise to beating periodic patterns on a scale much larger than the original lattice, referred to as a moiré superlattice. When the stacking involves more than two layers with independent twist angles between adjacent layers, it generates moiré of moiré superlattices, with multiple length scales that control the system's behav…
▽ More
Layers of two-dimensional materials stacked with a small twist-angle give rise to beating periodic patterns on a scale much larger than the original lattice, referred to as a moiré superlattice. When the stacking involves more than two layers with independent twist angles between adjacent layers, it generates moiré of moiré superlattices, with multiple length scales that control the system's behavior. Here we demonstrate these effects of a high-order moiré superlattice in twisted trilayer graphene with two consecutive small twist angles. We report correlated insulating states near the half filling of the moiré of moiré superlattice at an extremely low carrier density (~1010 cm-2), near which we also report a zero-resistance transport behavior typically expected in a 2D superconductor. Moreover, the temperature dependence of the measured resistances at full-occupancy (v = -4 and v = 4) states are semi-metallic, distinct from the insulating behavior of twisted bilayer systems, providing the first demonstration of emergent correlated transport behaviors from continuous, non-isolated higher-order moiré flat bands. Our findings shed new insights into the microscopic mechanisms of moiré correlated states and provide the impetus for future studies on this material platform, such as the demonstration of phase coherence and Meissner-like effect.
△ Less
Submitted 1 December, 2020; v1 submitted 6 December, 2019;
originally announced December 2019.
-
Learning from Label Proportions with Consistency Regularization
Authors:
Kuen-Han Tsai,
Hsuan-Tien Lin
Abstract:
The problem of learning from label proportions (LLP) involves training classifiers with weak labels on bags of instances, rather than strong labels on individual instances. The weak labels only contain the label proportion of each bag. The LLP problem is important for many practical applications that only allow label proportions to be collected because of data privacy or annotation cost, and has r…
▽ More
The problem of learning from label proportions (LLP) involves training classifiers with weak labels on bags of instances, rather than strong labels on individual instances. The weak labels only contain the label proportion of each bag. The LLP problem is important for many practical applications that only allow label proportions to be collected because of data privacy or annotation cost, and has recently received lots of research attention. Most existing works focus on extending supervised learning models to solve the LLP problem, but the weak learning nature makes it hard to further improve LLP performance with a supervised angle. In this paper, we take a different angle from semi-supervised learning. In particular, we propose a novel model inspired by consistency regularization, a popular concept in semi-supervised learning that encourages the model to produce a decision boundary that better describes the data manifold. With the introduction of consistency regularization, we further extend our study to non-uniform bag-generation and validation-based parameter-selection procedures that better match practical needs. Experiments not only justify that LLP with consistency regularization achieves superior performance, but also demonstrate the practical usability of the proposed procedures.
△ Less
Submitted 29 October, 2019;
originally announced October 2019.
-
The Physics of the B Factories
Authors:
A. J. Bevan,
B. Golob,
Th. Mannel,
S. Prell,
B. D. Yabsley,
K. Abe,
H. Aihara,
F. Anulli,
N. Arnaud,
T. Aushev,
M. Beneke,
J. Beringer,
F. Bianchi,
I. I. Bigi,
M. Bona,
N. Brambilla,
J. B rodzicka,
P. Chang,
M. J. Charles,
C. H. Cheng,
H. -Y. Cheng,
R. Chistov,
P. Colangelo,
J. P. Coleman,
A. Drutskoy
, et al. (2009 additional authors not shown)
Abstract:
This work is on the Physics of the B Factories. Part A of this book contains a brief description of the SLAC and KEK B Factories as well as their detectors, BaBar and Belle, and data taking related issues. Part B discusses tools and methods used by the experiments in order to obtain results. The results themselves can be found in Part C.
Please note that version 3 on the archive is the auxiliary…
▽ More
This work is on the Physics of the B Factories. Part A of this book contains a brief description of the SLAC and KEK B Factories as well as their detectors, BaBar and Belle, and data taking related issues. Part B discusses tools and methods used by the experiments in order to obtain results. The results themselves can be found in Part C.
Please note that version 3 on the archive is the auxiliary version of the Physics of the B Factories book. This uses the notation alpha, beta, gamma for the angles of the Unitarity Triangle. The nominal version uses the notation phi_1, phi_2 and phi_3. Please cite this work as Eur. Phys. J. C74 (2014) 3026.
△ Less
Submitted 31 October, 2015; v1 submitted 24 June, 2014;
originally announced June 2014.
-
Anomalous k-dependent spin splitting in wurtzite AlxGa1-xN/GaN heterostructures
Authors:
Ikai Lo,
M. H. Gau,
J. K. Tsai,
Y. L. Chen,
Z. J. Chang,
W. T. Wang,
J. C. Chiang,
T. Aggerstam
Abstract:
We have confirmed the k-dependent spin splitting in wurtzite AlxGa1-xN/GaN heterostructures. Anomalous beating pattern in Shubnikov-de Haas measurements arises from the interference of Rashba and Dresselhaus spin-orbit interactions. The dominant mechanism for the k-dependent spin splitting at high values of k is attributed to Dresselhaus term which is enhanced by the Delta C1-Delta C3 coupling o…
▽ More
We have confirmed the k-dependent spin splitting in wurtzite AlxGa1-xN/GaN heterostructures. Anomalous beating pattern in Shubnikov-de Haas measurements arises from the interference of Rashba and Dresselhaus spin-orbit interactions. The dominant mechanism for the k-dependent spin splitting at high values of k is attributed to Dresselhaus term which is enhanced by the Delta C1-Delta C3 coupling of wurtzite band folding effect.
△ Less
Submitted 9 November, 2006; v1 submitted 15 September, 2006;
originally announced September 2006.
-
Study of two-subband population in Fe-doped AlxGa1-xN/GaN heterostructures by persistent photoconductivity effect
Authors:
Ikai Lo,
J. K. Tsai,
M. H. Gau,
Y. L. Chen,
Z. J. Chang,
W. T. Wang,
J. C. Chiang,
K. R. Wang,
Chun-Nan Chen,
T. Aggerstam
Abstract:
The electronic properties of Fe-doped Al0.31Ga0.69N/GaN heterostructures have been studied by Shubnikov-de Haas measurement. Two subbands of the two-dimensional electron gas in the hetero-interface were populated. After the low temperature illumination, the electron density increases from 11.99 x 1012 cm-2 to 13.40 x 1012 cm-2 for the first subband and from 0.66 x 1012 cm-2 to 0.94 x 1012 cm-2 f…
▽ More
The electronic properties of Fe-doped Al0.31Ga0.69N/GaN heterostructures have been studied by Shubnikov-de Haas measurement. Two subbands of the two-dimensional electron gas in the hetero-interface were populated. After the low temperature illumination, the electron density increases from 11.99 x 1012 cm-2 to 13.40 x 1012 cm-2 for the first subband and from 0.66 x 1012 cm-2 to 0.94 x 1012 cm-2 for the second subband. The persistent photoconductivity effect (~13% increase) is mostly attributed to the Fe-related deep-donor level in GaN layer. The second subband starts to populate when the first subband is filled at a density n1 = 9.40 x 1012 cm-2. We calculate the energy separation between the first and second subbands to be 105 meV.
△ Less
Submitted 14 September, 2006;
originally announced September 2006.