-
Multi-Objective Optimization for Common-Centroid Placement of Analog Transistors
Authors:
Supriyo Maji,
Hyungjoo Park,
Gi moon Hong,
Souradip Poddar,
David Z. Pan
Abstract:
In analog circuits, process variation can cause unpredictability in circuit performance. Common-centroid (CC) type layouts have been shown to mitigate process-induced variations and are widely used to match circuit elements. Nevertheless, selecting the most suitable CC topology necessitates careful consideration of important layout constraints. Manual handling of these constraints becomes challeng…
▽ More
In analog circuits, process variation can cause unpredictability in circuit performance. Common-centroid (CC) type layouts have been shown to mitigate process-induced variations and are widely used to match circuit elements. Nevertheless, selecting the most suitable CC topology necessitates careful consideration of important layout constraints. Manual handling of these constraints becomes challenging, especially with large size problems. State-of-the-art CC placement methods lack an optimization framework to handle important layout constraints collectively. They also require manual efforts and consequently, the solutions can be suboptimal. To address this, we propose a unified framework based on multi-objective optimization for CC placement of analog transistors. Our method handles various constraints, including degree of dispersion, routing complexity, diffusion sharing, and layout dependent effects. The multi-objective optimization provides better handling of the objectives when compared to single-objective optimization. Moreover, compared to existing methods, our method explores more CC topologies. Post-layout simulation results show better performance compared to state-of-the-art techniques in generating CC layouts.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
A study guide for "On the Hausdorff dimension of Furstenberg sets and orthogonal projections in the plane" after T. Orponen and P. Shmerkin
Authors:
Jacob B. Fiedler,
Guo-Dong Hong,
Donggeun Ryou,
Shukun Wu
Abstract:
This article is a study guide for ``On the Hausdorff dimension of Furstenberg sets and orthogonal projections in the plane" by Orponen and Shmerkin. We begin by introducing Furstenberg set problem and exceptional set of projections and provide a summary of the proof with the core ideas.
This article is a study guide for ``On the Hausdorff dimension of Furstenberg sets and orthogonal projections in the plane" by Orponen and Shmerkin. We begin by introducing Furstenberg set problem and exceptional set of projections and provide a summary of the proof with the core ideas.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Are We Done with MMLU?
Authors:
Aryo Pradipta Gema,
Joshua Ong Jun Leang,
Giwon Hong,
Alessio Devoto,
Alberto Carlo Maria Mancino,
Rohit Saxena,
Xuanli He,
Yu Zhao,
Xiaotang Du,
Mohammad Reza Ghasemi Madani,
Claire Barale,
Robert McHardy,
Joshua Harris,
Jean Kaddour,
Emile van Krieken,
Pasquale Minervini
Abstract:
Maybe not. We identify and analyse errors in the popular Massive Multitask Language Understanding (MMLU) benchmark. Even though MMLU is widely adopted, our analysis demonstrates numerous ground truth errors that obscure the true capabilities of LLMs. For example, we find that 57% of the analysed questions in the Virology subset contain errors. To address this issue, we introduce a comprehensive fr…
▽ More
Maybe not. We identify and analyse errors in the popular Massive Multitask Language Understanding (MMLU) benchmark. Even though MMLU is widely adopted, our analysis demonstrates numerous ground truth errors that obscure the true capabilities of LLMs. For example, we find that 57% of the analysed questions in the Virology subset contain errors. To address this issue, we introduce a comprehensive framework for identifying dataset errors using a novel error taxonomy. Then, we create MMLU-Redux, which is a subset of 3,000 manually re-annotated questions across 30 MMLU subjects. Using MMLU-Redux, we demonstrate significant discrepancies with the model performance metrics that were originally reported. Our results strongly advocate for revising MMLU's error-ridden questions to enhance its future utility and reliability as a benchmark. Therefore, we open up MMLU-Redux for additional annotation https://huggingface.co/datasets/edinburgh-dawg/mmlu-redux.
△ Less
Submitted 7 June, 2024; v1 submitted 6 June, 2024;
originally announced June 2024.
-
QGait: Toward Accurate Quantization for Gait Recognition with Binarized Input
Authors:
Senmao Tian,
Haoyu Gao,
Gangyi Hong,
Shuyun Wang,
**gJie Wang,
Xin Yu,
Shunli Zhang
Abstract:
Existing deep learning methods have made significant progress in gait recognition. Typically, appearance-based models binarize inputs into silhouette sequences. However, mainstream quantization methods prioritize minimizing task loss over quantization error, which is detrimental to gait recognition with binarized inputs. Minor variations in silhouette sequences can be diminished in the network's i…
▽ More
Existing deep learning methods have made significant progress in gait recognition. Typically, appearance-based models binarize inputs into silhouette sequences. However, mainstream quantization methods prioritize minimizing task loss over quantization error, which is detrimental to gait recognition with binarized inputs. Minor variations in silhouette sequences can be diminished in the network's intermediate layers due to the accumulation of quantization errors. To address this, we propose a differentiable soft quantizer, which better simulates the gradient of the round function during backpropagation. This enables the network to learn from subtle input perturbations. However, our theoretical analysis and empirical studies reveal that directly applying the soft quantizer can hinder network convergence. We further refine the training strategy to ensure convergence while simulating quantization errors. Additionally, we visualize the distribution of outputs from different samples in the feature space and observe significant changes compared to the full precision network, which harms performance. Based on this, we propose an Inter-class Distance-guided Distillation (IDD) strategy to preserve the relative distance between the embeddings of samples with different labels. Extensive experiments validate the effectiveness of our approach, demonstrating state-of-the-art accuracy across various settings and datasets. The code will be made publicly available.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Decomposition of Longitudinal Disparities: an Application to the Fetal Growth-Singletons Study
Authors:
Sang Kyu Lee,
Seon** Kim,
Mi-Ok Kim,
Katherine L. Grantz,
Hyokyoung G. Hong
Abstract:
Addressing health disparities among different demographic groups is a key challenge in public health. Despite many efforts, there is still a gap in understanding how these disparities unfold over time. Our paper focuses on this overlooked longitudinal aspect, which is crucial in both clinical and public health settings. In this paper, we introduce a longitudinal disparity decomposition method that…
▽ More
Addressing health disparities among different demographic groups is a key challenge in public health. Despite many efforts, there is still a gap in understanding how these disparities unfold over time. Our paper focuses on this overlooked longitudinal aspect, which is crucial in both clinical and public health settings. In this paper, we introduce a longitudinal disparity decomposition method that decomposes disparities into three components: the explained disparity linked to differences in the exploratory variables' conditional distribution when the modifier distribution is identical between majority and minority groups, the explained disparity that emerges specifically from the unequal distribution of the modifier and its interaction with covariates, and the unexplained disparity. The proposed method offers a dynamic alternative to the traditional Peters-Belson decomposition approach, tackling both the potential reduction in disparity if the covariate distributions of minority groups matched those of the majority group and the evolving nature of disparity over time. We apply the proposed approach to a fetal growth study to gain insights into disparities between different race/ethnicity groups in fetal developmental progress throughout the course of pregnancy.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models
Authors:
Giwon Hong,
Aryo Pradipta Gema,
Rohit Saxena,
Xiaotang Du,
** Nie,
Yu Zhao,
Laura Perez-Beltrachini,
Max Ryabinin,
Xuanli He,
Clémentine Fourrier,
Pasquale Minervini
Abstract:
Large Language Models (LLMs) have transformed the Natural Language Processing (NLP) landscape with their remarkable ability to understand and generate human-like text. However, these models are prone to ``hallucinations'' -- outputs that do not align with factual reality or the input context. This paper introduces the Hallucinations Leaderboard, an open initiative to quantitatively measure and com…
▽ More
Large Language Models (LLMs) have transformed the Natural Language Processing (NLP) landscape with their remarkable ability to understand and generate human-like text. However, these models are prone to ``hallucinations'' -- outputs that do not align with factual reality or the input context. This paper introduces the Hallucinations Leaderboard, an open initiative to quantitatively measure and compare the tendency of each model to produce hallucinations. The leaderboard uses a comprehensive set of benchmarks focusing on different aspects of hallucinations, such as factuality and faithfulness, across various tasks, including question-answering, summarisation, and reading comprehension. Our analysis provides insights into the performance of different models, guiding researchers and practitioners in choosing the most reliable models for their applications.
△ Less
Submitted 17 April, 2024; v1 submitted 8 April, 2024;
originally announced April 2024.
-
Edinburgh Clinical NLP at SemEval-2024 Task 2: Fine-tune your model unless you have access to GPT-4
Authors:
Aryo Pradipta Gema,
Giwon Hong,
Pasquale Minervini,
Luke Daines,
Beatrice Alex
Abstract:
The NLI4CT task assesses Natural Language Inference systems in predicting whether hypotheses entail or contradict evidence from Clinical Trial Reports. In this study, we evaluate various Large Language Models (LLMs) with multiple strategies, including Chain-of-Thought, In-Context Learning, and Parameter-Efficient Fine-Tuning (PEFT). We propose a PEFT method to improve the consistency of LLMs by me…
▽ More
The NLI4CT task assesses Natural Language Inference systems in predicting whether hypotheses entail or contradict evidence from Clinical Trial Reports. In this study, we evaluate various Large Language Models (LLMs) with multiple strategies, including Chain-of-Thought, In-Context Learning, and Parameter-Efficient Fine-Tuning (PEFT). We propose a PEFT method to improve the consistency of LLMs by merging adapters that were fine-tuned separately using triplet and language modelling objectives. We found that merging the two PEFT adapters improves the F1 score (+0.0346) and consistency (+0.152) of the LLMs. However, our novel methods did not produce more accurate results than GPT-4 in terms of faithfulness and consistency. Averaging the three metrics, GPT-4 ranks joint-first in the competition with 0.8328. Finally, our contamination analysis with GPT-4 indicates that there was no test data leakage.
△ Less
Submitted 30 March, 2024;
originally announced April 2024.
-
A noncommutative maximal inequality for Fejér means on totally disconnected non-abelian groups
Authors:
Fugui Ding,
Guixiang Hong,
Xumin Wang
Abstract:
In this paper, we explore Fourier analysis for noncommutative $L_p$ space-valued functions on $G$, where $G$ is a totally disconnected non-abelian compact group. By additionally assuming that the value of these functions remains invariant within each conjugacy class, we establish a noncommutative maximal inequality for Fejér means utilizing the associated character system of $G$. This is an operat…
▽ More
In this paper, we explore Fourier analysis for noncommutative $L_p$ space-valued functions on $G$, where $G$ is a totally disconnected non-abelian compact group. By additionally assuming that the value of these functions remains invariant within each conjugacy class, we establish a noncommutative maximal inequality for Fejér means utilizing the associated character system of $G$. This is an operator-valued version of the classical result due to Gát. We follow essentially the classical sketch, but due to the noncommutativity, many classical arguments have to be revised. Notably, compared to the classical results. the bounds of our estimates are explicity calculated.
△ Less
Submitted 14 March, 2024;
originally announced March 2024.
-
Closing the AI generalization gap by adjusting for dermatology condition distribution differences across clinical settings
Authors:
Rajeev V. Rikhye,
Aaron Loh,
Grace Eunhae Hong,
Preeti Singh,
Margaret Ann Smith,
Vijaytha Muralidharan,
Doris Wong,
Rory Sayres,
Michelle Phung,
Nicolas Betancourt,
Bradley Fong,
Rachna Sahasrabudhe,
Khoban Nasim,
Alec Eschholz,
Basil Mustafa,
Jan Freyberg,
Terry Spitz,
Yossi Matias,
Greg S. Corrado,
Katherine Chou,
Dale R. Webster,
Peggy Bui,
Yuan Liu,
Yun Liu,
Justin Ko
, et al. (1 additional authors not shown)
Abstract:
Recently, there has been great progress in the ability of artificial intelligence (AI) algorithms to classify dermatological conditions from clinical photographs. However, little is known about the robustness of these algorithms in real-world settings where several factors can lead to a loss of generalizability. Understanding and overcoming these limitations will permit the development of generali…
▽ More
Recently, there has been great progress in the ability of artificial intelligence (AI) algorithms to classify dermatological conditions from clinical photographs. However, little is known about the robustness of these algorithms in real-world settings where several factors can lead to a loss of generalizability. Understanding and overcoming these limitations will permit the development of generalizable AI that can aid in the diagnosis of skin conditions across a variety of clinical settings. In this retrospective study, we demonstrate that differences in skin condition distribution, rather than in demographics or image capture mode are the main source of errors when an AI algorithm is evaluated on data from a previously unseen source. We demonstrate a series of steps to close this generalization gap, requiring progressively more information about the new source, ranging from the condition distribution to training data enriched for data less frequently seen during training. Our results also suggest comparable performance from end-to-end fine tuning versus fine tuning solely the classification layer on top of a frozen embedding model. Our approach can inform the adaptation of AI algorithms to new settings, based on the information and resources available.
△ Less
Submitted 23 February, 2024;
originally announced February 2024.
-
Failure of almost uniformly convergence for noncommutative martingales
Authors:
Guixiang Hong,
Éric Ricard
Abstract:
In this paper, we provide a counterexample to show that in sharp contrast to the classical case, the almost uniform convergence may not happen for truly noncommutative $L_p$-martingales when $1\leq p<2$. The same happens to ergodic averages. The proof consists of some sharp estimates of the distributional function of a sequence of matrices and some non standard transference techniques, which might…
▽ More
In this paper, we provide a counterexample to show that in sharp contrast to the classical case, the almost uniform convergence may not happen for truly noncommutative $L_p$-martingales when $1\leq p<2$. The same happens to ergodic averages. The proof consists of some sharp estimates of the distributional function of a sequence of matrices and some non standard transference techniques, which might admit further applications.
△ Less
Submitted 7 July, 2024; v1 submitted 13 February, 2024;
originally announced February 2024.
-
Best constants in the vector-valued Littlewood-Paley-Stein theory
Authors:
Guixiang Hong,
Zhendong Xu,
Hao Zhang
Abstract:
Let $L$ be a sectorial operator of type $α$ ($0 \leq α< π/2$) on $L^2(\mathbb{R}^d)$ with the kernels of $\{e^{-tL}\}_{t>0}$ satisfying certain size and regularity conditions. Define $$ S_{q,L}(f)(x) = \left(\int_0^{\infty}\int_{|y-x| < t} \|tL{e^{-tL}} (f)(y) \|_X^q \,\frac{{\rm d} y{\rm d} t}{t^{d+1}} \right)^{\frac{1}{q}},$$…
▽ More
Let $L$ be a sectorial operator of type $α$ ($0 \leq α< π/2$) on $L^2(\mathbb{R}^d)$ with the kernels of $\{e^{-tL}\}_{t>0}$ satisfying certain size and regularity conditions. Define $$ S_{q,L}(f)(x) = \left(\int_0^{\infty}\int_{|y-x| < t} \|tL{e^{-tL}} (f)(y) \|_X^q \,\frac{{\rm d} y{\rm d} t}{t^{d+1}} \right)^{\frac{1}{q}},$$ $$G_{q,{L}}(f)=\left( \int_0^{\infty} \left\|t{L}{e^{-t{L}}} (f)(y) \right\|_X^q \,\frac{{\rm d} t}{t}\right)^{\frac{1}{q}}.$$ We show that for $\underline{\mathrm{any}}$ Banach space $X$, $1 \leq p < \infty$ and $1 < q < \infty$ and $f\in C_c(\mathbb R^d)\otimes X$, there hold \begin{align*}
p^{-\frac{1}{q}}\| S_{q,{\sqrtΔ}}(f) \|_p \lesssim_{d, γ, β} \| S_{q,L}(f) \|_p \lesssim_{d, γ, β} p^{\frac{1}{q}}\| S_{q,{\sqrtΔ}}(f) \|_p,
\end{align*}
\begin{align*}
p^{-\frac{1}{q}}\| S_{q,L}(f) \|_p \lesssim_{d, γ, β} \| G_{q,L}(f) \|_p \lesssim_{d, γ, β} p^{\frac{1}{q}}\| S_{q,L}(f) \|_p, \end{align*} where $Δ$ is the standard Laplacian; moreover all the orders appeared above are {\it optimal} as $p\rightarrow1$. This, combined with the existing results in [29, 33], allows us to resolve partially Problem 1.8, Problem A.1 and Conjecture A.4 regarding the optimal Lusin type constant and the characterization of martingale type in a recent remarkable work due to Xu [48].
Several difficulties originate from the arbitrariness of $X$, which excludes the use of vector-valued Calderón-Zygmund theory. To surmount the obstacles, we introduce the novel vector-valued Hardy and BMO spaces associated with sectorial operators; in addition to Mei's duality techniques and Wilson's intrinsic square functions developed in this setting, the key new input is the vector-valued tent space theory and its unexpected amalgamation with these `old' techniques.
△ Less
Submitted 24 January, 2024;
originally announced January 2024.
-
Three term rational function progressions in finite fields
Authors:
Guo-Dong Hong,
Zi Li Lim
Abstract:
Let $F(t),G(t)\in \mathbb{Q}(t)$ be rational functions such that $F(t),G(t)$ and the constant function $1$ are linearly independent over $\mathbb{Q}$, we prove an asymptotic formula for the number of the three term rational function progressions of the form $x,x+F(y),x+G(y)$ in subsets of $\mathbb{F}_p$. The main new ingredient is an algebraic geometry version of PET induction that bypasses Weyl's…
▽ More
Let $F(t),G(t)\in \mathbb{Q}(t)$ be rational functions such that $F(t),G(t)$ and the constant function $1$ are linearly independent over $\mathbb{Q}$, we prove an asymptotic formula for the number of the three term rational function progressions of the form $x,x+F(y),x+G(y)$ in subsets of $\mathbb{F}_p$. The main new ingredient is an algebraic geometry version of PET induction that bypasses Weyl's differencing. This answers a question of Bourgain and Chang.
△ Less
Submitted 2 January, 2024;
originally announced January 2024.
-
Operator-Valued Hardy spaces and BMO Spaces on Spaces of Homogeneous Type
Authors:
Zhijie Fan,
Guixiang Hong,
Wenhua Wang
Abstract:
Let $\mathcal{M}$ be a von Neumann algebra equipped with a normal semifinite faithful trace, $(\mathbb{X},\,d,\,μ)$ be a space of homogeneous type in the sense of Coifman and Weiss, and $\mathcal{N}=L_\infty(\mathbb{X})\overline{\otimes}\mathcal{M}$. In this paper, we introduce and then conduct a systematic study on the operator-valued Hardy space $\mathcal{H}_p(\mathbb{X},\,\mathcal{M})$ for all…
▽ More
Let $\mathcal{M}$ be a von Neumann algebra equipped with a normal semifinite faithful trace, $(\mathbb{X},\,d,\,μ)$ be a space of homogeneous type in the sense of Coifman and Weiss, and $\mathcal{N}=L_\infty(\mathbb{X})\overline{\otimes}\mathcal{M}$. In this paper, we introduce and then conduct a systematic study on the operator-valued Hardy space $\mathcal{H}_p(\mathbb{X},\,\mathcal{M})$ for all $1\leq p<\infty$ and operator-valued BMO space $\mathcal{BMO}(\mathbb{X},\,\mathcal{M})$. The main results of this paper include $H_1$--$BMO$ duality theorem, atomic decomposition of $\mathcal{H}_1(\mathbb{X},\,\mathcal{M})$, interpolation between these Hardy spaces and BMO spaces, and equivalence between mixture Hardy spaces and $L_p$-spaces. %Compared with the communcative results, the novelty of this article is that $μ$ is not assumed to satisfy the reverse double condition. %The approaches we develop bypass the use of harmonicity of infinitesimal generator, which allows us to extend Mei's seminal work \cite{m07} to a broader setting. %Our results extend Mei's seminal work \cite{m07} to a broader setting. In particular, without the use of non-commutative martingale theory as in Mei's seminal work \cite{m07}, we provide a direct proof for the interpolation theory. Moreover, under our assumption on Calderón representation formula, these results are even new when going back to the commutative setting for spaces of homogeneous type which fails to satisfy reverse doubling condition. As an application, we obtain the $L_p(\mathcal{N})$-boundedness of operator-valued Calderón-Zygmund operators.
△ Less
Submitted 27 November, 2023;
originally announced November 2023.
-
Steady-State Analysis and Online Learning for Queues with Hawkes Arrivals
Authors:
Xinyun Chen,
Guiyu Hong
Abstract:
We investigate the long-run behavior of single-server queues with Hawkes arrivals and general service distributions and related optimization problems. In detail, utilizing novel coupling techniques, we establish finite moment bounds for the stationary distribution of the workload and busy period processes. In addition, we are able to show that, those queueing processes converge exponentially fast…
▽ More
We investigate the long-run behavior of single-server queues with Hawkes arrivals and general service distributions and related optimization problems. In detail, utilizing novel coupling techniques, we establish finite moment bounds for the stationary distribution of the workload and busy period processes. In addition, we are able to show that, those queueing processes converge exponentially fast to their stationary distribution. Based on these theoretic results, we develop an efficient numerical algorithm to solve the optimal staffing problem for the Hawkes queues in a data-driven manner. Numerical results indicate a sharp difference in staffing for Hawkes queues, compared to the classic GI/GI/1 model, especially in the heavy-traffic regime.
△ Less
Submitted 13 November, 2023; v1 submitted 5 November, 2023;
originally announced November 2023.
-
A Representation of Matrix-Valued Harmonic Functions by the Poisson Integral of Non-commutative BMO Functions
Authors:
Cheng Chen,
Guixiang Hong,
Wenhua Wang
Abstract:
In this paper, the authors study the matrix-valued harmonic functions and characterize them by the Poisson integral of functions in non-commutative BMO (bounded mean oscillation) spaces. This provides a very satisfactory non-commutative analogue of the beautiful result due to Fabes, Johnson and Neri [Indiana Univ. Math. J. {\bf25} (1976) 159-170; MR0394172].
In this paper, the authors study the matrix-valued harmonic functions and characterize them by the Poisson integral of functions in non-commutative BMO (bounded mean oscillation) spaces. This provides a very satisfactory non-commutative analogue of the beautiful result due to Fabes, Johnson and Neri [Indiana Univ. Math. J. {\bf25} (1976) 159-170; MR0394172].
△ Less
Submitted 20 September, 2023;
originally announced September 2023.
-
On the Splash Singularity for the free-boundary problem of the viscous and non-resistive incompressible magnetohydrodynamic equations in 3D
Authors:
Guangyi Hong,
Tao Luo,
Zhonghao Zhao
Abstract:
In this paper, the existence of finite-time splash singularity is proved for the free-boundary problem of the viscous and non-resistive incompressible magnetohydrodynamic (MHD) equations in $ \mathbb{R}^{3}$, based on a construction of a sequence of initial data alongside delicate estimates of the solutions. The result and analysis in this paper generalize those by Coutand and Shkoller in [14, Ann…
▽ More
In this paper, the existence of finite-time splash singularity is proved for the free-boundary problem of the viscous and non-resistive incompressible magnetohydrodynamic (MHD) equations in $ \mathbb{R}^{3}$, based on a construction of a sequence of initial data alongside delicate estimates of the solutions. The result and analysis in this paper generalize those by Coutand and Shkoller in [14, Ann. Inst. H. Poincaré C Anal. Non Linéaire, 2019] from the viscous surface waves to the viscous conducting fluids with magnetic effects for which non-trivial magnetic fields may present on the free boundary. The arguments in this paper also hold for any space dimension $d\ge 2$.
△ Less
Submitted 18 September, 2023;
originally announced September 2023.
-
Disorder-induced linear magnetoresistance in Al$_2$O$_3$/SrTiO$_3$ heterostructures
Authors:
Gao Kuang Hong,
Lin Tie,
Ma Xiao Rong,
Li Qiu Lin,
Li Zhi Qing
Abstract:
An unsaturated linear magnetoresistance (LMR) has attracted widely attention because of potential applications and fundamental interest. By controlling growth temperature, we realized a metal-to-insulator transition in Al2O3/SrTiO3 heterostructures. The LMR is observed in metallic samples with electron mobility varying over three orders of magnitude. The observed LMR cannot be explained by the gui…
▽ More
An unsaturated linear magnetoresistance (LMR) has attracted widely attention because of potential applications and fundamental interest. By controlling growth temperature, we realized a metal-to-insulator transition in Al2O3/SrTiO3 heterostructures. The LMR is observed in metallic samples with electron mobility varying over three orders of magnitude. The observed LMR cannot be explained by the guiding center diffusion model even in samples with very high mobility. The slope of the observed LMR is proportional to Hall mobility, and the crossover field, indicating a transition from quadratic (at low fields) to linear (at high fields) field dependence, is proportional to the inverse Hall mobility. This signifies that the classical model is valid to explain the observed LMR. More importantly, we develop an analytical expression according to the effective-medium theory that is equivalent to the classical model. And the analytical expression describes the LMR data very well, confirming the validity of the classical model.
△ Less
Submitted 6 January, 2024; v1 submitted 19 August, 2023;
originally announced August 2023.
-
Disposable Transfer Learning for Selective Source Task Unlearning
Authors:
Seunghee Koh,
Hyounguk Shon,
Janghyeon Lee,
Hyeong Gwon Hong,
Junmo Kim
Abstract:
Transfer learning is widely used for training deep neural networks (DNN) for building a powerful representation. Even after the pre-trained model is adapted for the target task, the representation performance of the feature extractor is retained to some extent. As the performance of the pre-trained model can be considered the private property of the owner, it is natural to seek the exclusive right…
▽ More
Transfer learning is widely used for training deep neural networks (DNN) for building a powerful representation. Even after the pre-trained model is adapted for the target task, the representation performance of the feature extractor is retained to some extent. As the performance of the pre-trained model can be considered the private property of the owner, it is natural to seek the exclusive right of the generalized performance of the pre-trained weight. To address this issue, we suggest a new paradigm of transfer learning called disposable transfer learning (DTL), which disposes of only the source task without degrading the performance of the target task. To achieve knowledge disposal, we propose a novel loss named Gradient Collision loss (GC loss). GC loss selectively unlearns the source knowledge by leading the gradient vectors of mini-batches in different directions. Whether the model successfully unlearns the source task is measured by piggyback learning accuracy (PL accuracy). PL accuracy estimates the vulnerability of knowledge leakage by retraining the scrubbed model on a subset of source data or new downstream data. We demonstrate that GC loss is an effective approach to the DTL problem by showing that the model trained with GC loss retains the performance on the target task with a significantly reduced PL accuracy.
△ Less
Submitted 19 August, 2023;
originally announced August 2023.
-
Varying-coefficients for regional quantile via KNN-based LASSO with applications to health outcome study
Authors:
Seyoung Park,
Eun Ryung Lee,
Hyokyoung G. Hong
Abstract:
Health outcomes, such as body mass index and cholesterol levels, are known to be dependent on age and exhibit varying effects with their associated risk factors. In this paper, we propose a novel framework for dynamic modeling of the associations between health outcomes and risk factors using varying-coefficients (VC) regional quantile regression via K-nearest neighbors (KNN) fused Lasso, which ca…
▽ More
Health outcomes, such as body mass index and cholesterol levels, are known to be dependent on age and exhibit varying effects with their associated risk factors. In this paper, we propose a novel framework for dynamic modeling of the associations between health outcomes and risk factors using varying-coefficients (VC) regional quantile regression via K-nearest neighbors (KNN) fused Lasso, which captures the time-varying effects of age. The proposed method has strong theoretical properties, including a tight estimation error bound and the ability to detect exact clustered patterns under certain regularity conditions. To efficiently solve the resulting optimization problem, we develop an alternating direction method of multipliers (ADMM) algorithm. Our empirical results demonstrate the efficacy of the proposed method in capturing the complex age-dependent associations between health outcomes and their risk factors.
△ Less
Submitted 8 August, 2023;
originally announced August 2023.
-
Localization using Multi-Focal Spatial Attention for Masked Face Recognition
Authors:
Yooshin Cho,
Hanbyel Cho,
Hyeong Gwon Hong,
Jaesung Ahn,
Dongmin Cho,
JungWoo Chang,
Junmo Kim
Abstract:
Since the beginning of world-wide COVID-19 pandemic, facial masks have been recommended to limit the spread of the disease. However, these masks hide certain facial attributes. Hence, it has become difficult for existing face recognition systems to perform identity verification on masked faces. In this context, it is necessary to develop masked Face Recognition (MFR) for contactless biometric reco…
▽ More
Since the beginning of world-wide COVID-19 pandemic, facial masks have been recommended to limit the spread of the disease. However, these masks hide certain facial attributes. Hence, it has become difficult for existing face recognition systems to perform identity verification on masked faces. In this context, it is necessary to develop masked Face Recognition (MFR) for contactless biometric recognition systems. Thus, in this paper, we propose Complementary Attention Learning and Multi-Focal Spatial Attention that precisely removes masked region by training complementary spatial attention to focus on two distinct regions: masked regions and backgrounds. In our method, standard spatial attention and networks focus on unmasked regions, and extract mask-invariant features while minimizing the loss of the conventional Face Recognition (FR) performance. For conventional FR, we evaluate the performance on the IJB-C, Age-DB, CALFW, and CPLFW datasets. We evaluate the MFR performance on the ICCV2021-MFR/Insightface track, and demonstrate the improved performance on the both MFR and FR datasets. Additionally, we empirically verify that spatial attention of proposed method is more precisely activated in unmasked regions.
△ Less
Submitted 7 September, 2023; v1 submitted 3 May, 2023;
originally announced May 2023.
-
Why So Gullible? Enhancing the Robustness of Retrieval-Augmented Models against Counterfactual Noise
Authors:
Giwon Hong,
Jeonghwan Kim,
Junmo Kang,
Sung-Hyon Myaeng,
Joyce Jiyoung Whang
Abstract:
Most existing retrieval-augmented language models (LMs) assume a naive dichotomy within a retrieved document set: query-relevance and irrelevance. Our work investigates a more challenging scenario in which even the "relevant" documents may contain misleading or incorrect information, causing conflict among the retrieved documents and thereby negatively influencing model decisions as noise. We obse…
▽ More
Most existing retrieval-augmented language models (LMs) assume a naive dichotomy within a retrieved document set: query-relevance and irrelevance. Our work investigates a more challenging scenario in which even the "relevant" documents may contain misleading or incorrect information, causing conflict among the retrieved documents and thereby negatively influencing model decisions as noise. We observe that existing LMs are highly brittle to the presence of conflicting information in both the fine-tuning and in-context few-shot learning scenarios. We propose approaches for handling knowledge conflicts among retrieved documents by explicitly fine-tuning a discriminator or prompting GPT-3.5 to elicit its discriminative capability. Our empirical results on open-domain QA show that these approaches significantly enhance model robustness. We also provide our findings on incorporating the fine-tuned discriminator's decision into the in-context learning process, proposing a way to exploit the benefits of two disparate learning schemes. Alongside our findings, we provide MacNoise, a machine-generated, conflict-induced dataset to further encourage research in this direction.
△ Less
Submitted 9 June, 2024; v1 submitted 2 May, 2023;
originally announced May 2023.
-
Towards Understanding the Effect of Pretraining Label Granularity
Authors:
Guan Zhe Hong,
Yin Cui,
Ariel Fuxman,
Stanley H. Chan,
Enming Luo
Abstract:
In this paper, we study how the granularity of pretraining labels affects the generalization of deep neural networks in image classification tasks. We focus on the "fine-to-coarse" transfer learning setting, where the pretraining label space is more fine-grained than that of the target problem. Empirically, we show that pretraining on the leaf labels of ImageNet21k produces better transfer results…
▽ More
In this paper, we study how the granularity of pretraining labels affects the generalization of deep neural networks in image classification tasks. We focus on the "fine-to-coarse" transfer learning setting, where the pretraining label space is more fine-grained than that of the target problem. Empirically, we show that pretraining on the leaf labels of ImageNet21k produces better transfer results on ImageNet1k than pretraining on other coarser granularity levels, which supports the common practice used in the community. Theoretically, we explain the benefit of fine-grained pretraining by proving that, for a data distribution satisfying certain hierarchy conditions, 1) coarse-grained pretraining only allows a neural network to learn the "common" or "easy-to-learn" features well, while 2) fine-grained pretraining helps the network learn the "rarer" or "fine-grained" features in addition to the common ones, thus improving its accuracy on hard downstream test samples in which common features are missing or weak in strength. Furthermore, we perform comprehensive experiments using the label hierarchies of iNaturalist 2021 and observe that the following conditions, in addition to proper choice of label granularity, enable the transfer to work well in practice: 1) the pretraining dataset needs to have a meaningful label hierarchy, and 2) the pretraining and target label functions need to align well.
△ Less
Submitted 5 October, 2023; v1 submitted 29 March, 2023;
originally announced March 2023.
-
Online Learning and Optimization for Queues with Unknown Demand Curve and Service Distribution
Authors:
Xinyun Chen,
Yunan Liu,
Guiyu Hong
Abstract:
We investigate an optimization problem in a queueing system where the service provider selects the optimal service fee p and service capacity μto maximize the cumulative expected profit (the service revenue minus the capacity cost and delay penalty). The conventional predict-then-optimize (PTO) approach takes two steps: first, it estimates the model parameters (e.g., arrival rate and service-time…
▽ More
We investigate an optimization problem in a queueing system where the service provider selects the optimal service fee p and service capacity μto maximize the cumulative expected profit (the service revenue minus the capacity cost and delay penalty). The conventional predict-then-optimize (PTO) approach takes two steps: first, it estimates the model parameters (e.g., arrival rate and service-time distribution) from data; second, it optimizes a model based on the estimated parameters. A major drawback of PTO is that its solution accuracy can often be highly sensitive to the parameter estimation errors because PTO is unable to properly link these errors (step 1) to the quality of the optimized solutions (step 2). To remedy this issue, we develop an online learning framework that automatically incorporates the aforementioned parameter estimation errors in the solution prescription process; it is an integrated method that can "learn" the optimal solution without needing to set up the parameter estimation as a separate step as in PTO. Effectiveness of our online learning approach is substantiated by (i) theoretical results including the algorithm convergence and analysis of the regret ("cost" to pay over time for the algorithm to learn the optimal policy), and (ii) engineering confirmation via simulation experiments of a variety of representative examples. We also provide careful comparisons for PTO and the online learning method.
△ Less
Submitted 6 March, 2023;
originally announced March 2023.
-
Sharp endpoint $L_p$ estimates of quantum Schrödinger groups
Authors:
Zhijie Fan,
Guixiang Hong,
Liang Wang
Abstract:
In this article, we establish sharp endpoint $L_p$ estimates of Schrödinger groups on general measure spaces which may not be equipped with good metrics but admit submarkovian semigroups satisfying purely algebraic assumptions. One of the key ingredients of our proof is to introduce and investigate a new noncommutative high-cancellation BMO space by constructing an abstract form of P-metric codify…
▽ More
In this article, we establish sharp endpoint $L_p$ estimates of Schrödinger groups on general measure spaces which may not be equipped with good metrics but admit submarkovian semigroups satisfying purely algebraic assumptions. One of the key ingredients of our proof is to introduce and investigate a new noncommutative high-cancellation BMO space by constructing an abstract form of P-metric codifying some sort of underlying metric and position. This provides the first form of Schrödinger group theory on arbitrary von Neumann algebras and can be applied to many models, including Schrödinger groups associated with non-negative self-adjoint operators satisfying purely Gaussian upper bounds on doubling metric spaces, standard Schrödinger groups on quantum Euclidean spaces, matrix algebras and group von Neumann algebras with finite dimensional cocycles.
△ Less
Submitted 1 February, 2023;
originally announced February 2023.
-
Quantitative mean ergodic inequalities: power bounded operators acting on one single noncommutative $L_p$ space
Authors:
Guixiang Hong,
Wei Liu,
Bang Xu
Abstract:
In this paper, we establish the quantitative mean ergodic theorems for two subclasses of power bounded operators on a fixed noncommutative $L_p$-space with $1<p<\infty$, which mainly concerns power bounded invertible operators and Lamperti contractions. Our approach to the quantitative ergodic theorems is the noncommutative square function inequalities. The establishment of the latter involves sev…
▽ More
In this paper, we establish the quantitative mean ergodic theorems for two subclasses of power bounded operators on a fixed noncommutative $L_p$-space with $1<p<\infty$, which mainly concerns power bounded invertible operators and Lamperti contractions. Our approach to the quantitative ergodic theorems is the noncommutative square function inequalities. The establishment of the latter involves several new ingredients such as the almost orthogonality and Calderón-Zygmund arguments for non-smooth kernels from semi-commutative harmonic analysis, the extension properties of the operators under consideration from operator theory, and a noncommutative version of the classical transference method due to Coifman and Weiss.
△ Less
Submitted 30 March, 2023; v1 submitted 31 December, 2022;
originally announced January 2023.
-
Noncommutative maximal strong $L_p$ estimates of Calderón-Zygmund operators
Authors:
Guixiang Hong,
Xudong Lai,
Samya Kumar Ray,
Bang Xu
Abstract:
In this paper, we obtain the desired noncommutative maximal inequalities of the truncated Calderón-Zygmund operators of non-convolution type acting on operator-valued $L_p$-functions for all $1<p<\infty$, answering a question left open in the previous work \cite{HLX}.
In this paper, we obtain the desired noncommutative maximal inequalities of the truncated Calderón-Zygmund operators of non-convolution type acting on operator-valued $L_p$-functions for all $1<p<\infty$, answering a question left open in the previous work \cite{HLX}.
△ Less
Submitted 26 December, 2022;
originally announced December 2022.
-
Digital Pixel Test Structures implemented in a 65 nm CMOS process
Authors:
Gianluca Aglieri Rinella,
Anton Andronic,
Matias Antonelli,
Mauro Aresti,
Roberto Baccomi,
Pascal Becht,
Stefania Beole,
Justus Braach,
Matthew Daniel Buckland,
Eric Buschmann,
Paolo Camerini,
Francesca Carnesecchi,
Leonardo Cecconi,
Edoardo Charbon,
Giacomo Contin,
Dominik Dannheim,
Joao de Melo,
Wen**g Deng,
Antonello di Mauro,
Jan Hasenbichler,
Hartmut Hillemanns,
Geun Hee Hong,
Artem Isakov,
Antoine Junique,
Alex Kluge
, et al. (27 additional authors not shown)
Abstract:
The ALICE ITS3 (Inner Tracking System 3) upgrade project and the CERN EP R&D on monolithic pixel sensors are investigating the feasibility of the Tower Partners Semiconductor Co. 65 nm process for use in the next generation of vertex detectors. The ITS3 aims to employ wafer-scale Monolithic Active Pixel Sensors thinned down to 20 to 40 um and bent to form truly cylindrical half barrels. Among the…
▽ More
The ALICE ITS3 (Inner Tracking System 3) upgrade project and the CERN EP R&D on monolithic pixel sensors are investigating the feasibility of the Tower Partners Semiconductor Co. 65 nm process for use in the next generation of vertex detectors. The ITS3 aims to employ wafer-scale Monolithic Active Pixel Sensors thinned down to 20 to 40 um and bent to form truly cylindrical half barrels. Among the first critical steps towards the realisation of this detector is to validate the sensor technology through extensive characterisation both in the laboratory and with in-beam measurements. The Digital Pixel Test Structure (DPTS) is one of the prototypes produced in the first sensor submission in this technology and has undergone a systematic measurement campaign whose details are presented in this article.
The results confirm the goals of detection efficiency and non-ionising and ionising radiation hardness up to the expected levels for ALICE ITS3 and also demonstrate operation at +20 C and a detection efficiency of 99% for a DPTS irradiated with a dose of $10^{15}$ 1 MeV n$_{\mathrm{eq}}/$cm$^2$. Furthermore, spatial, timing and energy resolutions were measured at various settings and irradiation levels.
△ Less
Submitted 10 July, 2023; v1 submitted 16 December, 2022;
originally announced December 2022.
-
Data Poisoning Attack Aiming the Vulnerability of Continual Learning
Authors:
Gyo** Han,
Jaehyun Choi,
Hyeong Gwon Hong,
Junmo Kim
Abstract:
Generally, regularization-based continual learning models limit access to the previous task data to imitate the real-world constraints related to memory and privacy. However, this introduces a problem in these models by not being able to track the performance on each task. In essence, current continual learning methods are susceptible to attacks on previous tasks. We demonstrate the vulnerability…
▽ More
Generally, regularization-based continual learning models limit access to the previous task data to imitate the real-world constraints related to memory and privacy. However, this introduces a problem in these models by not being able to track the performance on each task. In essence, current continual learning methods are susceptible to attacks on previous tasks. We demonstrate the vulnerability of regularization-based continual learning methods by presenting a simple task-specific data poisoning attack that can be used in the learning process of a new task. Training data generated by the proposed attack causes performance degradation on a specific task targeted by the attacker. We experiment with the attack on the two representative regularization-based continual learning methods, Elastic Weight Consolidation (EWC) and Synaptic Intelligence (SI), trained with variants of MNIST dataset. The experiment results justify the vulnerability proposed in this paper and demonstrate the importance of develo** continual learning models that are robust to adversarial attacks.
△ Less
Submitted 3 July, 2023; v1 submitted 28 November, 2022;
originally announced November 2022.
-
The Kriston AI System for the VoxCeleb Speaker Recognition Challenge 2022
Authors:
Qutang Cai,
Guoqiang Hong,
Zhijian Ye,
Ximin Li,
Haizhou Li
Abstract:
This technical report describes our system for track 1, 2 and 4 of the VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC-22). By combining several ResNet variants, our submission for track 1 attained a minDCF of 0:090 with EER 1:401%. By further incorporating three fine-tuned pre-trained models, our submission for track 2 achieved a minDCF of 0:072 with EER 1:119%. For track 4, our system consis…
▽ More
This technical report describes our system for track 1, 2 and 4 of the VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC-22). By combining several ResNet variants, our submission for track 1 attained a minDCF of 0:090 with EER 1:401%. By further incorporating three fine-tuned pre-trained models, our submission for track 2 achieved a minDCF of 0:072 with EER 1:119%. For track 4, our system consisted of voice activity detection (VAD), speaker embedding extraction, agglomerative hierarchical clustering (AHC) followed by a re-clustering step based on a Bayesian hidden Markov model and overlapped speech detection and handling. Our submission for track 4 achieved a diarisation error rate (DER) of 4.86%. The submissions all ranked the 2nd places for the corresponding tracks.
△ Less
Submitted 23 September, 2022;
originally announced September 2022.
-
Fourier restriction estimates on quantum Euclidean spaces
Authors:
Guixiang Hong,
Xudong Lai,
Liang Wang
Abstract:
In this paper, we initiate the study of the Fourier restriction phenomena on quantum Euclidean spaces, and establish the analogues of the Tomas-Stein restriction theorem and the two-dimensional full restriction theorem.
In this paper, we initiate the study of the Fourier restriction phenomena on quantum Euclidean spaces, and establish the analogues of the Tomas-Stein restriction theorem and the two-dimensional full restriction theorem.
△ Less
Submitted 4 September, 2022;
originally announced September 2022.
-
Rethinking Efficacy of Softmax for Lightweight Non-Local Neural Networks
Authors:
Yooshin Cho,
Youngsoo Kim,
Hanbyel Cho,
Jaesung Ahn,
Hyeong Gwon Hong,
Junmo Kim
Abstract:
Non-local (NL) block is a popular module that demonstrates the capability to model global contexts. However, NL block generally has heavy computation and memory costs, so it is impractical to apply the block to high-resolution feature maps. In this paper, to investigate the efficacy of NL block, we empirically analyze if the magnitude and direction of input feature vectors properly affect the atte…
▽ More
Non-local (NL) block is a popular module that demonstrates the capability to model global contexts. However, NL block generally has heavy computation and memory costs, so it is impractical to apply the block to high-resolution feature maps. In this paper, to investigate the efficacy of NL block, we empirically analyze if the magnitude and direction of input feature vectors properly affect the attention between vectors. The results show the inefficacy of softmax operation which is generally used to normalize the attention map of the NL block. Attention maps normalized with softmax operation highly rely upon magnitude of key vectors, and performance is degenerated if the magnitude information is removed. By replacing softmax operation with the scaling factor, we demonstrate improved performance on CIFAR-10, CIFAR-100, and Tiny-ImageNet. In Addition, our method shows robustness to embedding channel reduction and embedding weight initialization. Notably, our method makes multi-head attention employable without additional computational cost.
△ Less
Submitted 27 July, 2022;
originally announced July 2022.
-
On Isometric Embeddability of $S_q^m$ into $S_p^n$ as non-commutative Quasi-Banach space
Authors:
Arup Chattopadhyay,
Guixiang Hong,
Chandan Pradhan,
Samya Kumar Ray
Abstract:
The existence of isometric embedding of $S_q^m$ into $S_p^n$, where $1\leq p\neq q\leq \infty$ and $m,n\geq 2$ has been recently studied in \cite{JFA22}. In this article, we extend the study of isometric embeddability beyond the above mentioned range of $p$ and $q$. More precisely, we show that there is no isometric embedding of the commutative quasi-Banach space $\ell_q^m(\R)$ into…
▽ More
The existence of isometric embedding of $S_q^m$ into $S_p^n$, where $1\leq p\neq q\leq \infty$ and $m,n\geq 2$ has been recently studied in \cite{JFA22}. In this article, we extend the study of isometric embeddability beyond the above mentioned range of $p$ and $q$. More precisely, we show that there is no isometric embedding of the commutative quasi-Banach space $\ell_q^m(\R)$ into $\ell_p^n(\R)$, where $(q,p)\in (0,\infty)\times (0,1)$ and $p\neq q$. As non-commutative quasi-Banach spaces, we show that there is no isometric embedding of $S_q^m$ into $S_p^n$, where $(q,p)\in (0,2)\setminus \{1\}\times (0,1)$ $\cup\, \{1\}\times (0,1)\setminus \{\frac{1}{n}:n\in\mathbb{N}\}$ $\cup\, \{\infty\}\times (0,1)\setminus \{\frac{1}{n}:n\in\mathbb{N}\}$ and $p\neq q$. Moreover, in some restrictive cases, we also show that there is no isometric embedding of $S_q^m$ into $S_p^n$, where $(q,p)\in [2, \infty)\times (0,1)$. A new tool in our paper is the non-commutative Clarkson's inequality for Schatten class operators. Other tools involved are the Kato-Rellich theorem and multiple operator integrals in perturbation theory, followed by intricate computations involving power-series analysis.
△ Less
Submitted 1 June, 2023; v1 submitted 19 July, 2022;
originally announced July 2022.
-
Tail Quantile Estimation for Non-preemptive Priority Queues
Authors:
** Guang,
Guiyu Hong,
Xinyun Chen,
Xi Peng,
Li Chen,
Bo Bai,
Gong Zhang
Abstract:
Motivated by applications in computing and telecommunication systems, we investigate the problem of estimating p-quantile of steady-state sojourn times in a single-server multi-class queueing system with non-preemptive priorities for p close to 1. The main challenge in this problem lies in efficient sampling from the tail event. To address this issue, we develop a regenerative simulation algorithm…
▽ More
Motivated by applications in computing and telecommunication systems, we investigate the problem of estimating p-quantile of steady-state sojourn times in a single-server multi-class queueing system with non-preemptive priorities for p close to 1. The main challenge in this problem lies in efficient sampling from the tail event. To address this issue, we develop a regenerative simulation algorithm with importance sampling. In addition, we establish a central limit theorem for the estimator to construct the confidence interval. Numerical experiments show that our algorithm outperforms benchmark simulation methods. Our result contributes to the literature on rare event simulation for queueing systems.
△ Less
Submitted 8 July, 2022;
originally announced July 2022.
-
John-Nirenberg inequalities for noncommutative column BMO and Lipschitz martingales
Authors:
Guixiang Hong,
Congbian Ma,
Yu Wang
Abstract:
In this paper, we continue the study of John-Nirenberg theorems for BMO/Lipschitz spaces in the noncommutative martingale setting. As conjectured from the classical case, a desired noncommutative ``stop** time" argument was discovered to obtain the distribution function inequality form of John-Nirenberg theorem. This not only provides another approach without using duality and interpolation to t…
▽ More
In this paper, we continue the study of John-Nirenberg theorems for BMO/Lipschitz spaces in the noncommutative martingale setting. As conjectured from the classical case, a desired noncommutative ``stop** time" argument was discovered to obtain the distribution function inequality form of John-Nirenberg theorem. This not only provides another approach without using duality and interpolation to the results for spaces $\mathsf{bmo}^c(\mathcal M)$ and ${Λ^{c}_β}(\mathcal{M})$, but also allows us to find the desired version of John-Nirenberg inequalities for spaces $\mathcal{BMO}^c(\mathcal M)$ and ${\mathcal L^{c}_β}(\mathcal{M})$. And thus we solve two open questions after \cite{ref5, ref3}. As an application, we show that Lipschitz space is also the dual space of noncommutative Hardy space defined via symmetric atoms. Finally, our results for ${\mathcal L^{c}_β}(\mathcal{M})$ as well as the approach seem new even going back to the classical setting.
△ Less
Submitted 20 May, 2023; v1 submitted 25 January, 2022;
originally announced January 2022.
-
The Group Action Method and Radial Projection
Authors:
Guo-Dong Hong,
Chun-Yen Shen
Abstract:
The group action methods have been playing an important role in recent studies about the configuration problems inside a compact set $E$ in Euclidean spaces with given Hausdorff dimension. In this paper, we further explore the group action methods to study the radial projection problems for Salem sets.
The group action methods have been playing an important role in recent studies about the configuration problems inside a compact set $E$ in Euclidean spaces with given Hausdorff dimension. In this paper, we further explore the group action methods to study the radial projection problems for Salem sets.
△ Less
Submitted 24 November, 2021;
originally announced November 2021.
-
Post-Treatment Confounding in Causal Mediation Studies: A Cutting-Edge Problem and A Novel Solution via Sensitivity Analysis
Authors:
Guanglei Hong,
Fan Yang,
Xu Qin
Abstract:
In causal mediation studies that decompose an average treatment effect into a natural indirect effect (NIE) and a natural direct effect (NDE), examples of post-treatment confounding are abundant. Past research has generally considered it infeasible to adjust for a post-treatment confounder of the mediator-outcome relationship due to incomplete information: it is observed under the actual treatment…
▽ More
In causal mediation studies that decompose an average treatment effect into a natural indirect effect (NIE) and a natural direct effect (NDE), examples of post-treatment confounding are abundant. Past research has generally considered it infeasible to adjust for a post-treatment confounder of the mediator-outcome relationship due to incomplete information: it is observed under the actual treatment condition while missing under the counterfactual treatment condition. This study proposes a new sensitivity analysis strategy for handling post-treatment confounding and incorporates it into weighting-based causal mediation analysis without making extra identification assumptions. Under the sequential ignorability of the treatment assignment and of the mediator, we obtain the conditional distribution of the post-treatment confounder under the counterfactual treatment as a function of not just pretreatment covariates but also its counterpart under the actual treatment. The sensitivity analysis then generates a bound for the NIE and that for the NDE over a plausible range of the conditional correlation between the post-treatment confounder under the actual and that under the counterfactual conditions. Implemented through either imputation or integration, the strategy is suitable for binary as well as continuous measures of post-treatment confounders. Simulation results demonstrate major strengths and potential limitations of this new solution. A re-analysis of the National Evaluation of Welfare-to-Work Strategies (NEWWS) Riverside data reveals that the initial analytic results are sensitive to omitted post-treatment confounding.
△ Less
Submitted 22 July, 2021;
originally announced July 2021.
-
Inference for High Dimensional Censored Quantile Regression
Authors:
Zhe Fei,
Qi Zheng,
Hyokyoung G. Hong,
Yi Li
Abstract:
With the availability of high dimensional genetic biomarkers, it is of interest to identify heterogeneous effects of these predictors on patients' survival, along with proper statistical inference. Censored quantile regression has emerged as a powerful tool for detecting heterogeneous effects of covariates on survival outcomes. To our knowledge, there is little work available to draw inference on…
▽ More
With the availability of high dimensional genetic biomarkers, it is of interest to identify heterogeneous effects of these predictors on patients' survival, along with proper statistical inference. Censored quantile regression has emerged as a powerful tool for detecting heterogeneous effects of covariates on survival outcomes. To our knowledge, there is little work available to draw inference on the effects of high dimensional predictors for censored quantile regression. This paper proposes a novel procedure to draw inference on all predictors within the framework of global censored quantile regression, which investigates covariate-response associations over an interval of quantile levels, instead of a few discrete values. The proposed estimator combines a sequence of low dimensional model estimates that are based on multi-sample splittings and variable selection. We show that, under some regularity conditions, the estimator is consistent and asymptotically follows a Gaussian process indexed by the quantile level. Simulation studies indicate that our procedure can properly quantify the uncertainty of the estimates in high dimensional settings. We apply our method to analyze the heterogeneous effects of SNPs residing in lung cancer pathways on patients' survival, using the Boston Lung Cancer Survival Cohort, a cancer epidemiology study on the molecular mechanism of lung cancer.
△ Less
Submitted 22 July, 2021;
originally announced July 2021.
-
Magnetic properties of the quasi two-dimensional centered honeycomb antiferromagnet GdInO$_3$
Authors:
Xunqing Yin,
Yunlong Li,
Guohua Wang,
Jiayuan Hu,
Chenhang Xu,
Qi Lu,
Yunlei Zhong,
Jiawang Zhao,
Xiang Zhao,
Yuanlei Zhang,
Yiming Cao,
Kun Xu,
Zhe Li,
Yoshitomo Kamiya,
Guo Hong,
Dong Qian
Abstract:
The crystal structure and magnetic property of the single crystalline hexagonal rare-earth indium oxides GdInO$_3$ have been studied by combing experiments and model calculations. The two inequivalent Gd$^{3+}$ ions form the centered honeycomb lattice, which consists of honeycomb and triangular sublattices. The dc magnetic susceptibility and specific heat measurements suggest two antiferromagnetic…
▽ More
The crystal structure and magnetic property of the single crystalline hexagonal rare-earth indium oxides GdInO$_3$ have been studied by combing experiments and model calculations. The two inequivalent Gd$^{3+}$ ions form the centered honeycomb lattice, which consists of honeycomb and triangular sublattices. The dc magnetic susceptibility and specific heat measurements suggest two antiferromagnetic phase transitions at $T_\textrm{N1}$ = 2.3 K and $T_\textrm{N2}$ = 1.02 K. An inflection point is observed in the isothermal magnetization curve, which implies an up-up-down phase with a 1/3 magnetization plateau. We also observe a large magnetic entropy change originated from the magnetic frustration in GdInO$_3$. By considering a classical spin Hamiltonian, we establish the ground state phase diagram, which suggests that GdInO$_3$ has a weak easy-axis anisotropy and is close to the equilateral triangular-lattice system. The theoretical ground-state phase diagram may be used as a reference in NMR, ESR, or $μ$SR experiments in future.
△ Less
Submitted 6 September, 2021; v1 submitted 9 June, 2021;
originally announced June 2021.
-
First demonstration of in-beam performance of bent Monolithic Active Pixel Sensors
Authors:
ALICE ITS project,
:,
G. Aglieri Rinella,
M. Agnello,
B. Alessandro,
F. Agnese,
R. S. Akram,
J. Alme,
E. Anderssen,
D. Andreou,
F. Antinori,
N. Apadula,
P. Atkinson,
R. Baccomi,
A. Badalà,
A. Balbino,
C. Bartels,
R. Barthel,
F. Baruffaldi,
I. Belikov,
S. Beole,
P. Becht,
A. Bhatti,
M. Bhopal,
N. Bianchi
, et al. (230 additional authors not shown)
Abstract:
A novel approach for designing the next generation of vertex detectors foresees to employ wafer-scale sensors that can be bent to truly cylindrical geometries after thinning them to thicknesses of 20-40$μ$m. To solidify this concept, the feasibility of operating bent MAPS was demonstrated using 1.5$\times$3cm ALPIDE chips. Already with their thickness of 50$μ$m, they can be successfully bent to ra…
▽ More
A novel approach for designing the next generation of vertex detectors foresees to employ wafer-scale sensors that can be bent to truly cylindrical geometries after thinning them to thicknesses of 20-40$μ$m. To solidify this concept, the feasibility of operating bent MAPS was demonstrated using 1.5$\times$3cm ALPIDE chips. Already with their thickness of 50$μ$m, they can be successfully bent to radii of about 2cm without any signs of mechanical or electrical damage. During a subsequent characterisation using a 5.4GeV electron beam, it was further confirmed that they preserve their full electrical functionality as well as particle detection performance.
In this article, the bending procedure and the setup used for characterisation are detailed. Furthermore, the analysis of the beam test, including the measurement of the detection efficiency as a function of beam position and local inclination angle, is discussed. The results show that the sensors maintain their excellent performance after bending to radii of 2cm, with detection efficiencies above 99.9% at typical operating conditions, paving the way towards a new class of detectors with unprecedented low material budget and ideal geometrical properties.
△ Less
Submitted 17 August, 2021; v1 submitted 27 May, 2021;
originally announced May 2021.
-
Ultra-High Dimensional Sparse Representations with Binarization for Efficient Text Retrieval
Authors:
Kyoung-Rok Jang,
Junmo Kang,
Giwon Hong,
Sung-Hyon Myaeng,
Joohee Park,
Taewon Yoon,
Heecheol Seo
Abstract:
The semantic matching capabilities of neural information retrieval can ameliorate synonymy and polysemy problems of symbolic approaches. However, neural models' dense representations are more suitable for re-ranking, due to their inefficiency. Sparse representations, either in symbolic or latent form, are more efficient with an inverted index. Taking the merits of the sparse and dense representati…
▽ More
The semantic matching capabilities of neural information retrieval can ameliorate synonymy and polysemy problems of symbolic approaches. However, neural models' dense representations are more suitable for re-ranking, due to their inefficiency. Sparse representations, either in symbolic or latent form, are more efficient with an inverted index. Taking the merits of the sparse and dense representations, we propose an ultra-high dimensional (UHD) representation scheme equipped with directly controllable sparsity. UHD's large capacity and minimal noise and interference among the dimensions allow for binarized representations, which are highly efficient for storage and search. Also proposed is a bucketing method, where the embeddings from multiple layers of BERT are selected/merged to represent diverse linguistic aspects. We test our models with MS MARCO and TREC CAR, showing that our models outperforms other sparse models
△ Less
Submitted 15 October, 2021; v1 submitted 14 April, 2021;
originally announced April 2021.
-
Quantitative ergodic theorems for actions of groups of polynomial growth
Authors:
Guixiang Hong,
Wei Liu
Abstract:
We strengthen the maximal ergodic theorem for actions of groups of polynomial growth to a form involving jump quantity, which is the sharpest result among the family of variational or maximal ergodic theorems. As a consequence, we deduce in this setting the quantitative ergodic theorem, in particular, the upcrossing inequalities with exponential decay. The ideas or techniques involve probability t…
▽ More
We strengthen the maximal ergodic theorem for actions of groups of polynomial growth to a form involving jump quantity, which is the sharpest result among the family of variational or maximal ergodic theorems. As a consequence, we deduce in this setting the quantitative ergodic theorem, in particular, the upcrossing inequalities with exponential decay. The ideas or techniques involve probability theory, non-doubling Calderón-Zygmund theory, almost orthogonality argument and some delicate geometric argument involving the balls and the cubes on the group equipped with a not necessarily doubling measure.
△ Less
Submitted 6 April, 2021;
originally announced April 2021.
-
Student-Teacher Learning from Clean Inputs to Noisy Inputs
Authors:
Guanzhe Hong,
Zhiyuan Mao,
Xiaojun Lin,
Stanley H. Chan
Abstract:
Feature-based student-teacher learning, a training method that encourages the student's hidden features to mimic those of the teacher network, is empirically successful in transferring the knowledge from a pre-trained teacher network to the student network. Furthermore, recent empirical results demonstrate that, the teacher's features can boost the student network's generalization even when the st…
▽ More
Feature-based student-teacher learning, a training method that encourages the student's hidden features to mimic those of the teacher network, is empirically successful in transferring the knowledge from a pre-trained teacher network to the student network. Furthermore, recent empirical results demonstrate that, the teacher's features can boost the student network's generalization even when the student's input sample is corrupted by noise. However, there is a lack of theoretical insights into why and when this method of transferring knowledge can be successful between such heterogeneous tasks. We analyze this method theoretically using deep linear networks, and experimentally using nonlinear networks. We identify three vital factors to the success of the method: (1) whether the student is trained to zero training loss; (2) how knowledgeable the teacher is on the clean-input problem; (3) how the teacher decomposes its knowledge in its hidden features. Lack of proper control in any of the three factors leads to failure of the student-teacher learning method.
△ Less
Submitted 12 March, 2021;
originally announced March 2021.
-
Asymptotic stability of exogenous chemotaxis systems with physical boundary conditions
Authors:
Guangyi Hong,
Zhian Wang
Abstract:
In this paper, we consider the exogenous chemotaxis system with physical mixed zero-flux and Dirichlet boundary conditions in one dimension. Since the Dirichlet boundary condition can not contribute necessary estimates for the cross-diffusion structure in the system, the global-in-time existence and asymptotic behavior of solutions remain open up to date. In this paper, we overcome this difficulty…
▽ More
In this paper, we consider the exogenous chemotaxis system with physical mixed zero-flux and Dirichlet boundary conditions in one dimension. Since the Dirichlet boundary condition can not contribute necessary estimates for the cross-diffusion structure in the system, the global-in-time existence and asymptotic behavior of solutions remain open up to date. In this paper, we overcome this difficulty by employing the technique of taking anti-derivative so that the Dirichlet boundary condition can be fully used, and show that the system admits global strong solutions which exponentially stabilize to the unique stationary solution as time tends to infinity against some suitable small perturbations. To the best of our knowledge, this is the first result obtained on the global well-posedness and asymptotic behavior of solutions to the exogenous chemotaxis system with physical boundary conditions.
△ Less
Submitted 18 January, 2021;
originally announced January 2021.
-
Stay Connected, Leave no Trace: Enhancing Security and Privacy in WiFi via Obfuscating Radiometric Fingerprints
Authors:
Luis F. Abanto-Leon,
Andreas Baeuml,
Gek Hong,
Sim,
Matthias Hollick,
Arash Asadi
Abstract:
The intrinsic hardware imperfection of WiFi chipsets manifests itself in the transmitted signal, leading to a unique radiometric fingerprint. This fingerprint can be used as an additional means of authentication to enhance security. In fact, recent works propose practical fingerprinting solutions that can be readily implemented in commercial-off-the-shelf devices. In this paper, we prove analytica…
▽ More
The intrinsic hardware imperfection of WiFi chipsets manifests itself in the transmitted signal, leading to a unique radiometric fingerprint. This fingerprint can be used as an additional means of authentication to enhance security. In fact, recent works propose practical fingerprinting solutions that can be readily implemented in commercial-off-the-shelf devices. In this paper, we prove analytically and experimentally that these solutions are highly vulnerable to impersonation attacks. We also demonstrate that such a unique device-based signature can be abused to violate privacy by tracking the user device, and, as of today, users do not have any means to prevent such privacy attacks other than turning off the device.
We propose RF-Veil, a radiometric fingerprinting solution that not only is robust against impersonation attacks but also protects user privacy by obfuscating the radiometric fingerprint of the transmitter for non-legitimate receivers. Specifically, we introduce a randomized pattern of phase errors to the transmitted signal such that only the intended receiver can extract the original fingerprint of the transmitter. In a series of experiments and analyses, we expose the vulnerability of adopting naive randomization to statistical attacks and introduce countermeasures. Finally, we show the efficacy of RF-Veil experimentally in protecting user privacy and enhancing security. More importantly, our proposed solution allows communicating with other devices, which do not employ RF-Veil.
△ Less
Submitted 27 November, 2020; v1 submitted 25 November, 2020;
originally announced November 2020.
-
Nonlinear stability of phase transition steady states to a hyperbolic-parabolic system modelling vascular networks
Authors:
Guangyi Hong,
Hongyun Peng,
Zhi-An Wang,
Changjiang Zhu
Abstract:
This paper is concerned with the existence and stability of phase transition steady states to a quasi-linear hyperbolic-parabolic system of chemotactic aggregation, which was proposed in \cite{ambrosi2005review, gamba2003percolation} to describe the coherent vascular network formation observed {\it in vitro} experiment. Considering the system in the half line $ \mathbb{R}_{+}=(0,\infty)$ with Diri…
▽ More
This paper is concerned with the existence and stability of phase transition steady states to a quasi-linear hyperbolic-parabolic system of chemotactic aggregation, which was proposed in \cite{ambrosi2005review, gamba2003percolation} to describe the coherent vascular network formation observed {\it in vitro} experiment. Considering the system in the half line $ \mathbb{R}_{+}=(0,\infty)$ with Dirichlet boundary conditions, we first prove the existence \textcolor{black}{and uniqueness of non-constant phase transition steady states} under some structure conditions on the pressure function. Then we prove that this unique phase transition steady state is nonlinearly asymptotically stable against a small perturbation. We prove our results by the method of energy estimates, the technique of {\it a priori} assumption and a weighted Hardy-type inequality.
△ Less
Submitted 14 November, 2020;
originally announced November 2020.
-
The $L^2$-boundedness of the variational Calderón-Zygmund operators
Authors:
Y. Chen,
G. Hong
Abstract:
In this paper, we verify the $L^2$-boundedness for the jump functions and variations of Calderón-Zygmund singular integral operators with the underlying kernels satisfying \begin{align*}\int_{\varepsilon\leq |x-y|\leq N} K(x,y)dy=\int_{\varepsilon\leq |x-y|\leq N}K(x,y)dx=0\; \forall 0<\varepsilon\leq N<\infty,\end{align*} in addition to some proper size and smooth conditions. This result should b…
▽ More
In this paper, we verify the $L^2$-boundedness for the jump functions and variations of Calderón-Zygmund singular integral operators with the underlying kernels satisfying \begin{align*}\int_{\varepsilon\leq |x-y|\leq N} K(x,y)dy=\int_{\varepsilon\leq |x-y|\leq N}K(x,y)dx=0\; \forall 0<\varepsilon\leq N<\infty,\end{align*} in addition to some proper size and smooth conditions. This result should be the first general criteria for the variational inequalities for kernels not necessarily of convolution type. The $L^2$-boundedness assumption that we verified here is also the starting point of the related results on the (sharp) weighted norm inequalities appeared in many recent papers.
△ Less
Submitted 8 September, 2020;
originally announced September 2020.
-
Maximal singular integral operators acting on noncommutative $L_p$-spaces
Authors:
Guixiang Hong,
Xudong Lai,
Bang Xu
Abstract:
In this paper, we study the boundedness theory for maximal Calderón-Zygmund operators acting on noncommutative $L_p$-spaces. Our first result is a criterion for the weak type $(1,1)$ estimate of noncommutative maximal Calderón-Zygmund operators; as an application, we obtain the weak type $(1,1)$ estimates of operator-valued maximal singular integrals of convolution type under proper {regularity} c…
▽ More
In this paper, we study the boundedness theory for maximal Calderón-Zygmund operators acting on noncommutative $L_p$-spaces. Our first result is a criterion for the weak type $(1,1)$ estimate of noncommutative maximal Calderón-Zygmund operators; as an application, we obtain the weak type $(1,1)$ estimates of operator-valued maximal singular integrals of convolution type under proper {regularity} conditions. These are the {\it first} noncommutative maximal inequalities for families of linear operators that can not be reduced to positive ones. For homogeneous singular integrals, the strong type $(p,p)$ ($1<p<\infty$) maximal estimates are shown to be true even for {rough} kernels.
As a byproduct of the criterion, we obtain the noncommutative weak type $(1,1)$ estimate for Calderón-Zygmund operators with integral regularity condition that is slightly stronger than the Hörmander condition; this evidences somewhat an affirmative answer to an open question in the noncommutative Calderón-Zygmund theory.
△ Less
Submitted 20 October, 2020; v1 submitted 7 September, 2020;
originally announced September 2020.
-
An online learning approach to dynamic pricing and capacity sizing in service systems
Authors:
Xinyun Chen,
Yunan Liu,
Guiyu Hong
Abstract:
We study a dynamic pricing and capacity sizing problem in a $GI/GI/1$ queue, where the service provider's objective is to obtain the optimal service fee $p$ and service capacity $μ$ so as to maximize the cumulative expected profit (the service revenue minus the staffing cost and delay penalty). Due to the complex nature of the queueing dynamics, such a problem has no analytic solution so that prev…
▽ More
We study a dynamic pricing and capacity sizing problem in a $GI/GI/1$ queue, where the service provider's objective is to obtain the optimal service fee $p$ and service capacity $μ$ so as to maximize the cumulative expected profit (the service revenue minus the staffing cost and delay penalty). Due to the complex nature of the queueing dynamics, such a problem has no analytic solution so that previous research often resorts to heavy-traffic analysis where both the arrival rate and service rate are sent to infinity. In this work we propose an online learning framework designed for solving this problem which does not require the system's scale to increase. Our framework is dubbed Gradient-based Online Learning in Queue (GOLiQ). GOLiQ organizes the time horizon into successive operational cycles and prescribes an efficient procedure to obtain improved pricing and staffing policies in each cycle using data collected in previous cycles. Data here include the number of customer arrivals, waiting times, and the server's busy times. The ingenuity of this approach lies in its online nature, which allows the service provider do better by interacting with the environment. Effectiveness of GOLiQ is substantiated by (i) theoretical results including the algorithm convergence and regret analysis (with a logarithmic regret bound), and (ii) engineering confirmation via simulation experiments of a variety of representative $GI/GI/1$ queues.
△ Less
Submitted 7 September, 2022; v1 submitted 7 September, 2020;
originally announced September 2020.
-
Skewing Quanto with Simplicity
Authors:
George Hong
Abstract:
We present a simple and highly efficient analytical method for solving the Quanto Skew problem in Equities under a framework that accommodates both Equity and FX volatility skew consistently. Ease of implementation and extremely fast performance of this new approach should benefit a wide spectrum of market participants.
We present a simple and highly efficient analytical method for solving the Quanto Skew problem in Equities under a framework that accommodates both Equity and FX volatility skew consistently. Ease of implementation and extremely fast performance of this new approach should benefit a wide spectrum of market participants.
△ Less
Submitted 5 September, 2020;
originally announced September 2020.
-
Isometric Embeddability of $S_q^m$ into $S_p^n$
Authors:
Arup Chattopadhyay,
Guixiang Hong,
Avijit Pal,
Chandan Pradhan,
Samya Kumar Ray
Abstract:
In this paper, we study existence of isometric embedding of $S_q^m$ into $S_p^n,$ where $1\leq p\neq q\leq \infty$ and $n\geq m\geq 2.$ We show that for all $n\geq m\geq 2$ if there exists a linear isometry from $S_q^m$ into $S_p^n$, where $(q,p)\in(1,\infty]\times(1,\infty) \cup(1,\infty)\setminus\{3\}\times\{1,\infty\}$ and $p\neq q,$ then we must have $q=2.$ This mostly generalizes a classical…
▽ More
In this paper, we study existence of isometric embedding of $S_q^m$ into $S_p^n,$ where $1\leq p\neq q\leq \infty$ and $n\geq m\geq 2.$ We show that for all $n\geq m\geq 2$ if there exists a linear isometry from $S_q^m$ into $S_p^n$, where $(q,p)\in(1,\infty]\times(1,\infty) \cup(1,\infty)\setminus\{3\}\times\{1,\infty\}$ and $p\neq q,$ then we must have $q=2.$ This mostly generalizes a classical result of Lyubich and Vaserstein. We also show that whenever $S_q$ embeds isometrically into $S_p$ for $(q,p)\in \left(1,\infty\right)\times\left[2,\infty \right)\cup[4,\infty)\times\{1\} \cup\{\infty\}\times\left( 1,\infty\right)\cup[2,\infty)\times\{\infty\}$ with $p\neq q,$ we must have $q=2.$ Thus, our work complements work of Junge, Parcet, Xu and others on isometric and almost isometric embedding theory on non-commutative $L_p$-spaces. Our methods rely on several new ingredients related to perturbation theory of linear operators, namely Kato-Rellich theorem, theory of multiple operator integrals and Birkhoff-James orthogonality, followed by thorough and careful case by case analysis. The question whether for $m\geq 2$ and $1<q<2,$ $S_q^m$ embeds isometrically into $S_\infty^n$, was left open in \textit{Bull. London Math. Soc.} 52 (2020) 437-447.
△ Less
Submitted 28 September, 2021; v1 submitted 30 August, 2020;
originally announced August 2020.