Search | arXiv e-print repository

arXiv:2406.13578 [pdf, other]

Enhancing Distractor Generation for Multiple-Choice Questions with Retrieval Augmented Pretraining and Knowledge Graph Integration

Authors: Han-Cheng Yu, Yu-An Shih, Kin-Man Law, Kai-Yu Hsieh, Yu-Chen Cheng, Hsin-Chih Ho, Zih-An Lin, Wen-Chuan Hsu, Yao-Chung Fan

Abstract: In this paper, we tackle the task of distractor generation (DG) for multiple-choice questions. Our study introduces two key designs. First, we propose \textit{retrieval augmented pretraining}, which involves refining the language model pretraining to align it more closely with the downstream task of DG. Second, we explore the integration of knowledge graphs to enhance the performance of DG. Throug… ▽ More In this paper, we tackle the task of distractor generation (DG) for multiple-choice questions. Our study introduces two key designs. First, we propose \textit{retrieval augmented pretraining}, which involves refining the language model pretraining to align it more closely with the downstream task of DG. Second, we explore the integration of knowledge graphs to enhance the performance of DG. Through experiments with benchmarking datasets, we show that our models significantly outperform the state-of-the-art results. Our best-performing model advances the F1@3 score from 14.80 to 16.47 in MCQ dataset and from 15.92 to 16.50 in Sciq dataset. △ Less

Submitted 19 June, 2024; originally announced June 2024.

Comments: Findings at ACL 2024

arXiv:2405.17148 [pdf, other]

Direct view of gate-tunable miniband dispersion in graphene superlattices near the magic twist angle

Authors: Zhihao Jiang, Dongkyu Lee, Alfred J. H. Jones, Youngju Park, Kimberly Hsieh, Paulina Majchrzak, Chakradhar Sahoo, Thomas S. Nielsen, Kenji Watanabe, Takashi Taniguchi, Philip Hofmann, Jill A. Miwa, Yong P. Chen, Jeil Jung, Søren Ulstrup

Abstract: Superlattices from twisted graphene mono- and bi-layer systems give rise to on-demand quantum many-body states such as Mott insulators, unconventional superconductors and the fractional quantum Hall effect. These phenomena are observed in transport experiments when changing the filling of the low-energy electronic bands. Their origin is broadly ascribed to a combination of flat bands and strong Co… ▽ More Superlattices from twisted graphene mono- and bi-layer systems give rise to on-demand quantum many-body states such as Mott insulators, unconventional superconductors and the fractional quantum Hall effect. These phenomena are observed in transport experiments when changing the filling of the low-energy electronic bands. Their origin is broadly ascribed to a combination of flat bands and strong Coulomb interactions, yet a comprehensive understanding is lacking. This is primarily because the relevant low-energy band structure is believed to strongly change in a non-trivial way as the electron filling is varied. Here we gain direct access to the filling-dependent low energy bands of twisted bilayer graphene (TBG) and twisted double bilayer graphene (TDBG) by applying micro-focused angle-resolved photoemission spectroscopy to in situ gated devices. Our findings for the two systems are in stark contrast: The do** dependent dispersion for TBG can be described in a simple model, combining a filling-dependent rigid band shift with a many-body related bandwidth change. In TDBG, on the other hand, we find a complex behaviour of the low-energy bands, combining non-monotonous bandwidth changes and tuneable gap openings. Our work establishes the extent of electric field tunability of the low energy electronic states in twisted graphene superlattices and can serve to underpin the theoretical understanding of the resulting phenomena. △ Less

Submitted 27 May, 2024; originally announced May 2024.

Comments: 25 pages, 4 main figures and 7 supplementary figures

arXiv:2402.05625 [pdf, other]

Coded Many-User Multiple Access via Approximate Message Passing

Authors: Xiaoqi Liu, Kuan Hsieh, Ramji Venkataramanan

Abstract: We consider communication over the Gaussian multiple-access channel in the regime where the number of users grows linearly with the codelength. In this regime, schemes based on sparse superposition coding can achieve a near-optimal tradeoff between spectral efficiency and signal-to-noise ratio. However, these schemes are feasible only for small values of user payload. This paper investigates effic… ▽ More We consider communication over the Gaussian multiple-access channel in the regime where the number of users grows linearly with the codelength. In this regime, schemes based on sparse superposition coding can achieve a near-optimal tradeoff between spectral efficiency and signal-to-noise ratio. However, these schemes are feasible only for small values of user payload. This paper investigates efficient schemes for larger user payloads, focusing on coded CDMA schemes where each user's information is encoded via a linear code before being modulated with a signature sequence. We propose an efficient approximate message passing (AMP) decoder that can be tailored to the structure of the linear code, and provide an exact asymptotic characterization of its performance. Based on this result, we consider a decoder that integrates AMP and belief propagation and characterize its tradeoff between spectral efficiency and signal-to-noise ratio, for a given target error rate. Simulation results show that the decoder achieves state-of-the-art performance at finite lengths, with a coded CDMA scheme defined using LDPC codes and a spatially coupled matrix of signature sequences. △ Less

Submitted 1 July, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

Comments: 23 pages, 8 figures. A shorter version of this paper to appear in the Proceedings of IEEE ISIT 2024

arXiv:2402.02417 [pdf, other]

doi 10.1088/2053-1583/acf775

Revealing flat bands and hybridization gaps in a twisted bilayer graphene device with microARPES

Authors: Zhihao Jiang, Kimberly Hsieh, Alfred J. H. Jones, Paulina Majchrzak, Chakradhar Sahoo, Kenji Watanabe, Takashi Taniguchi, Jill A. Miwa, Yong P. Chen, Søren Ulstrup

Abstract: Controlling the electronic structure of two-dimensional materials using the combination of twist angle and electrostatic do** is an effective means to induce emergent phenomena. In bilayer graphene with an interlayer twist angle near the magic angle, the electronic dispersion is strongly modified by a manifold of hybridizing moiré Dirac cones leading to flat band segments with strong electronic… ▽ More Controlling the electronic structure of two-dimensional materials using the combination of twist angle and electrostatic do** is an effective means to induce emergent phenomena. In bilayer graphene with an interlayer twist angle near the magic angle, the electronic dispersion is strongly modified by a manifold of hybridizing moiré Dirac cones leading to flat band segments with strong electronic correlations. Numerous technical challenges arising from spatial inhomogeneity of interlayer interactions, twist angle and device functionality have so far limited momentum-resolved electronic structure measurements of these systems to static conditions. Here, we present a detailed characterization of the electronic structure exhibiting miniband dispersions for twisted bilayer graphene, near the magic angle, integrated in a functional device architecture using micro-focused angle-resolved photoemission spectroscopy. The optimum conditions for visualizing the miniband dispersion are determined by exploiting the spatial resolution and photon energy tunability of the light source and applied to extract a hybridization gap size of $(0.14 \pm 0.03)$~eV and flat band segments extending across a moiré mini Brillouin zone. \textit{In situ} electrostatic gating of the sample enables significant electron-do**, causing the conduction band states to shift below the Fermi energy. Our work emphasizes key challenges in probing the electronic structure of magic angle bilayer graphene devices and outlines conditions for exploring the do**-dependent evolution of the dispersion that underpins the ability to control many-body interactions in the material. △ Less

Submitted 4 February, 2024; originally announced February 2024.

Comments: 21 pages, 5 figures

Journal ref: 2D Mater. 10, 045027 (2023)

arXiv:2310.01733 [pdf, other]

doi 10.1109/ICDH60066.2023.00019

Health Guardian: Using Multi-modal Data to Understand Individual Health

Authors: Vince S. Siu, Kuan Yu Hsieh, Italo Buleje, Takashi Itoh, Tian Hao, Ben Civjan, Nigel Hinds, Bing Dang, Jeffrey L. Rogers, Bo Wen

Abstract: Artificial intelligence (AI) has shown great promise in revolutionizing the field of digital health by improving disease diagnosis, treatment, and prevention. This paper describes the Health Guardian platform, a non-commercial, scientific research-based platform developed by the IBM Digital Health team to rapidly translate AI research into cloud-based microservices. The platform can collect health… ▽ More Artificial intelligence (AI) has shown great promise in revolutionizing the field of digital health by improving disease diagnosis, treatment, and prevention. This paper describes the Health Guardian platform, a non-commercial, scientific research-based platform developed by the IBM Digital Health team to rapidly translate AI research into cloud-based microservices. The platform can collect health-related data from various digital devices, including wearables and mobile applications. Its flexible architecture supports microservices that accept diverse data types such as text, audio, and video, expanding the range of digital health assessments and enabling holistic health evaluations by capturing voice, facial, and motion bio-signals. These microservices can be deployed to a clinical cohort specified through the Clinical Task Manager (CTM). The CTM then collects multi-modal, clinical data that can iteratively improve the accuracy of AI predictive models, discover new disease mechanisms, or identify novel biomarkers. This paper highlights three microservices with different input data types, including a text-based microservice for depression assessment, a video-based microservice for sit-to-stand mobility assessment, and a wearable-based microservice for functional mobility assessment. The CTM is also discussed as a tool to help design and set up clinical studies to unlock the full potential of the platform. Today, the Health Guardian platform is being leveraged in collaboration with research partners to optimize the development of AI models by utilizing a multitude of input sources. This approach streamlines research efforts, enhances efficiency, and facilitates the development and validation of digital health applications. △ Less

Submitted 2 October, 2023; originally announced October 2023.

Comments: 10 pages, 6 figures

Journal ref: IEEE International Conference on Digital Health (ICDH), 2023, pp. 65-74

arXiv:2310.01673 [pdf, other]

doi 10.1109/ICDH60066.2023.00021

A Versatile Data Fabric for Advanced IoT-Based Remote Health Monitoring

Authors: Italo Buleje, Vince S. Siu, Kuan Yu Hsieh, Nigel Hinds, Bing Dang, Erhan Bilal, Thanhnha Nguyen, Ellen E. Lee, Colin A. Depp, Jeffrey L. Rogers

Abstract: This paper presents a data-centric and security-focused data fabric designed for digital health applications. With the increasing interest in digital health research, there has been a surge in the volume of Internet of Things (IoT) data derived from smartphones, wearables, and ambient sensors. Managing this vast amount of data, encompassing diverse data types and varying time scales, is crucial. M… ▽ More This paper presents a data-centric and security-focused data fabric designed for digital health applications. With the increasing interest in digital health research, there has been a surge in the volume of Internet of Things (IoT) data derived from smartphones, wearables, and ambient sensors. Managing this vast amount of data, encompassing diverse data types and varying time scales, is crucial. Moreover, compliance with regulatory and contractual obligations is essential. The proposed data fabric comprises an architecture and a toolkit that facilitate the integration of heterogeneous data sources, across different environments, to provide a unified view of the data in dashboards. Furthermore, the data fabric supports the development of reusable and configurable data integration components, which can be shared as open-source or inner-source software. These components are used to generate data pipelines that can be deployed and scheduled to run either in the cloud or on-premises. Additionally, we present the implementation of our data fabric in a home-based telemonitoring research project involving older adults, conducted in collaboration with the University of California, San Diego (UCSD). The study showcases the streamlined integration of data collected from various IoT sensors and mobile applications to create a unified view of older adults' health for further analysis and research. △ Less

Submitted 2 October, 2023; originally announced October 2023.

Journal ref: 2023 IEEE International Conference on Digital Health (ICDH), Chicago, IL, USA, 2023, pp. 88-90

arXiv:2309.08404 [pdf, other]

Bayes-Optimal Estimation in Generalized Linear Models via Spatial Coupling

Authors: Pablo Pascual Cobo, Kuan Hsieh, Ramji Venkataramanan

Abstract: We consider the problem of signal estimation in a generalized linear model (GLM). GLMs include many canonical problems in statistical estimation, such as linear regression, phase retrieval, and 1-bit compressed sensing. Recent work has precisely characterized the asymptotic minimum mean-squared error (MMSE) for GLMs with i.i.d. Gaussian sensing matrices. However, in many models there is a signific… ▽ More We consider the problem of signal estimation in a generalized linear model (GLM). GLMs include many canonical problems in statistical estimation, such as linear regression, phase retrieval, and 1-bit compressed sensing. Recent work has precisely characterized the asymptotic minimum mean-squared error (MMSE) for GLMs with i.i.d. Gaussian sensing matrices. However, in many models there is a significant gap between the MMSE and the performance of the best known feasible estimators. In this work, we address this issue by considering GLMs defined via spatially coupled sensing matrices. We propose an efficient approximate message passing (AMP) algorithm for estimation and prove that with a simple choice of spatially coupled design, the MSE of a carefully tuned AMP estimator approaches the asymptotic MMSE in the high-dimensional limit. To prove the result, we first rigorously characterize the asymptotic performance of AMP for a GLM with a generic spatially coupled design. This characterization is in terms of a deterministic recursion (`state evolution') that depends on the parameters defining the spatial coupling. Then, using a simple spatially coupled design and judicious choice of functions defining the AMP, we analyze the fixed points of the resulting state evolution and show that it achieves the asymptotic MMSE. Numerical results for phase retrieval and rectified linear regression show that spatially coupled designs can yield substantially lower MSE than i.i.d. Gaussian designs at finite dimensions when used with AMP algorithms. △ Less

Submitted 15 September, 2023; originally announced September 2023.

Comments: 39 pages, 4 figures. A shorter version of this paper appeared in the proceedings of the 2023 IEEE International Symposium on Information Theory

arXiv:2308.06261 [pdf, other]

Enhancing Network Management Using Code Generated by Large Language Models

Authors: Sathiya Kumaran Mani, Yajie Zhou, Kevin Hsieh, Santiago Segarra, Ranveer Chandra, Srikanth Kandula

Abstract: Analyzing network topologies and communication graphs plays a crucial role in contemporary network management. However, the absence of a cohesive approach leads to a challenging learning curve, heightened errors, and inefficiencies. In this paper, we introduce a novel approach to facilitate a natural-language-based network management experience, utilizing large language models (LLMs) to generate t… ▽ More Analyzing network topologies and communication graphs plays a crucial role in contemporary network management. However, the absence of a cohesive approach leads to a challenging learning curve, heightened errors, and inefficiencies. In this paper, we introduce a novel approach to facilitate a natural-language-based network management experience, utilizing large language models (LLMs) to generate task-specific code from natural language queries. This method tackles the challenges of explainability, scalability, and privacy by allowing network operators to inspect the generated code, eliminating the need to share network data with LLMs, and concentrating on application-specific requests combined with general program synthesis techniques. We design and evaluate a prototype system using benchmark applications, showcasing high accuracy, cost-effectiveness, and the potential for further enhancements using complementary program synthesis techniques. △ Less

Submitted 11 August, 2023; originally announced August 2023.

arXiv:2307.12675 [pdf, other]

doi 10.1103/PhysRevB.108.195410

Charge transfer-induced Lifshitz transition and magnetic symmetry breaking in ultrathin CrSBr crystals

Authors: Marco Bianchi, Kimberly Hsieh, Esben Juel Porat, Florian Dirnberger, Julian Klein, Kseniia Mosina, Zdenek Sofer, Alexander N. Rudenko, Mikhail I. Katsnelson, Yong P. Chen, Malte Rösner, Philip Hofmann

Abstract: Ultrathin CrSBr flakes are exfoliated \emph{in situ} on Au(111) and Ag(111) and their electronic structure is studied by angle-resolved photoemission spectroscopy. The thin flakes' electronic properties are drastically different from those of the bulk material and also substrate-dependent. For both substrates, a strong charge transfer to the flakes is observed, partly populating the conduction ban… ▽ More Ultrathin CrSBr flakes are exfoliated \emph{in situ} on Au(111) and Ag(111) and their electronic structure is studied by angle-resolved photoemission spectroscopy. The thin flakes' electronic properties are drastically different from those of the bulk material and also substrate-dependent. For both substrates, a strong charge transfer to the flakes is observed, partly populating the conduction band and giving rise to a highly anisotropic Fermi contour with an Ohmic contact to the substrate. The fundamental CrSBr band gap is strongly renormalized compared to the bulk. The charge transfer to the CrSBr flake is substantially larger for Ag(111) than for Au(111), but a rigid energy shift of the chemical potential is insufficient to describe the observed band structure modifications. In particular, the Fermi contour shows a Lifshitz transition, the fundamental band gap undergoes a transition from direct on Au(111) to indirect on Ag(111) and a do**-induced symmetry breaking between the intra-layer Cr magnetic moments further modifies the band structure. Electronic structure calculations can account for non-rigid Lifshitz-type band structure changes in thin CrSBr as a function of do** and strain. In contrast to undoped bulk band structure calculations that require self-consistent $GW$ theory, the doped thin film properties are well-approximated by density functional theory if local Coulomb interactions are taken into account on the mean-field level and the charge transfer is considered. △ Less

Submitted 24 July, 2023; originally announced July 2023.

arXiv:2306.00838 [pdf, other]

The Brain Tumor Segmentation (BraTS-METS) Challenge 2023: Brain Metastasis Segmentation on Pre-treatment MRI

Authors: Ahmed W. Moawad, Anastasia Janas, Ujjwal Baid, Divya Ramakrishnan, Rachit Saluja, Nader Ashraf, Leon Jekel, Raisa Amiruddin, Maruf Adewole, Jake Albrecht, Udunna Anazodo, Sanjay Aneja, Syed Muhammad Anwar, Timothy Bergquist, Evan Calabrese, Veronica Chiang, Verena Chung, Gian Marco Marco Conte, Farouk Dako, James Eddy, Ivan Ezhov, Ariana Familiar, Keyvan Farahani, Juan Eugenio Iglesias, Zhifan Jiang , et al. (206 additional authors not shown)

Abstract: The translation of AI-generated brain metastases (BM) segmentation into clinical practice relies heavily on diverse, high-quality annotated medical imaging datasets. The BraTS-METS 2023 challenge has gained momentum for testing and benchmarking algorithms using rigorously annotated internationally compiled real-world datasets. This study presents the results of the segmentation challenge and chara… ▽ More The translation of AI-generated brain metastases (BM) segmentation into clinical practice relies heavily on diverse, high-quality annotated medical imaging datasets. The BraTS-METS 2023 challenge has gained momentum for testing and benchmarking algorithms using rigorously annotated internationally compiled real-world datasets. This study presents the results of the segmentation challenge and characterizes the challenging cases that impacted the performance of the winning algorithms. Untreated brain metastases on standard anatomic MRI sequences (T1, T2, FLAIR, T1PG) from eight contributed international datasets were annotated in stepwise method: published UNET algorithms, student, neuroradiologist, final approver neuroradiologist. Segmentations were ranked based on lesion-wise Dice and Hausdorff distance (HD95) scores. False positives (FP) and false negatives (FN) were rigorously penalized, receiving a score of 0 for Dice and a fixed penalty of 374 for HD95. Eight datasets comprising 1303 studies were annotated, with 402 studies (3076 lesions) released on Synapse as publicly available datasets to challenge competitors. Additionally, 31 studies (139 lesions) were held out for validation, and 59 studies (218 lesions) were used for testing. Segmentation accuracy was measured as rank across subjects, with the winning team achieving a LesionWise mean score of 7.9. Common errors among the leading teams included false negatives for small lesions and misregistration of masks in space.The BraTS-METS 2023 challenge successfully curated well-annotated, diverse datasets and identified common errors, facilitating the translation of BM segmentation across varied clinical environments and providing personalized volumetric reports to patients undergoing BM treatment. △ Less

Submitted 17 June, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

arXiv:2305.13792 [pdf, other]

Mitigating the Performance Impact of Network Failures in Public Clouds

Authors: Pooria Namyar, Behnaz Arzani, Daniel Crankshaw, Daniel S. Berger, Kevin Hsieh, Srikanth Kandula, Ramesh Govindan

Abstract: Some faults in data center networks require hours to days to repair because they may need reboots, re-imaging, or manual work by technicians. To reduce traffic impact, cloud providers \textit{mitigate} the effect of faults, for example, by steering traffic to alternate paths. The state-of-art in automatic network mitigations uses simple safety checks and proxy metrics to determine mitigations. SWA… ▽ More Some faults in data center networks require hours to days to repair because they may need reboots, re-imaging, or manual work by technicians. To reduce traffic impact, cloud providers \textit{mitigate} the effect of faults, for example, by steering traffic to alternate paths. The state-of-art in automatic network mitigations uses simple safety checks and proxy metrics to determine mitigations. SWARM, the approach described in this paper, can pick orders of magnitude better mitigations by estimating end-to-end connection-level performance (CLP) metrics. At its core is a scalable CLP estimator that quickly ranks mitigations with high fidelity and, on failures observed at a large cloud provider, outperforms the state-of-the-art by over 700$\times$ in some cases. △ Less

Submitted 23 May, 2023; originally announced May 2023.

arXiv:2305.13750 [pdf, other]

doi 10.1103/PhysRevA.109.023705

Tuning atom-field interaction via phase sha**

Authors: Y. -T. Cheng, C. -H. Chien, K. -M. Hsieh, Y. -H. Huang, P. Y. Wen, W. -J. Lin, Y. Lu, F. Aziz, C. -P. Lee, K. -T. Lin, C. -Y. Chen, J. C. Chen, C. -S. Chuu, A. F. Kockum, G. -D. Lin, Y. -H. Lin, I. -C. Hoi

Abstract: A coherent electromagnetic field can be described by its amplitude, frequency, and phase. All these properties can influence the interaction between the field and an atom. Here we demonstrate the phase sha** of microwaves that are scattered by a superconducting artificial atom coupled to the end of a semi-infinite 1D transmission line. In particular, we input a weak exponentially rising pulse wi… ▽ More A coherent electromagnetic field can be described by its amplitude, frequency, and phase. All these properties can influence the interaction between the field and an atom. Here we demonstrate the phase sha** of microwaves that are scattered by a superconducting artificial atom coupled to the end of a semi-infinite 1D transmission line. In particular, we input a weak exponentially rising pulse with phase modulation to a transmon qubit. We observe that field-atom interaction can be tuned from nearly full interaction (interaction efficiency, i.e., amount of the field energy interacting with the atom, of 94.5%) to effectively no interaction (interaction efficiency 3.5%). △ Less

Submitted 26 January, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

Journal ref: Physical Review A 109, 023705 (2024)

arXiv:2302.07442 [pdf, other]

Microwave amplification via interfering multi-photon processes in a half-waveguide quantum electrodynamics system

Authors: Fahad Aziz, Kuan Ting Lin, ** Yi Wen, Samina, Yu Chen Lin, Emely Wiegand, Ching-** Lee, Yu-Ting Cheng, Ching-Yeh Chen, Chin-Hsun Chien, Kai-Min Hsieh, Yu-Huan Huang, Ian Hou, Jeng-Chung Chen, Yen-Hsiang Lin, Anton Frisk Kockum, Guin Dar Lin, Io-Chun Hoi

Abstract: We investigate the amplification of a microwave probe signal by a superconducting artificial atom, a transmon, strongly coupled to the end of a one-dimensional semi-infinite transmission line. The end of the transmission line acts as a mirror for microwave fields. Due to the weak anharmonicity of the artificial atom, a strong pump field creates multi-photon excitations among the dressed states. Tr… ▽ More We investigate the amplification of a microwave probe signal by a superconducting artificial atom, a transmon, strongly coupled to the end of a one-dimensional semi-infinite transmission line. The end of the transmission line acts as a mirror for microwave fields. Due to the weak anharmonicity of the artificial atom, a strong pump field creates multi-photon excitations among the dressed states. Transitions between these dressed states, Rabi sidebands, give rise to either amplification or attenuation of the weak probe. We obtain a maximum amplitude amplification of about 18 %, higher than in any previous experiment with a single artificial atom, due to constructive interference between Rabi sidebands. We also characterize the noise properties of the system by measuring the spectrum of spontaneous emission. △ Less

Submitted 14 February, 2023; originally announced February 2023.

arXiv:2211.06330 [pdf, other]

doi 10.1109/ICDH55609.2022.00015

Health Guardian Platform: A technology stack to accelerate discovery in Digital Health research

Authors: Bo Wen, Vince S. Siu, Italo Buleje, Kuan Yu Hsieh, Takashi Itoh, Lukas Zimmerli, Nigel Hinds, Elif Eyigoz, Bing Dang, Stefan von Cavallar, Jeffrey L. Rogers

Abstract: This paper highlights the design philosophy and architecture of the Health Guardian, a platform developed by the IBM Digital Health team to accelerate discoveries of new digital biomarkers and development of digital health technologies. The Health Guardian allows for rapid translation of artificial intelligence (AI) research into cloud-based microservices that can be tested with data from clinical… ▽ More This paper highlights the design philosophy and architecture of the Health Guardian, a platform developed by the IBM Digital Health team to accelerate discoveries of new digital biomarkers and development of digital health technologies. The Health Guardian allows for rapid translation of artificial intelligence (AI) research into cloud-based microservices that can be tested with data from clinical cohorts to understand disease and enable early prevention. The platform can be connected to mobile applications, wearables, or Internet of things (IoT) devices to collect health-related data into a secure database. When the analytics are created, the researchers can containerize and deploy their code on the cloud using pre-defined templates, and validate the models using the data collected from one or more sensing devices. The Health Guardian platform currently supports time-series, text, audio, and video inputs with 70+ analytic capabilities and is used for non-commercial scientific research. We provide an example of the Alzheimer's disease (AD) assessment microservice which uses AI methods to extract linguistic features from audio recordings to evaluate an individual's mini-mental state, the likelihood of having AD, and to predict the onset of AD before turning the age of 85. Today, IBM research teams across the globe use the Health Guardian internally as a test bed for early-stage research ideas, and externally with collaborators to support and enhance AI model development and clinical study efforts. △ Less

Submitted 10 November, 2022; originally announced November 2022.

Comments: 6 pages, 3 figures, https://ieeexplore.ieee.org/document/9861047

Journal ref: IEEE International Conference on Digital Health (ICDH), 2022, pp. 40-46

arXiv:2206.00799 [pdf, other]

Federated Learning under Distributed Concept Drift

Authors: Ellango Jothimurugesan, Kevin Hsieh, Jianyu Wang, Gauri Joshi, Phillip B. Gibbons

Abstract: Federated Learning (FL) under distributed concept drift is a largely unexplored area. Although concept drift is itself a well-studied phenomenon, it poses particular challenges for FL, because drifts arise staggered in time and space (across clients). To the best of our knowledge, this work is the first to explicitly study data heterogeneity in both dimensions. We first demonstrate that prior solu… ▽ More Federated Learning (FL) under distributed concept drift is a largely unexplored area. Although concept drift is itself a well-studied phenomenon, it poses particular challenges for FL, because drifts arise staggered in time and space (across clients). To the best of our knowledge, this work is the first to explicitly study data heterogeneity in both dimensions. We first demonstrate that prior solutions to drift adaptation that use a single global model are ill-suited to staggered drifts, necessitating multiple-model solutions. We identify the problem of drift adaptation as a time-varying clustering problem, and we propose two new clustering algorithms for reacting to drifts based on local drift detection and hierarchical clustering. Empirical evaluation shows that our solutions achieve significantly higher accuracy than existing baselines, and are comparable to an idealized algorithm with oracle knowledge of the ground-truth clustering of clients to concepts at each time step. △ Less

Submitted 27 February, 2023; v1 submitted 1 June, 2022; originally announced June 2022.

Comments: 20 pages. Published in AISTATS 2023

ACM Class: I.2.6

arXiv:2202.01267 [pdf, other]

FedSpace: An Efficient Federated Learning Framework at Satellites and Ground Stations

Authors: **hyun So, Kevin Hsieh, Behnaz Arzani, Shadi Noghabi, Salman Avestimehr, Ranveer Chandra

Abstract: Large-scale deployments of low Earth orbit (LEO) satellites collect massive amount of Earth imageries and sensor data, which can empower machine learning (ML) to address global challenges such as real-time disaster navigation and mitigation. However, it is often infeasible to download all the high-resolution images and train these ML models on the ground because of limited downlink bandwidth, spar… ▽ More Large-scale deployments of low Earth orbit (LEO) satellites collect massive amount of Earth imageries and sensor data, which can empower machine learning (ML) to address global challenges such as real-time disaster navigation and mitigation. However, it is often infeasible to download all the high-resolution images and train these ML models on the ground because of limited downlink bandwidth, sparse connectivity, and regularization constraints on the imagery resolution. To address these challenges, we leverage Federated Learning (FL), where ground stations and satellites collaboratively train a global ML model without sharing the captured images on the satellites. We show fundamental challenges in applying existing FL algorithms among satellites and ground stations, and we formulate an optimization problem which captures a unique trade-off between staleness and idleness. We propose a novel FL framework, named FedSpace, which dynamically schedules model aggregation based on the deterministic and time-varying connectivity according to satellite orbits. Extensive numerical evaluations based on real-world satellite images and satellite networks show that FedSpace reduces the training time by 1.7 days (38.6%) over the state-of-the-art FL algorithms. △ Less

Submitted 2 February, 2022; originally announced February 2022.

arXiv:2110.05554 [pdf, other]

Towards a Cost vs. Quality Sweet Spot for Monitoring Networks

Authors: Nofel Yaseen, Behnaz Arzani, Krishna Chintalapudi, Vaishnavi Ranganathan, Felipe Frujeri, Kevin Hsieh, Daniel Berger, Vincent Liu, Srikanth Kandula

Abstract: Continuously monitoring a wide variety of performance and fault metrics has become a crucial part of operating large-scale datacenter networks. In this work, we ask whether we can reduce the costs to monitor -- in terms of collection, storage and analysis -- by judiciously controlling how much and which measurements we collect. By positing that we can treat almost all measured signals as sampled t… ▽ More Continuously monitoring a wide variety of performance and fault metrics has become a crucial part of operating large-scale datacenter networks. In this work, we ask whether we can reduce the costs to monitor -- in terms of collection, storage and analysis -- by judiciously controlling how much and which measurements we collect. By positing that we can treat almost all measured signals as sampled time-series, we show that we can use signal processing techniques such as the Nyquist-Shannon theorem to avoid wasteful data collection. We show that large savings appear possible by analyzing tens of popular measurements from a production datacenter network. We also discuss the technical challenges that must be solved when applying these techniques in practice. △ Less

Submitted 11 October, 2021; originally announced October 2021.

arXiv:2102.11267 [pdf, other]

Interpret-able feedback for AutoML systems

Authors: Behnaz Arzani, Kevin Hsieh, Haoxian Chen

Abstract: Automated machine learning (AutoML) systems aim to enable training machine learning (ML) models for non-ML experts. A shortcoming of these systems is that when they fail to produce a model with high accuracy, the user has no path to improve the model other than hiring a data scientist or learning ML -- this defeats the purpose of AutoML and limits its adoption. We introduce an interpretable data f… ▽ More Automated machine learning (AutoML) systems aim to enable training machine learning (ML) models for non-ML experts. A shortcoming of these systems is that when they fail to produce a model with high accuracy, the user has no path to improve the model other than hiring a data scientist or learning ML -- this defeats the purpose of AutoML and limits its adoption. We introduce an interpretable data feedback solution for AutoML. Our solution suggests new data points for the user to label (without requiring a pool of unlabeled data) to improve the model's accuracy. Our solution analyzes how features influence the prediction among all ML models in an AutoML ensemble, and we suggest more data samples from feature ranges that have high variance in such analysis. Our evaluation shows that our solution can improve the accuracy of AutoML by 7-8% and significantly outperforms popular active learning solutions in data efficiency, all the while providing the added benefit of being interpretable. △ Less

Submitted 22 February, 2021; originally announced February 2021.

arXiv:2102.05099 [pdf, other]

doi 10.1103/PhysRevLett.126.206803

Spontaneous time reversal symmetry breaking at individual grain boundaries in graphene

Authors: Kimberly Hsieh, Vidya Kochat, Tathagata Biswas, Chandra Sekhar Tiwary, Abhishek Mishra, Gopalakrishnan Ramalingam, Aditya Jayaraman, Kamanio Chattopadhyay, Srinivasan Raghavan, Manish Jain, Arindam Ghosh

Abstract: Graphene grain boundaries have attracted interest for their ability to host nearly dispersionless electronic bands and magnetic instabilities. Here, we employ quantum transport and universal conductance fluctuations (UCF) measurements to experimentally demonstrate a spontaneous breaking of time reversal symmetry (TRS) across individual GBs of chemical vapour deposited graphene. While quantum trans… ▽ More Graphene grain boundaries have attracted interest for their ability to host nearly dispersionless electronic bands and magnetic instabilities. Here, we employ quantum transport and universal conductance fluctuations (UCF) measurements to experimentally demonstrate a spontaneous breaking of time reversal symmetry (TRS) across individual GBs of chemical vapour deposited graphene. While quantum transport across the GBs indicate spin-scattering-induced dephasing, and hence formation of local magnetic moments, below $T\lesssim 4$ K, we observe complete lifting of TRS at high carrier densities ($n \gtrsim 5\times 10^{12}$cm$^{-2}$) and low temperature ($T\lesssim 2$ K). An unprecedented thirty times reduction in the UCF magnitude with increasing do** density further supports the possibility of an emergent frozen magnetic state at the GBs. Our experimental results suggest that realistic GBs of graphene can be a promising resource for new electronic phases and spin-based applications. △ Less

Submitted 30 March, 2021; v1 submitted 9 February, 2021; originally announced February 2021.

Journal ref: Phys. Rev. Lett. 126, 206803 (2021)

arXiv:2102.04730 [pdf, other]

doi 10.1109/JSAIT.2022.3158827

Near-Optimal Coding for Many-user Multiple Access Channels

Authors: Kuan Hsieh, Cynthia Rush, Ramji Venkataramanan

Abstract: This paper considers the Gaussian multiple-access channel (MAC) in the asymptotic regime where the number of users grows linearly with the code length. We propose efficient coding schemes based on random linear models with approximate message passing (AMP) decoding and derive the asymptotic error rate achieved for a given user density, user payload (in bits), and user energy. The tradeoff between… ▽ More This paper considers the Gaussian multiple-access channel (MAC) in the asymptotic regime where the number of users grows linearly with the code length. We propose efficient coding schemes based on random linear models with approximate message passing (AMP) decoding and derive the asymptotic error rate achieved for a given user density, user payload (in bits), and user energy. The tradeoff between energy-per-bit and achievable user density (for a fixed user payload and target error rate) is studied, and it is demonstrated that in the large system limit, a spatially coupled coding scheme with AMP decoding achieves near-optimal tradeoffs for a wide range of user densities. Furthermore, in the regime where the user payload is large, we also study the tradeoff between energy-per-bit and spectral efficiency and discuss methods to reduce decoding complexity. △ Less

Submitted 9 March, 2022; v1 submitted 9 February, 2021; originally announced February 2021.

Comments: 15 pages, 4 figures. To appear in IEEE Journal on Selected Areas in Information Theory

Journal ref: IEEE Journal on Selected Areas in Information Theory, vol. 3, no. 1, pp. 21-36, March 2022

arXiv:2012.10557 [pdf, other]

Ekya: Continuous Learning of Video Analytics Models on Edge Compute Servers

Authors: Romil Bhardwaj, Zhengxu Xia, Ganesh Ananthanarayanan, Junchen Jiang, Nikolaos Karianakis, Yuanchao Shu, Kevin Hsieh, Victor Bahl, Ion Stoica

Abstract: Video analytics applications use edge compute servers for the analytics of the videos (for bandwidth and privacy). Compressed models that are deployed on the edge servers for inference suffer from data drift, where the live video data diverges from the training data. Continuous learning handles data drift by periodically retraining the models on new data. Our work addresses the challenge of jointl… ▽ More Video analytics applications use edge compute servers for the analytics of the videos (for bandwidth and privacy). Compressed models that are deployed on the edge servers for inference suffer from data drift, where the live video data diverges from the training data. Continuous learning handles data drift by periodically retraining the models on new data. Our work addresses the challenge of jointly supporting inference and retraining tasks on edge servers, which requires navigating the fundamental tradeoff between the retrained model's accuracy and the inference accuracy. Our solution Ekya balances this tradeoff across multiple models and uses a micro-profiler to identify the models that will benefit the most by retraining. Ekya's accuracy gain compared to a baseline scheduler is 29% higher, and the baseline requires 4x more GPU resources to achieve the same accuracy as Ekya. △ Less

Submitted 18 December, 2020; originally announced December 2020.

arXiv:2009.10931 [pdf]

doi 10.1038/s41598-021-02353-5

Drug repurposing for COVID-19 using graph neural network and harmonizing multiple evidence

Authors: Kanglin Hsieh, Yinyin Wang, Luyao Chen, Zhongming Zhao, Sean Savitz, Xiaoqian Jiang, **g Tang, Ye** Kim

Abstract: Amid the pandemic of 2019 novel coronavirus disease (COVID-19) infected by SARS-CoV-2, a vast amount of drug research for prevention and treatment has been quickly conducted, but these efforts have been unsuccessful thus far. Our objective is to prioritize repurposable drugs using a drug repurposing pipeline that systematically integrates multiple SARS-CoV-2 and drug interactions, deep graph neura… ▽ More Amid the pandemic of 2019 novel coronavirus disease (COVID-19) infected by SARS-CoV-2, a vast amount of drug research for prevention and treatment has been quickly conducted, but these efforts have been unsuccessful thus far. Our objective is to prioritize repurposable drugs using a drug repurposing pipeline that systematically integrates multiple SARS-CoV-2 and drug interactions, deep graph neural networks, and in-vitro/population-based validations. We first collected all the available drugs (n= 3,635) involved in COVID-19 patient treatment through CTDbase. We built a SARS-CoV-2 knowledge graph based on the interactions among virus baits, host genes, pathways, drugs, and phenotypes. A deep graph neural network approach was used to derive the candidate representation based on the biological interactions. We prioritized the candidate drugs using clinical trial history, and then validated them with their genetic profiles, in vitro experimental efficacy, and electronic health records. We highlight the top 22 drugs including Azithromycin, Atorvastatin, Aspirin, Acetaminophen, and Albuterol. We further pinpointed drug combinations that may synergistically target COVID-19. In summary, we demonstrated that the integration of extensive interactions, deep neural networks, and rigorous validation can facilitate the rapid identification of candidate drugs for COVID-19 treatment. This is a post-peer-review, pre-copyedit version of an article published in Scientific Reports The final authenticated version is available online at: https://www.nature.com/articles/s41598-021-02353-5 △ Less

Submitted 1 February, 2022; v1 submitted 23 September, 2020; originally announced September 2020.

Comments: 13 pages

Journal ref: Sci Rep 11, 23179 (2021)

arXiv:2004.09549 [pdf, other]

doi 10.1109/TIT.2021.3081368

Modulated Sparse Superposition Codes for the Complex AWGN Channel

Authors: Kuan Hsieh, Ramji Venkataramanan

Abstract: This paper studies a generalization of sparse superposition codes (SPARCs) for communication over the complex additive white Gaussian noise (AWGN) channel. In a SPARC, the codebook is defined in terms of a design matrix, and each codeword is a generated by multiplying the design matrix with a sparse message vector. In the standard SPARC construction, information is encoded in the locations of the… ▽ More This paper studies a generalization of sparse superposition codes (SPARCs) for communication over the complex additive white Gaussian noise (AWGN) channel. In a SPARC, the codebook is defined in terms of a design matrix, and each codeword is a generated by multiplying the design matrix with a sparse message vector. In the standard SPARC construction, information is encoded in the locations of the non-zero entries of the message vector. In this paper we generalize the construction and consider modulated SPARCs, where information in encoded in both the locations and the values of the non-zero entries of the message vector. We focus on the case where the non-zero entries take values from a phase-shift keying (PSK) constellation. We propose a computationally efficient approximate message passing (AMP) decoder, and obtain analytical bounds on the state evolution parameters which predict the error performance of the decoder. Using these bounds we show that PSK-modulated SPARCs are asymptotically capacity achieving for the complex AWGN channel, with either spatial coupling or power allocation. We also provide numerical simulation results to demonstrate the error performance at finite code lengths. These results show that introducing modulation to the SPARC design can significantly reduce decoding complexity without sacrificing error performance. △ Less

Submitted 11 May, 2021; v1 submitted 20 April, 2020; originally announced April 2020.

Comments: 20 pages, 6 figures. To appear in IEEE Transactions on Information Theory

Journal ref: IEEE Transactions on Information Theory, vol. 67, no. 7, pp. 4385-4404, July 2021

arXiv:2003.02880 [pdf, other]

doi 10.1021/acs.nanolett.0c03586

Evidence of Lifshitz transition in thermoelectric power of ultrahigh mobility bilayer graphene

Authors: Aditya Jayaraman, Kimberly Hsieh, Bhaskar Ghawri, Phanibhusan S. Mahapatra, Arindam Ghosh

Abstract: Resolving low-energy features in the density of states (DOS) holds the key to understanding wide variety of rich novel phenomena in graphene based 2D heterostructures. Lifshitz transition in bilayer graphene (BLG) arising from trigonal war** has been established theoretically and experimentally. Nevertheless, the experimental realization of its effects on the transport properties has been challe… ▽ More Resolving low-energy features in the density of states (DOS) holds the key to understanding wide variety of rich novel phenomena in graphene based 2D heterostructures. Lifshitz transition in bilayer graphene (BLG) arising from trigonal war** has been established theoretically and experimentally. Nevertheless, the experimental realization of its effects on the transport properties has been challenging because of its relatively low energy scale ($\sim 1$ meV). In this work, we demonstrate that the thermoelectric power (TEP) can be used as an effective probe to investigate fine changes in the DOS of BLG. We observe additional entropy features in the vicinity of the charge neutrality point (CNP) in gapped BLG. This apparent violation of Mott formula can be explained quantitatively by considering the effects of trigonal war**, thereby serving as a possible evidence of a Lifshitz transition. △ Less

Submitted 5 March, 2020; originally announced March 2020.

arXiv:2002.07844 [pdf, other]

doi 10.1109/TIT.2021.3083733

Capacity-achieving Spatially Coupled Sparse Superposition Codes with AMP Decoding

Authors: Cynthia Rush, Kuan Hsieh, Ramji Venkataramanan

Abstract: Sparse superposition codes, also called sparse regression codes (SPARCs), are a class of codes for efficient communication over the AWGN channel at rates approaching the channel capacity. In a standard SPARC, codewords are sparse linear combinations of columns of an i.i.d. Gaussian design matrix, while in a spatially coupled SPARC the design matrix has a block-wise structure, where the variance of… ▽ More Sparse superposition codes, also called sparse regression codes (SPARCs), are a class of codes for efficient communication over the AWGN channel at rates approaching the channel capacity. In a standard SPARC, codewords are sparse linear combinations of columns of an i.i.d. Gaussian design matrix, while in a spatially coupled SPARC the design matrix has a block-wise structure, where the variance of the Gaussian entries can be varied across blocks. A well-designed spatial coupling structure can significantly enhance the error performance of iterative decoding algorithms such as Approximate Message Passing (AMP). In this paper, we obtain a non-asymptotic bound on the probability of error of spatially coupled SPARCs with AMP decoding. Applying this bound to a simple band-diagonal design matrix, we prove that spatially coupled SPARCs with AMP decoding achieve the capacity of the AWGN channel. The bound also highlights how the decay of error probability depends on each design parameter of the spatially coupled SPARC. An attractive feature of AMP decoding is that its asymptotic mean squared error (MSE) can be predicted via a deterministic recursion called state evolution. Our result provides the first proof that the MSE concentrates on the state evolution prediction for spatially coupled designs. Combined with the state evolution prediction, this result implies that spatially coupled SPARCs with the proposed band-diagonal design are capacity-achieving. Using the proof technique used to establish the main result, we also obtain a concentration inequality for the MSE of AMP applied to compressed sensing with spatially coupled design matrices. Finally we provide numerical simulation results that demonstrate the finite length error performance of spatially coupled SPARCs. The performance is compared with coded modulation schemes that use LDPC codes from the DVB-S2 standard. △ Less

Submitted 8 May, 2021; v1 submitted 18 February, 2020; originally announced February 2020.

Comments: To appear in IEEE Transactions on Information Theory. This version contains proofs of two technical lemmas that were omitted in the journal version

Journal ref: IEEE Transactions on Information Theory, vol. 67, no. 7, pp. 4446-4484, July 2021

arXiv:1910.08663 [pdf, other]

Machine Learning Systems for Highly-Distributed and Rapidly-Growing Data

Authors: Kevin Hsieh

Abstract: The usability and practicality of any machine learning (ML) applications are largely influenced by two critical but hard-to-attain factors: low latency and low cost. Unfortunately, achieving low latency and low cost is very challenging when ML depends on real-world data that are highly distributed and rapidly growing (e.g., data collected by mobile phones and video cameras all over the world). Suc… ▽ More The usability and practicality of any machine learning (ML) applications are largely influenced by two critical but hard-to-attain factors: low latency and low cost. Unfortunately, achieving low latency and low cost is very challenging when ML depends on real-world data that are highly distributed and rapidly growing (e.g., data collected by mobile phones and video cameras all over the world). Such real-world data pose many challenges in communication and computation. For example, when training data are distributed across data centers that span multiple continents, communication among data centers can easily overwhelm the limited wide-area network bandwidth, leading to prohibitively high latency and high cost. In this dissertation, we demonstrate that the latency and cost of ML on highly-distributed and rapidly-growing data can be improved by one to two orders of magnitude by designing ML systems that exploit the characteristics of ML algorithms, ML model structures, and ML training/serving data. We support this thesis statement with three contributions. First, we design a system that provides both low-latency and low-cost ML serving (inferencing) over large-scale and continuously-growing datasets, such as videos. Second, we build a system that makes ML training over geo-distributed datasets as fast as training within a single data center. Third, we present a first detailed study and a system-level solution on a fundamental and largely overlooked problem: ML training over non-IID (i.e., not independent and identically distributed) data partitions (e.g., facial images collected by cameras varies according to the demographics of each camera's location). △ Less

Submitted 18 October, 2019; originally announced October 2019.

arXiv:1910.00189 [pdf, other]

The Non-IID Data Quagmire of Decentralized Machine Learning

Authors: Kevin Hsieh, Amar Phanishayee, Onur Mutlu, Phillip B. Gibbons

Abstract: Many large-scale machine learning (ML) applications need to perform decentralized learning over datasets generated at different devices and locations. Such datasets pose a significant challenge to decentralized learning because their different contexts result in significant data distribution skew across devices/locations. In this paper, we take a step toward better understanding this challenge by… ▽ More Many large-scale machine learning (ML) applications need to perform decentralized learning over datasets generated at different devices and locations. Such datasets pose a significant challenge to decentralized learning because their different contexts result in significant data distribution skew across devices/locations. In this paper, we take a step toward better understanding this challenge by presenting a detailed experimental study of decentralized DNN training on a common type of data skew: skewed distribution of data labels across devices/locations. Our study shows that: (i) skewed data labels are a fundamental and pervasive problem for decentralized learning, causing significant accuracy loss across many ML applications, DNN models, training datasets, and decentralized learning algorithms; (ii) the problem is particularly challenging for DNN models with batch normalization; and (iii) the degree of data skew is a key determinant of the difficulty of the problem. Based on these findings, we present SkewScout, a system-level approach that adapts the communication frequency of decentralized learning algorithms to the (skew-induced) accuracy loss between data partitions. We also show that group normalization can recover much of the accuracy loss of batch normalization. △ Less

Submitted 18 August, 2020; v1 submitted 30 September, 2019; originally announced October 2019.

Journal ref: International Conference on Machine Learning (ICML), 2020

arXiv:1903.11312 [pdf, other]

doi 10.1088/1361-6528/ab2d88

Optimising Graphene Visibility in van der Waals Heterostructures

Authors: Thanmay S. Menon, Simli Mishra, Vidhu Catherine Antony, Kiranmayi Dixit, Saloni Kakkar, Tanweer Ahmed, Saurav Islam, Aditya Jayaraman, Kimberly Hsieh, Paritosh Karnatak, Arindam Ghosh

Abstract: Graphene constitutes one of the key elements in many functional van der Waals heterostructures. However, it has negligible optical visibility due to its monolayer nature. Here we study the visibility of graphene in various van der Waals heterostructures and include the effects of the source spectrum, oblique incidence and the spectral sensitivity of the detector to obtain a realistic model. A visi… ▽ More Graphene constitutes one of the key elements in many functional van der Waals heterostructures. However, it has negligible optical visibility due to its monolayer nature. Here we study the visibility of graphene in various van der Waals heterostructures and include the effects of the source spectrum, oblique incidence and the spectral sensitivity of the detector to obtain a realistic model. A visibility experiment is performed at different wavelengths, resulting in a very good agreement with our calculations. This allows us to reliably predict the conditions for better visibility of graphene in van der Waals heterostructures. The framework and the codes provided in this work can be extended to study the visibility of any 2D material within an arbitrary van der Waals heterostructure. △ Less

Submitted 18 June, 2019; v1 submitted 27 March, 2019; originally announced March 2019.

arXiv:1805.03154 [pdf, other]

Flexible-Latency DRAM: Understanding and Exploiting Latency Variation in Modern DRAM Chips

Authors: Kevin K. Chang, Abhijith Kashyap, Hasan Hassan, Saugata Ghose, Kevin Hsieh, Donghyuk Lee, Tianshi Li, Gennady Pekhimenko, Samira Khan, Onur Mutlu

Abstract: This article summarizes key results of our work on experimental characterization and analysis of latency variation and latency-reliability trade-offs in modern DRAM chips, which was published in SIGMETRICS 2016, and examines the work's significance and future potential. The goal of this work is to (i) experimentally characterize and understand the latency variation across cells within a DRAM chi… ▽ More This article summarizes key results of our work on experimental characterization and analysis of latency variation and latency-reliability trade-offs in modern DRAM chips, which was published in SIGMETRICS 2016, and examines the work's significance and future potential. The goal of this work is to (i) experimentally characterize and understand the latency variation across cells within a DRAM chip for these three fundamental DRAM operations, and (ii) develop new mechanisms that exploit our understanding of the latency variation to reliably improve performance. To this end, we comprehensively characterize 240 DRAM chips from three major vendors, and make six major new observations about latency variation within DRAM. Notably, we find that (i) there is large latency variation across the cells for each of the three operations; (ii) variation characteristics exhibit significant spatial locality: slower cells are clustered in certain regions of a DRAM chip; and (iii) the three fundamental operations exhibit different reliability characteristics when the latency of each operation is reduced. Based on our observations, we propose Flexible-LatencY DRAM (FLY-DRAM), a mechanism that exploits latency variation across DRAM cells within a DRAM chip to improve system performance. The key idea of FLY-DRAM is to exploit the spatial locality of slower cells within DRAM, and access the faster DRAM regions with reduced latencies for the fundamental operations. Our evaluations show that FLY-DRAM improves the performance of a wide range of applications by 13.3%, 17.6%, and 19.5%, on average, for each of the three different vendors' real DRAM chips, in a simulated 8-core system. △ Less

Submitted 8 May, 2018; originally announced May 2018.

arXiv:1805.02498 [pdf, other]

Decoupling GPU Programming Models from Resource Management for Enhanced Programming Ease, Portability, and Performance

Authors: Nandita Vijaykumar, Kevin Hsieh, Gennady Pekhimenko, Samira Khan, Ashish Shrestha, Saugata Ghose, Adwait Jog, Phillip B. Gibbons, Onur Mutlu

Abstract: The application resource specification--a static specification of several parameters such as the number of threads and the scratchpad memory usage per thread block--forms a critical component of modern GPU programming models. This specification determines the parallelism, and hence performance, of the application during execution because the corresponding on-chip hardware resources are allocated a… ▽ More The application resource specification--a static specification of several parameters such as the number of threads and the scratchpad memory usage per thread block--forms a critical component of modern GPU programming models. This specification determines the parallelism, and hence performance, of the application during execution because the corresponding on-chip hardware resources are allocated and managed based on this specification. This tight-coupling between the software-provided resource specification and resource management in hardware leads to significant challenges in programming ease, portability, and performance. Zorua is a new resource virtualization framework, that decouples the programmer-specified resource usage of a GPU application from the actual allocation in the on-chip hardware resources. Zorua enables this decoupling by virtualizing each resource transparently to the programmer. We demonstrate that by providing the illusion of more resources than physically available via controlled and coordinated virtualization, Zorua offers several important benefits: (i) Programming Ease. Zorua eases the burden on the programmer to provide code that is tuned to efficiently utilize the physically available on-chip resources. (ii) Portability. Zorua alleviates the necessity of re-tuning an application's resource usage when porting the application across GPU generations. (iii) Performance. By dynamically allocating resources and carefully oversubscribing them when necessary, Zorua improves or retains the performance of applications that are already highly tuned to best utilize the resources. △ Less

Submitted 2 May, 2018; originally announced May 2018.

Comments: arXiv admin note: substantial text overlap with arXiv:1802.02573

arXiv:1803.08625 [pdf, other]

A Concept Learning Tool Based On Calculating Version Space Cardinality

Authors: Kuo-Kai Hsieh, Li-C. Wang

Abstract: In this paper, we proposed VeSC-CoL (Version Space Cardinality based Concept Learning) to deal with concept learning on extremely imbalanced datasets, especially when cross-validation is not a viable option. VeSC-CoL uses version space cardinality as a measure for model quality to replace cross-validation. Instead of naive enumeration of the version space, Ordered Binary Decision Diagram and Boole… ▽ More In this paper, we proposed VeSC-CoL (Version Space Cardinality based Concept Learning) to deal with concept learning on extremely imbalanced datasets, especially when cross-validation is not a viable option. VeSC-CoL uses version space cardinality as a measure for model quality to replace cross-validation. Instead of naive enumeration of the version space, Ordered Binary Decision Diagram and Boolean Satisfiability are used to compute the version space. Experiments show that VeSC-CoL can accurately learn the target concept when computational resource is allowed. △ Less

Submitted 22 March, 2018; originally announced March 2018.

arXiv:1802.02573 [pdf, other]

Zorua: Enhancing Programming Ease, Portability, and Performance in GPUs by Decoupling Programming Models from Resource Management

Authors: Nandita Vijaykumar, Kevin Hsieh, Gennady Pekhimenko, Samira Khan, Ashish Shrestha, Saugata Ghose, Phillip B. Gibbons, Onur Mutlu

Abstract: The application resource specification--a static specification of several parameters such as the number of threads and the scratchpad memory usage per thread block--forms a critical component of the existing GPU programming models. This specification determines the performance of the application during execution because the corresponding on-chip hardware resources are allocated and managed purely… ▽ More The application resource specification--a static specification of several parameters such as the number of threads and the scratchpad memory usage per thread block--forms a critical component of the existing GPU programming models. This specification determines the performance of the application during execution because the corresponding on-chip hardware resources are allocated and managed purely based on this specification. This tight coupling between the software-provided resource specification and resource management in hardware leads to significant challenges in programming ease, portability, and performance, as we demonstrate in this work. Our goal in this work is to reduce the dependence of performance on the software-provided resource specification to simultaneously alleviate the above challenges. To this end, we introduce Zorua, a new resource virtualization framework, that decouples the programmer-specified resource usage of a GPU application from the actual allocation in the on-chip hardware resources. Zorua enables this decoupling by virtualizing each resource transparently to the programmer. We demonstrate that by providing the illusion of more resources than physically available, Zorua offers several important benefits: (i) Programming Ease: Zorua eases the burden on the programmer to provide code that is tuned to efficiently utilize the physically available on-chip resources. (ii) Portability: Zorua alleviates the necessity of re-tuning an application's resource usage when porting the application across GPU generations. (iii) Performance: By dynamically allocating resources and carefully oversubscribing them when necessary, Zorua improves or retains the performance of applications that are already highly tuned to best utilize the resources. The holistic virtualization provided by Zorua has many other potential uses which we describe in this paper. △ Less

Submitted 7 February, 2018; originally announced February 2018.

Report number: SAFARI Technical Report 2016-005

arXiv:1802.00320 [pdf, other]

Enabling the Adoption of Processing-in-Memory: Challenges, Mechanisms, Future Research Directions

Authors: Saugata Ghose, Kevin Hsieh, Amirali Boroumand, Rachata Ausavarungnirun, Onur Mutlu

Abstract: Poor DRAM technology scaling over the course of many years has caused DRAM-based main memory to increasingly become a larger system bottleneck. A major reason for the bottleneck is that data stored within DRAM must be moved across a pin-limited memory channel to the CPU before any computation can take place. This requires a high latency and energy overhead, and the data often cannot benefit from c… ▽ More Poor DRAM technology scaling over the course of many years has caused DRAM-based main memory to increasingly become a larger system bottleneck. A major reason for the bottleneck is that data stored within DRAM must be moved across a pin-limited memory channel to the CPU before any computation can take place. This requires a high latency and energy overhead, and the data often cannot benefit from caching in the CPU, making it difficult to amortize the overhead. Modern 3D-stacked DRAM architectures include a logic layer, where compute logic can be integrated underneath multiple layers of DRAM cell arrays within the same chip. Architects can take advantage of the logic layer to perform processing-in-memory (PIM), or near-data processing. In a PIM architecture, the logic layer within DRAM has access to the high internal bandwidth available within 3D-stacked DRAM (which is much greater than the bandwidth available between DRAM and the CPU). Thus, PIM architectures can effectively free up valuable memory channel bandwidth while reducing system energy consumption. A number of important issues arise when we add compute logic to DRAM. In particular, the logic does not have low-latency access to common CPU structures that are essential for modern application execution, such as the virtual memory and cache coherence mechanisms. To ease the widespread adoption of PIM, we ideally would like to maintain traditional virtual memory abstractions and the shared memory programming model. This requires efficient mechanisms that can provide logic in DRAM with access to CPU structures without having to communicate frequently with the CPU. To this end, we propose and evaluate two general-purpose solutions that minimize unnecessary off-chip communication for PIM architectures. We show that both mechanisms improve the performance and energy consumption of many important memory-intensive applications. △ Less

Submitted 1 February, 2018; originally announced February 2018.

arXiv:1801.03493 [pdf, other]

Focus: Querying Large Video Datasets with Low Latency and Low Cost

Authors: Kevin Hsieh, Ganesh Ananthanarayanan, Peter Bodik, Paramvir Bahl, Matthai Philipose, Phillip B. Gibbons, Onur Mutlu

Abstract: Large volumes of videos are continuously recorded from cameras deployed for traffic control and surveillance with the goal of answering "after the fact" queries: identify video frames with objects of certain classes (cars, bags) from many days of recorded video. While advancements in convolutional neural networks (CNNs) have enabled answering such queries with high accuracy, they are too expensive… ▽ More Large volumes of videos are continuously recorded from cameras deployed for traffic control and surveillance with the goal of answering "after the fact" queries: identify video frames with objects of certain classes (cars, bags) from many days of recorded video. While advancements in convolutional neural networks (CNNs) have enabled answering such queries with high accuracy, they are too expensive and slow. We build Focus, a system for low-latency and low-cost querying on large video datasets. Focus uses cheap ingestion techniques to index the videos by the objects occurring in them. At ingest-time, it uses compression and video-specific specialization of CNNs. Focus handles the lower accuracy of the cheap CNNs by judiciously leveraging expensive CNNs at query-time. To reduce query time latency, we cluster similar objects and hence avoid redundant processing. Using experiments on video streams from traffic, surveillance and news channels, we see that Focus uses 58X fewer GPU cycles than running expensive ingest processors and is 37X faster than processing all the video at query time. △ Less

Submitted 10 January, 2018; originally announced January 2018.

arXiv:1801.01796 [pdf, other]

Spatially Coupled Sparse Regression Codes: Design and State Evolution Analysis

Authors: Kuan Hsieh, Cynthia Rush, Ramji Venkataramanan

Abstract: We consider the design and analysis of spatially coupled sparse regression codes (SC-SPARCs), which were recently introduced by Barbier et al. for efficient communication over the additive white Gaussian noise channel. SC-SPARCs can be efficiently decoded using an Approximate Message Passing (AMP) decoder, whose performance in each iteration can be predicted via a set of equations called state evo… ▽ More We consider the design and analysis of spatially coupled sparse regression codes (SC-SPARCs), which were recently introduced by Barbier et al. for efficient communication over the additive white Gaussian noise channel. SC-SPARCs can be efficiently decoded using an Approximate Message Passing (AMP) decoder, whose performance in each iteration can be predicted via a set of equations called state evolution. In this paper, we give an asymptotic characterization of the state evolution equations for SC-SPARCs. For any given base matrix (that defines the coupling structure of the SC-SPARC) and rate, this characterization can be used to predict whether or not AMP decoding will succeed in the large system limit. We then consider a simple base matrix defined by two parameters $(ω, Λ)$, and show that AMP decoding succeeds in the large system limit for all rates $R < \mathcal{C}$. The asymptotic result also indicates how the parameters of the base matrix affect the decoding progression. Simulation results are presented to evaluate the performance of SC-SPARCs defined with the proposed base matrix. △ Less

Submitted 26 April, 2018; v1 submitted 5 January, 2018; originally announced January 2018.

Comments: 8 pages, 6 figures. A shorter version of this paper to appear in ISIT 2018

arXiv:1711.03906 [pdf, other]

doi 10.1145/3084041.3084049

D-SLATS: Distributed Simultaneous Localization and Time Synchronization

Authors: Amr Alanwar, Henrique Ferraz, Kevin Hsieh, Rohit Thazhath, Paul Martin, Joao Hespanha, Mani Srivastava

Abstract: Through the last decade, we have witnessed a surge of Internet of Things (IoT) devices, and with that a greater need to choreograph their actions across both time and space. Although these two problems, namely time synchronization and localization, share many aspects in common, they are traditionally treated separately or combined on centralized approaches that results in an ineffcient use of reso… ▽ More Through the last decade, we have witnessed a surge of Internet of Things (IoT) devices, and with that a greater need to choreograph their actions across both time and space. Although these two problems, namely time synchronization and localization, share many aspects in common, they are traditionally treated separately or combined on centralized approaches that results in an ineffcient use of resources, or in solutions that are not scalable in terms of the number of IoT devices. Therefore, we propose D-SLATS, a framework comprised of three different and independent algorithms to jointly solve time synchronization and localization problems in a distributed fashion. The First two algorithms are based mainly on the distributed Extended Kalman Filter (EKF) whereas the third one uses optimization techniques. No fusion center is required, and the devices only communicate with their neighbors. The proposed methods are evaluated on custom Ultra-Wideband communication Testbed and a quadrotor, representing a network of both static and mobile nodes. Our algorithms achieve up to three microseconds time synchronization accuracy and 30 cm localization error. △ Less

Submitted 10 November, 2017; originally announced November 2017.

arXiv:1706.03162 [pdf, other]

LazyPIM: Efficient Support for Cache Coherence in Processing-in-Memory Architectures

Authors: Amirali Boroumand, Saugata Ghose, Minesh Patel, Hasan Hassan, Brandon Lucia, Nastaran Ha**azar, Kevin Hsieh, Krishna T. Malladi, Hongzhong Zheng, Onur Mutlu

Abstract: Processing-in-memory (PIM) architectures have seen an increase in popularity recently, as the high internal bandwidth available within 3D-stacked memory provides greater incentive to move some computation into the logic layer of the memory. To maintain program correctness, the portions of a program that are executed in memory must remain coherent with the portions of the program that continue to e… ▽ More Processing-in-memory (PIM) architectures have seen an increase in popularity recently, as the high internal bandwidth available within 3D-stacked memory provides greater incentive to move some computation into the logic layer of the memory. To maintain program correctness, the portions of a program that are executed in memory must remain coherent with the portions of the program that continue to execute within the processor. Unfortunately, PIM architectures cannot use traditional approaches to cache coherence due to the high off-chip traffic consumed by coherence messages, which, as we illustrate in this work, can undo the benefits of PIM execution for many data-intensive applications. We propose LazyPIM, a new hardware cache coherence mechanism designed specifically for PIM. Prior approaches for coherence in PIM are ill-suited to applications that share a large amount of data between the processor and the PIM logic. LazyPIM uses a combination of speculative cache coherence and compressed coherence signatures to greatly reduce the overhead of kee** PIM coherent with the processor, even when a large amount of sharing exists.We find that LazyPIM improves average performance across a range of data-intensive PIM applications by 19.6%, reduces off-chip traffic by 30.9%, and reduces energy consumption by 18.0%, over the best prior approaches to PIM coherence. △ Less

Submitted 9 June, 2017; originally announced June 2017.

arXiv:1102.5068 [pdf, other]

Mathematica with ROOT

Authors: Ken Hsieh, Thomas G. Throwe, Sebastian White

Abstract: We present an open-source Mathematica importer for CERN ROOT files. Taking advantage of Mathematica's import/export plug-in mechanism, the importer offers a simple, unified interface that cleanly wraps around its MathLink-based core that links the ROOT libraries with Mathematica. Among other tests for accuracy and efficiency, the importer has also been tested on a large (~5 Gbyte) file structure,… ▽ More We present an open-source Mathematica importer for CERN ROOT files. Taking advantage of Mathematica's import/export plug-in mechanism, the importer offers a simple, unified interface that cleanly wraps around its MathLink-based core that links the ROOT libraries with Mathematica. Among other tests for accuracy and efficiency, the importer has also been tested on a large (~5 Gbyte) file structure, D3PD, used by the ATLAS experiment for offline analysis without problems. In addition to describing the installation and usage of the importer, we discuss how the importer may be further improved and customized. A link to the package can be found at: http://library.wolfram.com/infocenter/Articles/7793/ and a related presentation is at: http://cd-docdb.fnal.gov/cgi-bin/DisplayMeeting?conferenceid=522 △ Less

Submitted 6 March, 2011; v1 submitted 24 February, 2011; originally announced February 2011.

Comments: 13 pages, 4 figures, corrected typos and updated references

arXiv:1003.3482 [pdf, other]

doi 10.1103/PhysRevD.82.035011

Global Analysis of General SU(2) x SU(2) x U(1) Models with Precision Data

Authors: Ken Hsieh, Kai Schmitz, Jiang-Hao Yu, C. -P. Yuan

Abstract: We present the results of a global analysis of a class of models with an extended electroweak gauge group of the form SU(2) x SU(2) x U(1), often denoted as G(221) models, which include as examples the left-right, the lepto-phobic, the hadro-phobic, the fermio-phobic, the un-unified, and the non-universal models. Using an effective Lagrangian approach, we compute the shifts to the coefficients in… ▽ More We present the results of a global analysis of a class of models with an extended electroweak gauge group of the form SU(2) x SU(2) x U(1), often denoted as G(221) models, which include as examples the left-right, the lepto-phobic, the hadro-phobic, the fermio-phobic, the un-unified, and the non-universal models. Using an effective Lagrangian approach, we compute the shifts to the coefficients in the electroweak Lagrangian due to the new heavy gauge bosons, and obtain the lower bounds on the masses of the Z' and W' bosons. The analysis of the electroweak parameter bounds reveals a consistent pattern of several key observables that are especially sensitive to the effects of new physics and thus dominate the overall shape of the respective parameter contours. △ Less

Submitted 17 March, 2010; originally announced March 2010.

Comments: 46 pages, 7 figures, and 11 tables

Report number: MSUHEP-091123, DESY 09-205

Journal ref: Phys.Rev.D82:035011,2010

arXiv:0902.3910 [pdf, ps, other]

doi 10.1103/PhysRevD.79.075016

Z to b bbar and Chiral Currents in Higgsless Models

Authors: Tomohiro Abe, R. Sekhar Chivukula, Neil D. Christensen, Ken Hsieh, Shinya Matsuzaki, Elizabeth H. Simmons, Masaharu Tanabashi

Abstract: In this note we compute the flavor-dependent chiral-logarithmic corrections to the decay Z to b bbar in the three site Higgsless model. We compute these corrections diagrammatically in the "gaugeless" limit in which the electroweak couplings vanish. We also compute the chiral-logarithmic corrections to the decay Z to b bbar using an RGE analysis in effective field theory, and show that the resul… ▽ More In this note we compute the flavor-dependent chiral-logarithmic corrections to the decay Z to b bbar in the three site Higgsless model. We compute these corrections diagrammatically in the "gaugeless" limit in which the electroweak couplings vanish. We also compute the chiral-logarithmic corrections to the decay Z to b bbar using an RGE analysis in effective field theory, and show that the results agree. In the process of this computation, we compute the form of the chiral current in the gaugeless limit of the three-site model, and consider the generalization to the N-site case. We elucidate the Ward-Takahashi identities which underlie the gaugeless limit calculation in the three-site model, and describe how the result for the Z to b bbar amplitude is obtained in unitary gauge in the full theory. We find that the phenomenological constraints on the three-site Higgsless model arising from measurements of Z to b bbar are relatively mild, requiring only that the heavy Dirac fermion be heavier than 1 TeV or so, and are satisfied automatically in the range of parameters allowed by other precision electroweak data. △ Less

Submitted 25 March, 2009; v1 submitted 23 February, 2009; originally announced February 2009.

Comments: 19 pages, 7 embedded eps figures (additional reference added)

Report number: NTLP 2008-04 and MSUHEP-090223

Journal ref: Phys.Rev.D79:075016,2009

arXiv:0902.3621 [pdf, ps, other]

doi 10.1088/0004-637X/694/1/L79

A Re-interpretation of the STEREO/STE Observations and it's Consequences

Authors: K. C. Hsieh, P. C. Frisch, J. Giacalone, J. R. Jokipii, J. Kota, D. E. Larson, R. P Lin, J. G. Luhmann, L. Wang

Abstract: We present an alternate interpretation of recent STEREO/STE observations that were originally attributed to energetic neutral atoms (ENA) from the heliosheath. The signal attributed to the diffuse ENA source instead shows the characteristics of a point source. We point out that the peak intensity seen by STEREO/STE is centered at the ecliptic longitude of the bright X-ray source Sco X-1. The obs… ▽ More We present an alternate interpretation of recent STEREO/STE observations that were originally attributed to energetic neutral atoms (ENA) from the heliosheath. The signal attributed to the diffuse ENA source instead shows the characteristics of a point source. We point out that the peak intensity seen by STEREO/STE is centered at the ecliptic longitude of the bright X-ray source Sco X-1. The observed energy spectrum and intensity are also consistent with the X-rays from Sco X-1. The problem of energy dissipation at the solar wind termination shock remains unsolved while current understanding of the interaction between the solar wind and interstellar wind awaits future observations. △ Less

Submitted 20 February, 2009; originally announced February 2009.

Comments: Accepted by ApJL

arXiv:0806.2608 [pdf, other]

doi 10.1103/PhysRevD.78.053006

Lone Higgs at the LHC

Authors: Ken Hsieh, C. -P. Yuan

Abstract: We address the possible scenario that the Large Hadron Collider (LHC) discovers only a Higgs boson after 10 fb^{-1} of operation, and attempt to identify this Higgs boson as that of the Standard Model (SM), the minimal universal extra dimension model (MUED), the littlest Higgs model with T-parity (LHT), or the minimal supersymmetric Standard Model (MSSM), using only the measurement of the produc… ▽ More We address the possible scenario that the Large Hadron Collider (LHC) discovers only a Higgs boson after 10 fb^{-1} of operation, and attempt to identify this Higgs boson as that of the Standard Model (SM), the minimal universal extra dimension model (MUED), the littlest Higgs model with T-parity (LHT), or the minimal supersymmetric Standard Model (MSSM), using only the measurement of the product of gluon-fusion production cross section and the di-photon branching ratio. In MUED, by decoupling any new physics sufficiently to evade the discovery reach at the LHC, the deviation of the signal from the SM is not statistically significant. However, in LHT and MSSM, it is possible to have a significant deviation in the signal that is consistent with this "lone Higgs scenario", and, in the case of a very large suppression, we can distinguish MSSM and LHT before the discovery of any new resonances. Starting with the lone Higgs scenario and the deviation in this measurement from the Standard Model prediction (whether or not statistically significant), we offer tests that may discriminate the models and search strategies of discovering new physics signatures with increasing integrated luminosity. △ Less

Submitted 19 September, 2008; v1 submitted 16 June, 2008; originally announced June 2008.

Comments: 32 pages, 25 figures, PRD version

Report number: MSUHEP-080606

Journal ref: Phys.Rev.D78:053006,2008

arXiv:0805.2623 [pdf, other]

doi 10.1103/PhysRevD.78.055016

Triplet Extended Supersymmetric Standard Model

Authors: Stefano Di Chiara, Ken Hsieh

Abstract: We revisit an extension of the MSSM by adding a hypercharge-neutral, SU(2)-triplet chiral superfield. Similar to the NMSSM, the triplet gives an additional contribution to the quartic coupling in the Higgs potential, and the mass of the lightest CP-even Higgs boson can be greater than tne mass of the Z-bosn at tree-level. In addition to discussing the perturbativity, fine-tuning, and decoupling… ▽ More We revisit an extension of the MSSM by adding a hypercharge-neutral, SU(2)-triplet chiral superfield. Similar to the NMSSM, the triplet gives an additional contribution to the quartic coupling in the Higgs potential, and the mass of the lightest CP-even Higgs boson can be greater than tne mass of the Z-bosn at tree-level. In addition to discussing the perturbativity, fine-tuning, and decoupling issues of this model, we compute the dominant 1-loop corrections to the mass of the lightest CP-even Higgs boson from the triplet sector. When the Higgs-Higgs-Triplet coupling in the superpotential is comparable to the top Yukawa coupling, we find that the Higgs mass can be as heavy as 140 GeV even without the traditional contributions from the top--s-top sector, and at the same time consistent with the precision electroweak constraints. At the expense of having Landau poles before the GUT scale, this opens up a previously forbidden region in the MSSM parameter space where both s-tops are light. In addition to having relatively small fine-tuning (about one part in 30), this leads to a gluo-philic Higgs boson whose production via gluon-gluon fusion at the LHC can be twice as large as the SM prediction. △ Less

Submitted 8 September, 2008; v1 submitted 16 May, 2008; originally announced May 2008.

Comments: 26 pages, 19 figures. Errors on the RGEs corrected and consequential changes applied. Plots updated and discussion broadened. Version to appear in PRD

Journal ref: Phys.Rev.D78:055016,2008

arXiv:0708.3970 [pdf, ps, other]

doi 10.1103/PhysRevD.77.015004

Pseudo-Dirac Bino Dark Matter

Authors: Ken Hsieh

Abstract: While the bino-dominated lightest neutralino of the minimal supersymmetric Standard Model (MSSM) is an interesting and widely-studied candidate of the dark matter, the p-wave suppression of its annihilation cross section requires fine-tunings of the MSSM spectra to be consistent with WMAP observations. We propose pseudo-Dirac bino that arises in theories with D-type supersymmetry-breaking as an… ▽ More While the bino-dominated lightest neutralino of the minimal supersymmetric Standard Model (MSSM) is an interesting and widely-studied candidate of the dark matter, the p-wave suppression of its annihilation cross section requires fine-tunings of the MSSM spectra to be consistent with WMAP observations. We propose pseudo-Dirac bino that arises in theories with D-type supersymmetry-breaking as an intriguing alternative candidate of dark matter. The pseudo-Dirac nature of the bino gives a natural mechanism of enhanced co-annihilation because these two states are degenerate in the absence of electroweak symmetry breaking. In addition, the lightest state can be consistent with limits of direct detection experiments because of the lack of vector interactions, as with the case of the MSSM bino. △ Less

Submitted 8 January, 2008; v1 submitted 29 August, 2007; originally announced August 2007.

Comments: 18 pages, 2 figures, REVTEX, to be published in PRD, made minor changes and added comments to match the published version

Report number: UMD-PP-07-004, MSU-HEP-07-08-28

Journal ref: Phys.Rev.D77:015004,2008

arXiv:hep-ph/0610155 [pdf, ps, other]

doi 10.1088/1126-6708/2006/12/067

Mixed Dark Matter in Universal Extra Dimension Models with TeV Scale $W_{R}$ and $Z'$

Authors: Ken Hsieh, R. N. Mohapatra, Salah Nasri

Abstract: We show that in a class of universal extra dimension (UED) models that solves both the neutrino mass and proton decay problems using low scale left-right symmetry, the dark matter of the Universe consists of an admixture of KK photon and KK right-handed neutrinos. We present a full calculation of the dark matter density in these models taking into account the co-annihilation effects due to near… ▽ More We show that in a class of universal extra dimension (UED) models that solves both the neutrino mass and proton decay problems using low scale left-right symmetry, the dark matter of the Universe consists of an admixture of KK photon and KK right-handed neutrinos. We present a full calculation of the dark matter density in these models taking into account the co-annihilation effects due to near by states such as the scalar partner of the KK photon as well as fermion states near the right-handed KK neutrino. Using the value of the relic CDM density, we obtain upper limits on $R^{-1}$ of about 400-650 GeV and $M_{Z'}\leq 1.5$ TeV, both being accessible to LHC. For a region in this parameter space where the KK right-handed neutrino contributes significantly to the total relic density of dark matter, we obtain a lower bound on the dark matter-nucleon scattering cross section of $10^{-44}$ cm$^2$, which can be probed by the next round of dark matter search experiments. △ Less

Submitted 16 October, 2006; v1 submitted 12 October, 2006; originally announced October 2006.

Comments: 28 pages, 7 figures

Report number: UFIFT-HEP-06-16, UMD-HEP-06-055

Journal ref: JHEP0612:067,2006

arXiv:hep-ph/0604256 [pdf, ps, other]

doi 10.1088/1126-6708/2007/06/062

Mixed Gauge and Anomaly Mediation From New Physics at 10 TeV

Authors: Ken Hsieh, Markus A. Luty

Abstract: In the context of anomaly-mediated supersymmetry breaking, it is natural for vectorlike fields and singlets to have supersymmetry breaking masses of order 10 TeV, and therefore act as messengers of supersymmetry breaking. We show that this can give rise to phenomenologically viable spectra compatible with perturbative gauge coupling unification. The minimal model interpolates continuously betwee… ▽ More In the context of anomaly-mediated supersymmetry breaking, it is natural for vectorlike fields and singlets to have supersymmetry breaking masses of order 10 TeV, and therefore act as messengers of supersymmetry breaking. We show that this can give rise to phenomenologically viable spectra compatible with perturbative gauge coupling unification. The minimal model interpolates continuously between pure anomaly mediation and gauge mediation with a messenger scale of order 10 TeV. It is also possible to have non-minimal models with more degenerate specta, with some squarks lighter than sleptons. These models reduce to the MSSM at low energies and incorporate a natural solution of the mu problem. The minimal model has four continuous parameters and one discrete parameter (the number of messengers). The LEP Higgs mass bound can be satisfied in the minimal model by tuning parameters at the GUT scale to one part in 50. △ Less

Submitted 27 April, 2006; originally announced April 2006.

Comments: 17 pages, 4 figures

Report number: UMD-PP-06-007

Journal ref: JHEP 0706:062,2007

arXiv:hep-ph/0604154 [pdf, ps, other]

doi 10.1103/PhysRevD.74.066004

Dark Matter in Universal Extra Dimension Models: $γ_{KK}$ vrs $ν_{R,KK}$

Authors: Ken Hsieh, R. N. Mohapatra, Salah Nasri

Abstract: We show that in a class of universal extra dimension models (UED), which solves both the neutrino mass and proton decay problem, an admixture of KK photon and KK right handed neutrinos can provide the required amount of cold dark matter (CDM). This model has two parameters $R^{-1}$ and $M_{Z'}$ ($R$ is the radius of the extra space dimensions and $Z'$ the extra neutral gauge boson of the model).… ▽ More We show that in a class of universal extra dimension models (UED), which solves both the neutrino mass and proton decay problem, an admixture of KK photon and KK right handed neutrinos can provide the required amount of cold dark matter (CDM). This model has two parameters $R^{-1}$ and $M_{Z'}$ ($R$ is the radius of the extra space dimensions and $Z'$ the extra neutral gauge boson of the model). Using the value of the relic CDM density, combined with the results from the cryogenic searches for CDM, we obtain upper limits on $R^{-1}$ of about 400-650 GeV and $M_{Z'}\leq 1.5$ TeV, both being accessible to LHC. In some regions of the parameter space, the dark matter-nucleon scattering cross section can be as high as of $10^{-44}$ cm$^2$, which can be probed by the next round of dark matter search experiments. △ Less

Submitted 1 September, 2006; v1 submitted 18 April, 2006; originally announced April 2006.

Comments: 13 pages, 2 figures; minor changes; to appear in Phys. Rev.D

Report number: UMD-PP-06-004

Journal ref: Phys.Rev.D74:066004,2006

Showing 1–47 of 47 results for author: Hsieh, K