-
Enhancing Distractor Generation for Multiple-Choice Questions with Retrieval Augmented Pretraining and Knowledge Graph Integration
Authors:
Han-Cheng Yu,
Yu-An Shih,
Kin-Man Law,
Kai-Yu Hsieh,
Yu-Chen Cheng,
Hsin-Chih Ho,
Zih-An Lin,
Wen-Chuan Hsu,
Yao-Chung Fan
Abstract:
In this paper, we tackle the task of distractor generation (DG) for multiple-choice questions. Our study introduces two key designs. First, we propose \textit{retrieval augmented pretraining}, which involves refining the language model pretraining to align it more closely with the downstream task of DG. Second, we explore the integration of knowledge graphs to enhance the performance of DG. Throug…
▽ More
In this paper, we tackle the task of distractor generation (DG) for multiple-choice questions. Our study introduces two key designs. First, we propose \textit{retrieval augmented pretraining}, which involves refining the language model pretraining to align it more closely with the downstream task of DG. Second, we explore the integration of knowledge graphs to enhance the performance of DG. Through experiments with benchmarking datasets, we show that our models significantly outperform the state-of-the-art results. Our best-performing model advances the F1@3 score from 14.80 to 16.47 in MCQ dataset and from 15.92 to 16.50 in Sciq dataset.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Direct view of gate-tunable miniband dispersion in graphene superlattices near the magic twist angle
Authors:
Zhihao Jiang,
Dongkyu Lee,
Alfred J. H. Jones,
Youngju Park,
Kimberly Hsieh,
Paulina Majchrzak,
Chakradhar Sahoo,
Thomas S. Nielsen,
Kenji Watanabe,
Takashi Taniguchi,
Philip Hofmann,
Jill A. Miwa,
Yong P. Chen,
Jeil Jung,
Søren Ulstrup
Abstract:
Superlattices from twisted graphene mono- and bi-layer systems give rise to on-demand quantum many-body states such as Mott insulators, unconventional superconductors and the fractional quantum Hall effect. These phenomena are observed in transport experiments when changing the filling of the low-energy electronic bands. Their origin is broadly ascribed to a combination of flat bands and strong Co…
▽ More
Superlattices from twisted graphene mono- and bi-layer systems give rise to on-demand quantum many-body states such as Mott insulators, unconventional superconductors and the fractional quantum Hall effect. These phenomena are observed in transport experiments when changing the filling of the low-energy electronic bands. Their origin is broadly ascribed to a combination of flat bands and strong Coulomb interactions, yet a comprehensive understanding is lacking. This is primarily because the relevant low-energy band structure is believed to strongly change in a non-trivial way as the electron filling is varied. Here we gain direct access to the filling-dependent low energy bands of twisted bilayer graphene (TBG) and twisted double bilayer graphene (TDBG) by applying micro-focused angle-resolved photoemission spectroscopy to in situ gated devices. Our findings for the two systems are in stark contrast: The do** dependent dispersion for TBG can be described in a simple model, combining a filling-dependent rigid band shift with a many-body related bandwidth change. In TDBG, on the other hand, we find a complex behaviour of the low-energy bands, combining non-monotonous bandwidth changes and tuneable gap openings. Our work establishes the extent of electric field tunability of the low energy electronic states in twisted graphene superlattices and can serve to underpin the theoretical understanding of the resulting phenomena.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Coded Many-User Multiple Access via Approximate Message Passing
Authors:
Xiaoqi Liu,
Kuan Hsieh,
Ramji Venkataramanan
Abstract:
We consider communication over the Gaussian multiple-access channel in the regime where the number of users grows linearly with the codelength. In this regime, schemes based on sparse superposition coding can achieve a near-optimal tradeoff between spectral efficiency and signal-to-noise ratio. However, these schemes are feasible only for small values of user payload. This paper investigates effic…
▽ More
We consider communication over the Gaussian multiple-access channel in the regime where the number of users grows linearly with the codelength. In this regime, schemes based on sparse superposition coding can achieve a near-optimal tradeoff between spectral efficiency and signal-to-noise ratio. However, these schemes are feasible only for small values of user payload. This paper investigates efficient schemes for larger user payloads, focusing on coded CDMA schemes where each user's information is encoded via a linear code before being modulated with a signature sequence. We propose an efficient approximate message passing (AMP) decoder that can be tailored to the structure of the linear code, and provide an exact asymptotic characterization of its performance. Based on this result, we consider a decoder that integrates AMP and belief propagation and characterize its tradeoff between spectral efficiency and signal-to-noise ratio, for a given target error rate. Simulation results show that the decoder achieves state-of-the-art performance at finite lengths, with a coded CDMA scheme defined using LDPC codes and a spatially coupled matrix of signature sequences.
△ Less
Submitted 1 July, 2024; v1 submitted 8 February, 2024;
originally announced February 2024.
-
Revealing flat bands and hybridization gaps in a twisted bilayer graphene device with microARPES
Authors:
Zhihao Jiang,
Kimberly Hsieh,
Alfred J. H. Jones,
Paulina Majchrzak,
Chakradhar Sahoo,
Kenji Watanabe,
Takashi Taniguchi,
Jill A. Miwa,
Yong P. Chen,
Søren Ulstrup
Abstract:
Controlling the electronic structure of two-dimensional materials using the combination of twist angle and electrostatic do** is an effective means to induce emergent phenomena. In bilayer graphene with an interlayer twist angle near the magic angle, the electronic dispersion is strongly modified by a manifold of hybridizing moiré Dirac cones leading to flat band segments with strong electronic…
▽ More
Controlling the electronic structure of two-dimensional materials using the combination of twist angle and electrostatic do** is an effective means to induce emergent phenomena. In bilayer graphene with an interlayer twist angle near the magic angle, the electronic dispersion is strongly modified by a manifold of hybridizing moiré Dirac cones leading to flat band segments with strong electronic correlations. Numerous technical challenges arising from spatial inhomogeneity of interlayer interactions, twist angle and device functionality have so far limited momentum-resolved electronic structure measurements of these systems to static conditions. Here, we present a detailed characterization of the electronic structure exhibiting miniband dispersions for twisted bilayer graphene, near the magic angle, integrated in a functional device architecture using micro-focused angle-resolved photoemission spectroscopy. The optimum conditions for visualizing the miniband dispersion are determined by exploiting the spatial resolution and photon energy tunability of the light source and applied to extract a hybridization gap size of $(0.14 \pm 0.03)$~eV and flat band segments extending across a moiré mini Brillouin zone. \textit{In situ} electrostatic gating of the sample enables significant electron-do**, causing the conduction band states to shift below the Fermi energy. Our work emphasizes key challenges in probing the electronic structure of magic angle bilayer graphene devices and outlines conditions for exploring the do**-dependent evolution of the dispersion that underpins the ability to control many-body interactions in the material.
△ Less
Submitted 4 February, 2024;
originally announced February 2024.
-
Health Guardian: Using Multi-modal Data to Understand Individual Health
Authors:
Vince S. Siu,
Kuan Yu Hsieh,
Italo Buleje,
Takashi Itoh,
Tian Hao,
Ben Civjan,
Nigel Hinds,
Bing Dang,
Jeffrey L. Rogers,
Bo Wen
Abstract:
Artificial intelligence (AI) has shown great promise in revolutionizing the field of digital health by improving disease diagnosis, treatment, and prevention. This paper describes the Health Guardian platform, a non-commercial, scientific research-based platform developed by the IBM Digital Health team to rapidly translate AI research into cloud-based microservices. The platform can collect health…
▽ More
Artificial intelligence (AI) has shown great promise in revolutionizing the field of digital health by improving disease diagnosis, treatment, and prevention. This paper describes the Health Guardian platform, a non-commercial, scientific research-based platform developed by the IBM Digital Health team to rapidly translate AI research into cloud-based microservices. The platform can collect health-related data from various digital devices, including wearables and mobile applications. Its flexible architecture supports microservices that accept diverse data types such as text, audio, and video, expanding the range of digital health assessments and enabling holistic health evaluations by capturing voice, facial, and motion bio-signals. These microservices can be deployed to a clinical cohort specified through the Clinical Task Manager (CTM). The CTM then collects multi-modal, clinical data that can iteratively improve the accuracy of AI predictive models, discover new disease mechanisms, or identify novel biomarkers. This paper highlights three microservices with different input data types, including a text-based microservice for depression assessment, a video-based microservice for sit-to-stand mobility assessment, and a wearable-based microservice for functional mobility assessment. The CTM is also discussed as a tool to help design and set up clinical studies to unlock the full potential of the platform. Today, the Health Guardian platform is being leveraged in collaboration with research partners to optimize the development of AI models by utilizing a multitude of input sources. This approach streamlines research efforts, enhances efficiency, and facilitates the development and validation of digital health applications.
△ Less
Submitted 2 October, 2023;
originally announced October 2023.
-
A Versatile Data Fabric for Advanced IoT-Based Remote Health Monitoring
Authors:
Italo Buleje,
Vince S. Siu,
Kuan Yu Hsieh,
Nigel Hinds,
Bing Dang,
Erhan Bilal,
Thanhnha Nguyen,
Ellen E. Lee,
Colin A. Depp,
Jeffrey L. Rogers
Abstract:
This paper presents a data-centric and security-focused data fabric designed for digital health applications. With the increasing interest in digital health research, there has been a surge in the volume of Internet of Things (IoT) data derived from smartphones, wearables, and ambient sensors. Managing this vast amount of data, encompassing diverse data types and varying time scales, is crucial. M…
▽ More
This paper presents a data-centric and security-focused data fabric designed for digital health applications. With the increasing interest in digital health research, there has been a surge in the volume of Internet of Things (IoT) data derived from smartphones, wearables, and ambient sensors. Managing this vast amount of data, encompassing diverse data types and varying time scales, is crucial. Moreover, compliance with regulatory and contractual obligations is essential. The proposed data fabric comprises an architecture and a toolkit that facilitate the integration of heterogeneous data sources, across different environments, to provide a unified view of the data in dashboards. Furthermore, the data fabric supports the development of reusable and configurable data integration components, which can be shared as open-source or inner-source software. These components are used to generate data pipelines that can be deployed and scheduled to run either in the cloud or on-premises. Additionally, we present the implementation of our data fabric in a home-based telemonitoring research project involving older adults, conducted in collaboration with the University of California, San Diego (UCSD). The study showcases the streamlined integration of data collected from various IoT sensors and mobile applications to create a unified view of older adults' health for further analysis and research.
△ Less
Submitted 2 October, 2023;
originally announced October 2023.
-
Bayes-Optimal Estimation in Generalized Linear Models via Spatial Coupling
Authors:
Pablo Pascual Cobo,
Kuan Hsieh,
Ramji Venkataramanan
Abstract:
We consider the problem of signal estimation in a generalized linear model (GLM). GLMs include many canonical problems in statistical estimation, such as linear regression, phase retrieval, and 1-bit compressed sensing. Recent work has precisely characterized the asymptotic minimum mean-squared error (MMSE) for GLMs with i.i.d. Gaussian sensing matrices. However, in many models there is a signific…
▽ More
We consider the problem of signal estimation in a generalized linear model (GLM). GLMs include many canonical problems in statistical estimation, such as linear regression, phase retrieval, and 1-bit compressed sensing. Recent work has precisely characterized the asymptotic minimum mean-squared error (MMSE) for GLMs with i.i.d. Gaussian sensing matrices. However, in many models there is a significant gap between the MMSE and the performance of the best known feasible estimators. In this work, we address this issue by considering GLMs defined via spatially coupled sensing matrices. We propose an efficient approximate message passing (AMP) algorithm for estimation and prove that with a simple choice of spatially coupled design, the MSE of a carefully tuned AMP estimator approaches the asymptotic MMSE in the high-dimensional limit. To prove the result, we first rigorously characterize the asymptotic performance of AMP for a GLM with a generic spatially coupled design. This characterization is in terms of a deterministic recursion (`state evolution') that depends on the parameters defining the spatial coupling. Then, using a simple spatially coupled design and judicious choice of functions defining the AMP, we analyze the fixed points of the resulting state evolution and show that it achieves the asymptotic MMSE. Numerical results for phase retrieval and rectified linear regression show that spatially coupled designs can yield substantially lower MSE than i.i.d. Gaussian designs at finite dimensions when used with AMP algorithms.
△ Less
Submitted 15 September, 2023;
originally announced September 2023.
-
Enhancing Network Management Using Code Generated by Large Language Models
Authors:
Sathiya Kumaran Mani,
Yajie Zhou,
Kevin Hsieh,
Santiago Segarra,
Ranveer Chandra,
Srikanth Kandula
Abstract:
Analyzing network topologies and communication graphs plays a crucial role in contemporary network management. However, the absence of a cohesive approach leads to a challenging learning curve, heightened errors, and inefficiencies. In this paper, we introduce a novel approach to facilitate a natural-language-based network management experience, utilizing large language models (LLMs) to generate t…
▽ More
Analyzing network topologies and communication graphs plays a crucial role in contemporary network management. However, the absence of a cohesive approach leads to a challenging learning curve, heightened errors, and inefficiencies. In this paper, we introduce a novel approach to facilitate a natural-language-based network management experience, utilizing large language models (LLMs) to generate task-specific code from natural language queries. This method tackles the challenges of explainability, scalability, and privacy by allowing network operators to inspect the generated code, eliminating the need to share network data with LLMs, and concentrating on application-specific requests combined with general program synthesis techniques. We design and evaluate a prototype system using benchmark applications, showcasing high accuracy, cost-effectiveness, and the potential for further enhancements using complementary program synthesis techniques.
△ Less
Submitted 11 August, 2023;
originally announced August 2023.
-
Charge transfer-induced Lifshitz transition and magnetic symmetry breaking in ultrathin CrSBr crystals
Authors:
Marco Bianchi,
Kimberly Hsieh,
Esben Juel Porat,
Florian Dirnberger,
Julian Klein,
Kseniia Mosina,
Zdenek Sofer,
Alexander N. Rudenko,
Mikhail I. Katsnelson,
Yong P. Chen,
Malte Rösner,
Philip Hofmann
Abstract:
Ultrathin CrSBr flakes are exfoliated \emph{in situ} on Au(111) and Ag(111) and their electronic structure is studied by angle-resolved photoemission spectroscopy. The thin flakes' electronic properties are drastically different from those of the bulk material and also substrate-dependent. For both substrates, a strong charge transfer to the flakes is observed, partly populating the conduction ban…
▽ More
Ultrathin CrSBr flakes are exfoliated \emph{in situ} on Au(111) and Ag(111) and their electronic structure is studied by angle-resolved photoemission spectroscopy. The thin flakes' electronic properties are drastically different from those of the bulk material and also substrate-dependent. For both substrates, a strong charge transfer to the flakes is observed, partly populating the conduction band and giving rise to a highly anisotropic Fermi contour with an Ohmic contact to the substrate. The fundamental CrSBr band gap is strongly renormalized compared to the bulk. The charge transfer to the CrSBr flake is substantially larger for Ag(111) than for Au(111), but a rigid energy shift of the chemical potential is insufficient to describe the observed band structure modifications. In particular, the Fermi contour shows a Lifshitz transition, the fundamental band gap undergoes a transition from direct on Au(111) to indirect on Ag(111) and a do**-induced symmetry breaking between the intra-layer Cr magnetic moments further modifies the band structure. Electronic structure calculations can account for non-rigid Lifshitz-type band structure changes in thin CrSBr as a function of do** and strain. In contrast to undoped bulk band structure calculations that require self-consistent $GW$ theory, the doped thin film properties are well-approximated by density functional theory if local Coulomb interactions are taken into account on the mean-field level and the charge transfer is considered.
△ Less
Submitted 24 July, 2023;
originally announced July 2023.
-
The Brain Tumor Segmentation (BraTS-METS) Challenge 2023: Brain Metastasis Segmentation on Pre-treatment MRI
Authors:
Ahmed W. Moawad,
Anastasia Janas,
Ujjwal Baid,
Divya Ramakrishnan,
Rachit Saluja,
Nader Ashraf,
Leon Jekel,
Raisa Amiruddin,
Maruf Adewole,
Jake Albrecht,
Udunna Anazodo,
Sanjay Aneja,
Syed Muhammad Anwar,
Timothy Bergquist,
Evan Calabrese,
Veronica Chiang,
Verena Chung,
Gian Marco Marco Conte,
Farouk Dako,
James Eddy,
Ivan Ezhov,
Ariana Familiar,
Keyvan Farahani,
Juan Eugenio Iglesias,
Zhifan Jiang
, et al. (206 additional authors not shown)
Abstract:
The translation of AI-generated brain metastases (BM) segmentation into clinical practice relies heavily on diverse, high-quality annotated medical imaging datasets. The BraTS-METS 2023 challenge has gained momentum for testing and benchmarking algorithms using rigorously annotated internationally compiled real-world datasets. This study presents the results of the segmentation challenge and chara…
▽ More
The translation of AI-generated brain metastases (BM) segmentation into clinical practice relies heavily on diverse, high-quality annotated medical imaging datasets. The BraTS-METS 2023 challenge has gained momentum for testing and benchmarking algorithms using rigorously annotated internationally compiled real-world datasets. This study presents the results of the segmentation challenge and characterizes the challenging cases that impacted the performance of the winning algorithms. Untreated brain metastases on standard anatomic MRI sequences (T1, T2, FLAIR, T1PG) from eight contributed international datasets were annotated in stepwise method: published UNET algorithms, student, neuroradiologist, final approver neuroradiologist. Segmentations were ranked based on lesion-wise Dice and Hausdorff distance (HD95) scores. False positives (FP) and false negatives (FN) were rigorously penalized, receiving a score of 0 for Dice and a fixed penalty of 374 for HD95. Eight datasets comprising 1303 studies were annotated, with 402 studies (3076 lesions) released on Synapse as publicly available datasets to challenge competitors. Additionally, 31 studies (139 lesions) were held out for validation, and 59 studies (218 lesions) were used for testing. Segmentation accuracy was measured as rank across subjects, with the winning team achieving a LesionWise mean score of 7.9. Common errors among the leading teams included false negatives for small lesions and misregistration of masks in space.The BraTS-METS 2023 challenge successfully curated well-annotated, diverse datasets and identified common errors, facilitating the translation of BM segmentation across varied clinical environments and providing personalized volumetric reports to patients undergoing BM treatment.
△ Less
Submitted 17 June, 2024; v1 submitted 1 June, 2023;
originally announced June 2023.
-
Mitigating the Performance Impact of Network Failures in Public Clouds
Authors:
Pooria Namyar,
Behnaz Arzani,
Daniel Crankshaw,
Daniel S. Berger,
Kevin Hsieh,
Srikanth Kandula,
Ramesh Govindan
Abstract:
Some faults in data center networks require hours to days to repair because they may need reboots, re-imaging, or manual work by technicians. To reduce traffic impact, cloud providers \textit{mitigate} the effect of faults, for example, by steering traffic to alternate paths. The state-of-art in automatic network mitigations uses simple safety checks and proxy metrics to determine mitigations. SWA…
▽ More
Some faults in data center networks require hours to days to repair because they may need reboots, re-imaging, or manual work by technicians. To reduce traffic impact, cloud providers \textit{mitigate} the effect of faults, for example, by steering traffic to alternate paths. The state-of-art in automatic network mitigations uses simple safety checks and proxy metrics to determine mitigations. SWARM, the approach described in this paper, can pick orders of magnitude better mitigations by estimating end-to-end connection-level performance (CLP) metrics. At its core is a scalable CLP estimator that quickly ranks mitigations with high fidelity and, on failures observed at a large cloud provider, outperforms the state-of-the-art by over 700$\times$ in some cases.
△ Less
Submitted 23 May, 2023;
originally announced May 2023.
-
Tuning atom-field interaction via phase sha**
Authors:
Y. -T. Cheng,
C. -H. Chien,
K. -M. Hsieh,
Y. -H. Huang,
P. Y. Wen,
W. -J. Lin,
Y. Lu,
F. Aziz,
C. -P. Lee,
K. -T. Lin,
C. -Y. Chen,
J. C. Chen,
C. -S. Chuu,
A. F. Kockum,
G. -D. Lin,
Y. -H. Lin,
I. -C. Hoi
Abstract:
A coherent electromagnetic field can be described by its amplitude, frequency, and phase. All these properties can influence the interaction between the field and an atom. Here we demonstrate the phase sha** of microwaves that are scattered by a superconducting artificial atom coupled to the end of a semi-infinite 1D transmission line. In particular, we input a weak exponentially rising pulse wi…
▽ More
A coherent electromagnetic field can be described by its amplitude, frequency, and phase. All these properties can influence the interaction between the field and an atom. Here we demonstrate the phase sha** of microwaves that are scattered by a superconducting artificial atom coupled to the end of a semi-infinite 1D transmission line. In particular, we input a weak exponentially rising pulse with phase modulation to a transmon qubit. We observe that field-atom interaction can be tuned from nearly full interaction (interaction efficiency, i.e., amount of the field energy interacting with the atom, of 94.5%) to effectively no interaction (interaction efficiency 3.5%).
△ Less
Submitted 26 January, 2024; v1 submitted 23 May, 2023;
originally announced May 2023.
-
Microwave amplification via interfering multi-photon processes in a half-waveguide quantum electrodynamics system
Authors:
Fahad Aziz,
Kuan Ting Lin,
** Yi Wen,
Samina,
Yu Chen Lin,
Emely Wiegand,
Ching-** Lee,
Yu-Ting Cheng,
Ching-Yeh Chen,
Chin-Hsun Chien,
Kai-Min Hsieh,
Yu-Huan Huang,
Ian Hou,
Jeng-Chung Chen,
Yen-Hsiang Lin,
Anton Frisk Kockum,
Guin Dar Lin,
Io-Chun Hoi
Abstract:
We investigate the amplification of a microwave probe signal by a superconducting artificial atom, a transmon, strongly coupled to the end of a one-dimensional semi-infinite transmission line. The end of the transmission line acts as a mirror for microwave fields. Due to the weak anharmonicity of the artificial atom, a strong pump field creates multi-photon excitations among the dressed states. Tr…
▽ More
We investigate the amplification of a microwave probe signal by a superconducting artificial atom, a transmon, strongly coupled to the end of a one-dimensional semi-infinite transmission line. The end of the transmission line acts as a mirror for microwave fields. Due to the weak anharmonicity of the artificial atom, a strong pump field creates multi-photon excitations among the dressed states. Transitions between these dressed states, Rabi sidebands, give rise to either amplification or attenuation of the weak probe. We obtain a maximum amplitude amplification of about 18 %, higher than in any previous experiment with a single artificial atom, due to constructive interference between Rabi sidebands. We also characterize the noise properties of the system by measuring the spectrum of spontaneous emission.
△ Less
Submitted 14 February, 2023;
originally announced February 2023.
-
Health Guardian Platform: A technology stack to accelerate discovery in Digital Health research
Authors:
Bo Wen,
Vince S. Siu,
Italo Buleje,
Kuan Yu Hsieh,
Takashi Itoh,
Lukas Zimmerli,
Nigel Hinds,
Elif Eyigoz,
Bing Dang,
Stefan von Cavallar,
Jeffrey L. Rogers
Abstract:
This paper highlights the design philosophy and architecture of the Health Guardian, a platform developed by the IBM Digital Health team to accelerate discoveries of new digital biomarkers and development of digital health technologies. The Health Guardian allows for rapid translation of artificial intelligence (AI) research into cloud-based microservices that can be tested with data from clinical…
▽ More
This paper highlights the design philosophy and architecture of the Health Guardian, a platform developed by the IBM Digital Health team to accelerate discoveries of new digital biomarkers and development of digital health technologies. The Health Guardian allows for rapid translation of artificial intelligence (AI) research into cloud-based microservices that can be tested with data from clinical cohorts to understand disease and enable early prevention. The platform can be connected to mobile applications, wearables, or Internet of things (IoT) devices to collect health-related data into a secure database. When the analytics are created, the researchers can containerize and deploy their code on the cloud using pre-defined templates, and validate the models using the data collected from one or more sensing devices. The Health Guardian platform currently supports time-series, text, audio, and video inputs with 70+ analytic capabilities and is used for non-commercial scientific research. We provide an example of the Alzheimer's disease (AD) assessment microservice which uses AI methods to extract linguistic features from audio recordings to evaluate an individual's mini-mental state, the likelihood of having AD, and to predict the onset of AD before turning the age of 85. Today, IBM research teams across the globe use the Health Guardian internally as a test bed for early-stage research ideas, and externally with collaborators to support and enhance AI model development and clinical study efforts.
△ Less
Submitted 10 November, 2022;
originally announced November 2022.
-
Federated Learning under Distributed Concept Drift
Authors:
Ellango Jothimurugesan,
Kevin Hsieh,
Jianyu Wang,
Gauri Joshi,
Phillip B. Gibbons
Abstract:
Federated Learning (FL) under distributed concept drift is a largely unexplored area. Although concept drift is itself a well-studied phenomenon, it poses particular challenges for FL, because drifts arise staggered in time and space (across clients). To the best of our knowledge, this work is the first to explicitly study data heterogeneity in both dimensions. We first demonstrate that prior solu…
▽ More
Federated Learning (FL) under distributed concept drift is a largely unexplored area. Although concept drift is itself a well-studied phenomenon, it poses particular challenges for FL, because drifts arise staggered in time and space (across clients). To the best of our knowledge, this work is the first to explicitly study data heterogeneity in both dimensions. We first demonstrate that prior solutions to drift adaptation that use a single global model are ill-suited to staggered drifts, necessitating multiple-model solutions. We identify the problem of drift adaptation as a time-varying clustering problem, and we propose two new clustering algorithms for reacting to drifts based on local drift detection and hierarchical clustering. Empirical evaluation shows that our solutions achieve significantly higher accuracy than existing baselines, and are comparable to an idealized algorithm with oracle knowledge of the ground-truth clustering of clients to concepts at each time step.
△ Less
Submitted 27 February, 2023; v1 submitted 1 June, 2022;
originally announced June 2022.
-
FedSpace: An Efficient Federated Learning Framework at Satellites and Ground Stations
Authors:
**hyun So,
Kevin Hsieh,
Behnaz Arzani,
Shadi Noghabi,
Salman Avestimehr,
Ranveer Chandra
Abstract:
Large-scale deployments of low Earth orbit (LEO) satellites collect massive amount of Earth imageries and sensor data, which can empower machine learning (ML) to address global challenges such as real-time disaster navigation and mitigation. However, it is often infeasible to download all the high-resolution images and train these ML models on the ground because of limited downlink bandwidth, spar…
▽ More
Large-scale deployments of low Earth orbit (LEO) satellites collect massive amount of Earth imageries and sensor data, which can empower machine learning (ML) to address global challenges such as real-time disaster navigation and mitigation. However, it is often infeasible to download all the high-resolution images and train these ML models on the ground because of limited downlink bandwidth, sparse connectivity, and regularization constraints on the imagery resolution. To address these challenges, we leverage Federated Learning (FL), where ground stations and satellites collaboratively train a global ML model without sharing the captured images on the satellites. We show fundamental challenges in applying existing FL algorithms among satellites and ground stations, and we formulate an optimization problem which captures a unique trade-off between staleness and idleness. We propose a novel FL framework, named FedSpace, which dynamically schedules model aggregation based on the deterministic and time-varying connectivity according to satellite orbits. Extensive numerical evaluations based on real-world satellite images and satellite networks show that FedSpace reduces the training time by 1.7 days (38.6%) over the state-of-the-art FL algorithms.
△ Less
Submitted 2 February, 2022;
originally announced February 2022.
-
Towards a Cost vs. Quality Sweet Spot for Monitoring Networks
Authors:
Nofel Yaseen,
Behnaz Arzani,
Krishna Chintalapudi,
Vaishnavi Ranganathan,
Felipe Frujeri,
Kevin Hsieh,
Daniel Berger,
Vincent Liu,
Srikanth Kandula
Abstract:
Continuously monitoring a wide variety of performance and fault metrics has become a crucial part of operating large-scale datacenter networks. In this work, we ask whether we can reduce the costs to monitor -- in terms of collection, storage and analysis -- by judiciously controlling how much and which measurements we collect. By positing that we can treat almost all measured signals as sampled t…
▽ More
Continuously monitoring a wide variety of performance and fault metrics has become a crucial part of operating large-scale datacenter networks. In this work, we ask whether we can reduce the costs to monitor -- in terms of collection, storage and analysis -- by judiciously controlling how much and which measurements we collect. By positing that we can treat almost all measured signals as sampled time-series, we show that we can use signal processing techniques such as the Nyquist-Shannon theorem to avoid wasteful data collection. We show that large savings appear possible by analyzing tens of popular measurements from a production datacenter network. We also discuss the technical challenges that must be solved when applying these techniques in practice.
△ Less
Submitted 11 October, 2021;
originally announced October 2021.
-
Interpret-able feedback for AutoML systems
Authors:
Behnaz Arzani,
Kevin Hsieh,
Haoxian Chen
Abstract:
Automated machine learning (AutoML) systems aim to enable training machine learning (ML) models for non-ML experts. A shortcoming of these systems is that when they fail to produce a model with high accuracy, the user has no path to improve the model other than hiring a data scientist or learning ML -- this defeats the purpose of AutoML and limits its adoption. We introduce an interpretable data f…
▽ More
Automated machine learning (AutoML) systems aim to enable training machine learning (ML) models for non-ML experts. A shortcoming of these systems is that when they fail to produce a model with high accuracy, the user has no path to improve the model other than hiring a data scientist or learning ML -- this defeats the purpose of AutoML and limits its adoption. We introduce an interpretable data feedback solution for AutoML. Our solution suggests new data points for the user to label (without requiring a pool of unlabeled data) to improve the model's accuracy. Our solution analyzes how features influence the prediction among all ML models in an AutoML ensemble, and we suggest more data samples from feature ranges that have high variance in such analysis. Our evaluation shows that our solution can improve the accuracy of AutoML by 7-8% and significantly outperforms popular active learning solutions in data efficiency, all the while providing the added benefit of being interpretable.
△ Less
Submitted 22 February, 2021;
originally announced February 2021.
-
Spontaneous time reversal symmetry breaking at individual grain boundaries in graphene
Authors:
Kimberly Hsieh,
Vidya Kochat,
Tathagata Biswas,
Chandra Sekhar Tiwary,
Abhishek Mishra,
Gopalakrishnan Ramalingam,
Aditya Jayaraman,
Kamanio Chattopadhyay,
Srinivasan Raghavan,
Manish Jain,
Arindam Ghosh
Abstract:
Graphene grain boundaries have attracted interest for their ability to host nearly dispersionless electronic bands and magnetic instabilities. Here, we employ quantum transport and universal conductance fluctuations (UCF) measurements to experimentally demonstrate a spontaneous breaking of time reversal symmetry (TRS) across individual GBs of chemical vapour deposited graphene. While quantum trans…
▽ More
Graphene grain boundaries have attracted interest for their ability to host nearly dispersionless electronic bands and magnetic instabilities. Here, we employ quantum transport and universal conductance fluctuations (UCF) measurements to experimentally demonstrate a spontaneous breaking of time reversal symmetry (TRS) across individual GBs of chemical vapour deposited graphene. While quantum transport across the GBs indicate spin-scattering-induced dephasing, and hence formation of local magnetic moments, below $T\lesssim 4$ K, we observe complete lifting of TRS at high carrier densities ($n \gtrsim 5\times 10^{12}$cm$^{-2}$) and low temperature ($T\lesssim 2$ K). An unprecedented thirty times reduction in the UCF magnitude with increasing do** density further supports the possibility of an emergent frozen magnetic state at the GBs. Our experimental results suggest that realistic GBs of graphene can be a promising resource for new electronic phases and spin-based applications.
△ Less
Submitted 30 March, 2021; v1 submitted 9 February, 2021;
originally announced February 2021.
-
Near-Optimal Coding for Many-user Multiple Access Channels
Authors:
Kuan Hsieh,
Cynthia Rush,
Ramji Venkataramanan
Abstract:
This paper considers the Gaussian multiple-access channel (MAC) in the asymptotic regime where the number of users grows linearly with the code length. We propose efficient coding schemes based on random linear models with approximate message passing (AMP) decoding and derive the asymptotic error rate achieved for a given user density, user payload (in bits), and user energy. The tradeoff between…
▽ More
This paper considers the Gaussian multiple-access channel (MAC) in the asymptotic regime where the number of users grows linearly with the code length. We propose efficient coding schemes based on random linear models with approximate message passing (AMP) decoding and derive the asymptotic error rate achieved for a given user density, user payload (in bits), and user energy. The tradeoff between energy-per-bit and achievable user density (for a fixed user payload and target error rate) is studied, and it is demonstrated that in the large system limit, a spatially coupled coding scheme with AMP decoding achieves near-optimal tradeoffs for a wide range of user densities. Furthermore, in the regime where the user payload is large, we also study the tradeoff between energy-per-bit and spectral efficiency and discuss methods to reduce decoding complexity.
△ Less
Submitted 9 March, 2022; v1 submitted 9 February, 2021;
originally announced February 2021.
-
Ekya: Continuous Learning of Video Analytics Models on Edge Compute Servers
Authors:
Romil Bhardwaj,
Zhengxu Xia,
Ganesh Ananthanarayanan,
Junchen Jiang,
Nikolaos Karianakis,
Yuanchao Shu,
Kevin Hsieh,
Victor Bahl,
Ion Stoica
Abstract:
Video analytics applications use edge compute servers for the analytics of the videos (for bandwidth and privacy). Compressed models that are deployed on the edge servers for inference suffer from data drift, where the live video data diverges from the training data. Continuous learning handles data drift by periodically retraining the models on new data. Our work addresses the challenge of jointl…
▽ More
Video analytics applications use edge compute servers for the analytics of the videos (for bandwidth and privacy). Compressed models that are deployed on the edge servers for inference suffer from data drift, where the live video data diverges from the training data. Continuous learning handles data drift by periodically retraining the models on new data. Our work addresses the challenge of jointly supporting inference and retraining tasks on edge servers, which requires navigating the fundamental tradeoff between the retrained model's accuracy and the inference accuracy. Our solution Ekya balances this tradeoff across multiple models and uses a micro-profiler to identify the models that will benefit the most by retraining. Ekya's accuracy gain compared to a baseline scheduler is 29% higher, and the baseline requires 4x more GPU resources to achieve the same accuracy as Ekya.
△ Less
Submitted 18 December, 2020;
originally announced December 2020.
-
Drug repurposing for COVID-19 using graph neural network and harmonizing multiple evidence
Authors:
Kanglin Hsieh,
Yinyin Wang,
Luyao Chen,
Zhongming Zhao,
Sean Savitz,
Xiaoqian Jiang,
**g Tang,
Ye** Kim
Abstract:
Amid the pandemic of 2019 novel coronavirus disease (COVID-19) infected by SARS-CoV-2, a vast amount of drug research for prevention and treatment has been quickly conducted, but these efforts have been unsuccessful thus far. Our objective is to prioritize repurposable drugs using a drug repurposing pipeline that systematically integrates multiple SARS-CoV-2 and drug interactions, deep graph neura…
▽ More
Amid the pandemic of 2019 novel coronavirus disease (COVID-19) infected by SARS-CoV-2, a vast amount of drug research for prevention and treatment has been quickly conducted, but these efforts have been unsuccessful thus far. Our objective is to prioritize repurposable drugs using a drug repurposing pipeline that systematically integrates multiple SARS-CoV-2 and drug interactions, deep graph neural networks, and in-vitro/population-based validations. We first collected all the available drugs (n= 3,635) involved in COVID-19 patient treatment through CTDbase. We built a SARS-CoV-2 knowledge graph based on the interactions among virus baits, host genes, pathways, drugs, and phenotypes. A deep graph neural network approach was used to derive the candidate representation based on the biological interactions. We prioritized the candidate drugs using clinical trial history, and then validated them with their genetic profiles, in vitro experimental efficacy, and electronic health records. We highlight the top 22 drugs including Azithromycin, Atorvastatin, Aspirin, Acetaminophen, and Albuterol. We further pinpointed drug combinations that may synergistically target COVID-19. In summary, we demonstrated that the integration of extensive interactions, deep neural networks, and rigorous validation can facilitate the rapid identification of candidate drugs for COVID-19 treatment. This is a post-peer-review, pre-copyedit version of an article published in Scientific Reports The final authenticated version is available online at: https://www.nature.com/articles/s41598-021-02353-5
△ Less
Submitted 1 February, 2022; v1 submitted 23 September, 2020;
originally announced September 2020.
-
Modulated Sparse Superposition Codes for the Complex AWGN Channel
Authors:
Kuan Hsieh,
Ramji Venkataramanan
Abstract:
This paper studies a generalization of sparse superposition codes (SPARCs) for communication over the complex additive white Gaussian noise (AWGN) channel. In a SPARC, the codebook is defined in terms of a design matrix, and each codeword is a generated by multiplying the design matrix with a sparse message vector. In the standard SPARC construction, information is encoded in the locations of the…
▽ More
This paper studies a generalization of sparse superposition codes (SPARCs) for communication over the complex additive white Gaussian noise (AWGN) channel. In a SPARC, the codebook is defined in terms of a design matrix, and each codeword is a generated by multiplying the design matrix with a sparse message vector. In the standard SPARC construction, information is encoded in the locations of the non-zero entries of the message vector. In this paper we generalize the construction and consider modulated SPARCs, where information in encoded in both the locations and the values of the non-zero entries of the message vector. We focus on the case where the non-zero entries take values from a phase-shift keying (PSK) constellation. We propose a computationally efficient approximate message passing (AMP) decoder, and obtain analytical bounds on the state evolution parameters which predict the error performance of the decoder. Using these bounds we show that PSK-modulated SPARCs are asymptotically capacity achieving for the complex AWGN channel, with either spatial coupling or power allocation. We also provide numerical simulation results to demonstrate the error performance at finite code lengths. These results show that introducing modulation to the SPARC design can significantly reduce decoding complexity without sacrificing error performance.
△ Less
Submitted 11 May, 2021; v1 submitted 20 April, 2020;
originally announced April 2020.
-
Evidence of Lifshitz transition in thermoelectric power of ultrahigh mobility bilayer graphene
Authors:
Aditya Jayaraman,
Kimberly Hsieh,
Bhaskar Ghawri,
Phanibhusan S. Mahapatra,
Arindam Ghosh
Abstract:
Resolving low-energy features in the density of states (DOS) holds the key to understanding wide variety of rich novel phenomena in graphene based 2D heterostructures. Lifshitz transition in bilayer graphene (BLG) arising from trigonal war** has been established theoretically and experimentally. Nevertheless, the experimental realization of its effects on the transport properties has been challe…
▽ More
Resolving low-energy features in the density of states (DOS) holds the key to understanding wide variety of rich novel phenomena in graphene based 2D heterostructures. Lifshitz transition in bilayer graphene (BLG) arising from trigonal war** has been established theoretically and experimentally. Nevertheless, the experimental realization of its effects on the transport properties has been challenging because of its relatively low energy scale ($\sim 1$ meV). In this work, we demonstrate that the thermoelectric power (TEP) can be used as an effective probe to investigate fine changes in the DOS of BLG. We observe additional entropy features in the vicinity of the charge neutrality point (CNP) in gapped BLG. This apparent violation of Mott formula can be explained quantitatively by considering the effects of trigonal war**, thereby serving as a possible evidence of a Lifshitz transition.
△ Less
Submitted 5 March, 2020;
originally announced March 2020.
-
Capacity-achieving Spatially Coupled Sparse Superposition Codes with AMP Decoding
Authors:
Cynthia Rush,
Kuan Hsieh,
Ramji Venkataramanan
Abstract:
Sparse superposition codes, also called sparse regression codes (SPARCs), are a class of codes for efficient communication over the AWGN channel at rates approaching the channel capacity. In a standard SPARC, codewords are sparse linear combinations of columns of an i.i.d. Gaussian design matrix, while in a spatially coupled SPARC the design matrix has a block-wise structure, where the variance of…
▽ More
Sparse superposition codes, also called sparse regression codes (SPARCs), are a class of codes for efficient communication over the AWGN channel at rates approaching the channel capacity. In a standard SPARC, codewords are sparse linear combinations of columns of an i.i.d. Gaussian design matrix, while in a spatially coupled SPARC the design matrix has a block-wise structure, where the variance of the Gaussian entries can be varied across blocks. A well-designed spatial coupling structure can significantly enhance the error performance of iterative decoding algorithms such as Approximate Message Passing (AMP).
In this paper, we obtain a non-asymptotic bound on the probability of error of spatially coupled SPARCs with AMP decoding. Applying this bound to a simple band-diagonal design matrix, we prove that spatially coupled SPARCs with AMP decoding achieve the capacity of the AWGN channel. The bound also highlights how the decay of error probability depends on each design parameter of the spatially coupled SPARC. An attractive feature of AMP decoding is that its asymptotic mean squared error (MSE) can be predicted via a deterministic recursion called state evolution. Our result provides the first proof that the MSE concentrates on the state evolution prediction for spatially coupled designs. Combined with the state evolution prediction, this result implies that spatially coupled SPARCs with the proposed band-diagonal design are capacity-achieving. Using the proof technique used to establish the main result, we also obtain a concentration inequality for the MSE of AMP applied to compressed sensing with spatially coupled design matrices. Finally we provide numerical simulation results that demonstrate the finite length error performance of spatially coupled SPARCs. The performance is compared with coded modulation schemes that use LDPC codes from the DVB-S2 standard.
△ Less
Submitted 8 May, 2021; v1 submitted 18 February, 2020;
originally announced February 2020.
-
Machine Learning Systems for Highly-Distributed and Rapidly-Growing Data
Authors:
Kevin Hsieh
Abstract:
The usability and practicality of any machine learning (ML) applications are largely influenced by two critical but hard-to-attain factors: low latency and low cost. Unfortunately, achieving low latency and low cost is very challenging when ML depends on real-world data that are highly distributed and rapidly growing (e.g., data collected by mobile phones and video cameras all over the world). Suc…
▽ More
The usability and practicality of any machine learning (ML) applications are largely influenced by two critical but hard-to-attain factors: low latency and low cost. Unfortunately, achieving low latency and low cost is very challenging when ML depends on real-world data that are highly distributed and rapidly growing (e.g., data collected by mobile phones and video cameras all over the world). Such real-world data pose many challenges in communication and computation. For example, when training data are distributed across data centers that span multiple continents, communication among data centers can easily overwhelm the limited wide-area network bandwidth, leading to prohibitively high latency and high cost.
In this dissertation, we demonstrate that the latency and cost of ML on highly-distributed and rapidly-growing data can be improved by one to two orders of magnitude by designing ML systems that exploit the characteristics of ML algorithms, ML model structures, and ML training/serving data. We support this thesis statement with three contributions. First, we design a system that provides both low-latency and low-cost ML serving (inferencing) over large-scale and continuously-growing datasets, such as videos. Second, we build a system that makes ML training over geo-distributed datasets as fast as training within a single data center. Third, we present a first detailed study and a system-level solution on a fundamental and largely overlooked problem: ML training over non-IID (i.e., not independent and identically distributed) data partitions (e.g., facial images collected by cameras varies according to the demographics of each camera's location).
△ Less
Submitted 18 October, 2019;
originally announced October 2019.
-
The Non-IID Data Quagmire of Decentralized Machine Learning
Authors:
Kevin Hsieh,
Amar Phanishayee,
Onur Mutlu,
Phillip B. Gibbons
Abstract:
Many large-scale machine learning (ML) applications need to perform decentralized learning over datasets generated at different devices and locations. Such datasets pose a significant challenge to decentralized learning because their different contexts result in significant data distribution skew across devices/locations. In this paper, we take a step toward better understanding this challenge by…
▽ More
Many large-scale machine learning (ML) applications need to perform decentralized learning over datasets generated at different devices and locations. Such datasets pose a significant challenge to decentralized learning because their different contexts result in significant data distribution skew across devices/locations. In this paper, we take a step toward better understanding this challenge by presenting a detailed experimental study of decentralized DNN training on a common type of data skew: skewed distribution of data labels across devices/locations. Our study shows that: (i) skewed data labels are a fundamental and pervasive problem for decentralized learning, causing significant accuracy loss across many ML applications, DNN models, training datasets, and decentralized learning algorithms; (ii) the problem is particularly challenging for DNN models with batch normalization; and (iii) the degree of data skew is a key determinant of the difficulty of the problem. Based on these findings, we present SkewScout, a system-level approach that adapts the communication frequency of decentralized learning algorithms to the (skew-induced) accuracy loss between data partitions. We also show that group normalization can recover much of the accuracy loss of batch normalization.
△ Less
Submitted 18 August, 2020; v1 submitted 30 September, 2019;
originally announced October 2019.
-
Optimising Graphene Visibility in van der Waals Heterostructures
Authors:
Thanmay S. Menon,
Simli Mishra,
Vidhu Catherine Antony,
Kiranmayi Dixit,
Saloni Kakkar,
Tanweer Ahmed,
Saurav Islam,
Aditya Jayaraman,
Kimberly Hsieh,
Paritosh Karnatak,
Arindam Ghosh
Abstract:
Graphene constitutes one of the key elements in many functional van der Waals heterostructures. However, it has negligible optical visibility due to its monolayer nature. Here we study the visibility of graphene in various van der Waals heterostructures and include the effects of the source spectrum, oblique incidence and the spectral sensitivity of the detector to obtain a realistic model. A visi…
▽ More
Graphene constitutes one of the key elements in many functional van der Waals heterostructures. However, it has negligible optical visibility due to its monolayer nature. Here we study the visibility of graphene in various van der Waals heterostructures and include the effects of the source spectrum, oblique incidence and the spectral sensitivity of the detector to obtain a realistic model. A visibility experiment is performed at different wavelengths, resulting in a very good agreement with our calculations. This allows us to reliably predict the conditions for better visibility of graphene in van der Waals heterostructures. The framework and the codes provided in this work can be extended to study the visibility of any 2D material within an arbitrary van der Waals heterostructure.
△ Less
Submitted 18 June, 2019; v1 submitted 27 March, 2019;
originally announced March 2019.
-
Flexible-Latency DRAM: Understanding and Exploiting Latency Variation in Modern DRAM Chips
Authors:
Kevin K. Chang,
Abhijith Kashyap,
Hasan Hassan,
Saugata Ghose,
Kevin Hsieh,
Donghyuk Lee,
Tianshi Li,
Gennady Pekhimenko,
Samira Khan,
Onur Mutlu
Abstract:
This article summarizes key results of our work on experimental characterization and analysis of latency variation and latency-reliability trade-offs in modern DRAM chips, which was published in SIGMETRICS 2016, and examines the work's significance and future potential.
The goal of this work is to (i) experimentally characterize and understand the latency variation across cells within a DRAM chi…
▽ More
This article summarizes key results of our work on experimental characterization and analysis of latency variation and latency-reliability trade-offs in modern DRAM chips, which was published in SIGMETRICS 2016, and examines the work's significance and future potential.
The goal of this work is to (i) experimentally characterize and understand the latency variation across cells within a DRAM chip for these three fundamental DRAM operations, and (ii) develop new mechanisms that exploit our understanding of the latency variation to reliably improve performance. To this end, we comprehensively characterize 240 DRAM chips from three major vendors, and make six major new observations about latency variation within DRAM. Notably, we find that (i) there is large latency variation across the cells for each of the three operations; (ii) variation characteristics exhibit significant spatial locality: slower cells are clustered in certain regions of a DRAM chip; and (iii) the three fundamental operations exhibit different reliability characteristics when the latency of each operation is reduced.
Based on our observations, we propose Flexible-LatencY DRAM (FLY-DRAM), a mechanism that exploits latency variation across DRAM cells within a DRAM chip to improve system performance. The key idea of FLY-DRAM is to exploit the spatial locality of slower cells within DRAM, and access the faster DRAM regions with reduced latencies for the fundamental operations. Our evaluations show that FLY-DRAM improves the performance of a wide range of applications by 13.3%, 17.6%, and 19.5%, on average, for each of the three different vendors' real DRAM chips, in a simulated 8-core system.
△ Less
Submitted 8 May, 2018;
originally announced May 2018.
-
Decoupling GPU Programming Models from Resource Management for Enhanced Programming Ease, Portability, and Performance
Authors:
Nandita Vijaykumar,
Kevin Hsieh,
Gennady Pekhimenko,
Samira Khan,
Ashish Shrestha,
Saugata Ghose,
Adwait Jog,
Phillip B. Gibbons,
Onur Mutlu
Abstract:
The application resource specification--a static specification of several parameters such as the number of threads and the scratchpad memory usage per thread block--forms a critical component of modern GPU programming models. This specification determines the parallelism, and hence performance, of the application during execution because the corresponding on-chip hardware resources are allocated a…
▽ More
The application resource specification--a static specification of several parameters such as the number of threads and the scratchpad memory usage per thread block--forms a critical component of modern GPU programming models. This specification determines the parallelism, and hence performance, of the application during execution because the corresponding on-chip hardware resources are allocated and managed based on this specification. This tight-coupling between the software-provided resource specification and resource management in hardware leads to significant challenges in programming ease, portability, and performance. Zorua is a new resource virtualization framework, that decouples the programmer-specified resource usage of a GPU application from the actual allocation in the on-chip hardware resources. Zorua enables this decoupling by virtualizing each resource transparently to the programmer.
We demonstrate that by providing the illusion of more resources than physically available via controlled and coordinated virtualization, Zorua offers several important benefits: (i) Programming Ease. Zorua eases the burden on the programmer to provide code that is tuned to efficiently utilize the physically available on-chip resources. (ii) Portability. Zorua alleviates the necessity of re-tuning an application's resource usage when porting the application across GPU generations. (iii) Performance. By dynamically allocating resources and carefully oversubscribing them when necessary, Zorua improves or retains the performance of applications that are already highly tuned to best utilize the resources.
△ Less
Submitted 2 May, 2018;
originally announced May 2018.
-
A Concept Learning Tool Based On Calculating Version Space Cardinality
Authors:
Kuo-Kai Hsieh,
Li-C. Wang
Abstract:
In this paper, we proposed VeSC-CoL (Version Space Cardinality based Concept Learning) to deal with concept learning on extremely imbalanced datasets, especially when cross-validation is not a viable option. VeSC-CoL uses version space cardinality as a measure for model quality to replace cross-validation. Instead of naive enumeration of the version space, Ordered Binary Decision Diagram and Boole…
▽ More
In this paper, we proposed VeSC-CoL (Version Space Cardinality based Concept Learning) to deal with concept learning on extremely imbalanced datasets, especially when cross-validation is not a viable option. VeSC-CoL uses version space cardinality as a measure for model quality to replace cross-validation. Instead of naive enumeration of the version space, Ordered Binary Decision Diagram and Boolean Satisfiability are used to compute the version space. Experiments show that VeSC-CoL can accurately learn the target concept when computational resource is allowed.
△ Less
Submitted 22 March, 2018;
originally announced March 2018.
-
Zorua: Enhancing Programming Ease, Portability, and Performance in GPUs by Decoupling Programming Models from Resource Management
Authors:
Nandita Vijaykumar,
Kevin Hsieh,
Gennady Pekhimenko,
Samira Khan,
Ashish Shrestha,
Saugata Ghose,
Phillip B. Gibbons,
Onur Mutlu
Abstract:
The application resource specification--a static specification of several parameters such as the number of threads and the scratchpad memory usage per thread block--forms a critical component of the existing GPU programming models. This specification determines the performance of the application during execution because the corresponding on-chip hardware resources are allocated and managed purely…
▽ More
The application resource specification--a static specification of several parameters such as the number of threads and the scratchpad memory usage per thread block--forms a critical component of the existing GPU programming models. This specification determines the performance of the application during execution because the corresponding on-chip hardware resources are allocated and managed purely based on this specification. This tight coupling between the software-provided resource specification and resource management in hardware leads to significant challenges in programming ease, portability, and performance, as we demonstrate in this work.
Our goal in this work is to reduce the dependence of performance on the software-provided resource specification to simultaneously alleviate the above challenges. To this end, we introduce Zorua, a new resource virtualization framework, that decouples the programmer-specified resource usage of a GPU application from the actual allocation in the on-chip hardware resources. Zorua enables this decoupling by virtualizing each resource transparently to the programmer.
We demonstrate that by providing the illusion of more resources than physically available, Zorua offers several important benefits: (i) Programming Ease: Zorua eases the burden on the programmer to provide code that is tuned to efficiently utilize the physically available on-chip resources. (ii) Portability: Zorua alleviates the necessity of re-tuning an application's resource usage when porting the application across GPU generations. (iii) Performance: By dynamically allocating resources and carefully oversubscribing them when necessary, Zorua improves or retains the performance of applications that are already highly tuned to best utilize the resources. The holistic virtualization provided by Zorua has many other potential uses which we describe in this paper.
△ Less
Submitted 7 February, 2018;
originally announced February 2018.
-
Enabling the Adoption of Processing-in-Memory: Challenges, Mechanisms, Future Research Directions
Authors:
Saugata Ghose,
Kevin Hsieh,
Amirali Boroumand,
Rachata Ausavarungnirun,
Onur Mutlu
Abstract:
Poor DRAM technology scaling over the course of many years has caused DRAM-based main memory to increasingly become a larger system bottleneck. A major reason for the bottleneck is that data stored within DRAM must be moved across a pin-limited memory channel to the CPU before any computation can take place. This requires a high latency and energy overhead, and the data often cannot benefit from c…
▽ More
Poor DRAM technology scaling over the course of many years has caused DRAM-based main memory to increasingly become a larger system bottleneck. A major reason for the bottleneck is that data stored within DRAM must be moved across a pin-limited memory channel to the CPU before any computation can take place. This requires a high latency and energy overhead, and the data often cannot benefit from caching in the CPU, making it difficult to amortize the overhead.
Modern 3D-stacked DRAM architectures include a logic layer, where compute logic can be integrated underneath multiple layers of DRAM cell arrays within the same chip. Architects can take advantage of the logic layer to perform processing-in-memory (PIM), or near-data processing. In a PIM architecture, the logic layer within DRAM has access to the high internal bandwidth available within 3D-stacked DRAM (which is much greater than the bandwidth available between DRAM and the CPU). Thus, PIM architectures can effectively free up valuable memory channel bandwidth while reducing system energy consumption.
A number of important issues arise when we add compute logic to DRAM. In particular, the logic does not have low-latency access to common CPU structures that are essential for modern application execution, such as the virtual memory and cache coherence mechanisms. To ease the widespread adoption of PIM, we ideally would like to maintain traditional virtual memory abstractions and the shared memory programming model. This requires efficient mechanisms that can provide logic in DRAM with access to CPU structures without having to communicate frequently with the CPU. To this end, we propose and evaluate two general-purpose solutions that minimize unnecessary off-chip communication for PIM architectures. We show that both mechanisms improve the performance and energy consumption of many important memory-intensive applications.
△ Less
Submitted 1 February, 2018;
originally announced February 2018.
-
Focus: Querying Large Video Datasets with Low Latency and Low Cost
Authors:
Kevin Hsieh,
Ganesh Ananthanarayanan,
Peter Bodik,
Paramvir Bahl,
Matthai Philipose,
Phillip B. Gibbons,
Onur Mutlu
Abstract:
Large volumes of videos are continuously recorded from cameras deployed for traffic control and surveillance with the goal of answering "after the fact" queries: identify video frames with objects of certain classes (cars, bags) from many days of recorded video. While advancements in convolutional neural networks (CNNs) have enabled answering such queries with high accuracy, they are too expensive…
▽ More
Large volumes of videos are continuously recorded from cameras deployed for traffic control and surveillance with the goal of answering "after the fact" queries: identify video frames with objects of certain classes (cars, bags) from many days of recorded video. While advancements in convolutional neural networks (CNNs) have enabled answering such queries with high accuracy, they are too expensive and slow. We build Focus, a system for low-latency and low-cost querying on large video datasets. Focus uses cheap ingestion techniques to index the videos by the objects occurring in them. At ingest-time, it uses compression and video-specific specialization of CNNs. Focus handles the lower accuracy of the cheap CNNs by judiciously leveraging expensive CNNs at query-time. To reduce query time latency, we cluster similar objects and hence avoid redundant processing. Using experiments on video streams from traffic, surveillance and news channels, we see that Focus uses 58X fewer GPU cycles than running expensive ingest processors and is 37X faster than processing all the video at query time.
△ Less
Submitted 10 January, 2018;
originally announced January 2018.
-
Spatially Coupled Sparse Regression Codes: Design and State Evolution Analysis
Authors:
Kuan Hsieh,
Cynthia Rush,
Ramji Venkataramanan
Abstract:
We consider the design and analysis of spatially coupled sparse regression codes (SC-SPARCs), which were recently introduced by Barbier et al. for efficient communication over the additive white Gaussian noise channel. SC-SPARCs can be efficiently decoded using an Approximate Message Passing (AMP) decoder, whose performance in each iteration can be predicted via a set of equations called state evo…
▽ More
We consider the design and analysis of spatially coupled sparse regression codes (SC-SPARCs), which were recently introduced by Barbier et al. for efficient communication over the additive white Gaussian noise channel. SC-SPARCs can be efficiently decoded using an Approximate Message Passing (AMP) decoder, whose performance in each iteration can be predicted via a set of equations called state evolution. In this paper, we give an asymptotic characterization of the state evolution equations for SC-SPARCs. For any given base matrix (that defines the coupling structure of the SC-SPARC) and rate, this characterization can be used to predict whether or not AMP decoding will succeed in the large system limit. We then consider a simple base matrix defined by two parameters $(ω, Λ)$, and show that AMP decoding succeeds in the large system limit for all rates $R < \mathcal{C}$. The asymptotic result also indicates how the parameters of the base matrix affect the decoding progression. Simulation results are presented to evaluate the performance of SC-SPARCs defined with the proposed base matrix.
△ Less
Submitted 26 April, 2018; v1 submitted 5 January, 2018;
originally announced January 2018.
-
D-SLATS: Distributed Simultaneous Localization and Time Synchronization
Authors:
Amr Alanwar,
Henrique Ferraz,
Kevin Hsieh,
Rohit Thazhath,
Paul Martin,
Joao Hespanha,
Mani Srivastava
Abstract:
Through the last decade, we have witnessed a surge of Internet of Things (IoT) devices, and with that a greater need to choreograph their actions across both time and space. Although these two problems, namely time synchronization and localization, share many aspects in common, they are traditionally treated separately or combined on centralized approaches that results in an ineffcient use of reso…
▽ More
Through the last decade, we have witnessed a surge of Internet of Things (IoT) devices, and with that a greater need to choreograph their actions across both time and space. Although these two problems, namely time synchronization and localization, share many aspects in common, they are traditionally treated separately or combined on centralized approaches that results in an ineffcient use of resources, or in solutions that are not scalable in terms of the number of IoT devices. Therefore, we propose D-SLATS, a framework comprised of three different and independent algorithms to jointly solve time synchronization and localization problems in a distributed fashion. The First two algorithms are based mainly on the distributed Extended Kalman Filter (EKF) whereas the third one uses optimization techniques. No fusion center is required, and the devices only communicate with their neighbors. The proposed methods are evaluated on custom Ultra-Wideband communication Testbed and a quadrotor, representing a network of both static and mobile nodes. Our algorithms achieve up to three microseconds time synchronization accuracy and 30 cm localization error.
△ Less
Submitted 10 November, 2017;
originally announced November 2017.
-
LazyPIM: Efficient Support for Cache Coherence in Processing-in-Memory Architectures
Authors:
Amirali Boroumand,
Saugata Ghose,
Minesh Patel,
Hasan Hassan,
Brandon Lucia,
Nastaran Ha**azar,
Kevin Hsieh,
Krishna T. Malladi,
Hongzhong Zheng,
Onur Mutlu
Abstract:
Processing-in-memory (PIM) architectures have seen an increase in popularity recently, as the high internal bandwidth available within 3D-stacked memory provides greater incentive to move some computation into the logic layer of the memory. To maintain program correctness, the portions of a program that are executed in memory must remain coherent with the portions of the program that continue to e…
▽ More
Processing-in-memory (PIM) architectures have seen an increase in popularity recently, as the high internal bandwidth available within 3D-stacked memory provides greater incentive to move some computation into the logic layer of the memory. To maintain program correctness, the portions of a program that are executed in memory must remain coherent with the portions of the program that continue to execute within the processor. Unfortunately, PIM architectures cannot use traditional approaches to cache coherence due to the high off-chip traffic consumed by coherence messages, which, as we illustrate in this work, can undo the benefits of PIM execution for many data-intensive applications. We propose LazyPIM, a new hardware cache coherence mechanism designed specifically for PIM. Prior approaches for coherence in PIM are ill-suited to applications that share a large amount of data between the processor and the PIM logic. LazyPIM uses a combination of speculative cache coherence and compressed coherence signatures to greatly reduce the overhead of kee** PIM coherent with the processor, even when a large amount of sharing exists.We find that LazyPIM improves average performance across a range of data-intensive PIM applications by 19.6%, reduces off-chip traffic by 30.9%, and reduces energy consumption by 18.0%, over the best prior approaches to PIM coherence.
△ Less
Submitted 9 June, 2017;
originally announced June 2017.
-
Mathematica with ROOT
Authors:
Ken Hsieh,
Thomas G. Throwe,
Sebastian White
Abstract:
We present an open-source Mathematica importer for CERN ROOT files. Taking advantage of Mathematica's import/export plug-in mechanism, the importer offers a simple, unified interface that cleanly wraps around its MathLink-based core that links the ROOT libraries with Mathematica. Among other tests for accuracy and efficiency, the importer has also been tested on a large (~5 Gbyte) file structure,…
▽ More
We present an open-source Mathematica importer for CERN ROOT files. Taking advantage of Mathematica's import/export plug-in mechanism, the importer offers a simple, unified interface that cleanly wraps around its MathLink-based core that links the ROOT libraries with Mathematica. Among other tests for accuracy and efficiency, the importer has also been tested on a large (~5 Gbyte) file structure, D3PD, used by the ATLAS experiment for offline analysis without problems. In addition to describing the installation and usage of the importer, we discuss how the importer may be further improved and customized. A link to the package can be found at: http://library.wolfram.com/infocenter/Articles/7793/ and a related presentation is at: http://cd-docdb.fnal.gov/cgi-bin/DisplayMeeting?conferenceid=522
△ Less
Submitted 6 March, 2011; v1 submitted 24 February, 2011;
originally announced February 2011.
-
Global Analysis of General SU(2) x SU(2) x U(1) Models with Precision Data
Authors:
Ken Hsieh,
Kai Schmitz,
Jiang-Hao Yu,
C. -P. Yuan
Abstract:
We present the results of a global analysis of a class of models with an extended electroweak gauge group of the form SU(2) x SU(2) x U(1), often denoted as G(221) models, which include as examples the left-right, the lepto-phobic, the hadro-phobic, the fermio-phobic, the un-unified, and the non-universal models. Using an effective Lagrangian approach, we compute the shifts to the coefficients in…
▽ More
We present the results of a global analysis of a class of models with an extended electroweak gauge group of the form SU(2) x SU(2) x U(1), often denoted as G(221) models, which include as examples the left-right, the lepto-phobic, the hadro-phobic, the fermio-phobic, the un-unified, and the non-universal models. Using an effective Lagrangian approach, we compute the shifts to the coefficients in the electroweak Lagrangian due to the new heavy gauge bosons, and obtain the lower bounds on the masses of the Z' and W' bosons. The analysis of the electroweak parameter bounds reveals a consistent pattern of several key observables that are especially sensitive to the effects of new physics and thus dominate the overall shape of the respective parameter contours.
△ Less
Submitted 17 March, 2010;
originally announced March 2010.
-
Z to b bbar and Chiral Currents in Higgsless Models
Authors:
Tomohiro Abe,
R. Sekhar Chivukula,
Neil D. Christensen,
Ken Hsieh,
Shinya Matsuzaki,
Elizabeth H. Simmons,
Masaharu Tanabashi
Abstract:
In this note we compute the flavor-dependent chiral-logarithmic corrections to the decay Z to b bbar in the three site Higgsless model. We compute these corrections diagrammatically in the "gaugeless" limit in which the electroweak couplings vanish. We also compute the chiral-logarithmic corrections to the decay Z to b bbar using an RGE analysis in effective field theory, and show that the resul…
▽ More
In this note we compute the flavor-dependent chiral-logarithmic corrections to the decay Z to b bbar in the three site Higgsless model. We compute these corrections diagrammatically in the "gaugeless" limit in which the electroweak couplings vanish. We also compute the chiral-logarithmic corrections to the decay Z to b bbar using an RGE analysis in effective field theory, and show that the results agree. In the process of this computation, we compute the form of the chiral current in the gaugeless limit of the three-site model, and consider the generalization to the N-site case. We elucidate the Ward-Takahashi identities which underlie the gaugeless limit calculation in the three-site model, and describe how the result for the Z to b bbar amplitude is obtained in unitary gauge in the full theory. We find that the phenomenological constraints on the three-site Higgsless model arising from measurements of Z to b bbar are relatively mild, requiring only that the heavy Dirac fermion be heavier than 1 TeV or so, and are satisfied automatically in the range of parameters allowed by other precision electroweak data.
△ Less
Submitted 25 March, 2009; v1 submitted 23 February, 2009;
originally announced February 2009.
-
A Re-interpretation of the STEREO/STE Observations and it's Consequences
Authors:
K. C. Hsieh,
P. C. Frisch,
J. Giacalone,
J. R. Jokipii,
J. Kota,
D. E. Larson,
R. P Lin,
J. G. Luhmann,
L. Wang
Abstract:
We present an alternate interpretation of recent STEREO/STE observations that were originally attributed to energetic neutral atoms (ENA) from the heliosheath. The signal attributed to the diffuse ENA source instead shows the characteristics of a point source. We point out that the peak intensity seen by STEREO/STE is centered at the ecliptic longitude of the bright X-ray source Sco X-1. The obs…
▽ More
We present an alternate interpretation of recent STEREO/STE observations that were originally attributed to energetic neutral atoms (ENA) from the heliosheath. The signal attributed to the diffuse ENA source instead shows the characteristics of a point source. We point out that the peak intensity seen by STEREO/STE is centered at the ecliptic longitude of the bright X-ray source Sco X-1. The observed energy spectrum and intensity are also consistent with the X-rays from Sco X-1. The problem of energy dissipation at the solar wind termination shock remains unsolved while current understanding of the interaction between the solar wind and interstellar wind awaits future observations.
△ Less
Submitted 20 February, 2009;
originally announced February 2009.
-
Lone Higgs at the LHC
Authors:
Ken Hsieh,
C. -P. Yuan
Abstract:
We address the possible scenario that the Large Hadron Collider (LHC) discovers only a Higgs boson after 10 fb^{-1} of operation, and attempt to identify this Higgs boson as that of the Standard Model (SM), the minimal universal extra dimension model (MUED), the littlest Higgs model with T-parity (LHT), or the minimal supersymmetric Standard Model (MSSM), using only the measurement of the produc…
▽ More
We address the possible scenario that the Large Hadron Collider (LHC) discovers only a Higgs boson after 10 fb^{-1} of operation, and attempt to identify this Higgs boson as that of the Standard Model (SM), the minimal universal extra dimension model (MUED), the littlest Higgs model with T-parity (LHT), or the minimal supersymmetric Standard Model (MSSM), using only the measurement of the product of gluon-fusion production cross section and the di-photon branching ratio. In MUED, by decoupling any new physics sufficiently to evade the discovery reach at the LHC, the deviation of the signal from the SM is not statistically significant. However, in LHT and MSSM, it is possible to have a significant deviation in the signal that is consistent with this "lone Higgs scenario", and, in the case of a very large suppression, we can distinguish MSSM and LHT before the discovery of any new resonances. Starting with the lone Higgs scenario and the deviation in this measurement from the Standard Model prediction (whether or not statistically significant), we offer tests that may discriminate the models and search strategies of discovering new physics signatures with increasing integrated luminosity.
△ Less
Submitted 19 September, 2008; v1 submitted 16 June, 2008;
originally announced June 2008.
-
Triplet Extended Supersymmetric Standard Model
Authors:
Stefano Di Chiara,
Ken Hsieh
Abstract:
We revisit an extension of the MSSM by adding a hypercharge-neutral, SU(2)-triplet chiral superfield. Similar to the NMSSM, the triplet gives an additional contribution to the quartic coupling in the Higgs potential, and the mass of the lightest CP-even Higgs boson can be greater than tne mass of the Z-bosn at tree-level. In addition to discussing the perturbativity, fine-tuning, and decoupling…
▽ More
We revisit an extension of the MSSM by adding a hypercharge-neutral, SU(2)-triplet chiral superfield. Similar to the NMSSM, the triplet gives an additional contribution to the quartic coupling in the Higgs potential, and the mass of the lightest CP-even Higgs boson can be greater than tne mass of the Z-bosn at tree-level. In addition to discussing the perturbativity, fine-tuning, and decoupling issues of this model, we compute the dominant 1-loop corrections to the mass of the lightest CP-even Higgs boson from the triplet sector. When the Higgs-Higgs-Triplet coupling in the superpotential is comparable to the top Yukawa coupling, we find that the Higgs mass can be as heavy as 140 GeV even without the traditional contributions from the top--s-top sector, and at the same time consistent with the precision electroweak constraints. At the expense of having Landau poles before the GUT scale, this opens up a previously forbidden region in the MSSM parameter space where both s-tops are light. In addition to having relatively small fine-tuning (about one part in 30), this leads to a gluo-philic Higgs boson whose production via gluon-gluon fusion at the LHC can be twice as large as the SM prediction.
△ Less
Submitted 8 September, 2008; v1 submitted 16 May, 2008;
originally announced May 2008.
-
Pseudo-Dirac Bino Dark Matter
Authors:
Ken Hsieh
Abstract:
While the bino-dominated lightest neutralino of the minimal supersymmetric Standard Model (MSSM) is an interesting and widely-studied candidate of the dark matter, the p-wave suppression of its annihilation cross section requires fine-tunings of the MSSM spectra to be consistent with WMAP observations. We propose pseudo-Dirac bino that arises in theories with D-type supersymmetry-breaking as an…
▽ More
While the bino-dominated lightest neutralino of the minimal supersymmetric Standard Model (MSSM) is an interesting and widely-studied candidate of the dark matter, the p-wave suppression of its annihilation cross section requires fine-tunings of the MSSM spectra to be consistent with WMAP observations. We propose pseudo-Dirac bino that arises in theories with D-type supersymmetry-breaking as an intriguing alternative candidate of dark matter. The pseudo-Dirac nature of the bino gives a natural mechanism of enhanced co-annihilation because these two states are degenerate in the absence of electroweak symmetry breaking. In addition, the lightest state can be consistent with limits of direct detection experiments because of the lack of vector interactions, as with the case of the MSSM bino.
△ Less
Submitted 8 January, 2008; v1 submitted 29 August, 2007;
originally announced August 2007.
-
Mixed Dark Matter in Universal Extra Dimension Models with TeV Scale $W_{R}$ and $Z'$
Authors:
Ken Hsieh,
R. N. Mohapatra,
Salah Nasri
Abstract:
We show that in a class of universal extra dimension (UED) models that solves both the neutrino mass and proton decay problems using low scale left-right symmetry, the dark matter of the Universe consists of an admixture of KK photon and KK right-handed neutrinos. We present a full calculation of the dark matter density in these models taking into account the co-annihilation effects due to near…
▽ More
We show that in a class of universal extra dimension (UED) models that solves both the neutrino mass and proton decay problems using low scale left-right symmetry, the dark matter of the Universe consists of an admixture of KK photon and KK right-handed neutrinos. We present a full calculation of the dark matter density in these models taking into account the co-annihilation effects due to near by states such as the scalar partner of the KK photon as well as fermion states near the right-handed KK neutrino. Using the value of the relic CDM density, we obtain upper limits on $R^{-1}$ of about 400-650 GeV and $M_{Z'}\leq 1.5$ TeV, both being accessible to LHC. For a region in this parameter space where the KK right-handed neutrino contributes significantly to the total relic density of dark matter, we obtain a lower bound on the dark matter-nucleon scattering cross section of $10^{-44}$ cm$^2$, which can be probed by the next round of dark matter search experiments.
△ Less
Submitted 16 October, 2006; v1 submitted 12 October, 2006;
originally announced October 2006.
-
Mixed Gauge and Anomaly Mediation From New Physics at 10 TeV
Authors:
Ken Hsieh,
Markus A. Luty
Abstract:
In the context of anomaly-mediated supersymmetry breaking, it is natural for vectorlike fields and singlets to have supersymmetry breaking masses of order 10 TeV, and therefore act as messengers of supersymmetry breaking. We show that this can give rise to phenomenologically viable spectra compatible with perturbative gauge coupling unification. The minimal model interpolates continuously betwee…
▽ More
In the context of anomaly-mediated supersymmetry breaking, it is natural for vectorlike fields and singlets to have supersymmetry breaking masses of order 10 TeV, and therefore act as messengers of supersymmetry breaking. We show that this can give rise to phenomenologically viable spectra compatible with perturbative gauge coupling unification. The minimal model interpolates continuously between pure anomaly mediation and gauge mediation with a messenger scale of order 10 TeV. It is also possible to have non-minimal models with more degenerate specta, with some squarks lighter than sleptons. These models reduce to the MSSM at low energies and incorporate a natural solution of the mu problem. The minimal model has four continuous parameters and one discrete parameter (the number of messengers). The LEP Higgs mass bound can be satisfied in the minimal model by tuning parameters at the GUT scale to one part in 50.
△ Less
Submitted 27 April, 2006;
originally announced April 2006.
-
Dark Matter in Universal Extra Dimension Models: $γ_{KK}$ vrs $ν_{R,KK}$
Authors:
Ken Hsieh,
R. N. Mohapatra,
Salah Nasri
Abstract:
We show that in a class of universal extra dimension models (UED), which solves both the neutrino mass and proton decay problem, an admixture of KK photon and KK right handed neutrinos can provide the required amount of cold dark matter (CDM). This model has two parameters $R^{-1}$ and $M_{Z'}$ ($R$ is the radius of the extra space dimensions and $Z'$ the extra neutral gauge boson of the model).…
▽ More
We show that in a class of universal extra dimension models (UED), which solves both the neutrino mass and proton decay problem, an admixture of KK photon and KK right handed neutrinos can provide the required amount of cold dark matter (CDM). This model has two parameters $R^{-1}$ and $M_{Z'}$ ($R$ is the radius of the extra space dimensions and $Z'$ the extra neutral gauge boson of the model). Using the value of the relic CDM density, combined with the results from the cryogenic searches for CDM, we obtain upper limits on $R^{-1}$ of about 400-650 GeV and $M_{Z'}\leq 1.5$ TeV, both being accessible to LHC. In some regions of the parameter space, the dark matter-nucleon scattering cross section can be as high as of $10^{-44}$ cm$^2$, which can be probed by the next round of dark matter search experiments.
△ Less
Submitted 1 September, 2006; v1 submitted 18 April, 2006;
originally announced April 2006.