Skip to main content

Showing 1–50 of 64 results for author: Chai, S

.
  1. arXiv:2407.02797  [pdf, other

    cs.RO cs.CV

    Solving Motion Planning Tasks with a Scalable Generative Model

    Authors: Yihan Hu, Siqi Chai, Zhening Yang, **gyu Qian, Kun Li, Wenxin Shao, Haichao Zhang, Wei Xu, Qiang Liu

    Abstract: As autonomous driving systems being deployed to millions of vehicles, there is a pressing need of improving the system's scalability, safety and reducing the engineering cost. A realistic, scalable, and practical simulator of the driving world is highly desired. In this paper, we present an efficient solution based on generative models which learns the dynamics of the driving scenes. With this mod… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: ECCV2024

  2. arXiv:2405.17459  [pdf

    cs.LG cs.AI cs.CL cs.CV

    Integrating Medical Imaging and Clinical Reports Using Multimodal Deep Learning for Advanced Disease Analysis

    Authors: Ziyan Yao, Fei Lin, Sheng Chai, Weijie He, Lu Dai, Xinghui Fei

    Abstract: In this paper, an innovative multi-modal deep learning model is proposed to deeply integrate heterogeneous information from medical images and clinical reports. First, for medical images, convolutional neural networks were used to extract high-dimensional features and capture key visual information such as focal details, texture and spatial distribution. Secondly, for clinical report text, a two-w… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  3. arXiv:2404.04314  [pdf, other

    cs.LG

    Faraday: Synthetic Smart Meter Generator for the smart grid

    Authors: Sheng Chai, Gus Chadney

    Abstract: Access to smart meter data is essential to rapid and successful transitions to electrified grids, underpinned by flexibility delivered by low carbon technologies, such as electric vehicles (EV) and heat pumps, and powered by renewable energy. Yet little of this data is available for research and modelling purposes due consumer privacy protections. Whilst many are calling for raw datasets to be unl… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: Published as a workshop paper at Tackling Climate Change with Machine Learning, ICLR 2024

  4. arXiv:2401.15991  [pdf, other

    cond-mat.mes-hall

    Measurement of the Chern Number for Non-Hermitian Chern Insulators

    Authors: Hongfang Liu, Ming Lu, Shengdu Chai, Zhi-Qiang Zhang, Hua Jiang

    Abstract: The identification of the topological invariant of a topological system is crucial in experiments. However, due to the inherent non-Hermitian features, such determination is notably challenging in non-Hermitian systems. Here, we propose that the magnetic effect can be utilized to measure the Chern number of the non-Hermitian Chern insulator. We find that the splitting of non-Hermitian bands under… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 15 pages, 10 figures, Comments are welcome

  5. arXiv:2401.02474  [pdf, other

    hep-ph hep-ex

    From Optimal Observables to Machine Learning: an Effective-Field-Theory Analysis of $e^+e^- \to W^+W^-$ at Future Lepton Colliders

    Authors: Shengdu Chai, Jiayin Gu, Lingfeng Li

    Abstract: We apply machine-learning techniques to the effective-field-theory analysis of the $e^+e^- \to W^+W^-$ processes at future lepton colliders, and demonstrate their advantages in comparison with conventional methods, such as optimal observables. Compared to traditional algorithms, we show that simulation-based inference methods are more robust to detector effects and backgrounds, and could in princi… ▽ More

    Submitted 30 June, 2024; v1 submitted 4 January, 2024; originally announced January 2024.

    Comments: 31 pages, 8 figures, minor updates

  6. arXiv:2312.06982  [pdf

    cond-mat.mtrl-sci physics.app-ph

    Residual Stress-Driven Non-Euclidean Morphing in Origami Structures

    Authors: Zihe Liang, Sibo Chai, Qinyun Ding, Kai Xiao, Ke Liu, Jiayao Ma, Jaehyung Ju

    Abstract: Non-Euclidean surfaces are ubiquitous in numerous engineering fields, such as automotive, aerospace, and biomedical engineering domains. Morphing origami has numerous potential engineering applications, including soft robots, mechanical metamaterials, antennas, aerospace structures, and biomedical devices, owing to its intrinsic morphing features from two-dimensional (2D) planes to three-dimension… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  7. arXiv:2309.05569  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    ITI-GEN: Inclusive Text-to-Image Generation

    Authors: Cheng Zhang, Xuanbai Chen, Siqi Chai, Chen Henry Wu, Dmitry Lagun, Thabo Beeler, Fernando De la Torre

    Abstract: Text-to-image generative models often reflect the biases of the training data, leading to unequal representations of underrepresented groups. This study investigates inclusive text-to-image generative models that generate images based on human-written prompts and ensure the resulting images are uniformly distributed across attributes of interest. Unfortunately, directly expressing the desired attr… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: Accepted to ICCV 2023 (Oral Presentation)

  8. arXiv:2308.13126  [pdf

    physics.med-ph physics.ins-det

    A Cryogenic Tune and Match Circuit for Magnetic Resonance Microscopy at 15.2T

    Authors: Benjamin M. Hardy, Gary Drake, Shuyang Chai, Bibek Dhakal, Jonathan B. Martin, Junzhong Xu, Mark D. Does, Adam W. Anderson, Xinqiang Yan, John C. Gore

    Abstract: Signal to noise ratios (SNR) in magnetic resonance microscopy images are limited by acquisition times and the decreasing number of spins in smaller voxels. Significant SNR gains from cooling of the RF receiver are only realized when the Johnson noise generated within the RF hardware is large compared to the electromagnetic noise produced by the sample. Cryogenic cooling of imaging probes is common… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: 33 pages, 10 figures, 1 table, 4 supplemental figures, 1 supplemental table

  9. arXiv:2307.01124  [pdf

    eess.IV cs.CV

    Cross-modality Attention Adapter: A Glioma Segmentation Fine-tuning Method for SAM Using Multimodal Brain MR Images

    Authors: Xiaoyu Shi, Shurong Chai, Yinhao Li, **gliang Cheng, Jie Bai, Guohua Zhao, Yen-Wei Chen

    Abstract: According to the 2021 World Health Organization (WHO) Classification scheme for gliomas, glioma segmentation is a very important basis for diagnosis and genotype prediction. In general, 3D multimodal brain MRI is an effective diagnostic tool. In the past decade, there has been an increase in the use of machine learning, particularly deep learning, for medical images processing. Thanks to the devel… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

  10. arXiv:2306.14519  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall physics.app-ph

    Towards Sustainable Ultrawide Bandgap Van der Waals Materials: An ab initio Screening Effort

    Authors: Chuin Wei Tan, Linqiang Xu, Chen Chen Er, Siang-Piao Chai, Boris Kozinsky, Hui Ying Yang, Shengyuan A. Yang, **g Lu, Yee Sin Ang

    Abstract: The sustainable development of next-generation device technology is paramount in the face of climate change and the looming energy crisis. Tremendous efforts have been made in the discovery and design of nanomaterials that achieve device-level sustainability, where high performance and low operational energy cost are prioritized. However, many of such materials are composed of elements that are un… ▽ More

    Submitted 25 October, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: 17 pages, 8 figures

  11. arXiv:2306.12737  [pdf, other

    cs.CV

    Ladder Fine-tuning approach for SAM integrating complementary network

    Authors: Shurong Chai, Rahul Kumar Jain, Shiyu Teng, Jiaqing Liu, Yinhao Li, Tomoko Tateyama, Yen-wei Chen

    Abstract: Recently, foundation models have been introduced demonstrating various tasks in the field of computer vision. These models such as Segment Anything Model (SAM) are generalized models trained using huge datasets. Currently, ongoing research focuses on exploring the effective utilization of these generalized models for specific domains, such as medical imaging. However, in medical imaging, the lack… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

  12. arXiv:2305.02567  [pdf, other

    cs.CV

    LayoutDM: Transformer-based Diffusion Model for Layout Generation

    Authors: Shang Chai, Liansheng Zhuang, Fengying Yan

    Abstract: Automatic layout generation that can synthesize high-quality layouts is an important tool for graphic design in many applications. Though existing methods based on generative models such as Generative Adversarial Networks (GANs) and Variational Auto-Encoders (VAEs) have progressed, they still leave much room for improving the quality and diversity of the results. Inspired by the recent success of… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: Accepted by CVPR 2023

  13. arXiv:2212.10156  [pdf, other

    cs.CV cs.RO

    Planning-oriented Autonomous Driving

    Authors: Yihan Hu, Jiazhi Yang, Li Chen, Keyu Li, Chonghao Sima, Xizhou Zhu, Siqi Chai, Senyao Du, Tianwei Lin, Wenhai Wang, Lewei Lu, Xiaosong Jia, Qiang Liu, Jifeng Dai, Yu Qiao, Hongyang Li

    Abstract: Modern autonomous driving system is characterized as modular tasks in sequential order, i.e., perception, prediction, and planning. In order to perform a wide diversity of tasks and achieve advanced-level intelligence, contemporary approaches either deploy standalone models for individual tasks, or design a multi-task paradigm with separate heads. However, they might suffer from accumulative error… ▽ More

    Submitted 23 March, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: CVPR 2023 award candidate. Project page: https://opendrivelab.github.io/UniAD/

  14. Accommodating the CDF W-boson Mass Measurement in the Beautiful Mirror Model

    Authors: Shengdu Chai, Jiayin Gu, Lian-Tao Wang

    Abstract: The W-boson mass measurement recently reported by the CDF II experiment exhibits a significant deviation from both the Standard Model prediction and previous measurements. There is also a long-standing deviation between the Standard Model prediction of the forward-backward asymmetry of the bottom quark ($A^{0,b}_{\rm FB}$) and its measurement at the LEP experiment. The Beautiful Mirror model, prop… ▽ More

    Submitted 19 May, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: 23 pages, 4 figures. v2: matches published version

    Journal ref: Phys.Rev.D 107 (2023) 9, 095013

  15. Security Defense of Large Scale Networks Under False Data Injection Attacks: An Attack Detection Scheduling Approach

    Authors: Yuhan Suo, Senchun Chai, Runqi Chai, Zhong-Hua Pang, Yuanqing Xia, Guo-** Liu

    Abstract: In large-scale networks, communication links between nodes are easily injected with false data by adversaries. This paper proposes a novel security defense strategy from the perspective of attack detection scheduling to ensure the security of the network. Based on the proposed strategy, each sensor can directly exclude suspicious sensors from its neighboring set. First, the problem of selecting su… ▽ More

    Submitted 17 December, 2023; v1 submitted 11 December, 2022; originally announced December 2022.

    Comments: 14 pages, 13 figures

  16. arXiv:2211.14837  [pdf, ps, other

    math.NA math.AP

    Numerical analysis of a time discretized method for nonlinear filtering problem with Lévy process observations

    Authors: Fengshan Zhang, Yongkui Zou, Shimin Chai, Yanzhao Cao

    Abstract: In this paper, we consider a nonlinear filtering model with observations driven by correlated Wiener processes and point processes. We first derive a Zakai equation whose solution is a unnormalized probability density function of the filter solution. Then we apply a splitting-up technique to decompose the Zakai equation into three stochastic differential equations, based on which we construct a sp… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    Report number: 31

  17. arXiv:2208.04785  [pdf, other

    math.NA

    Weak Galerkin finite element method for linear poroelasticity problems

    Authors: Shanshan Gu, Shimin Chai, Chenguang Zhou

    Abstract: This paper is devoted to a weak Galerkin (WG) finite element method for linear poroelasticity problems where weakly defined divergence and gradient operators over discontinuous functions are introduced. We establish both the continuous and discrete time WG schemes, and obtain their optimal convergence order estimates in a discrete $H^1$ norm for the displacement and in an $H^1$ type and $L^2$ norm… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

    Comments: 25 pages, 6 figures

    MSC Class: 65M60; 65M15; 76S05

  18. A Transformer-based Generative Adversarial Network for Brain Tumor Segmentation

    Authors: Liqun Huang, Long Chen, Baihai Zhang, Senchun Chai

    Abstract: Brain tumor segmentation remains a challenge in medical image segmentation tasks. With the application of transformer in various computer vision tasks, transformer blocks show the capability of learning long-distance dependency in global space, which is complementary with CNNs. In this paper, we proposed a novel transformer-based generative adversarial network to automatically segment brain tumors… ▽ More

    Submitted 28 July, 2022; v1 submitted 28 July, 2022; originally announced July 2022.

    Comments: 11 pages, 2 figures

  19. arXiv:2207.04497  [pdf, other

    cs.LG cs.AI cs.CR

    One-shot Neural Backdoor Erasing via Adversarial Weight Masking

    Authors: Shuwen Chai, **ghui Chen

    Abstract: Recent studies show that despite achieving high accuracy on a number of real-world applications, deep neural networks (DNNs) can be backdoored: by injecting triggered data samples into the training dataset, the adversary can mislead the trained model into classifying any test data to the target class as long as the trigger pattern is presented. To nullify such backdoor threats, various methods hav… ▽ More

    Submitted 1 November, 2022; v1 submitted 10 July, 2022; originally announced July 2022.

    Comments: Accepted by NeurIPS 2022 (19 pages, 6 figures, 10 tables)

  20. arXiv:2206.14660  [pdf, other

    cs.CL cs.SD eess.AS

    The THUEE System Description for the IARPA OpenASR21 Challenge

    Authors: **g Zhao, Haoyu Wang, **peng Li, Shuzhou Chai, Guan-Bo Wang, Guoguo Chen, Wei-Qiang Zhang

    Abstract: This paper describes the THUEE team's speech recognition system for the IARPA Open Automatic Speech Recognition Challenge (OpenASR21), with further experiment explorations. We achieve outstanding results under both the Constrained and Constrained-plus training conditions. For the Constrained training condition, we construct our basic ASR system based on the standard hybrid architecture. To allevia… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: accepted by INTERSPEECH 2022

  21. arXiv:2206.10118  [pdf, other

    cs.CV

    HOPE: Hierarchical Spatial-temporal Network for Occupancy Flow Prediction

    Authors: Yihan Hu, Wenxin Shao, Bo Jiang, Jiajie Chen, Siqi Chai, Zhening Yang, **gyu Qian, Helong Zhou, Qiang Liu

    Abstract: In this report, we introduce our solution to the Occupancy and Flow Prediction challenge in the Waymo Open Dataset Challenges at CVPR 2022, which ranks 1st on the leaderboard. We have developed a novel hierarchical spatial-temporal network featured with spatial-temporal encoders, a multi-scale aggregator enriched with latent variables, and a recursive hierarchical 3D decoder. We use multiple losse… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

    Comments: 1st Ranking Solution for the Occupancy and Flow Prediction of the Waymo Open Dataset Challenges 2022 (http://cvpr2022.wad.vision/)

  22. arXiv:2112.10088  [pdf, ps, other

    physics.atom-ph physics.optics quant-ph

    Enhancing fiber atom interferometer by in-fiber laser cooling

    Authors: Yu Wang, Shijie Chai, Thomas Billotte, Zilong Chen, Mingjie Xin, Wui Seng Leong, Foued Amrani, Benoit Debord, Fetah Benabid, Shau-Yu Lan

    Abstract: We demonstrate an inertia sensitive atom interferometer optically guided inside a 22-cm-long negative curvature hollow-core photonic crystal fiber with an interferometer time of 20 ms. The result prolongs the previous fiber guided atom interferometer time by three orders of magnitude. The improvement arises from the realization of in-fiber Λ-enhanced gray molasses and delta-kick cooling to cool at… ▽ More

    Submitted 19 December, 2021; originally announced December 2021.

    Journal ref: Phys. Rev. Research 4, L022058 (2022)

  23. arXiv:2108.05817  [pdf, other

    stat.AP

    Hong Kong Air Traffic: Explanation and Prediction based on Sparse Seasonal ARIMA Model

    Authors: Shuwen Chai

    Abstract: The monthly air traffic of a city is a time series with an obvious seasonal pattern, and is closely related to the economic situation and social environment of the city. In Hong Kong, for example, July, August, and October tend to be the peak season of traffic flow, while there is also a relatively fixed off-season. In the case of a stable social environment, a carefully identified and fitted seas… ▽ More

    Submitted 12 August, 2021; originally announced August 2021.

  24. arXiv:2106.06909  [pdf, other

    cs.SD cs.CL eess.AS

    GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio

    Authors: Guoguo Chen, Shuzhou Chai, Guanbo Wang, Jiayu Du, Wei-Qiang Zhang, Chao Weng, Dan Su, Daniel Povey, Jan Trmal, Junbo Zhang, Mingjie **, Sanjeev Khudanpur, Shinji Watanabe, Shuaijiang Zhao, Wei Zou, Xiangang Li, Xuchen Yao, Yongqing Wang, Yujun Wang, Zhao You, Zhiyong Yan

    Abstract: This paper introduces GigaSpeech, an evolving, multi-domain English speech recognition corpus with 10,000 hours of high quality labeled audio suitable for supervised training, and 40,000 hours of total audio suitable for semi-supervised and unsupervised training. Around 40,000 hours of transcribed audio is first collected from audiobooks, podcasts and YouTube, covering both read and spontaneous sp… ▽ More

    Submitted 13 June, 2021; originally announced June 2021.

  25. arXiv:2103.06231  [pdf, other

    cs.LG cs.CV

    Quantization-Guided Training for Compact TinyML Models

    Authors: Sedigh Ghamari, Koray Ozcan, Thu Dinh, Andrey Melnikov, Juan Carvajal, Jan Ernst, Sek Chai

    Abstract: We propose a Quantization Guided Training (QGT) method to guide DNN training towards optimized low-bit-precision targets and reach extreme compression levels below 8-bit precision. Unlike standard quantization-aware training (QAT) approaches, QGT uses customized regularization to encourage weight values towards a distribution that maximizes accuracy while reducing quantization errors. One of the m… ▽ More

    Submitted 10 March, 2021; originally announced March 2021.

    Comments: TinyML Summit, March 2021

  26. Dark-state sideband cooling in an atomic ensemble

    Authors: Chang Huang, Shijie Chai, Shau-Yu Lan

    Abstract: We utilize the dark state in a Λ-type three-level system to cool an ensemble of 85Rb atoms in an optical lattice [Morigi et al., Phys. Rev. Lett. 85, 4458 (2000)]. The common suppression of the carrier transition of atoms with different vibrational frequencies allows them to reach a subrecoil temperature of 100 nK after being released from the optical lattice. A nearly zero vibrational quantum num… ▽ More

    Submitted 12 January, 2021; originally announced January 2021.

    Journal ref: Phys. Rev. A 103, 013305 (2021)

  27. arXiv:2011.15093  [pdf, other

    eess.IV cs.CV

    Reducing Textural Bias Improves Robustness of Deep Segmentation Models

    Authors: Seoin Chai, Daniel Rueckert, Ahmed E. Fetit

    Abstract: Despite advances in deep learning, robustness under domain shift remains a major bottleneck in medical imaging settings. Findings on natural images suggest that deep neural models can show a strong textural bias when carrying out image classification tasks. In this thorough empirical study, we draw inspiration from findings on natural images and investigate ways in which addressing the textural bi… ▽ More

    Submitted 27 June, 2021; v1 submitted 30 November, 2020; originally announced November 2020.

    Comments: To appear in MIUA 2021 (accepted version)

  28. arXiv:2011.08009  [pdf, other

    cs.CV cs.LG

    Subtensor Quantization for Mobilenets

    Authors: Thu Dinh, Andrey Melnikov, Vasilios Daskalopoulos, Sek Chai

    Abstract: Quantization for deep neural networks (DNN) have enabled developers to deploy models with less memory and more efficient low-power inference. However, not all DNN designs are friendly to quantization. For example, the popular Mobilenet architecture has been tuned to reduce parameter size and computational latency with separable depth-wise convolutions, but not all quantization algorithms work well… ▽ More

    Submitted 4 November, 2020; originally announced November 2020.

    Comments: Embedded Vision Workshop, 16th European Conference on Computer Vision (ECCV), Aug 2020

  29. arXiv:2011.02836  [pdf, other

    cs.LG

    Dynamically Throttleable Neural Networks (TNN)

    Authors: Hengyue Liu, Samyak Parajuli, Jesse Hostetler, Sek Chai, Bir Bhanu

    Abstract: Conditional computation for Deep Neural Networks (DNNs) reduce overall computational load and improve model accuracy by running a subset of the network. In this work, we present a runtime throttleable neural network (TNN) that can adaptively self-regulate its own performance target and computing resources. We designed TNN with several properties that enable more flexibility for dynamic execution b… ▽ More

    Submitted 1 November, 2020; originally announced November 2020.

    Comments: arXiv admin note: text overlap with arXiv:1905.13179

  30. arXiv:2010.10161  [pdf

    quant-ph physics.atom-ph

    Large array of Schrödinger cat states facilitated by an optical waveguide

    Authors: Wui Seng Leong, Mingjie Xin, Zilong Chen, Shijie Chai, Yu Wang, Shau-Yu Lan

    Abstract: Quantum engineering using photonic structures offer new capabilities for atom-photon interactions for quantum optics and atomic physics, which could eventually lead to integrated quantum devices. Despite the rapid progress in the variety of structures, coherent excitation of the motional states of atoms in a photonic waveguide using guided modes has yet to be demonstrated. Here, we use the wavegui… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

    Journal ref: Nat Commun 11, 5295 (2020)

  31. arXiv:1910.06761  [pdf, other

    cs.LG eess.SY

    Causal Mechanism Transfer Network for Time Series Domain Adaptation in Mechanical Systems

    Authors: Zijian Li, Ruichu Cai, Kok Soon Chai, Hong Wei Ng, Hoang Dung Vu, Marianne Winslett, Tom Z. J. Fu, Boyan Xu, Xiaoyan Yang, Zhenjie Zhang

    Abstract: Data-driven models are becoming essential parts in modern mechanical systems, commonly used to capture the behavior of various equipment and varying environmental characteristics. Despite the advantages of these data-driven models on excellent adaptivity to high dynamics and aging equipment, they are usually hungry to massive labels over historical data, mostly contributed by human engineers at an… ▽ More

    Submitted 13 October, 2019; originally announced October 2019.

  32. arXiv:1910.04877  [pdf

    cs.CV cs.LG cs.PF

    Bit Efficient Quantization for Deep Neural Networks

    Authors: Prateeth Nayak, David Zhang, Sek Chai

    Abstract: Quantization for deep neural networks have afforded models for edge devices that use less on-board memory and enable efficient low-power inference. In this paper, we present a comparison of model-parameter driven quantization approaches that can achieve as low as 3-bit precision without affecting accuracy. The post-training quantization approaches are data-free, and the resulting weight values are… ▽ More

    Submitted 7 October, 2019; originally announced October 2019.

    Comments: EMC2 - NeurIPS workshop 2019, #latentai

  33. Voting for Distortion Points in Geometric Processing

    Authors: Shuangming Chai, Xiao-Ming Fu, Ligang Liu

    Abstract: Low isometric distortion is often required for mesh parameterizations. A configuration of some vertices, where the distortion is concentrated, provides a way to mitigate isometric distortion, but determining the number and placement of these vertices is non-trivial. We call these vertices distortion points. We present a novel and automatic method to detect distortion points using a voting strategy… ▽ More

    Submitted 1 November, 2019; v1 submitted 28 September, 2019; originally announced September 2019.

  34. arXiv:1902.08349  [pdf

    cs.LG cs.AI stat.ML

    Generative Memory for Lifelong Reinforcement Learning

    Authors: Aswin Raghavan, Jesse Hostetler, Sek Chai

    Abstract: Our research is focused on understanding and applying biological memory transfers to new AI systems that can fundamentally improve their performance, throughout their fielded lifetime experience. We leverage current understanding of biological memory transfer to arrive at AI algorithms for memory consolidation and replay. In this paper, we propose the use of generative memory that can be recalled… ▽ More

    Submitted 21 February, 2019; originally announced February 2019.

    Comments: Abstract NICE 2019 conference

  35. Beam test performance of the highly granular SiW-ECAL technological prototype for the ILC

    Authors: K. Kawagoe, Y. Miura, I. Sekiya, T. Suehara, T. Yoshioka, S. Bilokin, J. Bonis, P. Cornebise, A. Gallas, A. Irles, R. Pöschl, F. Richard, A. Thiebault, D. Zerwas, M. Anduze, V. Balagura, V. Boudry, J-C. Brient, E. Edy, G. Fayolle, M. Frotin, F. Gastaldi, R. Guillaumat, A. Lobanov, M. Louzir , et al. (19 additional authors not shown)

    Abstract: The technological prototype of the CALICE highly granular silicon-tungsten electromagnetic calorimeter (SiW-ECAL) was tested in a beam at DESY in 2017. The setup comprised seven layers of silicon sensors. Each layer comprised four sensors, with each sensor containing an array of 256 $5.5\times5.5$ mm$^2$ silicon PIN diodes. The four sensors covered a total area of $18\times18$ cm$^2$, and comprise… ▽ More

    Submitted 22 October, 2019; v1 submitted 31 January, 2019; originally announced February 2019.

    Report number: KYUSHU-RCAPP-2019-04

  36. Data Driven Chiller Plant Energy Optimization with Domain Knowledge

    Authors: Hoang Dung Vu, Kok Soon Chai, Bryan Keating, Nurislam Tursynbek, Boyan Xu, Kaige Yang, Xiaoyan Yang, Zhenjie Zhang

    Abstract: Refrigeration and chiller optimization is an important and well studied topic in mechanical engineering, mostly taking advantage of physical models, designed on top of over-simplified assumptions, over the equipments. Conventional optimization techniques using physical models make decisions of online parameter tuning, based on very limited information of hardware specifications and external condit… ▽ More

    Submitted 3 December, 2018; originally announced December 2018.

    Comments: CIKM2017. Proceedings of the 26th ACM International Conference on Information and Knowledge Management. 2017

  37. arXiv:1811.12108  [pdf

    cs.CV

    Bootstrap** Deep Neural Networks from Approximate Image Processing Pipelines

    Authors: Kilho Son, Jesse Hostetler, Sek Chai

    Abstract: Complex image processing and computer vision systems often consist of a processing pipeline of functional modules. We intend to replace parts or all of a target pipeline with deep neural networks to achieve benefits such as increased accuracy or reduced computational requirement. To acquire a large amount of labeled data necessary to train the deep neural network, we propose a workflow that levera… ▽ More

    Submitted 15 February, 2019; v1 submitted 29 November, 2018; originally announced November 2018.

    Comments: 6 pages, 5 figures

  38. arXiv:1811.04985  [pdf, other

    cs.LG stat.ML

    Generalized Ternary Connect: End-to-End Learning and Compression of Multiplication-Free Deep Neural Networks

    Authors: Samyak Parajuli, Aswin Raghavan, Sek Chai

    Abstract: The use of deep neural networks in edge computing devices hinges on the balance between accuracy and complexity of computations. Ternary Connect (TC) \cite{lin2015neural} addresses this issue by restricting the parameters to three levels $-1, 0$, and $+1$, thus eliminating multiplications in the forward pass of the network during prediction. We propose Generalized Ternary Connect (GTC), which allo… ▽ More

    Submitted 12 November, 2018; originally announced November 2018.

  39. arXiv:1810.05133  [pdf, other

    physics.ins-det

    Commissioning of the highly granular SiW-ECAL technological prototype

    Authors: S. Bilokin, J. Bonis, P. Cornebise, A. Gallas, A. Irles, R. Pöschl, F. Richard, A. Thiebault, D. Zerwas, M. Anduze, V. Balagura, V. Boudry, J-C. Brient, E. Edy, G. Fayolle, M. Frotin, F. Gastaldi, A. Lobanov, F. Magniette, J. Nanni, M. Rubio-Roy, K. Shpak, H. Videau, D. Yu, S. Callier , et al. (18 additional authors not shown)

    Abstract: In this article we describe the commissioning and a first analysis of the the beam test performance of a small prototype of a highly granular silicon tungsten calorimeter. The prototype features detector elements with a channel number similar to that envisaged for e.g. the ILD Detector of the International Linear Collider (ILC). The analysis demonstrates the capability of the detector to record si… ▽ More

    Submitted 4 April, 2019; v1 submitted 11 October, 2018; originally announced October 2018.

    Report number: AIDA-2020-PUB-2019-002

  40. arXiv:1806.06496  [pdf, other

    cs.CR cs.AI

    Power-Grid Controller Anomaly Detection with Enhanced Temporal Deep Learning

    Authors: Zecheng He, Aswin Raghavan, Guangyuan Hu, Sek Chai, Ruby Lee

    Abstract: Controllers of security-critical cyber-physical systems, like the power grid, are a very important class of computer systems. Attacks against the control code of a power-grid system, especially zero-day attacks, can be catastrophic. Earlier detection of the anomalies can prevent further damage. However, detecting zero-day attacks is extremely challenging because they have no known code and have un… ▽ More

    Submitted 22 June, 2019; v1 submitted 18 June, 2018; originally announced June 2018.

    Comments: Accepted in the 18th IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom'19)

  41. Electromagnon in Y-type hexaferrite BaSrCoZnFe$_{11}$AlO$_{22}$

    Authors: Jakub Vit, Filip Kadlec, Christelle Kadlec, Fedir Borodavka, Yi Sheng Chai, Kun Zhai, Young Sun, Stanislav Kamba

    Abstract: We investigated static and dynamic magnetoelectric properties of single crystalline BaSrCoZnFe$_{11}$AlO$_{22}$ which is a room-temperature multiferroic with Y-type hexaferrite crystal structure. Below $300\,\rm K$, a purely electric-dipole-active electromagnon at $\approx 1.2\,\rm THz$ with the electric polarization oscillating along the hexagonal axis was observed by THz and Raman spectroscopies… ▽ More

    Submitted 9 April, 2018; originally announced April 2018.

  42. arXiv:1712.05928  [pdf

    cond-mat.mtrl-sci cond-mat.str-el

    Contrasting magnetoelectric behavior in multiferroic hexaferrites as understood by crystal symmetry analyses

    Authors: Y. S. Chai, S. H. Chun, J. Z. Cong, Kee Hoon Kim

    Abstract: Magnetoelectric (ME) properties under rotating magnetic field H are comparatively investigated in two representative hexaferrites Y-type Ba0.5Sr1.5Zn2(Fe0.92Al0.08)12O22 and Z-type Ba0.52Sr2.48Co2Fe24O41, both of which have exhibited a similar transverse conical spin structure and giant ME coupling near room temperature. When the external H is rotated clockwise by 2pi, in-plane P vector is rotated… ▽ More

    Submitted 16 December, 2017; originally announced December 2017.

    Comments: 36 pages, 4 figures

    Journal ref: Phys. Rev. B 98, 104416 (2018)

  43. arXiv:1708.04788  [pdf, other

    cs.LG stat.ML

    BitNet: Bit-Regularized Deep Neural Networks

    Authors: Aswin Raghavan, Mohamed Amer, Sek Chai, Graham Taylor

    Abstract: We present a novel optimization strategy for training neural networks which we call "BitNet". The parameters of neural networks are usually unconstrained and have a dynamic range dispersed over all real values. Our key idea is to limit the expressive power of the network by dynamically controlling the range and set of values that the parameters can take. We formulate this idea using a novel end-to… ▽ More

    Submitted 16 November, 2018; v1 submitted 16 August, 2017; originally announced August 2017.

  44. arXiv:1703.09146  [pdf, other

    cs.LG

    GPU Activity Prediction using Representation Learning

    Authors: Aswin Raghavan, Mohamed Amer, Timothy Shields, David Zhang, Sek Chai

    Abstract: GPU activity prediction is an important and complex problem. This is due to the high level of contention among thousands of parallel threads. This problem was mostly addressed using heuristics. We propose a representation learning approach to address this problem. We model any performance metric as a temporal function of the executed instructions with the intuition that the flow of instructions ca… ▽ More

    Submitted 27 March, 2017; originally announced March 2017.

    Comments: Proceedings of the 33 rd International Conference on Machine Learning, New York, NY, USA, 2016. JMLR: W&CP volume 48. Copyright 2016 by the author(s)

  45. arXiv:1703.08595  [pdf

    cs.LG

    Low Precision Neural Networks using Subband Decomposition

    Authors: Sek Chai, Aswin Raghavan, David Zhang, Mohamed Amer, Tim Shields

    Abstract: Large-scale deep neural networks (DNN) have been successfully used in a number of tasks from image recognition to natural language processing. They are trained using large training sets on large models, making them computationally and memory intensive. As such, there is much interest in research development for faster training and test time. In this paper, we present a unique approach using lower… ▽ More

    Submitted 24 March, 2017; originally announced March 2017.

    Comments: Presented at CogArch Workshop, Atlanta, GA, April 2016

  46. arXiv:1612.04257  [pdf, ps, other

    physics.plasm-ph

    Observation of Toroidal Alfven Eigenmodes during Minor Disruptions in Ohmic Plasmas

    Authors: Yangqing Liu, Yi Tan, Zhe Gao, Yuhong Xu, Youjun Hu, Song Chai, Yanzheng Jiang, Rui Ke, Heng Zhong, Wenhao Wang

    Abstract: Toroidal Alfven eigenmodes (TAEs) excited in purely ohmically heated plasmas without any auxiliary heating have been identified for the first time in the SUNIST spherical tokamak. The TAE modes are observed during minor disruptions and have a frequency range of 150-500 kHz. The mode structure analysis indicates the existence of both m/n=-3/-1 and -4/-1 harmonics, propagating in the electron diamag… ▽ More

    Submitted 13 December, 2016; originally announced December 2016.

    Comments: 10 pages, 4 figures, accepted by Physics of Plasmas

  47. Resonant transfer of large momenta from finite duration pulse sequences

    Authors: Julia Fekete, Shijie Chai, Simon A. Gardiner, Mikkel F. Andersen

    Abstract: We experimentally investigate the atom optics kicked particle at quantum resonance using finite duration kicks. Even though the underlying process is quantum interference it can be well described by an $ε$-pseudoclassical model. The $ε$-pseudoclassical model agrees well with our experiments for a wide range of parameters. We investigate the parameters yielding maximal momentum transfer to the atom… ▽ More

    Submitted 1 February, 2017; v1 submitted 26 September, 2016; originally announced September 2016.

    Journal ref: Phys. Rev. A 95, 033601 (2017)

  48. Electromagnon in the Z-type hexaferrite $({\rm Ba}_{x}{\rm Sr}_{1-x})_3\rm Co_2Fe_{24}O_{41}$

    Authors: Filip Kadlec, Christelle Kadlec, Jakub Vit, Fedir Borodavka, Martin Kempa, Jan Prokleska, Josef Bursik, Robert Uhrecky, Stephane Rols, Yi Sheng Chai, Kun Zhai, Young Sun, Jan Drahokoupil, Veronica Goian, Stanislav Kamba

    Abstract: We studied experimentally the high-temperature magnetoelectric $({\rm Ba}_{x}{\rm Sr}_{1-x})_3\rm Co_2Fe_{24}O_{41}$ prepared as ceramics (x = 0, 0.2) and a single crystal (x = 0.5) using inelastic neutron scattering, THz time-domain, Raman and far-infrared spectroscopies. The spectra, measured with varying temperature and magnetic field, reveal rich information about the collective spin and latti… ▽ More

    Submitted 20 July, 2016; originally announced July 2016.

    Comments: Paper 8 pages + Suppl. Materials 3 pages

    Journal ref: Phys. Rev. B 94, 024419 (2016)

  49. arXiv:1504.05282  [pdf

    physics.acc-ph physics.ins-det

    Intelligent Low-level RF System by Non-destructive Beam Monitoring Device for Cyclotrons

    Authors: M. S. Sharifi Asadi Malafeh, M. Ghergherehchi, H. Afarideh, J. S. Chai

    Abstract: The project of a10MeV PET cyclotron accelerator for medical diagnosis and treatment was started at Amirkabir University of Technology in 2012. The low-level RF system of cyclotron accelerator is designed to stabilize acceleration voltage and control the resonance frequency of the cavity. In this work Intelligent Low Level Radio Frequency Circuit or ILLRF suitable for Most of the AVF cyclotron acce… ▽ More

    Submitted 20 April, 2015; originally announced April 2015.

  50. arXiv:1407.3016  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Magnetic domain-wall motion twisted by nanoscale probe-induced spin transfer

    Authors: J. Wang, L. S. Xie, C. S. Wang, H. Z. Zhang, L. Shu, J. Bai, Y. S. Chai, X. Zhao, J. C. Nie, C. B. Cao, C. Z. Gu, C. M. Xiong, Y. Sun, J. Shi, S. Salahuddin, K. Xia, C. W. Nan, J. X. Zhang

    Abstract: A method for deterministic control of the magnetic order parameter using an electrical stimulus is highly desired for the new generation of spintronic and magnetoelectronic devices. Much effort has been focused on magnetic domain-wall motion manipulated by a successive injection of spin-polarized current into a magnetic nanostructure. However, an integrant high-threshold current density of 107~108… ▽ More

    Submitted 10 July, 2014; originally announced July 2014.