Skip to main content

Showing 1–8 of 8 results for author: Che, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05962  [pdf, other

    cs.DC cs.DB

    Data Caching for Enterprise-Grade Petabyte-Scale OLAP

    Authors: Chunxu Tang, Bin Fan, **g Zhao, Chen Liang, Yi Wang, Beinan Wang, Ziyue Qiu, Lu Qiu, Bowen Ding, Shouzhuo Sun, Saiguang Che, Jiaming Mai, Shouwei Chen, Yu Zhu, Jianjian Xie, Yutian, Sun, Yao Li, Yangjun Zhang, Ke Wang, Mingmin Chen

    Abstract: With the exponential growth of data and evolving use cases, petabyte-scale OLAP data platforms are increasingly adopting a model that decouples compute from storage. This shift, evident in organizations like Uber and Meta, introduces operational challenges including massive, read-heavy I/O traffic with potential throttling, as well as skewed and fragmented data access patterns. Addressing these ch… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Accepted to the USENIX Annual Technical Conference (USENIX ATC) 2024

  2. Fermihedral: On the Optimal Compilation for Fermion-to-Qubit Encoding

    Authors: Yuhao Liu, Shize Che, Junyu Zhou, Yunong Shi, Gushu Li

    Abstract: This paper introduces Fermihedral, a compiler framework focusing on discovering the optimal Fermion-to-qubit encoding for targeted Fermionic Hamiltonians. Fermion-to-qubit encoding is a crucial step in harnessing quantum computing for efficient simulation of Fermionic quantum systems. Utilizing Pauli algebra, Fermihedral redefines complex constraints and objectives of Fermion-to-qubit encoding int… ▽ More

    Submitted 26 March, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

    Journal ref: ASPLOS 2024

  3. arXiv:2309.09825  [pdf, other

    cs.AI

    Bias of AI-Generated Content: An Examination of News Produced by Large Language Models

    Authors: Xiao Fang, Shangkun Che, Minjia Mao, Hongzhe Zhang, Ming Zhao, Xiaohang Zhao

    Abstract: Large language models (LLMs) have the potential to transform our lives and work through the content they generate, known as AI-Generated Content (AIGC). To harness this transformation, we need to understand the limitations of LLMs. Here, we investigate the bias of AIGC produced by seven representative LLMs, including ChatGPT and LLaMA. We collect news articles from The New York Times and Reuters,… ▽ More

    Submitted 3 April, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

  4. arXiv:2308.01320  [pdf, other

    cs.LG cs.AI cs.CL

    DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales

    Authors: Zhewei Yao, Reza Yazdani Aminabadi, Olatunji Ruwase, Samyam Rajbhandari, Xiaoxia Wu, Ammar Ahmad Awan, Jeff Rasley, Minjia Zhang, Conglong Li, Connor Holmes, Zhongzhu Zhou, Michael Wyatt, Molly Smith, Lev Kurilenko, Heyang Qin, Masahiro Tanaka, Shuai Che, Shuaiwen Leon Song, Yuxiong He

    Abstract: ChatGPT-like models have revolutionized various applications in artificial intelligence, from summarization and coding to translation, matching or even surpassing human performance. However, the current landscape lacks an accessible, efficient, and cost-effective end-to-end RLHF (Reinforcement Learning with Human Feedback) training pipeline for these powerful models, particularly when training at… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: 14 pages, 7 figures

  5. arXiv:2105.01603  [pdf, other

    cs.AI cs.LG

    Federated Multi-View Learning for Private Medical Data Integration and Analysis

    Authors: Sicong Che, Hao Peng, Lichao Sun, Yong Chen, Lifang He

    Abstract: Along with the rapid expansion of information technology and digitalization of health data, there is an increasing concern on maintaining data privacy while garnering the benefits in medical field. Two critical challenges are identified: Firstly, medical data is naturally distributed across multiple local sites, making it difficult to collectively train machine learning models without data leakage… ▽ More

    Submitted 4 May, 2021; originally announced May 2021.

    Comments: 22 pages, 6 figures, journal

    ACM Class: I.2; J.3; I.5.2

  6. arXiv:1908.10834  [pdf, other

    cs.DC cs.LG

    AWB-GCN: A Graph Convolutional Network Accelerator with Runtime Workload Rebalancing

    Authors: Tong Geng, Ang Li, Runbin Shi, Chunshu Wu, Tianqi Wang, Yanfei Li, Pouya Haghi, Antonino Tumeo, Shuai Che, Steve Reinhardt, Martin Herbordt

    Abstract: Deep learning systems have been successfully applied to Euclidean data such as images, video, and audio. In many applications, however, information and their relationships are better expressed with graphs. Graph Convolutional Networks (GCNs) appear to be a promising approach to efficiently learn from graph data structures, having shown advantages in many critical applications. As with other deep l… ▽ More

    Submitted 10 September, 2020; v1 submitted 23 August, 2019; originally announced August 2019.

  7. Software-defined Design Space Exploration for an Efficient DNN Accelerator Architecture

    Authors: Ye Yu, Yingmin Li, Shuai Che, Niraj K. Jha, Weifeng Zhang

    Abstract: Deep neural networks (DNNs) have been shown to outperform conventional machine learning algorithms across a wide range of applications, e.g., image recognition, object detection, robotics, and natural language processing. However, the high computational complexity of DNNs often necessitates extremely fast and efficient hardware. The problem gets worse as the size of neural networks grows exponenti… ▽ More

    Submitted 16 January, 2020; v1 submitted 18 March, 2019; originally announced March 2019.

  8. arXiv:1901.10997  [pdf, other

    cs.NE

    Hardware-Guided Symbiotic Training for Compact, Accurate, yet Execution-Efficient LSTM

    Authors: Hongxu Yin, Guoyang Chen, Yingmin Li, Shuai Che, Weifeng Zhang, Niraj K. Jha

    Abstract: Many long short-term memory (LSTM) applications need fast yet compact models. Neural network compression approaches, such as the grow-and-prune paradigm, have proved to be promising for cutting down network complexity by skip** insignificant weights. However, current compression strategies are mostly hardware-agnostic and network complexity reduction does not always translate into execution effi… ▽ More

    Submitted 30 January, 2019; originally announced January 2019.