Skip to main content

Showing 1–6 of 6 results for author: Weng, Q

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.17297  [pdf, other

    cs.CL cs.AI

    InternLM2 Technical Report

    Authors: Zheng Cai, Maosong Cao, Haojiong Chen, Kai Chen, Keyu Chen, Xin Chen, Xun Chen, Zehui Chen, Zhi Chen, Pei Chu, Xiaoyi Dong, Haodong Duan, Qi Fan, Zhaoye Fei, Yang Gao, Jiaye Ge, Chenya Gu, Yuzhe Gu, Tao Gui, Aijia Guo, Qipeng Guo, Conghui He, Yingfan Hu, Ting Huang, Tao Jiang , et al. (75 additional authors not shown)

    Abstract: The evolution of Large Language Models (LLMs) like ChatGPT and GPT-4 has sparked discussions on the advent of Artificial General Intelligence (AGI). However, replicating such advancements in open-source models has been challenging. This paper introduces InternLM2, an open-source LLM that outperforms its predecessors in comprehensive evaluations across 6 dimensions and 30 benchmarks, long-context m… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  2. arXiv:2401.11240  [pdf, other

    cs.DC

    CaraServe: CPU-Assisted and Rank-Aware LoRA Serving for Generative LLM Inference

    Authors: Suyi Li, Hanfeng Lu, Tianyuan Wu, Minchen Yu, Qizhen Weng, Xusheng Chen, Yizhou Shan, Binhang Yuan, Wei Wang

    Abstract: Pre-trained large language models (LLMs) often need specialization for domain-specific tasks. Low-Rank Adaptation (LoRA) is a popular approach that adapts a base model to multiple tasks by adding lightweight trainable adapters. In this paper, we present CaraServe, a system that efficiently serves many LoRA adapters derived from a common base model. CaraServe maintains the base model on GPUs and dy… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

  3. Image Blending Algorithm with Automatic Mask Generation

    Authors: Haochen Xue, Mingyu **, Chong Zhang, Yuxuan Huang, Qian Weng, Xiaobo **

    Abstract: In recent years, image blending has gained popularity for its ability to create visually stunning content. However, the current image blending algorithms mainly have the following problems: manually creating image blending masks requires a lot of manpower and material resources; image blending algorithms cannot effectively solve the problems of brightness distortion and low resolution. To this end… ▽ More

    Submitted 29 November, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: 14 pages, 8 figures

    Journal ref: International Conference on Neural Information Processing 2023

  4. arXiv:2208.11212  [pdf, other

    cs.LG

    DeepPicarMicro: Applying TinyML to Autonomous Cyber Physical Systems

    Authors: Michael Bechtel, QiTao Weng, Heechul Yun

    Abstract: Running deep neural networks (DNNs) on tiny Micro-controller Units (MCUs) is challenging due to their limitations in computing, memory, and storage capacity. Fortunately, recent advances in both MCU hardware and machine learning software frameworks make it possible to run fairly complex neural networks on modern MCUs, resulting in a new field of study widely known as TinyML. However, there have be… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

    Comments: RTCSA 2022

  5. arXiv:2103.15383  [pdf, other

    cs.CV

    Selective Output Smoothing Regularization: Regularize Neural Networks by Softening Output Distributions

    Authors: Xuan Cheng, Tianshu Xie, Xiaomin Wang, Qifeng Weng, Minghui Liu, Jiali Deng, Ming Liu

    Abstract: In this paper, we propose Selective Output Smoothing Regularization, a novel regularization method for training the Convolutional Neural Networks (CNNs). Inspired by the diverse effects on training from different samples, Selective Output Smoothing Regularization improves the performance by encouraging the model to produce equal logits on incorrect classes when dealing with samples that the model… ▽ More

    Submitted 29 March, 2022; v1 submitted 29 March, 2021; originally announced March 2021.

  6. arXiv:1806.02508  [pdf, other

    cs.DC cs.AI cs.LG cs.PF

    Semi-Dynamic Load Balancing: Efficient Distributed Learning in Non-Dedicated Environments

    Authors: Chen Chen, Qizhen Weng, Wei Wang, Baochun Li, Bo Li

    Abstract: Machine learning (ML) models are increasingly trained in clusters with non-dedicated workers possessing heterogeneous resources. In such scenarios, model training efficiency can be negatively affected by stragglers -- workers that run much slower than others. Efficient model training requires eliminating such stragglers, yet for modern ML workloads, existing load balancing strategies are inefficie… ▽ More

    Submitted 8 December, 2020; v1 submitted 7 June, 2018; originally announced June 2018.