Skip to main content

Showing 1–8 of 8 results for author: Yan, F Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.01617  [pdf, other

    cs.NI cs.LG cs.MM

    LLM-ABR: Designing Adaptive Bitrate Algorithms via Large Language Models

    Authors: Zhiyuan He, Aashish Gottipati, Lili Qiu, Francis Y. Yan, Xufang Luo, Kenuo Xu, Yuqing Yang

    Abstract: We present LLM-ABR, the first system that utilizes the generative capabilities of large language models (LLMs) to autonomously design adaptive bitrate (ABR) algorithms tailored for diverse network characteristics. Operating within a reinforcement learning framework, LLM-ABR empowers LLMs to design key components such as states and neural network architectures. We evaluate LLM-ABR across diverse ne… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  2. arXiv:2403.06324  [pdf, other

    cs.NI cs.MM

    ACM MMSys 2024 Bandwidth Estimation in Real Time Communications Challenge

    Authors: Sami Khairy, Gabriel Mittag, Vishak Gopal, Francis Y. Yan, Zhixiong Niu, Ezra Ameri, Scott Inglis, Mehrsa Golestaneh, Ross Cutler

    Abstract: The quality of experience (QoE) delivered by video conferencing systems to end users depends in part on correctly estimating the capacity of the bottleneck link between the sender and the receiver over time. Bandwidth estimation for real-time communications (RTC) remains a significant challenge, primarily due to the continuously evolving heterogeneous network architectures and technologies. From t… ▽ More

    Submitted 15 March, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

  3. arXiv:2305.12333  [pdf, other

    cs.MM cs.AI cs.NI

    GRACE: Loss-Resilient Real-Time Video through Neural Codecs

    Authors: Yihua Cheng, Ziyi Zhang, Hanchen Li, Anton Arapin, Yue Zhang, Qizheng Zhang, Yuhan Liu, Xu Zhang, Francis Y. Yan, Amrita Mazumdar, Nick Feamster, Junchen Jiang

    Abstract: In real-time video communication, retransmitting lost packets over high-latency networks is not viable due to strict latency requirements. To counter packet losses without retransmission, two primary strategies are employed -- encoder-based forward error correction (FEC) and decoder-based error concealment. The former encodes data with redundancy before transmission, yet determining the optimal re… ▽ More

    Submitted 12 March, 2024; v1 submitted 20 May, 2023; originally announced May 2023.

  4. arXiv:2212.12180  [pdf, other

    cs.DC cs.LG

    Autothrottle: A Practical Bi-Level Approach to Resource Management for SLO-Targeted Microservices

    Authors: Zibo Wang, **he Li, Chieh-Jan Mike Liang, Feng Wu, Francis Y. Yan

    Abstract: Achieving resource efficiency while preserving end-user experience is non-trivial for cloud application operators. As cloud applications progressively adopt microservices, resource managers are faced with two distinct levels of system behavior: end-to-end application latency and per-service resource usage. Translating between the two levels, however, is challenging because user requests traverse h… ▽ More

    Submitted 14 April, 2024; v1 submitted 23 December, 2022; originally announced December 2022.

    Comments: Accepted by USENIX NSDI '24

  5. arXiv:2210.13763  [pdf, other

    cs.NI cs.LG

    Teal: Learning-Accelerated Optimization of WAN Traffic Engineering

    Authors: Zhiying Xu, Francis Y. Yan, Rachee Singh, Justin T. Chiu, Alexander M. Rush, Minlan Yu

    Abstract: The rapid expansion of global cloud wide-area networks (WANs) has posed a challenge for commercial optimization engines to efficiently solve network traffic engineering (TE) problems at scale. Existing acceleration strategies decompose TE optimization into concurrent subproblems but realize limited parallelism due to an inherent tradeoff between run time and allocation performance. We present Te… ▽ More

    Submitted 19 May, 2024; v1 submitted 25 October, 2022; originally announced October 2022.

  6. arXiv:2202.05940  [pdf, other

    cs.NI

    Automatic Curriculum Generation for Learning Adaptation in Networking

    Authors: Zhengxu Xia, Yajie Zhou, Francis Y. Yan, Junchen Jiang

    Abstract: As deep reinforcement learning (RL) showcases its strengths in networking and systems, its pitfalls also come to the public's attention--when trained to handle a wide range of network workloads and previously unseen deployment environments, RL policies often manifest suboptimal performance and poor generalizability. To tackle these problems, we present Genet, a new training framework for learnin… ▽ More

    Submitted 8 September, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

    Comments: Accepted by SIGCOMM'22

  7. arXiv:2011.09611  [pdf, other

    cs.NI

    Implementing BOLA-BASIC on Puffer: Lessons for the use of SSIM in ABR logic

    Authors: Emily Marx, Francis Y. Yan, Keith Winstein

    Abstract: One ABR algorithm implemented on Puffer is BOLA-BASIC, the simplest variant of BOLA. BOLA finds wide use in industry, notably in the MPEG-DASH reference player used as the basis for video players at Akamai, BBC, Orange, and CBS. The overall goal of BOLA is to maximize each encoded chunk's video quality while minimizing rebuffering. To measure video quality, Puffer uses the structural similarity me… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

  8. arXiv:1906.01113  [pdf, other

    cs.NI

    Learning in situ: a randomized experiment in video streaming

    Authors: Francis Y. Yan, Hudson Ayers, Chenzhi Zhu, Sadjad Fouladi, James Hong, Keyi Zhang, Philip Levis, Keith Winstein

    Abstract: We describe the results of a randomized controlled trial of video-streaming algorithms for bitrate selection and network prediction. Over the last eight months, we have streamed 14.2 years of video to 56,000 users across the Internet. Sessions are randomized in blinded fashion among algorithms, and client telemetry is recorded for analysis. We found that in this real-world setting, it is difficu… ▽ More

    Submitted 23 September, 2019; v1 submitted 3 June, 2019; originally announced June 2019.

    Journal ref: USENIX NSDI (2020) 495-511