Skip to main content

Showing 1–27 of 27 results for author: Weng, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19002  [pdf, ps, other

    cs.IT

    Coded Cooperative Networks for Semi-Decentralized Federated Learning

    Authors: Shudi Weng, Ming Xiao, Mikael Skoglund

    Abstract: To enhance straggler resilience in federated learning (FL) systems, a semi-decentralized approach has been recently proposed, enabling collaboration between clients. Unlike the existing semi-decentralized schemes, which adaptively adjust the collaboration weight according to the network topology, this letter proposes a deterministic coded network that leverages wireless diversity for semi-decentra… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.15734  [pdf, other

    cs.CL cs.AI

    RankAdaptor: Hierarchical Dynamic Low-Rank Adaptation for Structural Pruned LLMs

    Authors: Changhai Zhou, Shijie Han, Shiyang Zhang, Shichao Weng, Zekai Liu, Cheng **

    Abstract: The efficient compression of large language models (LLMs) is becoming increasingly popular. However, recovering the accuracy of compressed LLMs is still a major challenge. Structural pruning with standard Low-Rank Adaptation (LoRA) is a common technique in current LLM compression. In structural pruning, the model architecture is modified unevenly, resulting in suboptimal performance in various dow… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  3. arXiv:2405.05001  [pdf, other

    cs.CV

    HMANet: Hybrid Multi-Axis Aggregation Network for Image Super-Resolution

    Authors: Shu-Chuan Chu, Zhi-Chao Dou, Jeng-Shyang Pan, Shaowei Weng, Junbao Li

    Abstract: Transformer-based methods have demonstrated excellent performance on super-resolution visual tasks, surpassing conventional convolutional neural networks. However, existing work typically restricts self-attention computation to non-overlap** windows to save computational costs. This means that Transformer-based networks can only use input information from a limited spatial range. Therefore, a no… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 12 pages, 10 figures, conference

  4. arXiv:2403.06536  [pdf, other

    cs.CV

    Multi-Scale Implicit Transformer with Re-parameterize for Arbitrary-Scale Super-Resolution

    Authors: **chen Zhu, Mingjian Zhang, Ling Zheng, Shizhuang Weng

    Abstract: Recently, the methods based on implicit neural representations have shown excellent capabilities for arbitrary-scale super-resolution (ASSR). Although these methods represent the features of an image by generating latent codes, these latent codes are difficult to adapt for different magnification factors of super-resolution, which seriously affects their performance. Addressing this, we design Mul… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: Super-resolution, Arbitrary-Scale Super-Resolution, Multi-Scale, Transformer

  5. arXiv:2402.18147  [pdf, other

    eess.IV cs.CV

    A Lightweight Low-Light Image Enhancement Network via Channel Prior and Gamma Correction

    Authors: Shyang-En Weng, Shaou-Gang Miaou, Ricky Christanto

    Abstract: Human vision relies heavily on available ambient light to perceive objects. Low-light scenes pose two distinct challenges: information loss due to insufficient illumination and undesirable brightness shifts. Low-light image enhancement (LLIE) refers to image enhancement technology tailored to handle this scenario. We introduce CPGA-Net, an innovative LLIE network that combines dark/bright channel… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: Preprint of an article submitted for consideration in [International Journal of Pattern Recognition and Artificial Intelligence] \c{opyright} [2024] [copyright World Scientific Publishing Company] [https://www.worldscientific.com/worldscinet/ijprai]

  6. arXiv:2402.12184  [pdf, other

    cs.CV

    Colorizing Monochromatic Radiance Fields

    Authors: Yean Cheng, Renjie Wan, Shuchen Weng, Chengxuan Zhu, Yakun Chang, Boxin Shi

    Abstract: Though Neural Radiance Fields (NeRF) can produce colorful 3D representations of the world by using a set of 2D images, such ability becomes non-existent when only monochromatic images are provided. Since color is necessary in representing the world, reproducing color from monochromatic radiance fields becomes crucial. To achieve this goal, instead of manipulating the monochromatic radiance fields… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  7. arXiv:2402.11874  [pdf, other

    cs.CV

    Language-guided Image Reflection Separation

    Authors: Haofeng Zhong, Yuchen Hong, Shuchen Weng, **xiu Liang, Boxin Shi

    Abstract: This paper studies the problem of language-guided reflection separation, which aims at addressing the ill-posed reflection separation problem by introducing language descriptions to provide layer content. We propose a unified framework to solve this problem, which leverages the cross-attention mechanism with contrastive learning strategies to construct the correspondence between language descripti… ▽ More

    Submitted 4 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  8. arXiv:2311.10746  [pdf, other

    cs.CY cs.CL cs.LG

    EIT: Earnest Insight Toolkit for Evaluating Students' Earnestness in Interactive Lecture Participation Exercises

    Authors: Mihran Miroyan, Shiny Weng, Rahul Shah, Lisa Yan, Narges Norouzi

    Abstract: In today's rapidly evolving educational landscape, traditional modes of passive information delivery are giving way to transformative pedagogical approaches that prioritize active student engagement. Within the context of large-scale hybrid classrooms, the challenge lies in fostering meaningful and active interaction between students and course content. This study delves into the significance of m… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

  9. Stringesthesia: Dynamically Shifting Musical Agency Between Audience and Performer Based on Trust in an Interactive and Improvised Performance

    Authors: Torin Hopkins, Emily Doherty, Netta Ofer, Suibi Che Chuan Weng, Peter Gyrory, Chad Tobin, Leanne Hirshfield, Ellen Yi-Luen Do

    Abstract: This paper introduces Stringesthesia, an interactive and improvised performance paradigm. Stringesthesia uses real-time neuroimaging to connect performers and audiences, enabling direct access to the performers mental state and determining audience participation during the performance. Functional near-infrared spectroscopy, or fNIRS, a noninvasive neuroimaging tool, was used to assess metabolic ac… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Journal ref: Audio Mostly 2023, Edinburgh, UK

  10. arXiv:2309.00842  [pdf, other

    cs.HC

    DualStream: Spatially Sharing Selves and Surroundings using Mobile Devices and Augmented Reality

    Authors: Rishi Vanukuru, Suibi Che-Chuan Weng, Krithik Ranjan, Torin Hopkins, Amy Banic, Mark D. Gross, Ellen Yi-Luen Do

    Abstract: In-person human interaction relies on our spatial perception of each other and our surroundings. Current remote communication tools partially address each of these aspects. Video calls convey real user representations but without spatial interactions. Augmented and Virtual Reality (AR/VR) experiences are immersive and spatial but often use virtual environments and characters instead of real-life r… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

    Comments: 10 pages, 4 figures, 1 table; To appear in the proceedings of the IEEE International Symposium on Mixed and Augmented Reality (ISMAR) 2023

  11. arXiv:2305.15217  [pdf, other

    cs.CV cs.AI

    L-CAD: Language-based Colorization with Any-level Descriptions using Diffusion Priors

    Authors: Zheng Chang, Shuchen Weng, Peixuan Zhang, Yu Li, Si Li, Boxin Shi

    Abstract: Language-based colorization produces plausible and visually pleasing colors under the guidance of user-friendly natural language descriptions. Previous methods implicitly assume that users provide comprehensive color descriptions for most of the objects in the image, which leads to suboptimal performance. In this paper, we propose a unified model to perform language-based colorization with any-lev… ▽ More

    Submitted 23 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

  12. arXiv:2305.11403  [pdf, other

    cs.CV

    Efficient Mixed Transformer for Single Image Super-Resolution

    Authors: Ling Zheng, **chen Zhu, **peng Shi, Shizhuang Weng

    Abstract: Recently, Transformer-based methods have achieved impressive results in single image super-resolution (SISR). However, the lack of locality mechanism and high complexity limit their application in the field of super-resolution (SR). To solve these problems, we propose a new method, Efficient Mixed Transformer (EMT) in this study. Specifically, we propose the Mixed Transformer Block (MTB), consisti… ▽ More

    Submitted 19 June, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Super-resolution, Long-range attention, Transformer, Locality

  13. arXiv:2301.09869  [pdf, other

    cs.CV

    Image Super-Resolution using Efficient Striped Window Transformer

    Authors: **peng Shi, Hui Li, Tianle Liu, Yulong Liu, Mingjian Zhang, **chen Zhu, Ling Zheng, Shizhuang Weng

    Abstract: Transformers have achieved remarkable results in single-image super-resolution (SR). However, the challenge of balancing model performance and complexity has hindered their application in lightweight SR (LSR). To tackle this challenge, we propose an efficient striped window transformer (ESWT). We revisit the normalization layer in the transformer and design a concise and efficient transformer stru… ▽ More

    Submitted 14 March, 2023; v1 submitted 24 January, 2023; originally announced January 2023.

    Comments: SOTA lightweight super-resolution transformer. 8 pages, 9 figures and 6 tables. The Code is available at https://github.com/Fried-Rice-Lab/FriedRiceLab

  14. arXiv:2301.00062  [pdf

    quant-ph cs.CR

    FIPS Compliant Quantum Secure Communication using Quantum Permutation Pad

    Authors: Alex He, Dafu Lou, Eric She, Shangjie Guo, Hareesh Watson, Sibyl Weng, Maria Perepechaenko, Rand Kuang

    Abstract: Quantum computing has entered fast development track since Shor's algorithm was proposed in 1994. Multi-cloud services of quantum computing farms are currently available. One of which, IBM quantum computing, presented a road map showing their Kookaburra system with over 4158 qubits will be available in 2025. For the standardization of Post-Quantum Cryptography or PQC, the National Institute of Sta… ▽ More

    Submitted 28 December, 2023; v1 submitted 30 December, 2022; originally announced January 2023.

    Comments: 6 pages, 3 figures, to be submitted for a conference

  15. arXiv:2210.00198  [pdf, other

    cs.CG

    Closed cap condition under the cap construction algorithm

    Authors: Mercedes Sandu, Shuyi Weng, Jade Zhang

    Abstract: Every polygon $P$ can be companioned by a cap polygon $\hat P$ such that $P$ and $\hat P$ serve as two parts of the boundary surface of a polyhedron $V$. Pairs of vertices on $P$ and $\hat P$ are identified successively to become vertices of $V$. In this paper, we study the cap construction that asserts equal angular defects at these pairings. We exhibit a linear relation that arises from the cap… ▽ More

    Submitted 11 June, 2023; v1 submitted 1 October, 2022; originally announced October 2022.

    Comments: 13 pages, 8 figures, accepted by Involve

  16. arXiv:2103.05767  [pdf

    cs.CR cs.AI cs.LG

    ZYELL-NCTU NetTraffic-1.0: A Large-Scale Dataset for Real-World Network Anomaly Detection

    Authors: Lei Chen, Shao-En Weng, Chu-Jun Peng, Hong-Han Shuai, Wen-Huang Cheng

    Abstract: Network security has been an active research topic for long. One critical issue is improving the anomaly detection capability of intrusion detection systems (IDSs), such as firewalls. However, existing network anomaly datasets are out of date (i.e., being collected many years ago) or IP-anonymized, making the data characteristics differ from today's network. Therefore, this work introduces a new,… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Comments: 2 pages, 3 tables, 1 figure

  17. arXiv:2006.01189  [pdf, other

    cs.CL

    An Effective Contextual Language Modeling Framework for Speech Summarization with Augmented Features

    Authors: Shi-Yan Weng, Tien-Hong Lo, Berlin Chen

    Abstract: Tremendous amounts of multimedia associated with speech information are driving an urgent need to develop efficient and effective automatic summarization methods. To this end, we have seen rapid progress in applying supervised deep neural network-based methods to extractive speech summarization. More recently, the Bidirectional Encoder Representations from Transformers (BERT) model was proposed an… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

    Comments: Accepted by EUSIPCO 2020

  18. arXiv:2005.08440  [pdf

    eess.AS cs.CL cs.SD

    An Effective End-to-End Modeling Approach for Mispronunciation Detection

    Authors: Tien-Hong Lo, Shi-Yan Weng, Hsiu-Jui Chang, Berlin Chen

    Abstract: Recently, end-to-end (E2E) automatic speech recognition (ASR) systems have garnered tremendous attention because of their great success and unified modeling paradigms in comparison to conventional hybrid DNN-HMM ASR systems. Despite the widespread adoption of E2E modeling frameworks on ASR, there still is a dearth of work on investigating the E2E frameworks for use in computer-assisted pronunciati… ▽ More

    Submitted 17 May, 2020; originally announced May 2020.

    Comments: Submitted to Interspeech 2020

  19. arXiv:2005.08433  [pdf, other

    eess.AS cs.CL cs.SD

    The NTNU System at the Interspeech 2020 Non-Native Children's Speech ASR Challenge

    Authors: Tien-Hong Lo, Fu-An Chao, Shi-Yan Weng, Berlin Chen

    Abstract: This paper describes the NTNU ASR system participating in the Interspeech 2020 Non-Native Children's Speech ASR Challenge supported by the SIG-CHILD group of ISCA. This ASR shared task is made much more challenging due to the coexisting diversity of non-native and children speaking characteristics. In the setting of closed-track evaluation, all participants were restricted to develop their systems… ▽ More

    Submitted 2 June, 2020; v1 submitted 17 May, 2020; originally announced May 2020.

    Comments: Submitted to Interspeech 2020 Special Session: Shared Task on Automatic Speech Recognition for Non-Native Children's Speech

  20. arXiv:1908.00966  [pdf, other

    cs.LG math.CO stat.ML

    Mixed-Integer Optimization Approach to Learning Association Rules for Unplanned ICU Transfer

    Authors: Chun-An Chou, Qingtao Cao, Shao-Jen Weng, Che-Hung Tsai

    Abstract: After admission to emergency department (ED), patients with critical illnesses are transferred to intensive care unit (ICU) due to unexpected clinical deterioration occurrence. Identifying such unplanned ICU transfers is urgently needed for medical physicians to achieve two-fold goals: improving critical care quality and preventing mortality. A priority task is to understand the crucial rationale… ▽ More

    Submitted 2 August, 2019; originally announced August 2019.

    Journal ref: Artificial Intelligence in Medicine, 2020

  21. arXiv:1805.07740  [pdf, other

    cs.CV

    STS Classification with Dual-stream CNN

    Authors: Shuchen Weng, Wenbo Li, Yi Zhang, Siwei Lyu

    Abstract: The structured time series (STS) classification problem requires the modeling of interweaved spatiotemporal dependency. most previous STS classification methods model the spatial and temporal dependencies independently. Due to the complexity of the STS data, we argue that a desirable STS classification method should be a holistic framework that can be made as adaptive and flexible as possible. Thi… ▽ More

    Submitted 20 May, 2018; originally announced May 2018.

  22. arXiv:1611.00692  [pdf

    cs.PL

    Towards Automatic Resource Bound Analysis for OCaml

    Authors: Jan Hoffmann, Ankush Das, Shu-Chun Weng

    Abstract: This article presents a resource analysis system for OCaml programs. This system automatically derives worst-case resource bounds for higher-order polymorphic programs with user-defined inductive types. The technique is parametric in the resource and can derive bounds for time, memory allocations and energy usage. The derived bounds are multivariate resource polynomials which are functions of diff… ▽ More

    Submitted 2 November, 2016; originally announced November 2016.

    Comments: 74 pages, technical report, short version accepted at POPL 2017

  23. arXiv:1511.04519  [pdf, ps, other

    cs.CE cs.DC math.NA

    MATEX: A Distributed Framework for Transient Simulation of Power Distribution Networks

    Authors: Hao Zhuang, Shih-Hung Weng, Jeng-Hau Lin, Chung-Kuan Cheng

    Abstract: We proposed MATEX, a distributed framework for transient simulation of power distribution networks (PDNs). MATEX utilizes matrix exponential kernel with Krylov subspace approximations to solve differential equations of linear circuit. First, the whole simulation task is divided into subtasks based on decompositions of current sources, in order to reduce the computational overheads. Then these subt… ▽ More

    Submitted 14 November, 2015; originally announced November 2015.

    Comments: ACM/IEEE DAC 2014. arXiv admin note: substantial text overlap with arXiv:1505.06699

  24. arXiv:1507.06711  [pdf, other

    cs.SD cs.CL

    The SYSU System for the Interspeech 2015 Automatic Speaker Verification Spoofing and Countermeasures Challenge

    Authors: Shitao Weng, Shushan Chen, Lei Yu, Xuewei Wu, Weicheng Cai, Zhi Liu, Ming Li

    Abstract: Many existing speaker verification systems are reported to be vulnerable against different spoofing attacks, for example speaker-adapted speech synthesis, voice conversion, play back, etc. In order to detect these spoofed speech signals as a countermeasure, we propose a score level fusion approach with several different i-vector subsystems. We show that the acoustic level Mel-frequency cepstral co… ▽ More

    Submitted 29 July, 2015; v1 submitted 23 July, 2015; originally announced July 2015.

    Comments: 5 pages, 1 figure

  25. arXiv:1505.06699  [pdf, ps, other

    cs.CE cs.DC math.DS math.NA

    Simulation Algorithms with Exponential Integration for Time-Domain Analysis of Large-Scale Power Delivery Networks

    Authors: Hao Zhuang, Wenjian Yu, Shih-Hung Weng, Ilgweon Kang, Jeng-Hau Lin, Xiang Zhang, Ryan Coutts, Chung-Kuan Cheng

    Abstract: We design an algorithmic framework using matrix exponentials for time-domain simulation of power delivery network (PDN). Our framework can reuse factorized matrices to simulate the large-scale linear PDN system with variable stepsizes. In contrast, current conventional PDN simulation solvers have to use fixed step-size approach in order to reuse factorized matrices generated by the expensive matri… ▽ More

    Submitted 1 February, 2016; v1 submitted 25 May, 2015; originally announced May 2015.

    Comments: Accepted by IEEE Transactions on Computer Aided Design of Integrated Circuits and Systems (TCAD)

  26. arXiv:1309.5333  [pdf, ps, other

    cs.CE math.DS math.NA

    Power Grid Simulation using Matrix Exponential Method with Rational Krylov Subspaces

    Authors: Hao Zhuang, Shih-Hung Weng, Chung-Kuan Cheng

    Abstract: One well adopted power grid simulation methodology is to factorize matrix once and perform only backward forward substitution with a deliberately chosen step size along the simulation. Since the required simulation time is usually long for the power grid design, the costly factorization is amortized. However, such fixed step size cannot exploit larger step size for the low frequency response in th… ▽ More

    Submitted 14 October, 2013; v1 submitted 20 September, 2013; originally announced September 2013.

  27. arXiv:1005.3450  [pdf, ps, other

    cs.OS cs.DC

    Efficient System-Enforced Deterministic Parallelism

    Authors: Amittai Aviram, Shu-Chun Weng, Sen Hu, Bryan Ford

    Abstract: Deterministic execution offers many benefits for debugging, fault tolerance, and security. Running parallel programs deterministically is usually difficult and costly, however - especially if we desire system-enforced determinism, ensuring precise repeatability of arbitrarily buggy or malicious software. Determinator is a novel operating system that enforces determinism on both multithreaded and m… ▽ More

    Submitted 19 May, 2010; originally announced May 2010.

    Comments: 14 pages, 12 figures, 3 tables