Skip to main content

Showing 1–20 of 20 results for author: Guo, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.10550  [pdf, other

    eess.IV cs.CV

    LighTDiff: Surgical Endoscopic Image Low-Light Enhancement with T-Diffusion

    Authors: Tong Chen, Qingcheng Lyu, Long Bai, Erjian Guo, Huxin Gao, Xiaoxiao Yang, Hongliang Ren, Lu** Zhou

    Abstract: Advances in endoscopy use in surgeries face challenges like inadequate lighting. Deep learning, notably the Denoising Diffusion Probabilistic Model (DDPM), holds promise for low-light image enhancement in the medical field. However, DDPMs are computationally demanding and slow, limiting their practical medical applications. To bridge this gap, we propose a lightweight DDPM, dubbed LighTDiff. It ad… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  2. arXiv:2401.05584  [pdf

    cs.CV cs.AI

    FourCastNeXt: Optimizing FourCastNet Training for Limited Compute

    Authors: Edison Guo, Maruf Ahmed, Yue Sun, Rui Yang, Harrison Cook, Tennessee Leeuwenburg, Ben Evans

    Abstract: FourCastNeXt is an optimization of FourCastNet - a global machine learning weather forecasting model - that performs with a comparable level of accuracy and can be trained using around 5% of the original FourCastNet computational requirements. This technical report presents strategies for model optimization that maintain similar performance as measured by the root-mean-square error (RMSE) of the m… ▽ More

    Submitted 20 March, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: Major revision. All prior content (text, figures, table) has been updated. Additionally, new text, tables and figures have been added. Updated title. Updated author list

  3. arXiv:2312.12655  [pdf, other

    cs.LG cs.AI cs.CL

    Can Transformers Learn Sequential Function Classes In Context?

    Authors: Ryan Campbell, Emma Guo, Evan Hu, Reya Vir, Ethan Hsiao

    Abstract: In-context learning (ICL) has revolutionized the capabilities of transformer models in NLP. In our project, we extend the understanding of the mechanisms underpinning ICL by exploring whether transformers can learn from sequential, non-textual function class data distributions. We introduce a novel sliding window sequential function class and employ toy-sized transformers with a GPT-2 architecture… ▽ More

    Submitted 20 December, 2023; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: 8 pages, 8 figures

  4. arXiv:2311.16580  [pdf, other

    cs.CV

    Clean Label Disentangling for Medical Image Segmentation with Noisy Labels

    Authors: Zicheng Wang, Zhen Zhao, Erjian Guo, Lu** Zhou

    Abstract: Current methods focusing on medical image segmentation suffer from incorrect annotations, which is known as the noisy label issue. Most medical image segmentation with noisy labels methods utilize either noise transition matrix, noise-robust loss functions or pseudo-labeling methods, while none of the current research focuses on clean label disentanglement. We argue that the main reason is that th… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 13 pages, 6 figures, 11 tables

  5. arXiv:2310.03137  [pdf, other

    cs.RO cs.HC

    Speech-Based Human-Exoskeleton Interaction for Lower Limb Motion Planning

    Authors: Eddie Guo, Christopher Perlette, Mojtaba Sharifi, Lukas Grasse, Matthew Tata, Vivian K. Mushahwar, Mahdi Tavakoli

    Abstract: This study presents a speech-based motion planning strategy (SBMP) developed for lower limb exoskeletons to facilitate safe and compliant human-robot interaction. A speech processing system, finite state machine, and central pattern generator are the building blocks of the proposed strategy for online planning of the exoskeleton's trajectory. According to experimental evaluations, this speech-proc… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: 13 pages, 8 figures, 2 tables

  6. arXiv:2306.04879  [pdf, other

    cs.LG

    Augmenting Hessians with Inter-Layer Dependencies for Mixed-Precision Post-Training Quantization

    Authors: Clemens JS Schaefer, Navid Lambert-Shirzad, Xiaofan Zhang, Chiachen Chou, Tom Jablin, Jian Li, Elfie Guo, Caitlin Stanton, Siddharth Joshi, Yu Emma Wang

    Abstract: Efficiently serving neural network models with low latency is becoming more challenging due to increasing model complexity and parameter count. Model quantization offers a solution which simultaneously reduces memory footprint and compute requirements. However, aggressive quantization may lead to an unacceptable loss in model accuracy owing to differences in sensitivity to numerical imperfection a… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  7. arXiv:2305.00844  [pdf, other

    cs.CL cs.AI

    Automated Paper Screening for Clinical Reviews Using Large Language Models

    Authors: Eddie Guo, Mehul Gupta, Jiawen Deng, Ye-Jean Park, Mike Paget, Christopher Naugler

    Abstract: Objective: To assess the performance of the OpenAI GPT API in accurately and efficiently identifying relevant titles and abstracts from real-world clinical review datasets and compare its performance against ground truth labelling by two independent human reviewers. Methods: We introduce a novel workflow using the OpenAI GPT API for screening titles and abstracts in clinical reviews. A Python sc… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: 15 pages, 2 figures, 4 tables

  8. arXiv:2302.11795  [pdf, other

    eess.IV cs.CV cs.LG

    Bridging Synthetic and Real Images: a Transferable and Multiple Consistency aided Fundus Image Enhancement Framework

    Authors: Erjian Guo, Huazhu Fu, Lu** Zhou, Dong Xu

    Abstract: Deep learning based image enhancement models have largely improved the readability of fundus images in order to decrease the uncertainty of clinical observations and the risk of misdiagnosis. However, due to the difficulty of acquiring paired real fundus images at different qualities, most existing methods have to adopt synthetic image pairs as training data. The domain shift between the synthetic… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

  9. arXiv:2302.01382  [pdf, other

    cs.LG

    Mixed Precision Post Training Quantization of Neural Networks with Sensitivity Guided Search

    Authors: Clemens JS Schaefer, Elfie Guo, Caitlin Stanton, Xiaofan Zhang, Tom Jablin, Navid Lambert-Shirzad, Jian Li, Chiachen Chou, Siddharth Joshi, Yu Emma Wang

    Abstract: Serving large-scale machine learning (ML) models efficiently and with low latency has become challenging owing to increasing model size and complexity. Quantizing models can simultaneously reduce memory and compute requirements, facilitating their widespread access. However, for large models not all layers are equally amenable to the same numerical precision and aggressive quantization can lead to… ▽ More

    Submitted 6 February, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

  10. arXiv:2212.13621  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    Annealing Double-Head: An Architecture for Online Calibration of Deep Neural Networks

    Authors: Erdong Guo, David Draper, Maria De Iorio

    Abstract: Model calibration, which is concerned with how frequently the model predicts correctly, not only plays a vital part in statistical model design, but also has substantial practical applications, such as optimal decision-making in the real world. However, it has been discovered that modern deep neural networks are generally poorly calibrated due to the overestimation (or underestimation) of predicti… ▽ More

    Submitted 15 January, 2023; v1 submitted 27 December, 2022; originally announced December 2022.

    Comments: Revised Preprint. 19 pages, 10 figures, 4 tables. Typos fixed, and references added

  11. arXiv:2111.14046  [pdf, other

    stat.ML cs.LG cs.NE quant-ph

    Neural Tangent Kernel of Matrix Product States: Convergence and Applications

    Authors: Erdong Guo, David Draper

    Abstract: In this work, we study the Neural Tangent Kernel (NTK) of Matrix Product States (MPS) and the convergence of its NTK in the infinite bond dimensional limit. We prove that the NTK of MPS asymptotically converges to a constant matrix during the gradient descent (training) process (and also the initialization phase) as the bond dimensions of MPS go to infinity by the observation that the variation of… ▽ More

    Submitted 27 November, 2021; originally announced November 2021.

    Comments: 19 pages, 1 figure

  12. arXiv:2105.09938  [pdf, other

    cs.SE cs.CL cs.LG

    Measuring Coding Challenge Competence With APPS

    Authors: Dan Hendrycks, Steven Basart, Saurav Kadavath, Mantas Mazeika, Akul Arora, Ethan Guo, Collin Burns, Samir Puranik, Horace He, Dawn Song, Jacob Steinhardt

    Abstract: While programming is one of the most broadly applicable skills in modern society, modern machine learning models still cannot code solutions to basic problems. Despite its importance, there has been surprisingly little work on evaluating code generation, and it can be difficult to accurately assess code generation performance rigorously. To meet this challenge, we introduce APPS, a benchmark for c… ▽ More

    Submitted 8 November, 2021; v1 submitted 20 May, 2021; originally announced May 2021.

    Comments: NeurIPS 2021. Code and the APPS dataset is available at https://github.com/hendrycks/apps

  13. arXiv:2103.08277  [pdf, ps, other

    stat.ML cs.LG cs.NE quant-ph

    Representation Theorem for Matrix Product States

    Authors: Erdong Guo, David Draper

    Abstract: In this work, we investigate the universal representation capacity of the Matrix Product States (MPS) from the perspective of boolean functions and continuous functions. We show that MPS can accurately realize arbitrary boolean functions by providing a construction method of the corresponding MPS structure for an arbitrarily given boolean gate. Moreover, we prove that the function space of MPS wit… ▽ More

    Submitted 15 March, 2021; originally announced March 2021.

    Comments: 19 pages

  14. arXiv:2101.02333  [pdf, ps, other

    stat.ML cs.LG cs.NE

    Infinitely Wide Tensor Networks as Gaussian Process

    Authors: Erdong Guo, David Draper

    Abstract: Gaussian Process is a non-parametric prior which can be understood as a distribution on the function space intuitively. It is known that by introducing appropriate prior to the weights of the neural networks, Gaussian Process can be obtained by taking the infinite-width limit of the Bayesian neural networks from a Bayesian perspective. In this paper, we explore the infinitely wide Tensor Networks… ▽ More

    Submitted 6 January, 2021; originally announced January 2021.

    Comments: 20 pages, 4 figures

  15. arXiv:2101.00245  [pdf, other

    stat.ML cs.CV cs.LG cs.NE

    The Bayesian Method of Tensor Networks

    Authors: Erdong Guo, David Draper

    Abstract: Bayesian learning is a powerful learning framework which combines the external information of the data (background information) with the internal information (training data) in a logically consistent way in inference and prediction. By Bayes rule, the external information (prior distribution) and the internal information (training data likelihood) are combined coherently, and the posterior distrib… ▽ More

    Submitted 1 January, 2021; originally announced January 2021.

    Comments: 13 pages, 4 figures

  16. arXiv:2006.07113  [pdf, other

    cs.HC cs.AI cs.LG stat.ML

    Large-scale Hybrid Approach for Predicting User Satisfaction with Conversational Agents

    Authors: Dookun Park, Hao Yuan, Dongmin Kim, Yinglei Zhang, Matsoukas Spyros, Young-Bum Kim, Ruhi Sarikaya, Edward Guo, Yuan Ling, Kevin Quinn, Pham Hung, Benjamin Yao, Sung** Lee

    Abstract: Measuring user satisfaction level is a challenging task, and a critical component in develo** large-scale conversational agent systems serving the needs of real users. An widely used approach to tackle this is to collect human annotation data and use them for evaluation or modeling. Human annotation based approaches are easier to control, but hard to scale. A novel alternative approach is to col… ▽ More

    Submitted 29 May, 2020; originally announced June 2020.

  17. arXiv:1910.11272  [pdf

    eess.IV cs.CV physics.optics

    Learning-based real-time method to looking through scattering medium beyond the memory effect

    Authors: Enlai Guo, Shuo Zhu, Yan Sun, Lianfa Bai, **g Han

    Abstract: Strong scattering medium brings great difficulties to optical imaging, which is also a problem in medical imaging and many other fields. Optical memory effect makes it possible to image through strong random scattering medium. However, this method also has the limitation of limited angle field-of-view (FOV), which prevents it from being applied in practice. In this paper, a kind of practical convo… ▽ More

    Submitted 4 November, 2019; v1 submitted 19 October, 2019; originally announced October 2019.

    Comments: 15 pages with 9 figures

  18. arXiv:1812.01640  [pdf, other

    cs.LG

    Overcoming Catastrophic Forgetting by Soft Parameter Pruning

    Authors: Jian Peng, Jiang Hao, Zhuo Li, Enqiang Guo, Xiaohong Wan, Deng Min, Qing Zhu, Haifeng Li

    Abstract: Catastrophic forgetting is a challenge issue in continual learning when a deep neural network forgets the knowledge acquired from the former task after learning on subsequent tasks. However, existing methods try to find the joint distribution of parameters shared with all tasks. This idea can be questionable because this joint distribution may not present when the number of tasks increase. On the… ▽ More

    Submitted 4 December, 2018; originally announced December 2018.

    Comments: 10 pages, 12 figures

  19. arXiv:1810.09111  [pdf, other

    cs.CV

    Learning to Measure Change: Fully Convolutional Siamese Metric Networks for Scene Change Detection

    Authors: Enqiang Guo, Xinsha Fu, Jiawei Zhu, Min Deng, Yu Liu, Qing Zhu, Haifeng Li

    Abstract: A critical challenge problem of scene change detection is that noisy changes generated by varying illumination, shadows and camera viewpoint make variances of a scene difficult to define and measure since the noisy changes and semantic ones are entangled. Following the intuitive idea of detecting changes by directly comparing dissimilarities between a pair of features, we propose a novel fully Con… ▽ More

    Submitted 11 November, 2018; v1 submitted 22 October, 2018; originally announced October 2018.

    Comments: 10 pages, 12 figures

  20. arXiv:1607.05447  [pdf, other

    cs.CV math.OC

    On Differentiating Parameterized Argmin and Argmax Problems with Application to Bi-level Optimization

    Authors: Stephen Gould, Basura Fernando, Anoop Cherian, Peter Anderson, Rodrigo Santa Cruz, Edison Guo

    Abstract: Some recent works in machine learning and computer vision involve the solution of a bi-level optimization problem. Here the solution of a parameterized lower-level problem binds variables that appear in the objective of an upper-level problem. The lower-level problem typically appears as an argmin or argmax optimization problem. Many techniques have been proposed to solve bi-level optimization pro… ▽ More

    Submitted 20 July, 2016; v1 submitted 19 July, 2016; originally announced July 2016.

    Comments: 16 pages, 6 figures