Skip to main content

Showing 1–34 of 34 results for author: Yu, B

Searching in archive eess. Search in all archives.
.
  1. arXiv:2406.15754  [pdf, other

    cs.CV cs.CL cs.LG cs.SD eess.AS

    Multimodal Segmentation for Vocal Tract Modeling

    Authors: Rishi Jain, Bohan Yu, Peter Wu, Tejas Prabhune, Gopala Anumanchipalli

    Abstract: Accurate modeling of the vocal tract is necessary to construct articulatory representations for interpretable speech processing and linguistics. However, vocal tract modeling is challenging because many internal articulators are occluded from external motion capture technologies. Real-time magnetic resonance imaging (RT-MRI) allows measuring precise movements of internal articulators during speech… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: Interspeech 2024

  2. arXiv:2405.09548  [pdf, other

    eess.SP

    Efficient Bilevel Source Mask Optimization

    Authors: Guo** Chen, Hongquan He, Peng Xu, Hao Geng, Bei Yu

    Abstract: Resolution Enhancement Techniques (RETs) are critical to meet the demands of advanced technology nodes. Among RETs, Source Mask Optimization (SMO) is pivotal, concurrently optimizing both the source and the mask to expand the process window. Traditional SMO methods, however, are limited by sequential and alternating optimizations, leading to extended runtimes without performance guarantees. This p… ▽ More

    Submitted 7 March, 2024; originally announced May 2024.

    Comments: Accepted by Design Automation Conference (DAC) 2024

  3. arXiv:2403.20091  [pdf, other

    cs.IT eess.SP

    A Signature Based Approach Towards Global Channel Charting with Ultra Low Complexity

    Authors: Longhai Zhao, Yunchuan Yang, Qi Xiong, He Wang, Bin Yu, Feifei Sun, Chengjun Sun

    Abstract: Channel charting, an unsupervised learning method that learns a low-dimensional representation from channel information to preserve geometrical property of physical space of user equipments (UEs), has drawn many attentions from both academic and industrial communities, because it can facilitate many downstream tasks, such as indoor localization, UE handover, beam management, and so on. However, ma… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: accepted by IEEE ICC 2024 Workshops

  4. arXiv:2402.05755  [pdf, other

    cs.CL cs.SD eess.AS

    SpiRit-LM: Interleaved Spoken and Written Language Model

    Authors: Tu Anh Nguyen, Benjamin Muller, Bokai Yu, Marta R. Costa-jussa, Maha Elbayad, Sravya Popuri, Paul-Ambroise Duquenne, Robin Algayres, Ruslan Mavlyutov, Itai Gat, Gabriel Synnaeve, Juan Pino, Benoit Sagot, Emmanuel Dupoux

    Abstract: We introduce SPIRIT-LM, a foundation multimodal language model that freely mixes text and speech. Our model is based on a pretrained text language model that we extend to the speech modality by continuously training it on text and speech units. Speech and text sequences are concatenated as a single set of tokens, and trained with a word-level interleaving method using a small automatically-curated… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  5. arXiv:2402.01808  [pdf, other

    cs.SD eess.AS

    KS-Net: Multi-band joint speech restoration and enhancement network for 2024 ICASSP SSI Challenge

    Authors: Guochen Yu, Runqiang Han, Chenglin Xu, Haoran Zhao, Nan Li, Chen Zhang, Xiguang Zheng, Chao Zhou, Qi Huang, Bing Yu

    Abstract: This paper presents the speech restoration and enhancement system created by the 1024K team for the ICASSP 2024 Speech Signal Improvement (SSI) Challenge. Our system consists of a generative adversarial network (GAN) in complex-domain for speech restoration and a fine-grained multi-band fusion module for speech enhancement. In the blind test set of SSI, the proposed system achieves an overall mean… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted to ICASSP 2024; Rank 1st in ICASSP 2024 Speech Signal Improvement (SSI) Challenge

  6. arXiv:2312.13722  [pdf, other

    cs.SD eess.AS

    BAE-Net: A Low complexity and high fidelity Bandwidth-Adaptive neural network for speech super-resolution

    Authors: Guochen Yu, Xiguang Zheng, Nan Li, Runqiang Han, Chengshi Zheng, Chen Zhang, Chao Zhou, Qi Huang, Bing Yu

    Abstract: Speech bandwidth extension (BWE) has demonstrated promising performance in enhancing the perceptual speech quality in real communication systems. Most existing BWE researches primarily focus on fixed upsampling ratios, disregarding the fact that the effective bandwidth of captured audio may fluctuate frequently due to various capturing devices and transmission conditions. In this paper, we propose… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted to ICASSP 2024

  7. arXiv:2312.05187  [pdf, other

    cs.CL cs.SD eess.AS

    Seamless: Multilingual Expressive and Streaming Speech Translation

    Authors: Seamless Communication, Loïc Barrault, Yu-An Chung, Mariano Coria Meglioli, David Dale, Ning Dong, Mark Duppenthaler, Paul-Ambroise Duquenne, Brian Ellis, Hady Elsahar, Justin Haaheim, John Hoffman, Min-Jae Hwang, Hirofumi Inaguma, Christopher Klaiber, Ilia Kulikov, Pengwei Li, Daniel Licht, Jean Maillard, Ruslan Mavlyutov, Alice Rakotoarison, Kaushik Ram Sadagopan, Abinesh Ramakrishnan, Tuan Tran, Guillaume Wenzek , et al. (40 additional authors not shown)

    Abstract: Large-scale automatic speech translation systems today lack key features that help machine-mediated communication feel seamless when compared to human-to-human dialogue. In this work, we introduce a family of models that enable end-to-end expressive and multilingual translations in a streaming fashion. First, we contribute an improved version of the massively multilingual and multimodal SeamlessM4… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  8. arXiv:2311.06144  [pdf, other

    cs.RO eess.SY

    Multi-Agent Reinforcement Learning for the Low-Level Control of a Quadrotor UAV

    Authors: Beomyeol Yu, Taeyoung Lee

    Abstract: By leveraging the underlying structures of the quadrotor dynamics, we propose multi-agent reinforcement learning frameworks to innovate the low-level control of a quadrotor, where independent agents operate cooperatively to achieve a common goal. While single-agent reinforcement learning has been successfully applied in quadrotor controls, training a large monolithic network is often data-intensiv… ▽ More

    Submitted 26 February, 2024; v1 submitted 10 November, 2023; originally announced November 2023.

    Comments: 8 pages, 5 figures, 3 tables

  9. arXiv:2310.16287  [pdf, other

    cs.SD cs.GR eess.AS

    Towards Streaming Speech-to-Avatar Synthesis

    Authors: Tejas S. Prabhune, Peter Wu, Bohan Yu, Gopala K. Anumanchipalli

    Abstract: Streaming speech-to-avatar synthesis creates real-time animations for a virtual character from audio data. Accurate avatar representations of speech are important for the visualization of sound in linguistics, phonetics, and phonology, visual feedback to assist second language acquisition, and virtual embodiment for paralyzed patients. Previous works have highlighted the capability of deep articul… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Submitted to ICASSP 2024

  10. arXiv:2310.14355  [pdf

    cs.LG eess.IV

    A global product of fine-scale urban building height based on spaceborne lidar

    Authors: Xiao Ma, Guang Zheng, Chi Xu, L. Monika Moskal, Peng Gong, Qinghua Guo, Huabing Huang, Xuecao Li, Yong Pang, Cheng Wang, Huan Xie, Bailang Yu, Bo Zhao, Yuyu Zhou

    Abstract: Characterizing urban environments with broad coverages and high precision is more important than ever for achieving the UN's Sustainable Development Goals (SDGs) as half of the world's populations are living in cities. Urban building height as a fundamental 3D urban structural feature has far-reaching applications. However, so far, producing readily available datasets of recent urban building heig… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

  11. arXiv:2310.02497  [pdf, other

    cs.SD cs.LG eess.AS

    Towards an Interpretable Representation of Speaker Identity via Perceptual Voice Qualities

    Authors: Robin Netzorg, Bohan Yu, Andrea Guzman, Peter Wu, Luna McNulty, Gopala Anumanchipalli

    Abstract: Unlike other data modalities such as text and vision, speech does not lend itself to easy interpretation. While lay people can understand how to describe an image or sentence via perception, non-expert descriptions of speech often end at high-level demographic information, such as gender or age. In this paper, we propose a possible interpretable representation of speaker identity based on perceptu… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  12. arXiv:2307.01229  [pdf, other

    cs.SD cs.AI cs.LG cs.MM eess.AS

    EmoGen: Eliminating Subjective Bias in Emotional Music Generation

    Authors: Chenfei Kang, Peiling Lu, Botao Yu, Xu Tan, Wei Ye, Shikun Zhang, Jiang Bian

    Abstract: Music is used to convey emotions, and thus generating emotional music is important in automatic music generation. Previous work on emotional music generation directly uses annotated emotion labels as control signals, which suffers from subjective bias: different people may annotate different emotions on the same music, and one person may feel different emotions under different situations. Therefor… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: 12 pages, 7 pages

  13. arXiv:2306.00110  [pdf, other

    cs.SD cs.AI cs.CL cs.LG cs.MM eess.AS

    MuseCoco: Generating Symbolic Music from Text

    Authors: Peiling Lu, Xin Xu, Chenfei Kang, Botao Yu, Chengyi Xing, Xu Tan, Jiang Bian

    Abstract: Generating music from text descriptions is a user-friendly mode since the text is a relatively easy interface for user engagement. While some approaches utilize texts to control music audio generation, editing musical elements in generated audio is challenging for users. In contrast, symbolic music offers ease of editing, making it more accessible for users to manipulate specific musical elements.… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

  14. arXiv:2304.09322  [pdf, other

    eess.IV cs.CV cs.LG

    Multi-Modality Multi-Scale Cardiovascular Disease Subtypes Classification Using Raman Image and Medical History

    Authors: Bo Yu, Hechang Chen, Chengyou Jia, Hongren Zhou, Lele Cong, Xiankai Li, Jianhui Zhuang, Xianling Cong

    Abstract: Raman spectroscopy (RS) has been widely used for disease diagnosis, e.g., cardiovascular disease (CVD), owing to its efficiency and component-specific testing capabilities. A series of popular deep learning methods have recently been introduced to learn nuance features from RS for binary classifications and achieved outstanding performance than conventional machine learning methods. However, these… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Journal ref: [J]. Expert Systems with Applications, 2023: 119965

  15. arXiv:2304.04773  [pdf, other

    eess.IV cs.CV

    HDR Video Reconstruction with a Large Dynamic Dataset in Raw and sRGB Domains

    Authors: Huan**g Yue, Yubo Peng, Biting Yu, Xuanwu Yin, Zhenyu Zhou, **gyu Yang

    Abstract: High dynamic range (HDR) video reconstruction is attracting more and more attention due to the superior visual quality compared with those of low dynamic range (LDR) videos. The availability of LDR-HDR training pairs is essential for the HDR reconstruction quality. However, there are still no real LDR-HDR pairs for dynamic scenes due to the difficulty in capturing LDR-HDR frames simultaneously. In… ▽ More

    Submitted 12 April, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

  16. A High-Performance Accelerator for Super-Resolution Processing on Embedded GPU

    Authors: Wenqian Zhao, Qi Sun, Yang Bai, Wenbo Li, Haisheng Zheng, Bei Yu, Martin D. F. Wong

    Abstract: Recent years have witnessed impressive progress in super-resolution (SR) processing. However, its real-time inference requirement sets a challenge not only for the model design but also for the on-chip implementation. In this paper, we implement a full-stack SR acceleration framework on embedded GPU devices. The special dictionary learning algorithm used in SR models was analyzed in detail and acc… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  17. arXiv:2303.08435  [pdf, other

    cs.CV cs.LG eess.IV

    Physics-Informed Optical Kernel Regression Using Complex-valued Neural Fields

    Authors: Guo** Chen, Zehua Pei, Haoyu Yang, Yuzhe Ma, Bei Yu, Martin D. F. Wong

    Abstract: Lithography is fundamental to integrated circuit fabrication, necessitating large computation overhead. The advancement of machine learning (ML)-based lithography models alleviates the trade-offs between manufacturing process expense and capability. However, all previous methods regard the lithography system as an image-to-image black box map**, utilizing network parameters to learn by rote mapp… ▽ More

    Submitted 9 April, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: Accepted by DAC23

  18. arXiv:2303.03670  [pdf, other

    eess.IV cs.RO

    Weakly Supervised Caveline Detection For AUV Navigation Inside Underwater Caves

    Authors: Boxiao Yu, Reagan Tibbetts, Titon Barua, Ailani Morales, Ioannis Rekleitis, Md Jahidul Islam

    Abstract: Underwater caves are challenging environments that are crucial for water resource management, and for our understanding of hydro-geology and history. Map** underwater caves is a time-consuming, labor-intensive, and hazardous operation. For autonomous cave map** by underwater robots, the major challenge lies in vision-based estimation in the complete absence of ambient light, which results in c… ▽ More

    Submitted 28 June, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

  19. arXiv:2301.10602  [pdf, other

    cs.RO eess.SY

    DreamWaQ: Learning Robust Quadrupedal Locomotion With Implicit Terrain Imagination via Deep Reinforcement Learning

    Authors: I Made Aswin Nahrendra, Byeongho Yu, Hyun Myung

    Abstract: Quadrupedal robots resemble the physical ability of legged animals to walk through unstructured terrains. However, designing a controller for quadrupedal robots poses a significant challenge due to their functional complexity and requires adaptation to various terrains. Recently, deep reinforcement learning, inspired by how legged animals learn to walk from their experiences, has been utilized to… ▽ More

    Submitted 2 March, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

    Comments: Accepted for ICRA 2023

  20. arXiv:2210.10349  [pdf, other

    cs.SD cs.AI cs.CL cs.LG cs.MM eess.AS

    Museformer: Transformer with Fine- and Coarse-Grained Attention for Music Generation

    Authors: Botao Yu, Peiling Lu, Rui Wang, Wei Hu, Xu Tan, Wei Ye, Shikun Zhang, Tao Qin, Tie-Yan Liu

    Abstract: Symbolic music generation aims to generate music scores automatically. A recent trend is to use Transformer or its variants in music generation, which is, however, suboptimal, because the full attention cannot efficiently model the typically long music sequences (e.g., over 10,000 tokens), and the existing models have shortcomings in generating musical repetition structures. In this paper, we prop… ▽ More

    Submitted 30 October, 2022; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted by the Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022)

  21. arXiv:2209.12358  [pdf, other

    cs.CV eess.IV

    UDepth: Fast Monocular Depth Estimation for Visually-guided Underwater Robots

    Authors: Boxiao Yu, Jiayi Wu, Md Jahidul Islam

    Abstract: In this paper, we present a fast monocular depth estimation method for enabling 3D perception capabilities of low-cost underwater robots. We formulate a novel end-to-end deep visual learning pipeline named UDepth, which incorporates domain knowledge of image formation characteristics of natural underwater scenes. First, we adapt a new input space from raw RGB image space by exploiting underwater l… ▽ More

    Submitted 2 February, 2023; v1 submitted 25 September, 2022; originally announced September 2022.

    Comments: 10 pages, 6 figures

  22. arXiv:2209.00805  [pdf, other

    eess.AS

    Multi-scale temporal-frequency attention for music source separation

    Authors: Lianwu Chen, Xiguang Zheng, Chen Zhang, Liang Guo, Bing Yu

    Abstract: In recent years, deep neural networks (DNNs) based approaches have achieved the start-of-the-art performance for music source separation (MSS). Although previous methods have addressed the large receptive field modeling using various methods, the temporal and frequency correlations of the music spectrogram with repeated patterns have not been explicitly explored for the MSS task. In this paper, a… ▽ More

    Submitted 1 September, 2022; originally announced September 2022.

  23. arXiv:2208.14345  [pdf, other

    cs.SD cs.CL cs.LG cs.MM eess.AS

    MeloForm: Generating Melody with Musical Form based on Expert Systems and Neural Networks

    Authors: Peiling Lu, Xu Tan, Botao Yu, Tao Qin, Sheng Zhao, Tie-Yan Liu

    Abstract: Human usually composes music by organizing elements according to the musical form to express music ideas. However, for neural network-based music generation, it is difficult to do so due to the lack of labelled data on musical form. In this paper, we develop MeloForm, a system that generates melody with musical form using expert systems and neural networks. Specifically, 1) we design an expert sys… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

  24. arXiv:2208.12599  [pdf

    physics.optics eess.IV

    SOFFLFM: Super-resolution optical fluctuation Fourier light-field microscopy

    Authors: Haixin Huang, Haoyuan Qiu, Hanzhe Wu, Yihong Ji, Heng Li, Bin Yu, Danni Chen, Junle Qu

    Abstract: Fourier light-field microscopy (FLFM) uses a micro-lens array (MLA) to segment the Fourier Plane of the microscopic objective lens to generate multiple two-dimensional perspective views, thereby reconstructing the three-dimensional(3D) structure of the sample using 3D deconvolution calculation without scanning. However, the resolution of FLFM is still limited by diffraction, and furthermore, depen… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

  25. arXiv:2203.01621  [pdf, other

    cs.IT eess.SP

    Endogenous Security of Computation Offloading in Blockchain-Empowered Internet of Things

    Authors: Yiliang Liu, Zhou Su, Bobo Yu

    Abstract: This paper investigates an endogenous security architecture for computation offloading in the Internet of Things (IoT), where the blockchain technology enables the traceability of malicious behaviors, and the task data uploading link from sensors to small base station (SBS) is protected by intelligent reflecting surface (IRS)-assisted physical layer security (PLS). After receiving task data, the S… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

  26. arXiv:2201.10809  [pdf, other

    eess.AS cs.SD

    A two-step backward compatible fullband speech enhancement system

    Authors: Xu Zhang, Lianwu Chen, Xiguang Zheng, Xinlei Ren, Chen Zhang, Liang Guo, Bing Yu

    Abstract: Speech enhancement methods based on deep learning have surpassed traditional methods. While many of these new approaches are operating on the wideband (16kHz) sample rate, a new fullband (48kHz) speech enhancement system is proposed in this paper. Compared to the existing fullband systems that utilizes perceptually motivated features to train the fullband speech enhancement using a single network… ▽ More

    Submitted 27 January, 2022; v1 submitted 26 January, 2022; originally announced January 2022.

  27. arXiv:2110.08634  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Towards Robust Waveform-Based Acoustic Models

    Authors: Dino Oglic, Zoran Cvetkovic, Peter Sollich, Steve Renals, Bin Yu

    Abstract: We study the problem of learning robust acoustic models in adverse environments, characterized by a significant mismatch between training and test conditions. This problem is of paramount importance for the deployment of speech recognition systems that need to perform well in unseen environments. First, we characterize data augmentation theoretically as an instance of vicinal risk minimization, wh… ▽ More

    Submitted 29 June, 2022; v1 submitted 16 October, 2021; originally announced October 2021.

    Journal ref: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022

  28. arXiv:2106.15097  [pdf, other

    eess.IV cs.CV

    IREM: High-Resolution Magnetic Resonance (MR) Image Reconstruction via Implicit Neural Representation

    Authors: Qing Wu, Yuwei Li, Lan Xu, Ruiming Feng, Hongjiang Wei, Qing Yang, Boliang Yu, Xiaozhao Liu, **gyi Yu, Yuyao Zhang

    Abstract: For collecting high-quality high-resolution (HR) MR image, we propose a novel image reconstruction network named IREM, which is trained on multiple low-resolution (LR) MR images and achieve an arbitrary up-sampling rate for HR image reconstruction. In this work, we suppose the desired HR image as an implicit continuous function of the 3D image spatial coordinate and the thick-slice LR images as se… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

    Comments: 8 pages, 6 figures, conference

  29. arXiv:2102.03357  [pdf, other

    eess.SP cs.AI cs.LG eess.SY

    Machine Learning for Electronic Design Automation: A Survey

    Authors: Guyue Huang, **gbo Hu, Yifan He, Jialong Liu, Mingyuan Ma, Zhaoyang Shen, Juejian Wu, Yuanfan Xu, Hengrui Zhang, Kai Zhong, Xuefei Ning, Yuzhe Ma, Haoyu Yang, Bei Yu, Huazhong Yang, Yu Wang

    Abstract: With the down-scaling of CMOS technology, the design complexity of very large-scale integrated (VLSI) is increasing. Although the application of machine learning (ML) techniques in electronic design automation (EDA) can trace its history back to the 90s, the recent breakthrough of ML and the increasing complexity of EDA tasks have aroused more interests in incorporating ML to solve EDA tasks. In t… ▽ More

    Submitted 8 March, 2021; v1 submitted 10 January, 2021; originally announced February 2021.

    Comments: Accepted by TODAES. The first 10 authors are ordered alphabetically

  30. arXiv:2004.13979  [pdf

    cs.CV cs.LG eess.IV

    Skeleton Focused Human Activity Recognition in RGB Video

    Authors: Bruce X. B. Yu, Yan Liu, Keith C. C. Chan

    Abstract: The data-driven approach that learns an optimal representation of vision features like skeleton frames or RGB videos is currently a dominant paradigm for activity recognition. While great improvements have been achieved from existing single modal approaches with increasingly larger datasets, the fusion of various data modalities at the feature level has seldom been attempted. In this paper, we pro… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

    Comments: 8 pages

  31. arXiv:2004.13977  [pdf

    cs.CV cs.LG eess.IV

    Effective Human Activity Recognition Based on Small Datasets

    Authors: Bruce X. B. Yu, Yan Liu, Keith C. C. Chan

    Abstract: Most recent work on vision-based human activity recognition (HAR) focuses on designing complex deep learning models for the task. In so doing, there is a requirement for large datasets to be collected. As acquiring and processing large training datasets are usually very expensive, the problem of how dataset size can be reduced without affecting recognition accuracy has to be tackled. To do so, we… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

    Comments: 7 pages

  32. arXiv:1905.02373  [pdf, other

    eess.IV cs.RO

    PI-BA Bundle Adjustment Acceleration on Embedded FPGAs with Co-observation Optimization

    Authors: Shuzhen Qin, Qiang Liu, Bo Yu, Shaoshan Liu

    Abstract: Bundle adjustment (BA) is a fundamental optimization technique used in many crucial applications, including 3D scene reconstruction, robotic localization, camera calibration, autonomous driving, space exploration, street view map generation etc. Essentially, BA is a joint non-linear optimization problem, and one which can consume a significant amount of time and power, especially for large optimiz… ▽ More

    Submitted 7 May, 2019; originally announced May 2019.

    Comments: in Proceedings of IEEE FCCM 2019

  33. Lesion Segmentation in Ultrasound Using Semi-pixel-wise Cycle Generative Adversarial Nets

    Authors: Jie Xing, Zheren Li, Biyuan Wang, Yuji Qi, Bingbin Yu, Farhad G. Zanjani, Aiwen Zheng, Remco Duits, Tao Tan

    Abstract: Breast cancer is the most common invasive cancer with the highest cancer occurrence in females. Handheld ultrasound is one of the most efficient ways to identify and diagnose the breast cancer. The area and the shape information of a lesion is very helpful for clinicians to make diagnostic decisions. In this study we propose a new deep-learning scheme, semi-pixel-wise cycle generative adversarial… ▽ More

    Submitted 17 October, 2020; v1 submitted 6 May, 2019; originally announced May 2019.

    Journal ref: IEEE/ACM Transactions on Computational Biology and Bioinformatics, 04 March 2020, pp.1-1

  34. arXiv:1807.06446  [pdf, other

    cs.LG eess.IV stat.ML

    Bridging the Gap Between Layout Pattern Sampling and Hotspot Detection via Batch Active Learning

    Authors: Haoyu Yang, Shuhe Li, Cyrus Tabery, Bingqing Lin, Bei Yu

    Abstract: Layout hotpot detection is one of the main steps in modern VLSI design. A typical hotspot detection flow is extremely time consuming due to the computationally expensive mask optimization and lithographic simulation. Recent researches try to facilitate the procedure with a reduced flow including feature extraction, training set generation and hotspot detection, where feature extraction methods and… ▽ More

    Submitted 13 July, 2018; originally announced July 2018.

    Comments: 8 pages, 7 figures