Skip to main content

Showing 1–15 of 15 results for author: Heng, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14644  [pdf, other

    cs.CL

    Unveiling the Spectrum of Data Contamination in Language Models: A Survey from Detection to Remediation

    Authors: Chunyuan Deng, Yilun Zhao, Yuzhao Heng, Yitong Li, Jiannan Cao, Xiangru Tang, Arman Cohan

    Abstract: Data contamination has garnered increased attention in the era of large language models (LLMs) due to the reliance on extensive internet-derived training corpora. The issue of training corpus overlap with evaluation benchmarks--referred to as contamination--has been the focus of significant recent research. This body of work aims to identify contamination, understand its impacts, and explore mitig… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: ACL 2024 Camera-Ready Version

  2. arXiv:2403.16186  [pdf, other

    cs.IT eess.SP

    Site-Specific Beam Alignment in 6G via Deep Learning

    Authors: Yuqiang Heng, Yu Zhang, Ahmed Alkhateeb, Jeffrey G. Andrews

    Abstract: Beam alignment (BA) in modern millimeter wave standards such as 5G NR and WiGig (802.11ay) is based on exhaustive and/or hierarchical beam searches over pre-defined codebooks of wide and narrow beams. This approach is slow and bandwidth/power-intensive, and is a considerable hindrance to the wide deployment of millimeter wave bands. A new approach is needed as we move towards 6G. BA is a promising… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Accepted for publication in the IEEE Communications Magazine

  3. arXiv:2403.11103  [pdf, other

    cs.CL cs.LG

    ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models

    Authors: Yuzhao Heng, Chunyuan Deng, Yitong Li, Yue Yu, Yinghao Li, Rongzhi Zhang, Chao Zhang

    Abstract: Although Large Language Models (LLMs) exhibit remarkable adaptability across domains, these models often fall short in structured knowledge extraction tasks such as named entity recognition (NER). This paper explores an innovative, cost-efficient strategy to harness LLMs with modest NER capabilities for producing superior NER datasets. Our approach diverges from the basic class-conditional prompts… ▽ More

    Submitted 9 June, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted to ACL 2024 Findings

  4. arXiv:2311.10042  [pdf, other

    cs.CV

    Depth Insight -- Contribution of Different Features to Indoor Single-image Depth Estimation

    Authors: Yihong Wu, Yuwen Heng, Mahesan Niranjan, Hansung Kim

    Abstract: Depth estimation from a single image is a challenging problem in computer vision because binocular disparity or motion information is absent. Whereas impressive performances have been reported in this area recently using end-to-end trained deep neural architectures, as to what cues in the images that are being exploited by these black box systems is hard to know. To this end, in this work, we quan… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

  5. arXiv:2310.16655  [pdf, other

    cs.LG

    Towards Control-Centric Representations in Reinforcement Learning from Images

    Authors: Chen Liu, Hongyu Zang, Xin Li, Yong Heng, Yifei Wang, Zhen Fang, Yisen Wang, Mingzhong Wang

    Abstract: Image-based Reinforcement Learning is a practical yet challenging task. A major hurdle lies in extracting control-centric representations while disregarding irrelevant information. While approaches that follow the bisimulation principle exhibit the potential in learning state representations to address this issue, they still grapple with the limited expressive capacity of latent dynamics and the i… ▽ More

    Submitted 27 October, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

  6. arXiv:2310.15815  [pdf, other

    cs.LG

    Good Better Best: Self-Motivated Imitation Learning for noisy Demonstrations

    Authors: Ye Yuan, Xin Li, Yong Heng, Leiji Zhang, MingZhong Wang

    Abstract: Imitation Learning (IL) aims to discover a policy by minimizing the discrepancy between the agent's behavior and expert demonstrations. However, IL is susceptible to limitations imposed by noisy demonstrations from non-expert behaviors, presenting a significant challenge due to the lack of supplementary information to assess their expertise. In this paper, we introduce Self-Motivated Imitation LEa… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  7. arXiv:2309.13596  [pdf, other

    cs.CV

    Advancements in 3D Lane Detection Using LiDAR Point Clouds: From Data Collection to Model Development

    Authors: Runkai Zhao, Yuwen Heng, Heng Wang, Yuanda Gao, Shilei Liu, Changhao Yao, Jiawen Chen, Weidong Cai

    Abstract: Advanced Driver-Assistance Systems (ADAS) have successfully integrated learning-based techniques into vehicle perception and decision-making. However, their application in 3D lane detection for effective driving environment perception is hindered by the lack of comprehensive LiDAR datasets. The sparse nature of LiDAR point cloud data prevents an efficient manual annotation process. To solve this p… ▽ More

    Submitted 15 March, 2024; v1 submitted 24 September, 2023; originally announced September 2023.

    Comments: Accepted by ICRA2024

  8. arXiv:2308.01857  [pdf, other

    cs.AR

    iEDA: An Open-Source Intelligent Physical Implementation Toolkit and Library

    Authors: Xingquan Li, Simin Tao, Zengrong Huang, Shijian Chen, Zhisheng Zeng, Liwei Ni, Zhipeng Huang, Chunan Zhuang, Hongxi Wu, Weiguo Li1, Xueyan Zhao, He Liu, Shuaiying Long, Wei He, Bojun Liu, Sifeng Gan, Zihao Yu, Tong Liu, Yuchi Miao, Zhiyuan Yan, Hao Wang, Jie Zhao, Yifan Li, Ruizhi Liu, Xiaoze Lin , et al. (31 additional authors not shown)

    Abstract: Open-source EDA shows promising potential in unleashing EDA innovation and lowering the cost of chip design. This paper presents an open-source EDA project, iEDA, aiming for building a basic infrastructure for EDA technology evolution and closing the industrial-academic gap in the EDA area. iEDA now covers the whole flow of physical design (including Floorplan, Placement, CTS, Routing, Timing Opti… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

  9. arXiv:2307.11466  [pdf, other

    cs.CV eess.IV

    MatSpectNet: Material Segmentation Network with Domain-Aware and Physically-Constrained Hyperspectral Reconstruction

    Authors: Yuwen Heng, Yihong Wu, Jiawen Chen, Srinandan Dasmahapatra, Hansung Kim

    Abstract: Achieving accurate material segmentation for 3-channel RGB images is challenging due to the considerable variation in a material's appearance. Hyperspectral images, which are sets of spectral measurements sampled at multiple wavelengths, theoretically offer distinct information for material identification, as variations in intensity of electromagnetic radiation reflected by a surface depend on the… ▽ More

    Submitted 17 August, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: 7 pages main paper

  10. arXiv:2305.16521  [pdf, other

    cs.CL cs.LG

    Label Agnostic Pre-training for Zero-shot Text Classification

    Authors: Christopher Clarke, Yuzhao Heng, Yi** Kang, Krisztian Flautner, Lingjia Tang, Jason Mars

    Abstract: Conventional approaches to text classification typically assume the existence of a fixed set of predefined labels to which a given text can be classified. However, in real-world applications, there exists an infinite label space for describing a given text. In addition, depending on the aspect (sentiment, topic, etc.) and domain of the text (finance, legal, etc.), the interpretation of the label c… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023

  11. arXiv:2305.03919  [pdf, other

    cs.CV

    DBAT: Dynamic Backward Attention Transformer for Material Segmentation with Cross-Resolution Patches

    Authors: Yuwen Heng, Srinandan Dasmahapatra, Hansung Kim

    Abstract: The objective of dense material segmentation is to identify the material categories for every image pixel. Recent studies adopt image patches to extract material features. Although the trained networks can improve the segmentation performance, their methods choose a fixed patch resolution which fails to take into account the variation in pixel area covered by each material. In this paper, we propo… ▽ More

    Submitted 28 February, 2024; v1 submitted 5 May, 2023; originally announced May 2023.

    Comments: 13 pages

  12. arXiv:2209.08198  [pdf, other

    cs.IT eess.SP

    Grid-Free MIMO Beam Alignment through Site-Specific Deep Learning

    Authors: Yuqiang Heng, Jeffrey G. Andrews

    Abstract: Beam alignment is a critical bottleneck in millimeter wave (mmWave) communication. An ideal beam alignment technique should achieve high beamforming (BF) gain with low latency, scale well to systems with higher carrier frequencies, larger antenna arrays and multiple user equipments (UEs), and not require hard-to-obtain context information (CI). These qualities are collectively lacking in existing… ▽ More

    Submitted 9 July, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

    Comments: to appear in IEEE Transactions on Wireless Communications, 10.1109/TWC.2023.3283475

  13. Deep Learning-Based Grading of Ductal Carcinoma In Situ in Breast Histopathology Images

    Authors: Suzanne C. Wetstein, Nikolas Stathonikos, Josien P. W. Pluim, Yu**g J. Heng, Natalie D. ter Hoeve, Celien P. H. Vreuls, Paul J. van Diest, Mitko Veta

    Abstract: Ductal carcinoma in situ (DCIS) is a non-invasive breast cancer that can progress into invasive ductal carcinoma (IDC). Studies suggest DCIS is often overtreated since a considerable part of DCIS lesions may never progress into IDC. Lower grade lesions have a lower progression speed and risk, possibly allowing treatment de-escalation. However, studies show significant inter-observer variation in D… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Journal ref: Laboratory Investigation. Published February 19th, 2021

  14. Deep learning assessment of breast terminal duct lobular unit involution: towards automated prediction of breast cancer risk

    Authors: Suzanne C Wetstein, Allison M Onken, Christina Luffman, Gabrielle M Baker, Michael E Pyle, Kevin H Kensler, Ying Liu, Bart Bakker, Ruud Vlutters, Marinus B van Leeuwen, Laura C Collins, Stuart J Schnitt, Josien PW Pluim, Rulla M Tamimi, Yu**g J Heng, Mitko Veta

    Abstract: Terminal ductal lobular unit (TDLU) involution is the regression of milk-producing structures in the breast. Women with less TDLU involution are more likely to develop breast cancer. A major bottleneck in studying TDLU involution in large cohort studies is the need for labor-intensive manual assessment of TDLUs. We developed a computational pathology solution to automatically capture TDLU involuti… ▽ More

    Submitted 31 October, 2019; originally announced November 2019.

  15. Predicting breast tumor proliferation from whole-slide images: the TUPAC16 challenge

    Authors: Mitko Veta, Yu**g J. Heng, Nikolas Stathonikos, Babak Ehteshami Bejnordi, Francisco Beca, Thomas Wollmann, Karl Rohr, Manan A. Shah, Dayong Wang, Mikael Rousson, Martin Hedlund, David Tellez, Francesco Ciompi, Erwan Zerhouni, David Lanyi, Matheus Viana, Vassili Kovalev, Vitali Liauchuk, Hady Ahmady Phoulady, Talha Qaiser, Simon Graham, Nasir Rajpoot, Erik Sjöblom, Jesper Molin, Kyunghyun Paeng , et al. (8 additional authors not shown)

    Abstract: Tumor proliferation is an important biomarker indicative of the prognosis of breast cancer patients. Assessment of tumor proliferation in a clinical setting is highly subjective and labor-intensive task. Previous efforts to automate tumor proliferation assessment by image analysis only focused on mitosis detection in predefined tumor regions. However, in a real-world scenario, automatic mitosis de… ▽ More

    Submitted 29 March, 2019; v1 submitted 22 July, 2018; originally announced July 2018.

    Comments: Overview paper of the TUPAC16 challenge: http://tupac.tue-image.nl/