Skip to main content

Showing 1–24 of 24 results for author: Wen, E

.
  1. arXiv:2404.18347  [pdf, other

    physics.app-ph

    Helical Phononic Modes Induced by a Screw Dislocation

    Authors: Yun Zhou, Robert Davis, Li Chen, Erda Wen, Prabhakar Bandaru, Daniel Sievenpiper

    Abstract: In this study, we investigate a one-dimensional (1D) unidirectional phononic waveguide embedded within a three-dimensional (3D) hexagonal close-packed phononic crystal, achieved by the introduction of a screw dislocation. This approach does not rely on the non-trivial topological characteristics of the 3D crystal. We discover that this dislocation induces a pair of helical modes, characterized by… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: 13 pages, 4 figures

  2. arXiv:2403.02545  [pdf, other

    cs.LG cs.AI

    Wukong: Towards a Scaling Law for Large-Scale Recommendation

    Authors: Buyun Zhang, Liang Luo, Yuxin Chen, Jade Nie, Xi Liu, Daifeng Guo, Yanli Zhao, Shen Li, Yuchen Hao, Yantao Yao, Guna Lakshminarayanan, Ellie Dingqiao Wen, Jongsoo Park, Maxim Naumov, Wenlin Chen

    Abstract: Scaling laws play an instrumental role in the sustainable improvement in model quality. Unfortunately, recommendation models to date do not exhibit such laws similar to those observed in the domain of large language models, due to the inefficiencies of their upscaling mechanisms. This limitation poses significant challenges in adapting these models to increasingly more complex real-world datasets.… ▽ More

    Submitted 4 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: 12 pages

  3. arXiv:2403.00877  [pdf, other

    cs.LG cs.DC cs.IR

    Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large-Scale Recommendation

    Authors: Liang Luo, Buyun Zhang, Michael Tsang, Yinbin Ma, Ching-Hsiang Chu, Yuxin Chen, Shen Li, Yuchen Hao, Yanli Zhao, Guna Lakshminarayanan, Ellie Dingqiao Wen, Jongsoo Park, Dheevatsa Mudigere, Maxim Naumov

    Abstract: We study a mismatch between the deep learning recommendation models' flat architecture, common distributed training paradigm and hierarchical data center topology. To address the associated inefficiencies, we propose Disaggregated Multi-Tower (DMT), a modeling technique that consists of (1) Semantic-preserving Tower Transform (SPTT), a novel training paradigm that decomposes the monolithic global… ▽ More

    Submitted 2 May, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

  4. arXiv:2308.02511  [pdf, other

    physics.app-ph physics.optics

    Multifunctional Metasurface: Simultaneous Beam Steering, Polarization Conversion and Phase Offset

    Authors: Xiaozhen Yang, Erda Wen, Dinesh Bharadia, Daniel F. Sievenpiper

    Abstract: A varactor-based reconfigurable multifunctional metasurface capable of simultaneous beam steering, polarization conversion and phase offset is proposed in this paper. The unit cell is designed to naturally decompose the incident waves into two equal amplitude orthogonal linear components, and by integrating varactors, the reflection phase of the field components can be engineered from… ▽ More

    Submitted 27 July, 2023; originally announced August 2023.

  5. arXiv:2307.11096  [pdf, other

    cs.IR cs.LG

    Towards the Better Ranking Consistency: A Multi-task Learning Framework for Early Stage Ads Ranking

    Authors: Xuewei Wang, Qiang **, Shengyu Huang, Min Zhang, Xi Liu, Zhengli Zhao, Yukun Chen, Zhengyu Zhang, Jiyan Yang, Ellie Wen, Sagar Chordia, Wenlin Chen, Qin Huang

    Abstract: Dividing ads ranking system into retrieval, early, and final stages is a common practice in large scale ads recommendation to balance the efficiency and accuracy. The early stage ranking often uses efficient models to generate candidates out of a set of retrieved ads. The candidates are then fed into a more computationally intensive but accurate final stage ranking system to produce the final ads… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: Accepted by AdKDD 23

  6. arXiv:2306.03381  [pdf, other

    cs.AI

    VR.net: A Real-world Dataset for Virtual Reality Motion Sickness Research

    Authors: Elliott Wen, Chitralekha Gupta, Prasanth Sasikumar, Mark Billinghurst, James Wilmott, Emily Skow, Arindam Dey, Suranga Nanayakkara

    Abstract: Researchers have used machine learning approaches to identify motion sickness in VR experience. These approaches demand an accurately-labeled, real-world, and diverse dataset for high accuracy and generalizability. As a starting point to address this need, we introduce `VR.net', a dataset offering approximately 12-hour gameplay videos from ten real-world games in 10 diverse genres. For each video… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  7. arXiv:2305.11899  [pdf, other

    physics.app-ph eess.SY physics.optics

    Real-data-driven Real-time Reconfigurable Microwave Reflective Surface

    Authors: Erda Wen, Xiaozhen Yang, Daniel F. Sievenpiper

    Abstract: Manipulating the electromagnetic (EM) reflection behavior from an arbitrary surface dynamically on arbitrary design goals is an ultimate ambition for many EM stealth and communication problems, yet it is nearly impossible to accomplish with conventional analysis and optimization techniques. In this paper we present a reconfigurable conformal metasurface prototype as well as a workflow that enables… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  8. arXiv:2305.02207  [pdf, other

    physics.app-ph physics.optics

    All-passive Microwave-Diode Nonreciprocal Metasurface

    Authors: Xiaozhen Yang, Erda Wen, Daniel F. Sievenpiper

    Abstract: Breaking reciprocity in the microwave frequency range will have important implications for modern electronic systems. Since it usually involves bulky biasing magnets or complex spatial-temporal modulations, exploring a lightweight, all-passive approach becomes intriguing. Starting from a circuit model, we theoretically demonstrate the nonreciprocal behaviour on a transmission line building block c… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

  9. arXiv:2304.03450  [pdf, other

    cs.HC

    Striving for Authentic and Sustained Technology Use In the Classroom: Lessons Learned from a Longitudinal Evaluation of a Sensor-based Science Education Platform

    Authors: Yvonne Chua, Sankha Cooray, Juan Pablo Forero Cortes, Paul Denny, Sonia Dupuch, Dawn L Garbett, Alaeddin Nassani, Jiashuo Cao, Hannah Qiao, Andrew Reis, Deviana Reis, Philipp M. Scholl, Priyashri Kamlesh Sridhar, Hussel Suriyaarachchi, Fiona Taimana, Vanessa Tanga, Chamod Weerasinghe, Elliott Wen, Michelle Wu, Qin Wu, Haimo Zhang, Suranga Nanayakkara

    Abstract: Technology integration in educational settings has led to the development of novel sensor-based tools that enable students to measure and interact with their environment. Although reports from using such tools can be positive, evaluations are often conducted under controlled conditions and short timeframes. There is a need for longitudinal data collected in realistic classroom settings. However, s… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

  10. arXiv:2302.08446  [pdf

    cond-mat.mtrl-sci

    Engineering Robust Metallic Zero-Mode States in Olympicene Graphene Nanoribbons

    Authors: Ryan D. McCurdy, Aidan Delgado, **gwei Jiang, Junmian Zhu, Ethan Chi Ho Wen, Raymond E. Blackwell, Gregory C. Veber, Shenkai Wang, Steven G. Louie, Felix R. Fischer

    Abstract: Metallic graphene nanoribbons (GNRs) represent a critical component in the toolbox of low-dimensional functional materials technolo-gy serving as 1D interconnects capable of both electronic and quantum information transport. The structural constraints imposed by on-surface bottom-up GNR synthesis protocols along with the limited control over orientation and sequence of asymmetric monomer building… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: 8 pages, 4 figures

  11. arXiv:2210.02627  [pdf, other

    cs.CL cs.IR

    Improving the Domain Adaptation of Retrieval Augmented Generation (RAG) Models for Open Domain Question Answering

    Authors: Shamane Siriwardhana, Rivindu Weerasekera, Elliott Wen, Tharindu Kaluarachchi, Rajib Rana, Suranga Nanayakkara

    Abstract: Retrieval Augment Generation (RAG) is a recent advancement in Open-Domain Question Answering (ODQA). RAG has only been trained and explored with a Wikipedia-based external knowledge base and is not optimized for use in other specialized domains such as healthcare and news. In this paper, we evaluate the impact of joint training of the retriever and generator components of RAG for the task of domai… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: This paper is awaiting publication at Transactions of the Association for Computational Linguistics. This is a pre-MIT Press publication version. For associated huggingface transformers code, see https://github.com/huggingface/transformers/tree/main/examples/research_projects/rag-end2end-retriever

  12. arXiv:2205.07333  [pdf, other

    cs.HC cs.CV

    Trucks Don't Mean Trump: Diagnosing Human Error in Image Analysis

    Authors: J. D. Zamfirescu-Pereira, Jerry Chen, Emily Wen, Allison Koenecke, Nikhil Garg, Emma Pierson

    Abstract: Algorithms provide powerful tools for detecting and dissecting human bias and error. Here, we develop machine learning methods to to analyze how humans err in a particular high-stakes task: image interpretation. We leverage a unique dataset of 16,135,392 human predictions of whether a neighborhood voted for Donald Trump or Joe Biden in the 2020 US election, based on a Google Street View image. We… ▽ More

    Submitted 15 May, 2022; originally announced May 2022.

    Comments: To be published in FAccT 2022

  13. arXiv:2203.11014  [pdf, other

    cs.IR cs.AI cs.LG

    DHEN: A Deep and Hierarchical Ensemble Network for Large-Scale Click-Through Rate Prediction

    Authors: Buyun Zhang, Liang Luo, Xi Liu, Jay Li, Zeliang Chen, Weilin Zhang, Xiaohan Wei, Yuchen Hao, Michael Tsang, Wenjun Wang, Yang Liu, Huayu Li, Yasmine Badr, Jongsoo Park, Jiyan Yang, Dheevatsa Mudigere, Ellie Wen

    Abstract: Learning feature interactions is important to the model performance of online advertising services. As a result, extensive efforts have been devoted to designing effective architectures to learn feature interactions. However, we observe that the practical performance of those designs can vary from dataset to dataset, even when the order of interactions claimed to be captured is the same. That indi… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

  14. arXiv:2201.05907  [pdf, ps, other

    physics.app-ph cond-mat.mes-hall physics.optics

    Designing Topological Defect Lines Protected by Gauge-dependent Symmetry Indicators

    Authors: Erda Wen, Dia'aaldin J. Bisharat, Robert J. Davis, Xiaozhen Yang, Daniel F. Sievenpiper

    Abstract: Symmetry indicators are a modern tool for characterizing topological phases that require only minimal computational expense but provide an elegant means of designing practical devices. This paper demonstrates how a rotational symmetry indicator can be used to construct and characterize a topologically robust waveguide, which is then verified experimentally on a printed circuit board (PCB) platform… ▽ More

    Submitted 15 January, 2022; originally announced January 2022.

  15. arXiv:2111.04502  [pdf, other

    physics.app-ph physics.optics

    Power-dependent Reflective Metasurface with Self-induced Bandgap

    Authors: Xiaozhen Yang, Erda Wen, Daniel F. Sievenpiper

    Abstract: A metallic ring based, diode-integrated, low-profile, power-dependent, reflective metasurface working from 3 GHz to 3.6 GHz is proposed in this letter. Unlike the previous study which shifts a band up and down to change the impedance of the surface, the triggering of the diodes directly transforms the structure from a surface wave supportive state to a self-induced bandgap topology if exposed to h… ▽ More

    Submitted 4 November, 2021; originally announced November 2021.

  16. arXiv:2110.12843  [pdf, other

    physics.app-ph

    Broadband time-modulated absorber beyond the Bode-Fano limit by energy trap**

    Authors: Xiaozhen Yang, Erda Wen, Daniel F. Sievenpiper

    Abstract: Wide-band absorption is a popular topic in microwave engineering to protect sensitive devices against broadband sources. However, the Bode-Fano criterion defines the trade-off between bandwidth and efficiency for all passive, linear, time-invariant systems. In this letter, we propose a broadband absorber beyond the Bode-Fano limit by creating an energy trap using time-modulated switch/diodes. This… ▽ More

    Submitted 4 October, 2021; originally announced October 2021.

  17. arXiv:2109.02436  [pdf, other

    eess.IV cs.CV cs.LG

    ReLaX: Retinal Layer Attribution for Guided Explanations of Automated Optical Coherence Tomography Classification

    Authors: Evan Wen, Rebecca Sorenson, Max Ehrlich

    Abstract: 30 million Optical Coherence Tomography (OCT) imaging tests are issued annually to diagnose various retinal diseases, but accurate diagnosis of OCT scans requires trained eye care professionals who are still prone to making errors. With better systems for diagnosis, many cases of vision loss caused by retinal disease could be entirely avoided. In this work, we present ReLaX, a novel deep learning… ▽ More

    Submitted 1 October, 2022; v1 submitted 3 September, 2021; originally announced September 2021.

    Comments: ECCV 2022 Medical Computer Vision Workshop

  18. arXiv:2106.11517  [pdf, ps, other

    cs.IR cs.CL

    Fine-tune the Entire RAG Architecture (including DPR retriever) for Question-Answering

    Authors: Shamane Siriwardhana, Rivindu Weerasekera, Elliott Wen, Suranga Nanayakkara

    Abstract: In this paper, we illustrate how to fine-tune the entire Retrieval Augment Generation (RAG) architecture in an end-to-end manner. We highlighted the main engineering challenges that needed to be addressed to achieve this objective. We also compare how end-to-end RAG architecture outperforms the original RAG architecture for the task of question answering. We have open-sourced our implementation in… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

    Comments: for associated code, see https://github.com/huggingface/transformers/tree/master/examples/research_projects/rag-end2end-retriever

  19. arXiv:2105.12676  [pdf, other

    cs.LG cs.AR cs.IR cs.PF math.NA

    Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale

    Authors: Zhaoxia, Deng, Jongsoo Park, ** Tak Peter Tang, Haixin Liu, Jie, Yang, Hector Yuen, Jianyu Huang, Daya Khudia, Xiaohan Wei, Ellie Wen, Dhruv Choudhary, Raghuraman Krishnamoorthi, Carole-Jean Wu, Satish Nadathur, Changkyu Kim, Maxim Naumov, Sam Naghshineh, Mikhail Smelyanskiy

    Abstract: Tremendous success of machine learning (ML) and the unabated growth in ML model complexity motivated many ML-specific designs in both CPU and accelerator architectures to speed up the model inference. While these architectures are diverse, highly optimized low-precision arithmetic is a component shared by most. Impressive compute throughputs are indeed often exhibited by these architectures on ben… ▽ More

    Submitted 26 May, 2021; originally announced May 2021.

  20. arXiv:2104.05158  [pdf, other

    cs.DC cs.AI cs.LG cs.PF

    Software-Hardware Co-design for Fast and Scalable Training of Deep Learning Recommendation Models

    Authors: Dheevatsa Mudigere, Yuchen Hao, Jianyu Huang, Zhihao Jia, Andrew Tulloch, Srinivas Sridharan, Xing Liu, Mustafa Ozdal, Jade Nie, Jongsoo Park, Liang Luo, Jie Amy Yang, Leon Gao, Dmytro Ivchenko, Aarti Basant, Yuxi Hu, Jiyan Yang, Ehsan K. Ardestani, Xiaodong Wang, Rakesh Komuravelli, Ching-Hsiang Chu, Serhat Yilmaz, Huayu Li, Jiyuan Qian, Zhuobo Feng , et al. (28 additional authors not shown)

    Abstract: Deep learning recommendation models (DLRMs) are used across many business-critical services at Facebook and are the single largest AI application in terms of infrastructure demand in its data-centers. In this paper we discuss the SW/HW co-designed solution for high-performance distributed training of large-scale DLRMs. We introduce a high-performance scalable software stack based on PyTorch and pa… ▽ More

    Submitted 26 February, 2023; v1 submitted 11 April, 2021; originally announced April 2021.

  21. arXiv:2102.10484  [pdf, other

    cs.CV cs.AI cs.LG

    CheXseg: Combining Expert Annotations with DNN-generated Saliency Maps for X-ray Segmentation

    Authors: Soham Gadgil, Mark Endo, Emily Wen, Andrew Y. Ng, Pranav Rajpurkar

    Abstract: Medical image segmentation models are typically supervised by expert annotations at the pixel-level, which can be expensive to acquire. In this work, we propose a method that combines the high quality of pixel-level expert annotations with the scale of coarse DNN-generated saliency maps for training multi-label semantic segmentation models. We demonstrate the application of our semi-supervised met… ▽ More

    Submitted 17 May, 2021; v1 submitted 20 February, 2021; originally announced February 2021.

    Comments: Accepted to Medical Imaging with Deep Learning (MIDL) Conference 2021

  22. arXiv:2010.08655  [pdf, other

    cs.LG

    Adaptive Dense-to-Sparse Paradigm for Pruning Online Recommendation System with Non-Stationary Data

    Authors: Mao Ye, Dhruv Choudhary, Jiecao Yu, Ellie Wen, Zeliang Chen, Jiyan Yang, Jongsoo Park, Qiang Liu, Arun Kejariwal

    Abstract: Large scale deep learning provides a tremendous opportunity to improve the quality of content recommendation systems by employing both wider and deeper models, but this comes at great infrastructural cost and carbon footprint in modern data centers. Pruning is an effective technique that reduces both memory and compute demand for model inference. However, pruning for online recommendation systems… ▽ More

    Submitted 21 October, 2020; v1 submitted 16 October, 2020; originally announced October 2020.

  23. arXiv:2003.13593  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    How Not to Give a FLOP: Combining Regularization and Pruning for Efficient Inference

    Authors: Tai Vu, Emily Wen, Roy Nehoran

    Abstract: The challenge of speeding up deep learning models during the deployment phase has been a large, expensive bottleneck in the modern tech industry. In this paper, we examine the use of both regularization and pruning for reduced computational complexity and more efficient inference in Deep Neural Networks (DNNs). In particular, we apply mixup and cutout regularizations and soft filter pruning to the… ▽ More

    Submitted 9 April, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: Citations added, typos fixed

  24. arXiv:1902.09451  [pdf, other

    cs.NI cs.AI

    Optimizing Controller Placement for Software-Defined Networks

    Authors: Victoria Huang, Gang Chen, Qiang Fu, Elliott Wen

    Abstract: Controller placement problem (CPP) is a key issue for Software-Defined Networking (SDN) with distributed controller architectures. This problem aims to determine a suitable number of controllers deployed in important locations so as to optimize the overall network performance. In comparison to communication delay, existing literature on the CPP assumes that the influence of controller workload dis… ▽ More

    Submitted 14 February, 2019; originally announced February 2019.

    Journal ref: 2019 IFIP/IEEE Symposium on Integrated Network and Service Management (IM) (2019) 224-232