Skip to main content

Showing 1–19 of 19 results for author: Dong, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16878  [pdf, ps, other

    eess.SP cs.AI cs.IT

    Benchmarking Semantic Communications for Image Transmission Over MIMO Interference Channels

    Authors: Yanhu Wang, Shuaishuai Guo, Anming Dong, Hui Zhao

    Abstract: Semantic communications offer promising prospects for enhancing data transmission efficiency. However, existing schemes have predominantly concentrated on point-to-point transmissions. In this paper, we aim to investigate the validity of this claim in interference scenarios compared to baseline approaches. Specifically, our focus is on general multiple-input multiple-output (MIMO) interference cha… ▽ More

    Submitted 10 April, 2024; originally announced June 2024.

  2. arXiv:2406.10778  [pdf, other

    cs.CE stat.AP

    Heterogeneous Entity Representation for Medicinal Synergy Prediction

    Authors: Jiawei Wu, Jun Wen, Mingyuan Yan, Anqi Dong, Can Chen

    Abstract: Medicinal synergy prediction is a powerful tool in drug discovery and development that harnesses the principles of combination therapy to enhance therapeutic outcomes by improving efficacy, reducing toxicity, and preventing drug resistance. While a myriad of computational methods has emerged for predicting synergistic drug combinations, a large portion of them may overlook the intricate, yet criti… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 8 pages, 3 figures

    MSC Class: 92C50; 05C65; 68T07

  3. arXiv:2405.12171  [pdf, other

    cs.SE cs.CV

    State of the Practice for Medical Imaging Software

    Authors: W. Spencer Smith, Ao Dong, Jacques Carette, Michael D. Noseworthy

    Abstract: We selected 29 medical imaging projects from 48 candidates, assessed 10 software qualities by answering 108 questions for each software project, and interviewed 8 of the 29 development teams. Based on the quantitative data, we ranked the MI software with the Analytic Hierarchy Process (AHP). The four top-ranked software products are 3D Slicer, ImageJ, Fiji, and OHIF Viewer. Generally, MI software… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 73 pages, 14 figures, 12 tables

    ACM Class: D.2.7; I.4.0

  4. MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels

    Authors: Qi Chen, Xiubo Geng, Corby Rosset, Carolyn Buractaon, **gwen Lu, Tao Shen, Kun Zhou, Chenyan Xiong, Yeyun Gong, Paul Bennett, Nick Craswell, Xing Xie, Fan Yang, Bryan Tower, Nikhil Rao, Anlei Dong, Wenqi Jiang, Zheng Liu, Mingqin Li, Chuanjie Liu, Zengzhong Li, Rangan Majumder, Jennifer Neville, Andy Oakley, Knut Magne Risvik , et al. (6 additional authors not shown)

    Abstract: Recent breakthroughs in large models have highlighted the critical significance of data scale, labels and modals. In this paper, we introduce MS MARCO Web Search, the first large-scale information-rich web dataset, featuring millions of real clicked query-document labels. This dataset closely mimics real-world web document and query distribution, provides rich information for various kinds of down… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 10 pages, 6 figures, for associated dataset, see http://github.com/microsoft/MS-MARCO-Web-Search

  5. arXiv:2402.01051  [pdf, other

    cs.CL

    Generation, Distillation and Evaluation of Motivational Interviewing-Style Reflections with a Foundational Language Model

    Authors: Andrew Brown, Jiading Zhu, Mohamed Abdelwahab, Alec Dong, Cindy Wang, Jonathan Rose

    Abstract: Large Foundational Language Models are capable of performing many tasks at a high level but are difficult to deploy in many applications because of their size and proprietary ownership. Many will be motivated to distill specific capabilities of foundational models into smaller models that can be owned and controlled. In the development of a therapeutic chatbot, we wish to distill a capability know… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted to EACL 2024 Long Paper

  6. arXiv:2307.07738  [pdf, other

    q-bio.MN cs.LG eess.SY

    Promotion/Inhibition Effects in Networks: A Model with Negative Probabilities

    Authors: Anqi Dong, Tryphon T. Georgiou, Allen Tannenbaum

    Abstract: Biological networks often encapsulate promotion/inhibition as signed edge-weights of a graph. Nodes may correspond to genes assigned expression levels (mass) of respective proteins. The promotion/inhibition nature of co-expression between nodes is encoded in the sign of the corresponding entry of a sign-indefinite adjacency matrix, though the strength of such co-expression (i.e., the precise value… ▽ More

    Submitted 16 August, 2023; v1 submitted 15 July, 2023; originally announced July 2023.

    Comments: 6 pages

    MSC Class: 92F99; 49M29; 90C30; 93-08; 90C25

  7. arXiv:2305.03793  [pdf, other

    cs.CL cs.LG

    Towards Zero-Shot Frame Semantic Parsing with Task Agnostic Ontologies and Simple Labels

    Authors: Danilo Ribeiro, Omid Abdar, Jack Goetz, Mike Ross, Annie Dong, Kenneth Forbus, Ahmed Mohamed

    Abstract: Frame semantic parsing is an important component of task-oriented dialogue systems. Current models rely on a significant amount training data to successfully identify the intent and slots in the user's input utterance. This creates a significant barrier for adding new domains to virtual assistant capabilities, as creation of this data requires highly specialized NLP expertise. In this work we prop… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    ACM Class: I.2.7; I.2.6

  8. arXiv:2212.14509  [pdf, other

    eess.SY cs.GT math.OC

    Monge-Kantorovich Optimal Transport Through Constrictions and Flow-rate Constraints

    Authors: Anqi Dong, Arthur Stephanovitch, Tryphon T. Georgiou

    Abstract: We consider the problem to transport resources/mass while abiding by constraints on the flow through constrictions along their path between specified terminal distributions. Constrictions, conceptualized as toll stations at specified points, limit the flow rate across. We quantify flow-rate constraints via a bound on a sought probability density of the times that mass-elements cross toll stations… ▽ More

    Submitted 1 May, 2023; v1 submitted 29 December, 2022; originally announced December 2022.

    Comments: 8 pages, 6 figures

    MSC Class: 49345; 90C08; 26B25

  9. arXiv:2212.09114  [pdf, other

    cs.CL

    CAPSTONE: Curriculum Sampling for Dense Retrieval with Document Expansion

    Authors: Xingwei He, Yeyun Gong, A-Long **, Hang Zhang, Anlei Dong, Jian Jiao, Siu Ming Yiu, Nan Duan

    Abstract: The dual-encoder has become the de facto architecture for dense retrieval. Typically, it computes the latent representations of the query and document independently, thus failing to fully capture the interactions between the query and document. To alleviate this, recent research has focused on obtaining query-informed document representations. During training, it expands the document with a real q… ▽ More

    Submitted 29 October, 2023; v1 submitted 18 December, 2022; originally announced December 2022.

    Comments: Accetpted to EMNLP 2023

  10. arXiv:2212.05225  [pdf, other

    cs.IR cs.CL

    LEAD: Liberal Feature-based Distillation for Dense Retrieval

    Authors: Hao Sun, Xiao Liu, Yeyun Gong, Anlei Dong, **gwen Lu, Yan Zhang, Linjun Yang, Rangan Majumder, Nan Duan

    Abstract: Knowledge distillation is often used to transfer knowledge from a strong teacher model to a relatively weak student model. Traditional methods include response-based methods and feature-based methods. Response-based methods are widely used but suffer from lower upper limits of performance due to their ignorance of intermediate signals, while feature-based methods have constraints on vocabularies,… ▽ More

    Submitted 11 December, 2023; v1 submitted 10 December, 2022; originally announced December 2022.

    Comments: Accepted by WSDM 2024

  11. arXiv:2211.05457   

    cs.AI

    Syntax-Guided Domain Adaptation for Aspect-based Sentiment Analysis

    Authors: Anguo Dong, Cuiyun Gao, Yan Jia, Qing Liao, Xuan Wang, Lei Wang, **g Xiao

    Abstract: Aspect-based sentiment analysis (ABSA) aims at extracting opinionated aspect terms in review texts and determining their sentiment polarities, which is widely studied in both academia and industry. As a fine-grained classification task, the annotation cost is extremely high. Domain adaptation is a popular solution to alleviate the data deficiency issue in new domains by transferring common knowled… ▽ More

    Submitted 15 August, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

    Comments: I want to withdraw this article due to personal reason

  12. arXiv:2210.11773  [pdf, other

    cs.CL cs.IR

    SimANS: Simple Ambiguous Negatives Sampling for Dense Text Retrieval

    Authors: Kun Zhou, Yeyun Gong, Xiao Liu, Wayne Xin Zhao, Yelong Shen, Anlei Dong, **gwen Lu, Rangan Majumder, Ji-Rong Wen, Nan Duan, Weizhu Chen

    Abstract: Sampling proper negatives from a large document pool is vital to effectively train a dense retrieval model. However, existing negative sampling strategies suffer from the uninformative or false negative problem. In this work, we empirically show that according to the measured relevance scores, the negatives ranked around the positives are generally more informative and less likely to be false nega… ▽ More

    Submitted 24 October, 2022; v1 submitted 21 October, 2022; originally announced October 2022.

    Comments: 12 pages, accepted by EMNLP 2022

  13. arXiv:2210.08634  [pdf, other

    cs.CL cs.SD eess.AS

    SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning

    Authors: Tzu-hsun Feng, Annie Dong, Ching-Feng Yeh, Shu-wen Yang, Tzu-Quan Lin, Jiatong Shi, Kai-Wei Chang, Zili Huang, Haibin Wu, Xuankai Chang, Shinji Watanabe, Abdelrahman Mohamed, Shang-Wen Li, Hung-yi Lee

    Abstract: We present the SUPERB challenge at SLT 2022, which aims at learning self-supervised speech representation for better performance, generalization, and efficiency. The challenge builds upon the SUPERB benchmark and implements metrics to measure the computation requirements of self-supervised learning (SSL) representation and to evaluate its generalizability and performance across the diverse SUPERB… ▽ More

    Submitted 29 October, 2022; v1 submitted 16 October, 2022; originally announced October 2022.

    Comments: Accepted by 2022 SLT Workshop

  14. arXiv:2209.13335  [pdf, other

    cs.IR cs.CL

    PROD: Progressive Distillation for Dense Retrieval

    Authors: Zhenghao Lin, Yeyun Gong, Xiao Liu, Hang Zhang, Chen Lin, Anlei Dong, Jian Jiao, **gwen Lu, Daxin Jiang, Rangan Majumder, Nan Duan

    Abstract: Knowledge distillation is an effective way to transfer knowledge from a strong teacher to an efficient student model. Ideally, we expect the better the teacher is, the better the student. However, this expectation does not always come true. It is common that a better teacher model results in a bad student via distillation due to the nonnegligible gap between teacher and student. To bridge the gap,… ▽ More

    Submitted 24 June, 2023; v1 submitted 27 September, 2022; originally announced September 2022.

    Comments: Accepted by WWW2023

  15. arXiv:2201.08721  [pdf, other

    cs.IR cs.AI cs.LG

    Less is Less: When Are Snippets Insufficient for Human vs Machine Relevance Estimation?

    Authors: Gabriella Kazai, Bhaskar Mitra, Anlei Dong, Nick Craswell, Linjun Yang

    Abstract: Traditional information retrieval (IR) ranking models process the full text of documents. Newer models based on Transformers, however, would incur a high computational cost when processing long texts, so typically use only snippets from the document instead. The model's input based on a document's URL, title, and snippet (UTS) is akin to the summaries that appear on a search engine results page (S… ▽ More

    Submitted 21 January, 2022; originally announced January 2022.

  16. arXiv:2110.11575  [pdf, other

    cs.SE

    Methodology for Assessing the State of the Practice for Domain X

    Authors: Spencer Smith, Jacques Carette, Peter Michalski, Ao Dong, Olu Owojaiye

    Abstract: To improve software development methods and tools for research software, we first need to understand the current state of the practice. Therefore, we have developed a methodology for assessing the state of the software development practices for a given research software domain. For each domain we wish to answer questions such as: i) What artifacts (documents, code, test cases, etc.) are present? i… ▽ More

    Submitted 21 October, 2021; originally announced October 2021.

    Comments: 35 pages, 3 figures

    ACM Class: D.2.0

  17. arXiv:2106.08888  [pdf, other

    cs.LG cs.AI

    Bandit Modeling of Map Selection in Counter-Strike: Global Offensive

    Authors: Guido Petri, Michael H. Stanley, Alec B. Hon, Alexander Dong, Peter Xenopoulos, Cláudio Silva

    Abstract: Many esports use a pick and ban process to define the parameters of a match before it starts. In Counter-Strike: Global Offensive (CSGO) matches, two teams first pick and ban maps, or virtual worlds, to play. Teams typically ban and pick maps based on a variety of factors, such as banning maps which they do not practice, or choosing maps based on the team's recent performance. We introduce a conte… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

    Comments: 6 pages, 3 figures, IJCAI-AISA 2021

  18. arXiv:2005.09152  [pdf, other

    math.OC cs.DS math.ST stat.AP stat.CO

    Lasso formulation of the shortest path problem

    Authors: Anqi Dong, Amirhossein Taghvaei, Tryphon T. Georgiou

    Abstract: The shortest path problem is formulated as an $l_1$-regularized regression problem, known as lasso. Based on this formulation, a connection is established between Dijkstra's shortest path algorithm and the least angle regression (LARS) for the lasso problem. Specifically, the solution path of the lasso problem, obtained by varying the regularization parameter from infinity to zero (the regularizat… ▽ More

    Submitted 22 May, 2020; v1 submitted 18 May, 2020; originally announced May 2020.

    Comments: 17 pages

    MSC Class: 05C38 (Primary) 62J07; 68R10; 90C25; 90C06(Secondary)

  19. arXiv:1310.1050  [pdf, ps, other

    cs.DC cs.SE eess.SY

    The failure tolerance of mechatronic software systems to random and targeted attacks

    Authors: Dharshana Kasthurirathna, Andy Dong, Mahendrarajah Piraveenan, Irem Y. Tumer

    Abstract: This paper describes a complex networks approach to study the failure tolerance of mechatronic software systems under various types of hardware and/or software failures. We produce synthetic system architectures based on evidence of modular and hierarchical modular product architectures and known motifs for the interconnection of physical components to software. The system architectures are then s… ▽ More

    Submitted 27 September, 2013; originally announced October 2013.

    Comments: Proceedings of the 2013 ASME International Design Engineering Technical Conferences & Computers and Information in Engineering Conference IDETC/CIE 2013 August 4-7, 2013, Portland, Oregon, USA (In Print)