Skip to main content

Showing 1–50 of 75 results for author: Bui, D

.
  1. arXiv:2406.11927  [pdf, other

    cs.SE cs.AI

    REPOEXEC: Evaluate Code Generation with a Repository-Level Executable Benchmark

    Authors: Nam Le Hai, Dung Manh Nguyen, Nghi D. Q. Bui

    Abstract: The ability of CodeLLMs to generate executable and functionally correct code at the repository-level scale remains largely unexplored. We introduce RepoExec, a novel benchmark for evaluating code generation at the repository-level scale. RepoExec focuses on three main aspects: executability, functional correctness through automated test case generation with high coverage rate, and carefully crafte… ▽ More

    Submitted 19 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2406.11912  [pdf, other

    cs.SE cs.AI

    AgileCoder: Dynamic Collaborative Agents for Software Development based on Agile Methodology

    Authors: Minh Huynh Nguyen, Thang Phan Chau, Phong X. Nguyen, Nghi D. Q. Bui

    Abstract: Software agents have emerged as promising tools for addressing complex software engineering tasks. However, existing works oversimplify software development workflows by following the waterfall model. Thus, we propose AgileCoder, a multi-agent system that integrates Agile Methodology (AM) into the framework. This system assigns specific AM roles such as Product Manager, Developer, and Tester to di… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  3. arXiv:2405.02695  [pdf, ps, other

    cs.DS cs.DC

    Improved All-Pairs Approximate Shortest Paths in Congested Clique

    Authors: Hong Duc Bui, Shashwat Chandra, Yi-Jun Chang, Michal Dory, Dean Leitersdorf

    Abstract: In this paper, we present new algorithms for approximating All-Pairs Shortest Paths (APSP) in the Congested Clique model. We present randomized algorithms for weighted undirected graphs. Our first contribution is an $O(1)$-approximate APSP algorithm taking just $O(\log \log \log n)$ rounds. Prior to our work, the fastest algorithms that give an $O(1)$-approximation for APSP take… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

  4. arXiv:2405.02010  [pdf, other

    cs.CL

    The Trade-off between Performance, Efficiency, and Fairness in Adapter Modules for Text Classification

    Authors: Minh Duc Bui, Katharina von der Wense

    Abstract: Current natural language processing (NLP) research tends to focus on only one or, less frequently, two dimensions - e.g., performance, privacy, fairness, or efficiency - at a time, which may lead to suboptimal conclusions and often overlooking the broader goal of achieving trustworthy NLP. Work on adapter modules (Houlsby et al., 2019; Hu et al., 2021) focuses on improving performance and efficien… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: Accepted to the 4th Workshop on Trustworthy Natural Language Processing (TrustNLP) at NAACL 2024

  5. arXiv:2404.19319  [pdf, other

    cs.CL

    Knowledge Distillation vs. Pretraining from Scratch under a Fixed (Computation) Budget

    Authors: Minh Duc Bui, Fabian David Schmidt, Goran Glavaš, Katharina von der Wense

    Abstract: Compared to standard language model (LM) pretraining (i.e., from scratch), Knowledge Distillation (KD) entails an additional forward pass through a teacher model that is typically substantially larger than the target student model. As such, KD in LM pretraining materially slows down throughput of pretraining instances vis-a-vis pretraining from scratch. Scaling laws of LM pretraining suggest that… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: Accepted to the 5th Workshop on Insights from Negative Results in NLP at NAACL 2024

  6. Radial Basis Function Neural Networks for Formation Control of Unmanned Aerial Vehicles

    Authors: Duy-Nam Bui, Manh Duong Phung

    Abstract: This paper addresses the problem of controlling multiple unmanned aerial vehicles (UAVs) cooperating in a formation to carry out a complex task such as surface inspection. We first use the virtual leader-follower model to determine the topology and trajectory of the formation. A double-loop control system combining backstep** and sliding mode control techniques is then designed for the UAVs to t… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Journal ref: Robotica, 2024

  7. arXiv:2403.14592  [pdf, other

    cs.SE cs.AI cs.HC

    Envisioning the Next-Generation AI Coding Assistants: Insights & Proposals

    Authors: Khanh Nghiem, Anh Minh Nguyen, Nghi D. Q. Bui

    Abstract: As a research-product hybrid group in AI for Software Engineering (AI4SE), we present four key takeaways from our experience develo** in-IDE AI coding assistants. AI coding assistants should set clear expectations for usage, integrate with advanced IDE capabilities and existing extensions, use extendable backend designs, and collect app data responsibly for downstream analyses. We propose open q… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  8. arXiv:2403.06119  [pdf, other

    cs.CV

    CLEAR: Cross-Transformers with Pre-trained Language Model is All you need for Person Attribute Recognition and Retrieval

    Authors: Doanh C. Bui, Thinh V. Le, Ba Hung Ngo, Tae Jong Choi

    Abstract: Person attribute recognition and attribute-based retrieval are two core human-centric tasks. In the recognition task, the challenge is specifying attributes depending on a person's appearance, while the retrieval task involves searching for matching persons based on attribute queries. There is a significant relationship between recognition and retrieval tasks. In this study, we demonstrate that if… ▽ More

    Submitted 30 April, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

  9. arXiv:2403.06095  [pdf, other

    cs.SE cs.AI

    RepoHyper: Better Context Retrieval Is All You Need for Repository-Level Code Completion

    Authors: Huy N. Phan, Hoang N. Phan, Tien N. Nguyen, Nghi D. Q. Bui

    Abstract: Code Large Language Models (CodeLLMs) have demonstrated impressive proficiency in code completion tasks. However, they often fall short of fully understanding the extensive context of a project repository, such as the intricacies of relevant files and class hierarchies, which can result in less precise completions. To overcome these limitations, we present \tool, a multifaceted framework designed… ▽ More

    Submitted 16 March, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

    Comments: Under Review

  10. arXiv:2402.11101  [pdf

    cond-mat.mtrl-sci cs.CE cs.LG

    Physics-based material parameters extraction from perovskite experiments via Bayesian optimization

    Authors: Hualin Zhan, Viqar Ahmad, Azul Mayon, Grace Tabi, Anh Dinh Bui, Zhuofeng Li, Daniel Walter, Hieu Nguyen, Klaus Weber, Thomas White, Kylie Catchpole

    Abstract: The ability to extract material parameters of perovskite from quantitative experimental analysis is essential for rational design of photovoltaic and optoelectronic applications. However, the difficulty of this analysis increases significantly with the complexity of the theoretical model and the number of material parameters for perovskite. Here we use Bayesian optimization to develop an analysis… ▽ More

    Submitted 29 May, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: The work is published in Energy & Environmental Science (DOI: 10.1039/D4EE00911H). This work is supported by the Australian Centre for Advanced Photovoltaics (ACAP) and received funding from the Australian Renewable Energy Agency (ARENA). H.Z. acknowledges the support of the ACAP Fellowship. H.Z. thanks Pawsey for providing the Nimbus Research Cloud Service

  11. Ant Colony Optimization for Cooperative Inspection Path Planning Using Multiple Unmanned Aerial Vehicles

    Authors: Duy Nam Bui, Thuy Ngan Duong, Manh Duong Phung

    Abstract: This paper presents a new swarm intelligence-based approach to deal with the cooperative path planning problem of unmanned aerial vehicles (UAVs), which is essential for the automatic inspection of infrastructure. The approach uses a 3D model of the structure to generate viewpoints for the UAVs. The calculation of the viewpoints considers the constraints related to the UAV formation model, camera… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: Published in: 2024 IEEE/SICE International Symposium on System Integration (SII)

  12. Self-Reconfigurable V-shape Formation of Multiple UAVs in Narrow Space Environments

    Authors: Duy Nam Bui, Manh Duong Phung, Hung Pham Duy

    Abstract: This paper presents the design and implementation of a self-reconfigurable V-shape formation controller for multiple unmanned aerial vehicles (UAVs) navigating through narrow spaces in a dense obstacle environment. The selection of the V-shape formation is motivated by its maneuverability and visibility advantages. The main objective is to develop an effective formation control strategy that allow… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: Published in: 2024 IEEE/SICE International Symposium on System Integration (SII)

  13. arXiv:2311.10102  [pdf, other

    cond-mat.stat-mech

    Mechanical Attributes of Fractal Dragons

    Authors: Huy T. Q. Phan, Duc M. Bui, Cong T. Than, Trung V. Phan

    Abstract: Fractals are ubiquitous natural emergences that have gained increased attention in engineering applications, thanks to recent technological advancements enabling the fabrication of structures spanning across many spatial scales. We show how the geometries of fractals can be exploited to determine their important mechanical properties, such as the first and second moments, which physically correspo… ▽ More

    Submitted 12 November, 2023; originally announced November 2023.

  14. arXiv:2311.03366  [pdf, other

    cs.SE cs.AI cs.LG

    Functional Overlap Reranking for Neural Code Generation

    Authors: Hung Quoc To, Minh Huynh Nguyen, Nghi D. Q. Bui

    Abstract: Code Large Language Models (CodeLLMs) have ushered in a new era in code generation advancements. However, selecting the best code solutions from all possible CodeLLM outputs remains a challenge. Previous methods often overlooked the intricate functional similarities and interactions between solution clusters. We introduce SRank, a novel reranking strategy for selecting the best solutions from code… ▽ More

    Submitted 22 June, 2024; v1 submitted 16 October, 2023; originally announced November 2023.

    Comments: EMNLP 2024, Long Findings

  15. arXiv:2311.00737  [pdf

    cs.LG physics.ins-det physics.med-ph

    Real-Time Magnetic Tracking and Diagnosis of COVID-19 via Machine Learning

    Authors: Dang Nguyen, Phat K. Huynh, Vinh Duc An Bui, Kee Young Hwang, Nityanand Jain, Chau Nguyen, Le Huu Nhat Minh, Le Van Truong, Xuan Thanh Nguyen, Dinh Hoang Nguyen, Le Tien Dung, Trung Q. Le, Manh-Huong Phan

    Abstract: The COVID-19 pandemic underscored the importance of reliable, noninvasive diagnostic tools for robust public health interventions. In this work, we fused magnetic respiratory sensing technology (MRST) with machine learning (ML) to create a diagnostic platform for real-time tracking and diagnosis of COVID-19 and other respiratory diseases. The MRST precisely captures breathing patterns through thre… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  16. arXiv:2306.06347  [pdf, other

    cs.SE

    DocChecker: Bootstrap** Code Large Language Model for Detecting and Resolving Code-Comment Inconsistencies

    Authors: Anh T. V. Dau, ** L. C. Guo, Nghi D. Q. Bui

    Abstract: Comments within source code are essential for developers to comprehend the code's purpose and ensure its correct usage. However, as codebases evolve, maintaining an accurate alignment between the comments and the code becomes increasingly challenging. Recognizing the growing interest in automated solutions for detecting and correcting differences between code and its accompanying comments, current… ▽ More

    Submitted 2 February, 2024; v1 submitted 10 June, 2023; originally announced June 2023.

    Journal ref: EACL 2024 - Demonstration track

  17. arXiv:2306.00029  [pdf, other

    cs.SE cs.AI

    CodeTF: One-stop Transformer Library for State-of-the-art Code LLM

    Authors: Nghi D. Q. Bui, Hung Le, Yue Wang, Junnan Li, Akhilesh Deepak Gotmare, Steven C. H. Hoi

    Abstract: Code intelligence plays a key role in transforming modern software engineering. Recently, deep learning-based models, especially Transformer-based large language models (LLMs), have demonstrated remarkable potential in tackling these tasks by leveraging massive open-source code data and programming language features. However, the development and deployment of such models often require expertise in… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: Ongoing work - Draft Preview

  18. arXiv:2305.07922  [pdf, other

    cs.CL cs.LG cs.PL

    CodeT5+: Open Code Large Language Models for Code Understanding and Generation

    Authors: Yue Wang, Hung Le, Akhilesh Deepak Gotmare, Nghi D. Q. Bui, Junnan Li, Steven C. H. Hoi

    Abstract: Large language models (LLMs) pretrained on vast source code have achieved prominent progress in code intelligence. However, existing code LLMs have two main limitations in terms of architecture and pretraining tasks. First, they often adopt a specific architecture (encoder-only or decoder-only) or rely on a unified encoder-decoder network for different downstream tasks. The former paradigm is limi… ▽ More

    Submitted 20 May, 2023; v1 submitted 13 May, 2023; originally announced May 2023.

    Comments: 26 pages, preprint

  19. arXiv:2305.06156  [pdf, other

    cs.CL cs.AI cs.PL cs.SE

    The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation

    Authors: Dung Nguyen Manh, Nam Le Hai, Anh T. V. Dau, Anh Minh Nguyen, Khanh Nghiem, ** Guo, Nghi D. Q. Bui

    Abstract: We present The Vault, a dataset of high-quality code-text pairs in multiple programming languages for training large language models to understand and generate code. We present methods for thoroughly extracting samples that use both rule-based and deep learning-based methods to ensure that they contain high-quality pairs of code and text, resulting in a dataset of 43 million high-quality code-text… ▽ More

    Submitted 30 October, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: Accepted at EMNLP 2023, Long Findings

  20. Development of a Vision System to Enhance the Reliability of the Pick-and-Place Robot for Autonomous Testing of Camera Module used in Smartphones

    Authors: Hoang-Anh Phan, Duy Nam Bui, Tuan Nguyen Dinh, Bao-Anh Hoang, An Nguyen Ngoc, Dong Tran Huu Quoc, Ha Tran Thi Thuy, Tung Thanh Bui, Van Nguyen Thi Thanh

    Abstract: Pick-and-place robots are commonly used in modern industrial manufacturing. For complex devices/parts like camera modules used in smartphones, which contain optical parts, electrical components and interfacing connectors, the placement operation may not absolutely accurate, which may cause damage in the device under test during the mechanical movement to make good contact for electrical functions… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: Published to 2021 International Conference on Engineering and Emerging Technologies (ICEET 2021). 6 pages

  21. arXiv:2305.04166  [pdf, other

    cs.CV cs.CL

    UIT-OpenViIC: A Novel Benchmark for Evaluating Image Captioning in Vietnamese

    Authors: Doanh C. Bui, Nghia Hieu Nguyen, Khang Nguyen

    Abstract: Image Captioning is one of the vision-language tasks that still interest the research community worldwide in the 2020s. MS-COCO Caption benchmark is commonly used to evaluate the performance of advanced captioning models, although it was published in 2015. Recent captioning models trained on the MS-COCO Caption dataset only have good performance in language patterns of English; they do not have su… ▽ More

    Submitted 9 May, 2023; v1 submitted 6 May, 2023; originally announced May 2023.

    Comments: 10 pages, 7 figures, submitted to Elsevier

  22. arXiv:2305.01384  [pdf, other

    cs.CL cs.LG

    Class based Influence Functions for Error Detection

    Authors: Thang Nguyen-Duc, Hoang Thanh-Tung, Quan Hung Tran, Dang Huu-Tien, Hieu Ngoc Nguyen, Anh T. V. Dau, Nghi D. Q. Bui

    Abstract: Influence functions (IFs) are a powerful tool for detecting anomalous examples in large scale datasets. However, they are unstable when applied to deep networks. In this paper, we provide an explanation for the instability of IFs and develop a solution to this problem. We show that IFs are unreliable when the two data points belong to two different classes. Our solution leverages class information… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Comments: Thang Nguyen-Duc, Hoang Thanh-Tung, and Quan Hung Tran are co-first authors of this paper. 12 pages, 12 figures. Accepted to ACL 2023

  23. arXiv:2305.00084  [pdf

    cs.HC

    CarGameAR: An Integrated AR Car Game Authoring Interface for Custom-Built Car Programed on Arduino Board

    Authors: Dang Bui, Wanwan Li, Hong Huang

    Abstract: In this paper, we present CarGameAR: An Integrated AR Car Game Authoring Interface for Custom-Built Car Programed on Arduino Board. The car consists of an Arduino board, an H-bridge, and motors. The objective of the project is to create a system that can move a car in different directions using a computer application. The system uses Unity software to create a virtual environment where the user ca… ▽ More

    Submitted 28 April, 2023; originally announced May 2023.

  24. arXiv:2304.01228  [pdf, other

    cs.CL cs.AI

    Better Language Models of Code through Self-Improvement

    Authors: Hung Quoc To, Nghi D. Q. Bui, ** Guo, Tien N. Nguyen

    Abstract: Pre-trained language models for code (PLMCs) have gained attention in recent research. These models are pre-trained on large-scale datasets using multi-modal objectives. However, fine-tuning them requires extensive supervision and is limited by the size of the dataset provided. We aim to improve this issue by proposing a simple data augmentation framework. Our framework utilizes knowledge gained d… ▽ More

    Submitted 9 May, 2023; v1 submitted 2 April, 2023; originally announced April 2023.

    Comments: Accepted to Findings, ACL 2023

  25. arXiv:2212.13209  [pdf, other

    cs.RO

    Deployment of UAVs for Optimal Multihop Ad-hoc Networks Using Particle Swarm Optimization and Behavior-based Control

    Authors: Ngan Duong Thi Thuy, Duy Nam Bui, Manh Duong Phung, Hung Pham Duy

    Abstract: This study proposes an approach for establishing an optimal multihop ad-hoc network using multiple unmanned aerial vehicles (UAVs) to provide emergency communication in disaster areas. The approach includes two stages, one uses particle swarm optimization (PSO) to find optimal positions to deploy UAVs, and the other uses a behavior-based controller to navigate the UAVs to their assigned positions… ▽ More

    Submitted 26 December, 2022; originally announced December 2022.

    Comments: In the 11th International Conference on Control, Automation and Information Sciences (ICCAIS 2022), Hanoi, Vietnam

  26. arXiv:2211.14875  [pdf, other

    cs.SE cs.CL

    Detect-Localize-Repair: A Unified Framework for Learning to Debug with CodeT5

    Authors: Nghi D. Q. Bui, Yue Wang, Steven Hoi

    Abstract: Automated software debugging is a crucial task for improving the productivity of software developers. Many neural-based techniques have been proven effective for debugging-related tasks such as bug localization and program repair (or bug fixing). However, these techniques often focus only on either one of them or approach them in a stage-wise manner, ignoring the mutual benefits between them. In t… ▽ More

    Submitted 22 December, 2022; v1 submitted 27 November, 2022; originally announced November 2022.

    Comments: Accepted to EMNLP 2022 Findings Track

  27. Optimal sizing of renewable energy storage: A comparative study of hydrogen and battery system considering degradation and seasonal storage

    Authors: Son Tay Le, Tuan Ngoc Nguyen, Dac-Khuong Bui, Tuan Duc Ngo

    Abstract: Renewable energy storage (RES) is essential to address the intermittence issues of renewable energy systems, thereby enhancing the system stability and reliability. This study presents an optimisation study of sizing and operational strategy parameters of a grid-connected photovoltaic (PV)-hydrogen/battery systems using a Multi-Objective Modified Firefly Algorithm (MOMFA). An operational strategy… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

  28. arXiv:2211.04773  [pdf, other

    cs.CV

    SG-Shuffle: Multi-aspect Shuffle Transformer for Scene Graph Generation

    Authors: Anh Duc Bui, Soyeon Caren Han, Josiah Poon

    Abstract: Scene Graph Generation (SGG) serves a comprehensive representation of the images for human understanding as well as visual understanding tasks. Due to the long tail bias problem of the object and predicate labels in the available annotated data, the scene graph generated from current methodologies can be biased toward common, non-informative relationship labels. Relationship can sometimes be non-m… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

  29. Lyapunov-based Nonlinear Model Predictive Control for Attitude Trajectory Tracking of Unmanned Aerial Vehicles

    Authors: Duy Nam Bui, Thi Thanh Van Nguyen, Manh Duong Phung

    Abstract: This paper presents a new Lyapunov-based nonlinear model predictive controller (LNMPC) for the attitude control problem of unmanned aerial vehicles (UAVs), which is essential for their functioning operation. The controller is designed based on a quadratic cost function integrating UAV dynamics and system constraints. An additional contraction constraint is then introduced to ensure closed-loop sys… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Journal ref: International Journal of Aeronautical and Space Sciences, 2022

  30. arXiv:2207.04551  [pdf, other

    cs.CV

    Depth Perspective-aware Multiple Object Tracking

    Authors: Kha Gia Quach, Huu Le, Pha Nguyen, Chi Nhan Duong, Tien Dai Bui, Khoa Luu

    Abstract: This paper aims to tackle Multiple Object Tracking (MOT), an important problem in computer vision but remains challenging due to many practical issues, especially occlusions. Indeed, we propose a new real-time Depth Perspective-aware Multiple Object Tracking (DP-MOT) approach to tackle the occlusion problem in MOT. A simple yet efficient Subject-Ordered Depth Estimation (SODE) is first proposed to… ▽ More

    Submitted 27 February, 2023; v1 submitted 10 July, 2022; originally announced July 2022.

    Comments: In review PR journal

  31. arXiv:2205.15479  [pdf, other

    cs.SE cs.AI cs.PL

    HierarchyNet: Learning to Summarize Source Code with Heterogeneous Representations

    Authors: Minh Huynh Nguyen, Nghi D. Q. Bui, Truong Son Hy, Long Tran-Thanh, Tien N. Nguyen

    Abstract: We propose a novel method for code summarization utilizing Heterogeneous Code Representations (HCRs) and our specially designed HierarchyNet. HCRs effectively capture essential code features at lexical, syntactic, and semantic levels by abstracting coarse-grained code elements and incorporating fine-grained program elements in a hierarchical structure. Our HierarchyNet method processes each layer… ▽ More

    Submitted 9 May, 2023; v1 submitted 30 May, 2022; originally announced May 2022.

  32. arXiv:2205.13022  [pdf, ps, other

    cs.SE cs.AI cs.PL

    Towards Using Data-Influence Methods to Detect Noisy Samples in Source Code Corpora

    Authors: Anh T. V. Dau, Thang Nguyen-Duc, Hoang Thanh-Tung, Nghi D. Q. Bui

    Abstract: Despite the recent trend of develo** and applying neural source code models to software engineering tasks, the quality of such models is insufficient for real-world use. This is because there could be noise in the source code corpora used to train such models. We adapt data-influence methods to detect such noises in this paper. Data-influence methods are used in machine learning to evaluate the… ▽ More

    Submitted 2 October, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: The 37th IEEE/ACM International Conference on Automated Software Engineering

  33. arXiv:2202.12275  [pdf, other

    stat.ML cs.LG

    Partitioned Variational Inference: A Framework for Probabilistic Federated Learning

    Authors: Matthew Ashman, Thang D. Bui, Cuong V. Nguyen, Stratis Markou, Adrian Weller, Siddharth Swaroop, Richard E. Turner

    Abstract: The proliferation of computing devices has brought about an opportunity to deploy machine learning models on new problem domains using previously inaccessible data. Traditional algorithms for training such models often require data to be stored on a single machine with compute performed by a single node, making them unsuitable for decentralised training on multiple devices. This deficiency has mot… ▽ More

    Submitted 28 April, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:1811.11206

  34. arXiv:2201.00590  [pdf

    cs.CG cs.GR

    Two New Algorithms for Line Clip** in E2 and Their Comparison

    Authors: Vaclav Skala, Duc Huy Bui

    Abstract: Many algorithms for clip** a line by a rectangular area or a convex polygon in E2 or by a non-convex or convex polyhedron in E3 have been published. The line segment clip** by the rectangular window in E2 is often restricted to the use of the Cohen-Sutherland (CS) algorithm or its modifications based on some presumptions like small clip** window or more sophisticated coding technique, etc. T… ▽ More

    Submitted 3 January, 2022; originally announced January 2022.

    MSC Class: 68U05 ACM Class: I.3

    Journal ref: Machine Graphics & Vision, Vol. 9, no. 1/2, pp. 297-306, 2000

  35. arXiv:2201.00587  [pdf

    cs.CG cs.GR

    A New Algorithm for Pyramidal Clip** of Line Segments in E3

    Authors: Vaclav Skala, Duc Huy Bui

    Abstract: A new algorithm for clip** a line segment against a pyramid in E3 is presented. This algorithm avoids computation of intersection points which are not end-points of the output line segment. It also allows solving all cases more effectively. The performance of this algorithm is shown to be consistently better than existing algorithms, including the Cohen-Sutherland, Liang-Barsky and Cyrus-Beck al… ▽ More

    Submitted 3 January, 2022; originally announced January 2022.

    MSC Class: 68U05 ACM Class: I.3

    Journal ref: Machine GRAPHICS & VISION, Poland Academy of Sciences, Vol. 9, No. 4, 2000, pp. 841-850, ISSN 1230-0535, 2000

  36. arXiv:2112.11226   

    cs.LG cs.AI cs.PL cs.SE

    Energy-bounded Learning for Robust Models of Code

    Authors: Nghi D. Q. Bui, Yijun Yu

    Abstract: In programming, learning code representations has a variety of applications, including code classification, code search, comment generation, bug prediction, and so on. Various representations of code in terms of tokens, syntax trees, dependency graphs, code navigation paths, or a combination of their variants have been proposed, however, existing vanilla learning techniques have a major limitation… ▽ More

    Submitted 9 May, 2022; v1 submitted 20 December, 2021; originally announced December 2021.

    Comments: There are some flaws in our experiments, we would like to fix it and publish a fixed version again in the very near future

  37. arXiv:2112.03846  [pdf

    physics.med-ph physics.comp-ph

    Monte Carlo calculation of the organ equivalent dose and effective dose due to immersion in a 16N beta source in air using the ICRP Reference Phantoms

    Authors: Jose M. Gomez-Ros, Montserrat Moraleda, Pedro Arce, Duc-Ky Bui, Thi-My-Linh Dang, Laurent Desorgher, Han Sung Kim, Dragana Krstic, Michal Kuc, Ngoc-Thiem Le, Yi-Kang Lee, Ngoc-Quynh Nguyen, Dragoslav Nikezic, Katarzyna Tyminska, Tomas Vrba

    Abstract: This work summarises the results of a comparison organized by EURADOS focused on the usage of the ICRP Reference Computational Phantoms. This activity aimed to provide training for the implementation of voxel phantoms in Monte Carlo radiation transport codes and the calculation of the dose equivalent in organs and the effective dose. This particular case describes a scenario of immersion in a 16N… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: 18 pages, 4 figures, 2 tables

    Journal ref: Radiation Measurements 145 (2021) 106612

  38. arXiv:2012.07023  [pdf, other

    cs.SE cs.AI cs.LG cs.PL

    InferCode: Self-Supervised Learning of Code Representations by Predicting Subtrees

    Authors: Nghi D. Q. Bui, Yijun Yu, Lingxiao Jiang

    Abstract: Building deep learning models on source code has found many successful software engineering applications, such as code search, code comment generation, bug detection, code migration, and so on. Current learning techniques, however, have a major drawback that these models are mostly trained on datasets labeled for particular downstream tasks, and code representations may not be suitable for other t… ▽ More

    Submitted 15 December, 2020; v1 submitted 13 December, 2020; originally announced December 2020.

    Comments: Accepted at ICSE 2021

  39. arXiv:2012.02463  [pdf, other

    eess.IV cs.CV

    Offset Curves Loss for Imbalanced Problem in Medical Segmentation

    Authors: Ngan Le, Trung Le, Kashu Yamazaki, Toan Duc Bui, Khoa Luu, Marios Savides

    Abstract: Medical image segmentation has played an important role in medical analysis and widely developed for many clinical applications. Deep learning-based approaches have achieved high performance in semantic segmentation but they are limited to pixel-wise setting and imbalanced classes data problem. In this paper, we tackle those limitations by develo** a new deep learning-based model which takes int… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

    Comments: ICPR 2020

  40. arXiv:2012.01777  [pdf, other

    eess.IV cs.CV

    Flow-based Deformation Guidance for Unpaired Multi-Contrast MRI Image-to-Image Translation

    Authors: Toan Duc Bui, Manh Nguyen, Ngan Le, Khoa Luu

    Abstract: Image synthesis from corrupted contrasts increases the diversity of diagnostic information available for many neurological diseases. Recently the image-to-image translation has experienced significant levels of interest within medical research, beginning with the successful use of the Generative Adversarial Network (GAN) to the introduction of cyclic constraint extended to multiple domains. Howeve… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

    Comments: Medical Image Computing and Computer Assisted Interventions

  41. arXiv:2009.09777  [pdf, other

    cs.SE cs.AI cs.PL

    TreeCaps: Tree-Based Capsule Networks for Source Code Processing

    Authors: Nghi D. Q. Bui, Yijun Yu, Lingxiao Jiang

    Abstract: Recently program learning techniques have been proposed to process source code based on syntactical structures (e.g., Abstract Syntax Trees) and/or semantic information (e.g., Dependency Graphs). Although graphs may be better at capturing various viewpoints of code semantics than trees, constructing graph inputs from code needs static code semantic analysis that may not be accurate and introduces… ▽ More

    Submitted 14 December, 2020; v1 submitted 5 September, 2020; originally announced September 2020.

    Comments: Accepted at AAAI 2021

  42. arXiv:2009.02731  [pdf, other

    cs.SE cs.AI cs.LG cs.PL

    Self-Supervised Contrastive Learning for Code Retrieval and Summarization via Semantic-Preserving Transformations

    Authors: Nghi D. Q. Bui, Yijun Yu, Lingxiao Jiang

    Abstract: We propose Corder, a self-supervised contrastive learning framework for source code model. Corder is designed to alleviate the need of labeled data for code retrieval and code summarization tasks. The pre-trained model of Corder can be used in two ways: (1) it can produce vector representation of code which can be applied to code retrieval tasks that do not have labeled data; (2) it can be used in… ▽ More

    Submitted 23 May, 2021; v1 submitted 6 September, 2020; originally announced September 2020.

    Comments: Accepted at SIGIR 2021

  43. arXiv:2008.08976  [pdf, other

    cs.CV eess.IV

    Improving Text to Image Generation using Mode-seeking Function

    Authors: Naitik Bhise, Zhenfei Zhang, Tien D. Bui

    Abstract: Generative Adversarial Networks (GANs) have long been used to understand the semantic relationship between the text and image. However, there are problems with mode collapsing in the image generation that causes some preferred output modes. Our aim is to improve the training of the network by using a specialized mode-seeking loss function to avoid this issue. In the text to image synthesis, our lo… ▽ More

    Submitted 18 September, 2020; v1 submitted 19 August, 2020; originally announced August 2020.

    Comments: changes : changed the title of the research for submission to CVIU

  44. arXiv:2008.05250  [pdf, ps, other

    cs.AI math.NA

    Optimizing fire allocation in a NCW-type model

    Authors: Nam Hong Nguyen, My Anh Vu, Dinh Van Bui, Anh Ngoc Ta, Manh Duc Hy

    Abstract: In this paper, we introduce a non-linear Lanchester model of NCW-type and investigate an optimization problem for this model, where only the Red force is supplied by several supply agents. Optimal fire allocation of the Blue force is sought in the form of a piece-wise constant function of time. A threatening rate is computed for the Red force and each of its supply agents at the beginning of each… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

    Comments: 6 pages on NCW-type model

  45. On the Generalizability of Neural Program Models with respect to Semantic-Preserving Program Transformations

    Authors: Md Rafiqul Islam Rabin, Nghi D. Q. Bui, Ke Wang, Yijun Yu, Lingxiao Jiang, Mohammad Amin Alipour

    Abstract: With the prevalence of publicly available source code repositories to train deep neural network models, neural program models can do well in source code analysis tasks such as predicting method names in given programs that cannot be easily done by traditional program analysis techniques. Although such neural program models have been tested on various existing datasets, the extent to which they gen… ▽ More

    Submitted 18 March, 2021; v1 submitted 31 July, 2020; originally announced August 2020.

    Comments: Information and Software Technology, IST Journal 2021, Elsevier. Related to arXiv:2004.07313

  46. arXiv:2007.02096  [pdf

    eess.IV cs.CV cs.LG

    Multi-Site Infant Brain Segmentation Algorithms: The iSeg-2019 Challenge

    Authors: Yue Sun, Kun Gao, Zhengwang Wu, Zhihao Lei, Ying Wei, Jun Ma, ** Yang, Xue Feng, Li Zhao, Trung Le Phan, Jitae Shin, Tao Zhong, Yu Zhang, Lequan Yu, Caizi Li, Ramesh Basnet, M. Omair Ahmad, M. N. S. Swamy, Wenao Ma, Qi Dou, Toan Duc Bui, Camilo Bermudez Noguera, Bennett Landman, Ian H. Gotlib, Kathryn L. Humphreys , et al. (8 additional authors not shown)

    Abstract: To better understand early brain growth patterns in health and disorder, it is critical to accurately segment infant brain magnetic resonance (MR) images into white matter (WM), gray matter (GM), and cerebrospinal fluid (CSF). Deep learning-based methods have achieved state-of-the-art performance; however, one of major limitations is that the learning-based methods may suffer from the multi-site i… ▽ More

    Submitted 11 July, 2020; v1 submitted 4 July, 2020; originally announced July 2020.

    Journal ref: IEEE Transactions on Medical Imaging, 40(5), 1363-1376, 2021

  47. arXiv:2006.05468  [pdf, other

    stat.ML cs.LG

    Variational Auto-Regressive Gaussian Processes for Continual Learning

    Authors: Sanyam Kapoor, Theofanis Karaletsos, Thang D. Bui

    Abstract: Through sequential construction of posteriors on observing data online, Bayes' theorem provides a natural framework for continual learning. We develop Variational Auto-Regressive Gaussian Processes (VAR-GPs), a principled posterior updating mechanism to solve sequential tasks in continual learning. By relying on sparse inducing point approximations for scalable posteriors, we propose a novel auto-… ▽ More

    Submitted 12 June, 2021; v1 submitted 9 June, 2020; originally announced June 2020.

    Comments: International Conference on Machine Learning (ICML), 2021

  48. arXiv:2004.05085  [pdf, other

    cs.CV

    LIAAD: Lightweight Attentive Angular Distillation for Large-scale Age-Invariant Face Recognition

    Authors: Thanh-Dat Truong, Chi Nhan Duong, Kha Gia Quach, Ngan Le, Tien D. Bui, Khoa Luu

    Abstract: Disentangled representations have been commonly adopted to Age-invariant Face Recognition (AiFR) tasks. However, these methods have reached some limitations with (1) the requirement of large-scale face recognition (FR) training data with age labels, which is limited in practice; (2) heavy deep network architectures for high performance; and (3) their evaluations are usually taken place on age-rela… ▽ More

    Submitted 11 September, 2022; v1 submitted 8 April, 2020; originally announced April 2020.

    Comments: arXiv admin note: text overlap with arXiv:1905.10620

  49. arXiv:2002.04033  [pdf, other

    stat.ML cs.LG

    Hierarchical Gaussian Process Priors for Bayesian Neural Network Weights

    Authors: Theofanis Karaletsos, Thang D. Bui

    Abstract: Probabilistic neural networks are typically modeled with independent weight priors, which do not capture weight correlations in the prior and do not provide a parsimonious interface to express properties in function space. A desirable class of priors would represent weights compactly, capture correlations between weights, facilitate calibrated reasoning about uncertainty, and allow inclusion of pr… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

    Comments: 12 pages main paper, 13 pages appendix

  50. A Fast Template-based Approach to Automatically Identify Primary Text Content of a Web Page

    Authors: Dat Quoc Nguyen, Dai Quoc Nguyen, Son Bao Pham, The Duy Bui

    Abstract: Search engines have become an indispensable tool for browsing information on the Internet. The user, however, is often annoyed by redundant results from irrelevant Web pages. One reason is because search engines also look at non-informative blocks of Web pages such as advertisement, navigation links, etc. In this paper, we propose a fast algorithm called FastContentExtractor to automatically detec… ▽ More

    Submitted 26 November, 2019; originally announced November 2019.

    Comments: In Proceedings of the 2009 International Conference on Knowledge and Systems Engineering (KSE 2009)