-
Unraveling the Complex Structure of AGN-driven Outflows. VI. Strong Ionized Outflows in Type 1 AGNs and the Outflow Size-Luminosity Relation
Authors:
Changseok Kim,
Jong-Hak Woo,
Rongxin Luo,
Aeree Chung,
Junhyun Baek,
Huynh Anh N. Le,
Donghoon Son
Abstract:
We present spatially resolved gas kinematics, ionization, and energetics of 11 type 1 and 5 type 2 active galactic nuclei (AGNs) with strong ionized gas outflows at z $<0.3$ using Gemini Multi-Object Spectrograph Integral Field Unit (GMOS-IFU) data. We find a strongly blueshifted region in [OIII] velocity maps, representing an approaching cone in biconical outflows, and blueshifted and redshifted…
▽ More
We present spatially resolved gas kinematics, ionization, and energetics of 11 type 1 and 5 type 2 active galactic nuclei (AGNs) with strong ionized gas outflows at z $<0.3$ using Gemini Multi-Object Spectrograph Integral Field Unit (GMOS-IFU) data. We find a strongly blueshifted region in [OIII] velocity maps, representing an approaching cone in biconical outflows, and blueshifted and redshifted regions in H$α$ velocity maps, which show gravitationally rotating kinematics. AGN photoionization is dominant in the central region of most targets, and some of them also show ring-like structures of LINER or composite that surround the AGN-dominated center. Following our previous studies, we kinematically determine outflow sizes by the ratio between [OIII] and stellar velocity dispersion. Outflow sizes of type 1 AGNs follow the same kinematic outflow size-[OIII] luminosity relation obtained from the type 2 IFU sample in Kang & Woo and Luo (updated slope $0.29\pm0.04$), while they are limited to the central kpc scales, indicating the lack of global impact of outflows on the interstellar medium. Small mass outflow rates and large star formation rates of the combined sample support that there is no evidence of rapid star formation quenching by outflows, which is consistent with the delayed AGN feedback.
△ Less
Submitted 12 October, 2023; v1 submitted 10 October, 2023;
originally announced October 2023.
-
Open-Fusion: Real-time Open-Vocabulary 3D Map** and Queryable Scene Representation
Authors:
Kashu Yamazaki,
Taisei Hanyu,
Khoa Vo,
Thang Pham,
Minh Tran,
Gianfranco Doretto,
Anh Nguyen,
Ngan Le
Abstract:
Precise 3D environmental map** is pivotal in robotics. Existing methods often rely on predefined concepts during training or are time-intensive when generating semantic maps. This paper presents Open-Fusion, a groundbreaking approach for real-time open-vocabulary 3D map** and queryable scene representation using RGB-D data. Open-Fusion harnesses the power of a pre-trained vision-language found…
▽ More
Precise 3D environmental map** is pivotal in robotics. Existing methods often rely on predefined concepts during training or are time-intensive when generating semantic maps. This paper presents Open-Fusion, a groundbreaking approach for real-time open-vocabulary 3D map** and queryable scene representation using RGB-D data. Open-Fusion harnesses the power of a pre-trained vision-language foundation model (VLFM) for open-set semantic comprehension and employs the Truncated Signed Distance Function (TSDF) for swift 3D scene reconstruction. By leveraging the VLFM, we extract region-based embeddings and their associated confidence maps. These are then integrated with 3D knowledge from TSDF using an enhanced Hungarian-based feature-matching mechanism. Notably, Open-Fusion delivers outstanding annotation-free 3D segmentation for open-vocabulary without necessitating additional 3D training. Benchmark tests on the ScanNet dataset against leading zero-shot methods highlight Open-Fusion's superiority. Furthermore, it seamlessly combines the strengths of region-based VLFM and TSDF, facilitating real-time 3D scene comprehension that includes object concepts and open-world semantics. We encourage the readers to view the demos on our project page: https://uark-aicv.github.io/OpenFusion
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
CORec-Cri: How collaborative and social technologies can help to contextualize crises?
Authors:
Ngoc Luyen Le,
**feng Zhong,
Elsa Negre,
Marie-Hélène Abel
Abstract:
Crisis situations can present complex and multifaceted challenges, often requiring the involvement of multiple organizations and stakeholders with varying areas of expertise, responsibilities, and resources. Acquiring accurate and timely information about impacted areas is crucial to effectively respond to these crises. In this paper, we investigate how collaborative and social technologies help t…
▽ More
Crisis situations can present complex and multifaceted challenges, often requiring the involvement of multiple organizations and stakeholders with varying areas of expertise, responsibilities, and resources. Acquiring accurate and timely information about impacted areas is crucial to effectively respond to these crises. In this paper, we investigate how collaborative and social technologies help to contextualize crises, including identifying impacted areas and real-time needs. To this end, we define CORec-Cri (Contextulized Ontology-based Recommender system for crisis management) based on existing work. Our motivation for this approach is two-fold: first, effective collaboration among stakeholders is essential for efficient and coordinated crisis response; second, social computing facilitates interaction, information flow, and collaboration among stakeholders. We detail the key components of our system design, highlighting its potential to support decision-making, resource allocation, and communication among stakeholders. Finally, we provide examples of how our system can be applied to contextualize crises to improve crisis management.
△ Less
Submitted 3 October, 2023;
originally announced October 2023.
-
I-AI: A Controllable & Interpretable AI System for Decoding Radiologists' Intense Focus for Accurate CXR Diagnoses
Authors:
Trong Thang Pham,
Jacob Brecheisen,
Anh Nguyen,
Hien Nguyen,
Ngan Le
Abstract:
In the field of chest X-ray (CXR) diagnosis, existing works often focus solely on determining where a radiologist looks, typically through tasks such as detection, segmentation, or classification. However, these approaches are often designed as black-box models, lacking interpretability. In this paper, we introduce Interpretable Artificial Intelligence (I-AI) a novel and unified controllable inter…
▽ More
In the field of chest X-ray (CXR) diagnosis, existing works often focus solely on determining where a radiologist looks, typically through tasks such as detection, segmentation, or classification. However, these approaches are often designed as black-box models, lacking interpretability. In this paper, we introduce Interpretable Artificial Intelligence (I-AI) a novel and unified controllable interpretable pipeline for decoding the intense focus of radiologists in CXR diagnosis. Our I-AI addresses three key questions: where a radiologist looks, how long they focus on specific areas, and what findings they diagnose. By capturing the intensity of the radiologist's gaze, we provide a unified solution that offers insights into the cognitive process underlying radiological interpretation. Unlike current methods that rely on black-box machine learning models, which can be prone to extracting erroneous information from the entire input image during the diagnosis process, we tackle this issue by effectively masking out irrelevant information. Our proposed I-AI leverages a vision-language model, allowing for precise control over the interpretation process while ensuring the exclusion of irrelevant features. To train our I-AI model, we utilize an eye gaze dataset to extract anatomical gaze information and generate ground truth heatmaps. Through extensive experimentation, we demonstrate the efficacy of our method. We showcase that the attention heatmaps, designed to mimic radiologists' focus, encode sufficient and relevant information, enabling accurate classification tasks using only a portion of CXR. The code, checkpoints, and data are at https://github.com/UARK-AICV/IAI
△ Less
Submitted 9 December, 2023; v1 submitted 24 September, 2023;
originally announced September 2023.
-
Evaluating the diversity and utility of materials proposed by generative models
Authors:
Alexander New,
Michael Pekala,
Elizabeth A. Pogue,
Nam Q. Le,
Janna Domenico,
Christine D. Piatko,
Christopher D. Stiles
Abstract:
Generative machine learning models can use data generated by scientific modeling to create large quantities of novel material structures. Here, we assess how one state-of-the-art generative model, the physics-guided crystal generation model (PGCGM), can be used as part of the inverse design process. We show that the default PGCGM's input space is not smooth with respect to parameter variation, mak…
▽ More
Generative machine learning models can use data generated by scientific modeling to create large quantities of novel material structures. Here, we assess how one state-of-the-art generative model, the physics-guided crystal generation model (PGCGM), can be used as part of the inverse design process. We show that the default PGCGM's input space is not smooth with respect to parameter variation, making material optimization difficult and limited. We also demonstrate that most generated structures are predicted to be thermodynamically unstable by a separate property-prediction model, partially due to out-of-domain data challenges. Our findings suggest how generative models might be improved to enable better inverse design.
△ Less
Submitted 9 August, 2023;
originally announced September 2023.
-
Open-Vocabulary Affordance Detection using Knowledge Distillation and Text-Point Correlation
Authors:
Tuan Van Vo,
Minh Nhat Vu,
Baoru Huang,
Toan Nguyen,
Ngan Le,
Thieu Vo,
Anh Nguyen
Abstract:
Affordance detection presents intricate challenges and has a wide range of robotic applications. Previous works have faced limitations such as the complexities of 3D object shapes, the wide range of potential affordances on real-world objects, and the lack of open-vocabulary support for affordance understanding. In this paper, we introduce a new open-vocabulary affordance detection method in 3D po…
▽ More
Affordance detection presents intricate challenges and has a wide range of robotic applications. Previous works have faced limitations such as the complexities of 3D object shapes, the wide range of potential affordances on real-world objects, and the lack of open-vocabulary support for affordance understanding. In this paper, we introduce a new open-vocabulary affordance detection method in 3D point clouds, leveraging knowledge distillation and text-point correlation. Our approach employs pre-trained 3D models through knowledge distillation to enhance feature extraction and semantic understanding in 3D point clouds. We further introduce a new text-point correlation method to learn the semantic links between point cloud features and open-vocabulary labels. The intensive experiments show that our approach outperforms previous works and adapts to new affordance labels and unseen objects. Notably, our method achieves the improvement of 7.96% mIOU score compared to the baselines. Furthermore, it offers real-time inference which is well-suitable for robotic manipulation applications.
△ Less
Submitted 19 September, 2023;
originally announced September 2023.
-
Language-Conditioned Affordance-Pose Detection in 3D Point Clouds
Authors:
Toan Nguyen,
Minh Nhat Vu,
Baoru Huang,
Tuan Van Vo,
Vy Truong,
Ngan Le,
Thieu Vo,
Bac Le,
Anh Nguyen
Abstract:
Affordance detection and pose estimation are of great importance in many robotic applications. Their combination helps the robot gain an enhanced manipulation capability, in which the generated pose can facilitate the corresponding affordance task. Previous methods for affodance-pose joint learning are limited to a predefined set of affordances, thus limiting the adaptability of robots in real-wor…
▽ More
Affordance detection and pose estimation are of great importance in many robotic applications. Their combination helps the robot gain an enhanced manipulation capability, in which the generated pose can facilitate the corresponding affordance task. Previous methods for affodance-pose joint learning are limited to a predefined set of affordances, thus limiting the adaptability of robots in real-world environments. In this paper, we propose a new method for language-conditioned affordance-pose joint learning in 3D point clouds. Given a 3D point cloud object, our method detects the affordance region and generates appropriate 6-DoF poses for any unconstrained affordance label. Our method consists of an open-vocabulary affordance detection branch and a language-guided diffusion model that generates 6-DoF poses based on the affordance text. We also introduce a new high-quality dataset for the task of language-driven affordance-pose joint learning. Intensive experimental results demonstrate that our proposed method works effectively on a wide range of open-vocabulary affordances and outperforms other baselines by a large margin. In addition, we illustrate the usefulness of our method in real-world robotic applications. Our code and dataset are publicly available at https://3DAPNet.github.io
△ Less
Submitted 19 September, 2023;
originally announced September 2023.
-
Averages of completely multiplicative functions over the Gaussian integers -- a dynamical approach
Authors:
Sebastián Donoso,
Anh N. Le,
Joel Moreira,
Wenbo Sun
Abstract:
We prove a pointwise convergence result for additive ergodic averages associated with certain multiplicative actions of the Gaussian integers. We derive several applications in dynamics and number theory, including:
(i) Wirsing's theorem for Gaussian integers: if $f\colon \mathbb{G} \to \mathbb{R}$ is a bounded completely multiplicative function, then the following limit exists:…
▽ More
We prove a pointwise convergence result for additive ergodic averages associated with certain multiplicative actions of the Gaussian integers. We derive several applications in dynamics and number theory, including:
(i) Wirsing's theorem for Gaussian integers: if $f\colon \mathbb{G} \to \mathbb{R}$ is a bounded completely multiplicative function, then the following limit exists: $$\lim_{N \to \infty} \frac{1}{N^2} \sum_{1 \leq m, n \leq N} f(m + {\rm i} n).$$ (ii) An answer to a special case of a question of Frantzikinakis and Host: for any completely multiplicative real-valued function $f: \mathbb{N} \to \mathbb{R}$, the following limit exists: $$\lim_{N \to \infty} \frac{1}{N^2} \sum_{1 \leq m, n \leq N} f(m^2 + n^2).$$ (iii) A variant of a theorem of Bergelson and Richter on ergodic averages along the $Ω$ function: if $(X,T)$ is a uniquely ergodic system with unique invariant measure $μ$, then for any $x\in X$ and $f\in C(X)$, $$\lim_{N\to\infty}\frac{1}{N^2}\sum_{1 \leq m, n \leq N} f(T^{Ω(m^2 + n^2)}x)=\int_Xf \ dμ.$$
△ Less
Submitted 6 March, 2024; v1 submitted 13 September, 2023;
originally announced September 2023.
-
Towards Robust Natural-Looking Mammography Lesion Synthesis on Ipsilateral Dual-Views Breast Cancer Analysis
Authors:
Thanh-Huy Nguyen,
Quang Hien Kha,
Thai Ngoc Toan Truong,
Ba Thinh Lam,
Ba Hung Ngo,
Quang Vinh Dinh,
Nguyen Quoc Khanh Le
Abstract:
In recent years, many mammographic image analysis methods have been introduced for improving cancer classification tasks. Two major issues of mammogram classification tasks are leveraging multi-view mammographic information and class-imbalance handling. In the first problem, many multi-view methods have been released for concatenating features of two or more views for the training and inference st…
▽ More
In recent years, many mammographic image analysis methods have been introduced for improving cancer classification tasks. Two major issues of mammogram classification tasks are leveraging multi-view mammographic information and class-imbalance handling. In the first problem, many multi-view methods have been released for concatenating features of two or more views for the training and inference stage. Having said that, most multi-view existing methods are not explainable in the meaning of feature fusion, and treat many views equally for diagnosing. Our work aims to propose a simple but novel method for enhancing examined view (main view) by leveraging low-level feature information from the auxiliary view (ipsilateral view) before learning the high-level feature that contains the cancerous features. For the second issue, we also propose a simple but novel malignant mammogram synthesis framework for upsampling minor class samples. Our easy-to-implement and no-training framework has eliminated the current limitation of the CutMix algorithm which is unreliable synthesized images with random pasted patches, hard-contour problems, and domain shift problems. Our results on VinDr-Mammo and CMMD datasets show the effectiveness of our two new frameworks for both multi-view training and synthesizing mammographic images, outperforming the previous conventional methods in our experimental settings.
△ Less
Submitted 7 September, 2023;
originally announced September 2023.
-
SAM3D: Segment Anything Model in Volumetric Medical Images
Authors:
Nhat-Tan Bui,
Dinh-Hieu Hoang,
Minh-Triet Tran,
Gianfranco Doretto,
Donald Adjeroh,
Brijesh Patel,
Arabinda Choudhary,
Ngan Le
Abstract:
Image segmentation remains a pivotal component in medical image analysis, aiding in the extraction of critical information for precise diagnostic practices. With the advent of deep learning, automated image segmentation methods have risen to prominence, showcasing exceptional proficiency in processing medical imagery. Motivated by the Segment Anything Model (SAM)-a foundational model renowned for…
▽ More
Image segmentation remains a pivotal component in medical image analysis, aiding in the extraction of critical information for precise diagnostic practices. With the advent of deep learning, automated image segmentation methods have risen to prominence, showcasing exceptional proficiency in processing medical imagery. Motivated by the Segment Anything Model (SAM)-a foundational model renowned for its remarkable precision and robust generalization capabilities in segmenting 2D natural images-we introduce SAM3D, an innovative adaptation tailored for 3D volumetric medical image analysis. Unlike current SAM-based methods that segment volumetric data by converting the volume into separate 2D slices for individual analysis, our SAM3D model processes the entire 3D volume image in a unified approach. Extensive experiments are conducted on multiple medical image datasets to demonstrate that our network attains competitive results compared with other state-of-the-art methods in 3D medical segmentation tasks while being significantly efficient in terms of parameters. Code and checkpoints are available at https://github.com/UARK-AICV/SAM3D.
△ Less
Submitted 5 March, 2024; v1 submitted 7 September, 2023;
originally announced September 2023.
-
MEGANet: Multi-Scale Edge-Guided Attention Network for Weak Boundary Polyp Segmentation
Authors:
Nhat-Tan Bui,
Dinh-Hieu Hoang,
Quang-Thuc Nguyen,
Minh-Triet Tran,
Ngan Le
Abstract:
Efficient polyp segmentation in healthcare plays a critical role in enabling early diagnosis of colorectal cancer. However, the segmentation of polyps presents numerous challenges, including the intricate distribution of backgrounds, variations in polyp sizes and shapes, and indistinct boundaries. Defining the boundary between the foreground (i.e. polyp itself) and the background (surrounding tiss…
▽ More
Efficient polyp segmentation in healthcare plays a critical role in enabling early diagnosis of colorectal cancer. However, the segmentation of polyps presents numerous challenges, including the intricate distribution of backgrounds, variations in polyp sizes and shapes, and indistinct boundaries. Defining the boundary between the foreground (i.e. polyp itself) and the background (surrounding tissue) is difficult. To mitigate these challenges, we propose Multi-Scale Edge-Guided Attention Network (MEGANet) tailored specifically for polyp segmentation within colonoscopy images. This network draws inspiration from the fusion of a classical edge detection technique with an attention mechanism. By combining these techniques, MEGANet effectively preserves high-frequency information, notably edges and boundaries, which tend to erode as neural networks deepen. MEGANet is designed as an end-to-end framework, encompassing three key modules: an encoder, which is responsible for capturing and abstracting the features from the input image, a decoder, which focuses on salient features, and the Edge-Guided Attention module (EGA) that employs the Laplacian Operator to accentuate polyp boundaries. Extensive experiments, both qualitative and quantitative, on five benchmark datasets, demonstrate that our MEGANet outperforms other existing SOTA methods under six evaluation metrics. Our code is available at https://github.com/UARK-AICV/MEGANet.
△ Less
Submitted 4 November, 2023; v1 submitted 6 September, 2023;
originally announced September 2023.
-
The twisted Gan-Gross-Prasad problem for finite classical groups
Authors:
Nhat Hoang Le
Abstract:
In this paper, we study the twisted Gan-Gross-Prasad problem for classical groups over finite fields. We formulate a multiplicity formula for Deligne-Lusztig characters and give a complete answer for cuspidal representations arising from anisotropic torus modulo the center.
In this paper, we study the twisted Gan-Gross-Prasad problem for classical groups over finite fields. We formulate a multiplicity formula for Deligne-Lusztig characters and give a complete answer for cuspidal representations arising from anisotropic torus modulo the center.
△ Less
Submitted 3 September, 2023;
originally announced September 2023.
-
A preliminary study of photometric redshifts based on the Wide Field Survey Telescope
Authors:
Yu Liu,
Xiao-zhi Lin,
Yong-quan Xue,
Huynh Anh N. Le
Abstract:
The Wide Field Survey Telescope (WFST) is a dedicated time-domain multi-band ($u$, $g$, $r$, $i$, and $z$) photometric survey facility under construction. In this paper, we present a preliminary study that assesses the quality of photometric redshifts based on WFST by utilizing mock observations derived with the galaxy catalog in the COSMOS/UltraVISTA field. We apply the template fitting technique…
▽ More
The Wide Field Survey Telescope (WFST) is a dedicated time-domain multi-band ($u$, $g$, $r$, $i$, and $z$) photometric survey facility under construction. In this paper, we present a preliminary study that assesses the quality of photometric redshifts based on WFST by utilizing mock observations derived with the galaxy catalog in the COSMOS/UltraVISTA field. We apply the template fitting technique to estimate photometric redshifts by using the ZEBRA photometric-redshift code and adopting a modified set of adaptive templates. We evaluate the bias (median relative offset between the output photometric redshifts and input redshifts), normalized median absolute deviation ($σ_{\rm NMAD}$) and outlier fraction ($f_{\rm outlier}$) of photometric redshifts in two typical WFST observational cases, the single 30-second exposure observations (hereafter shallow mode) and co-added 50-minute exposure observations (hereafter deep mode). We find bias$\la0.006$, $σ_{\rm NMAD}\la0.03$, and $f_{\rm outlier}\la5\%$ in the shallow mode and bias$\approx 0.005$, $σ_{\rm NMAD}\approx 0.06$, and $f_{\rm outlier}\approx 17\%$--$27\%$ in the deep mode, respectively, under various lunar phases. Combining the WFST mock observational data with that from the upcoming CSST and Euclid surveys, we demonstrate that the $z_{\rm phot}$ results can be significantly improved, with $f_{\rm outlier}\approx 1\%$ and $σ_{\rm NMAD}\approx 0.02$.
△ Less
Submitted 1 September, 2023;
originally announced September 2023.
-
Designing a User Contextual Profile Ontology: A Focus on the Vehicle Sales Domain
Authors:
Ngoc Luyen Le,
Marie-Hélène Abel,
Philippe Gouspillou
Abstract:
In the digital age, it is crucial to understand and tailor experiences for users interacting with systems and applications. This requires the creation of user contextual profiles that combine user profiles with contextual information. However, there is a lack of research on the integration of contextual information with different user profiles. This study aims to address this gap by designing a us…
▽ More
In the digital age, it is crucial to understand and tailor experiences for users interacting with systems and applications. This requires the creation of user contextual profiles that combine user profiles with contextual information. However, there is a lack of research on the integration of contextual information with different user profiles. This study aims to address this gap by designing a user contextual profile ontology that considers both user profiles and contextual information on each profile. Specifically, we present a design and development of the user contextual profile ontology with a focus on the vehicle sales domain. Our designed ontology serves as a structural foundation for standardizing the representation of user profiles and contextual information, enhancing the system's ability to capture user preferences and contextual information of the user accurately. Moreover, we illustrate a case study using the User Contextual Profile Ontology in generating personalized recommendations for vehicle sales domain.
△ Less
Submitted 11 August, 2023;
originally announced August 2023.
-
A Constraint-based Recommender System via RDF Knowledge Graphs
Authors:
Ngoc Luyen Le,
Marie-Hélène Abel,
Philippe Gouspillou
Abstract:
Knowledge graphs, represented in RDF, are able to model entities and their relations by means of ontologies. The use of knowledge graphs for information modeling has attracted interest in recent years. In recommender systems, items and users can be mapped and integrated into the knowledge graph, which can represent more links and relationships between users and items. Constraint-based recommender…
▽ More
Knowledge graphs, represented in RDF, are able to model entities and their relations by means of ontologies. The use of knowledge graphs for information modeling has attracted interest in recent years. In recommender systems, items and users can be mapped and integrated into the knowledge graph, which can represent more links and relationships between users and items. Constraint-based recommender systems are based on the idea of explicitly exploiting deep recommendation knowledge through constraints to identify relevant recommendations. When combined with knowledge graphs, a constraint-based recommender system gains several benefits in terms of constraint sets. In this paper, we investigate and propose the construction of a constraint-based recommender system via RDF knowledge graphs applied to the vehicle purchase/sale domain. The results of our experiments show that the proposed approach is able to efficiently identify recommendations in accordance with user preferences.
△ Less
Submitted 20 July, 2023;
originally announced July 2023.
-
A Personalized Recommender System Based-on Knowledge Graph Embeddings
Authors:
Ngoc Luyen Le,
Marie-Hélène Abel,
Philippe Gouspillou
Abstract:
Knowledge graphs have proven to be effective for modeling entities and their relationships through the use of ontologies. The recent emergence in interest for using knowledge graphs as a form of information modeling has led to their increased adoption in recommender systems. By incorporating users and items into the knowledge graph, these systems can better capture the implicit connections between…
▽ More
Knowledge graphs have proven to be effective for modeling entities and their relationships through the use of ontologies. The recent emergence in interest for using knowledge graphs as a form of information modeling has led to their increased adoption in recommender systems. By incorporating users and items into the knowledge graph, these systems can better capture the implicit connections between them and provide more accurate recommendations. In this paper, we investigate and propose the construction of a personalized recommender system via knowledge graphs embedding applied to the vehicle purchase/sale domain. The results of our experimentation demonstrate the efficacy of the proposed method in providing relevant recommendations that are consistent with individual users.
△ Less
Submitted 20 July, 2023;
originally announced July 2023.
-
Improving Semantic Similarity Measure Within a Recommender System Based-on RDF Graphs
Authors:
Ngoc Luyen Le,
Marie-Hélène Abel,
Philippe Gouspillou
Abstract:
In today's era of information explosion, more users are becoming more reliant upon recommender systems to have better advice, suggestions, or inspire them. The measure of the semantic relatedness or likeness between terms, words, or text data plays an important role in different applications dealing with textual data, as in a recommender system. Over the past few years, many ontologies have been d…
▽ More
In today's era of information explosion, more users are becoming more reliant upon recommender systems to have better advice, suggestions, or inspire them. The measure of the semantic relatedness or likeness between terms, words, or text data plays an important role in different applications dealing with textual data, as in a recommender system. Over the past few years, many ontologies have been developed and used as a form of structured representation of knowledge bases for information systems. The measure of semantic similarity from ontology has developed by several methods. In this paper, we propose and carry on an approach for the improvement of semantic similarity calculations within a recommender system based-on RDF graphs.
△ Less
Submitted 20 July, 2023;
originally announced July 2023.
-
ChatGPT is Good but Bing Chat is Better for Vietnamese Students
Authors:
Xuan-Quy Dao,
Ngoc-Bich Le
Abstract:
This study examines the efficacy of two SOTA large language models (LLMs), namely ChatGPT and Microsoft Bing Chat (BingChat), in catering to the needs of Vietnamese students. Although ChatGPT exhibits proficiency in multiple disciplines, Bing Chat emerges as the more advantageous option. We conduct a comparative analysis of their academic achievements in various disciplines, encompassing mathemati…
▽ More
This study examines the efficacy of two SOTA large language models (LLMs), namely ChatGPT and Microsoft Bing Chat (BingChat), in catering to the needs of Vietnamese students. Although ChatGPT exhibits proficiency in multiple disciplines, Bing Chat emerges as the more advantageous option. We conduct a comparative analysis of their academic achievements in various disciplines, encompassing mathematics, literature, English language, physics, chemistry, biology, history, geography, and civic education. The results of our study suggest that BingChat demonstrates superior performance compared to ChatGPT across a wide range of subjects, with the exception of literature, where ChatGPT exhibits better performance. Additionally, BingChat utilizes the more advanced GPT-4 technology in contrast to ChatGPT, which is built upon GPT-3.5. This allows BingChat to improve to comprehension, reasoning and generation of creative and informative text. Moreover, the fact that BingChat is accessible in Vietnam and its integration of hyperlinks and citations within responses serve to reinforce its superiority. In our analysis, it is evident that while ChatGPT exhibits praiseworthy qualities, BingChat presents a more apdated solutions for Vietnamese students.
△ Less
Submitted 29 July, 2023; v1 submitted 17 July, 2023;
originally announced July 2023.
-
ChatGPT in the Age of Generative AI and Large Language Models: A Concise Survey
Authors:
Salman Mohamadi,
Ghulam Mujtaba,
Ngan Le,
Gianfranco Doretto,
Donald A. Adjeroh
Abstract:
ChatGPT is a large language model (LLM) created by OpenAI that has been carefully trained on a large amount of data. It has revolutionized the field of natural language processing (NLP) and has pushed the boundaries of LLM capabilities. ChatGPT has played a pivotal role in enabling widespread public interaction with generative artificial intelligence (GAI) on a large scale. It has also sparked res…
▽ More
ChatGPT is a large language model (LLM) created by OpenAI that has been carefully trained on a large amount of data. It has revolutionized the field of natural language processing (NLP) and has pushed the boundaries of LLM capabilities. ChatGPT has played a pivotal role in enabling widespread public interaction with generative artificial intelligence (GAI) on a large scale. It has also sparked research interest in develo** similar technologies and investigating their applications and implications. In this paper, our primary goal is to provide a concise survey on the current lines of research on ChatGPT and its evolution. We considered both the glass box and black box views of ChatGPT, encompassing the components and foundational elements of the technology, as well as its applications, impacts, and implications. The glass box approach focuses on understanding the inner workings of the technology, and the black box approach embraces it as a complex system, and thus examines its inputs, outputs, and effects. This paves the way for a comprehensive exploration of the technology and provides a road map for further research and experimentation. We also lay out essential foundational literature on LLMs and GAI in general and their connection with ChatGPT. This overview sheds light on existing and missing research lines in the emerging field of LLMs, benefiting both public users and developers. Furthermore, the paper delves into the broad spectrum of applications and significant concerns in fields such as education, research, healthcare, finance, etc.
△ Less
Submitted 15 July, 2023; v1 submitted 9 July, 2023;
originally announced July 2023.
-
Advancing Wound Filling Extraction on 3D Faces: Auto-Segmentation and Wound Face Regeneration Approach
Authors:
Duong Q. Nguyen,
Thinh D. Le,
Phuong D. Nguyen,
Nga T. K. Le,
H. Nguyen-Xuan
Abstract:
Facial wound segmentation plays a crucial role in preoperative planning and optimizing patient outcomes in various medical applications. In this paper, we propose an efficient approach for automating 3D facial wound segmentation using a two-stream graph convolutional network. Our method leverages the Cir3D-FaIR dataset and addresses the challenge of data imbalance through extensive experimentation…
▽ More
Facial wound segmentation plays a crucial role in preoperative planning and optimizing patient outcomes in various medical applications. In this paper, we propose an efficient approach for automating 3D facial wound segmentation using a two-stream graph convolutional network. Our method leverages the Cir3D-FaIR dataset and addresses the challenge of data imbalance through extensive experimentation with different loss functions. To achieve accurate segmentation, we conducted thorough experiments and selected a high-performing model from the trained models. The selected model demonstrates exceptional segmentation performance for complex 3D facial wounds. Furthermore, based on the segmentation model, we propose an improved approach for extracting 3D facial wound fillers and compare it to the results of the previous study. Our method achieved a remarkable accuracy of 0.9999986\% on the test suite, surpassing the performance of the previous method. From this result, we use 3D printing technology to illustrate the shape of the wound filling. The outcomes of this study have significant implications for physicians involved in preoperative planning and intervention design. By automating facial wound segmentation and improving the accuracy of wound-filling extraction, our approach can assist in carefully assessing and optimizing interventions, leading to enhanced patient outcomes. Additionally, it contributes to advancing facial reconstruction techniques by utilizing machine learning and 3D bioprinting for printing skin tissue implants. Our source code is available at \url{https://github.com/SIMOGroup/WoundFilling3D}.
△ Less
Submitted 12 July, 2023; v1 submitted 4 July, 2023;
originally announced July 2023.
-
Soft Grip**: Specifying for Trustworthiness
Authors:
Dhaminda B. Abeywickrama,
Nguyen Hao Le,
Greg Chance,
Peter D. Winter,
Arianna Manzini,
Alix J. Partridge,
Jonathan Ives,
John Downer,
Graham Deacon,
Jonathan Rossiter,
Kerstin Eder,
Shane Windsor
Abstract:
Soft robotics is an emerging technology in which engineers create flexible devices for use in a variety of applications. In order to advance the wide adoption of soft robots, ensuring their trustworthiness is essential; if soft robots are not trusted, they will not be used to their full potential. In order to demonstrate trustworthiness, a specification needs to be formulated to define what is tru…
▽ More
Soft robotics is an emerging technology in which engineers create flexible devices for use in a variety of applications. In order to advance the wide adoption of soft robots, ensuring their trustworthiness is essential; if soft robots are not trusted, they will not be used to their full potential. In order to demonstrate trustworthiness, a specification needs to be formulated to define what is trustworthy. However, even for soft robotic grippers, which is one of the most mature areas in soft robotics, the soft robotics community has so far given very little attention to formulating specifications. In this work, we discuss the importance of develo** specifications during development of soft robotic systems, and present an extensive example specification for a soft gripper for pick-and-place tasks for grocery items. The proposed specification covers both functional and non-functional requirements, such as reliability, safety, adaptability, predictability, ethics, and regulations. We also highlight the need to promote verifiability as a first-class objective in the design of a soft gripper.
△ Less
Submitted 30 October, 2023; v1 submitted 3 July, 2023;
originally announced July 2023.
-
The Seoul National University AGN Monitoring Project IV: H$α$ reverberation map** of 6 AGNs and the H$α$ Size-Luminosity Relation
Authors:
Ho** Cho,
Jong-Hak Woo,
Shu Wang,
Donghoon Son,
Jae** Shin,
Suvendu Rakshit,
Aaron J. Barth,
Vardha N. Bennert,
Elena Gallo,
Edmund Hodges-Kluck,
Tommaso Treu,
Hyun-** Bae,
Wan** Cho,
Adi Foord,
Jaehyuk Geum,
Yashashree Jadhav,
Yiseul Jeon,
Kyle M. Kabasares,
Daeun Kang,
Wonseok Kang,
Changseok Kim,
Donghwa Kim,
Min** Kim,
Taewoo Kim,
Huynh Anh N. Le
, et al. (7 additional authors not shown)
Abstract:
The broad line region (BLR) size-luminosity relation has paramount importance for estimating the mass of black holes in active galactic nuclei (AGNs). Traditionally, the size of the H$β$ BLR is often estimated from the optical continuum luminosity at 5100\angstrom{} , while the size of the H$α$ BLR and its correlation with the luminosity is much less constrained. As a part of the Seoul National Un…
▽ More
The broad line region (BLR) size-luminosity relation has paramount importance for estimating the mass of black holes in active galactic nuclei (AGNs). Traditionally, the size of the H$β$ BLR is often estimated from the optical continuum luminosity at 5100\angstrom{} , while the size of the H$α$ BLR and its correlation with the luminosity is much less constrained. As a part of the Seoul National University AGN Monitoring Project (SAMP) which provides six-year photometric and spectroscopic monitoring data, we present our measurements of the H$α$ lags of 6 high-luminosity AGNs. Combined with the measurements for 42 AGNs from the literature, we derive the size-luminosity relations of H$α$ BLR against broad H$α$ and 5100\angstrom{} continuum luminosities. We find the slope of the relations to be $0.61\pm0.04$ and $0.59\pm0.04$, respectively, which are consistent with the \hb{} size-luminosity relation. Moreover, we find a linear relation between the 5100\angstrom{} continuum luminosity and the broad H$α$ luminosity across 7 orders of magnitude. Using these results, we propose a new virial mass estimator based on the H$α$ broad emission line, finding that the previous mass estimates based on the scaling relations in the literature are overestimated by up to 0.7 dex at masses lower than $10^7$~M$_{\odot}$.
△ Less
Submitted 29 June, 2023;
originally announced June 2023.
-
Nanoextraction from a flow of a highly diluted solution for much-improved sensitivity in offline chemical detection and quantification
Authors:
Hongyan Wu,
Quynh Nhu Le,
Binglin Zeng,
Xuehua Zhang
Abstract:
Preconcentration of the target compound is a critical step that ensures the accuracy of the subsequent chemical analysis. In this work, we present a straightforward yet effective liquid-liquid extraction approach based on surface nanodroplets (i.e., nanoextraction) for offline analysis of highly diluted sample solutions. The extraction and sample collection were streamlined in a 3-m microcapillary…
▽ More
Preconcentration of the target compound is a critical step that ensures the accuracy of the subsequent chemical analysis. In this work, we present a straightforward yet effective liquid-liquid extraction approach based on surface nanodroplets (i.e., nanoextraction) for offline analysis of highly diluted sample solutions. The extraction and sample collection were streamlined in a 3-m microcapillary tube. The concentration of the target analyte in surface nanodroplets was significantly increased compared to the concentration in the sample solution, reaching several orders of magnitudes. A limit of detection (LOD) was decreased by a factor of $\sim 10^3$ for an organic model compound in Fourier-transform infrared spectroscopy (FTIR) measurements and $\sim 10^5$ for a model fluorescent dye in fluorescence detection. The quantitative analysis of the organic compound was also achieved in a wide concentration region from $10^{-3}$ M to $10^{-4}$ M. The total volume of surface nanodroplets can be manipulated to further enhance extraction efficiency, according to the principle that governs droplet formation by solvent exchange. Additionally, our method exhibited significantly improved sensitivity compared to traditional dispersive liquid-liquid microextraction (DLLME). The LOD of the fluorescent dye and the organic model compound obtained with DLLME was 3 orders of magnitude and 20 times higher than the LOD achieved through nanoextraction approach. The nanoextraction developed in this work can be applied to preconcentrate multi-compounds from river water samples, without clear interference from each other. This can further extend its applicability for the detection and quantification of target analytes in complex aqueous samples by common analytical instruments.
△ Less
Submitted 26 June, 2023;
originally announced June 2023.
-
On a Conjecture of Gezmis and Pellarin
Authors:
Khac Nhuan Le,
Kien Huu Nguyen
Abstract:
In 2022, Gezmis and Pellarin introduced and studied the concept of trivial multiple zeta values, along with a map from the vector space spanned by these values to the vector space spanned by Thakur's multiple zeta values. Their construction allows us to generate some linear relations among the latter values using the former. In our work, we determine the structure of the kernel of the aforemention…
▽ More
In 2022, Gezmis and Pellarin introduced and studied the concept of trivial multiple zeta values, along with a map from the vector space spanned by these values to the vector space spanned by Thakur's multiple zeta values. Their construction allows us to generate some linear relations among the latter values using the former. In our work, we determine the structure of the kernel of the aforementioned map. As a consequence, we give an answer to a conjecture proposed by Gezmis and Pellarin regarding the injectivity of this specific map.
△ Less
Submitted 22 June, 2023;
originally announced June 2023.
-
Numerical analysis of the stochastic Stefan problem
Authors:
Jerome Droniou,
Muhammad Awais Khan,
Kim Ngan Le
Abstract:
The gradient discretisation method (GDM) -- a generic framework encompassing many numerical methods -- is studied for a general stochastic Stefan problem with multiplicative noise. The convergence of the numerical solutions is proved by compactness method using discrete functional analysis tools, Skorohod theorem and the martingale representation theorem. The generic convergence results establishe…
▽ More
The gradient discretisation method (GDM) -- a generic framework encompassing many numerical methods -- is studied for a general stochastic Stefan problem with multiplicative noise. The convergence of the numerical solutions is proved by compactness method using discrete functional analysis tools, Skorohod theorem and the martingale representation theorem. The generic convergence results established in the GDM framework are applicable to a range of different numerical methods, including for example mass-lumped finite elements, but also some finite volume methods, mimetic methods, lowest-order virtual element methods, etc. Theoretical results are complemented by numerical tests based on two methods that fit in GDM framework.
△ Less
Submitted 26 June, 2023; v1 submitted 22 June, 2023;
originally announced June 2023.
-
Can ChatGPT pass the Vietnamese National High School Graduation Examination?
Authors:
Xuan-Quy Dao,
Ngoc-Bich Le,
Xuan-Dung Phan,
Bac-Bien Ngo
Abstract:
This research article highlights the potential of AI-powered chatbots in education and presents the results of using ChatGPT, a large language model, to complete the Vietnamese National High School Graduation Examination (VNHSGE). The study dataset included 30 essays in the literature test case and 1,700 multiple-choice questions designed for other subjects. The results showed that ChatGPT was abl…
▽ More
This research article highlights the potential of AI-powered chatbots in education and presents the results of using ChatGPT, a large language model, to complete the Vietnamese National High School Graduation Examination (VNHSGE). The study dataset included 30 essays in the literature test case and 1,700 multiple-choice questions designed for other subjects. The results showed that ChatGPT was able to pass the examination with an average score of 6-7, demonstrating the technology's potential to revolutionize the educational landscape. The analysis of ChatGPT performance revealed its proficiency in a range of subjects, including mathematics, English, physics, chemistry, biology, history, geography, civic education, and literature, which suggests its potential to provide effective support for learners. However, further research is needed to assess ChatGPT performance on more complex exam questions and its potential to support learners in different contexts. As technology continues to evolve and improve, we can expect to see the use of AI tools like ChatGPT become increasingly common in educational settings, ultimately enhancing the educational experience for both students and educators.
△ Less
Submitted 10 July, 2023; v1 submitted 15 June, 2023;
originally announced June 2023.
-
AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation
Authors:
Kashu Yamazaki,
Taisei Hanyu,
Minh Tran,
Adrian de Luis,
Roy McCann,
Haitao Liao,
Chase Rainwater,
Meredith Adkins,
Jackson Cothren,
Ngan Le
Abstract:
Aerial Image Segmentation is a top-down perspective semantic segmentation and has several challenging characteristics such as strong imbalance in the foreground-background distribution, complex background, intra-class heterogeneity, inter-class homogeneity, and tiny objects. To handle these problems, we inherit the advantages of Transformers and propose AerialFormer, which unifies Transformers at…
▽ More
Aerial Image Segmentation is a top-down perspective semantic segmentation and has several challenging characteristics such as strong imbalance in the foreground-background distribution, complex background, intra-class heterogeneity, inter-class homogeneity, and tiny objects. To handle these problems, we inherit the advantages of Transformers and propose AerialFormer, which unifies Transformers at the contracting path with lightweight Multi-Dilated Convolutional Neural Networks (MD-CNNs) at the expanding path. Our AerialFormer is designed as a hierarchical structure, in which Transformer encoder outputs multi-scale features and MD-CNNs decoder aggregates information from the multi-scales. Thus, it takes both local and global contexts into consideration to render powerful representations and high-resolution segmentation. We have benchmarked AerialFormer on three common datasets including iSAID, LoveDA, and Potsdam. Comprehensive experiments and extensive ablation studies show that our proposed AerialFormer outperforms previous state-of-the-art methods with remarkable performance. Our source code will be publicly available upon acceptance.
△ Less
Submitted 1 October, 2023; v1 submitted 11 June, 2023;
originally announced June 2023.
-
Investigating the Effectiveness of ChatGPT in Mathematical Reasoning and Problem Solving: Evidence from the Vietnamese National High School Graduation Examination
Authors:
Xuan-Quy Dao,
Ngoc-Bich Le
Abstract:
This study offers a complete analysis of ChatGPT's mathematics abilities in responding to multiple-choice questions for the Vietnamese National High School Graduation Examination (VNHSGE) on a range of subjects and difficulty levels. The dataset included 250 questions divided into four levels: knowledge (K), comprehension (C), application (A), and high application (H), and it included ten themes t…
▽ More
This study offers a complete analysis of ChatGPT's mathematics abilities in responding to multiple-choice questions for the Vietnamese National High School Graduation Examination (VNHSGE) on a range of subjects and difficulty levels. The dataset included 250 questions divided into four levels: knowledge (K), comprehension (C), application (A), and high application (H), and it included ten themes that covered diverse mathematical concepts. The outcomes demonstrate that ChatGPT's performance varies depending on the difficulty level and subject. It performed best on questions at Level (K), with an accuracy rate of $83\%$; but, as the difficulty level rose, it scored poorly, with an accuracy rate of $10\%$. The study has also shown that ChatGPT significantly succeeds in providing responses to questions on subjects including exponential and logarithmic functions, geometric progression, and arithmetic progression. The study found that ChatGPT had difficulty correctly answering questions on topics including derivatives and applications, spatial geometry, and Oxyz spatial calculus. Additionally, this study contrasted ChatGPT outcomes with Vietnamese students in VNHSGE and in other math competitions. ChatGPT dominated in the SAT Math competition with a success rate of $70\%$, followed by VNHSGE mathematics ($58.8\%)$. However, its success rates were lower on other exams, such as AP Statistics, the GRE Quantitative, AMC 10, AMC 12, and AP Calculus BC. These results suggest that ChatGPT has the potential to be an effective teaching tool for mathematics, but more work is needed to enhance its handling of graphical data and address the challenges presented by questions that are getting more challenging.
△ Less
Submitted 31 October, 2023; v1 submitted 9 June, 2023;
originally announced June 2023.
-
Investigating the Impact of Metallicity on Star Formation in the Outer Galaxy. I. VLT/KMOS Survey of Young Stellar Objects in Canis Major
Authors:
Dominika Itrich,
Agata Karska,
Marta Sewiło,
Lars E. Kristensen,
Gregory J. Herczeg,
Suzanne Ramsay,
William J. Fischer,
Benoît Tabone,
Will R. M. Rocha,
Maciej Koprowski,
Ngân Lê,
Beata Deka-Szymankiewicz
Abstract:
The effects of metallicity on the evolution of protoplanetary disks may be studied in the outer Galaxy where the metallicity is lower than in the solar neighbourhood. We present the VLT/KMOS integral field spectroscopy in the near-infrared of $\sim$120 candidate young stellar objects (YSOs) in the CMa-$\ell$224 star-forming region located at a Galactocentric distance of 9.1 kpc. We characterise th…
▽ More
The effects of metallicity on the evolution of protoplanetary disks may be studied in the outer Galaxy where the metallicity is lower than in the solar neighbourhood. We present the VLT/KMOS integral field spectroscopy in the near-infrared of $\sim$120 candidate young stellar objects (YSOs) in the CMa-$\ell$224 star-forming region located at a Galactocentric distance of 9.1 kpc. We characterise the YSO accretion luminosities and accretion rates using the hydrogen Br$γ$ emission and find the median accretion luminosity of $\log{(L_{\rm acc})} = -0.82^{+0.80}_{-0.82} L_\odot$. Based on the measured accretion luminosities, we investigate the hypothesis of star formation history in the CMa-$\ell$224. Their median values suggest that Cluster C, where most of YSO candidates have been identified, might be the most evolved part of the region. The accretion luminosities are similar to those observed toward low-mass YSOs in the Perseus and Orion molecular clouds, and do not reveal the impact of lower metallicity. Similar studies in other outer Galaxy clouds covering a wide range of metallicities are critical to gain a complete picture of star formation in the Galaxy.
△ Less
Submitted 8 June, 2023;
originally announced June 2023.
-
Research Impact of Solar Panel Cleaning Robot on Photovoltaic Panel's Deflection
Authors:
Trung Dat Phan,
Minh Duc Nguyen,
Maxence Auffray,
Nhut Thang Le,
Cong Toai Truong,
Van Tu Duong,
Huy Hung Nguyen,
Tan Tien Nguyen
Abstract:
In the last few decades, solar panel cleaning robots (SPCR) have been widely used for sanitizing photovoltaic (PV) panels as an effective solution for ensuring PV efficiency. However, the dynamic load generated by the SPCR during operation might have a negative impact on PV panels. To reduce these effects, this paper presents the utilization of ANSYS software to simulate multiple scenarios involvi…
▽ More
In the last few decades, solar panel cleaning robots (SPCR) have been widely used for sanitizing photovoltaic (PV) panels as an effective solution for ensuring PV efficiency. However, the dynamic load generated by the SPCR during operation might have a negative impact on PV panels. To reduce these effects, this paper presents the utilization of ANSYS software to simulate multiple scenarios involving the impact of SPCR on PV panels. The simulation scenarios provided in the paper are derived from the typical movements of SPCR observed during practical operations. The simulation results show the deformation process of PV panels, and a second-order polynomial is established to describe the deformed amplitude along the centerline of PV panels. This second-order polynomial contributes to the design process of a damper system for SPCR aiming to reduce the influence of SPCR on PV panels. Moreover, the experiments are conducted to examine the correlation between the results of the simulation and the experiment.
△ Less
Submitted 8 June, 2023; v1 submitted 8 June, 2023;
originally announced June 2023.
-
Automatic retrieval of corresponding US views in longitudinal examinations
Authors:
Hamideh Kerdegari,
Tran Huy Nhat Phung1,
Van Hao Nguyen,
Thi Phuong Thao Truong,
Ngoc Minh Thu Le,
Thanh Phuong Le,
Thi Mai Thao Le,
Luigi Pisani,
Linda Denehy,
Vital Consortium,
Reza Razavi,
Louise Thwaites,
Sophie Yacoub,
Andrew P. King,
Alberto Gomez
Abstract:
Skeletal muscle atrophy is a common occurrence in critically ill patients in the intensive care unit (ICU) who spend long periods in bed. Muscle mass must be recovered through physiotherapy before patient discharge and ultrasound imaging is frequently used to assess the recovery process by measuring the muscle size over time. However, these manual measurements are subject to large variability, par…
▽ More
Skeletal muscle atrophy is a common occurrence in critically ill patients in the intensive care unit (ICU) who spend long periods in bed. Muscle mass must be recovered through physiotherapy before patient discharge and ultrasound imaging is frequently used to assess the recovery process by measuring the muscle size over time. However, these manual measurements are subject to large variability, particularly since the scans are typically acquired on different days and potentially by different operators. In this paper, we propose a self-supervised contrastive learning approach to automatically retrieve similar ultrasound muscle views at different scan times. Three different models were compared using data from 67 patients acquired in the ICU. Results indicate that our contrastive model outperformed a supervised baseline model in the task of view retrieval with an AUC of 73.52% and when combined with an automatic segmentation model achieved 5.7%+/-0.24% error in cross-sectional area. Furthermore, a user study survey confirmed the efficacy of our model for muscle view retrieval.
△ Less
Submitted 7 June, 2023;
originally announced June 2023.
-
Improved statistical benchmarking of digital pathology models using pairwise frames evaluation
Authors:
Ylaine Gerardin,
John Shamshoian,
Judy Shen,
Nhat Le,
Jamie Prezioso,
John Abel,
Isaac Finberg,
Daniel Borders,
Raymond Biju,
Michael Nercessian,
Vaed Prasad,
Joseph Lee,
Spencer Wyman,
Sid Gupta,
Abigail Emerson,
Bahar Rahsepar,
Darpan Sanghavi,
Ryan Leung,
Limin Yu,
Archit Khosla,
Amaro Taylor-Weiner
Abstract:
Nested pairwise frames is a method for relative benchmarking of cell or tissue digital pathology models against manual pathologist annotations on a set of sampled patches. At a high level, the method compares agreement between a candidate model and pathologist annotations with agreement among pathologists' annotations. This evaluation framework addresses fundamental issues of data size and annotat…
▽ More
Nested pairwise frames is a method for relative benchmarking of cell or tissue digital pathology models against manual pathologist annotations on a set of sampled patches. At a high level, the method compares agreement between a candidate model and pathologist annotations with agreement among pathologists' annotations. This evaluation framework addresses fundamental issues of data size and annotator variability in using manual pathologist annotations as a source of ground truth for model validation. We implemented nested pairwise frames evaluation for tissue classification, cell classification, and cell count prediction tasks and show results for cell and tissue models deployed on an H&E-stained melanoma dataset.
△ Less
Submitted 7 June, 2023;
originally announced June 2023.
-
Constraint-based recommender system for crisis management simulations
Authors:
Ngoc Luyen Le,
**feng Zhong,
Elsa Negre,
Marie-Hélène Abel
Abstract:
In the context of the evacuation of populations, some citizens/volunteers may want and be able to participate in the evacuation of populations in difficulty by coming to lend a hand to emergency/evacuation vehicles with their own vehicles. One way of framing these impulses of solidarity would be to be able to list in real-time the citizens/volunteers available with their vehicles (land, sea, air,…
▽ More
In the context of the evacuation of populations, some citizens/volunteers may want and be able to participate in the evacuation of populations in difficulty by coming to lend a hand to emergency/evacuation vehicles with their own vehicles. One way of framing these impulses of solidarity would be to be able to list in real-time the citizens/volunteers available with their vehicles (land, sea, air, etc.), to be able to geolocate them according to the risk areas to be evacuated, and adding them to the evacuation/rescue vehicles. Because it is difficult to propose an effective real-time operational system on the field in a real crisis situation, in this work, we propose to add a module for recommending driver/vehicle pairs (with their specificities) to a system of crisis management simulation. To do that, we chose to model and develop an ontology-supported constraint-based recommender system for crisis management simulations.
△ Less
Submitted 7 June, 2023;
originally announced June 2023.
-
Construction d'un système de recommandation basé sur des contraintes via des graphes de connaissances
Authors:
Ngoc Luyen Le,
Marie-Hélène Abel,
Philippe Gouspillou
Abstract:
Knowledge graphs in RDF model entities and their relations using ontologies, and have gained popularity for information modeling. In recommender systems, knowledge graphs help represent more links and relationships between users and items. Constraint-based recommender systems leverage deep recommendation knowledge to identify relevant suggestions. When combined with knowledge graphs, they offer be…
▽ More
Knowledge graphs in RDF model entities and their relations using ontologies, and have gained popularity for information modeling. In recommender systems, knowledge graphs help represent more links and relationships between users and items. Constraint-based recommender systems leverage deep recommendation knowledge to identify relevant suggestions. When combined with knowledge graphs, they offer benefits in constraint sets. This paper explores a constraint-based recommender system using RDF knowledge graphs for the vehicle purchase/sale domain. Our experiments demonstrate that the proposed approach efficiently identifies recommendations based on user preferences.
△ Less
Submitted 5 June, 2023;
originally announced June 2023.
-
Système de recommandations basé sur les contraintes pour les simulations de gestion de crise
Authors:
Ngoc Luyen Le,
**feng Zhong,
Elsa Negre,
Marie-Hélène Abel
Abstract:
In the context of the evacuation of populations, some citizens/volunteers may want and be able to participate in the evacuation of populations in difficulty by coming to lend a hand to emergency/evacuation vehicles with their own vehicles. One way of framing these impulses of solidarity would be to be able to list in real-time the citizens/volunteers available with their vehicles (land, sea, air,…
▽ More
In the context of the evacuation of populations, some citizens/volunteers may want and be able to participate in the evacuation of populations in difficulty by coming to lend a hand to emergency/evacuation vehicles with their own vehicles. One way of framing these impulses of solidarity would be to be able to list in real-time the citizens/volunteers available with their vehicles (land, sea, air, etc.), to be able to geolocate them according to the risk areas to be evacuated, and adding them to the evacuation/rescue vehicles. Because it is difficult to propose an effective real-time operational system on the field in a real crisis situation, in this work, we propose to add a module for recommending driver/vehicle pairs (with their specificities) to a system of crisis management simulation. To do that, we chose to model and develop an ontology-supported constraint-based recommender system for crisis management simulations.
△ Less
Submitted 2 June, 2023;
originally announced June 2023.
-
Z-GMOT: Zero-shot Generic Multiple Object Tracking
Authors:
Kim Hoang Tran,
Anh Duy Le Dinh,
Tien Phat Nguyen,
Thinh Phan,
Pha Nguyen,
Khoa Luu,
Donald Adjeroh,
Gianfranco Doretto,
Ngan Hoang Le
Abstract:
Despite recent significant progress, Multi-Object Tracking (MOT) faces limitations such as reliance on prior knowledge and predefined categories and struggles with unseen objects. To address these issues, Generic Multiple Object Tracking (GMOT) has emerged as an alternative approach, requiring less prior information. However, current GMOT methods often rely on initial bounding boxes and struggle t…
▽ More
Despite recent significant progress, Multi-Object Tracking (MOT) faces limitations such as reliance on prior knowledge and predefined categories and struggles with unseen objects. To address these issues, Generic Multiple Object Tracking (GMOT) has emerged as an alternative approach, requiring less prior information. However, current GMOT methods often rely on initial bounding boxes and struggle to handle variations in factors such as viewpoint, lighting, occlusion, and scale, among others. Our contributions commence with the introduction of the \textit{Referring GMOT dataset} a collection of videos, each accompanied by detailed textual descriptions of their attributes. Subsequently, we propose $\mathtt{Z-GMOT}$, a cutting-edge tracking solution capable of tracking objects from \textit{never-seen categories} without the need of initial bounding boxes or predefined categories. Within our $\mathtt{Z-GMOT}$ framework, we introduce two novel components: (i) $\mathtt{iGLIP}$, an improved Grounded language-image pretraining, for accurately detecting unseen objects with specific characteristics. (ii) $\mathtt{MA-SORT}$, a novel object association approach that adeptly integrates motion and appearance-based matching strategies to tackle the complex task of tracking objects with high similarity. Our contributions are benchmarked through extensive experiments conducted on the Referring GMOT dataset for GMOT task. Additionally, to assess the generalizability of the proposed $\mathtt{Z-GMOT}$, we conduct ablation studies on the DanceTrack and MOT20 datasets for the MOT task. Our dataset, code, and models are released at: https://fsoft-aic.github.io/Z-GMOT.
△ Less
Submitted 13 June, 2024; v1 submitted 28 May, 2023;
originally announced May 2023.
-
Are Large Language Models Robust Coreference Resolvers?
Authors:
Nghia T. Le,
Alan Ritter
Abstract:
Recent work on extending coreference resolution across domains and languages relies on annotated data in both the target domain and language. At the same time, pre-trained large language models (LMs) have been reported to exhibit strong zero- and few-shot learning abilities across a wide range of NLP tasks. However, prior work mostly studied this ability using artificial sentence-level datasets su…
▽ More
Recent work on extending coreference resolution across domains and languages relies on annotated data in both the target domain and language. At the same time, pre-trained large language models (LMs) have been reported to exhibit strong zero- and few-shot learning abilities across a wide range of NLP tasks. However, prior work mostly studied this ability using artificial sentence-level datasets such as the Winograd Schema Challenge. In this paper, we assess the feasibility of prompt-based coreference resolution by evaluating instruction-tuned language models on difficult, linguistically-complex coreference benchmarks (e.g., CoNLL-2012). We show that prompting for coreference can outperform current unsupervised coreference systems, although this approach appears to be reliant on high-quality mention detectors. Further investigations reveal that instruction-tuned LMs generalize surprisingly well across domains, languages, and time periods; yet continued fine-tuning of neural models should still be preferred if small amounts of annotated examples are available.
△ Less
Submitted 14 November, 2023; v1 submitted 23 May, 2023;
originally announced May 2023.
-
VNHSGE: VietNamese High School Graduation Examination Dataset for Large Language Models
Authors:
Xuan-Quy Dao,
Ngoc-Bich Le,
The-Duy Vo,
Xuan-Dung Phan,
Bac-Bien Ngo,
Van-Tien Nguyen,
Thi-My-Thanh Nguyen,
Hong-Phuoc Nguyen
Abstract:
The VNHSGE (VietNamese High School Graduation Examination) dataset, developed exclusively for evaluating large language models (LLMs), is introduced in this article. The dataset, which covers nine subjects, was generated from the Vietnamese National High School Graduation Examination and comparable tests. 300 literary essays have been included, and there are over 19,000 multiple-choice questions o…
▽ More
The VNHSGE (VietNamese High School Graduation Examination) dataset, developed exclusively for evaluating large language models (LLMs), is introduced in this article. The dataset, which covers nine subjects, was generated from the Vietnamese National High School Graduation Examination and comparable tests. 300 literary essays have been included, and there are over 19,000 multiple-choice questions on a range of topics. The dataset assesses LLMs in multitasking situations such as question answering, text generation, reading comprehension, visual question answering, and more by including both textual data and accompanying images. Using ChatGPT and BingChat, we evaluated LLMs on the VNHSGE dataset and contrasted their performance with that of Vietnamese students to see how well they performed. The results show that ChatGPT and BingChat both perform at a human level in a number of areas, including literature, English, history, geography, and civics education. They still have space to grow, though, especially in the areas of mathematics, physics, chemistry, and biology. The VNHSGE dataset seeks to provide an adequate benchmark for assessing the abilities of LLMs with its wide-ranging coverage and variety of activities. We intend to promote future developments in the creation of LLMs by making this dataset available to the scientific community, especially in resolving LLMs' limits in disciplines involving mathematics and the natural sciences.
△ Less
Submitted 20 May, 2023;
originally announced May 2023.
-
A Liouville type result for fractional GJMS equations on higher dimensional spheres
Authors:
Quynh N. T. Lê,
Quôc Anh Ngô,
Tien-Tai Nguyen
Abstract:
Let $n$ be an integer and $s$ be a real number such that $n > 2s \geq 2$. Inspired by the perturbation approach initiated by F. Hang and P. Yang (\textit{Int. Math. Res. Not. IMRN}, 2020), we are interested in non-negative, smooth solution $v$ to the following higher-order fractional equation \[ {\mathbf P}_n^{2s}(v) = Q_n^{2s}(\varepsilon v+v^α) \] on $\mathbf S^n$ with $0<α\leq (n+2s)/(n-2s)$, a…
▽ More
Let $n$ be an integer and $s$ be a real number such that $n > 2s \geq 2$. Inspired by the perturbation approach initiated by F. Hang and P. Yang (\textit{Int. Math. Res. Not. IMRN}, 2020), we are interested in non-negative, smooth solution $v$ to the following higher-order fractional equation \[ {\mathbf P}_n^{2s}(v) = Q_n^{2s}(\varepsilon v+v^α) \] on $\mathbf S^n$ with $0<α\leq (n+2s)/(n-2s)$, and $\varepsilon \geq 0$. Here ${\mathbf P}_n^{2s}$ is the fractional GJMS type operator of order $2s$ on $\mathbf S^n$ and $Q_n^{2s} ={\mathbf P}_n^{2s}(1)$ is constant. We show that if $\varepsilon >0$ and $0<α\leq (n+2s)/(n-2s)$, then any positive, smooth solution $v$ to the above equation must be constant. The same result remains valid if $\varepsilon=0$ but with $0<α< (n+2s)/(n-2s)$.As a by-product, with $0<α\leq (n+2s)/(n-2s)$, we compute the sharp constant of the subcritical/critical Sobolev inequalities \[ \int_{\mathbf S^n} v {\mathbf P}_n^{2s} (v) dμ_{g_{\mathbf S^n}} \geq \frac{Γ(n/2 + s)}{Γ(n/2 - s )} | \mathbf S^n|^\frac{α-1}{α+1} \Big( \int_{\mathbf S^n} v^{α+1} dμ_{g_{\mathbf S^n}} \Big)^\frac{2}{α+1}. \] for the GJMS operator ${\mathbf P}_n^{2s}$ on $\mathbf S^n$ and for all non-negative functions $v\in H^s(\mathbf S^n)$.
△ Less
Submitted 23 October, 2023; v1 submitted 12 May, 2023;
originally announced May 2023.
-
Investigation of Stellar Kinematics and Ionized gas Outflows in Local (U)LIRGs
Authors:
Ashraf Ayubinia,
Yongquan Xue,
Huynh Anh Nguyen Le,
Fan Zou,
Shu Wang,
Zhicheng He,
Ece Kilerci Eser
Abstract:
We explore properties of stellar kinematics and ionized gas in a sample of 1106 local (U)LIRGs from the AKARI telescope. We combine data from $Wide-field\ Infrared\ Survey\ Explorer$ (WISE) and Sloan Digital Sky Survey (SDSS) Data Release 13 (DR13) to fit the spectral energy distribution (SED) of each source to constrain the contribution of active galactic nuclei (AGNs) to the total IR luminosity…
▽ More
We explore properties of stellar kinematics and ionized gas in a sample of 1106 local (U)LIRGs from the AKARI telescope. We combine data from $Wide-field\ Infrared\ Survey\ Explorer$ (WISE) and Sloan Digital Sky Survey (SDSS) Data Release 13 (DR13) to fit the spectral energy distribution (SED) of each source to constrain the contribution of active galactic nuclei (AGNs) to the total IR luminosity and estimate physical parameters such as stellar mass and star-formation rate (SFR). We split our sample into AGNs and weak/non-AGNs. We find that our sample is considerably above the main sequence. The highest SFRs and stellar masses are associated with ULIRGs. We also fit the H$β$ and H$α$ regions to characterize the outflows. We find that the incidence of ionized gas outflows in AGN (U)LIRGs ($\sim$ 72\%) is much higher than that in weak/non-AGN ones ($\sim$ 39\%). The AGN ULIRGs have extreme outflow velocities (up to $\sim$ 2300 km s$^{-1}$) and high mass-outflow rates (up to $\sim$ 60 \solarm~yr$^{-1}$). Our results suggest that starbursts are insufficient to produce such powerful outflows. We explore the correlations of SFR and specific SFR (sSFR) with ionized gas outflows. We find that AGN hosts with the highest SFRs exhibit a negative correlation between outflow velocity and sSFR. Therefore, in AGNs containing large amounts of gas, the negative feedback scenario might be suggested.
△ Less
Submitted 21 June, 2023; v1 submitted 19 April, 2023;
originally announced April 2023.
-
Far-infrared line emission from the outer Galaxy cluster Gy 3-7 with SOFIA/FIFI-LS: Physical conditions and UV fields
Authors:
N. Le,
A. Karska,
M. Figueira,
M. Sewiło,
A. Mirocha,
Ch. Fischer,
M. Kaźmierczak-Barthel,
R. Klein,
M. Gawroński,
M. Koprowski,
K. Kowalczyk,
W. J. Fischer,
K. M. Menten,
F. Wyrowski,
C. König,
L. E. Kristensen
Abstract:
(abridged) Far-infrared (FIR) line emission provides key information about the gas cooling and heating due to shocks and UV radiation associated with the early stages of star formation. Gas cooling via FIR lines might, however, depend on metallicity. We aim to quantify the FIR line emission and determine the spatial distribution of the CO rotational temperature, ultraviolet (UV) radiation field, a…
▽ More
(abridged) Far-infrared (FIR) line emission provides key information about the gas cooling and heating due to shocks and UV radiation associated with the early stages of star formation. Gas cooling via FIR lines might, however, depend on metallicity. We aim to quantify the FIR line emission and determine the spatial distribution of the CO rotational temperature, ultraviolet (UV) radiation field, and H2 number density toward the embedded cluster Gy 3-7 in the CMa-l224 star-forming region, whose metallicity is expected to be intermediate between that of the LMC and the Solar neighborhood. By comparing the total luminosities of CO and [O I] toward Gy 3-7 with values found for low- and high-mass protostars extending over a broad range of metallicities, we also aim to identify the possible effects of metallicity on the FIR line cooling within our Galaxy. We studied SOFIA/FIFI-LS spectra of Gy 3-7 covering several FIR lines. The spatial extent of CO high-J (J>14) emission resembles that of the elongated 160 um continuum emission detected with Herschel. The CO transitions from J=14-13 to J=16-15 are detected throughout the cluster and show a median rotational temperature of 170+/-30 K on Boltzmann diagrams. Comparisons to other protostars observed with Herschel show a good agreement with intermediate-mass sources in the inner Galaxy. Assuming an origin of the [O I] and high-J CO emission in UV-irradiated C-shocks, we obtained pre-shock H2 number densities of 10^4-5 cm-3 and UV radiation field strengths of 0.1-10 Habing fields. Far-IR line observations reveal ongoing star formation in Gy 3-7, dominated by intermediate-mass Class 0/I young stellar objects. The ratio of molecular-to-atomic far-IR line emission shows a decreasing trend with bolometric luminosities of the protostars. However, it does not indicate that the low-metallicity has an impact on the line cooling in Gy 3-7.
△ Less
Submitted 17 April, 2023;
originally announced April 2023.
-
Translating Simulation Images to X-ray Images via Multi-Scale Semantic Matching
Authors:
**gxuan Kang,
Tudor Jianu,
Baoru Huang,
Binod Bhattarai,
Ngan Le,
Frans Coenen,
Anh Nguyen
Abstract:
Endovascular intervention training is increasingly being conducted in virtual simulators. However, transferring the experience from endovascular simulators to the real world remains an open problem. The key challenge is the virtual environments are usually not realistically simulated, especially the simulation images. In this paper, we propose a new method to translate simulation images from an en…
▽ More
Endovascular intervention training is increasingly being conducted in virtual simulators. However, transferring the experience from endovascular simulators to the real world remains an open problem. The key challenge is the virtual environments are usually not realistically simulated, especially the simulation images. In this paper, we propose a new method to translate simulation images from an endovascular simulator to X-ray images. Previous image-to-image translation methods often focus on visual effects and neglect structure information, which is critical for medical images. To address this gap, we propose a new method that utilizes multi-scale semantic matching. We apply self-domain semantic matching to ensure that the input image and the generated image have the same positional semantic relationships. We further apply cross-domain matching to eliminate the effects of different styles. The intensive experiment shows that our method generates realistic X-ray images and outperforms other state-of-the-art approaches by a large margin. We also collect a new large-scale dataset to serve as the new benchmark for this task. Our source code and dataset will be made publicly available.
△ Less
Submitted 16 April, 2023;
originally announced April 2023.
-
Distributed Coverage Control of Constrained Constant-Speed Unicycle Multi-Agent Systems
Authors:
Qingchen Liu,
Zengjie Zhang,
Nhan Khanh Le,
Jiahu Qin,
Fangzhou Liu,
Sandra Hirche
Abstract:
This paper proposes a novel distributed coverage controller for a multi-agent system with constant-speed unicycle robots (CSUR). The work is motivated by the limitation of the conventional method that does not ensure the satisfaction of hard state- and input-dependent constraints and leads to feasibility issues for multi-CSUR systems. In this paper, we solve these problems by designing a novel cov…
▽ More
This paper proposes a novel distributed coverage controller for a multi-agent system with constant-speed unicycle robots (CSUR). The work is motivated by the limitation of the conventional method that does not ensure the satisfaction of hard state- and input-dependent constraints and leads to feasibility issues for multi-CSUR systems. In this paper, we solve these problems by designing a novel coverage cost function and a saturated gradient-search-based control law. Invariant set theory and Lyapunov-based techniques are used to prove the state-dependent confinement and the convergence of the system state to the optimal coverage configuration, respectively. The controller is implemented in a distributed manner based on a novel communication standard among the agents. A series of simulation case studies are conducted to validate the effectiveness of the proposed coverage controller in different initial conditions and with control parameters. A comparison study in simulation reveals the advantage of the proposed method in terms of avoiding infeasibility. The experiment study verifies the applicability of the method to real robots with uncertainties. The development procedure of the method from theoretical analysis to experimental validation provides a novel framework for multi-agent system coordinate control with complex agent dynamics.
△ Less
Submitted 14 March, 2024; v1 submitted 12 April, 2023;
originally announced April 2023.
-
Unparticle effects at the MUonE experiment
Authors:
Duc Ninh Le,
Van Dung Le,
Duc Truyen Le,
Van Cuong Le
Abstract:
We investigate possible effects of unparticles at the MUonE experiment by considering a general model for unparticle with broken scale invariance, characterized by the scaling dimension $d$ and the energy scale $μ$ at which the scale invariance is broken. Taking into account available relevant constraints on the couplings of the unparticles with the Standard Model (SM) leptons, we found that the M…
▽ More
We investigate possible effects of unparticles at the MUonE experiment by considering a general model for unparticle with broken scale invariance, characterized by the scaling dimension $d$ and the energy scale $μ$ at which the scale invariance is broken. Taking into account available relevant constraints on the couplings of the unparticles with the Standard Model (SM) leptons, we found that the MUonE experiment at the level of 10 ppm systematic accuracy is sensitive to such effects if $1<d\lesssim 1.4$ and $1\le μ\lesssim 12$ GeV for vector unparticles. The effects of scalar unparticles are too feeble to be detected. The vector unparticles can induce a significant shift on the best-fit value of $a_μ^\text{had}$ at the MUonE, thereby providing an opportunity to detect unparticles or to obtain a new bound on the unparticle-SM couplings in the case of no anomaly.
△ Less
Submitted 21 November, 2023; v1 submitted 10 April, 2023;
originally announced April 2023.
-
Hopf algebras and alternating multiple zeta values in positive characteristic
Authors:
Bo-Hae Im,
Ho** Kim,
Khac Nhuan Le,
Tuan Ngo Dac,
Lan Huong Pham
Abstract:
In \cite{IKLNDP23} we presented a systematic study of algebra structures of multiple zeta values in positive characteristic introduced by Thakur as analogues of classical multiple zeta values of Euler. In this paper we construct algebra and Hopf algebra structures of alternating multiple zeta values introduced by Harada, extending our previous work. Our results could be considered as an analogue o…
▽ More
In \cite{IKLNDP23} we presented a systematic study of algebra structures of multiple zeta values in positive characteristic introduced by Thakur as analogues of classical multiple zeta values of Euler. In this paper we construct algebra and Hopf algebra structures of alternating multiple zeta values introduced by Harada, extending our previous work. Our results could be considered as an analogue of those of Hoffman \cite{Hof00} and Racinet \cite{Rac02} in the classical setting. The proof is based on two new ingredients: the first one is a direct and explicit construction of the shuffle Hopf algebra structure, and the second one is the notion of horizontal maps.
△ Less
Submitted 5 April, 2023;
originally announced April 2023.
-
FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding
Authors:
Thanh-Dat Truong,
Ngan Le,
Bhiksha Raj,
Jackson Cothren,
Khoa Luu
Abstract:
Although Domain Adaptation in Semantic Scene Segmentation has shown impressive improvement in recent years, the fairness concerns in the domain adaptation have yet to be well defined and addressed. In addition, fairness is one of the most critical aspects when deploying the segmentation models into human-related real-world applications, e.g., autonomous driving, as any unfair predictions could inf…
▽ More
Although Domain Adaptation in Semantic Scene Segmentation has shown impressive improvement in recent years, the fairness concerns in the domain adaptation have yet to be well defined and addressed. In addition, fairness is one of the most critical aspects when deploying the segmentation models into human-related real-world applications, e.g., autonomous driving, as any unfair predictions could influence human safety. In this paper, we propose a novel Fairness Domain Adaptation (FREDOM) approach to semantic scene segmentation. In particular, from the proposed formulated fairness objective, a new adaptation framework will be introduced based on the fair treatment of class distributions. Moreover, to generally model the context of structural dependency, a new conditional structural constraint is introduced to impose the consistency of predicted segmentation. Thanks to the proposed Conditional Structure Network, the self-attention mechanism has sufficiently modeled the structural information of segmentation. Through the ablation studies, the proposed method has shown the performance improvement of the segmentation models and promoted fairness in the model predictions. The experimental results on the two standard benchmarks, i.e., SYNTHIA $\to$ Cityscapes and GTA5 $\to$ Cityscapes, have shown that our method achieved State-of-the-Art (SOTA) performance.
△ Less
Submitted 4 April, 2023;
originally announced April 2023.
-
Well-Rounded ideal lattices of cyclic cubic and quartic fields
Authors:
Dat T. Tran,
Nam H. Le,
Ha T. N. Tran
Abstract:
In this paper, we find criteria for when cyclic cubic and cyclic quartic fields have well-rounded ideal lattices. We show that every cyclic cubic field has at least one well-rounded ideal. We also prove that there exist families of cyclic quartic fields which have well-rounded ideals and explicitly construct their minimal bases. In addition, for a given prime number $p$, if a cyclic quartic field…
▽ More
In this paper, we find criteria for when cyclic cubic and cyclic quartic fields have well-rounded ideal lattices. We show that every cyclic cubic field has at least one well-rounded ideal. We also prove that there exist families of cyclic quartic fields which have well-rounded ideals and explicitly construct their minimal bases. In addition, for a given prime number $p$, if a cyclic quartic field has a unique prime ideal above $p$, then we provide the necessary and sufficient conditions for that ideal to be well-rounded. Moreover, in cyclic quartic fields, we provide the prime decomposition of all odd prime numbers and construct an explicit integral basis for every prime ideal.
△ Less
Submitted 13 October, 2023; v1 submitted 29 March, 2023;
originally announced March 2023.
-
MoViT: Memorizing Vision Transformers for Medical Image Analysis
Authors:
Yiqing Shen,
Pengfei Guo,
**gpu Wu,
Qianqi Huang,
Nhat Le,
**yuan Zhou,
Shanshan Jiang,
Mathias Unberath
Abstract:
The synergy of long-range dependencies from transformers and local representations of image content from convolutional neural networks (CNNs) has led to advanced architectures and increased performance for various medical image analysis tasks due to their complementary benefits. However, compared with CNNs, transformers require considerably more training data, due to a larger number of parameters…
▽ More
The synergy of long-range dependencies from transformers and local representations of image content from convolutional neural networks (CNNs) has led to advanced architectures and increased performance for various medical image analysis tasks due to their complementary benefits. However, compared with CNNs, transformers require considerably more training data, due to a larger number of parameters and an absence of inductive bias. The need for increasingly large datasets continues to be problematic, particularly in the context of medical imaging, where both annotation efforts and data protection result in limited data availability. In this work, inspired by the human decision-making process of correlating new evidence with previously memorized experience, we propose a Memorizing Vision Transformer (MoViT) to alleviate the need for large-scale datasets to successfully train and deploy transformer-based architectures. MoViT leverages an external memory structure to cache history attention snapshots during the training stage. To prevent overfitting, we incorporate an innovative memory update scheme, attention temporal moving average, to update the stored external memories with the historical moving average. For inference speedup, we design a prototypical attention learning method to distill the external memory into smaller representative subsets. We evaluate our method on a public histology image dataset and an in-house MRI dataset, demonstrating that MoViT applied to varied medical image analysis tasks, can outperform vanilla transformer models across varied data regimes, especially in cases where only a small amount of annotated data is available. More importantly, MoViT can reach a competitive performance of ViT with only 3.0% of the training data.
△ Less
Submitted 29 September, 2023; v1 submitted 27 March, 2023;
originally announced March 2023.
-
SPONGE: Sequence Planning with Deformable-ON-Rigid Contact Prediction from Geometric Features
Authors:
Tran Nguyen Le,
Fares J. Abu-Dakka,
Ville Kyrki
Abstract:
Planning robotic manipulation tasks, especially those that involve interaction between deformable and rigid objects, is challenging due to the complexity in predicting such interactions. We introduce SPONGE, a sequence planning pipeline powered by a deep learning-based contact prediction model for contacts between deformable and rigid bodies under interactions. The contact prediction model is trai…
▽ More
Planning robotic manipulation tasks, especially those that involve interaction between deformable and rigid objects, is challenging due to the complexity in predicting such interactions. We introduce SPONGE, a sequence planning pipeline powered by a deep learning-based contact prediction model for contacts between deformable and rigid bodies under interactions. The contact prediction model is trained on synthetic data generated by a developed simulation environment to learn the map** from point-cloud observation of a rigid target object and the pose of a deformable tool, to 3D representation of the contact points between the two bodies. We experimentally evaluated the proposed approach for a dish cleaning task both in simulation and on a real \panda with real-world objects. The experimental results demonstrate that in both scenarios the proposed planning pipeline is capable of generating high-quality trajectories that can accomplish the task by achieving more than 90\% area coverage on different objects of varying sizes and curvatures while minimizing travel distance. Code and video are available at: \url{https://irobotics.aalto.fi/sponge/}.
△ Less
Submitted 24 March, 2023;
originally announced March 2023.
-
Music-Driven Group Choreography
Authors:
Nhat Le,
Thang Pham,
Tuong Do,
Erman Tjiputra,
Quang D. Tran,
Anh Nguyen
Abstract:
Music-driven choreography is a challenging problem with a wide variety of industrial applications. Recently, many methods have been proposed to synthesize dance motions from music for a single dancer. However, generating dance motion for a group remains an open problem. In this paper, we present $\rm AIOZ-GDANCE$, a new large-scale dataset for music-driven group dance generation. Unlike existing d…
▽ More
Music-driven choreography is a challenging problem with a wide variety of industrial applications. Recently, many methods have been proposed to synthesize dance motions from music for a single dancer. However, generating dance motion for a group remains an open problem. In this paper, we present $\rm AIOZ-GDANCE$, a new large-scale dataset for music-driven group dance generation. Unlike existing datasets that only support single dance, our new dataset contains group dance videos, hence supporting the study of group choreography. We propose a semi-autonomous labeling method with humans in the loop to obtain the 3D ground truth for our dataset. The proposed dataset consists of 16.7 hours of paired music and 3D motion from in-the-wild videos, covering 7 dance styles and 16 music genres. We show that naively applying single dance generation technique to creating group dance motion may lead to unsatisfactory results, such as inconsistent movements and collisions between dancers. Based on our new dataset, we propose a new method that takes an input music sequence and a set of 3D positions of dancers to efficiently produce multiple group-coherent choreographies. We propose new evaluation metrics for measuring group dance quality and perform intensive experiments to demonstrate the effectiveness of our method. Our project facilitates future research on group dance generation and is available at: https://aioz-ai.github.io/AIOZ-GDANCE/
△ Less
Submitted 26 March, 2023; v1 submitted 22 March, 2023;
originally announced March 2023.