-
Adaptation of Distinct Semantics for Uncertain Areas in Polyp Segmentation
Authors:
Quang Vinh Nguyen,
Van Thong Huynh,
Soo-Hyung Kim
Abstract:
Colonoscopy is a common and practical method for detecting and treating polyps. Segmenting polyps from colonoscopy image is useful for diagnosis and surgery progress. Nevertheless, achieving excellent segmentation performance is still difficult because of polyp characteristics like shape, color, condition, and obvious non-distinction from the surrounding context. This work presents a new novel arc…
▽ More
Colonoscopy is a common and practical method for detecting and treating polyps. Segmenting polyps from colonoscopy image is useful for diagnosis and surgery progress. Nevertheless, achieving excellent segmentation performance is still difficult because of polyp characteristics like shape, color, condition, and obvious non-distinction from the surrounding context. This work presents a new novel architecture namely Adaptation of Distinct Semantics for Uncertain Areas in Polyp Segmentation (ADSNet), which modifies misclassified details and recovers weak features having the ability to vanish and not be detected at the final stage. The architecture consists of a complementary trilateral decoder to produce an early global map. A continuous attention module modifies semantics of high-level features to analyze two separate semantics of the early global map. The suggested method is experienced on polyp benchmarks in learning ability and generalization ability, experimental results demonstrate the great correction and recovery ability leading to better segmentation performance compared to the other state of the art in the polyp image segmentation task. Especially, the proposed architecture could be experimented flexibly for other CNN-based encoders, Transformer-based encoders, and decoder backbones.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
DCTM: Dilated Convolutional Transformer Model for Multimodal Engagement Estimation in Conversation
Authors:
Vu Ngoc Tu,
Van Thong Huynh,
Hyung-Jeong Yang,
M. Zaigham Zaheer,
Shah Nawaz,
Karthik Nandakumar,
Soo-Hyung Kim
Abstract:
Conversational engagement estimation is posed as a regression problem, entailing the identification of the favorable attention and involvement of the participants in the conversation. This task arises as a crucial pursuit to gain insights into human's interaction dynamics and behavior patterns within a conversation. In this research, we introduce a dilated convolutional Transformer for modeling an…
▽ More
Conversational engagement estimation is posed as a regression problem, entailing the identification of the favorable attention and involvement of the participants in the conversation. This task arises as a crucial pursuit to gain insights into human's interaction dynamics and behavior patterns within a conversation. In this research, we introduce a dilated convolutional Transformer for modeling and estimating human engagement in the MULTIMEDIATE 2023 competition. Our proposed system surpasses the baseline models, exhibiting a noteworthy $7$\% improvement on test set and $4$\% on validation set. Moreover, we employ different modality fusion mechanism and show that for this type of data, a simple concatenated method with self-attention fusion gains the best performance.
△ Less
Submitted 31 July, 2023;
originally announced August 2023.
-
Multi-scale Transformer-based Network for Emotion Recognition from Multi Physiological Signals
Authors:
Tu Vu,
Van Thong Huynh,
Soo-Hyung Kim
Abstract:
This paper presents an efficient Multi-scale Transformer-based approach for the task of Emotion recognition from Physiological data, which has gained widespread attention in the research community due to the vast amount of information that can be extracted from these signals using modern sensors and machine learning techniques. Our approach involves applying a Multi-modal technique combined with s…
▽ More
This paper presents an efficient Multi-scale Transformer-based approach for the task of Emotion recognition from Physiological data, which has gained widespread attention in the research community due to the vast amount of information that can be extracted from these signals using modern sensors and machine learning techniques. Our approach involves applying a Multi-modal technique combined with scaling data to establish the relationship between internal body signals and human emotions. Additionally, we utilize Transformer and Gaussian Transformation techniques to improve signal encoding effectiveness and overall performance. Our model achieves decent results on the CASE dataset of the EPiC competition, with an RMSE score of 1.45.
△ Less
Submitted 7 May, 2023; v1 submitted 1 May, 2023;
originally announced May 2023.
-
TextANIMAR: Text-based 3D Animal Fine-Grained Retrieval
Authors:
Trung-Nghia Le,
Tam V. Nguyen,
Minh-Quan Le,
Trong-Thuan Nguyen,
Viet-Tham Huynh,
Trong-Le Do,
Khanh-Duy Le,
Mai-Khiem Tran,
Nhat Hoang-Xuan,
Thang-Long Nguyen-Ho,
Vinh-Tiep Nguyen,
Tuong-Nghiem Diep,
Khanh-Duy Ho,
Xuan-Hieu Nguyen,
Thien-Phuc Tran,
Tuan-Anh Yang,
Kim-Phat Tran,
Nhu-Vinh Hoang,
Minh-Quang Nguyen,
E-Ro Nguyen,
Minh-Khoi Nguyen-Nhat,
Tuan-An To,
Trung-Truc Huynh-Le,
Nham-Tan Nguyen,
Hoang-Chau Luong
, et al. (8 additional authors not shown)
Abstract:
3D object retrieval is an important yet challenging task that has drawn more and more attention in recent years. While existing approaches have made strides in addressing this issue, they are often limited to restricted settings such as image and sketch queries, which are often unfriendly interactions for common users. In order to overcome these limitations, this paper presents a novel SHREC chall…
▽ More
3D object retrieval is an important yet challenging task that has drawn more and more attention in recent years. While existing approaches have made strides in addressing this issue, they are often limited to restricted settings such as image and sketch queries, which are often unfriendly interactions for common users. In order to overcome these limitations, this paper presents a novel SHREC challenge track focusing on text-based fine-grained retrieval of 3D animal models. Unlike previous SHREC challenge tracks, the proposed task is considerably more challenging, requiring participants to develop innovative approaches to tackle the problem of text-based retrieval. Despite the increased difficulty, we believe this task can potentially drive useful applications in practice and facilitate more intuitive interactions with 3D objects. Five groups participated in our competition, submitting a total of 114 runs. While the results obtained in our competition are satisfactory, we note that the challenges presented by this task are far from fully solved. As such, we provide insights into potential areas for future research and improvements. We believe we can help push the boundaries of 3D object retrieval and facilitate more user-friendly interactions via vision-language technologies. https://aichallenge.hcmus.edu.vn/textanimar
△ Less
Submitted 9 August, 2023; v1 submitted 12 April, 2023;
originally announced April 2023.
-
SketchANIMAR: Sketch-based 3D Animal Fine-Grained Retrieval
Authors:
Trung-Nghia Le,
Tam V. Nguyen,
Minh-Quan Le,
Trong-Thuan Nguyen,
Viet-Tham Huynh,
Trong-Le Do,
Khanh-Duy Le,
Mai-Khiem Tran,
Nhat Hoang-Xuan,
Thang-Long Nguyen-Ho,
Vinh-Tiep Nguyen,
Nhat-Quynh Le-Pham,
Huu-Phuc Pham,
Trong-Vu Hoang,
Quang-Binh Nguyen,
Trong-Hieu Nguyen-Mau,
Tuan-Luc Huynh,
Thanh-Danh Le,
Ngoc-Linh Nguyen-Ha,
Tuong-Vy Truong-Thuy,
Truong Hoai Phong,
Tuong-Nghiem Diep,
Khanh-Duy Ho,
Xuan-Hieu Nguyen,
Thien-Phuc Tran
, et al. (9 additional authors not shown)
Abstract:
The retrieval of 3D objects has gained significant importance in recent years due to its broad range of applications in computer vision, computer graphics, virtual reality, and augmented reality. However, the retrieval of 3D objects presents significant challenges due to the intricate nature of 3D models, which can vary in shape, size, and texture, and have numerous polygons and vertices. To this…
▽ More
The retrieval of 3D objects has gained significant importance in recent years due to its broad range of applications in computer vision, computer graphics, virtual reality, and augmented reality. However, the retrieval of 3D objects presents significant challenges due to the intricate nature of 3D models, which can vary in shape, size, and texture, and have numerous polygons and vertices. To this end, we introduce a novel SHREC challenge track that focuses on retrieving relevant 3D animal models from a dataset using sketch queries and expedites accessing 3D models through available sketches. Furthermore, a new dataset named ANIMAR was constructed in this study, comprising a collection of 711 unique 3D animal models and 140 corresponding sketch queries. Our contest requires participants to retrieve 3D models based on complex and detailed sketches. We receive satisfactory results from eight teams and 204 runs. Although further improvement is necessary, the proposed task has the potential to incentivize additional research in the domain of 3D object retrieval, potentially yielding benefits for a wide range of applications. We also provide insights into potential areas of future research, such as improving techniques for feature extraction and matching and creating more diverse datasets to evaluate retrieval performance. https://aichallenge.hcmus.edu.vn/sketchanimar
△ Less
Submitted 9 August, 2023; v1 submitted 12 April, 2023;
originally announced April 2023.
-
Vision Transformer for Action Units Detection
Authors:
Tu Vu,
Van Thong Huynh,
Soo Hyung Kim
Abstract:
Facial Action Units detection (FAUs) represents a fine-grained classification problem that involves identifying different units on the human face, as defined by the Facial Action Coding System. In this paper, we present a simple yet efficient Vision Transformer-based approach for addressing the task of Action Units (AU) detection in the context of Affective Behavior Analysis in-the-wild (ABAW) com…
▽ More
Facial Action Units detection (FAUs) represents a fine-grained classification problem that involves identifying different units on the human face, as defined by the Facial Action Coding System. In this paper, we present a simple yet efficient Vision Transformer-based approach for addressing the task of Action Units (AU) detection in the context of Affective Behavior Analysis in-the-wild (ABAW) competition. We employ the Video Vision Transformer(ViViT) Network to capture the temporal facial change in the video. Besides, to reduce massive size of the Vision Transformers model, we replace the ViViT feature extraction layers with the CNN backbone (Regnet). Our model outperform the baseline model of ABAW 2023 challenge, with a notable 14% difference in result. Furthermore, the achieved results are comparable to those of the top three teams in the previous ABAW 2022 challenge.
△ Less
Submitted 20 March, 2023; v1 submitted 16 March, 2023;
originally announced March 2023.
-
Efficient learning of large sets of locally optimal classification rules
Authors:
Van Quoc Phuong Huynh,
Johannes Fürnkranz,
Florian Beck
Abstract:
Conventional rule learning algorithms aim at finding a set of simple rules, where each rule covers as many examples as possible. In this paper, we argue that the rules found in this way may not be the optimal explanations for each of the examples they cover. Instead, we propose an efficient algorithm that aims at finding the best rule covering each training example in a greedy optimization consist…
▽ More
Conventional rule learning algorithms aim at finding a set of simple rules, where each rule covers as many examples as possible. In this paper, we argue that the rules found in this way may not be the optimal explanations for each of the examples they cover. Instead, we propose an efficient algorithm that aims at finding the best rule covering each training example in a greedy optimization consisting of one specialization and one generalization loop. These locally optimal rules are collected and then filtered for a final rule set, which is much larger than the sets learned by conventional rule learning algorithms. A new example is classified by selecting the best among the rules that cover this example. In our experiments on small to very large datasets, the approach's average classification accuracy is higher than that of state-of-the-art rule learning algorithms. Moreover, the algorithm is highly efficient and can inherently be processed in parallel without affecting the learned rule set and so the classification accuracy. We thus believe that it closes an important gap for large-scale classification rule induction.
△ Less
Submitted 26 January, 2023; v1 submitted 24 January, 2023;
originally announced January 2023.
-
Generic Event Boundary Detection in Video with Pyramid Features
Authors:
Van Thong Huynh,
Hyung-Jeong Yang,
Guee-Sang Lee,
Soo-Hyung Kim
Abstract:
Generic event boundary detection (GEBD) aims to split video into chunks at a broad and diverse set of actions as humans naturally perceive event boundaries. In this study, we present an approach that considers the correlation between neighbor frames with pyramid feature maps in both spatial and temporal dimensions to construct a framework for localizing generic events in video. The features at mul…
▽ More
Generic event boundary detection (GEBD) aims to split video into chunks at a broad and diverse set of actions as humans naturally perceive event boundaries. In this study, we present an approach that considers the correlation between neighbor frames with pyramid feature maps in both spatial and temporal dimensions to construct a framework for localizing generic events in video. The features at multiple spatial dimensions of a pre-trained ResNet-50 are exploited with different views in the temporal dimension to form a temporal pyramid feature map. Based on that, the similarity between neighbor frames is calculated and projected to build a temporal pyramid similarity feature vector. A decoder with 1D convolution operations is used to decode these similarities to a new representation that incorporates their temporal relationship for later boundary score estimation. Extensive experiments conducted on the GEBD benchmark dataset show the effectiveness of our system and its variations, in which we outperformed the state-of-the-art approaches. Additional experiments on TAPOS dataset, which contains long-form videos with Olympic sport actions, demonstrated the effectiveness of our study compared to others.
△ Less
Submitted 10 January, 2023;
originally announced January 2023.
-
Facial Expression Classification using Fusion of Deep Neural Network in Video for the 3rd ABAW3 Competition
Authors:
Kim Ngan Phan,
Hong-Hai Nguyen,
Van-Thong Huynh,
Soo-Hyung Kim
Abstract:
For computers to recognize human emotions, expression classification is an equally important problem in the human-computer interaction area. In the 3rd Affective Behavior Analysis In-The-Wild competition, the task of expression classification includes eight classes with six basic expressions of human faces from videos. In this paper, we employ a transformer mechanism to encode the robust represent…
▽ More
For computers to recognize human emotions, expression classification is an equally important problem in the human-computer interaction area. In the 3rd Affective Behavior Analysis In-The-Wild competition, the task of expression classification includes eight classes with six basic expressions of human faces from videos. In this paper, we employ a transformer mechanism to encode the robust representation from the backbone. Fusion of the robust representations plays an important role in the expression classification task. Our approach achieves 30.35\% and 28.60\% for the $F_1$ score on the validation set and the test set, respectively. This result shows the effectiveness of the proposed architecture based on the Aff-Wild2 dataset.
△ Less
Submitted 8 April, 2022; v1 submitted 24 March, 2022;
originally announced March 2022.
-
An Ensemble Approach for Facial Expression Analysis in Video
Authors:
Hong-Hai Nguyen,
Van-Thong Huynh,
Soo-Hyung Kim
Abstract:
Human emotions recognization contributes to the development of human-computer interaction. The machines understanding human emotions in the real world will significantly contribute to life in the future. This paper will introduce the Affective Behavior Analysis in-the-wild (ABAW3) 2022 challenge. The paper focuses on solving the problem of the valence-arousal estimation and action unit detection.…
▽ More
Human emotions recognization contributes to the development of human-computer interaction. The machines understanding human emotions in the real world will significantly contribute to life in the future. This paper will introduce the Affective Behavior Analysis in-the-wild (ABAW3) 2022 challenge. The paper focuses on solving the problem of the valence-arousal estimation and action unit detection. For valence-arousal estimation, we conducted two stages: creating new features from multimodel and temporal learning to predict valence-arousal. First, we make new features; the Gated Recurrent Unit (GRU) and Transformer are combined using a Regular Networks (RegNet) feature, which is extracted from the image. The next step is the GRU combined with Local Attention to predict valence-arousal. The Concordance Correlation Coefficient (CCC) was used to evaluate the model.
△ Less
Submitted 24 March, 2022;
originally announced March 2022.
-
Communication-hiding pipelined BiCGSafe methods for solving large linear systems
Authors:
Viet Q. H. Huynh,
Hiroshi Suito
Abstract:
Recently, a new variant of the BiCGStab method, known as the pipeline BiCGStab, has been proposed. This method can achieve a higher degree of scalability and speed-up rates through a mechanism in which the communication phase for the computation of the inner product can be overlapped with the computation of the matrix-vector product. On the other hand, there exist several generalized iteration met…
▽ More
Recently, a new variant of the BiCGStab method, known as the pipeline BiCGStab, has been proposed. This method can achieve a higher degree of scalability and speed-up rates through a mechanism in which the communication phase for the computation of the inner product can be overlapped with the computation of the matrix-vector product. On the other hand, there exist several generalized iteration methods with better convergence behavior than BiCGStab such as ssBiCGSafe, BiCGSafe, GPBi-CG. Of these methods, ssBiCGSafe, which requires a single phase of computing inner products per one iteration, is best suited for high-performance computing systems. In this paper, inspired by the success of the pipelined BiCGStab method, we propose variations of the ssBiCGSafe method, in which only one phase of inner product computation per iteration is required and this inner product computation phase can be overlapped with the matrix-vector computation. Through numerical experiments, we show that the proposed methods lead to improvements in convergence behavior and execution time compared to the pipelined BiCGStab and ssBiCGSafe methods.
△ Less
Submitted 27 October, 2021; v1 submitted 24 August, 2021;
originally announced August 2021.
-
Temporal Convolution Networks with Positional Encoding for Evoked Expression Estimation
Authors:
VanThong Huynh,
Guee-Sang Lee,
Hyung-Jeong Yang,
Soo-Huyng Kim
Abstract:
This paper presents an approach for Evoked Expressions from Videos (EEV) challenge, which aims to predict evoked facial expressions from video. We take advantage of pre-trained models on large-scale datasets in computer vision and audio signals to extract the deep representation of timestamps in the video. A temporal convolution network, rather than an RNN like architecture, is used to explore tem…
▽ More
This paper presents an approach for Evoked Expressions from Videos (EEV) challenge, which aims to predict evoked facial expressions from video. We take advantage of pre-trained models on large-scale datasets in computer vision and audio signals to extract the deep representation of timestamps in the video. A temporal convolution network, rather than an RNN like architecture, is used to explore temporal relationships due to its advantage in memory consumption and parallelism. Furthermore, to address the missing annotations of some timestamps, positional encoding is employed to ensure continuity of input data when discarding these timestamps during training. We achieved state-of-the-art results on the EEV challenge with a Pearson correlation coefficient of 0.05477, the first ranked performance in the EEV 2021 challenge.
△ Less
Submitted 16 June, 2021;
originally announced June 2021.
-
Text Generation with Deep Variational GAN
Authors:
Mahmoud Hossam,
Trung Le,
Michael Papasimeon,
Viet Huynh,
Dinh Phung
Abstract:
Generating realistic sequences is a central task in many machine learning applications. There has been considerable recent progress on building deep generative models for sequence generation tasks. However, the issue of mode-collapsing remains a main issue for the current models. In this paper we propose a GAN-based generic framework to address the problem of mode-collapse in a principled approach…
▽ More
Generating realistic sequences is a central task in many machine learning applications. There has been considerable recent progress on building deep generative models for sequence generation tasks. However, the issue of mode-collapsing remains a main issue for the current models. In this paper we propose a GAN-based generic framework to address the problem of mode-collapse in a principled approach. We change the standard GAN objective to maximize a variational lower-bound of the log-likelihood while minimizing the Jensen-Shanon divergence between data and model distributions. We experiment our model with text generation task and show that it can generate realistic text with high diversity.
△ Less
Submitted 27 April, 2021;
originally announced April 2021.
-
Improved and Efficient Text Adversarial Attacks using Target Information
Authors:
Mahmoud Hossam,
Trung Le,
He Zhao,
Viet Huynh,
Dinh Phung
Abstract:
There has been recently a growing interest in studying adversarial examples on natural language models in the black-box setting. These methods attack natural language classifiers by perturbing certain important words until the classifier label is changed. In order to find these important words, these methods rank all words by importance by querying the target model word by word for each input sent…
▽ More
There has been recently a growing interest in studying adversarial examples on natural language models in the black-box setting. These methods attack natural language classifiers by perturbing certain important words until the classifier label is changed. In order to find these important words, these methods rank all words by importance by querying the target model word by word for each input sentence, resulting in high query inefficiency. A new interesting approach was introduced that addresses this problem through interpretable learning to learn the word ranking instead of previous expensive search. The main advantage of using this approach is that it achieves comparable attack rates to the state-of-the-art methods, yet faster and with fewer queries, where fewer queries are desirable to avoid suspicion towards the attacking agent. Nonetheless, this approach sacrificed the useful information that could be leveraged from the target classifier for that sake of query efficiency. In this paper we study the effect of leveraging the target model outputs and data on both attack rates and average number of queries, and we show that both can be improved, with a limited overhead of additional queries.
△ Less
Submitted 2 May, 2021; v1 submitted 27 April, 2021;
originally announced April 2021.
-
A Strong Baseline for Vehicle Re-Identification
Authors:
Su V. Huynh,
Nam H. Nguyen,
Ngoc T. Nguyen,
Vinh TQ. Nguyen,
Chau Huynh,
Chuong Nguyen
Abstract:
Vehicle Re-Identification (Re-ID) aims to identify the same vehicle across different cameras, hence plays an important role in modern traffic management systems. The technical challenges require the algorithms must be robust in different views, resolution, occlusion and illumination conditions. In this paper, we first analyze the main factors hindering the Vehicle Re-ID performance. We then presen…
▽ More
Vehicle Re-Identification (Re-ID) aims to identify the same vehicle across different cameras, hence plays an important role in modern traffic management systems. The technical challenges require the algorithms must be robust in different views, resolution, occlusion and illumination conditions. In this paper, we first analyze the main factors hindering the Vehicle Re-ID performance. We then present our solutions, specifically targeting the dataset Track 2 of the 5th AI City Challenge, including (1) reducing the domain gap between real and synthetic data, (2) network modification by stacking multi heads with attention mechanism, (3) adaptive loss weight adjustment. Our method achieves 61.34% mAP on the private CityFlow testset without using external dataset or pseudo labeling, and outperforms all previous works at 87.1% mAP on the Veri benchmark. The code is available at https://github.com/cybercore-co-ltd/track2_aicity_2021.
△ Less
Submitted 21 April, 2021;
originally announced April 2021.
-
Topic Modelling Meets Deep Neural Networks: A Survey
Authors:
He Zhao,
Dinh Phung,
Viet Huynh,
Yuan **,
Lan Du,
Wray Buntine
Abstract:
Topic modelling has been a successful technique for text analysis for almost twenty years. When topic modelling met deep neural networks, there emerged a new and increasingly popular research area, neural topic models, with over a hundred models developed and a wide range of applications in neural language understanding such as text generation, summarisation and language models. There is a need to…
▽ More
Topic modelling has been a successful technique for text analysis for almost twenty years. When topic modelling met deep neural networks, there emerged a new and increasingly popular research area, neural topic models, with over a hundred models developed and a wide range of applications in neural language understanding such as text generation, summarisation and language models. There is a need to summarise research developments and discuss open problems and future directions. In this paper, we provide a focused yet comprehensive overview of neural topic models for interested researchers in the AI community, so as to facilitate them to navigate and innovate in this fast-growing research area. To the best of our knowledge, ours is the first review focusing on this specific topic.
△ Less
Submitted 28 February, 2021;
originally announced March 2021.
-
Artificial intelligence supported anemia control system (AISACS) to prevent anemia in maintenance hemodialysis patients
Authors:
Toshiaki Ohara,
Hiroshi Ikeda,
Yoshiki Sugitani,
Hiroshi Suito,
Viet Quang Huy Huynh,
Masaru Kinomura,
Soichiro Haraguchi,
Kazufumi Sakurama
Abstract:
Anemia, for which erythropoiesis-stimulating agents (ESAs) and iron supplements (ISs) are used as preventive measures, presents important difficulties for hemodialysis patients. Nevertheless, the number of physicians able to manage such medications appropriately is not kee** pace with the rapid increase of hemodialysis patients. Moreover, the high cost of ESAs imposes heavy burdens on medical in…
▽ More
Anemia, for which erythropoiesis-stimulating agents (ESAs) and iron supplements (ISs) are used as preventive measures, presents important difficulties for hemodialysis patients. Nevertheless, the number of physicians able to manage such medications appropriately is not kee** pace with the rapid increase of hemodialysis patients. Moreover, the high cost of ESAs imposes heavy burdens on medical insurance systems. An artificial-intelligence-supported anemia control system (AISACS) trained using administration direction data from experienced physicians has been developed by the authors. For the system, appropriate data selection and rectification techniques play important roles. Decision making related to ESAs poses a multi-class classification problem for which a two-step classification technique is introduced. Several validations have demonstrated that AISACS exhibits high performance with correct classification rates of 72-87% and clinically appropriate classification rates of 92-98%.
△ Less
Submitted 5 October, 2020;
originally announced October 2020.
-
Neural Topic Model via Optimal Transport
Authors:
He Zhao,
Dinh Phung,
Viet Huynh,
Trung Le,
Wray Buntine
Abstract:
Recently, Neural Topic Models (NTMs) inspired by variational autoencoders have obtained increasingly research interest due to their promising results on text analysis. However, it is usually hard for existing NTMs to achieve good document representation and coherent/diverse topics at the same time. Moreover, they often degrade their performance severely on short documents. The requirement of repar…
▽ More
Recently, Neural Topic Models (NTMs) inspired by variational autoencoders have obtained increasingly research interest due to their promising results on text analysis. However, it is usually hard for existing NTMs to achieve good document representation and coherent/diverse topics at the same time. Moreover, they often degrade their performance severely on short documents. The requirement of reparameterisation could also comprise their training quality and model flexibility. To address these shortcomings, we present a new neural topic model via the theory of optimal transport (OT). Specifically, we propose to learn the topic distribution of a document by directly minimising its OT distance to the document's word distributions. Importantly, the cost matrix of the OT distance models the weights between topics and words, which is constructed by the distances between topics and words in an embedding space. Our proposed model can be trained efficiently with a differentiable loss. Extensive experiments show that our framework significantly outperforms the state-of-the-art NTMs on discovering more coherent and diverse topics and deriving better document representations for both regular and short texts.
△ Less
Submitted 31 May, 2022; v1 submitted 12 August, 2020;
originally announced August 2020.
-
State Estimation-Based Robust Optimal Control of Influenza Epidemics in an Interactive Human Society
Authors:
Vahid Azimi,
Mojtaba Sharifi,
Seyed Fakoorian,
Thang Tien Nguyen,
Van Van Huynh
Abstract:
This paper presents a state estimation-based robust optimal control strategy for influenza epidemics in an interactive human society in the presence of modeling uncertainties. Interactive society is influenced by the random entrance of individuals from other human societies whose effects can be modeled as a non-Gaussian noise. Since only the number of exposed and infected humans can be measured, s…
▽ More
This paper presents a state estimation-based robust optimal control strategy for influenza epidemics in an interactive human society in the presence of modeling uncertainties. Interactive society is influenced by the random entrance of individuals from other human societies whose effects can be modeled as a non-Gaussian noise. Since only the number of exposed and infected humans can be measured, states of the influenza epidemics are first estimated by an extended maximum correntropy Kalman filter (EMCKF) to provide a robust state estimation in the presence of the non-Gaussian noise. An online quadratic program (QP) optimization is then synthesized subject to a robust control Lyapunov function (RCLF) to minimize susceptible and infected humans, while minimizing and bounding the rates of vaccination and antiviral treatment. The joint QP-RCLF-EMCKF meets multiple design specifications such as state estimation, tracking, pointwise control optimality, and robustness to parameter uncertainty and state estimation errors that have not been achieved simultaneously in previous studies. The uniform ultimate boundedness (UUB)/convergence of error trajectories is guaranteed using a Lyapunov stability argument. The soundness of the proposed approach is validated on the influenza epidemics of an interactive human society with a population of 16000. Simulation results show that the QP-RCLF-EMCKF achieves appropriate tracking and state estimation performance. The robustness of the proposed controller is finally illustrated in the presence of modeling error and non-Gaussian noise.
△ Less
Submitted 11 November, 2020; v1 submitted 26 May, 2020;
originally announced May 2020.
-
The 1st Agriculture-Vision Challenge: Methods and Results
Authors:
Mang Tik Chiu,
Xingqian Xu,
Kai Wang,
Jennifer Hobbs,
Naira Hovakimyan,
Thomas S. Huang,
Honghui Shi,
Yunchao Wei,
Zilong Huang,
Alexander Schwing,
Robert Brunner,
Ivan Dozier,
Wyatt Dozier,
Karen Ghandilyan,
David Wilson,
Hyunseong Park,
Junhee Kim,
Sungho Kim,
Qinghui Liu,
Michael C. Kampffmeyer,
Robert Jenssen,
Arnt B. Salberg,
Alexandre Barbosa,
Rodrigo Trevisan,
Bingchen Zhao
, et al. (17 additional authors not shown)
Abstract:
The first Agriculture-Vision Challenge aims to encourage research in develo** novel and effective algorithms for agricultural pattern recognition from aerial images, especially for the semantic segmentation task associated with our challenge dataset. Around 57 participating teams from various countries compete to achieve state-of-the-art in aerial agriculture semantic segmentation. The Agricultu…
▽ More
The first Agriculture-Vision Challenge aims to encourage research in develo** novel and effective algorithms for agricultural pattern recognition from aerial images, especially for the semantic segmentation task associated with our challenge dataset. Around 57 participating teams from various countries compete to achieve state-of-the-art in aerial agriculture semantic segmentation. The Agriculture-Vision Challenge Dataset was employed, which comprises of 21,061 aerial and multi-spectral farmland images. This paper provides a summary of notable methods and results in the challenge. Our submission server and leaderboard will continue to open for researchers that are interested in this challenge dataset and task; the link can be found here.
△ Less
Submitted 23 April, 2020; v1 submitted 21 April, 2020;
originally announced April 2020.
-
OptiGAN: Generative Adversarial Networks for Goal Optimized Sequence Generation
Authors:
Mahmoud Hossam,
Trung Le,
Viet Huynh,
Michael Papasimeon,
Dinh Phung
Abstract:
One of the challenging problems in sequence generation tasks is the optimized generation of sequences with specific desired goals. Current sequential generative models mainly generate sequences to closely mimic the training data, without direct optimization of desired goals or properties specific to the task. We introduce OptiGAN, a generative model that incorporates both Generative Adversarial Ne…
▽ More
One of the challenging problems in sequence generation tasks is the optimized generation of sequences with specific desired goals. Current sequential generative models mainly generate sequences to closely mimic the training data, without direct optimization of desired goals or properties specific to the task. We introduce OptiGAN, a generative model that incorporates both Generative Adversarial Networks (GAN) and Reinforcement Learning (RL) to optimize desired goal scores using policy gradients. We apply our model to text and real-valued sequence generation, where our model is able to achieve higher desired scores out-performing GAN and RL baselines, while not sacrificing output sample diversity.
△ Less
Submitted 14 January, 2021; v1 submitted 16 April, 2020;
originally announced April 2020.
-
Deep Learning for The Inverse Design of Mid-infrared Graphene Plasmons
Authors:
Anh D. Phan,
Cuong V. Nguyen,
Pham T. Linh,
Tran V. Huynh,
Vu D. Lam,
Anh-Tuan Le
Abstract:
We theoretically investigate the plasmonic properties of mid-infrared graphene-based metamaterials and apply deep learning of a neural network for the inverse design. These artificial structures have square periodic arrays of graphene plasmonic resonators deposited on dielectric thin films. Optical spectra vary significantly with changes in structural parameters. Our numerical results are in accor…
▽ More
We theoretically investigate the plasmonic properties of mid-infrared graphene-based metamaterials and apply deep learning of a neural network for the inverse design. These artificial structures have square periodic arrays of graphene plasmonic resonators deposited on dielectric thin films. Optical spectra vary significantly with changes in structural parameters. Our numerical results are in accordance with previous experiments. Then, the theoretical approach is employed to generate data for training and testing deep neural networks. By merging the pre-trained neural network with the inverse network, we implement calculations for inverse design of the graphene-based metameterials. We also discuss the limitation of the data-driven approach.
△ Less
Submitted 19 February, 2020; v1 submitted 28 November, 2019;
originally announced November 2019.
-
Eye Semantic Segmentation with a Lightweight Model
Authors:
Van Thong Huynh,
Soo-Hyung Kim,
Guee-Sang Lee,
Hyung-Jeong Yang
Abstract:
In this paper, we present a multi-class eye segmentation method that can run the hardware limitations for real-time inference. Our approach includes three major stages: get a grayscale image from the input, segment three distinct eye region with a deep network, and remove incorrect areas with heuristic filters. Our model based on the encoder decoder structure with the key is the depthwise convolut…
▽ More
In this paper, we present a multi-class eye segmentation method that can run the hardware limitations for real-time inference. Our approach includes three major stages: get a grayscale image from the input, segment three distinct eye region with a deep network, and remove incorrect areas with heuristic filters. Our model based on the encoder decoder structure with the key is the depthwise convolution operation to reduce the computation cost. We experiment on OpenEDS, a large scale dataset of eye images captured by a head-mounted display with two synchronized eye facing cameras. We achieved the mean intersection over union (mIoU) of 94.85% with a model of size 0.4 megabytes. The source code are available https://github.com/th2l/Eye_VR_Segmentation
△ Less
Submitted 4 November, 2019;
originally announced November 2019.
-
Tree-Wasserstein Barycenter for Large-Scale Multilevel Clustering and Scalable Bayes
Authors:
Tam Le,
Viet Huynh,
Nhat Ho,
Dinh Phung,
Makoto Yamada
Abstract:
We study in this paper a variant of Wasserstein barycenter problem, which we refer to as tree-Wasserstein barycenter, by leveraging a specific class of ground metrics, namely tree metrics, for Wasserstein distance. Drawing on the tree structure, we propose an efficient algorithmic approach to solve the tree-Wasserstein barycenter and its variants. The proposed approach is not only fast for computa…
▽ More
We study in this paper a variant of Wasserstein barycenter problem, which we refer to as tree-Wasserstein barycenter, by leveraging a specific class of ground metrics, namely tree metrics, for Wasserstein distance. Drawing on the tree structure, we propose an efficient algorithmic approach to solve the tree-Wasserstein barycenter and its variants. The proposed approach is not only fast for computation but also efficient for memory usage. Exploiting the tree-Wasserstein barycenter and its variants, we scale up multi-level clustering and scalable Bayes, especially for large-scale applications where the number of supports in probability measures is large. Empirically, we test our proposed approach against other baselines on large-scale synthetic and real datasets.
△ Less
Submitted 26 February, 2020; v1 submitted 10 October, 2019;
originally announced October 2019.
-
On Efficient Multilevel Clustering via Wasserstein Distances
Authors:
Viet Huynh,
Nhat Ho,
Nhan Dam,
XuanLong Nguyen,
Mikhail Yurochkin,
Hung Bui,
and Dinh Phung
Abstract:
We propose a novel approach to the problem of multilevel clustering, which aims to simultaneously partition data in each group and discover grou** patterns among groups in a potentially large hierarchically structured corpus of data. Our method involves a joint optimization formulation over several spaces of discrete probability measures, which are endowed with Wasserstein distance metrics. We p…
▽ More
We propose a novel approach to the problem of multilevel clustering, which aims to simultaneously partition data in each group and discover grou** patterns among groups in a potentially large hierarchically structured corpus of data. Our method involves a joint optimization formulation over several spaces of discrete probability measures, which are endowed with Wasserstein distance metrics. We propose several variants of this problem, which admit fast optimization algorithms, by exploiting the connection to the problem of finding Wasserstein barycenters. Consistency properties are established for the estimates of both local and global clusters. Finally, experimental results with both synthetic and real data are presented to demonstrate the flexibility and scalability of the proposed approach.
△ Less
Submitted 24 May, 2021; v1 submitted 18 September, 2019;
originally announced September 2019.
-
Probabilistic Multilevel Clustering via Composite Transportation Distance
Authors:
Nhat Ho,
Viet Huynh,
Dinh Phung,
Michael I. Jordan
Abstract:
We propose a novel probabilistic approach to multilevel clustering problems based on composite transportation distance, which is a variant of transportation distance where the underlying metric is Kullback-Leibler divergence. Our method involves solving a joint optimization problem over spaces of probability measures to simultaneously discover grou** structures within groups and among groups. By…
▽ More
We propose a novel probabilistic approach to multilevel clustering problems based on composite transportation distance, which is a variant of transportation distance where the underlying metric is Kullback-Leibler divergence. Our method involves solving a joint optimization problem over spaces of probability measures to simultaneously discover grou** structures within groups and among groups. By exploiting the connection of our method to the problem of finding composite transportation barycenters, we develop fast and efficient optimization algorithms even for potentially large-scale multilevel datasets. Finally, we present experimental results with both synthetic and real data to demonstrate the efficiency and scalability of the proposed approach.
△ Less
Submitted 28 October, 2018;
originally announced October 2018.
-
Multilevel Clustering via Wasserstein Means
Authors:
Nhat Ho,
XuanLong Nguyen,
Mikhail Yurochkin,
Hung Hai Bui,
Viet Huynh,
Dinh Phung
Abstract:
We propose a novel approach to the problem of multilevel clustering, which aims to simultaneously partition data in each group and discover grou** patterns among groups in a potentially large hierarchically structured corpus of data. Our method involves a joint optimization formulation over several spaces of discrete probability measures, which are endowed with Wasserstein distance metrics. We p…
▽ More
We propose a novel approach to the problem of multilevel clustering, which aims to simultaneously partition data in each group and discover grou** patterns among groups in a potentially large hierarchically structured corpus of data. Our method involves a joint optimization formulation over several spaces of discrete probability measures, which are endowed with Wasserstein distance metrics. We propose a number of variants of this problem, which admit fast optimization algorithms, by exploiting the connection to the problem of finding Wasserstein barycenters. Consistency properties are established for the estimates of both local and global clusters. Finally, experiment results with both synthetic and real data are presented to demonstrate the flexibility and scalability of the proposed approach.
△ Less
Submitted 12 June, 2017;
originally announced June 2017.
-
Ekeland's inverse function theorem in graded Fréchet spaces revisited for multifunctions
Authors:
Van Ngai Huynh,
Michel Théra
Abstract:
In this paper, we present some implicit function theorems for set-valued map**s between Fréchet spaces. The proof relies on Lebesgue's Dominated Convergence Theorem and on Ekeland's variational principle. An application to the existence of solutions of differential equations in Fréchet spaces with non-smooth data is given.
In this paper, we present some implicit function theorems for set-valued map**s between Fréchet spaces. The proof relies on Lebesgue's Dominated Convergence Theorem and on Ekeland's variational principle. An application to the existence of solutions of differential equations in Fréchet spaces with non-smooth data is given.
△ Less
Submitted 22 February, 2017; v1 submitted 27 October, 2016;
originally announced October 2016.
-
Syntactic Phylogenetic Trees
Authors:
Kevin Shu,
Sharjeel Aziz,
Vy-Luan Huynh,
David Warrick,
Matilde Marcolli
Abstract:
In this paper we identify several serious problems that arise in the use of syntactic data from the SSWL database for the purpose of computational phylogenetic reconstruction. We show that the most naive approach fails to produce reliable linguistic phylogenetic trees. We identify some of the sources of the observed problems and we discuss how they may be, at least partly, corrected by using addit…
▽ More
In this paper we identify several serious problems that arise in the use of syntactic data from the SSWL database for the purpose of computational phylogenetic reconstruction. We show that the most naive approach fails to produce reliable linguistic phylogenetic trees. We identify some of the sources of the observed problems and we discuss how they may be, at least partly, corrected by using additional information, such as prior subdivision into language families and subfamilies, and a better use of the information about ancient languages. We also describe how the use of phylogenetic algebraic geometry can help in estimating to what extent the probability distribution at the leaves of the phylogenetic tree obtained from the SSWL data can be considered reliable, by testing it on phylogenetic trees established by other forms of linguistic analysis. In simple examples, we find that, after restricting to smaller language subfamilies and considering only those SSWL parameters that are fully mapped for the whole subfamily, the SSWL data match extremely well reliable phylogenetic trees, according to the evaluation of phylogenetic invariants. This is a promising sign for the use of SSWL data for linguistic phylogenetics.
△ Less
Submitted 10 July, 2016;
originally announced July 2016.
-
Novel primary photoexcitations in $π$-conjugated donor-acceptor copolymers probed by transient magneto-photoinduced-absorption
Authors:
Uyen N. V. Huynh,
Tek P. Basel,
L. Dou,
Karan Aryanpour,
Gang Li,
Sumit Mazumdar,
Eitan Ehrenfreund,
Yang Yang,
Z. Valy Vardeny
Abstract:
The saga of the primary photoexcitations in $π$-conjugated polymers has been a source of extraordinary scientific curiosity that has lasted for more than three decades. From soliton excitations in trans-polyacetylene, to singlet and triplet excitons and polarons in other polymers, to charge transfer excitons in blends of polymers and fullerenes, the field has been rich with a variety of different…
▽ More
The saga of the primary photoexcitations in $π$-conjugated polymers has been a source of extraordinary scientific curiosity that has lasted for more than three decades. From soliton excitations in trans-polyacetylene, to singlet and triplet excitons and polarons in other polymers, to charge transfer excitons in blends of polymers and fullerenes, the field has been rich with a variety of different photoexcitation species. Here we show the photogeneration of a novel primary intrachain photoexcitation species, namely the composite multi-exciton (CME) in $π$-conjugated donor-acceptor (DA)-copolymers used in organic photovoltaic (OPV) solar cells. We utilized the magnetic field response of the transient photoinduced absorption from sub-picosecond to millisecond to show in pristine DA-copolymer early photogeneration of the CME species that is composed of four coupled spin 1/2 particles, having unique optical and magnetic signatures. This species decomposes into two independent triplets in the microsecond time domain. Importantly in copolymer/fullerene blends the CME ionization generates photocarriers by a unique process that may enhance the photocurrent in OPV solar cells.
△ Less
Submitted 15 October, 2015;
originally announced October 2015.
-
Directional Hölder Metric Regularity
Authors:
Van Ngai Huynh,
Huu Tron Nguyen,
Michel Théra
Abstract:
This paper sheds new light on regularity of multifunctions through various characterizations of directional Hölder /Lipschitz metric regularity, which are based on the concepts of slope and coderivative. By using these characterizations, we show that directional Hölder /Lipschitz metric regularity is stable, when the multifunction under consideration is perturbed suitably. Applications of directio…
▽ More
This paper sheds new light on regularity of multifunctions through various characterizations of directional Hölder /Lipschitz metric regularity, which are based on the concepts of slope and coderivative. By using these characterizations, we show that directional Hölder /Lipschitz metric regularity is stable, when the multifunction under consideration is perturbed suitably. Applications of directional Hölder /Lipschitz metric regularity to investigate the stability and the sensitivity analysis of parameterized optimization problems are also discussed.
△ Less
Submitted 8 August, 2015;
originally announced August 2015.
-
Theory of Primary Photoexcitations in Donor-Acceptor Copolymers
Authors:
Karan Aryanpour,
Tirthankar Dutta,
Uyen N. V. Huynh,
Zeev Valy Vardeny,
Sumit Mazumdar
Abstract:
We present a generic theory of primary photoexcitations in low band gap donor-acceptor conjugated copolymers. Because of the combined effects of strong electron correlations and broken symmetry, there is considerable mixing between a charge-transfer exciton and an energetically proximate triplet-triplet state with an overall spin singlet. The triplet-triplet state, optically forbidden in homopolym…
▽ More
We present a generic theory of primary photoexcitations in low band gap donor-acceptor conjugated copolymers. Because of the combined effects of strong electron correlations and broken symmetry, there is considerable mixing between a charge-transfer exciton and an energetically proximate triplet-triplet state with an overall spin singlet. The triplet-triplet state, optically forbidden in homopolymers, is allowed in donor-acceptor copolymers. For an intermediate difference in electron affinities of the donor and the acceptor, the triplet-triplet state can have stronger oscillator strength than the charge-transfer exciton. We discuss the possibility of intramolecular singlet fission from the triplet-triplet state, and how such fission can be detected experimentally.
△ Less
Submitted 30 December, 2015; v1 submitted 31 July, 2015;
originally announced August 2015.
-
A Martingale Approach and Time-Consistent Sampling-based Algorithms for Risk Management in Stochastic Optimal Control
Authors:
Vu Anh Huynh,
Leonid Kogan,
Emilio Frazzoli
Abstract:
In this paper, we consider a class of stochastic optimal control problems with risk constraints that are expressed as bounded probabilities of failure for particular initial states. We present here a martingale approach that diffuses a risk constraint into a martingale to construct time-consistent control policies. The martingale stands for the level of risk tolerance over time. By augmenting the…
▽ More
In this paper, we consider a class of stochastic optimal control problems with risk constraints that are expressed as bounded probabilities of failure for particular initial states. We present here a martingale approach that diffuses a risk constraint into a martingale to construct time-consistent control policies. The martingale stands for the level of risk tolerance over time. By augmenting the system dynamics with the controlled martingale, the original risk-constrained problem is transformed into a stochastic target problem. We extend the incremental Markov Decision Process (iMDP) algorithm to approximate arbitrarily well an optimal feedback policy of the original problem by sampling in the augmented state space and computing proper boundary conditions for the reformulated problem. We show that the algorithm is both probabilistically sound and asymptotically optimal. The performance of the proposed algorithm is demonstrated on motion planning and control problems subject to bounded probability of collision in uncertain cluttered environments.
△ Less
Submitted 8 July, 2015; v1 submitted 29 December, 2013;
originally announced December 2013.
-
Metric regularity of composition set-valued map**s: metric setting and coderivative conditions
Authors:
Marius Durea,
Van Ngai Huynh,
Huu Tron Nguyen,
Radu Strugariu
Abstract:
The paper concerns a new method to obtain a direct proof of the openness at linear rate/metric regularity of composite set-valued maps on metric spaces by the unification and refinement of several methods developed somehow separately in several works of the authors. In fact, this work is a synthesis and a precise specialization to a general situation of some techniques explored in the last years i…
▽ More
The paper concerns a new method to obtain a direct proof of the openness at linear rate/metric regularity of composite set-valued maps on metric spaces by the unification and refinement of several methods developed somehow separately in several works of the authors. In fact, this work is a synthesis and a precise specialization to a general situation of some techniques explored in the last years in the literature. In turn, these techniques are based on several important concepts (like error bounds, lower semicontinuous envelope of a set-valued map, local composition stability of multifunctions) and allow us to obtain two new proofs of a recent result having deep roots in the topic of regularity of map**s. Moreover, we make clear the idea that it is possible to use (co)derivative conditions as tools of proof for openness results in very general situations.
△ Less
Submitted 15 October, 2012;
originally announced October 2012.
-
An Incremental Sampling-based Algorithm for Stochastic Optimal Control
Authors:
Vu Anh Huynh,
Sertac Karaman,
Emilio Frazzoli
Abstract:
In this paper, we consider a class of continuous-time, continuous-space stochastic optimal control problems. Building upon recent advances in Markov chain approximation methods and sampling-based algorithms for deterministic path planning, we propose a novel algorithm called the incremental Markov Decision Process (iMDP) to compute incrementally control policies that approximate arbitrarily well a…
▽ More
In this paper, we consider a class of continuous-time, continuous-space stochastic optimal control problems. Building upon recent advances in Markov chain approximation methods and sampling-based algorithms for deterministic path planning, we propose a novel algorithm called the incremental Markov Decision Process (iMDP) to compute incrementally control policies that approximate arbitrarily well an optimal policy in terms of the expected cost. The main idea behind the algorithm is to generate a sequence of finite discretizations of the original problem through random sampling of the state space. At each iteration, the discretized problem is a Markov Decision Process that serves as an incrementally refined model of the original problem. We show that with probability one, (i) the sequence of the optimal value functions for each of the discretized problems converges uniformly to the optimal value function of the original stochastic optimal control problem, and (ii) the original optimal value function can be computed efficiently in an incremental manner using asynchronous value iterations. Thus, the proposed algorithm provides an anytime approach to the computation of optimal control policies of the continuous problem. The effectiveness of the proposed approach is demonstrated on motion planning and control problems in cluttered environments in the presence of process noise.
△ Less
Submitted 24 February, 2012;
originally announced February 2012.
-
Non-abelian Reidemeister torsion for twist knots
Authors:
Jérôme Dubois,
Vu Huynh,
Yoshikazu Yamaguchi
Abstract:
This paper gives an explicit formula for the SL_2(C)-non-abelian Reidemeister torsion as defined in [Dub06] in the case of twist knots. For hyperbolic twist knots, we also prove that the non-abelian Reidemeister torsion at the holonomy representation can be expressed as a rational function evaluated at the cusp shape of the knot.
This paper gives an explicit formula for the SL_2(C)-non-abelian Reidemeister torsion as defined in [Dub06] in the case of twist knots. For hyperbolic twist knots, we also prove that the non-abelian Reidemeister torsion at the holonomy representation can be expressed as a rational function evaluated at the cusp shape of the knot.
△ Less
Submitted 9 March, 2008; v1 submitted 14 June, 2007;
originally announced June 2007.
-
Twisted Alexander polynomial of links in the projective space
Authors:
Vu Q. Huynh,
Thang T. Q. Le
Abstract:
We use Reidemeister torsion to study a twisted Alexander polynomial, as defined by Turaev, for links in the projective space. Using sign-refined torsion we derive a skein relation for a normalized form of this polynomial.
We use Reidemeister torsion to study a twisted Alexander polynomial, as defined by Turaev, for links in the projective space. Using sign-refined torsion we derive a skein relation for a normalized form of this polynomial.
△ Less
Submitted 13 June, 2007;
originally announced June 2007.
-
On the Colored Jones Polynomial and the Kashaev invariant
Authors:
Vu Huynh,
Thang T. Q. Le
Abstract:
We express the colored Jones polynomial as the inverse of the quantum determinant of a matrix with entries in the $q$-Weyl algebra of $q$-operators, evaluated at the trivial function (plus simple substitutions). The Kashaev invariant is proved to be equal to another special evaluation of the determinant. We also discuss the similarity between our determinant formula of the Kashaev invariant and…
▽ More
We express the colored Jones polynomial as the inverse of the quantum determinant of a matrix with entries in the $q$-Weyl algebra of $q$-operators, evaluated at the trivial function (plus simple substitutions). The Kashaev invariant is proved to be equal to another special evaluation of the determinant. We also discuss the similarity between our determinant formula of the Kashaev invariant and the determinant formula of the hyperbolic volume of knot complements, ho** it would lead to a proof of the volume conjecture.
△ Less
Submitted 15 March, 2005;
originally announced March 2005.