Search | arXiv e-print repository

MTS-Net: Dual-Enhanced Positional Multi-Head Self-Attention for 3D CT Diagnosis of May-Thurner Syndrome

Authors: Yixin Huang, Yiqi **, Ke Tao, Kaijian Xia, Jianfeng Gu, Lei Yu, Lan Du, Cunjian Chen

Abstract: May-Thurner Syndrome (MTS), also known as iliac vein compression syndrome or Cockett's syndrome, is a condition potentially impacting over 20 percent of the population, leading to an increased risk of iliofemoral deep venous thrombosis. In this paper, we present a 3D-based deep learning approach called MTS-Net for diagnosing May-Thurner Syndrome using CT scans. To effectively capture the spatial-t… ▽ More May-Thurner Syndrome (MTS), also known as iliac vein compression syndrome or Cockett's syndrome, is a condition potentially impacting over 20 percent of the population, leading to an increased risk of iliofemoral deep venous thrombosis. In this paper, we present a 3D-based deep learning approach called MTS-Net for diagnosing May-Thurner Syndrome using CT scans. To effectively capture the spatial-temporal relationship among CT scans and emulate the clinical process of diagnosing MTS, we propose a novel attention module called the dual-enhanced positional multi-head self-attention (DEP-MHSA). The proposed DEP-MHSA reconsiders the role of positional embedding and incorporates a dual-enhanced positional embedding in both attention weights and residual connections. Further, we establish a new dataset, termed MTS-CT, consisting of 747 subjects. Experimental results demonstrate that our proposed approach achieves state-of-the-art MTS diagnosis results, and our self-attention design facilitates the spatial-temporal modeling. We believe that our DEP-MHSA is more suitable to handle CT image sequence modeling and the proposed dataset enables future research on MTS diagnosis. We make our code and dataset publicly available at: https://github.com/Nutingnon/MTS_dep_mhsa. △ Less

Submitted 7 June, 2024; originally announced June 2024.

arXiv:2403.12982 [pdf]

Knowledge-Reuse Transfer Learning Methods in Molecular and Material Science

Authors: An Chen, Zhilong Wang, Karl Luigi Loza Vidaurre, Yanqiang Han, Simin Ye, Kehao Tao, Shiwei Wang, **g Gao, **** Li

Abstract: Molecules and materials are the foundation for the development of modern advanced industries such as energy storage systems and semiconductor devices. However, traditional trial-and-error methods or theoretical calculations are highly resource-intensive, and extremely long R&D (Research and Development) periods cannot meet the urgent need for molecules/materials in industrial development. Machine… ▽ More Molecules and materials are the foundation for the development of modern advanced industries such as energy storage systems and semiconductor devices. However, traditional trial-and-error methods or theoretical calculations are highly resource-intensive, and extremely long R&D (Research and Development) periods cannot meet the urgent need for molecules/materials in industrial development. Machine learning (ML) methods based on big data are expected to break this dilemma. However, the difficulty in constructing large-scale datasets of new molecules/materials due to the high cost of data acquisition and annotation limits the development of machine learning. The application of transfer learning lowers the data requirements for model training, which makes transfer learning stand out in researches addressing data quality issues. In this review, we summarize recent advances in transfer learning related to molecular and materials science. We focus on the application of transfer learning methods for the discovery of advanced molecules/materials, particularly, the construction of transfer learning frameworks for different systems, and how transfer learning can enhance the performance of models. In addition, the challenges of transfer learning are also discussed. △ Less

Submitted 2 March, 2024; originally announced March 2024.

Comments: 42 pages, 10 figures

arXiv:2401.12761 [pdf, other]

MUSES: The Multi-Sensor Semantic Perception Dataset for Driving under Uncertainty

Authors: Tim Brödermann, David Bruggemann, Christos Sakaridis, Kevin Ta, Odysseas Liagouris, Jason Corkill, Luc Van Gool

Abstract: Achieving level-5 driving automation in autonomous vehicles necessitates a robust semantic visual perception system capable of parsing data from different sensors across diverse conditions. However, existing semantic perception datasets often lack important non-camera modalities typically used in autonomous vehicles, or they do not exploit such modalities to aid and improve semantic annotations in… ▽ More Achieving level-5 driving automation in autonomous vehicles necessitates a robust semantic visual perception system capable of parsing data from different sensors across diverse conditions. However, existing semantic perception datasets often lack important non-camera modalities typically used in autonomous vehicles, or they do not exploit such modalities to aid and improve semantic annotations in challenging conditions. To address this, we introduce MUSES, the MUlti-SEnsor Semantic perception dataset for driving in adverse conditions under increased uncertainty. MUSES includes synchronized multimodal recordings with 2D panoptic annotations for 2500 images captured under diverse weather and illumination. The dataset integrates a frame camera, a lidar, a radar, an event camera, and an IMU/GNSS sensor. Our new two-stage panoptic annotation protocol captures both class-level and instance-level uncertainty in the ground truth and enables the novel task of uncertainty-aware panoptic segmentation we introduce, along with standard semantic and panoptic segmentation. MUSES proves both effective for training and challenging for evaluating models under diverse visual conditions, and it opens new avenues for research in multimodal and uncertainty-aware dense semantic perception. Our dataset and benchmark will be made publicly available. △ Less

Submitted 23 January, 2024; originally announced January 2024.

arXiv:2401.12722 [pdf, other]

Falcon: Fair Active Learning using Multi-armed Bandits

Authors: Ki Hyun Tae, Hantian Zhang, Jaeyoung Park, Kexin Rong, Steven Euijong Whang

Abstract: Biased data can lead to unfair machine learning models, highlighting the importance of embedding fairness at the beginning of data analysis, particularly during dataset curation and labeling. In response, we propose Falcon, a scalable fair active learning framework. Falcon adopts a data-centric approach that improves machine learning model fairness via strategic sample selection. Given a user-spec… ▽ More Biased data can lead to unfair machine learning models, highlighting the importance of embedding fairness at the beginning of data analysis, particularly during dataset curation and labeling. In response, we propose Falcon, a scalable fair active learning framework. Falcon adopts a data-centric approach that improves machine learning model fairness via strategic sample selection. Given a user-specified group fairness measure, Falcon identifies samples from "target groups" (e.g., (attribute=female, label=positive)) that are the most informative for improving fairness. However, a challenge arises since these target groups are defined using ground truth labels that are not available during sample selection. To handle this, we propose a novel trial-and-error method, where we postpone using a sample if the predicted label is different from the expected one and falls outside the target group. We also observe the trade-off that selecting more informative samples results in higher likelihood of postponing due to undesired label prediction, and the optimal balance varies per dataset. We capture the trade-off between informativeness and postpone rate as policies and propose to automatically select the best policy using adversarial multi-armed bandit methods, given their computational efficiency and theoretical guarantees. Experiments show that Falcon significantly outperforms existing fair active learning approaches in terms of fairness and accuracy and is more efficient. In particular, only Falcon supports a proper trade-off between accuracy and fairness where its maximum fairness score is 1.8-4.5x higher than the second-best results. △ Less

Submitted 23 January, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

Comments: Accepted to VLDB 2024

arXiv:2311.13052 [pdf, other]

Novel OCT mosaicking pipeline with Feature- and Pixel-based registration

Authors: Jiacheng Wang, Hao Li, Dewei Hu, Yuankai K. Tao, Ipek Oguz

Abstract: High-resolution Optical Coherence Tomography (OCT) images are crucial for ophthalmology studies but are limited by their relatively narrow field of view (FoV). Image mosaicking is a technique for aligning multiple overlap** images to obtain a larger FoV. Current mosaicking pipelines often struggle with substantial noise and considerable displacement between the input sub-fields. In this paper, w… ▽ More High-resolution Optical Coherence Tomography (OCT) images are crucial for ophthalmology studies but are limited by their relatively narrow field of view (FoV). Image mosaicking is a technique for aligning multiple overlap** images to obtain a larger FoV. Current mosaicking pipelines often struggle with substantial noise and considerable displacement between the input sub-fields. In this paper, we propose a versatile pipeline for stitching multi-view OCT/OCTA \textit{en face} projection images. Our method combines the strengths of learning-based feature matching and robust pixel-based registration to align multiple images effectively. Furthermore, we advance the application of a trained foundational model, Segment Anything Model (SAM), to validate mosaicking results in an unsupervised manner. The efficacy of our pipeline is validated using an in-house dataset and a large public dataset, where our method shows superior performance in terms of both accuracy and computational efficiency. We also made our evaluation tool for image mosaicking and the corresponding pipeline publicly available at \url{https://github.com/MedICL-VU/OCT-mosaicking}. △ Less

Submitted 21 November, 2023; originally announced November 2023.

arXiv:2311.05077 [pdf, other]

POISE: Pose Guided Human Silhouette Extraction under Occlusions

Authors: Arindam Dutta, Rohit Lal, Dripta S. Raychaudhuri, Calvin Khang Ta, Amit K. Roy-Chowdhury

Abstract: Human silhouette extraction is a fundamental task in computer vision with applications in various downstream tasks. However, occlusions pose a significant challenge, leading to incomplete and distorted silhouettes. To address this challenge, we introduce POISE: Pose Guided Human Silhouette Extraction under Occlusions, a novel self-supervised fusion framework that enhances accuracy and robustness i… ▽ More Human silhouette extraction is a fundamental task in computer vision with applications in various downstream tasks. However, occlusions pose a significant challenge, leading to incomplete and distorted silhouettes. To address this challenge, we introduce POISE: Pose Guided Human Silhouette Extraction under Occlusions, a novel self-supervised fusion framework that enhances accuracy and robustness in human silhouette prediction. By combining initial silhouette estimates from a segmentation model with human joint predictions from a 2D pose estimation model, POISE leverages the complementary strengths of both approaches, effectively integrating precise body shape information and spatial information to tackle occlusions. Furthermore, the self-supervised nature of \POISE eliminates the need for costly annotations, making it scalable and practical. Extensive experimental results demonstrate its superiority in improving silhouette extraction under occlusions, with promising results in downstream tasks such as gait recognition. The code for our method is available https://github.com/take2rohit/poise. △ Less

Submitted 8 November, 2023; originally announced November 2023.

Journal ref: Winter Conference on Applications of Computer Vision, 2024

arXiv:2308.03218 [pdf]

The 1/4 occupied O atoms induced ultraflat band and the one dimensional channels in the Pb$_{10-x}$Cu$_{x}$(PO$_4$)$_{6}$O$_{4}$ (x=0,0.5) crystal

Authors: Kun Tao, Rongrong Chen, Lei Yang, ** Gao, Desheng Xue, Chenglong Jia

Abstract: The search for room-temperature superconductors has been a long-standing goal in condensed matter physics. In this study, we investigate the electronic and geometric properties of lead apatite with and without Cu doped within the frame work of the density functional theory. Based on our calculations, we found that without the Cu doped the lead apatite shows an insulator character with flat bands s… ▽ More The search for room-temperature superconductors has been a long-standing goal in condensed matter physics. In this study, we investigate the electronic and geometric properties of lead apatite with and without Cu doped within the frame work of the density functional theory. Based on our calculations, we found that without the Cu doped the lead apatite shows an insulator character with flat bands straddle the Fermi level. Once we introduce the O1 vacancies, the flat bands disappear. Furthermore, we analyze the effects of Cu do** on the crystal structure and electronic band structure of the material. Our calculations reveal the presence of one-dimensional channels induced by fully occupied O1 atoms, that are only 1/4 occupied in the literature, which may play a crucial role in the realization of room-temperature superconductivity. Based on our findings, we propose a possible solution to improve the quality of superconductivity by annealing the material in an oxygen atmosphere. These results contribute to a better understanding of the unusual properties of Cu-doped lead apatite and will pave the way for further exploration of its potential as a room-temperature superconductor. △ Less

Submitted 23 November, 2023; v1 submitted 6 August, 2023; originally announced August 2023.

Comments: 8 pages, 7 figures

arXiv:2307.00245 [pdf, other]

Deep Angiogram: Trivializing Retinal Vessel Segmentation

Authors: Dewei Hu, Xing Yao, Jiacheng Wang, Yuankai K. Tao, Ipek Oguz

Abstract: Among the research efforts to segment the retinal vasculature from fundus images, deep learning models consistently achieve superior performance. However, this data-driven approach is very sensitive to domain shifts. For fundus images, such data distribution changes can easily be caused by variations in illumination conditions as well as the presence of disease-related features such as hemorrhages… ▽ More Among the research efforts to segment the retinal vasculature from fundus images, deep learning models consistently achieve superior performance. However, this data-driven approach is very sensitive to domain shifts. For fundus images, such data distribution changes can easily be caused by variations in illumination conditions as well as the presence of disease-related features such as hemorrhages and drusen. Since the source domain may not include all possible types of pathological cases, a model that can robustly recognize vessels on unseen domains is desirable but remains elusive, despite many proposed segmentation networks of ever-increasing complexity. In this work, we propose a contrastive variational auto-encoder that can filter out irrelevant features and synthesize a latent image, named deep angiogram, representing only the retinal vessels. Then segmentation can be readily accomplished by thresholding the deep angiogram. The generalizability of the synthetic network is improved by the contrastive loss that makes the model less sensitive to variations of image contrast and noisy features. Compared to baseline deep segmentation networks, our model achieves higher segmentation performance via simple thresholding. Our experiments show that the model can generate stable angiograms on different target domains, providing excellent visualization of vessels and a non-invasive, safe alternative to fluorescein angiography. △ Less

Submitted 1 July, 2023; originally announced July 2023.

Comments: 5 pages, 4 figures, SPIE 2023

Journal ref: In Medical Imaging 2023: Image Processing, vol. 12464, pp. 656-660. SPIE, 2023

arXiv:2306.11048 [pdf, other]

UncLe-SLAM: Uncertainty Learning for Dense Neural SLAM

Authors: Erik Sandström, Kevin Ta, Luc Van Gool, Martin R. Oswald

Abstract: We present an uncertainty learning framework for dense neural simultaneous localization and map** (SLAM). Estimating pixel-wise uncertainties for the depth input of dense SLAM methods allows re-weighing the tracking and map** losses towards image regions that contain more suitable information that is more reliable for SLAM. To this end, we propose an online framework for sensor uncertainty est… ▽ More We present an uncertainty learning framework for dense neural simultaneous localization and map** (SLAM). Estimating pixel-wise uncertainties for the depth input of dense SLAM methods allows re-weighing the tracking and map** losses towards image regions that contain more suitable information that is more reliable for SLAM. To this end, we propose an online framework for sensor uncertainty estimation that can be trained in a self-supervised manner from only 2D input data. We further discuss the advantages of the uncertainty learning for the case of multi-sensor input. Extensive analysis, experimentation, and ablations show that our proposed modeling paradigm improves both map** and tracking accuracy and often performs better than alternatives that require ground truth depth or 3D. Our experiments show that we achieve a 38\% and 27\% lower absolute trajectory tracking error (ATE) on the 7-Scenes and TUM-RGBD datasets respectively. On the popular Replica dataset using two types of depth sensors, we report an 11\% F1-score improvement on RGBD SLAM compared to the recent state-of-the-art neural implicit approaches. Source code: https://github.com/kev-in-ta/UncLe-SLAM. △ Less

Submitted 6 September, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

Comments: ICCV 2023 Workshop. 20 pages, 9 figures

arXiv:2305.14032 [pdf, other]

doi 10.21437/Interspeech.2023-1426

Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification

Authors: Sangmin Bae, June-Woo Kim, Won-Yang Cho, Hyerim Baek, Soyoun Son, Byungjo Lee, Changwan Ha, Kyongpil Tae, Sungnyun Kim, Se-Young Yun

Abstract: Respiratory sound contains crucial information for the early diagnosis of fatal lung diseases. Since the COVID-19 pandemic, there has been a growing interest in contact-free medical care based on electronic stethoscopes. To this end, cutting-edge deep learning models have been developed to diagnose lung diseases; however, it is still challenging due to the scarcity of medical data. In this study,… ▽ More Respiratory sound contains crucial information for the early diagnosis of fatal lung diseases. Since the COVID-19 pandemic, there has been a growing interest in contact-free medical care based on electronic stethoscopes. To this end, cutting-edge deep learning models have been developed to diagnose lung diseases; however, it is still challenging due to the scarcity of medical data. In this study, we demonstrate that the pretrained model on large-scale visual and audio datasets can be generalized to the respiratory sound classification task. In addition, we introduce a straightforward Patch-Mix augmentation, which randomly mixes patches between different samples, with Audio Spectrogram Transformer (AST). We further propose a novel and effective Patch-Mix Contrastive Learning to distinguish the mixed representations in the latent space. Our method achieves state-of-the-art performance on the ICBHI dataset, outperforming the prior leading score by an improvement of 4.08%. △ Less

Submitted 22 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

Comments: INTERSPEECH 2023, Code URL: https://github.com/raymin0223/patch-mix_contrastive_learning

arXiv:2303.11651 [pdf]

AlphaMat: A Material Informatics Hub Connecting Data, Features, Models and Applications

Authors: Zhilong Wang, Junfei Cai, An Chen, Yanqiang Han, Kehao Tao, Simin Ye, Shiwei Wang, Imran Ali, **** Li

Abstract: The development of modern civil industry, energy and information technology is inseparable from the rapid explorations of new materials, which are hampered by months to years of painstaking attempts, resulting in only a small fraction of materials being determined in a vast chemical space. Artificial intelligence (AI)-based methods are promising to address this gap, but face many challenges such a… ▽ More The development of modern civil industry, energy and information technology is inseparable from the rapid explorations of new materials, which are hampered by months to years of painstaking attempts, resulting in only a small fraction of materials being determined in a vast chemical space. Artificial intelligence (AI)-based methods are promising to address this gap, but face many challenges such as data scarcity and inaccurate material descriptor coding. Here, we develop an AI platform, AlphaMat, that connects materials and applications. AlphaMat is not limited by the data scale (from 101 to 106) and can design structural and component descriptors that are effective for docking with various AI models. With prediction time of milliseconds and high accuracy, AlphaMat exhibits strong powers to model at least 12 common attributes (formation energy, band gap, ionic conductivity, magnetism, phonon property, bulk modulus, dielectric constant, adsorption energy, etc.), resulting in an unexplored material database with over 117,000 entries. We further demonstrate the ability of AlphaMat to mine and design materials, which successfully discover thousands of new materials in photonics, batteries, catalysts, and capacitors from the largest inorganic compound databases that cover all elements in periodic table. This work proposes the first material informatics hub that does not require users to have strong programming knowledge to build AI models to design materials. Users can either directly retrieve our database or easily build AI models through AlphaMat to discover and design the required materials. AlphaMat can shorten the cycle of database construction and material discovery by at least decades, and its effective use will facilitate the applications of AI technology in material science and lead scientific and technological progress to a new height. △ Less

Submitted 21 March, 2023; originally announced March 2023.

arXiv:2301.01357 [pdf, ps, other]

A noetherian criterion for sequences of modules

Authors: Wee Liang Gan, Khoa Ta

Abstract: We prove a noetherian criterion for a sequence of modules equipped with linear maps between them. This generalizes a noetherian criterion of Gan and Li for infinite EI categories. We apply our criterion to the linear categories associated to certain diagram algebras defined by Patzt. We prove a noetherian criterion for a sequence of modules equipped with linear maps between them. This generalizes a noetherian criterion of Gan and Li for infinite EI categories. We apply our criterion to the linear categories associated to certain diagram algebras defined by Patzt. △ Less

Submitted 25 June, 2024; v1 submitted 3 January, 2023; originally announced January 2023.

arXiv:2209.07047 [pdf, other]

iFlipper: Label Flip** for Individual Fairness

Authors: Hantian Zhang, Ki Hyun Tae, Jaeyoung Park, Xu Chu, Steven Euijong Whang

Abstract: As machine learning becomes prevalent, mitigating any unfairness present in the training data becomes critical. Among the various notions of fairness, this paper focuses on the well-known individual fairness, which states that similar individuals should be treated similarly. While individual fairness can be improved when training a model (in-processing), we contend that fixing the data before mode… ▽ More As machine learning becomes prevalent, mitigating any unfairness present in the training data becomes critical. Among the various notions of fairness, this paper focuses on the well-known individual fairness, which states that similar individuals should be treated similarly. While individual fairness can be improved when training a model (in-processing), we contend that fixing the data before model training (pre-processing) is a more fundamental solution. In particular, we show that label flip** is an effective pre-processing technique for improving individual fairness. Our system iFlipper solves the optimization problem of minimally flip** labels given a limit to the individual fairness violations, where a violation occurs when two similar examples in the training data have different labels. We first prove that the problem is NP-hard. We then propose an approximate linear programming algorithm and provide theoretical guarantees on how close its result is to the optimal solution in terms of the number of label flips. We also propose techniques for making the linear programming solution more optimal without exceeding the violations limit. Experiments on real datasets show that iFlipper significantly outperforms other pre-processing baselines in terms of individual fairness and accuracy on unseen test sets. In addition, iFlipper can be combined with in-processing techniques for even better results. △ Less

Submitted 15 September, 2022; originally announced September 2022.

Comments: 20 pages, 19 figures, 8 tables

arXiv:2207.01009 [pdf, other]

L2E: Lasers to Events for 6-DoF Extrinsic Calibration of Lidars and Event Cameras

Authors: Kevin Ta, David Bruggemann, Tim Brödermann, Christos Sakaridis, Luc Van Gool

Abstract: As neuromorphic technology is maturing, its application to robotics and autonomous vehicle systems has become an area of active research. In particular, event cameras have emerged as a compelling alternative to frame-based cameras in low-power and latency-demanding applications. To enable event cameras to operate alongside staple sensors like lidar in perception tasks, we propose a direct, tempora… ▽ More As neuromorphic technology is maturing, its application to robotics and autonomous vehicle systems has become an area of active research. In particular, event cameras have emerged as a compelling alternative to frame-based cameras in low-power and latency-demanding applications. To enable event cameras to operate alongside staple sensors like lidar in perception tasks, we propose a direct, temporally-decoupled extrinsic calibration method between event cameras and lidars. The high dynamic range, high temporal resolution, and low-latency operation of event cameras are exploited to directly register lidar laser returns, allowing information-based correlation methods to optimize for the 6-DoF extrinsic calibration between the two sensors. This paper presents the first direct calibration method between event cameras and lidars, removing dependencies on frame-based camera intermediaries and/or highly-accurate hand measurements. △ Less

Submitted 20 February, 2023; v1 submitted 3 July, 2022; originally announced July 2022.

Comments: Accepted to ICRA2023

arXiv:2205.08497 [pdf, ps, other]

Feature Aggregation in Zero-Shot Cross-Lingual Transfer Using Multilingual BERT

Authors: Beiduo Chen, Wu Guo, Quan Liu, Kun Tao

Abstract: Multilingual BERT (mBERT), a language model pre-trained on large multilingual corpora, has impressive zero-shot cross-lingual transfer capabilities and performs surprisingly well on zero-shot POS tagging and Named Entity Recognition (NER), as well as on cross-lingual model transfer. At present, the mainstream methods to solve the cross-lingual downstream tasks are always using the last transformer… ▽ More Multilingual BERT (mBERT), a language model pre-trained on large multilingual corpora, has impressive zero-shot cross-lingual transfer capabilities and performs surprisingly well on zero-shot POS tagging and Named Entity Recognition (NER), as well as on cross-lingual model transfer. At present, the mainstream methods to solve the cross-lingual downstream tasks are always using the last transformer layer's output of mBERT as the representation of linguistic information. In this work, we explore the complementary property of lower layers to the last transformer layer of mBERT. A feature aggregation module based on an attention mechanism is proposed to fuse the information contained in different layers of mBERT. The experiments are conducted on four zero-shot cross-lingual transfer datasets, and the proposed method obtains performance improvements on key multilingual benchmark tasks XNLI (+1.5 %), PAWS-X (+2.4 %), NER (+1.2 F1), and POS (+1.5 F1). Through the analysis of the experimental results, we prove that the layers before the last layer of mBERT can provide extra useful information for cross-lingual downstream tasks and explore the interpretability of mBERT empirically. △ Less

Submitted 17 May, 2022; originally announced May 2022.

Comments: Accepted by ICPR 2022

arXiv:2201.11760 [pdf, other]

Unsupervised Denoising of Retinal OCT with Diffusion Probabilistic Model

Authors: Dewei Hu, Yuankai K. Tao, Ipek Oguz

Abstract: Optical coherence tomography (OCT) is a prevalent non-invasive imaging method which provides high resolution volumetric visualization of retina. However, its inherent defect, the speckle noise, can seriously deteriorate the tissue visibility in OCT. Deep learning based approaches have been widely used for image restoration, but most of these require a noise-free reference image for supervision. In… ▽ More Optical coherence tomography (OCT) is a prevalent non-invasive imaging method which provides high resolution volumetric visualization of retina. However, its inherent defect, the speckle noise, can seriously deteriorate the tissue visibility in OCT. Deep learning based approaches have been widely used for image restoration, but most of these require a noise-free reference image for supervision. In this study, we present a diffusion probabilistic model that is fully unsupervised to learn from noise instead of signal. A diffusion process is defined by adding a sequence of Gaussian noise to self-fused OCT b-scans. Then the reverse process of diffusion, modeled by a Markov chain, provides an adjustable level of denoising. Our experiment results demonstrate that our method can significantly improve the image quality with a simple working pipeline and a small amount of training data. △ Less

Submitted 27 January, 2022; originally announced January 2022.

Comments: SPIE medical imaging, 2022

arXiv:2107.04288 [pdf, other]

Retinal OCT Denoising with Pseudo-Multimodal Fusion Network

Authors: Dewei Hu, Joseph D. Malone, Yigit Atay, Yuankai K. Tao, Ipek Oguz

Abstract: Optical coherence tomography (OCT) is a prevalent imaging technique for retina. However, it is affected by multiplicative speckle noise that can degrade the visibility of essential anatomical structures, including blood vessels and tissue layers. Although averaging repeated B-scan frames can significantly improve the signal-to-noise-ratio (SNR), this requires longer acquisition time, which can int… ▽ More Optical coherence tomography (OCT) is a prevalent imaging technique for retina. However, it is affected by multiplicative speckle noise that can degrade the visibility of essential anatomical structures, including blood vessels and tissue layers. Although averaging repeated B-scan frames can significantly improve the signal-to-noise-ratio (SNR), this requires longer acquisition time, which can introduce motion artifacts and cause discomfort to patients. In this study, we propose a learning-based method that exploits information from the single-frame noisy B-scan and a pseudo-modality that is created with the aid of the self-fusion method. The pseudo-modality provides good SNR for layers that are barely perceptible in the noisy B-scan but can over-smooth fine features such as small vessels. By using a fusion network, desired features from each modality can be combined, and the weight of their contribution is adjustable. Evaluated by intensity-based and structural metrics, the result shows that our method can effectively suppress the speckle noise and enhance the contrast between retina layers while the overall structure and small blood vessels are preserved. Compared to the single modality network, our method improves the structural similarity with low noise B-scan from 0.559 +\- 0.033 to 0.576 +\- 0.031. △ Less

Submitted 9 July, 2021; originally announced July 2021.

Comments: Accepted by International Workshop on Ophthalmic Medical Image Analysis (OMIA) 2020

arXiv:2107.04282 [pdf, other]

LIFE: A Generalizable Autodidactic Pipeline for 3D OCT-A Vessel Segmentation

Authors: Dewei Hu, Can Cui, Hao Li, Kathleen E. Larson, Yuankai K. Tao, Ipek Oguz

Abstract: Optical coherence tomography (OCT) is a non-invasive imaging technique widely used for ophthalmology. It can be extended to OCT angiography (OCT-A), which reveals the retinal vasculature with improved contrast. Recent deep learning algorithms produced promising vascular segmentation results; however, 3D retinal vessel segmentation remains difficult due to the lack of manually annotated training da… ▽ More Optical coherence tomography (OCT) is a non-invasive imaging technique widely used for ophthalmology. It can be extended to OCT angiography (OCT-A), which reveals the retinal vasculature with improved contrast. Recent deep learning algorithms produced promising vascular segmentation results; however, 3D retinal vessel segmentation remains difficult due to the lack of manually annotated training data. We propose a learning-based method that is only supervised by a self-synthesized modality named local intensity fusion (LIF). LIF is a capillary-enhanced volume computed directly from the input OCT-A. We then construct the local intensity fusion encoder (LIFE) to map a given OCT-A volume and its LIF counterpart to a shared latent space. The latent space of LIFE has the same dimensions as the input data and it contains features common to both modalities. By binarizing this latent space, we obtain a volumetric vessel segmentation. Our method is evaluated in a human fovea OCT-A and three zebrafish OCT-A volumes with manual labels. It yields a Dice score of 0.7736 on human data and 0.8594 +/- 0.0275 on zebrafish data, a dramatic improvement over existing unsupervised algorithms. △ Less

Submitted 9 July, 2021; originally announced July 2021.

Comments: Accepted by International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) 2021

arXiv:2102.12851 [pdf, ps, other]

Homogeneous Spectrum of Quasi-periodic Gevrey Schrödinger Operators with Diophantine Frequency

Authors: Yan Yang, Kai Tao

Abstract: We consider the quasi-periodic Schrödinger operator with the non-degenerate Gevrey potential for the Diophantine frequency. We prove that if the coupling number of the potential is large, then the spectrum is homogeneous. We consider the quasi-periodic Schrödinger operator with the non-degenerate Gevrey potential for the Diophantine frequency. We prove that if the coupling number of the potential is large, then the spectrum is homogeneous. △ Less

Submitted 26 November, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

Comments: 25 pages

arXiv:2101.05967 [pdf, other]

Responsible AI Challenges in End-to-end Machine Learning

Authors: Steven Euijong Whang, Ki Hyun Tae, Yuji Roh, Geon Heo

Abstract: Responsible AI is becoming critical as AI is widely used in our everyday lives. Many companies that deploy AI publicly state that when training a model, we not only need to improve its accuracy, but also need to guarantee that the model does not discriminate against users (fairness), is resilient to noisy or poisoned data (robustness), is explainable, and more. In addition, these objectives are no… ▽ More Responsible AI is becoming critical as AI is widely used in our everyday lives. Many companies that deploy AI publicly state that when training a model, we not only need to improve its accuracy, but also need to guarantee that the model does not discriminate against users (fairness), is resilient to noisy or poisoned data (robustness), is explainable, and more. In addition, these objectives are not only relevant to model training, but to all steps of end-to-end machine learning, which include data collection, data cleaning and validation, model training, model evaluation, and model management and serving. Finally, responsible AI is conceptually challenging, and supporting all the objectives must be as easy as possible. We thus propose three key research directions towards this vision - depth, breadth, and usability - to measure progress and introduce our ongoing research. First, responsible AI must be deeply supported where multiple objectives like fairness and robust must be handled together. To this end, we propose FR-Train, a holistic framework for fair and robust model training in the presence of data bias and poisoning. Second, responsible AI must be broadly supported, preferably in all steps of machine learning. Currently we focus on the data pre-processing steps and propose Slice Tuner, a selective data acquisition framework for training fair and accurate models, and MLClean, a data cleaning framework that also improves fairness and robustness. Finally, responsible AI must be usable where the techniques must be easy to deploy and actionable. We propose FairBatch, a batch selection approach for fairness that is effective and simple to use, and Slice Finder, a model evaluation tool that automatically finds problematic slices. We believe we scratched the surface of responsible AI for end-to-end machine learning and suggest research challenges moving forward. △ Less

Submitted 14 January, 2021; originally announced January 2021.

arXiv:2011.04499 [pdf, other]

Synonym Knowledge Enhanced Reader for Chinese Idiom Reading Comprehension

Authors: Siyu Long, Ran Wang, Kun Tao, Jiali Zeng, Xin-Yu Dai

Abstract: Machine reading comprehension (MRC) is the task that asks a machine to answer questions based on a given context. For Chinese MRC, due to the non-literal and non-compositional semantic characteristics, Chinese idioms pose unique challenges for machines to understand. Previous studies tend to treat idioms separately without fully exploiting the relationship among them. In this paper, we first defin… ▽ More Machine reading comprehension (MRC) is the task that asks a machine to answer questions based on a given context. For Chinese MRC, due to the non-literal and non-compositional semantic characteristics, Chinese idioms pose unique challenges for machines to understand. Previous studies tend to treat idioms separately without fully exploiting the relationship among them. In this paper, we first define the concept of literal meaning coverage to measure the consistency between semantics and literal meanings for Chinese idioms. With the definition, we prove that the literal meanings of many idioms are far from their semantics, and we also verify that the synonymic relationship can mitigate this inconsistency, which would be beneficial for idiom comprehension. Furthermore, to fully utilize the synonymic relationship, we propose the synonym knowledge enhanced reader. Specifically, for each idiom, we first construct a synonym graph according to the annotations from a high-quality synonym dictionary or the cosine similarity between the pre-trained idiom embeddings and then incorporate the graph attention network and gate mechanism to encode the graph. Experimental results on ChID, a large-scale Chinese idiom reading comprehension dataset, show that our model achieves state-of-the-art performance. △ Less

Submitted 9 November, 2020; originally announced November 2020.

Comments: 12 pages, 3 figure, accepted by COLING2020

arXiv:2007.06071 [pdf, other]

doi 10.1371/journal.pone.0236452

A Comparative Study on Polyp Classification using Convolutional Neural Networks

Authors: Krushi Patel, Kaidong Li, Ke Tao, Quan Wang, Ajay Bansal, Amit Rastogi, Guanghui Wang

Abstract: Colorectal cancer is the third most common cancer diagnosed in both men and women in the United States. Most colorectal cancers start as a growth on the inner lining of the colon or rectum, called 'polyp'. Not all polyps are cancerous, but some can develop into cancer. Early detection and recognition of the type of polyps is critical to prevent cancer and change outcomes. However, visual classific… ▽ More Colorectal cancer is the third most common cancer diagnosed in both men and women in the United States. Most colorectal cancers start as a growth on the inner lining of the colon or rectum, called 'polyp'. Not all polyps are cancerous, but some can develop into cancer. Early detection and recognition of the type of polyps is critical to prevent cancer and change outcomes. However, visual classification of polyps is challenging due to varying illumination conditions of endoscopy, variant texture, appearance, and overlap** morphology between polyps. More importantly, evaluation of polyp patterns by gastroenterologists is subjective leading to a poor agreement among observers. Deep convolutional neural networks have proven very successful in object classification across various object categories. In this work, we compare the performance of the state-of-the-art general object classification models for polyp classification. We trained a total of six CNN models end-to-end using a dataset of 157 video sequences composed of two types of polyps: hyperplastic and adenomatous. Our results demonstrate that the state-of-the-art CNN models can successfully classify polyps with an accuracy comparable or better than reported among gastroenterologists. The results of this study can guide future research in polyp classification. △ Less

Submitted 12 July, 2020; originally announced July 2020.

arXiv:2004.01251 [pdf, other]

R3: A Reading Comprehension Benchmark Requiring Reasoning Processes

Authors: Ran Wang, Kun Tao, Dingjie Song, Zhilong Zhang, Xiao Ma, Xi'ao Su, Xinyu Dai

Abstract: Existing question answering systems can only predict answers without explicit reasoning processes, which hinder their explainability and make us overestimate their ability of understanding and reasoning over natural language. In this work, we propose a novel task of reading comprehension, in which a model is required to provide final answers and reasoning processes. To this end, we introduce a for… ▽ More Existing question answering systems can only predict answers without explicit reasoning processes, which hinder their explainability and make us overestimate their ability of understanding and reasoning over natural language. In this work, we propose a novel task of reading comprehension, in which a model is required to provide final answers and reasoning processes. To this end, we introduce a formalism for reasoning over unstructured text, namely Text Reasoning Meaning Representation (TRMR). TRMR consists of three phrases, which is expressive enough to characterize the reasoning process to answer reading comprehension questions. We develop an annotation platform to facilitate TRMR's annotation, and release the R3 dataset, a \textbf{R}eading comprehension benchmark \textbf{R}equiring \textbf{R}easoning processes. R3 contains over 60K pairs of question-answer pairs and their TRMRs. Our dataset is available at: \url{http://anonymous}. △ Less

Submitted 2 April, 2020; originally announced April 2020.

Comments: work in progress

arXiv:2003.04549 [pdf, other]

Slice Tuner: A Selective Data Acquisition Framework for Accurate and Fair Machine Learning Models

Authors: Ki Hyun Tae, Steven Euijong Whang

Abstract: As machine learning becomes democratized in the era of Software 2.0, a serious bottleneck is acquiring enough data to ensure accurate and fair models. Recent techniques including crowdsourcing provide cost-effective ways to gather such data. However, simply acquiring data as much as possible is not necessarily an effective strategy for optimizing accuracy and fairness. For example, if an online ap… ▽ More As machine learning becomes democratized in the era of Software 2.0, a serious bottleneck is acquiring enough data to ensure accurate and fair models. Recent techniques including crowdsourcing provide cost-effective ways to gather such data. However, simply acquiring data as much as possible is not necessarily an effective strategy for optimizing accuracy and fairness. For example, if an online app store has enough training data for certain slices of data (say American customers), but not for others, obtaining more American customer data will only bias the model training. Instead, we contend that one needs to selectively acquire data and propose Slice Tuner, which acquires possibly-different amounts of data per slice such that the model accuracy and fairness on all slices are optimized. This problem is different than labeling existing data (as in active learning or weak supervision) because the goal is obtaining the right amounts of new data. At its core, Slice Tuner maintains learning curves of slices that estimate the model accuracies given more data and uses convex optimization to find the best data acquisition strategy. The key challenges of estimating learning curves are that they may be inaccurate if there is not enough data, and there may be dependencies among slices where acquiring data for one slice influences the learning curves of others. We solve these issues by iteratively and efficiently updating the learning curves as more data is acquired. We evaluate Slice Tuner on real datasets using crowdsourcing for data acquisition and show that Slice Tuner significantly outperforms baselines in terms of model accuracy and fairness, even when the learning curves cannot be reliably estimated. △ Less

Submitted 21 August, 2021; v1 submitted 10 March, 2020; originally announced March 2020.

Comments: 15 pages, 11 figures, 11 tables

arXiv:1906.11136 [pdf, ps, other]

Large Deviation theorems for Dirichlet determinants of analytic quasi-periodic Jacobi operators with Brjuno-Rüssmann frequency

Authors: Wenmeng Geng, Kai Tao

Abstract: In this paper, we first study the strong Birkhoff Ergodic Theorem for subharmonic functions with the Brjuno-Rüssmann shift on the Torus. Then, we apply it to prove the large deviation theorems for the finite scale Dirichlet determinants of quasi-periodic analytic Jacobi operators with this frequency. It shows that the Brjuno-Rüssmann function, which reflects the irrationality of the frequency, pla… ▽ More In this paper, we first study the strong Birkhoff Ergodic Theorem for subharmonic functions with the Brjuno-Rüssmann shift on the Torus. Then, we apply it to prove the large deviation theorems for the finite scale Dirichlet determinants of quasi-periodic analytic Jacobi operators with this frequency. It shows that the Brjuno-Rüssmann function, which reflects the irrationality of the frequency, plays the key role in these theorems via the smallest deviation. At last, as an application, we obtain a distribution of the eigenvalues of the Jacobi operators with Dirichlet boundary conditions, which also depends on the smallest deviation, essentially on the irrationality of the frequency. △ Less

Submitted 26 June, 2019; originally announced June 2019.

Comments: 28 pages

MSC Class: 37C55; 37F10; 37C40

arXiv:1904.10761 [pdf, other]

Data Cleaning for Accurate, Fair, and Robust Models: A Big Data - AI Integration Approach

Authors: Ki Hyun Tae, Yuji Roh, Young Hun Oh, Hyunsu Kim, Steven Euijong Whang

Abstract: The wide use of machine learning is fundamentally changing the software development paradigm (a.k.a. Software 2.0) where data becomes a first-class citizen, on par with code. As machine learning is used in sensitive applications, it becomes imperative that the trained model is accurate, fair, and robust to attacks. While many techniques have been proposed to improve the model training process (in-… ▽ More The wide use of machine learning is fundamentally changing the software development paradigm (a.k.a. Software 2.0) where data becomes a first-class citizen, on par with code. As machine learning is used in sensitive applications, it becomes imperative that the trained model is accurate, fair, and robust to attacks. While many techniques have been proposed to improve the model training process (in-processing approach) or the trained model itself (post-processing), we argue that the most effective method is to clean the root cause of error: the data the model is trained on (pre-processing). Historically, there are at least three research communities that have been separately studying this problem: data management, machine learning (model fairness), and security. Although a significant amount of research has been done by each community, ultimately the same datasets must be preprocessed, and there is little understanding how the techniques relate to each other and can possibly be integrated. We contend that it is time to extend the notion of data cleaning for modern machine learning needs. We identify dependencies among the data preprocessing techniques and propose MLClean, a unified data cleaning framework that integrates the techniques and helps train accurate and fair models. This work is part of a broader trend of Big data -- Artificial Intelligence (AI) integration. △ Less

Submitted 22 April, 2019; originally announced April 2019.

Comments: 4 pages

arXiv:1809.01263 [pdf, other]

An Efficient Approach for Polyps Detection in Endoscopic Videos Based on Faster R-CNN

Authors: Xi Mo, Ke Tao, Quan Wang, Guanghui Wang

Abstract: Polyp has long been considered as one of the major etiologies to colorectal cancer which is a fatal disease around the world, thus early detection and recognition of polyps plays a crucial role in clinical routines. Accurate diagnoses of polyps through endoscopes operated by physicians becomes a challenging task not only due to the varying expertise of physicians, but also the inherent nature of e… ▽ More Polyp has long been considered as one of the major etiologies to colorectal cancer which is a fatal disease around the world, thus early detection and recognition of polyps plays a crucial role in clinical routines. Accurate diagnoses of polyps through endoscopes operated by physicians becomes a challenging task not only due to the varying expertise of physicians, but also the inherent nature of endoscopic inspections. To facilitate this process, computer-aid techniques that emphasize fully-conventional image processing and novel machine learning enhanced approaches have been dedicatedly designed for polyp detection in endoscopic videos or images. Among all proposed algorithms, deep learning based methods take the lead in terms of multiple metrics in evolutions for algorithmic performance. In this work, a highly effective model, namely the faster region-based convolutional neural network (Faster R-CNN) is implemented for polyp detection. In comparison with the reported results of the state-of-the-art approaches on polyps detection, extensive experiments demonstrate that the Faster R-CNN achieves very competing results, and it is an efficient approach for clinical practice. △ Less

Submitted 4 September, 2018; originally announced September 2018.

Comments: 6 pages, 10 figures,2018 International Conference on Pattern Recognition

arXiv:1807.06068 [pdf, other]

Automated Data Slicing for Model Validation:A Big data - AI Integration Approach

Authors: Yeounoh Chung, Tim Kraska, Neoklis Polyzotis, Ki Hyun Tae, Steven Euijong Whang

Abstract: As machine learning systems become democratized, it becomes increasingly important to help users easily debug their models. However, current data tools are still primitive when it comes to hel** users trace model performance problems all the way to the data. We focus on the particular problem of slicing data to identify subsets of the validation data where the model performs poorly. This is an i… ▽ More As machine learning systems become democratized, it becomes increasingly important to help users easily debug their models. However, current data tools are still primitive when it comes to hel** users trace model performance problems all the way to the data. We focus on the particular problem of slicing data to identify subsets of the validation data where the model performs poorly. This is an important problem in model validation because the overall model performance can fail to reflect that of the smaller subsets, and slicing allows users to analyze the model performance on a more granular-level. Unlike general techniques (e.g., clustering) that can find arbitrary slices, our goal is to find interpretable slices (which are easier to take action compared to arbitrary subsets) that are problematic and large. We propose Slice Finder, which is an interactive framework for identifying such slices using statistical techniques. Applications include diagnosing model fairness and fraud detection, where identifying slices that are interpretable to humans is crucial. This research is part of a larger trend of Big data and Artificial Intelligence (AI) integration and opens many opportunities for new research. △ Less

Submitted 6 January, 2019; v1 submitted 16 July, 2018; originally announced July 2018.

arXiv:1807.04807 [pdf, other]

Learning-based Regularization for Cardiac Strain Analysis with Ability for Domain Adaptation

Authors: Allen Lu, Nripesh Parajuli, Maria Zontak, John Stendahl, Kevinminh Ta, Zhao Liu, Nabil Boutagy, Geng-Shi Jeng, Imran Alkhalil, Lawrence H. Staib, Matthew O'Donnell, Albert J. Sinusas, James S. Duncan

Abstract: Reliable motion estimation and strain analysis using 3D+time echocardiography (4DE) for localization and characterization of myocardial injury is valuable for early detection and targeted interventions. However, motion estimation is difficult due to the low-SNR that stems from the inherent image properties of 4DE, and intelligent regularization is critical for producing reliable motion estimates.… ▽ More Reliable motion estimation and strain analysis using 3D+time echocardiography (4DE) for localization and characterization of myocardial injury is valuable for early detection and targeted interventions. However, motion estimation is difficult due to the low-SNR that stems from the inherent image properties of 4DE, and intelligent regularization is critical for producing reliable motion estimates. In this work, we incorporated the notion of domain adaptation into a supervised neural network regularization framework. We first propose an unsupervised autoencoder network with biomechanical constraints for learning a latent representation that is shown to have more physiologically plausible displacements. We extended this framework to include a supervised loss term on synthetic data and showed the effects of biomechanical constraints on the network's ability for domain adaptation. We validated both the autoencoder and semi-supervised regularization method on in vivo data with implanted sonomicrometers. Finally, we showed the ability of our semi-supervised learning regularization approach to identify infarcted regions using estimated regional strain maps with good agreement to manually traced infarct regions from postmortem excised hearts. △ Less

Submitted 12 July, 2018; originally announced July 2018.

arXiv:1807.02951 [pdf, other]

Flow Network Tracking for Spatiotemporal and Periodic Point Matching: Applied to Cardiac Motion Analysis

Authors: Nripesh Parajuli, Allen Lu, Kevinminh Ta, John C. Stendahl, Nabil Boutagy, Imran Alkhalil, Melissa Eberle, Geng-Shi Jeng, Maria Zontak, Matthew ODonnell, Albert J. Sinusas, James S. Duncan

Abstract: The accurate quantification of left ventricular (LV) deformation/strain shows significant promise for quantitatively assessing cardiac function for use in diagnosis and therapy planning (Jasaityte et al., 2013). However, accurate estimation of the displacement of myocardial tissue and hence LV strain has been challenging due to a variety of issues, including those related to deriving tracking toke… ▽ More The accurate quantification of left ventricular (LV) deformation/strain shows significant promise for quantitatively assessing cardiac function for use in diagnosis and therapy planning (Jasaityte et al., 2013). However, accurate estimation of the displacement of myocardial tissue and hence LV strain has been challenging due to a variety of issues, including those related to deriving tracking tokens from images and following tissue locations over the entire cardiac cycle. In this work, we propose a point matching scheme where correspondences are modeled as flow through a graphical network. Myocardial surface points are set up as nodes in the network and edges define neighborhood relationships temporally. The novelty lies in the constraints that are imposed on the matching scheme, which render the correspondences one-to-one through the entire cardiac cycle, and not just two consecutive frames. The constraints also encourage motion to be cyclic, which is an important characteristic of LV motion. We validate our method by applying it to the estimation of quantitative LV displacement and strain estimation using 8 synthetic and 8 open-chested canine 4D echocardiographic image sequences, the latter with sonomicrometric crystals implanted on the LV wall. We were able to achieve excellent tracking accuracy on the synthetic dataset and observed a good correlation with crystal-based strains on the in-vivo data. △ Less

Submitted 9 July, 2018; originally announced July 2018.

Comments: Submitted manuscript to Medical Image Analysis Journal

arXiv:1805.00431 [pdf, ps, other]

Strong Birkhoff Ergodic Theorem for subharmonic functions with irrational shift and its application to analytic quasi-periodic cocycles

Authors: Kai Tao

Abstract: In this paper, we first prove the strong Birkhoff Ergodic Theorem for subharmonic functions with the irrational shift on the Torus. Then, it is applied to the analytic quasi-periodic Jacobi cocycles. We show that if the Lyapunov exponent of these cocycles is positive at one point, then it is positive on an interval centered at this point for suitable frequency and coupling numbers. We also prove t… ▽ More In this paper, we first prove the strong Birkhoff Ergodic Theorem for subharmonic functions with the irrational shift on the Torus. Then, it is applied to the analytic quasi-periodic Jacobi cocycles. We show that if the Lyapunov exponent of these cocycles is positive at one point, then it is positive on an interval centered at this point for suitable frequency and coupling numbers. We also prove that the Lyapunov exponent is Hölder continuous in $E$ on this interval and calculate the expression of its length. What's more, if the coupling number of the potential is large, then the Lyapunov exponent is always positive for all irrational frequencies and Hölder continuous in $E$ for all finite Liouville frequencies. We also study the Lyapunov exponent of the Schrödinger cocycles, a special case of the Jacobi ones, and obtain its Hölder continuity in the frequency. △ Less

Submitted 19 July, 2018; v1 submitted 1 May, 2018; originally announced May 2018.

Comments: 29 pages

MSC Class: 37A02; 37C02; 37D02

arXiv:1712.07900 [pdf, ps, other]

Non-perturbative positive Lyapunov exponent of Schrödinger equations and its applications to skew-shift

Authors: Kai Tao

Abstract: We first study the discrete Schrödinger equations with analytic potentials given by a class of transformations. It is shown that if the coupling number is large, then its logarithm equals approximately to the Lyapunov exponents. When the transformation becomes the skew-shift, we prove that the Lyapunov exponent is week Hölder continuous, and the spectrum satisfies Anderson Localization and contain… ▽ More We first study the discrete Schrödinger equations with analytic potentials given by a class of transformations. It is shown that if the coupling number is large, then its logarithm equals approximately to the Lyapunov exponents. When the transformation becomes the skew-shift, we prove that the Lyapunov exponent is week Hölder continuous, and the spectrum satisfies Anderson Localization and contains large intervals. Moreover, all of these conclusions are non-perturbative. △ Less

Submitted 1 May, 2018; v1 submitted 21 December, 2017; originally announced December 2017.

MSC Class: 37A02; 37C02; 37D02

arXiv:1502.07536 [pdf]

doi 10.1007/s11468-015-9901-x

Collective dark states controlled transmission in plasmonic slot waveguide with a stub coupled to a cavity dimer

Authors: Zhenzhen Liu, Jun-Jun Xiao, Qiang Zhang, Xiaoming Zhang, Keyu Tao

Abstract: We report collective dark states controlled transmission in metal-dielectric-metal waveguides with a stub coupled to two twin cavities, namely, plasmonic waveguide-stub-dimer systems. In absence of one individual cavity in the dimer, plasmon induced transparency (PIT) is possible when the cavity and the stub have the same resonance frequency. However, it is shown that the hybridized modes in the d… ▽ More We report collective dark states controlled transmission in metal-dielectric-metal waveguides with a stub coupled to two twin cavities, namely, plasmonic waveguide-stub-dimer systems. In absence of one individual cavity in the dimer, plasmon induced transparency (PIT) is possible when the cavity and the stub have the same resonance frequency. However, it is shown that the hybridized modes in the dimer collectively generate two dark states which make the stub-dimer "invisible" to the straight waveguide, splitting the original PIT peak into two in the transmission spectrum. Simultaneously, the original PIT peak becomes a dip due to dark state interaction, yielding anti-PIT-like modulation of the transmission. With full-wave electromagnetic simulation, we demonstrate that this transition is controlled by the dimer-stub separation and the dimer-stub relative position. All results are analytically described by the temporal coupled mode theory. Our results may be useful in designing densely integrated optical circuits, and in optical sensing and switching applications. △ Less

Submitted 26 February, 2015; originally announced February 2015.

arXiv:1502.03206 [pdf, ps, other]

High order numerical schemes for second-order FBSDEs with applications to stochastic optimal control

Authors: Kong Tao, Weidong Zhao, Tao Zhou

Abstract: This is one of our series papers on multistep schemes for solving forward backward stochastic differential equations (FBSDEs) and related problems. Here we extend (with non-trivial updates) our multistep schemes in [W. Zhao, Y. Fu and T. Zhou, SIAM J. Sci. Comput., 36 (2014), pp. A1731-A1751.] to solve the second order FBSDEs (2FBSDEs). The key feature of the multistep schemes is that the Euler me… ▽ More This is one of our series papers on multistep schemes for solving forward backward stochastic differential equations (FBSDEs) and related problems. Here we extend (with non-trivial updates) our multistep schemes in [W. Zhao, Y. Fu and T. Zhou, SIAM J. Sci. Comput., 36 (2014), pp. A1731-A1751.] to solve the second order FBSDEs (2FBSDEs). The key feature of the multistep schemes is that the Euler method is used to discrete the forward SDE, which dramatically reduces the entire computational complexity. Moreover, it is shown that the usual quantities of interest (e.g., the solution tuple $(Y_t, Z_t, A_t, Γ_t)$ in the 2FBSDEs) are still of high order accuracy. Several numerical examples are given to show the effective of the proposed numerical schemes. Applications of our numerical schemes for stochastic optimal control problems are also presented. △ Less

Submitted 11 February, 2015; originally announced February 2015.

arXiv:1501.01028 [pdf, ps, other]

Hölder continuity of the integrated density of states for quasi-periodic Jacobi operators

Authors: Kai Tao, Mircea Voda

Abstract: We show Hölder continuity for the integrated density of states of a quasi-periodic Jacobi operator with analytic coefficients, in the regime of positive Lyapunov exponent and with a strong Diophantine condition on the frequency. In particular, when the coefficients are trigonometric polynomials we express the Hölder exponent in terms of the degrees of the coefficients. We show Hölder continuity for the integrated density of states of a quasi-periodic Jacobi operator with analytic coefficients, in the regime of positive Lyapunov exponent and with a strong Diophantine condition on the frequency. In particular, when the coefficients are trigonometric polynomials we express the Hölder exponent in terms of the degrees of the coefficients. △ Less

Submitted 29 January, 2015; v1 submitted 5 January, 2015; originally announced January 2015.

Comments: v.2: fixed some typos

arXiv:1108.3747 [pdf, ps, other]

Hölder continuity of Lyapunov exponent for quasi-periodic Jacobi operators

Authors: Kai Tao

Abstract: We consider the quasi-periodic Jacobi operator $H_{x,ω}$ in $l^2(\mathbb{Z})$ $(H_{x,ω}φ)(n) = -b(x+(n+1)ω)φ(n+1) - b(x+nω)φ(n-1) + a(x+nω)φ(n) = Eφ(n),\ n\in\mathbb{Z},$ where $a(x),\ b(x)$ are analytic function on $\mathbb{T}$, $b$ is not identically zero, and $ω$ obeys some strong Diophantine condition. We consider the corresponding unimodular cocycle. We prove that if the Lyapunov exponent… ▽ More We consider the quasi-periodic Jacobi operator $H_{x,ω}$ in $l^2(\mathbb{Z})$ $(H_{x,ω}φ)(n) = -b(x+(n+1)ω)φ(n+1) - b(x+nω)φ(n-1) + a(x+nω)φ(n) = Eφ(n),\ n\in\mathbb{Z},$ where $a(x),\ b(x)$ are analytic function on $\mathbb{T}$, $b$ is not identically zero, and $ω$ obeys some strong Diophantine condition. We consider the corresponding unimodular cocycle. We prove that if the Lyapunov exponent $L(E)$ of the cocycle is positive for some $E=E_0$, then there exists $ρ_0=ρ_0(a,b,ω,E_0)$, $β=β(a,b,ω)$ such that $|L(E)-L(E')|<|E-E'|^β$ for any $E,E'\in (E_0-ρ_0,E_0+ρ_0)$. If $L(E)>0$ for all $E$ in some compact interval $I$ then $L(E)$ is Hölder continuous on $I$ with a Hölder exponent $β=β(a,b,ω,I)$. In our derivation we follow the refined version of the Goldstein-Schlag method \cite{GS} developed by Bourgain and Jitomirskaya \cite{BJ}. △ Less

Submitted 18 August, 2011; originally announced August 2011.

arXiv:0808.0585 [pdf, ps, other]

doi 10.1103/PhysRevLett.101.216802

Kondo effect in single atom contacts: the importance of the atomic geometry

Authors: Lucia Vitali, Robin Ohmann, Sebastian Stepanow, Pietro Gambardella, Kun Tao, Renzhong Huang, Valeri S. Stepanyuk, Patrick Bruno, Klaus Kern

Abstract: Co single atom junctions on copper surfaces are studied by scanning tunneling microscopy and ab-initio calculations. The Kondo temperature of single cobalt atoms on the Cu(111) surface has been measured at various tip-sample distances ranging from tunneling to the point contact regime. The experiments show a constant Kondo temperature for a whole range of tip-substrate distances consistently wit… ▽ More Co single atom junctions on copper surfaces are studied by scanning tunneling microscopy and ab-initio calculations. The Kondo temperature of single cobalt atoms on the Cu(111) surface has been measured at various tip-sample distances ranging from tunneling to the point contact regime. The experiments show a constant Kondo temperature for a whole range of tip-substrate distances consistently with the predicted energy position of the spin-polarized d-levels of Co. This is in striking difference to experiments on Co/Cu(100) junctions, where a substantial increase of the Kondo temperature has been found. Our calculations reveal that the different behavior of the Co adatoms on the two Cu surfaces originates from the interplay between the structural relaxations and the electronic properties in the near-contact regime. △ Less

Submitted 5 August, 2008; originally announced August 2008.

arXiv:0804.3337 [pdf, ps, other]

doi 10.1103/PhysRevB.78.014426

Manipulating magnetism and conductance of an adatom-molecule junction on metal surfaces: ab initio study

Authors: Kun Tao, V. S. Stepanyuk, P. Bruno, D. I. Bazhanov, V. V. Maslyuk, M. Brandbyge, I. Mertig

Abstract: The state of the art ab initio calculations reveal the effect of a scanning tunnelling microscopy tip on magnetic properties and conductance of a benzene-adatom sandwich on Cu(001). We concentrate on a benzene-Co system interacting with a Cr tip. Our studies give a clear evidence that magnetism and conductance in molecule-adatom junctions can be tailored by the STM tip. Varying the tip-substrate… ▽ More The state of the art ab initio calculations reveal the effect of a scanning tunnelling microscopy tip on magnetic properties and conductance of a benzene-adatom sandwich on Cu(001). We concentrate on a benzene-Co system interacting with a Cr tip. Our studies give a clear evidence that magnetism and conductance in molecule-adatom junctions can be tailored by the STM tip. Varying the tip-substrate distance the magnetic moment of the Co adatom can be switched on/off. The interplay between spin-polarized electron transport through the junction and its magnetic properties is demonstrated. A spin-filter effect in the junction is predicted. △ Less

Submitted 21 April, 2008; originally announced April 2008.

Comments: 4 pages, 3 figures

Showing 1–38 of 38 results for author: Tao, K