Search | arXiv e-print repository

LEDetection: A Simple Framework for Semi-Supervised Few-Shot Object Detection

Abstract: Few-shot object detection (FSOD) is a challenging problem aimed at detecting novel concepts from few exemplars. Existing approaches to FSOD all assume abundant base labels to adapt to novel objects. This paper studies the new task of semi-supervised FSOD by considering a realistic scenario in which both base and novel labels are simultaneously scarce. We explore the utility of unlabeled data withi… ▽ More Few-shot object detection (FSOD) is a challenging problem aimed at detecting novel concepts from few exemplars. Existing approaches to FSOD all assume abundant base labels to adapt to novel objects. This paper studies the new task of semi-supervised FSOD by considering a realistic scenario in which both base and novel labels are simultaneously scarce. We explore the utility of unlabeled data within our proposed label-efficient detection framework and discover its remarkable ability to boost semi-supervised FSOD by way of region proposals. Motivated by this finding, we introduce SoftER Teacher, a robust detector combining pseudo-labeling with consistency learning on region proposals, to harness unlabeled data for improved FSOD without relying on abundant labels. Rigorous experiments show that SoftER Teacher surpasses the novel performance of a strong supervised detector using only 10% of required base labels, without catastrophic forgetting observed in prior approaches. Our work also sheds light on a potential relationship between semi-supervised and few-shot detection suggesting that a stronger semi-supervised detector leads to a more effective few-shot detector. △ Less

Submitted 14 February, 2024; v1 submitted 10 March, 2023; originally announced March 2023.

Comments: AISTATS 2024. The code is available at https://github.com/lexisnexis-risk-open-source/ledetection

arXiv:2103.13696 [pdf, other]

SSLayout360: Semi-Supervised Indoor Layout Estimation from 360-Degree Panorama

Authors: Phi Vu Tran

Abstract: Recent years have seen flourishing research on both semi-supervised learning and 3D room layout reconstruction. In this work, we explore the intersection of these two fields to advance the research objective of enabling more accurate 3D indoor scene modeling with less labeled data. We propose the first approach to learn representations of room corners and boundaries by using a combination of label… ▽ More Recent years have seen flourishing research on both semi-supervised learning and 3D room layout reconstruction. In this work, we explore the intersection of these two fields to advance the research objective of enabling more accurate 3D indoor scene modeling with less labeled data. We propose the first approach to learn representations of room corners and boundaries by using a combination of labeled and unlabeled data for improved layout estimation in a 360-degree panoramic scene. Through extensive comparative experiments, we demonstrate that our approach can advance layout estimation of complex indoor scenes using as few as 20 labeled examples. When coupled with a layout predictor pre-trained on synthetic data, our semi-supervised method matches the fully supervised counterpart using only 12% of the labels. Our work takes an important first step towards robust semi-supervised layout estimation that can enable many applications in 3D perception with limited labeled data. △ Less

Submitted 16 May, 2021; v1 submitted 25 March, 2021; originally announced March 2021.

Comments: CVPR 2021. File size 37MB. Project page at https://github.com/FlyreelAI/sslayout360

arXiv:2011.02372 [pdf, other]

A biomimetic kidney tubule model

Authors: Elod Mehes, Tana S Pottorf, Marton Gulyas, Sandor Paku, Pamela V. Tran, Andras Czirok

Abstract: A critical barrier in the nephrology field is the lack of appropriate in vitro renal tubule models that allow manipulation of various mechanical factors, facilitating studies of disease pathophysiology and drug discovery. Here we report development of a novel in vitro assay system comprised of a renal tubule within an elasto-plastic extracellular matrix microenvironment. This in vitro tubule mimet… ▽ More A critical barrier in the nephrology field is the lack of appropriate in vitro renal tubule models that allow manipulation of various mechanical factors, facilitating studies of disease pathophysiology and drug discovery. Here we report development of a novel in vitro assay system comprised of a renal tubule within an elasto-plastic extracellular matrix microenvironment. This in vitro tubule mimetic device consists of a container with two, pipette-accessible ports, filament-deposition (3D-) printed into 35 mm cell culture dishes. The container is filled with a hydrogel, such as a collagen I or fibrin gel, while a narrow masking tube is threaded through the ports. Following gelation, the masking material is pulled out leaving a tunnel within the gel. Seeding of the tunnels with M1 or MDCK renal epithelial cells through the side ports results in a monolayer with apical-basal polarity, such that laminin and fibronectin are present on the basal surface, while primary cilia project from the apical side of cells into the tubular lumen. The device is optically accessible, and can be live-imaged by phase contrast or epifluorescence microscopy. The lumen of the epithelial-lined tube can be connected through the side ports to a circulatory flow. We demonstrate that kidney epithelial cells are able to adjust the diameter of the model tubule by myosin-II dependent contractility. Furthermore, cells of the tubule are also able to remodel the surrounding hydrogel leading to budding from the main tubule. We propose that this versatile in vitro model system can be developed into a future pre-clinical tool to study pathophysiology of kidney diseases and identify therapeutic compounds. △ Less

Submitted 4 November, 2020; originally announced November 2020.

Comments: 11 pages, 4 figures

arXiv:1911.13218 [pdf]

ModelHub.AI: Dissemination Platform for Deep Learning Models

Authors: Ahmed Hosny, Michael Schwier, Christoph Berger, Evin P Örnek, Mehmet Turan, Phi V Tran, Leon Weninger, Fabian Isensee, Klaus H Maier-Hein, Richard McKinley, Michael T Lu, Udo Hoffmann, Bjoern Menze, Spyridon Bakas, Andriy Fedorov, Hugo JWL Aerts

Abstract: Recent advances in artificial intelligence research have led to a profusion of studies that apply deep learning to problems in image analysis and natural language processing among others. Additionally, the availability of open-source computational frameworks has lowered the barriers to implementing state-of-the-art methods across multiple domains. Albeit leading to major performance breakthroughs… ▽ More Recent advances in artificial intelligence research have led to a profusion of studies that apply deep learning to problems in image analysis and natural language processing among others. Additionally, the availability of open-source computational frameworks has lowered the barriers to implementing state-of-the-art methods across multiple domains. Albeit leading to major performance breakthroughs in some tasks, effective dissemination of deep learning algorithms remains challenging, inhibiting reproducibility and benchmarking studies, impeding further validation, and ultimately hindering their effectiveness in the cumulative scientific progress. In develo** a platform for sharing research outputs, we present ModelHub.AI (www.modelhub.ai), a community-driven container-based software engine and platform for the structured dissemination of deep learning models. For contributors, the engine controls data flow throughout the inference cycle, while the contributor-facing standard template exposes model-specific functions including inference, as well as pre- and post-processing. Python and RESTful Application programming interfaces (APIs) enable users to interact with models hosted on ModelHub.AI and allows both researchers and developers to utilize models out-of-the-box. ModelHub.AI is domain-, data-, and framework-agnostic, catering to different workflows and contributors' preferences. △ Less

Submitted 26 November, 2019; originally announced November 2019.

arXiv:1906.10343 [pdf, other]

Exploring Self-Supervised Regularization for Supervised and Semi-Supervised Learning

Authors: Phi Vu Tran

Abstract: Recent advances in semi-supervised learning have shown tremendous potential in overcoming a major barrier to the success of modern machine learning algorithms: access to vast amounts of human-labeled training data. Previous algorithms based on consistency regularization can harness the abundance of unlabeled data to produce impressive results on a number of semi-supervised benchmarks, approaching… ▽ More Recent advances in semi-supervised learning have shown tremendous potential in overcoming a major barrier to the success of modern machine learning algorithms: access to vast amounts of human-labeled training data. Previous algorithms based on consistency regularization can harness the abundance of unlabeled data to produce impressive results on a number of semi-supervised benchmarks, approaching the performance of strong supervised baselines using only a fraction of the available labeled data. In this work, we challenge the long-standing success of consistency regularization by introducing self-supervised regularization as the basis for combining semantic feature representations from unlabeled data. We perform extensive comparative experiments to demonstrate the effectiveness of self-supervised regularization for supervised and semi-supervised image classification on SVHN, CIFAR-10, and CIFAR-100 benchmark datasets. We present two main results: (1) models augmented with self-supervised regularization significantly improve upon traditional supervised classifiers without the need for unlabeled data; (2) together with unlabeled data, our models yield semi-supervised performance competitive with, and in many cases exceeding, prior state-of-the-art consistency baselines. Lastly, our models have the practical utility of being efficiently trained end-to-end and require no additional hyper-parameters to tune for optimal performance beyond the standard set for training neural networks. Reference code and data are available at https://github.com/vuptran/sesemi △ Less

Submitted 21 November, 2019; v1 submitted 25 June, 2019; originally announced June 2019.

Comments: NeurIPS'19 Workshop on Learning with Rich Experience: Integration of Learning Paradigms

arXiv:1811.02798 [pdf, other]

Multi-Task Graph Autoencoders

Authors: Phi Vu Tran

Abstract: We examine two fundamental tasks associated with graph representation learning: link prediction and node classification. We present a new autoencoder architecture capable of learning a joint representation of local graph structure and available node features for the simultaneous multi-task learning of unsupervised link prediction and semi-supervised node classification. Our simple, yet effective a… ▽ More We examine two fundamental tasks associated with graph representation learning: link prediction and node classification. We present a new autoencoder architecture capable of learning a joint representation of local graph structure and available node features for the simultaneous multi-task learning of unsupervised link prediction and semi-supervised node classification. Our simple, yet effective and versatile model is efficiently trained end-to-end in a single stage, whereas previous related deep graph embedding methods require multiple training steps that are difficult to optimize. We provide an empirical evaluation of our model on five benchmark relational, graph-structured datasets and demonstrate significant improvement over three strong baselines for graph representation learning. Reference code and data are available at https://github.com/vuptran/graph-representation-learning △ Less

Submitted 7 November, 2018; originally announced November 2018.

Comments: NIPS 2018 Workshop on Relational Representation Learning. Short version of arXiv:1802.08352

arXiv:1802.08352 [pdf, other]

doi 10.1109/DSAA.2018.00034

Learning to Make Predictions on Graphs with Autoencoders

Authors: Phi Vu Tran

Abstract: We examine two fundamental tasks associated with graph representation learning: link prediction and semi-supervised node classification. We present a novel autoencoder architecture capable of learning a joint representation of both local graph structure and available node features for the multi-task learning of link prediction and node classification. Our autoencoder architecture is efficiently tr… ▽ More We examine two fundamental tasks associated with graph representation learning: link prediction and semi-supervised node classification. We present a novel autoencoder architecture capable of learning a joint representation of both local graph structure and available node features for the multi-task learning of link prediction and node classification. Our autoencoder architecture is efficiently trained end-to-end in a single learning stage to simultaneously perform link prediction and node classification, whereas previous related methods require multiple training steps that are difficult to optimize. We provide a comprehensive empirical evaluation of our models on nine benchmark graph-structured datasets and demonstrate significant improvement over related methods for graph representation learning. Reference code and data are available at https://github.com/vuptran/graph-representation-learning △ Less

Submitted 29 July, 2018; v1 submitted 22 February, 2018; originally announced February 2018.

Comments: Published as a conference paper at IEEE DSAA 2018

arXiv:1604.00494 [pdf, other]

A Fully Convolutional Neural Network for Cardiac Segmentation in Short-Axis MRI

Authors: Phi Vu Tran

Abstract: Automated cardiac segmentation from magnetic resonance imaging datasets is an essential step in the timely diagnosis and management of cardiac pathologies. We propose to tackle the problem of automated left and right ventricle segmentation through the application of a deep fully convolutional neural network architecture. Our model is efficiently trained end-to-end in a single learning stage from w… ▽ More Automated cardiac segmentation from magnetic resonance imaging datasets is an essential step in the timely diagnosis and management of cardiac pathologies. We propose to tackle the problem of automated left and right ventricle segmentation through the application of a deep fully convolutional neural network architecture. Our model is efficiently trained end-to-end in a single learning stage from whole-image inputs and ground truths to make inference at every pixel. To our knowledge, this is the first application of a fully convolutional neural network architecture for pixel-wise labeling in cardiac magnetic resonance imaging. Numerical experiments demonstrate that our model is robust to outperform previous fully automated methods across multiple evaluation measures on a range of cardiac datasets. Moreover, our model is fast and can leverage commodity compute resources such as the graphics processing unit to enable state-of-the-art cardiac segmentation at massive scales. The models and code are available at https://github.com/vuptran/cardiac-segmentation △ Less

Submitted 26 April, 2017; v1 submitted 2 April, 2016; originally announced April 2016.

Comments: Initial Technical Report; Include link to models and code

Showing 1–8 of 8 results for author: Tran, P V