Search | arXiv e-print repository

Combining Deep Learning and Street View Imagery to Map Smallholder Crop Types

Authors: Jordi Laguarta Soler, Thomas Friedel, Sherrie Wang

Abstract: Accurate crop type maps are an essential source of information for monitoring yield progress at scale, projecting global crop production, and planning effective policies. To date, however, crop type maps remain challenging to create in low and middle-income countries due to a lack of ground truth labels for training machine learning models. Field surveys are the gold standard in terms of accuracy… ▽ More Accurate crop type maps are an essential source of information for monitoring yield progress at scale, projecting global crop production, and planning effective policies. To date, however, crop type maps remain challenging to create in low and middle-income countries due to a lack of ground truth labels for training machine learning models. Field surveys are the gold standard in terms of accuracy but require an often-prohibitively large amount of time, money, and statistical capacity. In recent years, street-level imagery, such as Google Street View, KartaView, and Mapillary, has become available around the world. Such imagery contains rich information about crop types grown at particular locations and times. In this work, we develop an automated system to generate crop type ground references using deep learning and Google Street View imagery. The method efficiently curates a set of street view images containing crop fields, trains a model to predict crop type by utilizing weakly-labelled images from disparate out-of-domain sources, and combines predicted labels with remote sensing time series to create a wall-to-wall crop type map. We show that, in Thailand, the resulting country-wide map of rice, cassava, maize, and sugarcane achieves an accuracy of 93%. We publicly release the first-ever crop type map for all of Thailand 2022 at 10m-resolution with no gaps. To our knowledge, this is the first time a 10m-resolution, multi-crop map has been created for any smallholder country. As the availability of roadside imagery expands, our pipeline provides a way to map crop types at scale around the globe, especially in underserved smallholder regions. △ Less

Submitted 31 January, 2024; v1 submitted 11 September, 2023; originally announced September 2023.

Comments: Accepted to AAAI-24: Special Track on AI for Social Impact

arXiv:2111.11859 [pdf]

doi 10.3389/fcomp.2021.624694

Longitudinal Speech Biomarkers for Automated Alzheimer's Detection

Authors: Jordi Laguarta Soler, Brian Subirana

Abstract: We introduce a novel audio processing architecture, the Open Voice Brain Model (OVBM), improving detection accuracy for Alzheimer's (AD) longitudinal discrimination from spontaneous speech. We also outline the OVBM design methodology leading us to such architecture, which in general can incorporate multimodal biomarkers and target simultaneously several diseases and other AI tasks. Key in our meth… ▽ More We introduce a novel audio processing architecture, the Open Voice Brain Model (OVBM), improving detection accuracy for Alzheimer's (AD) longitudinal discrimination from spontaneous speech. We also outline the OVBM design methodology leading us to such architecture, which in general can incorporate multimodal biomarkers and target simultaneously several diseases and other AI tasks. Key in our methodology is the use of multiple biomarkers complementing each other, and when two of them uniquely identify different subjects in a target disease we say they are orthogonal. We illustrate the methodology by introducing 16 biomarkers, three of which are orthogonal, demonstrating simultaneous above state-of-the-art discrimination for apparently unrelated diseases such as AD and COVID-19. Inspired by research conducted at the MIT Center for Brain Minds and Machines, OVBM combines biomarker implementations of the four modules of intelligence: The brain OS chunks and overlaps audio samples and aggregates biomarker features from the sensory stream and cognitive core creating a multi-modal graph neural network of symbolic compositional models for the target task. We apply it to AD, achieving above state-of-the-art accuracy of 93.8% on raw audio, while extracting a subject saliency map that longitudinally tracks relative disease progression using multiple biomarkers, 16 in the reported AD task. The ultimate aim is to help medical practice by detecting onset and treatment impact so that intervention options can be longitudinally tested. Using the OBVM design methodology, we introduce a novel lung and respiratory tract biomarker created using 200,000+ cough samples to pre-train a model discriminating cough cultural origin. This cough dataset sets a new benchmark as the largest audio health dataset with 30,000+ subjects participating in April 2020, demonstrating for the first-time cough cultural bias. △ Less

Submitted 22 November, 2021; originally announced November 2021.

ACM Class: I.2.0; I.2.m

Journal ref: Frontiers in Computer Science, 08 April 2021

arXiv:2105.13175 [pdf]

Standalone micro-reformer for on-board hydrogen production from dimethyl ether

Authors: M. Bianchini, N. Alayo, L. Soler, M. Salleras, L. Fonseca, J. Llorca, A. Tarancon

Abstract: Entering a new era of sustainable energy generation and consumption, new solutions for powering consumer electronics are required to tackle the limited capacity provided by the portable power sources employed nowadays. Hydrocarbon-fed micro-fuel cells represent a promising technology for this purpose, and micro-reactor technology can indeed enable their integration for portable applications. In th… ▽ More Entering a new era of sustainable energy generation and consumption, new solutions for powering consumer electronics are required to tackle the limited capacity provided by the portable power sources employed nowadays. Hydrocarbon-fed micro-fuel cells represent a promising technology for this purpose, and micro-reactor technology can indeed enable their integration for portable applications. In this work, we present the design and fully scalable wafer-level fabrication of a MEMS-based catalytic micro-reactor, paving the way towards on-board hydrogen production for portable power generators. The device consists of an array of thousands of vertically-aligned micro-channels, 500 um in length and 50 um in diameter, for an overall superficial area per unit volume of 120 cm2 cm-3 and it embeds a thin-film heater for efficient reaction start-up. Functionalization of the active area was achieved by atomic layer deposition, resulting in the uniform coating of a Pt/Al2O3 heterogeneous catalyst. The temperature-dependent dimethyl ether (DME)-to-syngas conversion is tested through steam reforming (SR) and partial oxidation (POX) reactions. Here, conversion rates up to 74% and hydrogen selectivity of 60% are obtained by steam reforming at 650dC, while a specific volumetric hydrogen production of 4.5 mLH2 mL-1DME cm-3REACTOR at 600dC is obtained from DME POX in a standalone device tested by means of a 3D printed ceramic housing. △ Less

Submitted 27 May, 2021; originally announced May 2021.

arXiv:2105.13044 [pdf]

doi 10.1088/1742-6596/1407/1/012048

A Pd/Al2O3-based micro-reformer unit fully integrated in silicon technology for H-rich gas production

Authors: M Bianchini, N Alayo, L Soler, M Salleras, L Fonseca, J Llorca, A Tarancon

Abstract: This work reports the design, manufacturing and catalytic activity characterization of a micro-reformer for hydrogen-rich gas generation integrated in portable-solid oxide fuel cells (u-SOFCs). The reformer has been designed as a silicon micro monolithic substrate compatible with the mainstream microelectronics fabrication technologies ensuring a cost-effective high reproducibility and reliability… ▽ More This work reports the design, manufacturing and catalytic activity characterization of a micro-reformer for hydrogen-rich gas generation integrated in portable-solid oxide fuel cells (u-SOFCs). The reformer has been designed as a silicon micro monolithic substrate compatible with the mainstream microelectronics fabrication technologies ensuring a cost-effective high reproducibility and reliability. Design and geometry of the system have been optimized comparing with the previous design, consisting in an array of more than 7x103 vertical through-silicon micro channels perfectly aligned (50 {um diameter) and a 5 W integrated serpentine heater consisting of three stacked metallic layers (TiW, W and Au) for perfect adhesion and passivation. Traditional fuels for SOFCs, such as ethanol or methanol, have been replaced by dimethyl ether (DME) and the chosen catalyst for DME conversion consists of Pd nanoparticles grafted on an alumina active support. The micro-channels have been coated by atomic layer deposition (ALD) with amorphous Al2O3 and the influence of rapid thermal processing (RTP) on such film has been studied. A customized ceramic 3D-printed holder has been designed to measure the specific hydrogen production rates, DME conversion and selectivity profiles of such catalyst at different temperatures. △ Less

Submitted 27 May, 2021; originally announced May 2021.

Journal ref: J. Phys.: Conf. Ser. 1407 012048 (2019)

arXiv:2103.06104 [pdf, other]

U-Net Transformer: Self and Cross Attention for Medical Image Segmentation

Authors: Olivier Petit, Nicolas Thome, Clément Rambour, Luc Soler

Abstract: Medical image segmentation remains particularly challenging for complex and low-contrast anatomical structures. In this paper, we introduce the U-Transformer network, which combines a U-shaped architecture for image segmentation with self- and cross-attention from Transformers. U-Transformer overcomes the inability of U-Nets to model long-range contextual interactions and spatial dependencies, whi… ▽ More Medical image segmentation remains particularly challenging for complex and low-contrast anatomical structures. In this paper, we introduce the U-Transformer network, which combines a U-shaped architecture for image segmentation with self- and cross-attention from Transformers. U-Transformer overcomes the inability of U-Nets to model long-range contextual interactions and spatial dependencies, which are arguably crucial for accurate segmentation in challenging contexts. To this end, attention mechanisms are incorporated at two main levels: a self-attention module leverages global interactions between encoder features, while cross-attention in the skip connections allows a fine spatial recovery in the U-Net decoder by filtering out non-semantic features. Experiments on two abdominal CT-image datasets show the large performance gain brought out by U-Transformer compared to U-Net and local Attention U-Nets. We also highlight the importance of using both self- and cross-attention, and the nice interpretability features brought out by U-Transformer. △ Less

Submitted 12 March, 2021; v1 submitted 10 March, 2021; originally announced March 2021.

arXiv:1901.04056 [pdf, other]

doi 10.1016/j.media.2022.102680

The Liver Tumor Segmentation Benchmark (LiTS)

Authors: Patrick Bilic, Patrick Christ, Hongwei Bran Li, Eugene Vorontsov, Avi Ben-Cohen, Georgios Kaissis, Adi Szeskin, Colin Jacobs, Gabriel Efrain Humpire Mamani, Gabriel Chartrand, Fabian Lohöfer, Julian Walter Holch, Wieland Sommer, Felix Hofmann, Alexandre Hostettler, Naama Lev-Cohain, Michal Drozdzal, Michal Marianne Amitai, Refael Vivantik, Jacob Sosna, Ivan Ezhov, Anjany Sekuboyina, Fernando Navarro, Florian Kofler, Johannes C. Paetzold , et al. (84 additional authors not shown)

Abstract: In this work, we report the set-up and results of the Liver Tumor Segmentation Benchmark (LiTS), which was organized in conjunction with the IEEE International Symposium on Biomedical Imaging (ISBI) 2017 and the International Conferences on Medical Image Computing and Computer-Assisted Intervention (MICCAI) 2017 and 2018. The image dataset is diverse and contains primary and secondary tumors with… ▽ More In this work, we report the set-up and results of the Liver Tumor Segmentation Benchmark (LiTS), which was organized in conjunction with the IEEE International Symposium on Biomedical Imaging (ISBI) 2017 and the International Conferences on Medical Image Computing and Computer-Assisted Intervention (MICCAI) 2017 and 2018. The image dataset is diverse and contains primary and secondary tumors with varied sizes and appearances with various lesion-to-background levels (hyper-/hypo-dense), created in collaboration with seven hospitals and research institutions. Seventy-five submitted liver and liver tumor segmentation algorithms were trained on a set of 131 computed tomography (CT) volumes and were tested on 70 unseen test images acquired from different patients. We found that not a single algorithm performed best for both liver and liver tumors in the three events. The best liver segmentation algorithm achieved a Dice score of 0.963, whereas, for tumor segmentation, the best algorithms achieved Dices scores of 0.674 (ISBI 2017), 0.702 (MICCAI 2017), and 0.739 (MICCAI 2018). Retrospectively, we performed additional analysis on liver tumor detection and revealed that not all top-performing segmentation algorithms worked well for tumor detection. The best liver tumor detection method achieved a lesion-wise recall of 0.458 (ISBI 2017), 0.515 (MICCAI 2017), and 0.554 (MICCAI 2018), indicating the need for further research. LiTS remains an active benchmark and resource for research, e.g., contributing the liver-related segmentation tasks in \url{http://medicaldecathlon.com/}. In addition, both data and online evaluation are accessible via \url{www.lits-challenge.com}. △ Less

Submitted 25 November, 2022; v1 submitted 13 January, 2019; originally announced January 2019.

Comments: Patrick Bilic, Patrick Christ, Hongwei Bran Li, and Eugene Vorontsov made equal contributions to this work. Published in Medical Image Analysis

Journal ref: Medical Image Analysis (2022) Pg. 102680

arXiv:1705.09107 [pdf, other]

SLAM based Quasi Dense Reconstruction For Minimally Invasive Surgery Scenes

Authors: Nader Mahmoud, Alexandre Hostettler, Toby Collins, Luc Soler, Christophe Doignon, J. M. M. Montiel

Abstract: Recovering surgical scene structure in laparoscope surgery is crucial step for surgical guidance and augmented reality applications. In this paper, a quasi dense reconstruction algorithm of surgical scene is proposed. This is based on a state-of-the-art SLAM system, and is exploiting the initial exploration phase that is typically performed by the surgeon at the beginning of the surgery. We show h… ▽ More Recovering surgical scene structure in laparoscope surgery is crucial step for surgical guidance and augmented reality applications. In this paper, a quasi dense reconstruction algorithm of surgical scene is proposed. This is based on a state-of-the-art SLAM system, and is exploiting the initial exploration phase that is typically performed by the surgeon at the beginning of the surgery. We show how to convert the sparse SLAM map to a quasi dense scene reconstruction, using pairs of keyframe images and correlation-based featureless patch matching. We have validated the approach with a live porcine experiment using Computed Tomography as ground truth, yielding a Root Mean Squared Error of 4.9mm. △ Less

Submitted 25 May, 2017; originally announced May 2017.

Comments: ICRA 2017 workshop C4 Surgical Robots: Compliant, Continuum, Cognitive, and Collaborative

arXiv:1610.04097 [pdf, other]

Automatic View-Point Selection for Inter-Operative Endoscopic Surveillance

Authors: Anant S. Vemuri, Stephane A. Nicolau, Jacques Marescaux, Luc Soler, Nicholas Ayache

Abstract: Esophageal adenocarcinoma arises from Barrett's esophagus, which is the most serious complication of gastroesophageal reflux disease. Strategies for screening involve periodic surveillance and tissue biopsies. A major challenge in such regular examinations is to record and track the disease evolution and re-localization of biopsied sites to provide targeted treatments. In this paper, we extend our… ▽ More Esophageal adenocarcinoma arises from Barrett's esophagus, which is the most serious complication of gastroesophageal reflux disease. Strategies for screening involve periodic surveillance and tissue biopsies. A major challenge in such regular examinations is to record and track the disease evolution and re-localization of biopsied sites to provide targeted treatments. In this paper, we extend our original inter-operative relocalization framework to provide a constrained image based search for obtaining the best view-point match to the live view. Within this context we investigate the effect of: the choice of feature descriptors and color-space; filtering of uninformative frames and endoscopic modality, for view-point localization. Our experiments indicate an improvement in the best view-point retrieval rate to [92%,87%] from [73%,76%] (in our previous approach) for NBI and WL. △ Less

Submitted 13 October, 2016; originally announced October 2016.

Comments: Medical Content-based Retrieval for Clinical Decision Support and Treatment Planning, MICCAI Conference

arXiv:1608.08149 [pdf, other]

ORBSLAM-based Endoscope Tracking and 3D Reconstruction

Authors: Nader Mahmoud, Iñigo Cirauqui, Alexandre Hostettler, Christophe Doignon, Luc Soler, Jacques Marescaux, J. M. M. Montiel

Abstract: We aim to track the endoscope location inside the surgical scene and provide 3D reconstruction, in real-time, from the sole input of the image sequence captured by the monocular endoscope. This information offers new possibilities for develo** surgical navigation and augmented reality applications. The main benefit of this approach is the lack of extra tracking elements which can disturb the sur… ▽ More We aim to track the endoscope location inside the surgical scene and provide 3D reconstruction, in real-time, from the sole input of the image sequence captured by the monocular endoscope. This information offers new possibilities for develo** surgical navigation and augmented reality applications. The main benefit of this approach is the lack of extra tracking elements which can disturb the surgeon performance in the clinical routine. It is our first contribution to exploit ORBSLAM, one of the best performing monocular SLAM algorithms, to estimate both of the endoscope location, and 3D structure of the surgical scene. However, the reconstructed 3D map poorly describe textureless soft organ surfaces such as liver. It is our second contribution to extend ORBSLAM to be able to reconstruct a semi-dense map of soft organs. Experimental results on in-vivo pigs, shows a robust endoscope tracking even with organs deformations and partial instrument occlusions. It also shows the reconstruction density, and accuracy against ground truth surface obtained from CT. △ Less

Submitted 29 August, 2016; originally announced August 2016.

arXiv:1512.02910 [pdf, ps, other]

doi 10.1109/WD.2016.7461500

Latency Evaluation of a Virtualized MME

Authors: Jonathan Prados-Garzon, Juan J. Ramos-Munoz, Pablo Ameigeiras, Pilar Andres-Maldonado, Juan M. Lopez Soler

Abstract: Network Virtualization is one of the key technologies for develo** the future mobile networks. However, the performance of virtual mobile entities may not be sufficient for delivering the service required for future networks in terms of throughput or service time. In addition, to take advantage of the virtualization capabilities, a criterion to decide when to scale out the number of instances is… ▽ More Network Virtualization is one of the key technologies for develo** the future mobile networks. However, the performance of virtual mobile entities may not be sufficient for delivering the service required for future networks in terms of throughput or service time. In addition, to take advantage of the virtualization capabilities, a criterion to decide when to scale out the number of instances is a must. In this paper we propose an LTE virtualized Mobility Management Entity queue model to evaluate its service time for a given signaling workload. The estimation of this latency can serve to decide how many processing instances should be deployed to provide a target service. Additionally, we provide a compound data traffic model for the future mobile applications, and we predict theoretically the control workload that it will generate. Finally, we evaluate the virtualized Mobility Management Entity overall delay by simulation, providing insights for selecting the number of virtual instances for a given number of users. △ Less

Submitted 9 December, 2015; originally announced December 2015.

arXiv:1110.6492 [pdf, ps, other]

Scattering solution of a ball by a bat

Authors: Alejandro Cabo, Leon Soler, Carlos Gonzalez

Abstract: The problem of the mechanical evolution of a shock between a cylindrically symmetric bat and a spherical ball is solved in the strict rigid approximation for arbitrary values of the initial conditions. The friction during the impact is assumed to satisfy the standard rules. When the only source of energy dissipation is friction, the problem is fully solved by determining the separation point betwe… ▽ More The problem of the mechanical evolution of a shock between a cylindrically symmetric bat and a spherical ball is solved in the strict rigid approximation for arbitrary values of the initial conditions. The friction during the impact is assumed to satisfy the standard rules. When the only source of energy dissipation is friction, the problem is fully solved by determining the separation point between the bodies. It also follows that whatever the character of any additional form of dissipation is, it only affects the ending value of the net impulse I done by the normal force of the bat on the ball at separation, but not the dynamical evolution with the value of I during the shock process. A relation determining whether the contact points of the two bodies slides between them or become at rest (to be pure rotation state) at the end of the impact, is determined for the case of the purely frictional energy dissipation. The solution is also generalized to include losses in addition to the frictional ones and then applied to the description of experimental measures of the scattering of a ball by a bat. The evaluations satisfactorily reproduce the measured curves for the output center of mass and angular velocities of the ball as functions of the scattering angle and the impact parameter, respectively. △ Less

Submitted 28 October, 2011; originally announced October 2011.

Comments: 22 pages, 8 figures

arXiv:0711.2913 [pdf, ps, other]

Cubic-matrix splines and second-order matrix models

Authors: M. M. Tung, L. Soler, E. Defez, A. Hervas

Abstract: We discuss the direct use of cubic-matrix splines to obtain continuous approximations to the unique solution of matrix models of the type $Y''(x) = f(x,Y(x))$. For numerical illustration, an estimation of the approximation error, an algorithm for its implementation, and an example are given. We discuss the direct use of cubic-matrix splines to obtain continuous approximations to the unique solution of matrix models of the type $Y''(x) = f(x,Y(x))$. For numerical illustration, an estimation of the approximation error, an algorithm for its implementation, and an example are given. △ Less

Submitted 19 November, 2007; originally announced November 2007.

Comments: 5 pages

MSC Class: 41A15;39B42

Journal ref: Progress in Industrial Mathematics at ECMI 2006 (edited by L. L. Bonilla, M. A. Moscoso, G. Platero, and J. M. Vega), vol. 12 of Mathematics in Industry, pp. 949-953 (Springer, Berlin, 2007), ISBN 978-3-540-71991-5

arXiv:math/0612202 [pdf, ps, other]

doi 10.1016/j.mcm.2006.11.027

Numerical Solutions of Matrix Differential Models using Cubic Matrix Splines II

Authors: E. Defez, A. Hervas, L. Soler, M. M. Tung

Abstract: This paper presents the non-linear generalization of a previous work on matrix differential models. It focusses on the construction of approximate solutions of first-order matrix differential equations Y'(x)=f(x,Y(x)) using matrix-cubic splines. An estimation of the approximation error, an algorithm for its implementation and illustrative examples for Sylvester and Riccati matrix differential eq… ▽ More This paper presents the non-linear generalization of a previous work on matrix differential models. It focusses on the construction of approximate solutions of first-order matrix differential equations Y'(x)=f(x,Y(x)) using matrix-cubic splines. An estimation of the approximation error, an algorithm for its implementation and illustrative examples for Sylvester and Riccati matrix differential equations are given. △ Less

Submitted 7 December, 2006; originally announced December 2006.

Comments: 14 pages; submitted to Math. Comp. Modelling

Journal ref: Math. Comp. Modelling 46 (5-6), pp. 657-669 (2007)

Showing 1–13 of 13 results for author: Soler, L