Skip to main content

Showing 1–40 of 40 results for author: Lobell, D

.
  1. arXiv:2406.10973  [pdf, other

    cs.CV cs.AI

    ExPLoRA: Parameter-Efficient Extended Pre-Training to Adapt Vision Transformers under Domain Shifts

    Authors: Samar Khanna, Medhanie Irgau, David B. Lobell, Stefano Ermon

    Abstract: Parameter-efficient fine-tuning (PEFT) techniques such as low-rank adaptation (LoRA) can effectively adapt large pre-trained foundation models to downstream tasks using only a small fraction (0.1%-10%) of the original trainable weights. An under-explored question of PEFT is in extending the pre-training phase without supervised labels; that is, can we adapt a pre-trained foundation model to a new… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  2. arXiv:2405.08159  [pdf

    econ.GN

    Large increases in public R&D investment are needed to avoid declines of US agricultural productivity

    Authors: Ariel Ortiz-Bobea, Robert G. Chambers, Yurou He, David B. Lobell

    Abstract: Increasing agricultural productivity is a gradual process with significant time lags between research and development (R&D) investment and the resulting gains. We estimate the response of US agricultural Total Factor Productivity (TFP) to both R&D investment and weather, and quantify the public R&D spending required to offset the emerging impacts of climate change. We find that offsetting the clim… ▽ More

    Submitted 17 May, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: Main text: 19 pages, 4 figures. Supplementary material: 47 pages, 20 figures, 13 tables

  3. arXiv:2402.02680  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Large Language Models are Geographically Biased

    Authors: Rohin Manvi, Samar Khanna, Marshall Burke, David Lobell, Stefano Ermon

    Abstract: Large Language Models (LLMs) inherently carry the biases contained in their training corpora, which can lead to the perpetuation of societal harm. As the impact of these foundation models grows, understanding and evaluating their biases becomes crucial to achieving fairness and accuracy. We propose to study what LLMs know about the world we live in through the lens of geography. This approach is p… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  4. arXiv:2312.03606  [pdf, other

    cs.CV cs.AI cs.LG

    DiffusionSat: A Generative Foundation Model for Satellite Imagery

    Authors: Samar Khanna, Patrick Liu, Linqi Zhou, Chenlin Meng, Robin Rombach, Marshall Burke, David Lobell, Stefano Ermon

    Abstract: Diffusion models have achieved state-of-the-art results on many modalities including images, speech, and video. However, existing models are not tailored to support remote sensing data, which is widely used in important applications including environmental monitoring and crop-yield prediction. Satellite images are significantly different from natural images -- they can be multi-spectral, irregular… ▽ More

    Submitted 25 May, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Published at ICLR 2024

  5. arXiv:2310.08831  [pdf, other

    stat.AP

    Biases in estimates of air pollution impacts: the role of omitted variables and measurement errors

    Authors: Dan M. Kluger, David B. Lobell, Art B. Owen

    Abstract: Observational studies often use linear regression to assess the effect of ambient air pollution on outcomes of interest, such as human health indicators or crop yields. Yet pollution datasets are typically noisy and include only a subset of the potentially relevant pollutants, giving rise to both measurement error bias (MEB) and omitted variable bias (OVB). While it is well understood that these b… ▽ More

    Submitted 6 March, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: Added a data-based validation using a dataset of pollutants and crop yields from the Midwestern United States

  6. arXiv:2310.06213  [pdf, other

    cs.CL cs.LG

    GeoLLM: Extracting Geospatial Knowledge from Large Language Models

    Authors: Rohin Manvi, Samar Khanna, Gengchen Mai, Marshall Burke, David Lobell, Stefano Ermon

    Abstract: The application of machine learning (ML) in a range of geospatial tasks is increasingly common but often relies on globally available covariates such as satellite imagery that can either be expensive or lack predictive power. Here we explore the question of whether the vast amounts of knowledge found in Internet language corpora, now compressed within large language models (LLMs), can be leveraged… ▽ More

    Submitted 24 February, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: Accepted to ICLR 2024

  7. arXiv:2308.12061  [pdf, other

    cs.CV cs.LG

    HarvestNet: A Dataset for Detecting Smallholder Farming Activity Using Harvest Piles and Remote Sensing

    Authors: Jonathan Xu, Amna Elmustafa, Liya Weldegebriel, Emnet Negash, Richard Lee, Chenlin Meng, Stefano Ermon, David Lobell

    Abstract: Small farms contribute to a large share of the productive land in develo** countries. In regions such as sub-Saharan Africa, where 80\% of farms are small (under 2 ha in size), the task of map** smallholder cropland is an important part of tracking sustainability measures such as crop productivity. However, the visually diverse and nuanced appearance of small farms has limited the effectivenes… ▽ More

    Submitted 5 March, 2024; v1 submitted 23 August, 2023; originally announced August 2023.

    Comments: submitted to AAAI24

  8. arXiv:2301.01449  [pdf, other

    cs.CV

    Building Coverage Estimation with Low-resolution Remote Sensing Imagery

    Authors: Enci Liu, Chenlin Meng, Matthew Kolodner, Eun Jee Sung, Sihang Chen, Marshall Burke, David Lobell, Stefano Ermon

    Abstract: Building coverage statistics provide crucial insights into the urbanization, infrastructure, and poverty level of a region, facilitating efforts towards alleviating poverty, building sustainable cities, and allocating infrastructure investments and public service provision. Global map** of buildings has been made more efficient with the incorporation of deep learning models into the pipeline. Ho… ▽ More

    Submitted 4 January, 2023; v1 submitted 4 January, 2023; originally announced January 2023.

  9. arXiv:2212.09681  [pdf, other

    stat.AP cs.CV cs.LG

    Annual field-scale maps of tall and short crops at the global scale using GEDI and Sentinel-2

    Authors: Stefania Di Tommaso, Sherrie Wang, Vivek Vajipey, Noel Gorelick, Rob Strey, David B. Lobell

    Abstract: Crop type maps are critical for tracking agricultural land use and estimating crop production. Remote sensing has proven an efficient and reliable tool for creating these maps in regions with abundant ground labels for model training, yet these labels remain difficult to obtain in many regions and years. NASA's Global Ecosystem Dynamics Investigation (GEDI) spaceborne lidar instrument, originally… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.

  10. arXiv:2209.14611  [pdf, other

    econ.EM

    With big data come big problems: pitfalls in measuring basis risk for crop index insurance

    Authors: Matthieu Stigler, Apratim Dey, Andrew Hobbs, David Lobell

    Abstract: New satellite sensors will soon make it possible to estimate field-level crop yields, showing a great potential for agricultural index insurance. This paper identifies an important threat to better insurance from these new technologies: data with many fields and few years can yield downward biased estimates of basis risk, a fundamental metric in index insurance. To demonstrate this bias, we use st… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

  11. arXiv:2207.08051  [pdf, other

    cs.CV cs.AI

    SatMAE: Pre-training Transformers for Temporal and Multi-Spectral Satellite Imagery

    Authors: Yezhen Cong, Samar Khanna, Chenlin Meng, Patrick Liu, Erik Rozi, Yutong He, Marshall Burke, David B. Lobell, Stefano Ermon

    Abstract: Unsupervised pre-training methods for large vision models have shown to enhance performance on downstream supervised tasks. Develo** similar techniques for satellite imagery presents significant opportunities as unlabelled data is plentiful and the inherent temporal and multi-spectral structure provides avenues to further improve existing pre-training strategies. In this paper, we present SatMAE… ▽ More

    Submitted 15 January, 2023; v1 submitted 16 July, 2022; originally announced July 2022.

    Comments: Published at NeurIPS 2022. The first two listed names contributed equally to this project

  12. arXiv:2204.01736  [pdf, other

    eess.IV cs.CV

    Tracking Urbanization in Develo** Regions with Remote Sensing Spatial-Temporal Super-Resolution

    Authors: Yutong He, William Zhang, Chenlin Meng, Marshall Burke, David B. Lobell, Stefano Ermon

    Abstract: Automated tracking of urban development in areas where construction information is not available became possible with recent advancements in machine learning and remote sensing. Unfortunately, these solutions perform best on high-resolution imagery, which is expensive to acquire and infrequently available, making it difficult to scale over long time spans and across large geographies. In this work… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: Presented at Workshop on Machine Learning for the Develo** World (ML4D) at the 35th Conference on Neural Information Processing Systems (NeurIPS) 2021

  13. arXiv:2201.04771  [pdf, other

    cs.CV stat.AP

    Unlocking large-scale crop field delineation in smallholder farming systems with transfer learning and weak supervision

    Authors: Sherrie Wang, Francois Waldner, David B. Lobell

    Abstract: Crop field boundaries aid in map** crop types, predicting yields, and delivering field-scale analytics to farmers. Recent years have seen the successful application of deep learning to delineating field boundaries in industrial agricultural systems, but field boundary datasets remain missing in smallholder systems due to (1) small fields that require high resolution satellite imagery to delineat… ▽ More

    Submitted 12 January, 2022; originally announced January 2022.

    Comments: Under submission

  14. Combining randomized field experiments with observational satellite data to assess the benefits of crop rotations on yields

    Authors: Dan M. Kluger, Art B. Owen, David B. Lobell

    Abstract: With climate change threatening agricultural productivity and global food demand increasing, it is important to better understand which farm management practices will maximize crop yields in various climatic conditions. To assess the effectiveness of agricultural practices, researchers often turn to randomized field experiments, which are reliable for identifying causal effects but are often limit… ▽ More

    Submitted 27 December, 2021; originally announced December 2021.

    MSC Class: 62P12; 62D20

    Journal ref: Environmental Research Letters (2022). Volume 17, Number 4

  15. arXiv:2112.09126  [pdf, other

    cs.CV cs.LG

    IS-COUNT: Large-scale Object Counting from Satellite Images with Covariate-based Importance Sampling

    Authors: Chenlin Meng, Enci Liu, Willie Neiswanger, Jiaming Song, Marshall Burke, David Lobell, Stefano Ermon

    Abstract: Object detection in high-resolution satellite imagery is emerging as a scalable alternative to on-the-ground survey data collection in many environmental and socioeconomic monitoring applications. However, performing object detection over large geographies can still be prohibitively expensive due to the high cost of purchasing imagery and compute. Inspired by traditional survey data collection str… ▽ More

    Submitted 16 December, 2021; originally announced December 2021.

    Comments: AAAI 2022

  16. Optimal index insurance and basis risk decomposition: an application to Kenya

    Authors: Matthieu Stigler, David Lobell

    Abstract: Index insurance is a promising tool to reduce the risk faced by farmers, but high basis risk, which arises from imperfect correlation between the index and individual farm yields, has limited its adoption to date. Basis risk arises from two fundamental sources: the intrinsic heterogeneity within an insurance zone (zonal risk), and the lack of predictive accuracy of the index (design risk). Whereas… ▽ More

    Submitted 6 March, 2023; v1 submitted 16 November, 2021; originally announced November 2021.

    Comments: Forthcoming in American Journal of Agricultural Economics

  17. arXiv:2111.04724  [pdf, other

    cs.LG cs.CV

    SustainBench: Benchmarks for Monitoring the Sustainable Development Goals with Machine Learning

    Authors: Christopher Yeh, Chenlin Meng, Sherrie Wang, Anne Driscoll, Erik Rozi, Patrick Liu, Jihyeon Lee, Marshall Burke, David B. Lobell, Stefano Ermon

    Abstract: Progress toward the United Nations Sustainable Development Goals (SDGs) has been hindered by a lack of data on key environmental and socioeconomic indicators, which historically have come from ground surveys with sparse temporal and spatial coverage. Recent advances in machine learning have made it possible to utilize abundant, frequently-updated, and globally available data, such as from satellit… ▽ More

    Submitted 8 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021 (Track on Datasets and Benchmarks)

  18. arXiv:2110.10275  [pdf

    cs.CV cs.LG

    Early- and in-season crop type map** without current-year ground truth: generating labels from historical information via a topology-based approach

    Authors: Chenxi Lin, Liheng Zhong, Xiao-Peng Song, **wei Dong, David B. Lobell, Zhenong **

    Abstract: Land cover classification in remote sensing is often faced with the challenge of limited ground truth. Incorporating historical information has the potential to significantly lower the expensive cost associated with collecting ground truth and, more importantly, enable early- and in-season map** that is helpful to many pre-harvest decisions. In this study, we propose a new approach that can effe… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

  19. arXiv:2109.06972  [pdf, ps, other

    eess.IV cs.CE cs.CV cs.LG

    Combining GEDI and Sentinel-2 for wall-to-wall map** of tall and short crops

    Authors: Stefania Di Tommaso, Sherrie Wang, David B. Lobell

    Abstract: High resolution crop type maps are an important tool for improving food security, and remote sensing is increasingly used to create such maps in regions that possess ground truth labels for model training. However, these labels are absent in many regions, and models trained in other regions on typical satellite features, such as those from optical sensors, often exhibit low performance when transf… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

  20. arXiv:2109.01246  [pdf

    stat.AP cs.CV cs.LG stat.ML

    Two Shifts for Crop Map**: Leveraging Aggregate Crop Statistics to Improve Satellite-based Maps in New Regions

    Authors: Dan M. Kluger, Sherrie Wang, David B. Lobell

    Abstract: Crop type map** at the field level is critical for a variety of applications in agricultural monitoring, and satellite imagery is becoming an increasingly abundant and useful raw input from which to create crop type maps. Still, in many regions crop type map** with satellite data remains constrained by a scarcity of field-level crop labels for training supervised classification models. When tr… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

    Comments: This is the revised version of the paper which was submitted to Remote Sensing of Environment on May 3, 2021, immediately prior to acceptance for publication

    MSC Class: 62P12 (primary) 62H30 ACM Class: I.4.m; I.m; J.2

    Journal ref: Remote Sensing of Environment (2021). Volume 262

  21. arXiv:2106.11485  [pdf, other

    cs.CV cs.AI

    Spatial-Temporal Super-Resolution of Satellite Imagery via Conditional Pixel Synthesis

    Authors: Yutong He, Dingjie Wang, Nicholas Lai, William Zhang, Chenlin Meng, Marshall Burke, David B. Lobell, Stefano Ermon

    Abstract: High-resolution satellite imagery has proven useful for a broad range of tasks, including measurement of global human population, local economic livelihoods, and biodiversity, among many others. Unfortunately, high-resolution imagery is both infrequently collected and expensive to purchase, making it hard to efficiently and effectively scale these downstream tasks over both time and space. We prop… ▽ More

    Submitted 4 April, 2022; v1 submitted 21 June, 2021; originally announced June 2021.

    Journal ref: Advances in Neural Information Processing Systems 35 (2021) 27903-27915

  22. arXiv:2011.12544  [pdf, other

    econ.GN

    On the benefits of index insurance in US agriculture: a large-scale analysis using satellite data

    Authors: Matthieu Stigler, David Lobell

    Abstract: Index insurance has been promoted as a promising solution for reducing agricultural risk compared to traditional farm-based insurance. By linking payouts to a regional factor instead of individual loss, index insurance reduces monitoring costs, and alleviates the problems of moral hazard and adverse selection. Despite its theoretical appeal, demand for index insurance has remained low in many deve… ▽ More

    Submitted 7 November, 2021; v1 submitted 25 November, 2020; originally announced November 2020.

  23. arXiv:2011.09980  [pdf, other

    cs.CV

    Geography-Aware Self-Supervised Learning

    Authors: Kumar Ayush, Burak Uzkent, Chenlin Meng, Kumar Tanmay, Marshall Burke, David Lobell, Stefano Ermon

    Abstract: Contrastive learning methods have significantly narrowed the gap between supervised and unsupervised learning on computer vision tasks. In this paper, we explore their application to geo-located datasets, e.g. remote sensing, where unlabeled data is often abundant but labeled data is scarce. We first show that due to their different characteristics, a non-trivial gap persists between contrastive a… ▽ More

    Submitted 8 March, 2022; v1 submitted 19 November, 2020; originally announced November 2020.

    Comments: Accepted at ICCV 2021

  24. arXiv:2010.06988  [pdf, other

    cs.CY cs.CV cs.LG stat.ML

    Using satellite imagery to understand and promote sustainable development

    Authors: Marshall Burke, Anne Driscoll, David B. Lobell, Stefano Ermon

    Abstract: Accurate and comprehensive measurements of a range of sustainable development outcomes are fundamental inputs into both research and policy. We synthesize the growing literature that uses satellite imagery to understand these outcomes, with a focus on approaches that combine imagery with machine learning. We quantify the paucity of ground data on key human-related outcomes and the growing abundanc… ▽ More

    Submitted 23 September, 2020; originally announced October 2020.

  25. The Historical Impact of Anthropogenic Climate Change on Global Agricultural Productivity

    Authors: Ariel Ortiz-Bobea, Toby R. Ault, Carlos M. Carrillo, Robert G. Chambers, David B. Lobell

    Abstract: Agricultural research has fostered productivity growth, but the historical influence of anthropogenic climate change on that growth has not been quantified. We develop a robust econometric model of weather effects on global agricultural total factor productivity (TFP) and combine this model with counterfactual climate scenarios to evaluate impacts of past climate trends on TFP. Our baseline model… ▽ More

    Submitted 2 April, 2021; v1 submitted 20 July, 2020; originally announced July 2020.

    Comments: paper (14 pages; 5 figures) and supplemental materials (24 pages; 14 figures and 4 tables)

    Journal ref: Nat. Clim. Chang. 11, 306-312 (2021)

  26. arXiv:2006.08661  [pdf, other

    cs.CV

    Predicting Livelihood Indicators from Community-Generated Street-Level Imagery

    Authors: Jihyeon Lee, Dylan Grosz, Burak Uzkent, Sicheng Zeng, Marshall Burke, David Lobell, Stefano Ermon

    Abstract: Major decisions from governments and other large organizations rely on measurements of the populace's well-being, but making such measurements at a broad scale is expensive and thus infrequent in much of the develo** world. We propose an inexpensive, scalable, and interpretable approach to predict key livelihood indicators from public crowd-sourced street-level imagery. Such imagery can be cheap… ▽ More

    Submitted 26 February, 2021; v1 submitted 15 June, 2020; originally announced June 2020.

    Comments: Accepted to AAAI 2021. Code: https://github.com/sustainlab-group/mapillarygcn

    ACM Class: I.2; I.4; K.4; E.1

  27. arXiv:2006.04224  [pdf, other

    cs.CV

    Efficient Poverty Map** using Deep Reinforcement Learning

    Authors: Kumar Ayush, Burak Uzkent, Kumar Tanmay, Marshall Burke, David Lobell, Stefano Ermon

    Abstract: The combination of high-resolution satellite imagery and machine learning have proven useful in many sustainability-related tasks, including poverty prediction, infrastructure measurement, and forest monitoring. However, the accuracy afforded by high-resolution imagery comes at a cost, as such imagery is extremely expensive to purchase at scale. This creates a substantial hurdle to the efficient s… ▽ More

    Submitted 5 January, 2021; v1 submitted 7 June, 2020; originally announced June 2020.

    Comments: Accepted at AAAI 2021

  28. arXiv:2004.13390  [pdf, other

    cs.LG stat.ML

    Meta-Learning for Few-Shot Land Cover Classification

    Authors: Marc Rußwurm, Sherrie Wang, Marco Körner, David Lobell

    Abstract: The representations of the Earth's surface vary from one geographic region to another. For instance, the appearance of urban areas differs between continents, and seasonality influences the appearance of vegetation. To capture the diversity within a single category, like as urban or vegetation, requires a large model capacity and, consequently, large datasets. In this work, we propose a different… ▽ More

    Submitted 28 April, 2020; originally announced April 2020.

    Comments: accepted to the CVPR 2020 EarthVision Workshop

  29. arXiv:2004.05471  [pdf, other

    eess.IV cs.CV

    Farmland Parcel Delineation Using Spatio-temporal Convolutional Networks

    Authors: Han Lin Aung, Burak Uzkent, Marshall Burke, David Lobell, Stefano Ermon

    Abstract: Farm parcel delineation provides cadastral data that is important in develo** and managing climate change policies. Specifically, farm parcel delineation informs applications in downstream governmental policies of land allocation, irrigation, fertilization, green-house gases (GHG's), etc. This data can also be useful for the agricultural insurance sector for assessing compensations following dam… ▽ More

    Submitted 20 April, 2020; v1 submitted 11 April, 2020; originally announced April 2020.

  30. arXiv:2002.01612  [pdf, other

    cs.CV

    Generating Interpretable Poverty Maps using Object Detection in Satellite Images

    Authors: Kumar Ayush, Burak Uzkent, Marshall Burke, David Lobell, Stefano Ermon

    Abstract: Accurate local-level poverty measurement is an essential task for governments and humanitarian organizations to track the progress towards improving livelihoods and distribute scarce resources. Recent computer vision advances in using satellite imagery to predict poverty have shown increasing accuracy, but they do not generate features that are interpretable to policymakers, inhibiting adoption by… ▽ More

    Submitted 17 February, 2020; v1 submitted 4 February, 2020; originally announced February 2020.

  31. arXiv:1905.02506  [pdf, other

    cs.CV cs.LG

    Learning to Interpret Satellite Images in Global Scale Using Wikipedia

    Authors: Burak Uzkent, Evan Sheehan, Chenlin Meng, Zhongyi Tang, Marshall Burke, David Lobell, Stefano Ermon

    Abstract: Despite recent progress in computer vision, finegrained interpretation of satellite images remains challenging because of a lack of labeled training data. To overcome this limitation, we construct a novel dataset called WikiSatNet by pairing georeferenced Wikipedia articles with satellite imagery of their corresponding locations. We then propose two strategies to learn representations of satellite… ▽ More

    Submitted 11 August, 2019; v1 submitted 7 May, 2019; originally announced May 2019.

    Comments: Accepted to IJCAI 2019

  32. Map** Missing Population in Rural India: A Deep Learning Approach with Satellite Imagery

    Authors: Wenjie Hu, Jay Harshadbhai Patel, Zoe-Alanah Robert, Paul Novosad, Samuel Asher, Zhongyi Tang, Marshall Burke, David Lobell, Stefano Ermon

    Abstract: Millions of people worldwide are absent from their country's census. Accurate, current, and granular population metrics are critical to improving government allocation of resources, to measuring disease control, to responding to natural disasters, and to studying any aspect of human life in these communities. Satellite imagery can provide sufficient information to build a population map without th… ▽ More

    Submitted 4 May, 2019; originally announced May 2019.

    Comments: 7 pages

    ACM Class: I.2.10; I.2.6; J.2; J.4

    Journal ref: AAAI/ACM Conference on AI, Ethics, and Society (AIES '19), January 27-28, 2019, Honolulu, HI, USA

  33. arXiv:1905.01627  [pdf, other

    cs.LG cs.CY

    Predicting Economic Development using Geolocated Wikipedia Articles

    Authors: Evan Sheehan, Chenlin Meng, Matthew Tan, Burak Uzkent, Neal Jean, David Lobell, Marshall Burke, Stefano Ermon

    Abstract: Progress on the UN Sustainable Development Goals (SDGs) is hampered by a persistent lack of data regarding key social, environmental, and economic indicators, particularly in develo** countries. For example, data on poverty --- the first of seventeen SDGs --- is both spatially sparse and infrequently collected in Sub-Saharan Africa due to the high cost of surveys. Here we propose a novel method… ▽ More

    Submitted 11 May, 2019; v1 submitted 5 May, 2019; originally announced May 2019.

    Comments: Accepted to KDD 2019

  34. arXiv:1902.11110  [pdf, other

    cs.CV cs.LG

    Semi-Supervised Multitask Learning on Multispectral Satellite Images Using Wasserstein Generative Adversarial Networks (GANs) for Predicting Poverty

    Authors: Anthony Perez, Swetava Ganguli, Stefano Ermon, George Azzari, Marshall Burke, David Lobell

    Abstract: Obtaining reliable data describing local poverty metrics at a granularity that is informative to policy-makers requires expensive and logistically difficult surveys, particularly in the develo** world. Not surprisingly, the poverty stricken regions are also the ones which have a high probability of being a war zone, have poor infrastructure and sometimes have governments that do not cooperate wi… ▽ More

    Submitted 25 April, 2019; v1 submitted 13 February, 2019; originally announced February 2019.

    Comments: This project was recognized as the best two-person project during the Spring 2017 offering of CS 231N Convolutional Neural Networks for Visual Recognition. Second revised version corrects typographical errors and adds a few additional references

    Report number: Final report of research project conducted by the authors as part of the Sustainability and Artificial Intelligence Laboratory (SAIL) at Stanford University and as part of the Spring 2017 offering of CS 231N

  35. arXiv:1809.10236  [pdf, other

    cs.CV cs.CL

    Learning to Interpret Satellite Images Using Wikipedia

    Authors: Evan Sheehan, Burak Uzkent, Chenlin Meng, Zhongyi Tang, Marshall Burke, David Lobell, Stefano Ermon

    Abstract: Despite recent progress in computer vision, fine-grained interpretation of satellite images remains challenging because of a lack of labeled training data. To overcome this limitation, we propose using Wikipedia as a previously untapped source of rich, georeferenced textual information with global coverage. We construct a novel large-scale, multi-modal dataset by pairing geo-referenced Wikipedia a… ▽ More

    Submitted 19 September, 2018; originally announced September 2018.

  36. arXiv:1806.00894  [pdf, other

    cs.CY cs.CV stat.ML

    Infrastructure Quality Assessment in Africa using Satellite Imagery and Deep Learning

    Authors: Barak Oshri, Annie Hu, Peter Adelson, Xiao Chen, Pascaline Dupas, Jeremy Weinstein, Marshall Burke, David Lobell, Stefano Ermon

    Abstract: The UN Sustainable Development Goals allude to the importance of infrastructure quality in three of its seventeen goals. However, monitoring infrastructure quality in develo** regions remains prohibitively expensive and impedes efforts to measure progress toward these goals. To this end, we investigate the use of widely available remote sensing data for the prediction of infrastructure quality i… ▽ More

    Submitted 3 June, 2018; originally announced June 2018.

    Journal ref: KDD 2018 Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

  37. arXiv:1805.02855  [pdf, other

    cs.CV cs.LG stat.ML

    Tile2Vec: Unsupervised representation learning for spatially distributed data

    Authors: Neal Jean, Sherrie Wang, Anshul Samar, George Azzari, David Lobell, Stefano Ermon

    Abstract: Geospatial analysis lacks methods like the word vector representations and pre-trained networks that significantly boost performance across a wide range of natural language and computer vision tasks. To fill this gap, we introduce Tile2Vec, an unsupervised representation learning algorithm that extends the distributional hypothesis from natural language -- words appearing in similar contexts tend… ▽ More

    Submitted 30 May, 2018; v1 submitted 8 May, 2018; originally announced May 2018.

    Comments: 8 pages, 4 figures in main text; 9 pages, 11 figures in appendix

  38. arXiv:1711.05809  [pdf, other

    cs.LG stat.ML

    Hierarchical Modeling of Seed Variety Yields and Decision Making for Future Planting Plans

    Authors: Huaiyang Zhong, Xiaocheng Li, David Lobell, Stefano Ermon, Margaret L. Brandeau

    Abstract: Eradicating hunger and malnutrition is a key development goal of the 21st century. We address the problem of optimally identifying seed varieties to reliably increase crop yield within a risk-sensitive decision-making framework. Specifically, we introduce a novel hierarchical machine learning mechanism for predicting crop yield (the yield of different seed varieties of the same crop). We integrate… ▽ More

    Submitted 15 November, 2017; originally announced November 2017.

  39. arXiv:1711.03654  [pdf, other

    stat.ML cs.CV cs.LG

    Poverty Prediction with Public Landsat 7 Satellite Imagery and Machine Learning

    Authors: Anthony Perez, Christopher Yeh, George Azzari, Marshall Burke, David Lobell, Stefano Ermon

    Abstract: Obtaining detailed and reliable data about local economic livelihoods in develo** countries is expensive, and data are consequently scarce. Previous work has shown that it is possible to measure local-level economic livelihoods using high-resolution satellite imagery. However, such imagery is relatively expensive to acquire, often not updated frequently, and is mainly available for recent years.… ▽ More

    Submitted 9 November, 2017; originally announced November 2017.

    Comments: Presented at NIPS 2017 Workshop on Machine Learning for the Develo** World

  40. arXiv:1510.00098  [pdf, other

    cs.CV cs.CY

    Transfer Learning from Deep Features for Remote Sensing and Poverty Map**

    Authors: Michael Xie, Neal Jean, Marshall Burke, David Lobell, Stefano Ermon

    Abstract: The lack of reliable data in develo** countries is a major obstacle to sustainable development, food security, and disaster relief. Poverty data, for example, is typically scarce, sparse in coverage, and labor-intensive to obtain. Remote sensing data such as high-resolution satellite imagery, on the other hand, is becoming increasingly available and inexpensive. Unfortunately, such data is highl… ▽ More

    Submitted 27 February, 2016; v1 submitted 30 September, 2015; originally announced October 2015.

    Comments: In Proc. 30th AAAI Conference on Artificial Intelligence