-
Profiling checkpointing schedules in adjoint ST-AD
Authors:
Laurent Hascoët,
Jean-Luc Bouchot,
Shreyas Sunil Gaikwad,
Sri Hari Krishna Narayanan,
Jan Hückelheim
Abstract:
Checkpointing is a cornerstone of data-flow reversal in adjoint algorithmic differentiation. Checkpointing is a storage/recomputation trade-off that can be applied at different levels, one of which being the call tree. We are looking for good placements of checkpoints onto the call tree of a given application, to reduce run time and memory footprint of its adjoint. There is no known optimal soluti…
▽ More
Checkpointing is a cornerstone of data-flow reversal in adjoint algorithmic differentiation. Checkpointing is a storage/recomputation trade-off that can be applied at different levels, one of which being the call tree. We are looking for good placements of checkpoints onto the call tree of a given application, to reduce run time and memory footprint of its adjoint. There is no known optimal solution to this problem other than a combinatorial search on all placements. We propose a heuristics based on run-time profiling of the adjoint code. We describe implementation of this profiling tool in an existing source-transformation AD tool. We demonstrate the interest of this approach on test cases taken from the MITgcm ocean and atmospheric global circulation model. We discuss the limitations of our approach and propose directions to lift them.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
MITgcm-AD v2: Open source tangent linear and adjoint modeling framework for the oceans and atmosphere enabled by the Automatic Differentiation tool Tapenade
Authors:
Shreyas Sunil Gaikwad,
Sri Hari Krishna Narayanan,
Laurent Hascoet,
Jean-Michel Campin,
Helen Pillar,
An Nguyen,
Jan Hückelheim,
Paul Hovland,
Patrick Heimbach
Abstract:
The Massachusetts Institute of Technology General Circulation Model (MITgcm) is widely used by the climate science community to simulate planetary atmosphere and ocean circulations. A defining feature of the MITgcm is that it has been developed to be compatible with an algorithmic differentiation (AD) tool, TAF, enabling the generation of tangent-linear and adjoint models. These provide gradient i…
▽ More
The Massachusetts Institute of Technology General Circulation Model (MITgcm) is widely used by the climate science community to simulate planetary atmosphere and ocean circulations. A defining feature of the MITgcm is that it has been developed to be compatible with an algorithmic differentiation (AD) tool, TAF, enabling the generation of tangent-linear and adjoint models. These provide gradient information which enables dynamics-based sensitivity and attribution studies, state and parameter estimation, and rigorous uncertainty quantification. Importantly, gradient information is essential for computing comprehensive sensitivities and performing efficient large-scale data assimilation, ensuring that observations collected from satellites and in-situ measuring instruments can be effectively used to optimize a large uncertain control space. As a result, the MITgcm forms the dynamical core of a key data assimilation product employed by the physical oceanography research community: Estimating the Circulation and Climate of the Ocean (ECCO) state estimate. Although MITgcm and ECCO are used extensively within the research community, the AD tool TAF is proprietary and hence inaccessible to a large proportion of these users. The new version 2 (MITgcm-AD v2) framework introduced here is based on the source-to-source AD tool Tapenade, which has recently been open-sourced. Another feature of Tapenade is that it stores required variables by default (instead of recomputing them) which simplifies the implementation of efficient, AD-compatible code. The framework has been integrated with the MITgcm model main branch and is now freely available.
△ Less
Submitted 22 January, 2024;
originally announced January 2024.
-
Crowd Guilds: Worker-led Reputation and Feedback on Crowdsourcing Platforms
Authors:
Mark E. Whiting,
Dilrukshi Gamage,
Snehalkumar S. Gaikwad,
Aaron Gilbee,
Shirish Goyal,
Alipta Ballav,
Dinesh Majeti,
Nalin Chhibber,
Angela Richmond-Fuller,
Freddie Vargus,
Tejas Seshadri Sarma,
Varshine Chandrakanthan,
Teogenes Moura,
Mohamed Hashim Salih,
Gabriel Bayomi Tinoco Kalejaiye,
Adam Ginzberg,
Catherine A. Mullings,
Yoni Dayan,
Kristy Milland,
Henrique Orefice,
Jeff Regino,
Sayna Parsi,
Kunz Mainali,
Vibhor Sehgal,
Sekandar Matin
, et al. (3 additional authors not shown)
Abstract:
Crowd workers are distributed and decentralized. While decentralization is designed to utilize independent judgment to promote high-quality results, it paradoxically undercuts behaviors and institutions that are critical to high-quality work. Reputation is one central example: crowdsourcing systems depend on reputation scores from decentralized workers and requesters, but these scores are notoriou…
▽ More
Crowd workers are distributed and decentralized. While decentralization is designed to utilize independent judgment to promote high-quality results, it paradoxically undercuts behaviors and institutions that are critical to high-quality work. Reputation is one central example: crowdsourcing systems depend on reputation scores from decentralized workers and requesters, but these scores are notoriously inflated and uninformative. In this paper, we draw inspiration from historical worker guilds (e.g., in the silk trade) to design and implement crowd guilds: centralized groups of crowd workers who collectively certify each other's quality through double-blind peer assessment. A two-week field experiment compared crowd guilds to a traditional decentralized crowd work model. Crowd guilds produced reputation signals more strongly correlated with ground-truth worker quality than signals available on current crowd working platforms, and more accurate than in the traditional model.
△ Less
Submitted 28 February, 2017; v1 submitted 4 November, 2016;
originally announced November 2016.