Search | arXiv e-print repository

doi 10.1093/mnras/stae1602

The JWST Weather Report from the Nearest Brown Dwarfs I: multi-period JWST NIRSpec + MIRI monitoring of the benchmark binary brown dwarf WISE 1049AB

Authors: Beth A. Biller, Johanna M. Vos, Yifan Zhou, Allison M. McCarthy, Xianyu Tan, Ian J. M. Crossfield, Niall Whiteford, Genaro Suarez, Jacqueline Faherty, Elena Manjavacas, Xueqing Chen, Pengyu Liu, Ben J. Sutlieff, Mary Anne Limbach, Paul Molliere, Trent J. Dupuy, Natalia Oliveros-Gomez, Philip S. Muirhead, Thomas Henning, Gregory Mace, Nicolas Crouzet, Theodora Karalidi, Caroline V. Morley, Pascal Tremblin, Tiffany Kataria

Abstract: We report results from 8 hours of JWST/MIRI LRS spectroscopic monitoring directly followed by 7 hours of JWST/NIRSpec prism spectroscopic monitoring of the benchmark binary brown dwarf WISE 1049AB, the closest, brightest brown dwarfs known. We find water, methane, and CO absorption features in both components, including the 3.3 $μ$m methane absorption feature and a tentative detection of small gra… ▽ More We report results from 8 hours of JWST/MIRI LRS spectroscopic monitoring directly followed by 7 hours of JWST/NIRSpec prism spectroscopic monitoring of the benchmark binary brown dwarf WISE 1049AB, the closest, brightest brown dwarfs known. We find water, methane, and CO absorption features in both components, including the 3.3 $μ$m methane absorption feature and a tentative detection of small grain ($<$ 1$μ$m) silicate absorption at $>$8.5 $μ$m in WISE 1049A. Both components vary significantly ($>$1$\%$), with WISE 1049B displaying larger variations than WISE 1049A. Using K-means clustering, we find three main transition points in wavelength for both components of the binary: 1) change in behavior at $\sim$2.3 $μ$m coincident with a CO absorption bandhead, 2) change in behavior at 4.2 $μ$m, close to the CO fundamental band at $λ>$ 4.4 $μ$m, and 3) change in behavior at 8.3-8.5 $μ$m, potentially corresponding to silicate absorption. We interpret the lightcurves observed with both NIRSpec and MIRI as likely stemming from 1) a deep pressure level driving the double-peaked variability seen in WISE 1049B at wavelengths $<$2.3 $μ$m and $>$8.5 $μ$m, 2) an intermediate pressure level sha** the lightcurve morphology between 2.3 and 4.2 $μ$m, and 3) a higher-altitude pressure level producing single-peaked and plateaued lightcurve behavior between 4.2 and 8.5 $μ$m. △ Less

Submitted 12 July, 2024; originally announced July 2024.

Comments: 28 pages, 27 figures, accepted to MNRAS

arXiv:2405.11309 [pdf, other]

Klein-Nishina Corrections to the Spectra and Light Curves of Gamma-ray Burst Afterglows

Authors: George A. McCarthy, Tanmoy Laskar

Abstract: Multi-wavelength modeling of the synchrotron radiation from relativistic transients such as Gamma-ray Burst (GRB) afterglows is a powerful means of exploring the physics of relativistic shocks and of deriving properties of the explosion, such as the kinetic energy of the associated relativistic outflows. Capturing the location and evolution of the synchrotron cooling break is critical to break par… ▽ More Multi-wavelength modeling of the synchrotron radiation from relativistic transients such as Gamma-ray Burst (GRB) afterglows is a powerful means of exploring the physics of relativistic shocks and of deriving properties of the explosion, such as the kinetic energy of the associated relativistic outflows. Capturing the location and evolution of the synchrotron cooling break is critical to break parameter degeneracies associated with such modeling. However, the shape of the spectrum above the cooling break, as well as the location and evolution of the break itself can be significantly altered by synchrotron self-Compton (SSC) cooling. We present an observer's guide to applying SSC cooling with and without Klein-Nishina (KN) corrections to GRB afterglow modeling. We provide a publicly available python code to calculate the Compton $Y$-parameter as a function of electron Lorentz factor, from which we compute changes to the electron distribution, along with KN-corrected afterglow spectra and light curves. In this framework, the canonical synchrotron spectral shapes split into multiple sub-regimes. We summarize each new spectral shape and describe its observational significance. We discuss how KN corrections can account for harder spectra and shallower decline rates observed in some GRB X-ray afterglows. Our overall aim is to provide an easy application of SSC+KN corrections into analytical multi-wavelength modeling frameworks for relativistic transients. △ Less

Submitted 18 May, 2024; originally announced May 2024.

arXiv:2404.02127 [pdf, other]

FLawN-T5: An Empirical Examination of Effective Instruction-Tuning Data Mixtures for Legal Reasoning

Authors: Joel Niklaus, Lucia Zheng, Arya D. McCarthy, Christopher Hahn, Brian M. Rosen, Peter Henderson, Daniel E. Ho, Garrett Honke, Percy Liang, Christopher Manning

Abstract: Instruction tuning is an important step in making language models useful for direct user interaction. However, many legal tasks remain out of reach for most open LLMs and there do not yet exist any large scale instruction datasets for the domain. This critically limits research in this application area. In this work, we curate LawInstruct, a large legal instruction dataset, covering 17 jurisdictio… ▽ More Instruction tuning is an important step in making language models useful for direct user interaction. However, many legal tasks remain out of reach for most open LLMs and there do not yet exist any large scale instruction datasets for the domain. This critically limits research in this application area. In this work, we curate LawInstruct, a large legal instruction dataset, covering 17 jurisdictions, 24 languages and a total of 12M examples. We present evidence that domain-specific pretraining and instruction tuning improve performance on LegalBench, including improving Flan-T5 XL by 8 points or 16\% over the baseline. However, the effect does not generalize across all tasks, training regimes, model sizes, and other factors. LawInstruct is a resource for accelerating the development of models with stronger information processing and decision making capabilities in the legal domain. △ Less

Submitted 2 April, 2024; originally announced April 2024.

MSC Class: 68T50 ACM Class: I.2

arXiv:2402.15001 [pdf, other]

Multiple Patchy Cloud Layers in the Planetary Mass Object SIMP0136+0933

Authors: Allison M. McCarthy, Philip S. Muirhead, Patrick Tamburo, Johanna M. Vos, Caroline V. Morley, Jacqueline Faherty, Daniella C. Bardalez Gagliuffi, Eric Agol, Christopher Theissen

Abstract: Multi-wavelength photometry of brown dwarfs and planetary-mass objects provides insight into their atmospheres and cloud layers. We present near-simultaneous $J-$ and $K_s-$band multi-wavelength observations of the highly variable T2.5 planetary-mass object, SIMP J013656.5+093347. We reanalyze observations acquired over a single night in 2015 using a recently developed data reduction pipeline. For… ▽ More Multi-wavelength photometry of brown dwarfs and planetary-mass objects provides insight into their atmospheres and cloud layers. We present near-simultaneous $J-$ and $K_s-$band multi-wavelength observations of the highly variable T2.5 planetary-mass object, SIMP J013656.5+093347. We reanalyze observations acquired over a single night in 2015 using a recently developed data reduction pipeline. For the first time, we detect a phase shift between $J-$ and $K_s-$band light curves, which we measure to be $39.9^{\circ +3.6}_{ -1.1}$. Previously, phase shifts between near-infrared and mid-infrared observations of this object were detected and attributed to probing different depths of the atmosphere, and thus different cloud layers. Using the Sonora Bobcat models, we expand on this idea to show that at least two different patchy cloud layers must be present to explain the measured phase shift. Our results are generally consistent with recent atmospheric retrievals of this object and other similar L/T transition objects. △ Less

Submitted 26 February, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

Comments: 15 pages, 8 figures, Accepted for publication in ApJ

arXiv:2310.13678 [pdf, other]

Long-Form Speech Translation through Segmentation with Finite-State Decoding Constraints on Large Language Models

Authors: Arya D. McCarthy, Hao Zhang, Shankar Kumar, Felix Stahlberg, Ke Wu

Abstract: One challenge in speech translation is that plenty of spoken content is long-form, but short units are necessary for obtaining high-quality translations. To address this mismatch, we adapt large language models (LLMs) to split long ASR transcripts into segments that can be independently translated so as to maximize the overall translation quality. We overcome the tendency of hallucination in LLMs… ▽ More One challenge in speech translation is that plenty of spoken content is long-form, but short units are necessary for obtaining high-quality translations. To address this mismatch, we adapt large language models (LLMs) to split long ASR transcripts into segments that can be independently translated so as to maximize the overall translation quality. We overcome the tendency of hallucination in LLMs by incorporating finite-state constraints during decoding; these eliminate invalid outputs without requiring additional training. We discover that LLMs are adaptable to transcripts containing ASR errors through prompt-tuning or fine-tuning. Relative to a state-of-the-art automatic punctuation baseline, our best LLM improves the average BLEU by 2.9 points for English-German, English-Spanish, and English-Arabic TED talk translation in 9 test sets, just by improving segmentation. △ Less

Submitted 23 October, 2023; v1 submitted 20 October, 2023; originally announced October 2023.

Comments: accepted to the Findings of EMNLP 2023. arXiv admin note: text overlap with arXiv:2212.09895

arXiv:2302.07912 [pdf, other]

Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models

Authors: Abteen Ebrahimi, Arya D. McCarthy, Arturo Oncevay, Luis Chiruzzo, John E. Ortega, Gustavo A. Giménez-Lugo, Rolando Coto-Solano, Katharina Kann

Abstract: Large multilingual models have inspired a new class of word alignment methods, which work well for the model's pretraining languages. However, the languages most in need of automatic alignment are low-resource and, thus, not typically included in the pretraining data. In this work, we ask: How do modern aligners perform on unseen languages, and are they better than traditional methods? We contribu… ▽ More Large multilingual models have inspired a new class of word alignment methods, which work well for the model's pretraining languages. However, the languages most in need of automatic alignment are low-resource and, thus, not typically included in the pretraining data. In this work, we ask: How do modern aligners perform on unseen languages, and are they better than traditional methods? We contribute gold-standard alignments for Bribri--Spanish, Guarani--Spanish, Quechua--Spanish, and Shipibo-Konibo--Spanish. With these, we evaluate state-of-the-art aligners with and without model adaptation to the target language. Finally, we also evaluate the resulting alignments extrinsically through two downstream tasks: named entity recognition and part-of-speech tagging. We find that although transformer-based methods generally outperform traditional models, the two classes of approach remain competitive with each other. △ Less

Submitted 15 February, 2023; originally announced February 2023.

Comments: EACL 2023

arXiv:2302.01528 [pdf]

Four principles for improved statistical ecology

Authors: Gordana Popovic, Tanya J. Mason, Tiago A. Marques, Joanne Potts, Szymon M. Drobniak, Rocío Joo, Res Altwegg, Carolyn C. I. Burns, Michael A. McCarthy, Alison Johnston, Shinichi Nakagawa, Louise McMillan, Kadambari Devarajan, Patrick l. Taggart, Alison C. Wunderlich, Magdalena M. Mair, Juan Andrés Martínez-Lanfranco, Malgorzata Lagisz, Patrice P. Pottier

Abstract: Increasing attention has been drawn to the misuse of statistical methods over recent years, with particular concern about the prevalence of practices such as poor experimental design, cherry-picking and inadequate reporting. These failures are largely unintentional and no more common in ecology than in other scientific disciplines, with many of them easily remedied given the right guidance. Orig… ▽ More Increasing attention has been drawn to the misuse of statistical methods over recent years, with particular concern about the prevalence of practices such as poor experimental design, cherry-picking and inadequate reporting. These failures are largely unintentional and no more common in ecology than in other scientific disciplines, with many of them easily remedied given the right guidance. Originating from a discussion at the 2020 International Statistical Ecology Conference, we show how ecologists can build their research following four guiding principles for impactful statistical research practices: 1. Define a focused research question, then plan sampling and analysis to answer it; 2. Develop a model that accounts for the distribution and dependence of your data; 3. Emphasise effect sizes to replace statistical significance with ecological relevance; 4. Report your methods and findings in sufficient detail so that your research is valid and reproducible. Listed in approximate order of importance, these principles provide a framework for experimental design and reporting that guards against unsound practices. Starting with a well-defined research question allows researchers to create an efficient study to answer it, and guards against poor research practices that lead to false positives and poor replicability. Correct and appropriate statistical models give sound conclusions, good reporting practices and a focus on ecological relevance make results impactful and replicable. Illustrated with an example from a recent study into the impact of disturbance on upland swamps, this paper explains the rationale for the selection and use of effective statistical practices and provides practical guidance for ecologists seeking to improve their use of statistical methods. △ Less

Submitted 2 February, 2023; originally announced February 2023.

Comments: 19 pages, 2 figures

arXiv:2212.09895 [pdf, other]

Improved Long-Form Spoken Language Translation with Large Language Models

Authors: Arya D. McCarthy, Hao Zhang, Shankar Kumar, Felix Stahlberg, Axel H. Ng

Abstract: A challenge in spoken language translation is that plenty of spoken content is long-form, but short units are necessary for obtaining high-quality translations. To address this mismatch, we fine-tune a general-purpose, large language model to split long ASR transcripts into segments that can be independently translated so as to maximize the overall translation quality. We compare to several segmen… ▽ More A challenge in spoken language translation is that plenty of spoken content is long-form, but short units are necessary for obtaining high-quality translations. To address this mismatch, we fine-tune a general-purpose, large language model to split long ASR transcripts into segments that can be independently translated so as to maximize the overall translation quality. We compare to several segmentation strategies and find that our approach improves BLEU score on three languages by an average of 2.7 BLEU overall compared to an automatic punctuation baseline. Further, we demonstrate the effectiveness of two constrained decoding strategies to improve well-formedness of the model output from above 99% to 100%. △ Less

Submitted 19 December, 2022; originally announced December 2022.

arXiv:2212.01233 [pdf, other]

Safe machine learning model release from Trusted Research Environments: The AI-SDC package

Authors: Jim Smith, Richard J. Preen, Andrew McCarthy, Alba Crespi-Boixader, James Liley, Simon Rogers

Abstract: We present AI-SDC, an integrated suite of open source Python tools to facilitate Statistical Disclosure Control (SDC) of Machine Learning (ML) models trained on confidential data prior to public release. AI-SDC combines (i) a SafeModel package that extends commonly used ML models to provide ante-hoc SDC by assessing the vulnerability of disclosure posed by the training regime; and (ii) an Attacks… ▽ More We present AI-SDC, an integrated suite of open source Python tools to facilitate Statistical Disclosure Control (SDC) of Machine Learning (ML) models trained on confidential data prior to public release. AI-SDC combines (i) a SafeModel package that extends commonly used ML models to provide ante-hoc SDC by assessing the vulnerability of disclosure posed by the training regime; and (ii) an Attacks package that provides post-hoc SDC by rigorously assessing the empirical disclosure risk of a model through a variety of simulated attacks after training. The AI-SDC code and documentation are available under an MIT license at https://github.com/AI-SDC/AI-SDC. △ Less

Submitted 6 December, 2022; v1 submitted 2 December, 2022; originally announced December 2022.

arXiv:2211.16858 [pdf, other]

A Major Obstacle for NLP Research: Let's Talk about Time Allocation!

Authors: Katharina Kann, Shiran Dudy, Arya D. McCarthy

Abstract: The field of natural language processing (NLP) has grown over the last few years: conferences have become larger, we have published an incredible amount of papers, and state-of-the-art research has been implemented in a large variety of customer-facing products. However, this paper argues that we have been less successful than we should have been and reflects on where and how the field fails to ta… ▽ More The field of natural language processing (NLP) has grown over the last few years: conferences have become larger, we have published an incredible amount of papers, and state-of-the-art research has been implemented in a large variety of customer-facing products. However, this paper argues that we have been less successful than we should have been and reflects on where and how the field fails to tap its full potential. Specifically, we demonstrate that, in recent years, subpar time allocation has been a major obstacle for NLP research. We outline multiple concrete problems together with their negative consequences and, importantly, suggest remedies to improve the status quo. We hope that this paper will be a starting point for discussions around which common practices are -- or are not -- beneficial for NLP research. △ Less

Submitted 30 November, 2022; originally announced November 2022.

Comments: To appear at EMNLP 2022

arXiv:2211.01656 [pdf]

doi 10.5281/zenodo.7089491

GRAIMATTER Green Paper: Recommendations for disclosure control of trained Machine Learning (ML) models from Trusted Research Environments (TREs)

Authors: Emily Jefferson, James Liley, Maeve Malone, Smarti Reel, Alba Crespi-Boixader, Xaroula Kerasidou, Francesco Tava, Andrew McCarthy, Richard Preen, Alberto Blanco-Justicia, Esma Mansouri-Benssassi, Josep Domingo-Ferrer, Jillian Beggs, Antony Chuter, Christian Cole, Felix Ritchie, Angela Daly, Simon Rogers, Jim Smith

Abstract: TREs are widely, and increasingly used to support statistical analysis of sensitive data across a range of sectors (e.g., health, police, tax and education) as they enable secure and transparent research whilst protecting data confidentiality. There is an increasing desire from academia and industry to train AI models in TREs. The field of AI is develo** quickly with applications including spott… ▽ More TREs are widely, and increasingly used to support statistical analysis of sensitive data across a range of sectors (e.g., health, police, tax and education) as they enable secure and transparent research whilst protecting data confidentiality. There is an increasing desire from academia and industry to train AI models in TREs. The field of AI is develo** quickly with applications including spotting human errors, streamlining processes, task automation and decision support. These complex AI models require more information to describe and reproduce, increasing the possibility that sensitive personal data can be inferred from such descriptions. TREs do not have mature processes and controls against these risks. This is a complex topic, and it is unreasonable to expect all TREs to be aware of all risks or that TRE researchers have addressed these risks in AI-specific training. GRAIMATTER has developed a draft set of usable recommendations for TREs to guard against the additional risks when disclosing trained AI models from TREs. The development of these recommendations has been funded by the GRAIMATTER UKRI DARE UK sprint research project. This version of our recommendations was published at the end of the project in September 2022. During the course of the project, we have identified many areas for future investigations to expand and test these recommendations in practice. Therefore, we expect that this document will evolve over time. △ Less

Submitted 3 November, 2022; originally announced November 2022.

arXiv:2210.04462 [pdf, other]

doi 10.3847/1538-3881/ac9a52

The Perkins INfrared Exosatellite Survey (PINES) II. Transit Candidates and Implications for Planet Occurrence around L and T Dwarfs

Authors: Patrick Tamburo, Philip S. Muirhead, Allison M. McCarthy, Murdock Hart, Johanna M. Vos, Eric Agol, Christopher Theissen, David Gracia, Daniella C. Bardalez Gagliuffi, Jacqueline Faherty

Abstract: We describe a new transit detection algorithm designed to detect single transit events in discontinuous Perkins INfrared Exosatellite Survey (PINES) observations of L and T dwarfs. We use this algorithm to search for transits in 131 PINES light curves and identify two transit candidates: 2MASS J18212815+1414010 (2MASS J1821+1414) and 2MASS J08350622+1953050 (2MASS J0835+1953). We disfavor 2MASS J1… ▽ More We describe a new transit detection algorithm designed to detect single transit events in discontinuous Perkins INfrared Exosatellite Survey (PINES) observations of L and T dwarfs. We use this algorithm to search for transits in 131 PINES light curves and identify two transit candidates: 2MASS J18212815+1414010 (2MASS J1821+1414) and 2MASS J08350622+1953050 (2MASS J0835+1953). We disfavor 2MASS J1821+1414 as a genuine transit candidate due to the known variability properties of the source. We cannot rule out the planetary nature of 2MASS J0835+1953's candidate event and perform follow-up observations in an attempt to recover a second transit. A repeat event has yet to be observed, but these observations suggest that target variability is an unlikely cause of the candidate transit. We perform a Markov chain Monte Carlo simulation of the light curve and estimate a planet radius ranging from $4.2^{+3.5}_{-1.6}R_\oplus$ to $5.8^{+4.8}_{-2.1}R_\oplus$, depending on the host's age. Finally, we perform an injection and recovery simulation on our light curve sample. We inject planets into our data using measured M dwarf planet occurrence rates and attempt to recover them using our transit search algorithm. Our detection rates suggest that, assuming M dwarf planet occurrence rates, we should have roughly a 1$\%$ chance of detecting a candidate that could cause the transit depth we observe for 2MASS J0835+1953. If 2MASS J0835+1953 b is confirmed, it would suggest an enhancement in the occurrence of short-period planets around L and T dwarfs in comparison to M dwarfs, which would challenge predictions from planet formation models. △ Less

Submitted 10 October, 2022; originally announced October 2022.

Comments: 23 pages, 15 figures, accepted to AJ

arXiv:2205.03608 [pdf, other]

UniMorph 4.0: Universal Morphology

Authors: Khuyagbaatar Batsuren, Omer Goldman, Salam Khalifa, Nizar Habash, Witold Kieraś, Gábor Bella, Brian Leonard, Garrett Nicolai, Kyle Gorman, Yustinus Ghanggo Ate, Maria Ryskina, Sabrina J. Mielke, Elena Budianskaya, Charbel El-Khaissi, Tiago Pimentel, Michael Gasser, William Lane, Mohit Raj, Matt Coler, Jaime Rafael Montoya Samame, Delio Siticonatzi Camaiteri, Benoît Sagot, Esaú Zumaeta Rojas, Didier López Francis, Arturo Oncevay , et al. (71 additional authors not shown)

Abstract: The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation and a type-level resource of annotated data in diverse languages realizing that schema. This pa… ▽ More The Universal Morphology (UniMorph) project is a collaborative effort providing broad-coverage instantiated normalized morphological inflection tables for hundreds of diverse world languages. The project comprises two major thrusts: a language-independent feature schema for rich morphological annotation and a type-level resource of annotated data in diverse languages realizing that schema. This paper presents the expansions and improvements made on several fronts over the last couple of years (since McCarthy et al. (2020)). Collaborative efforts by numerous linguists have added 67 new languages, including 30 endangered languages. We have implemented several improvements to the extraction pipeline to tackle some issues, e.g. missing gender and macron information. We have also amended the schema to use a hierarchical structure that is needed for morphological phenomena like multiple-argument agreement and case stacking, while adding some missing morphological features to make the schema more inclusive. In light of the last UniMorph release, we also augmented the database with morpheme segmentation for 16 languages. Lastly, this new release makes a push towards inclusion of derivational morphology in UniMorph by enriching the data and annotation schema with instances representing derivational processes from MorphyNet. △ Less

Submitted 19 June, 2022; v1 submitted 7 May, 2022; originally announced May 2022.

Comments: LREC 2022; The first two authors made equal contributions

arXiv:2203.08909 [pdf, other]

Morphological Processing of Low-Resource Languages: Where We Are and What's Next

Authors: Adam Wiemerslage, Miikka Silfverberg, Changbing Yang, Arya D. McCarthy, Garrett Nicolai, Eliana Colunga, Katharina Kann

Abstract: Automatic morphological processing can aid downstream natural language processing applications, especially for low-resource languages, and assist language documentation efforts for endangered languages. Having long been multilingual, the field of computational morphology is increasingly moving towards approaches suitable for languages with minimal or no annotated resources. First, we survey recent… ▽ More Automatic morphological processing can aid downstream natural language processing applications, especially for low-resource languages, and assist language documentation efforts for endangered languages. Having long been multilingual, the field of computational morphology is increasingly moving towards approaches suitable for languages with minimal or no annotated resources. First, we survey recent developments in computational morphology with a focus on low-resource languages. Second, we argue that the field is ready to tackle the logical next challenge: understanding a language's morphology from raw text alone. We perform an empirical study on a truly unsupervised version of the paradigm completion task and show that, while existing state-of-the-art models bridged by two newly proposed models we devise perform reasonably, there is still much room for improvement. The stakes are high: solving this task will increase the language coverage of morphological resources by a number of magnitudes. △ Less

Submitted 16 March, 2022; originally announced March 2022.

Comments: Findings of ACL 2022

arXiv:2203.08850 [pdf, other]

Pre-Trained Multilingual Sequence-to-Sequence Models: A Hope for Low-Resource Language Translation?

Authors: En-Shiun Annie Lee, Sarubi Thillainathan, Shravan Nayak, Surangika Ranathunga, David Ifeoluwa Adelani, Ruisi Su, Arya D. McCarthy

Abstract: What can pre-trained multilingual sequence-to-sequence models like mBART contribute to translating low-resource languages? We conduct a thorough empirical experiment in 10 languages to ascertain this, considering five factors: (1) the amount of fine-tuning data, (2) the noise in the fine-tuning data, (3) the amount of pre-training data in the model, (4) the impact of domain mismatch, and (5) langu… ▽ More What can pre-trained multilingual sequence-to-sequence models like mBART contribute to translating low-resource languages? We conduct a thorough empirical experiment in 10 languages to ascertain this, considering five factors: (1) the amount of fine-tuning data, (2) the noise in the fine-tuning data, (3) the amount of pre-training data in the model, (4) the impact of domain mismatch, and (5) language typology. In addition to yielding several heuristics, the experiments form a framework for evaluating the data sensitivities of machine translation systems. While mBART is robust to domain differences, its translations for unseen and typologically distant languages remain below 3.0 BLEU. In answer to our title's question, mBART is not a low-resource panacea; we therefore encourage shifting the emphasis from new models to new data. △ Less

Submitted 30 April, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

Comments: Accepted to Findings of ACL 2022

arXiv:2201.01794 [pdf, other]

doi 10.3847/1538-3881/ac64aa

The Perkins INfrared Exosatellite Survey (PINES) I. Survey Overview, Reduction Pipeline, and Early Results

Authors: Patrick Tamburo, Philip S. Muirhead, Allison M. McCarthy, Murdock Hart, David Gracia, Johanna M. Vos, Daniella C. Bardalez Gagliuffi, Jacqueline Faherty, Christopher Theissen, Eric Agol, Julie N. Skinner, Sheila Sagear

Abstract: We describe the Perkins INfrared Exosatellite Survey (PINES), a near-infrared photometric search for short-period transiting planets and moons around a sample of 393 spectroscopically confirmed L- and T-type dwarfs. PINES is performed with Boston University's 1.8 m Perkins Telescope Observatory, located on Anderson Mesa, Arizona. We discuss the observational strategy of the survey, which was desig… ▽ More We describe the Perkins INfrared Exosatellite Survey (PINES), a near-infrared photometric search for short-period transiting planets and moons around a sample of 393 spectroscopically confirmed L- and T-type dwarfs. PINES is performed with Boston University's 1.8 m Perkins Telescope Observatory, located on Anderson Mesa, Arizona. We discuss the observational strategy of the survey, which was designed to optimize the number of expected transit detections, and describe custom automated observing procedures for performing PINES observations. We detail the steps of the $\texttt{PINES Analysis Toolkit}$ ($\texttt{PAT}$), software that is used to create light curves from PINES images. We assess the impact of second-order extinction due to changing precipitable water vapor on our observations and find that the magnitude of this effect is minimized in Mauna Kea Observatories $\textit{J}$-band. We demonstrate the validity of $\texttt{PAT}$ through the recovery of a transit of WASP-2 b and known variable brown dwarfs, and use it to identify a new variable L/T transition object: the T2 dwarf WISE J045746.08-020719.2. We report on the measured photometric precision of the survey and use it to estimate our transit detection sensitivity. We find that for our median brightness targets, assuming contributions from white noise only, we are sensitive to the detection of 2.5 $R_\oplus$ planets and larger. PINES will test whether the increase in sub-Neptune-sized planet occurrence with decreasing host mass continues into the L and T dwarf regime. △ Less

Submitted 21 April, 2022; v1 submitted 5 January, 2022; originally announced January 2022.

Comments: 25 pages, 15 figures, accepted to AJ

arXiv:2101.10245 [pdf, other]

AirWare: Utilizing Embedded Audio and Infrared Signals for In-Air Hand-Gesture Recognition

Authors: Nibhrat Lohia, Raunak Mundada, Arya D. McCarthy, Eric C. Larson

Abstract: We introduce AirWare, an in-air hand-gesture recognition system that uses the already embedded speaker and microphone in most electronic devices, together with embedded infrared proximity sensors. Gestures identified by AirWare are performed in the air above a touchscreen or a mobile phone. AirWare utilizes convolutional neural networks to classify a large vocabulary of hand gestures using multi-m… ▽ More We introduce AirWare, an in-air hand-gesture recognition system that uses the already embedded speaker and microphone in most electronic devices, together with embedded infrared proximity sensors. Gestures identified by AirWare are performed in the air above a touchscreen or a mobile phone. AirWare utilizes convolutional neural networks to classify a large vocabulary of hand gestures using multi-modal audio Doppler signatures and infrared (IR) sensor information. As opposed to other systems which use high frequency Doppler radars or depth cameras to uniquely identify in-air gestures, AirWare does not require any external sensors. In our analysis, we use openly available APIs to interface with the Samsung Galaxy S5 audio and proximity sensors for data collection. We find that AirWare is not reliable enough for a deployable interaction system when trying to classify a gesture set of 21 gestures, with an average true positive rate of only 50.5% per gesture. To improve performance, we train AirWare to identify subsets of the 21 gestures vocabulary based on possible usage scenarios. We find that AirWare can identify three gesture sets with average true positive rate greater than 80% using 4--7 gestures per set, which comprises a vocabulary of 16 unique in-air gestures. △ Less

Submitted 25 January, 2021; originally announced January 2021.

arXiv:2012.10295 [pdf]

Twelve years of SAMtools and BCFtools

Authors: Petr Danecek, James K. Bonfield, Jennifer Liddle, John Marshall, Valeriu Ohan, Martin O Pollard, Andrew Whitwham, Thomas Keane, Shane A. McCarthy, Robert M. Davies, Heng Li

Abstract: Background SAMtools and BCFtools are widely used programs for processing and analysing high-throughput sequencing data. Findings The first version appeared online twelve years ago and has been maintained and further developed ever since, with many new features and improvements added over the years. The SAMtools and BCFtools packages represent a unique collection of tools that have been used… ▽ More Background SAMtools and BCFtools are widely used programs for processing and analysing high-throughput sequencing data. Findings The first version appeared online twelve years ago and has been maintained and further developed ever since, with many new features and improvements added over the years. The SAMtools and BCFtools packages represent a unique collection of tools that have been used in numerous other software projects and countless genomic pipelines. Conclusion Both SAMtools and BCFtools are freely available on GitHub under the permissive MIT licence, free for both non-commercial and commercial use. Both packages have been installed over a million times via Bioconda. The source code and documentation are available from http://www.htslib.org. △ Less

Submitted 2 February, 2021; v1 submitted 18 December, 2020; originally announced December 2020.

arXiv:2009.01871 [pdf, other]

doi 10.1007/978-3-030-60548-3_18

Federated Learning for Breast Density Classification: A Real-World Implementation

Authors: Holger R. Roth, Ken Chang, Praveer Singh, Nir Neumark, Wenqi Li, Vikash Gupta, Sharut Gupta, Liangqiong Qu, Alvin Ihsani, Bernardo C. Bizzo, Yuhong Wen, Varun Buch, Meesam Shah, Felipe Kitamura, Matheus Mendonça, Vitor Lavor, Ahmed Harouni, Colin Compas, Jesse Tetreault, Prerna Dogra, Yan Cheng, Selnur Erdal, Richard White, Behrooz Hashemian, Thomas Schultz , et al. (18 additional authors not shown)

Abstract: Building robust deep learning-based models requires large quantities of diverse training data. In this study, we investigate the use of federated learning (FL) to build medical imaging classification models in a real-world collaborative setting. Seven clinical institutions from across the world joined this FL effort to train a model for breast density classification based on Breast Imaging, Report… ▽ More Building robust deep learning-based models requires large quantities of diverse training data. In this study, we investigate the use of federated learning (FL) to build medical imaging classification models in a real-world collaborative setting. Seven clinical institutions from across the world joined this FL effort to train a model for breast density classification based on Breast Imaging, Reporting & Data System (BI-RADS). We show that despite substantial differences among the datasets from all sites (mammography system, class distribution, and data set size) and without centralizing data, we can successfully train AI models in federation. The results show that models trained using FL perform 6.3% on average better than their counterparts trained on an institute's local data alone. Furthermore, we show a 45.8% relative improvement in the models' generalizability when evaluated on the other participating sites' testing data. △ Less

Submitted 20 October, 2020; v1 submitted 3 September, 2020; originally announced September 2020.

Comments: Accepted at the 1st MICCAI Workshop on "Distributed And Collaborative Learning"; add citation to Fig. 1 & 2 and update Fig. 5; fix typo in affiliations

Journal ref: In: Albarqouni S. et al. (eds) Domain Adaptation and Representation Transfer, and Distributed and Collaborative Learning. DART 2020, DCL 2020. Lecture Notes in Computer Science, vol 12444. Springer, Cham

arXiv:2008.01019 [pdf, other]

Combining Breast Cancer Risk Prediction Models

Authors: Zoe Guan, Theodore Huang, Anne Marie McCarthy, Kevin S. Hughes, Alan Semine, Hajime Uno, Lorenzo Trippa, Giovanni Parmigiani, Danielle Braun

Abstract: Accurate risk stratification is key to reducing cancer morbidity through targeted screening and preventative interventions. Numerous breast cancer risk prediction models have been developed, but they often give predictions with conflicting clinical implications. Integrating information from different models may improve the accuracy of risk predictions, which would be valuable for both clinicians a… ▽ More Accurate risk stratification is key to reducing cancer morbidity through targeted screening and preventative interventions. Numerous breast cancer risk prediction models have been developed, but they often give predictions with conflicting clinical implications. Integrating information from different models may improve the accuracy of risk predictions, which would be valuable for both clinicians and patients. BRCAPRO and BCRAT are two widely used models based on largely complementary sets of risk factors. BRCAPRO is a Bayesian model that uses detailed family history information to estimate the probability of carrying a BRCA1/2 mutation, as well as future risk of breast and ovarian cancer, based on mutation prevalence and penetrance (age-specific probability of develo** cancer given genotype). BCRAT uses a relative hazard model based on first-degree family history and non-genetic risk factors. We consider two approaches for combining BRCAPRO and BCRAT: 1) modifying the penetrance functions in BRCAPRO using relative hazard estimates from BCRAT, and 2) training an ensemble model that takes as input BRCAPRO and BCRAT predictions. We show that the combination models achieve performance gains over BRCAPRO and BCRAT in simulations and data from the Cancer Genetics Network. △ Less

Submitted 31 July, 2020; originally announced August 2020.

arXiv:2005.13756 [pdf, other]

The SIGMORPHON 2020 Shared Task on Unsupervised Morphological Paradigm Completion

Authors: Katharina Kann, Arya McCarthy, Garrett Nicolai, Mans Hulden

Abstract: In this paper, we describe the findings of the SIGMORPHON 2020 shared task on unsupervised morphological paradigm completion (SIGMORPHON 2020 Task 2), a novel task in the field of inflectional morphology. Participants were asked to submit systems which take raw text and a list of lemmas as input, and output all inflected forms, i.e., the entire morphological paradigm, of each lemma. In order to si… ▽ More In this paper, we describe the findings of the SIGMORPHON 2020 shared task on unsupervised morphological paradigm completion (SIGMORPHON 2020 Task 2), a novel task in the field of inflectional morphology. Participants were asked to submit systems which take raw text and a list of lemmas as input, and output all inflected forms, i.e., the entire morphological paradigm, of each lemma. In order to simulate a realistic use case, we first released data for 5 development languages. However, systems were officially evaluated on 9 surprise languages, which were only revealed a few days before the submission deadline. We provided a modular baseline system, which is a pipeline of 4 components. 3 teams submitted a total of 7 systems, but, surprisingly, none of the submitted systems was able to improve over the baseline on average over all 9 test languages. Only on 3 languages did a submitted system obtain the best results. This shows that unsupervised morphological paradigm completion is still largely unsolved. We present an analysis here, so that this shared task will ground further research on the topic. △ Less

Submitted 27 May, 2020; originally announced May 2020.

Comments: SIGMORPHON 2020

arXiv:2005.00970 [pdf, other]

Unsupervised Morphological Paradigm Completion

Authors: Huiming **, Liwei Cai, Yihui Peng, Chen Xia, Arya D. McCarthy, Katharina Kann

Abstract: We propose the task of unsupervised morphological paradigm completion. Given only raw text and a lemma list, the task consists of generating the morphological paradigms, i.e., all inflected forms, of the lemmas. From a natural language processing (NLP) perspective, this is a challenging unsupervised task, and high-performing systems have the potential to improve tools for low-resource languages or… ▽ More We propose the task of unsupervised morphological paradigm completion. Given only raw text and a lemma list, the task consists of generating the morphological paradigms, i.e., all inflected forms, of the lemmas. From a natural language processing (NLP) perspective, this is a challenging unsupervised task, and high-performing systems have the potential to improve tools for low-resource languages or to assist linguistic annotators. From a cognitive science perspective, this can shed light on how children acquire morphological knowledge. We further introduce a system for the task, which generates morphological paradigms via the following steps: (i) EDIT TREE retrieval, (ii) additional lemma retrieval, (iii) paradigm size discovery, and (iv) inflection generation. We perform an evaluation on 14 typologically diverse languages. Our system outperforms trivial baselines with ease and, for some languages, even obtains a higher accuracy than minimally supervised systems. △ Less

Submitted 20 May, 2020; v1 submitted 2 May, 2020; originally announced May 2020.

Comments: Accepted by ACL 2020

arXiv:2005.00626 [pdf, other]

Predicting Declension Class from Form and Meaning

Authors: Adina Williams, Tiago Pimentel, Arya D. McCarthy, Hagen Blix, Eleanor Chodroff, Ryan Cotterell

Abstract: The noun lexica of many natural languages are divided into several declension classes with characteristic morphological properties. Class membership is far from deterministic, but the phonological form of a noun and/or its meaning can often provide imperfect clues. Here, we investigate the strength of those clues. More specifically, we operationalize this by measuring how much information, in bits… ▽ More The noun lexica of many natural languages are divided into several declension classes with characteristic morphological properties. Class membership is far from deterministic, but the phonological form of a noun and/or its meaning can often provide imperfect clues. Here, we investigate the strength of those clues. More specifically, we operationalize this by measuring how much information, in bits, we can glean about declension class from knowing the form and/or meaning of nouns. We know that form and meaning are often also indicative of grammatical gender---which, as we quantitatively verify, can itself share information with declension class---so we also control for gender. We find for two Indo-European languages (Czech and German) that form and meaning respectively share significant amounts of information with class (and contribute additional information above and beyond gender). The three-way interaction between class, form, and meaning (given gender) is also significant. Our study is important for two reasons: First, we introduce a new method that provides additional quantitative support for a classic linguistic finding that form and meaning are relevant for the classification of nouns into declensions. Secondly, we show not only that individual declensions classes vary in the strength of their clues within a language, but also that these variations themselves vary across languages. △ Less

Submitted 28 May, 2020; v1 submitted 1 May, 2020; originally announced May 2020.

Comments: 14 pages, 2 figures, the is the camera-ready version accepted at the 2020 Annual Conference of the Association for Computational Linguistics (ACL 2020)

arXiv:2004.09211 [pdf, other]

doi 10.1109/TIP.2020.3046882

Robust 3D reconstruction of dynamic scenes from single-photon lidar using Beta-divergences

Authors: Quentin Legros, Julian Tachella, Rachael Tobin, Aongus McCarthy, Sylvain Meignen, Gerald S. Buller, Yoann Altmann, Stephen McLaughlin, Michael E. Davies

Abstract: In this paper, we present a new algorithm for fast, online 3D reconstruction of dynamic scenes using times of arrival of photons recorded by single-photon detector arrays. One of the main challenges in 3D imaging using single-photon lidar in practical applications is the presence of strong ambient illumination which corrupts the data and can jeopardize the detection of peaks/surface in the signals… ▽ More In this paper, we present a new algorithm for fast, online 3D reconstruction of dynamic scenes using times of arrival of photons recorded by single-photon detector arrays. One of the main challenges in 3D imaging using single-photon lidar in practical applications is the presence of strong ambient illumination which corrupts the data and can jeopardize the detection of peaks/surface in the signals. This background noise not only complicates the observation model classically used for 3D reconstruction but also the estimation procedure which requires iterative methods. In this work, we consider a new similarity measure for robust depth estimation, which allows us to use a simple observation model and a non-iterative estimation procedure while being robust to mis-specification of the background illumination model. This choice leads to a computationally attractive depth estimation procedure without significant degradation of the reconstruction performance. This new depth estimation procedure is coupled with a spatio-temporal model to capture the natural correlation between neighboring pixels and successive frames for dynamic scene analysis. The resulting online inference process is scalable and well suited for parallel implementation. The benefits of the proposed method are demonstrated through a series of experiments conducted with simulated and real single-photon lidar videos, allowing the analysis of dynamic scenes at 325 m observed under extreme ambient illumination conditions. △ Less

Submitted 18 December, 2020; v1 submitted 20 April, 2020; originally announced April 2020.

Comments: 12 pages

arXiv:2002.12231 [pdf, other]

SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation

Authors: Arya D. McCarthy, Liezl Puzon, Juan Pino

Abstract: We propose autoencoding speaker conversion for training data augmentation in automatic speech translation. This technique directly transforms an audio sequence, resulting in audio synthesized to resemble another speaker's voice. Our method compares favorably to SpecAugment on English$\to$French and English$\to$Romanian automatic speech translation (AST) tasks as well as on a low-resource English a… ▽ More We propose autoencoding speaker conversion for training data augmentation in automatic speech translation. This technique directly transforms an audio sequence, resulting in audio synthesized to resemble another speaker's voice. Our method compares favorably to SpecAugment on English$\to$French and English$\to$Romanian automatic speech translation (AST) tasks as well as on a low-resource English automatic speech recognition (ASR) task. Further, in ablations, we show the benefits of both quantity and diversity in augmented data. Finally, we show that we can combine our approach with augmentation by machine-translated transcripts to obtain a competitive end-to-end AST model that outperforms a very strong cascade model on an English$\to$French AST task. Our method is sufficiently general that it can be applied to other speech generation and analysis tasks. △ Less

Submitted 27 February, 2020; originally announced February 2020.

Comments: Accepted to ICASSP 2020

arXiv:1910.11493 [pdf, ps, other]

doi 10.18653/v1/W19-4226

The SIGMORPHON 2019 Shared Task: Morphological Analysis in Context and Cross-Lingual Transfer for Inflection

Authors: Arya D. McCarthy, Ekaterina Vylomova, Shijie Wu, Chaitanya Malaviya, Lawrence Wolf-Sonkin, Garrett Nicolai, Christo Kirov, Miikka Silfverberg, Sabrina J. Mielke, Jeffrey Heinz, Ryan Cotterell, Mans Hulden

Abstract: The SIGMORPHON 2019 shared task on cross-lingual transfer and contextual analysis in morphology examined transfer learning of inflection between 100 language pairs, as well as contextual lemmatization and morphosyntactic description in 66 languages. The first task evolves past years' inflection tasks by examining transfer of morphological inflection knowledge from a high-resource language to a low… ▽ More The SIGMORPHON 2019 shared task on cross-lingual transfer and contextual analysis in morphology examined transfer learning of inflection between 100 language pairs, as well as contextual lemmatization and morphosyntactic description in 66 languages. The first task evolves past years' inflection tasks by examining transfer of morphological inflection knowledge from a high-resource language to a low-resource language. This year also presents a new second challenge on lemmatization and morphological feature analysis in context. All submissions featured a neural component and built on either this year's strong baselines or highly ranked systems from previous years' shared tasks. Every participating team improved in accuracy over the baselines for the inflection task (though not Levenshtein distance), and every team in the contextual analysis task improved on both state-of-the-art neural and non-neural baselines. △ Less

Submitted 25 February, 2020; v1 submitted 24 October, 2019; originally announced October 2019.

Comments: Presented at SIGMORPHON 2019

Journal ref: Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology (2019) 229-244

arXiv:1910.01531 [pdf, other]

Modeling Color Terminology Across Thousands of Languages

Authors: Arya D. McCarthy, Winston Wu, Aaron Mueller, Bill Watson, David Yarowsky

Abstract: There is an extensive history of scholarship into what constitutes a "basic" color term, as well as a broadly attested acquisition sequence of basic color terms across many languages, as articulated in the seminal work of Berlin and Kay (1969). This paper employs a set of diverse measures on massively cross-linguistic data to operationalize and critique the Berlin and Kay color term hypotheses. Co… ▽ More There is an extensive history of scholarship into what constitutes a "basic" color term, as well as a broadly attested acquisition sequence of basic color terms across many languages, as articulated in the seminal work of Berlin and Kay (1969). This paper employs a set of diverse measures on massively cross-linguistic data to operationalize and critique the Berlin and Kay color term hypotheses. Collectively, the 14 empirically-grounded computational linguistic metrics we design---as well as their aggregation---correlate strongly with both the Berlin and Kay basic/secondary color term partition (gamma=0.96) and their hypothesized universal acquisition sequence. The measures and result provide further empirical evidence from computational linguistics in support of their claims, as well as additional nuance: they suggest treating the partition as a spectrum instead of a dichotomy. △ Less

Submitted 3 October, 2019; originally announced October 2019.

Comments: Accepted for presentation at EMNLP-IJCNLP 2019

arXiv:1909.09237 [pdf, other]

Improved Variational Neural Machine Translation by Promoting Mutual Information

Authors: Arya D. McCarthy, Xian Li, Jiatao Gu, Ning Dong

Abstract: Posterior collapse plagues VAEs for text, especially for conditional text generation with strong autoregressive decoders. In this work, we address this problem in variational neural machine translation by explicitly promoting mutual information between the latent variables and the data. Our model extends the conditional variational autoencoder (CVAE) with two new ingredients: first, we propose a m… ▽ More Posterior collapse plagues VAEs for text, especially for conditional text generation with strong autoregressive decoders. In this work, we address this problem in variational neural machine translation by explicitly promoting mutual information between the latent variables and the data. Our model extends the conditional variational autoencoder (CVAE) with two new ingredients: first, we propose a modified evidence lower bound (ELBO) objective which explicitly promotes mutual information; second, we regularize the probabilities of the decoder by mixing an auxiliary factorized distribution which is directly predicted by the latent variables. We present empirical results on the Transformer architecture and show the proposed model effectively addressed posterior collapse: latent variables are no longer ignored in the presence of powerful decoder. As a result, the proposed model yields improved translation quality while demonstrating superior performance in terms of data efficiency and robustness. △ Less

Submitted 19 September, 2019; originally announced September 2019.

arXiv:1909.06515 [pdf, other]

Harnessing Indirect Training Data for End-to-End Automatic Speech Translation: Tricks of the Trade

Authors: Juan Pino, Liezl Puzon, Jiatao Gu, Xutai Ma, Arya D. McCarthy, Deepak Gopinath

Abstract: For automatic speech translation (AST), end-to-end approaches are outperformed by cascaded models that transcribe with automatic speech recognition (ASR), then translate with machine translation (MT). A major cause of the performance gap is that, while existing AST corpora are small, massive datasets exist for both the ASR and MT subsystems. In this work, we evaluate several data augmentation and… ▽ More For automatic speech translation (AST), end-to-end approaches are outperformed by cascaded models that transcribe with automatic speech recognition (ASR), then translate with machine translation (MT). A major cause of the performance gap is that, while existing AST corpora are small, massive datasets exist for both the ASR and MT subsystems. In this work, we evaluate several data augmentation and pretraining approaches for AST, by comparing all on the same datasets. Simple data augmentation by translating ASR transcripts proves most effective on the English--French augmented LibriSpeech dataset, closing the performance gap from 8.2 to 1.4 BLEU, compared to a very strong cascade that could directly utilize copious ASR and MT data. The same end-to-end approach plus fine-tuning closes the gap on the English--Romanian MuST-C dataset from 6.7 to 3.7 BLEU. In addition to these results, we present practical recommendations for augmentation and pretraining approaches. Finally, we decrease the performance gap to 0.01 BLEU using a Transformer-based architecture. △ Less

Submitted 22 October, 2019; v1 submitted 13 September, 2019; originally announced September 2019.

Comments: IWSLT 2019

arXiv:1906.05906 [pdf, other]

Meaning to Form: Measuring Systematicity as Information

Authors: Tiago Pimentel, Arya D. McCarthy, Damián E. Blasi, Brian Roark, Ryan Cotterell

Abstract: A longstanding debate in semiotics centers on the relationship between linguistic signs and their corresponding semantics: is there an arbitrary relationship between a word form and its meaning, or does some systematic phenomenon pervade? For instance, does the character bigram \textit{gl} have any systematic relationship to the meaning of words like \textit{glisten}, \textit{gleam} and \textit{gl… ▽ More A longstanding debate in semiotics centers on the relationship between linguistic signs and their corresponding semantics: is there an arbitrary relationship between a word form and its meaning, or does some systematic phenomenon pervade? For instance, does the character bigram \textit{gl} have any systematic relationship to the meaning of words like \textit{glisten}, \textit{gleam} and \textit{glow}? In this work, we offer a holistic quantification of the systematicity of the sign using mutual information and recurrent neural networks. We employ these in a data-driven and massively multilingual approach to the question, examining 106 languages. We find a statistically significant reduction in entropy when modeling a word form conditioned on its semantic representation. Encouragingly, we also recover well-attested English examples of systematic affixes. We conclude with the meta-point: Our approximate effect size (measured in bits) is quite small---despite some amount of systematicity between form and meaning, an arbitrary relationship and its resulting benefits dominate human language. △ Less

Submitted 26 July, 2019; v1 submitted 13 June, 2019; originally announced June 2019.

Comments: Accepted for publication at ACL 2019

arXiv:1905.06700 [pdf, other]

doi 10.1038/s41467-019-12943-7

Real-time 3D reconstruction from single-photon lidar data using plug-and-play point cloud denoisers

Authors: Julián Tachella, Yoann Altmann, Nicolas Mellado, Aongus McCarthy, Rachael Tobin, Gerald S. Buller, Jean-Yves Tourneret, Stephen McLaughlin

Abstract: Single-photon lidar has emerged as a prime candidate technology for depth imaging through challenging environments. Until now, a major limitation has been the significant amount of time required for the analysis of the recorded data. Here we show a new computational framework for real-time three-dimensional (3D) scene reconstruction from single-photon data. By combining statistical models with hig… ▽ More Single-photon lidar has emerged as a prime candidate technology for depth imaging through challenging environments. Until now, a major limitation has been the significant amount of time required for the analysis of the recorded data. Here we show a new computational framework for real-time three-dimensional (3D) scene reconstruction from single-photon data. By combining statistical models with highly scalable computational tools from the computer graphics community, we demonstrate 3D reconstruction of complex outdoor scenes with processing times of the order of 20 ms, where the lidar data was acquired in broad daylight from distances up to 320 metres. The proposed method can handle an unknown number of surfaces in each pixel, allowing for target detection and imaging through cluttered scenes. This enables robust, real-time target reconstruction of complex moving scenes, paving the way for single-photon lidar at video rates for practical 3D imaging applications. △ Less

Submitted 4 October, 2019; v1 submitted 16 May, 2019; originally announced May 2019.

arXiv:1903.10092 [pdf, other]

doi 10.1007/978-3-030-36687-2_15

An Exact No Free Lunch Theorem for Community Detection

Authors: Arya D. McCarthy, Tongfei Chen, Seth Ebner

Abstract: A precondition for a No Free Lunch theorem is evaluation with a loss function which does not assume a priori superiority of some outputs over others. A previous result for community detection by Peel et al. (2017) relies on a mismatch between the loss function and the problem domain. The loss function computes an expectation over only a subset of the universe of possible outputs; thus, it is only… ▽ More A precondition for a No Free Lunch theorem is evaluation with a loss function which does not assume a priori superiority of some outputs over others. A previous result for community detection by Peel et al. (2017) relies on a mismatch between the loss function and the problem domain. The loss function computes an expectation over only a subset of the universe of possible outputs; thus, it is only asymptotically appropriate with respect to the problem size. By using the correct random model for the problem domain, we provide a stronger, exact No Free Lunch theorem for community detection. The claim generalizes to other set-partitioning tasks including core/periphery separation, $k$-clustering, and graph partitioning. Finally, we review the literature of proposed evaluation functions and identify functions which (perhaps with slight modifications) are compatible with an exact No Free Lunch theorem. △ Less

Submitted 24 March, 2019; originally announced March 2019.

Journal ref: Complex Networks and Their Applications VIII. COMPLEX NETWORKS 2019. Studies in Computational Intelligence, vol 881

arXiv:1901.01354 [pdf, other]

doi 10.1007/978-3-030-36687-2_14

Metrics matter in community detection

Authors: Arya D. McCarthy, Tongfei Chen, Rachel Rudinger, David W. Matula

Abstract: We present a critical evaluation of normalized mutual information (NMI) as an evaluation metric for community detection. NMI exaggerates the leximin method's performance on weak communities: Does leximin, in finding the trivial singletons clustering, truly outperform eight other community detection methods? Three NMI improvements from the literature are AMI, rrNMI, and cNMI. We show equivalences u… ▽ More We present a critical evaluation of normalized mutual information (NMI) as an evaluation metric for community detection. NMI exaggerates the leximin method's performance on weak communities: Does leximin, in finding the trivial singletons clustering, truly outperform eight other community detection methods? Three NMI improvements from the literature are AMI, rrNMI, and cNMI. We show equivalences under relevant random models, and for evaluating community detection, we advise one-sided AMI under the $\mathbb{M}_{\mathrm{all}}$ model (all partitions of $n$ nodes). This work seeks (1) to start a conversation on robust measurements, and (2) to advocate evaluations which do not give "free lunch". △ Less

Submitted 4 January, 2019; originally announced January 2019.

Journal ref: Complex Networks and Their Applications VIII. COMPLEX NETWORKS 2019. Studies in Computational Intelligence, vol 881

arXiv:1810.11633 [pdf, other]

doi 10.1137/18M1183972

Bayesian 3D Reconstruction of Complex Scenes from Single-Photon Lidar Data

Authors: Julián Tachella, Yoann Altmann, Ximing Ren, Aongus McCarthy, Gerald S. Buller, Jean-Yves Tourneret, Steve McLaughlin

Abstract: Light detection and ranging (Lidar) data can be used to capture the depth and intensity profile of a 3D scene. This modality relies on constructing, for each pixel, a histogram of time delays between emitted light pulses and detected photon arrivals. In a general setting, more than one surface can be observed in a single pixel. The problem of estimating the number of surfaces, their reflectivity a… ▽ More Light detection and ranging (Lidar) data can be used to capture the depth and intensity profile of a 3D scene. This modality relies on constructing, for each pixel, a histogram of time delays between emitted light pulses and detected photon arrivals. In a general setting, more than one surface can be observed in a single pixel. The problem of estimating the number of surfaces, their reflectivity and position becomes very challenging in the low-photon regime (which equates to short acquisition times) or relatively high background levels (i.e., strong ambient illumination). This paper presents a new approach to 3D reconstruction using single-photon, single-wavelength Lidar data, which is capable of identifying multiple surfaces in each pixel. Adopting a Bayesian approach, the 3D structure to be recovered is modelled as a marked point process and reversible jump Markov chain Monte Carlo (RJ-MCMC) moves are proposed to sample the posterior distribution of interest. In order to promote spatial correlation between points belonging to the same surface, we propose a prior that combines an area interaction process and a Strauss process. New RJ-MCMC dilation and erosion updates are presented to achieve an efficient exploration of the configuration space. To further reduce the computational load, we adopt a multiresolution approach, processing the data from a coarse to the finest scale. The experiments performed with synthetic and real data show that the algorithm obtains better reconstructions than other recently published optimization algorithms for lower execution times. △ Less

Submitted 27 October, 2018; originally announced October 2018.

Journal ref: SIAM Journal on Imaging Sciences 2019 12:1, 521-550

arXiv:1810.11101 [pdf, other]

UniMorph 2.0: Universal Morphology

Authors: Christo Kirov, Ryan Cotterell, John Sylak-Glassman, Géraldine Walther, Ekaterina Vylomova, Patrick Xia, Manaal Faruqui, Sabrina J. Mielke, Arya D. McCarthy, Sandra Kübler, David Yarowsky, Jason Eisner, Mans Hulden

Abstract: The Universal Morphology UniMorph project is a collaborative effort to improve how NLP handles complex morphology across the world's languages. The project releases annotated morphological data using a universal tagset, the UniMorph schema. Each inflected form is associated with a lemma, which typically carries its underlying lexical meaning, and a bundle of morphological features from our schema.… ▽ More The Universal Morphology UniMorph project is a collaborative effort to improve how NLP handles complex morphology across the world's languages. The project releases annotated morphological data using a universal tagset, the UniMorph schema. Each inflected form is associated with a lemma, which typically carries its underlying lexical meaning, and a bundle of morphological features from our schema. Additional supporting data and tools are also released on a per-language basis when available. UniMorph is based at the Center for Language and Speech Processing (CLSP) at Johns Hopkins University in Baltimore, Maryland and is sponsored by the DARPA LORELEI program. This paper details advances made to the collection, annotation, and dissemination of project resources since the initial UniMorph release described at LREC 2016. lexical resources} } △ Less

Submitted 25 February, 2020; v1 submitted 25 October, 2018; originally announced October 2018.

Comments: LREC 2018

arXiv:1810.07125 [pdf, other]

The CoNLL--SIGMORPHON 2018 Shared Task: Universal Morphological Reinflection

Authors: Ryan Cotterell, Christo Kirov, John Sylak-Glassman, Géraldine Walther, Ekaterina Vylomova, Arya D. McCarthy, Katharina Kann, Sabrina J. Mielke, Garrett Nicolai, Miikka Silfverberg, David Yarowsky, Jason Eisner, Mans Hulden

Abstract: The CoNLL--SIGMORPHON 2018 shared task on supervised learning of morphological generation featured data sets from 103 typologically diverse languages. Apart from extending the number of languages involved in earlier supervised tasks of generating inflected forms, this year the shared task also featured a new second task which asked participants to inflect words in sentential context, similar to a… ▽ More The CoNLL--SIGMORPHON 2018 shared task on supervised learning of morphological generation featured data sets from 103 typologically diverse languages. Apart from extending the number of languages involved in earlier supervised tasks of generating inflected forms, this year the shared task also featured a new second task which asked participants to inflect words in sentential context, similar to a cloze task. This second task featured seven languages. Task 1 received 27 submissions and task 2 received 6 submissions. Both tasks featured a low, medium, and high data condition. Nearly all submissions featured a neural component and built on highly-ranked systems from the earlier 2017 shared task. In the inflection task (task 1), 41 of the 52 languages present in last year's inflection task showed improvement by the best systems in the low-resource setting. The cloze task (task 2) proved to be difficult, and few submissions managed to consistently improve upon both a simple neural baseline system and a lemma-repeating baseline. △ Less

Submitted 25 February, 2020; v1 submitted 16 October, 2018; originally announced October 2018.

Comments: CoNLL 2018. arXiv admin note: text overlap with arXiv:1706.09031

arXiv:1810.06743 [pdf, other]

doi 10.18653/v1/W18-6011

Marrying Universal Dependencies and Universal Morphology

Authors: Arya D. McCarthy, Miikka Silfverberg, Ryan Cotterell, Mans Hulden, David Yarowsky

Abstract: The Universal Dependencies (UD) and Universal Morphology (UniMorph) projects each present schemata for annotating the morphosyntactic details of language. Each project also provides corpora of annotated text in many languages - UD at the token level and UniMorph at the type level. As each corpus is built by different annotators, language-specific decisions hinder the goal of universal schemata. Wi… ▽ More The Universal Dependencies (UD) and Universal Morphology (UniMorph) projects each present schemata for annotating the morphosyntactic details of language. Each project also provides corpora of annotated text in many languages - UD at the token level and UniMorph at the type level. As each corpus is built by different annotators, language-specific decisions hinder the goal of universal schemata. With compatibility of tags, each project's annotations could be used to validate the other's. Additionally, the availability of both type- and token-level resources would be a boon to tasks such as parsing and homograph disambiguation. To ease this interoperability, we present a deterministic map** from Universal Dependencies v2 features into the UniMorph schema. We validate our approach by lookup in the UniMorph corpora and find a macro-average of 64.13% recall. We also note incompatibilities due to paucity of data on either side. Finally, we present a critical evaluation of the foundations, strengths, and weaknesses of the two annotation projects. △ Less

Submitted 15 October, 2018; originally announced October 2018.

Comments: UDW18

Journal ref: Proceedings of the Second Workshop on Universal Dependencies (2018) 91-101

arXiv:1809.05218 [pdf, other]

doi 10.18653/v1/W18-6313

Freezing Subnetworks to Analyze Domain Adaptation in Neural Machine Translation

Authors: Brian Thompson, Huda Khayrallah, Antonios Anastasopoulos, Arya D. McCarthy, Kevin Duh, Rebecca Marvin, Paul McNamee, Jeremy Gwinnup, Tim Anderson, Philipp Koehn

Abstract: To better understand the effectiveness of continued training, we analyze the major components of a neural machine translation system (the encoder, decoder, and each embedding space) and consider each component's contribution to, and capacity for, domain adaptation. We find that freezing any single component during continued training has minimal impact on performance, and that performance is surpri… ▽ More To better understand the effectiveness of continued training, we analyze the major components of a neural machine translation system (the encoder, decoder, and each embedding space) and consider each component's contribution to, and capacity for, domain adaptation. We find that freezing any single component during continued training has minimal impact on performance, and that performance is surprisingly good when a single component is adapted while holding the rest of the model fixed. We also find that continued training does not move the model very far from the out-of-domain model, compared to a sensitivity analysis metric, suggesting that the out-of-domain model can provide a good generic initialization for the new domain. △ Less

Submitted 15 January, 2019; v1 submitted 13 September, 2018; originally announced September 2018.

Comments: presented at WMT 2018. Please cite using the bib entry from here: http://www.statmt.org/wmt18/bib/WMT013.bib

Journal ref: Proceedings of the Third Conference on Machine Translation: Research Papers (2018) 124-132

arXiv:1805.04755 [pdf, other]

A Simple and Effective Model-Based Variable Importance Measure

Authors: Brandon M. Greenwell, Bradley C. Boehmke, Andrew J. McCarthy

Abstract: In the era of "big data", it is becoming more of a challenge to not only build state-of-the-art predictive models, but also gain an understanding of what's really going on in the data. For example, it is often of interest to know which, if any, of the predictors in a fitted model are relatively influential on the predicted outcome. Some modern algorithms---like random forests and gradient boosted… ▽ More In the era of "big data", it is becoming more of a challenge to not only build state-of-the-art predictive models, but also gain an understanding of what's really going on in the data. For example, it is often of interest to know which, if any, of the predictors in a fitted model are relatively influential on the predicted outcome. Some modern algorithms---like random forests and gradient boosted decision trees---have a natural way of quantifying the importance or relative influence of each feature. Other algorithms---like naive Bayes classifiers and support vector machines---are not capable of doing so and model-free approaches are generally used to measure each predictor's importance. In this paper, we propose a standardized, model-based approach to measuring predictor importance across the growing spectrum of supervised learning algorithms. Our proposed method is illustrated through both simulated and real data examples. The R code to reproduce all of the figures in this paper is available in the supplementary materials. △ Less

Submitted 12 May, 2018; originally announced May 2018.

arXiv:1803.02903 [pdf, other]

doi 10.1016/j.nima.2018.07.028

Nuclear-recoil energy scale in CDMS II silicon dark-matter detectors

Authors: R. Agnese, A. J. Anderson, T. Aramaki, W. Baker, D. Balakishiyeva, S. Banik, D. Barker, R. Basu Thakur, D. A. Bauer, T. Binder, A. Borgland, M. A. Bowles, P. L. Brink, R. Bunker, B. Cabrera, D. O. Caldwell, R. Calkins, C. Cartaro, D. G. Cerdeno, H. Chagani, Y. -Y. Chang, Y. Chen, J. Cooley, B. Cornell, P. Cushman , et al. (84 additional authors not shown)

Abstract: The Cryogenic Dark Matter Search (CDMS II) experiment aims to detect dark matter particles that elastically scatter from nuclei in semiconductor detectors. The resulting nuclear-recoil energy depositions are detected by ionization and phonon sensors. Neutrons produce a similar spectrum of low-energy nuclear recoils in such detectors, while most other backgrounds produce electron recoils. The absol… ▽ More The Cryogenic Dark Matter Search (CDMS II) experiment aims to detect dark matter particles that elastically scatter from nuclei in semiconductor detectors. The resulting nuclear-recoil energy depositions are detected by ionization and phonon sensors. Neutrons produce a similar spectrum of low-energy nuclear recoils in such detectors, while most other backgrounds produce electron recoils. The absolute energy scale for nuclear recoils is necessary to interpret results correctly. The energy scale can be determined in CDMS II silicon detectors using neutrons incident from a broad-spectrum $^{252}$Cf source, taking advantage of a prominent resonance in the neutron elastic scattering cross section of silicon at a recoil (neutron) energy near 20 (182) keV. Results indicate that the phonon collection efficiency for nuclear recoils is $4.8^{+0.7}_{-0.9}$% lower than for electron recoils of the same energy. Comparisons of the ionization signals for nuclear recoils to those measured previously by other groups at higher electric fields indicate that the ionization collection efficiency for CDMS II silicon detectors operated at $\sim$4 V/cm is consistent with 100% for nuclear recoils below 20 keV and gradually decreases for larger energies to $\sim$75% at 100 keV. The impact of these measurements on previously published CDMS II silicon results is small. △ Less

Submitted 27 July, 2018; v1 submitted 7 March, 2018; originally announced March 2018.

Comments: 22 pages, 17 figures, 1 table, 1 appendix

arXiv:1801.08428 [pdf, other]

Discrete projective minimal surfaces

Authors: A. McCarthy, W. K. Schief

Abstract: We propose a natural discretisation scheme for classical projective minimal surfaces. We follow the classical geometric characterisation and classification of projective minimal surfaces and introduce at each step canonical discrete models of the associated geometric notions and objects. Thus, we introduce discrete analogues of classical Lie quadrics and their envelopes and classify discrete proje… ▽ More We propose a natural discretisation scheme for classical projective minimal surfaces. We follow the classical geometric characterisation and classification of projective minimal surfaces and introduce at each step canonical discrete models of the associated geometric notions and objects. Thus, we introduce discrete analogues of classical Lie quadrics and their envelopes and classify discrete projective minimal surfaces according to the cardinality of the class of envelopes. This leads to discrete versions of Godeaux-Rozet, Demoulin and Tzitzeica surfaces. The latter class of surfaces requires the introduction of certain discrete line congruences which may also be employed in the classification of discrete projective minimal surfaces. The classification scheme is based on the notion of discrete surfaces which are in asymptotic correspondence. In this context, we set down a discrete analogue of a classical theorem which states that an envelope (of the Lie quadrics) of a surface is in asymptotic correspondence with the surface if and only if the surface is either projective minimal or a Q surface. Accordingly, we present a geometric definition of discrete Q surfaces and their relatives, namely discrete counterparts of classical semi-Q, complex, doubly Q and doubly complex surfaces. △ Less

Submitted 25 January, 2018; originally announced January 2018.

arXiv:1712.03353 [pdf, other]

Variational Inference over Non-differentiable Cardiac Simulators using Bayesian Optimization

Authors: Adam McCarthy, Blanca Rodriguez, Ana Minchole

Abstract: Performing inference over simulators is generally intractable as their runtime means we cannot compute a marginal likelihood. We develop a likelihood-free inference method to infer parameters for a cardiac simulator, which replicates electrical flow through the heart to the body surface. We improve the fit of a state-of-the-art simulator to an electrocardiogram (ECG) recorded from a real patient. Performing inference over simulators is generally intractable as their runtime means we cannot compute a marginal likelihood. We develop a likelihood-free inference method to infer parameters for a cardiac simulator, which replicates electrical flow through the heart to the body surface. We improve the fit of a state-of-the-art simulator to an electrocardiogram (ECG) recorded from a real patient. △ Less

Submitted 9 December, 2017; originally announced December 2017.

Comments: Workshops on Deep Learning for Physical Sciences and Machine Learning 4 Health, NIPS 2017

arXiv:1612.00662 [pdf, other]

Predicting Patient State-of-Health using Sliding Window and Recurrent Classifiers

Authors: Adam McCarthy, Christopher K. I. Williams

Abstract: Bedside monitors in Intensive Care Units (ICUs) frequently sound incorrectly, slowing response times and desensitising nurses to alarms (Chambrin, 2001), causing true alarms to be missed (Hug et al., 2011). We compare sliding window predictors with recurrent predictors to classify patient state-of-health from ICU multivariate time series; we report slightly improved performance for the RNN for thr… ▽ More Bedside monitors in Intensive Care Units (ICUs) frequently sound incorrectly, slowing response times and desensitising nurses to alarms (Chambrin, 2001), causing true alarms to be missed (Hug et al., 2011). We compare sliding window predictors with recurrent predictors to classify patient state-of-health from ICU multivariate time series; we report slightly improved performance for the RNN for three out of four targets. △ Less

Submitted 2 December, 2016; originally announced December 2016.

Comments: NIPS 2016 Workshop on Machine Learning for Health

arXiv:1610.04107 [pdf, other]

Robust spectral unmixing of sparse multispectral Lidar waveforms using gamma Markov random fields

Authors: Yoann Altmann, Aurora Maccarone, Aongus McCarthy, Gregory Newstadt, Gerald S. Buller, Steve McLaughlin, Alfred Hero

Abstract: This paper presents a new Bayesian spectral unmixing algorithm to analyse remote scenes sensed via sparse multispectral Lidar measurements. To a first approximation, in the presence of a target, each Lidar waveform consists of a main peak, whose position depends on the target distance and whose amplitude depends on the wavelength of the laser source considered (i.e, on the target reflectivity). Be… ▽ More This paper presents a new Bayesian spectral unmixing algorithm to analyse remote scenes sensed via sparse multispectral Lidar measurements. To a first approximation, in the presence of a target, each Lidar waveform consists of a main peak, whose position depends on the target distance and whose amplitude depends on the wavelength of the laser source considered (i.e, on the target reflectivity). Besides, these temporal responses are usually assumed to be corrupted by Poisson noise in the low photon count regime. When considering multiple wavelengths, it becomes possible to use spectral information in order to identify and quantify the main materials in the scene, in addition to estimation of the Lidar-based range profiles. Due to its anomaly detection capability, the proposed hierarchical Bayesian model, coupled with an efficient Markov chain Monte Carlo algorithm, allows robust estimation of depth images together with abundance and outlier maps associated with the observed 3D scene. The proposed methodology is illustrated via experiments conducted with real multispectral Lidar data acquired in a controlled environment. The results demonstrate the possibility to unmix spectral responses constructed from extremely sparse photon counts (less than 10 photons per pixel and band). △ Less

Submitted 13 June, 2017; v1 submitted 13 October, 2016; originally announced October 2016.

arXiv:1608.06143 [pdf, other]

Object Depth Profile and Reflectivity Restoration from Sparse Single-Photon Data Acquired in Underwater Environments

Authors: Abderrahim Halimi, Aurora Maccarone, Aongus McCarthy, Steve McLaughlin, Gerald S. Buller

Abstract: This paper presents two new algorithms for the joint restoration of depth and reflectivity (DR) images constructed from time-correlated single-photon counting (TCSPC) measurements. Two extreme cases are considered: (i) a reduced acquisition time that leads to very low photon counts and (ii) a highly attenuating environment (such as a turbid medium) which makes the reflectivity estimation more diff… ▽ More This paper presents two new algorithms for the joint restoration of depth and reflectivity (DR) images constructed from time-correlated single-photon counting (TCSPC) measurements. Two extreme cases are considered: (i) a reduced acquisition time that leads to very low photon counts and (ii) a highly attenuating environment (such as a turbid medium) which makes the reflectivity estimation more difficult at increasing range. Adopting a Bayesian approach, the Poisson distributed observations are combined with prior distributions about the parameters of interest, to build the joint posterior distribution. More precisely, two Markov random field (MRF) priors enforcing spatial correlations are assigned to the DR images. Under some justified assumptions, the restoration problem (regularized likelihood) reduces to a convex formulation with respect to each of the parameters of interest. This problem is first solved using an adaptive Markov chain Monte Carlo (MCMC) algorithm that approximates the minimum mean square parameter estimators. This algorithm is fully automatic since it adjusts the parameters of the MRFs by maximum marginal likelihood estimation. However, the MCMC-based algorithm exhibits a relatively long computational time. The second algorithm deals with this issue and is based on a coordinate descent algorithm. Results on single-photon depth data from laboratory based underwater measurements demonstrate the benefit of the proposed strategy that improves the quality of the estimated DR images. △ Less

Submitted 22 August, 2016; originally announced August 2016.

arXiv:1601.06149 [pdf, other]

Robust Bayesian target detection algorithm for depth imaging from sparse single-photon data

Authors: Yoann Altmann, Ximing Ren, Aongus McCarthy, Gerald S. Buller, Steve McLaughlin

Abstract: This paper presents a new Bayesian model and associated algorithm for depth and intensity profiling using full waveforms from time-correlated single-photon counting (TCSPC) measurements in the limit of very low photon counts (i.e., typically less than 20 photons per pixel). The model represents each Lidar waveform as an unknown constant background level, which is combined in the presence of a targ… ▽ More This paper presents a new Bayesian model and associated algorithm for depth and intensity profiling using full waveforms from time-correlated single-photon counting (TCSPC) measurements in the limit of very low photon counts (i.e., typically less than 20 photons per pixel). The model represents each Lidar waveform as an unknown constant background level, which is combined in the presence of a target, to a known impulse response weighted by the target intensity and finally corrupted by Poisson noise. The joint target detection and depth imaging problem is expressed as a pixel-wise model selection and estimation problem which is solved using Bayesian inference. Prior knowledge about the problem is embedded in a hierarchical model that describes the dependence structure between the model parameters while accounting for their constraints. In particular, Markov random fields (MRFs) are used to model the joint distribution of the background levels and of the target presence labels, which are both expected to exhibit significant spatial correlations. An adaptive Markov chain Monte Carlo algorithm including reversible-jump updates is then proposed to compute the Bayesian estimates of interest. This algorithm is equipped with a stochastic optimization adaptation mechanism that automatically adjusts the parameters of the MRFs by maximum marginal likelihood estimation. Finally, the benefits of the proposed methodology are demonstrated through a series of experiments using real data. △ Less

Submitted 13 October, 2016; v1 submitted 21 January, 2016; originally announced January 2016.

Comments: arXiv admin note: text overlap with arXiv:1507.02511

arXiv:1507.02511 [pdf, other]

doi 10.1109/TIP.2016.2526784

Lidar waveform based analysis of depth images constructed using sparse single-photon data

Authors: Yoann Altmann, Ximing Ren, Aongus McCarthy, Gerald S. Buller, Steve McLaughlin

Abstract: This paper presents a new Bayesian model and algorithm used for depth and intensity profiling using full waveforms from the time-correlated single photon counting (TCSPC) measurement in the limit of very low photon counts. The model proposed represents each Lidar waveform as a combination of a known impulse response, weighted by the target intensity, and an unknown constant background, corrupted b… ▽ More This paper presents a new Bayesian model and algorithm used for depth and intensity profiling using full waveforms from the time-correlated single photon counting (TCSPC) measurement in the limit of very low photon counts. The model proposed represents each Lidar waveform as a combination of a known impulse response, weighted by the target intensity, and an unknown constant background, corrupted by Poisson noise. Prior knowledge about the problem is embedded in a hierarchical model that describes the dependence structure between the model parameters and their constraints. In particular, a gamma Markov random field (MRF) is used to model the joint distribution of the target intensity, and a second MRF is used to model the distribution of the target depth, which are both expected to exhibit significant spatial correlations. An adaptive Markov chain Monte Carlo algorithm is then proposed to compute the Bayesian estimates of interest and perform Bayesian inference. This algorithm is equipped with a stochastic optimization adaptation mechanism that automatically adjusts the parameters of the MRFs by maximum marginal likelihood estimation. Finally, the benefits of the proposed methodology are demonstrated through a serie of experiments using real data. △ Less

Submitted 9 July, 2015; originally announced July 2015.

arXiv:1504.05871 [pdf, other]

doi 10.1103/PhysRevD.92.072003

Improved WIMP-search reach of the CDMS II germanium data

Authors: R. Agnese, A. J. Anderson, M. Asai, D. Balakishiyeva, D. Barker, R. Basu Thakur, D. A. Bauer, J. Billard, A. Borgland, M. A. Bowles, D. Brandt, P. L. Brink, R. Bunker, B. Cabrera, D. O. Caldwell, R. Calkins, D. G. Cerdeño, H. Chagani, Y. Chen, J. Cooley, B. Cornell, C. H. Crewdson, P. Cushman, M. Daal, P. C. F. Di Stefano , et al. (64 additional authors not shown)

Abstract: CDMS II data from the 5-tower runs at the Soudan Underground Laboratory were reprocessed with an improved charge-pulse fitting algorithm. Two new analysis techniques to reject surface-event backgrounds were applied to the 612 kg days germanium-detector WIMP-search exposure. An extended analysis was also completed by decreasing the 10 keV analysis threshold to $\sim$5 keV, to increase sensitivity n… ▽ More CDMS II data from the 5-tower runs at the Soudan Underground Laboratory were reprocessed with an improved charge-pulse fitting algorithm. Two new analysis techniques to reject surface-event backgrounds were applied to the 612 kg days germanium-detector WIMP-search exposure. An extended analysis was also completed by decreasing the 10 keV analysis threshold to $\sim$5 keV, to increase sensitivity near a WIMP mass of 8 GeV/$c^2$. After unblinding, there were zero candidate events above a deposited energy of 10 keV and 6 events in the lower-threshold analysis. This yielded minimum WIMP-nucleon spin-independent scattering cross-section limits of $1.8 \times 10^{-44}$ and $1.18 \times 10 ^{-41}$ cm$^2$ at 90\% confidence for 60 and 8.6 GeV/$c^2$ WIMPs, respectively. This improves the previous CDMS II result by a factor of 2.4 (2.7) for 60 (8.6) GeV/$c^2$ WIMPs. △ Less

Submitted 13 October, 2015; v1 submitted 22 April, 2015; originally announced April 2015.

Comments: 25 pages, 15 figures, slightly updated organization and text consistent with PRD referee process, Fig. 14 updated

Report number: IPPP/15/24, DCTP/15/48

Journal ref: Phys. Rev. D 92, 072003 (2015)

arXiv:1503.09147 [pdf]

doi 10.1186/s12936-015-0751-y

Characterization of the infectious reservoir of malaria with an agent-based model calibrated to age-stratified parasite densities and infectiousness

Authors: Jaline Gerardin, Andre Lin Ouedraogo, Kevin A. McCarthy, Bocar Kouyate, Philip A. Eckhoff, Edward A. Wenger

Abstract: Background Elimination of malaria can only be achieved through removal of all vectors or complete depletion of the infectious reservoir in humans. Mechanistic models can be built to synthesize diverse observations from the field collected under a variety of conditions and subsequently used to query the infectious reservoir in great detail. Methods The EMOD model of malaria transmission was calibra… ▽ More Background Elimination of malaria can only be achieved through removal of all vectors or complete depletion of the infectious reservoir in humans. Mechanistic models can be built to synthesize diverse observations from the field collected under a variety of conditions and subsequently used to query the infectious reservoir in great detail. Methods The EMOD model of malaria transmission was calibrated to prevalence, incidence, asexual parasite density, gametocyte density, infection duration, and infectiousness data from 9 study sites. The infectious reservoir was characterized by diagnostic detection limit and age group over a range of transmission intensities with and without case management and vector control. Mass screen-and-treat drug campaigns were tested for likelihood of achieving elimination. Results The composition of the infectious reservoir by diagnostic threshold is similar over a range of transmission intensities, and higher intensity settings are biased toward infections in children. Recent ramp-ups in case management and use of insecticide-treated bednets reduce the infectious reservoir and shift the composition toward submicroscopic infections. Mass campaigns with antimalarial drugs are highly effective at interrupting transmission if deployed shortly after ITN campaigns. Conclusions Low density infections comprise a substantial portion of the infectious reservoir. Proper timing of vector control, seasonal variation in transmission intensity, and mass drug campaigns allows lingering population immunity to help drive a region toward elimination. △ Less

Submitted 31 March, 2015; originally announced March 2015.

Comments: submitted to Malaria Journal on March 31, 2015

Journal ref: Malaria Journal 2015, 14:231

arXiv:1503.03379 [pdf, other]

doi 10.1103/PhysRevD.91.092004

Dark matter effective field theory scattering in direct detection experiments

Authors: K. Schneck, B. Cabrera, D. G. Cerdeno, V. Mandic, H. E. Rogers, R. Agnese, A. J. Anderson, M. Asai, D. Balakishiyeva, D. Barker, R. Basu Thakur, D. A. Bauer, J. Billard, A. Borgland, D. Brandt, P. L. Brink, R. Bunker, D. O. Caldwell, R. Calkins, H. Chagani, Y. Chen, J. Cooley, B. Cornell, C. H. Crewdson, P. Cushman , et al. (62 additional authors not shown)

Abstract: We examine the consequences of the effective field theory (EFT) of dark matter-nucleon scattering for current and proposed direct detection experiments. Exclusion limits on EFT coupling constants computed using the optimum interval method are presented for SuperCDMS Soudan, CDMS II, and LUX, and the necessity of combining results from multiple experiments in order to determine dark matter paramete… ▽ More We examine the consequences of the effective field theory (EFT) of dark matter-nucleon scattering for current and proposed direct detection experiments. Exclusion limits on EFT coupling constants computed using the optimum interval method are presented for SuperCDMS Soudan, CDMS II, and LUX, and the necessity of combining results from multiple experiments in order to determine dark matter parameters is discussed. We demonstrate that spectral differences between the standard dark matter model and a general EFT interaction can produce a bias when calculating exclusion limits and when develo** signal models for likelihood and machine learning techniques. We also discuss the implications of the EFT for the next-generation (G2) direct detection experiments and point out regions of complementarity in the EFT parameter space. △ Less

Submitted 16 August, 2016; v1 submitted 11 March, 2015; originally announced March 2015.

Comments: Newest version includes erratum

Journal ref: Phys. Rev. D 91, 092004 (2015)

Showing 1–50 of 86 results for author: McCarthy, A