Skip to main content

Showing 1–29 of 29 results for author: Hyland, L

.
  1. arXiv:2406.04449  [pdf, other

    cs.CL cs.CV

    MAIRA-2: Grounded Radiology Report Generation

    Authors: Shruthi Bannur, Kenza Bouzid, Daniel C. Castro, Anton Schwaighofer, Sam Bond-Taylor, Maximilian Ilse, Fernando Pérez-García, Valentina Salvatelli, Harshita Sharma, Felix Meissen, Mercy Ranjit, Shaury Srivastav, Julia Gong, Fabian Falck, Ozan Oktay, Anja Thieme, Matthew P. Lungren, Maria Teodora Wetscherek, Javier Alvarez-Valle, Stephanie L. Hyland

    Abstract: Radiology reporting is a complex task that requires detailed image understanding, integration of multiple inputs, including comparison with prior imaging, and precise language generation. This makes it ideal for the development and use of generative multimodal models. Here, we extend report generation to include the localisation of individual findings on the image - a task we call grounded report… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 44 pages, 20 figures

  2. arXiv:2406.02660  [pdf, other

    astro-ph.GA astro-ph.HE

    First VLBI detection of Fornax A

    Authors: G. F. Paraschos, M. Wielgus, P. Benke, V. Mpisketzis, F. Rösch, K. Dasyra, E. Ros, M. Kadler, R. Ojha, P. G. Edwards, L. Hyland, J. F. H. Quick, S. Weston

    Abstract: Radio galaxies harbouring jetted active galactic nuclei are a frequent target of very-long-baseline interferometry (VLBI) because they play an essential role in exploring how jets form and propagate. Hence, only few have not been detected with VLBI yet; Fornax A is one of the most famous examples. Here we present the first detection of the compact core region of Fornax A with VLBI. At 8.4 GHz the… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 4 pages, 2 figures, accepted for publication in A&A

    Journal ref: A&A 687, L6 (2024)

  3. arXiv:2405.12370  [pdf, other

    astro-ph.HE

    Swift J1727.8-1613 has the Largest Resolved Continuous Jet Ever Seen in an X-ray Binary

    Authors: Callan M. Wood, James C. A. Miller-Jones, Arash Bahramian, Steven J. Tingay, Steve Prabu, Thomas D. Russell, Pikky Atri, Francesco Carotenuto, Diego Altamirano, Sara E. Motta, Lucas Hyland, Cormac Reynolds, Stuart Weston, Rob Fender, Elmar Körding, Dipankar Maitra, Sera Markoff, Simone Migliari, David M. Russell, Craig L. Sarazin, Gregory R. Sivakoff, Roberto Soria, Alexandra J. Tetarenko, Valeriu Tudose

    Abstract: Multi-wavelength polarimetry and radio observations of Swift J1727.8-1613 at the beginning of its recent 2023 outburst suggested the presence of a bright compact jet aligned in the north-south direction, which could not be confirmed without high angular resolution images. Using the Very Long Baseline Array and the Long Baseline Array, we imaged Swift J1727.8-1613, during the hard/hard-intermediate… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: Submitted to ApJL

  4. arXiv:2405.05299  [pdf, other

    cs.HC cs.AI

    Challenges for Responsible AI Design and Workflow Integration in Healthcare: A Case Study of Automatic Feeding Tube Qualification in Radiology

    Authors: Anja Thieme, Abhijith Rajamohan, Benjamin Cooper, Heather Groombridge, Robert Simister, Barney Wong, Nicholas Woznitza, Mark Ames Pinnock, Maria Teodora Wetscherek, Cecily Morrison, Hannah Richardson, Fernando Pérez-García, Stephanie L. Hyland, Shruthi Bannur, Daniel C. Castro, Kenza Bouzid, Anton Schwaighofer, Mercy Ranjit, Harshita Sharma, Matthew P. Lungren, Ozan Oktay, Javier Alvarez-Valle, Aditya Nori, Stephen Harris, Joseph Jacob

    Abstract: Nasogastric tubes (NGTs) are feeding tubes that are inserted through the nose into the stomach to deliver nutrition or medication. If not placed correctly, they can cause serious harm, even death to patients. Recent AI developments demonstrate the feasibility of robustly detecting NGT placement from Chest X-ray images to reduce risks of sub-optimally or critically placed NGTs being missed or delay… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    ACM Class: H.5.m; I.2.m

  5. Multimodal Healthcare AI: Identifying and Designing Clinically Relevant Vision-Language Applications for Radiology

    Authors: Nur Yildirim, Hannah Richardson, Maria T. Wetscherek, Junaid Bajwa, Joseph Jacob, Mark A. Pinnock, Stephen Harris, Daniel Coelho de Castro, Shruthi Bannur, Stephanie L. Hyland, Pratik Ghosh, Mercy Ranjit, Kenza Bouzid, Anton Schwaighofer, Fernando Pérez-García, Harshita Sharma, Ozan Oktay, Matthew Lungren, Javier Alvarez-Valle, Aditya Nori, Anja Thieme

    Abstract: Recent advances in AI combine large language models (LLMs) with vision encoders that bring forward unprecedented technical capabilities to leverage for a wide range of healthcare applications. Focusing on the domain of radiology, vision-language models (VLMs) achieve good performance results for tasks such as generating radiology findings based on a patient's medical image, or answering visual que… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: to appear at CHI 2024

  6. arXiv:2401.10815  [pdf, other

    cs.CV

    RAD-DINO: Exploring Scalable Medical Image Encoders Beyond Text Supervision

    Authors: Fernando Pérez-García, Harshita Sharma, Sam Bond-Taylor, Kenza Bouzid, Valentina Salvatelli, Maximilian Ilse, Shruthi Bannur, Daniel C. Castro, Anton Schwaighofer, Matthew P. Lungren, Maria Wetscherek, Noel Codella, Stephanie L. Hyland, Javier Alvarez-Valle, Ozan Oktay

    Abstract: Language-supervised pre-training has proven to be a valuable method for extracting semantically meaningful features from images, serving as a foundational element in multimodal systems within the computer vision and medical imaging domains. However, resulting features are limited by the information contained within the text. This is particularly problematic in medical imaging, where radiologists'… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  7. arXiv:2311.13668  [pdf, other

    cs.CL cs.AI cs.CV

    MAIRA-1: A specialised large multimodal model for radiology report generation

    Authors: Stephanie L. Hyland, Shruthi Bannur, Kenza Bouzid, Daniel C. Castro, Mercy Ranjit, Anton Schwaighofer, Fernando Pérez-García, Valentina Salvatelli, Shaury Srivastav, Anja Thieme, Noel Codella, Matthew P. Lungren, Maria Teodora Wetscherek, Ozan Oktay, Javier Alvarez-Valle

    Abstract: We present a radiology-specific multimodal model for the task for generating radiological reports from chest X-rays (CXRs). Our work builds on the idea that large language model(s) can be equipped with multimodal capabilities through alignment with pre-trained vision encoders. On natural images, this has been shown to allow multimodal models to gain image understanding and description capabilities… ▽ More

    Submitted 26 April, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: 18 pages, 9 tables, 5 figures. v2 adds test IDs and image encoder citation. v3 fixes error in NPV/specificity

  8. TANAMI: Tracking Active Galactic Nuclei with Austral Milliarcsecond Interferometry. III. First-epoch S band images

    Authors: Petra Benke, Florian Rösch, Eduardo Ros, Matthias Kadler, Roopesh Ojha, Philip G. Edwards, Shinji Horiuchi, Lucas J. Hyland, Chris Phillips, Jonathan F. H. Quick, Jamie Stevens, Anastasios K. Tzioumis, Stuart Weston

    Abstract: With the emergence of very high energy astronomy (VHE; E>100 GeV), new open questions were presented to astronomers studying the multi-wavelength emission from blazars. Answers to these open questions, such as the Doppler crisis, and finding the location of the high-energy activity have eluded us thus far. Recently, quasi-simultaneous multi-wavelength monitoring programs have shown considerable su… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Journal ref: A&A 681, A69 (2024)

  9. arXiv:2304.14740  [pdf, other

    astro-ph.SR astro-ph.GA

    A Keplerian disk with a four-arm spiral birthing an episodically accreting high-mass protostar

    Authors: R. A. Burns, Y. Uno, N. Sakai, J. Blanchard, Z. Rosli, G. Orosz, Y. Yonekura, Y. Tanabe, K. Sugiyama, T. Hirota, Kee-Tae Kim, A. Aberfelds, A. E. Volvach, A. Bartkiewicz, A. Caratti o Garatti, A. M. Sobolev, B. Stecklum, C. Brogan, C. Phillips, D. A. Ladeyschikov, D. Johnstone, G. Surcis, G. C. MacLeod, H. Linz, J. O. Chibueze , et al. (12 additional authors not shown)

    Abstract: High-mass protostars (M$_{\star} >$ 8 M$_{\odot}$) are thought to gain the majority of their mass via short, intense bursts of growth. This episodic accretion is thought to be facilitated by gravitationally unstable and subsequently inhomogeneous accretion disks. Limitations of observational capabilities, paired with a lack of observed accretion burst events has withheld affirmative confirmation o… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

    Comments: Published in Nature Astronomy in 2023

  10. arXiv:2304.14739  [pdf, other

    astro-ph.SR astro-ph.GA

    A heat-wave of accretion energy traced by masers in the G358-MM1 high-mass protostar

    Authors: R. A. Burns, K. Sugiyama, T. Hirota, Kee-Tae Kim, A. M. Sobolev, B. Stecklum, G. C. MacLeod, Y. Yonekura, M. Olech, G. Orosz, S. P. Ellingsen, L. Hyland, A. Caratti o Garatti, C. Brogan, T. R. Hunter, C. Phillips, S. P. van den Heever, J. Eislöffel, H. Linz, G. Surcis, J. O. Chibueze, W. Baan, B. Kramer

    Abstract: High-mass stars are thought to accumulate much of their mass via short, infrequent bursts of disk-aided accretion. Such accretion events are rare and difficult to observe directly but are known to drive enhanced maser emission. In this Letter we report high-resolution, multi-epoch methanol maser observations toward G358.93-0.03 which reveal an interesting phenomenon; the sub-luminal propagation of… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

    Comments: Published in Nature Astronomy in 2020

  11. arXiv:2303.13386  [pdf, other

    cs.CL cs.LG

    Compositional Zero-Shot Domain Transfer with Text-to-Text Models

    Authors: Fangyu Liu, Qianchu Liu, Shruthi Bannur, Fernando Pérez-García, Naoto Usuyama, Sheng Zhang, Tristan Naumann, Aditya Nori, Hoifung Poon, Javier Alvarez-Valle, Ozan Oktay, Stephanie L. Hyland

    Abstract: Label scarcity is a bottleneck for improving task performance in specialised domains. We propose a novel compositional transfer learning framework (DoT5 - domain compositional zero-shot T5) for zero-shot domain transfer. Without access to in-domain labels, DoT5 jointly learns domain knowledge (from MLM of unlabelled in-domain free text) and task knowledge (from task training on more readily availa… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted at TACL, pre-MIT Press publication version. 16 pages, 4 figures

  12. arXiv:2301.04558  [pdf, other

    cs.CV cs.CL

    Learning to Exploit Temporal Structure for Biomedical Vision-Language Processing

    Authors: Shruthi Bannur, Stephanie Hyland, Qianchu Liu, Fernando Pérez-García, Maximilian Ilse, Daniel C. Castro, Benedikt Boecking, Harshita Sharma, Kenza Bouzid, Anja Thieme, Anton Schwaighofer, Maria Wetscherek, Matthew P. Lungren, Aditya Nori, Javier Alvarez-Valle, Ozan Oktay

    Abstract: Self-supervised learning in vision-language processing exploits semantic alignment between imaging and text modalities. Prior work in biomedical VLP has mostly relied on the alignment of single image and report pairs even though clinical notes commonly refer to prior images. This does not only introduce poor alignment between the modalities but also a missed opportunity to exploit rich self-superv… ▽ More

    Submitted 16 March, 2023; v1 submitted 11 January, 2023; originally announced January 2023.

    Comments: To appear in CVPR 2023

  13. arXiv:2212.03555  [pdf, other

    astro-ph.GA astro-ph.IM

    Inverse MultiView II: Microarcsecond Trigonometric Parallaxes for Southern Hemisphere 6.7~GHz Methanol Masers G232.62+00.99 and G323.74$-$00.26

    Authors: Lucas J. Hyland, Mark J. Reid, Gabor Orosz, Simon P. Ellingsen, Stuart D. Weston, Jayendar Kumar, Richard Dodson, Maria J. Rioja, Warren J. Hankey, Patrick M. Yates-Jones, Tim Natusch, Sergei Gulyaev, Karl M. Menten, Andreas Brunthaler

    Abstract: We present the first results from the Southern Hemisphere Parallax Interferometric Radio Astrometry Legacy Survey (\spirals): $10μ$as-accurate parallaxes and proper motions for two southern hemisphere 6.7 GHz methanol masers obtained using the inverse MultiView calibration method. Using an array of radio telescopes in Australia and New Zealand, we measured the trigonometric parallax and proper mot… ▽ More

    Submitted 16 May, 2023; v1 submitted 7 December, 2022; originally announced December 2022.

    Comments: 13 pages, 9 figures, 3 tables. Accepted for publication in ApJ

  14. arXiv:2205.13398  [pdf, other

    cs.LG

    Looking for Out-of-Distribution Environments in Multi-center Critical Care Data

    Authors: Dimitris Spathis, Stephanie L. Hyland

    Abstract: Clinical machine learning models show a significant performance drop when tested in settings not seen during training. Domain generalisation models promise to alleviate this problem, however, there is still scepticism about whether they improve over traditional training. In this work, we take a principled approach to identifying Out of Distribution (OoD) environments, motivated by the problem of c… ▽ More

    Submitted 11 November, 2022; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2022, November 28th, 2022, New Orleans, United States & Virtual, http://www.ml4h.cc, 17 pages

  15. arXiv:2205.00092  [pdf, other

    astro-ph.IM astro-ph.GA

    Inverse Multview I: Multi-Calibrator inverse phase referencing for Microarcsecond VLBI Astrometry

    Authors: Lucas J. Hyland, Mark J. Reid, Simon P. Ellingsen, Maria J. Rioja, Richard Dodson, Gabor Orosz, Colin R. Masson, Jamie M. McCallum

    Abstract: Very Long Baseline Interferometry (VLBI) astrometry is a well established technique for achieving $\pm10~μ$as parallax accuracies at frequencies well above 10~GHz. At lower frequencies, uncompensated interferometer delays associated with the ionosphere play the dominant role in limiting the astrometric accuracy. Multiview is a novel VLBI calibration method, which uses observations of multiple quas… ▽ More

    Submitted 13 February, 2023; v1 submitted 29 April, 2022; originally announced May 2022.

    Comments: 11 pages, 5 figures

    Journal ref: 2022 ApJ 932 52

  16. Making the Most of Text Semantics to Improve Biomedical Vision--Language Processing

    Authors: Benedikt Boecking, Naoto Usuyama, Shruthi Bannur, Daniel C. Castro, Anton Schwaighofer, Stephanie Hyland, Maria Wetscherek, Tristan Naumann, Aditya Nori, Javier Alvarez-Valle, Hoifung Poon, Ozan Oktay

    Abstract: Multi-modal data abounds in biomedicine, such as radiology images and reports. Interpreting this data at scale is essential for improving clinical care and accelerating clinical research. Biomedical text with its complex semantics poses additional challenges in vision--language modelling compared to the general domain, and previous work has used insufficiently adapted models that lack domain-speci… ▽ More

    Submitted 21 July, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

    Comments: To appear in ECCV 2022. Code: https://aka.ms/biovil-code Dataset: https://aka.ms/ms-cxr Demo Notebook: https://aka.ms/biovil-demo-notebook

    Journal ref: Computer Vision - ECCV 2022, LNCS vol 13696, pp 1-21

  17. arXiv:2110.09669  [pdf, ps, other

    astro-ph.GA astro-ph.SR

    Molecular line search toward the flaring 6.7-GHz methanol masers of G24.33+0.13 and G359.6-0.243: rare maser transitions detected

    Authors: Tiege McCarthy, Gabor Orosz, Simon Ellingsen, Shari Breen, Maxim Voronkov, Ross Burns, Mateusz Olech, Yoshinori Yonekura, Tomoya Hirota, Lucas Hyland, Pawel Wolak

    Abstract: We have performed a molecular line search toward the flaring 6.7-GHz masers G24.33+0.13 and G359.62-0.24 using the Australia Telescope Compact Array. We present spectra of the 6.7-GHz class~II methanol and 22.2-GHz water masers toward these sources and provide comparison with other recent flaring events these sources have experienced. We also detect the fourth example of a 23.4-GHz class~I methano… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: Accepted into MNRAS 2021 October 17. 10 pages, 4 figures and 3 tables

  18. arXiv:2105.05728  [pdf, other

    cs.LG stat.ML

    Early prediction of respiratory failure in the intensive care unit

    Authors: Matthias Hüser, Martin Faltys, Xinrui Lyu, Chris Barber, Stephanie L. Hyland, Tobias M. Merz, Gunnar Rätsch

    Abstract: The development of respiratory failure is common among patients in intensive care units (ICU). Large data quantities from ICU patient monitoring systems make timely and comprehensive analysis by clinicians difficult but are ideal for automatic processing by machine learning algorithms. Early prediction of respiratory system failure could alert clinicians to patients at risk of respiratory failure… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

    Comments: 14 pages, 5 figures

  19. arXiv:2011.11554   

    cs.LG

    ML4H Abstract Track 2020

    Authors: Emily Alsentzer, Matthew B. A. McDermott, Fabian Falck, Suproteem K. Sarkar, Subhrajit Roy, Stephanie L. Hyland

    Abstract: A collection of the accepted abstracts for the Machine Learning for Health (ML4H) workshop at NeurIPS 2020. This index is not complete, as some accepted abstracts chose to opt-out of inclusion.

    Submitted 19 November, 2020; originally announced November 2020.

  20. arXiv:1912.02919  [pdf, other

    cs.LG cs.CR stat.ML

    An Empirical Study on the Intrinsic Privacy of SGD

    Authors: Stephanie L. Hyland, Shruti Tople

    Abstract: Introducing noise in the training of machine learning systems is a powerful way to protect individual privacy via differential privacy guarantees, but comes at a cost to utility. This work looks at whether the inherent randomness of stochastic gradient descent (SGD) could contribute to privacy, effectively reducing the amount of \emph{additional} noise required to achieve a given privacy guarantee… ▽ More

    Submitted 28 February, 2022; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: 21 pages, 11 figures, 8 tables

  21. arXiv:1904.12973  [pdf

    cs.LG cs.CL stat.AP stat.ML

    Unsupervised Extraction of Phenotypes from Cancer Clinical Notes for Association Studies

    Authors: Stefan G. Stark, Stephanie L. Hyland, Melanie F. Pradier, Kjong Lehmann, Andreas Wicki, Fernando Perez Cruz, Julia E. Vogt, Gunnar Rätsch

    Abstract: The recent adoption of Electronic Health Records (EHRs) by health care providers has introduced an important source of data that provides detailed and highly specific insights into patient phenotypes over large cohorts. These datasets, in combination with machine learning and statistical approaches, generate new opportunities for research and clinical care. However, many methods require the patien… ▽ More

    Submitted 3 May, 2019; v1 submitted 29 April, 2019; originally announced April 2019.

  22. arXiv:1904.07990  [pdf

    cs.LG stat.AP stat.ML

    Machine learning for early prediction of circulatory failure in the intensive care unit

    Authors: Stephanie L. Hyland, Martin Faltys, Matthias Hüser, Xinrui Lyu, Thomas Gumbsch, Cristóbal Esteban, Christian Bock, Max Horn, Michael Moor, Bastian Rieck, Marc Zimmermann, Dean Bodenham, Karsten Borgwardt, Gunnar Rätsch, Tobias M. Merz

    Abstract: Intensive care clinicians are presented with large quantities of patient information and measurements from a multitude of monitoring systems. The limited ability of humans to process such complex information hinders physicians to readily recognize and act on early signs of patient deterioration. We used machine learning to develop an early warning system for circulatory failure based on a high-res… ▽ More

    Submitted 19 April, 2019; v1 submitted 16 April, 2019; originally announced April 2019.

    Comments: 5 main figures, 1 main table, 13 supplementary figures, 5 supplementary tables; 250ppi images

  23. arXiv:1812.00490  [pdf, other

    cs.LG stat.ML

    Improving Clinical Predictions through Unsupervised Time Series Representation Learning

    Authors: Xinrui Lyu, Matthias Hueser, Stephanie L. Hyland, George Zerveas, Gunnar Raetsch

    Abstract: In this work, we investigate unsupervised representation learning on medical time series, which bears the promise of leveraging copious amounts of existing unlabeled data in order to eventually assist clinical decision making. By evaluating on the prediction of clinically relevant outcomes, we show that in a practical setting, unsupervised representation learning can offer clear performance benefi… ▽ More

    Submitted 2 December, 2018; originally announced December 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

    Report number: ML4H/2018/171

  24. arXiv:1707.03963  [pdf, other

    astro-ph.GA astro-ph.SR

    MALT-45: A 7mm survey of the southern Galaxy - II. ATCA follow-up observations of 44GHz class I methanol masers

    Authors: Christopher H. Jordan, Andrew J. Walsh, Shari L. Breen, Simon P. Ellingsen, Maxim A. Voronkov, Lucas J. Hyland

    Abstract: We detail interferometric observations of 44GHz class I methanol masers detected by MALT-45 (a 7mm unbiased auto-correlated spectral-line Galactic-plane survey) using the Australia Telescope Compact Array. We detect 238 maser spots across 77 maser sites. Using high-resolution positions, we compare the class I CH$_3$OH masers to other star formation maser species, including CS (1-0), SiO $v=0$ and… ▽ More

    Submitted 12 July, 2017; originally announced July 2017.

    Comments: Accepted by MNRAS on 12 July 2017

  25. arXiv:1706.02633  [pdf, other

    stat.ML cs.LG

    Real-valued (Medical) Time Series Generation with Recurrent Conditional GANs

    Authors: Cristóbal Esteban, Stephanie L. Hyland, Gunnar Rätsch

    Abstract: Generative Adversarial Networks (GANs) have shown remarkable success as a framework for training models to produce realistic-looking data. In this work, we propose a Recurrent GAN (RGAN) and Recurrent Conditional GAN (RCGAN) to produce realistic real-valued multi-dimensional time series, with an emphasis on their application to medical data. RGANs make use of recurrent neural networks in the gener… ▽ More

    Submitted 3 December, 2017; v1 submitted 8 June, 2017; originally announced June 2017.

    Comments: 13 pages, 4 figures, 3 tables (update with differential privacy)

  26. arXiv:1612.00467  [pdf, ps, other

    cs.CL

    Neural Document Embeddings for Intensive Care Patient Mortality Prediction

    Authors: Paulina Grnarova, Florian Schmidt, Stephanie L. Hyland, Carsten Eickhoff

    Abstract: We present an automatic mortality prediction scheme based on the unstructured textual content of clinical notes. Proposing a convolutional document embedding approach, our empirical investigation using the MIMIC-III intensive care database shows significant performance gains compared to previously employed methods such as latent topic distributions or generic doc2vec embeddings. These improvements… ▽ More

    Submitted 1 December, 2016; originally announced December 2016.

  27. arXiv:1607.04903  [pdf, other

    stat.ML cs.LG

    Learning Unitary Operators with Help From u(n)

    Authors: Stephanie L. Hyland, Gunnar Rätsch

    Abstract: A major challenge in the training of recurrent neural networks is the so-called vanishing or exploding gradient problem. The use of a norm-preserving transition operator can address this issue, but parametrization is challenging. In this work we focus on unitary operators and describe a parametrization using the Lie algebra $\mathfrak{u}(n)$ associated with the Lie group $U(n)$ of $n \times n$ uni… ▽ More

    Submitted 10 January, 2017; v1 submitted 17 July, 2016; originally announced July 2016.

    Comments: 9 pages, 3 figures, 5 figures inc. subfigures, to appear at AAAI-17

  28. arXiv:1602.03551  [pdf, other

    cs.CL stat.AP

    Knowledge Transfer with Medical Language Embeddings

    Authors: Stephanie L. Hyland, Theofanis Karaletsos, Gunnar Rätsch

    Abstract: Identifying relationships between concepts is a key aspect of scientific knowledge synthesis. Finding these links often requires a researcher to laboriously search through scien- tific papers and databases, as the size of these resources grows ever larger. In this paper we describe how distributional semantics can be used to unify structured knowledge graphs with unstructured text to predict new r… ▽ More

    Submitted 10 February, 2016; originally announced February 2016.

    Comments: 6 pages, 2 figures, to appear at SDM-DMMH 2016

  29. arXiv:1510.00259  [pdf, other

    cs.CL cs.LG stat.ML

    A Generative Model of Words and Relationships from Multiple Sources

    Authors: Stephanie L. Hyland, Theofanis Karaletsos, Gunnar Rätsch

    Abstract: Neural language models are a powerful tool to embed words into semantic vector spaces. However, learning such models generally relies on the availability of abundant and diverse training examples. In highly specialised domains this requirement may not be met due to difficulties in obtaining a large corpus, or the limited range of expression in average use. Such domains may encode prior knowledge a… ▽ More

    Submitted 3 December, 2015; v1 submitted 1 October, 2015; originally announced October 2015.

    Comments: 8 pages, 5 figures; incorporated feedback from reviewers; to appear in Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence 2016