Search | arXiv e-print repository

Automated HER2 Scoring in Breast Cancer Images Using Deep Learning and Pyramid Sampling

Authors: Sahan Yoruc Selcuk, Xilin Yang, Bijie Bai, Yijie Zhang, Yuzhu Li, Musa Aydin, Aras Firat Unal, Aditya Gomatam, Zhen Guo, Darrow Morgan Angus, Goren Kolodney, Karine Atlan, Tal Keidar Haran, Nir Pillar, Aydogan Ozcan

Abstract: Human epidermal growth factor receptor 2 (HER2) is a critical protein in cancer cell growth that signifies the aggressiveness of breast cancer (BC) and helps predict its prognosis. Accurate assessment of immunohistochemically (IHC) stained tissue slides for HER2 expression levels is essential for both treatment guidance and understanding of cancer mechanisms. Nevertheless, the traditional workflow… ▽ More Human epidermal growth factor receptor 2 (HER2) is a critical protein in cancer cell growth that signifies the aggressiveness of breast cancer (BC) and helps predict its prognosis. Accurate assessment of immunohistochemically (IHC) stained tissue slides for HER2 expression levels is essential for both treatment guidance and understanding of cancer mechanisms. Nevertheless, the traditional workflow of manual examination by board-certified pathologists encounters challenges, including inter- and intra-observer inconsistency and extended turnaround times. Here, we introduce a deep learning-based approach utilizing pyramid sampling for the automated classification of HER2 status in IHC-stained BC tissue images. Our approach analyzes morphological features at various spatial scales, efficiently managing the computational load and facilitating a detailed examination of cellular and larger-scale tissue-level details. This method addresses the tissue heterogeneity of HER2 expression by providing a comprehensive view, leading to a blind testing classification accuracy of 84.70%, on a dataset of 523 core images from tissue microarrays. Our automated system, proving reliable as an adjunct pathology tool, has the potential to enhance diagnostic precision and evaluation speed, and might significantly impact cancer treatment planning. △ Less

Submitted 31 March, 2024; originally announced April 2024.

Comments: 21 Pages, 7 Figures

arXiv:2311.03221 [pdf, other]

Segmentation of Drone Collision Hazards in Airborne RADAR Point Clouds Using PointNet

Authors: Hector Arroyo, Paul Kier, Dylan Angus, Santiago Matalonga, Svetlozar Georgiev, Mehdi Goli, Gerard Dooly, James Riordan

Abstract: The integration of unmanned aerial vehicles (UAVs) into shared airspace for beyond visual line of sight (BVLOS) operations presents significant challenges but holds transformative potential for sectors like transportation, construction, energy and defense. A critical prerequisite for this integration is equip** UAVs with enhanced situational awareness to ensure safe operations. Current approache… ▽ More The integration of unmanned aerial vehicles (UAVs) into shared airspace for beyond visual line of sight (BVLOS) operations presents significant challenges but holds transformative potential for sectors like transportation, construction, energy and defense. A critical prerequisite for this integration is equip** UAVs with enhanced situational awareness to ensure safe operations. Current approaches mainly target single object detection or classification, or simpler sensing outputs that offer limited perceptual understanding and lack the rapid end-to-end processing needed to convert sensor data into safety-critical insights. In contrast, our study leverages radar technology for novel end-to-end semantic segmentation of aerial point clouds to simultaneously identify multiple collision hazards. By adapting and optimizing the PointNet architecture and integrating aerial domain insights, our framework distinguishes five distinct classes: mobile drones (DJI M300 and DJI Mini) and airplanes (Ikarus C42), and static returns (ground and infrastructure) which results in enhanced situational awareness for UAVs. To our knowledge, this is the first approach addressing simultaneous identification of multiple collision threats in an aerial setting, achieving a robust 94% accuracy. This work highlights the potential of radar technology to advance situational awareness in UAVs, facilitating safe and efficient BVLOS operations. △ Less

Submitted 6 November, 2023; originally announced November 2023.

Comments: 16 pages, 13 figures, 4 tables

arXiv:2303.08123 [pdf, other]

Identifying Promising Candidate Radiotherapy Protocols via GPU-GA in-silico

Authors: Wojciech Ozimek, Rafał Banaś, Paweł Gora, Simon D. Angus, Monika J. Piotrowska

Abstract: Around half of all cancer patients, world-wide, will receive some form of radiotherapy (RT) as part of their treatment. And yet, despite the rapid advance of high-throughput screening to identify successful chemotherapy drug candidates, there is no current analogue for RT protocol screening or discovery at any scale. Here we introduce and demonstrate the application of a high-throughput/high-fidel… ▽ More Around half of all cancer patients, world-wide, will receive some form of radiotherapy (RT) as part of their treatment. And yet, despite the rapid advance of high-throughput screening to identify successful chemotherapy drug candidates, there is no current analogue for RT protocol screening or discovery at any scale. Here we introduce and demonstrate the application of a high-throughput/high-fidelity coupled tumour-irradiation simulation approach, we call "GPU-GA", and apply it to human breast cancer analogue - EMT6/Ro spheroids. By analysing over 9.5 million candidate protocols, GPU-GA yields significant gains in tumour suppression versus prior state-of-the-art high-fidelity/-low-throughput computational search under two clinically relevant benchmarks. By extending the search space to hypofractionated areas (> 2 Gy/day) yet within total dose limits, further tumour suppression of up to 33.7% compared to state-of-the-art is obtained. GPU-GA could be applied to any cell line with sufficient empirical data, and to many clinically relevant RT considerations. △ Less

Submitted 6 April, 2023; v1 submitted 24 February, 2023; originally announced March 2023.

arXiv:2206.00397 [pdf, other]

Predicting Political Ideology from Digital Footprints

Authors: Michael Kitchener, Nandini Anantharama, Simon D. Angus, Paul A. Raschky

Abstract: This paper proposes a new method to predict individual political ideology from digital footprints on one of the world's largest online discussion forum. We compiled a unique data set from the online discussion forum reddit that contains information on the political ideology of around 91,000 users as well as records of their comment frequency and the comments' text corpus in over 190,000 different… ▽ More This paper proposes a new method to predict individual political ideology from digital footprints on one of the world's largest online discussion forum. We compiled a unique data set from the online discussion forum reddit that contains information on the political ideology of around 91,000 users as well as records of their comment frequency and the comments' text corpus in over 190,000 different subforums of interest. Applying a set of statistical learning approaches, we show that information about activity in non-political discussion forums alone, can very accurately predict a user's political ideology. Depending on the model, we are able to predict the economic dimension of ideology with an accuracy of up to 90.63% and the social dimension with and accuracy of up to 82.02%. In comparison, using the textual features from actual comments does not improve predictive accuracy. Our paper highlights the importance of revealed digital behaviour to complement stated preferences from digital communication when analysing human preferences and behaviour using online data. △ Less

Submitted 1 June, 2022; originally announced June 2022.

arXiv:2202.08209 [pdf, other]

An Extension Of Combinatorial Contextuality For Cognitive Protocols

Authors: Abdul Karim Obeid, Peter Bruza, Catarina Moreira, Axel Bruns, Daniel Angus

Abstract: This article extends the combinatorial approach to support the determination of contextuality amidst causal influences. Contextuality is an active field of study in Quantum Cognition, in systems relating to mental phenomena, such as concepts in human memory [Aerts et al., 2013]. In the cognitive field of study, a contemporary challenge facing the determination of whether a phenomenon is contextual… ▽ More This article extends the combinatorial approach to support the determination of contextuality amidst causal influences. Contextuality is an active field of study in Quantum Cognition, in systems relating to mental phenomena, such as concepts in human memory [Aerts et al., 2013]. In the cognitive field of study, a contemporary challenge facing the determination of whether a phenomenon is contextual has been the identification and management of disturbances [Dzhafarov et al., 2016]. Whether or not said disturbances are identified through the modelling approach, constitute causal influences, or are disregardableas as noise is important, as contextuality cannot be adequately determined in the presence of causal influences [Gleason, 1957]. To address this challenge, we first provide a formalisation of necessary elements of the combinatorial approach within the language of canonical9 causal models. Through this formalisation, we extend the combinatorial approach to support a measurement and treatment of disturbance, and offer techniques to separately distinguish noise and causal influences. Thereafter, we develop a protocol through which these elements may be represented within a cognitive experiment. As human cognition seems rife with causal influences, cognitive modellers may apply the extended combinatorial approach to practically determine the contextuality of cognitive phenomena. △ Less

Submitted 14 February, 2022; originally announced February 2022.

Comments: 28 pages, 10 figures, 5 tables

arXiv:2010.08102 [pdf, other]

Estimating Sleep & Work Hours from Alternative Data by Segmented Functional Classification Analysis (SFCA)

Authors: Klaus Ackermann, Simon D. Angus, Paul A. Raschky

Abstract: Alternative data is increasingly adapted to predict human and economic behaviour. This paper introduces a new type of alternative data by re-conceptualising the internet as a data-driven insights platform at global scale. Using data from a unique internet activity and location dataset drawn from over 1.5 trillion observations of end-user internet connections, we construct a functional dataset cove… ▽ More Alternative data is increasingly adapted to predict human and economic behaviour. This paper introduces a new type of alternative data by re-conceptualising the internet as a data-driven insights platform at global scale. Using data from a unique internet activity and location dataset drawn from over 1.5 trillion observations of end-user internet connections, we construct a functional dataset covering over 1,600 cities during a 7 year period with temporal resolution of just 15min. To predict accurate temporal patterns of sleep and work activity from this data-set, we develop a new technique, Segmented Functional Classification Analysis (SFCA), and compare its performance to a wide array of linear, functional, and classification methods. To confirm the wider applicability of SFCA, in a second application we predict sleep and work activity using SFCA from US city-wide electricity demand functional data. Across both problems, SFCA is shown to out-perform current methods. △ Less

Submitted 15 October, 2020; originally announced October 2020.

arXiv:1805.08929 [pdf, ps, other]

Determining the Number of Samples Required to Estimate Entropy in Natural Sequences

Authors: Andrew D. Back, Daniel Angus, Janet Wiles

Abstract: Calculating the Shannon entropy for symbolic sequences has been widely considered in many fields. For descriptive statistical problems such as estimating the N-gram entropy of English language text, a common approach is to use as much data as possible to obtain progressively more accurate estimates. However in some instances, only short sequences may be available. This gives rise to the question o… ▽ More Calculating the Shannon entropy for symbolic sequences has been widely considered in many fields. For descriptive statistical problems such as estimating the N-gram entropy of English language text, a common approach is to use as much data as possible to obtain progressively more accurate estimates. However in some instances, only short sequences may be available. This gives rise to the question of how many samples are needed to compute entropy. In this paper, we examine this problem and propose a method for estimating the number of samples required to compute Shannon entropy for a set of ranked symbolic natural events. The result is developed using a modified Zipf-Mandelbrot law and the Dvoretzky-Kiefer-Wolfowitz inequality, and we propose an algorithm which yields an estimate for the minimum number of samples required to obtain an estimate of entropy with a given confidence level and degree of accuracy. △ Less

Submitted 22 May, 2018; originally announced May 2018.

arXiv:1805.06630 [pdf, ps, other]

Fast Entropy Estimation for Natural Sequences

Authors: Andrew D. Back, Daniel Angus, Janet Wiles

Abstract: It is well known that to estimate the Shannon entropy for symbolic sequences accurately requires a large number of samples. When some aspects of the data are known it is plausible to attempt to use this to more efficiently compute entropy. A number of methods having various assumptions have been proposed which can be used to calculate entropy for small sample sizes. In this paper, we examine this… ▽ More It is well known that to estimate the Shannon entropy for symbolic sequences accurately requires a large number of samples. When some aspects of the data are known it is plausible to attempt to use this to more efficiently compute entropy. A number of methods having various assumptions have been proposed which can be used to calculate entropy for small sample sizes. In this paper, we examine this problem and propose a method for estimating the Shannon entropy for a set of ranked symbolic natural events. Using a modified Zipf-Mandelbrot-Li law and a new rank-based coincidence counting method, we propose an efficient algorithm which enables the entropy to be estimated with surprising accuracy using only a small number of samples. The algorithm is tested on some natural sequences and shown to yield accurate results with very small amounts of data. △ Less

Submitted 17 May, 2018; originally announced May 2018.

arXiv:1701.05632 [pdf, other]

The Internet as Quantitative Social Science Platform: Insights from a Trillion Observations

Authors: Klaus Ackermann, Simon D Angus, Paul A Raschky

Abstract: With the large-scale penetration of the internet, for the first time, humanity has become linked by a single, open, communications platform. Harnessing this fact, we report insights arising from a unified internet activity and location dataset of an unparalleled scope and accuracy drawn from over a trillion (1.5$\times 10^{12}$) observations of end-user internet connections, with temporal resoluti… ▽ More With the large-scale penetration of the internet, for the first time, humanity has become linked by a single, open, communications platform. Harnessing this fact, we report insights arising from a unified internet activity and location dataset of an unparalleled scope and accuracy drawn from over a trillion (1.5$\times 10^{12}$) observations of end-user internet connections, with temporal resolution of just 15min over 2006-2012. We first apply this dataset to the expansion of the internet itself over 1,647 urban agglomerations globally. We find that unique IP per capita counts reach saturation at approximately one IP per three people, and take, on average, 16.1 years to achieve; eclipsing the estimated 100- and 60- year saturation times for steam-power and electrification respectively. Next, we use intra-diurnal internet activity features to up-scale traditional over-night sleep observations, producing the first global estimate of over-night sleep duration in 645 cities over 7 years. We find statistically significant variation between continental, national and regional sleep durations including some evidence of global sleep duration convergence. Finally, we estimate the relationship between internet concentration and economic outcomes in 411 OECD regions and find that the internet's expansion is associated with negative or positive productivity gains, depending strongly on sectoral considerations. To our knowledge, our study is the first of its kind to use online/offline activity of the entire internet to infer social science insights, demonstrating the unparalleled potential of the internet as a social data-science platform. △ Less

Submitted 19 January, 2017; originally announced January 2017.

Comments: 40 pages, including 4 main figures, and appendix

arXiv:1503.06522 [pdf, other]

doi 10.1371/journal.pcbi.1004587

Shared intentions and the advance of cumulative culture in hunter-gatherers

Authors: Simon D. Angus, Jonathan Newton

Abstract: It has been hypothesized that the evolution of modern human cognition was catalyzed by the development of jointly intentional modes of behaviour. From an early age (1-2 years), human infants outperform apes at tasks that involve collaborative activity. Specifically, human infants excel at joint action motivated by reasoning of the form "we will do X" (shared intentions), as opposed to reasoning of… ▽ More It has been hypothesized that the evolution of modern human cognition was catalyzed by the development of jointly intentional modes of behaviour. From an early age (1-2 years), human infants outperform apes at tasks that involve collaborative activity. Specifically, human infants excel at joint action motivated by reasoning of the form "we will do X" (shared intentions), as opposed to reasoning of the form "I will do X [because he is doing X]" (individual intentions). The mechanism behind the evolution of shared intentionality is unknown. Here we formally model the evolution of jointly intentional action and show under what conditions it is likely to have emerged in humans. Modelling the interaction of hunter-gatherers as a coordination game, we find that when the benefits from adopting new technologies or norms are low but positive, the sharing of intentions does not evolve, despite being a mutualistic behaviour that directly benefits all participants. When the benefits from adopting new technologies or norms are high, such as may be the case during a period of rapid environmental change, shared intentionality evolves and rapidly becomes dominant in the population. Our results shed new light on the evolution of collaborative behaviours. △ Less

Submitted 23 March, 2015; v1 submitted 23 March, 2015; originally announced March 2015.

Comments: 6 pages, 4 figures, 1 table, Supplementary Information not included

arXiv:1312.6520 [pdf, other]

The mass-hierarchy and CP-violation discovery reach of the LBNO long-baseline neutrino experiment

Authors: LAGUNA-LBNO Collaboration, :, S. K. Agarwalla, L. Agostino, M. Aittola, A. Alekou, B. Andrieu, D. Angus, F. Antoniou, A. Ariga, T. Ariga, R. Asfandiyarov, D. Autiero, P. Ballett, I. Bandac, D. Banerjee, G. J. Barker, G. Barr, W. Bartmann, F. Bay, V. Berardi, I. Bertram, O. Bésida, A. M. Blebea-Apostu, A. Blondel , et al. (193 additional authors not shown)

Abstract: The next generation neutrino observatory proposed by the LBNO collaboration will address fundamental questions in particle and astroparticle physics. The experiment consists of a far detector, in its first stage a 20 kt LAr double phase TPC and a magnetised iron calorimeter, situated at 2300 km from CERN and a near detector based on a high-pressure argon gas TPC. The long baseline provides a uniqu… ▽ More The next generation neutrino observatory proposed by the LBNO collaboration will address fundamental questions in particle and astroparticle physics. The experiment consists of a far detector, in its first stage a 20 kt LAr double phase TPC and a magnetised iron calorimeter, situated at 2300 km from CERN and a near detector based on a high-pressure argon gas TPC. The long baseline provides a unique opportunity to study neutrino flavour oscillations over their 1st and 2nd oscillation maxima exploring the $L/E$ behaviour, and distinguishing effects arising from $δ_{CP}$ and matter. In this paper we have reevaluated the physics potential of this setup for determining the mass hierarchy (MH) and discovering CP-violation (CPV), using a conventional neutrino beam from the CERN SPS with a power of 750 kW. We use conservative assumptions on the knowledge of oscillation parameter priors and systematic uncertainties. The impact of each systematic error and the precision of oscillation prior is shown. We demonstrate that the first stage of LBNO can determine unambiguously the MH to $>5σ$C.L. over the whole phase space. We show that the statistical treatment of the experiment is of very high importance, resulting in the conclusion that LBNO has $\sim$ 100% probability to determine the MH in at most 4-5 years of running. Since the knowledge of MH is indispensable to extract $δ_{CP}$ from the data, the first LBNO phase can convincingly give evidence for CPV on the $3σ$C.L. using today's knowledge on oscillation parameters and realistic assumptions on the systematic uncertainties. △ Less

Submitted 20 January, 2014; v1 submitted 23 December, 2013; originally announced December 2013.

Comments: 35 pages, 22 figures, added authors

arXiv:1001.0077 [pdf, ps, other]

The LAGUNA design study- towards giant liquid based underground detectors for neutrino physics and astrophysics and proton decay searches

Authors: LAGUNA Collaboration, D. Angus, A. Ariga, D. Autiero, A. Apostu, A. Badertscher, T. Bennet, G. Bertola, P. F. Bertola, O. Besida, A. Bettini, C. Booth, J. L. Borne, I. Brancus, W. Bujakowsky, J. E. Campagne, G. Cata Danil, F. Chipesiu, M. Chorowski, J. Cripps, A. Curioni, S. Davidson, Y. Declais, U. Drost, O. Duliu , et al. (99 additional authors not shown)

Abstract: The feasibility of a next generation neutrino observatory in Europe is being considered within the LAGUNA design study. To accommodate giant neutrino detectors and shield them from cosmic rays, a new very large underground infrastructure is required. Seven potential candidate sites in different parts of Europe and at several distances from CERN are being studied: Boulby (UK), Canfranc (Spain), F… ▽ More The feasibility of a next generation neutrino observatory in Europe is being considered within the LAGUNA design study. To accommodate giant neutrino detectors and shield them from cosmic rays, a new very large underground infrastructure is required. Seven potential candidate sites in different parts of Europe and at several distances from CERN are being studied: Boulby (UK), Canfranc (Spain), Fréjus (France/Italy), Pyhäsalmi (Finland), Polkowice-Sieroszowice (Poland), Slanic (Romania) and Umbria (Italy). The design study aims at the comprehensive and coordinated technical assessment of each site, at a coherent cost estimation, and at a prioritization of the sites within the summer 2010. △ Less

Submitted 30 December, 2009; originally announced January 2010.

Comments: 5 pages, contribution to the Workshop "European Strategy for Future Neutrino Physics", CERN, Oct. 2009

Showing 1–12 of 12 results for author: Angus, D