Skip to main content

Showing 1–15 of 15 results for author: Yaliraki, S N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2010.15067  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Graph-based Topic Extraction from Vector Embeddings of Text Documents: Application to a Corpus of News Articles

    Authors: M. Tarik Altuncu, Sophia N. Yaliraki, Mauricio Barahona

    Abstract: Production of news content is growing at an astonishing rate. To help manage and monitor the sheer amount of text, there is an increasing need to develop efficient methods that can provide insights into emerging content areas, and stratify unstructured corpora of text into `topics' that stem intrinsically from content similarity. Here we present an unsupervised framework that brings together power… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

  2. arXiv:2007.07003  [pdf, other

    cs.SI cs.CY

    Data-driven modelling and characterisation of task completion sequences in online courses

    Authors: Robert L. Peach, Sam F. Greenbury, Iain G. Johnston, Sophia N. Yaliraki, David Lefevre, Mauricio Barahona

    Abstract: The intrinsic temporality of learning demands the adoption of methodologies capable of exploiting time-series information. In this study we leverage the sequence data framework and show how data-driven analysis of temporal sequences of task completion in online courses can be used to characterise personal and group learners' behaviors, and to identify critical tasks and course sessions in a given… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

    Comments: 18 pages

  3. arXiv:2006.02972  [pdf, other

    physics.soc-ph cs.IR cs.SI physics.data-an

    Severability of mesoscale components and local time scales in dynamical networks

    Authors: Yun William Yu, Jean-Charles Delvenne, Sophia N. Yaliraki, Mauricio Barahona

    Abstract: A major goal of dynamical systems theory is the search for simplified descriptions of the dynamics of a large number of interacting states. For overwhelmingly complex dynamical systems, the derivation of a reduced description on the entire dynamics at once is computationally infeasible. Other complex systems are so expansive that despite the continual onslaught of new data only partial information… ▽ More

    Submitted 4 June, 2020; originally announced June 2020.

    Comments: 24 pages, 13 figures

  4. arXiv:1909.00183  [pdf, other

    cs.LG cs.CL cs.IR math.SP stat.ML

    Extracting information from free text through unsupervised graph-based clustering: an application to patient incident records

    Authors: M. Tarik Altuncu, Eloise Sorin, Joshua D. Symons, Erik Mayer, Sophia N. Yaliraki, Francesca Toni, Mauricio Barahona

    Abstract: The large volume of text in electronic healthcare records often remains underused due to a lack of methodologies to extract interpretable content. Here we present an unsupervised framework for the analysis of free text that combines text-embedding with paragraph vectors and graph-theoretical multiscale community detection. We analyse text from a corpus of patient incident reports from the National… ▽ More

    Submitted 31 August, 2019; originally announced September 2019.

    Comments: To appear as a book chapter

  5. arXiv:1902.04047  [pdf, other

    cs.SI physics.data-an

    Data-driven unsupervised clustering of online learner behaviour

    Authors: Robert L. Peach, Sophia N. Yaliraki, David Lefevre, Mauricio Barahona

    Abstract: The widespread adoption of online courses opens opportunities for the analysis of learner behaviour and for the optimisation of web-based material adapted to observed usage. Here we introduce a mathematical framework for the analysis of time series collected from online engagement of learners, which allows the identification of clusters of learners with similar online behaviour directly from the d… ▽ More

    Submitted 16 July, 2019; v1 submitted 11 February, 2019; originally announced February 2019.

    Comments: 16 pages, 5 figures, journal submission, work presented at various conferences

  6. arXiv:1811.05711  [pdf, other

    cs.CL cs.IR cs.LG cs.SI math.SP

    From Free Text to Clusters of Content in Health Records: An Unsupervised Graph Partitioning Approach

    Authors: M. Tarik Altuncu, Erik Mayer, Sophia N. Yaliraki, Mauricio Barahona

    Abstract: Electronic Healthcare records contain large volumes of unstructured data in different forms. Free text constitutes a large portion of such data, yet this source of richly detailed information often remains under-used in practice because of a lack of suitable methodologies to extract interpretable content in a timely manner. Here we apply network-theoretical tools to the analysis of free text in Ho… ▽ More

    Submitted 14 November, 2018; originally announced November 2018.

    Comments: 25 pages, 2 tables, 8 figures and 5 supplementary figures

  7. arXiv:1808.01175  [pdf, other

    cs.CL cs.IR cs.LG math.SP

    Content-driven, unsupervised clustering of news articles through multiscale graph partitioning

    Authors: M. Tarik Altuncu, Sophia N. Yaliraki, Mauricio Barahona

    Abstract: The explosion in the amount of news and journalistic content being generated across the globe, coupled with extended and instantaneous access to information through online media, makes it difficult and time-consuming to monitor news developments and opinion formation in real time. There is an increasing need for tools that can pre-process, analyse and classify raw text to extract interpretable con… ▽ More

    Submitted 3 August, 2018; originally announced August 2018.

    Comments: 8 pages; 5 figures; To present at KDD 2018: Data Science, Journalism & Media workshop

  8. arXiv:1807.02599  [pdf, other

    cs.CL cs.IR cs.LG cs.SI math.SP

    From Text to Topics in Healthcare Records: An Unsupervised Graph Partitioning Methodology

    Authors: M. Tarik Altuncu, Erik Mayer, Sophia N. Yaliraki, Mauricio Barahona

    Abstract: Electronic Healthcare Records contain large volumes of unstructured data, including extensive free text. Yet this source of detailed information often remains under-used because of a lack of methodologies to extract interpretable content in a timely manner. Here we apply network-theoretical tools to analyse free text in Hospital Patient Incident reports from the National Health Service, to find cl… ▽ More

    Submitted 6 July, 2018; originally announced July 2018.

  9. arXiv:1508.03165  [pdf, other

    cs.SI physics.soc-ph

    Community detection and role identification in directed networks: understanding the Twitter network of the care.data debate

    Authors: B. Amor, S. Vuik, R. Callahan, A. Darzi, S. N. Yaliraki, M. Barahona

    Abstract: With the rise of social media as an important channel for the debate and discussion of public affairs, online social networks such as Twitter have become important platforms for public information and engagement by policy makers. To communicate effectively through Twitter, policy makers need to understand how influence and interest propagate within its network of users. In this chapter we use grap… ▽ More

    Submitted 13 August, 2015; originally announced August 2015.

    Comments: 27 pages, 6 figures, to appear in 'Dynamic Networks and Cyber-Security'

  10. arXiv:1507.05458  [pdf, other

    physics.soc-ph cs.SI

    Great cities look small

    Authors: Aaron Sim, Sophia N Yaliraki, Mauricio Barahona, Michael P H Stumpf

    Abstract: Great cities connect people; failed cities isolate people. Despite the fundamental importance of physical, face-to-face social-ties in the functioning of cities, these connectivity networks are not explicitly observed in their entirety. Attempts at estimating them often rely on unrealistic over-simplifications such as the assumption of spatial homogeneity. Here we propose a mathematical model of h… ▽ More

    Submitted 20 July, 2015; originally announced July 2015.

    Comments: 19 pages, 8 figures

    Journal ref: J. R. Soc. Interface 2015 12 20150315. Published 15 July 2015

  11. arXiv:1311.6785  [pdf, other

    physics.soc-ph cs.SI

    Interest communities and flow roles in directed networks: the Twitter network of the UK riots

    Authors: Mariano Beguerisse-Díaz, Guillermo Garduño-Hernández, Borislav Vangelov, Sophia N. Yaliraki, Mauricio Barahona

    Abstract: Directionality is a crucial ingredient in many complex networks in which information, energy or influence are transmitted. In such directed networks, analysing flows (and not only the strength of connections) is crucial to reveal important features of the network that might go undetected if the orientation of connections is ignored. We showcase here a flow-based approach for community detection in… ▽ More

    Submitted 8 October, 2014; v1 submitted 26 November, 2013; originally announced November 2013.

    Comments: 32 pages, 14 figures. Supplementary Spreadsheet available from: http://www2.imperial.ac.uk/~mbegueri/Docs/riotsCommunities.zip or http://rsif.royalsocietypublishing.org/content/11/101/20140940/suppl/DC1

    Journal ref: J. R. Soc. Interface 6 December 2014 vol. 11 no. 101 20140940

  12. arXiv:1308.1605  [pdf, other

    physics.soc-ph cond-mat.stat-mech cs.SI physics.data-an

    The stability of a graph partition: A dynamics-based framework for community detection

    Authors: Jean-Charles Delvenne, Michael T. Schaub, Sophia N. Yaliraki, Mauricio Barahona

    Abstract: Recent years have seen a surge of interest in the analysis of complex networks, facilitated by the availability of relational data and the increasingly powerful computational resources that can be employed for their analysis. Naturally, the study of real-world systems leads to highly complex networks and a current challenge is to extract intelligible, simplified descriptions from the network in te… ▽ More

    Submitted 7 August, 2013; originally announced August 2013.

    Comments: 3 figures; published as book chapter

    Journal ref: Dynamics On and Of Complex Networks, Volume 2, pp 221-242, Springer 2013

  13. arXiv:1303.6241  [pdf, other

    physics.soc-ph cs.SI

    Structure of complex networks: Quantifying edge-to-edge relations by failure-induced flow redistribution

    Authors: Michael T. Schaub, Jörg Lehmann, Sophia N. Yaliraki, Mauricio Barahona

    Abstract: The analysis of complex networks has so far revolved mainly around the role of nodes and communities of nodes. However, the dynamics of interconnected systems is commonly focalised on edge processes, and a dual edge-centric perspective can often prove more natural. Here we present graph-theoretical measures to quantify edge-to-edge relations inspired by the notion of flow redistribution induced by… ▽ More

    Submitted 7 April, 2014; v1 submitted 25 March, 2013; originally announced March 2013.

    Comments: 24 pages, 6 figures

    Journal ref: Network Science, 2014, 2(1), pp. 66--89

  14. arXiv:1109.5593  [pdf, other

    physics.soc-ph cs.SI

    Markov dynamics as a zooming lens for multiscale community detection: non clique-like communities and the field-of-view limit

    Authors: Michael T. Schaub, Jean-Charles Delvenne, Sophia N. Yaliraki, Mauricio Barahona

    Abstract: In recent years, there has been a surge of interest in community detection algorithms for complex networks. A variety of computational heuristics, some with a long history, have been proposed for the identification of communities or, alternatively, of good graph partitions. In most cases, the algorithms maximize a particular objective function, thereby finding the `right' split into communities. A… ▽ More

    Submitted 17 January, 2012; v1 submitted 26 September, 2011; originally announced September 2011.

    Comments: 20 pages, 6 figures

    Journal ref: PLoS ONE, 2012, 7(2), e32210

  15. arXiv:0812.1811  [pdf, other

    physics.soc-ph cs.IR physics.data-an

    Stability of graph communities across time scales

    Authors: J. -C. Delvenne, S. N. Yaliraki, M. Barahona

    Abstract: The complexity of biological, social and engineering networks makes it desirable to find natural partitions into communities that can act as simplified descriptions and provide insight into the structure and function of the overall system. Although community detection methods abound, there is a lack of consensus on how to quantify and rank the quality of partitions. We show here that the quality… ▽ More

    Submitted 11 March, 2009; v1 submitted 9 December, 2008; originally announced December 2008.

    Comments: submitted; updated bibliography from v3