Skip to main content

Showing 1–21 of 21 results for author: Levene, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10117  [pdf, other

    cs.LG

    Trustworthy Artificial Intelligence in the Context of Metrology

    Authors: Tameem Adel, Sam Bilson, Mark Levene, Andrew Thompson

    Abstract: We review research at the National Physical Laboratory (NPL) in the area of trustworthy artificial intelligence (TAI), and more specifically trustworthy machine learning (TML), in the context of metrology, the science of measurement. We describe three broad themes of TAI: technical, socio-technical and social, which play key roles in ensuring that the developed models are trustworthy and can be re… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Journal ref: In Producing Artificial Intelligent Systems: The roles of Benchmarking, Standardisation and Certification, Studies in Computational Intelligence, edited by M. I. A. Ferreira, 2024, Springer

  2. arXiv:2309.02188  [pdf, other

    cs.CL cs.SI

    Incorporating Dictionaries into a Neural Network Architecture to Extract COVID-19 Medical Concepts From Social Media

    Authors: Abul Hasan, Mark Levene, David Weston

    Abstract: We investigate the potential benefit of incorporating dictionary information into a neural network architecture for natural language processing. In particular, we make use of this architecture to extract several concepts related to COVID-19 from an on-line medical forum. We use a sample from the forum to manually curate one dictionary for each concept. In addition, we use MetaMap, which is a tool… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  3. arXiv:2103.11850  [pdf, other

    cs.CL

    Monitoring Covid-19 on social media using a novel triage and diagnosis approach

    Authors: Abul Hasan, Mark Levene, David Weston, Renate Fromson, Nicolas Koslover, Tamara Levene

    Abstract: Objective: This study aims to develop an end-to-end natural language processing pipeline for triage and diagnosis of COVID-19 from patient-authored social media posts, in order to provide researchers and public health practitioners with additional information on the symptoms, severity and prevalence of the disease rather than to provide an actionable decision at the individual level. Materials and… ▽ More

    Submitted 7 January, 2022; v1 submitted 22 March, 2021; originally announced March 2021.

    Comments: 13 pages, 6 figrues

  4. Potential gain as a centrality measure

    Authors: Pasquale De Meo, Mark Levene, Alessandro Provetti

    Abstract: Navigability is a distinctive features of graphs associated with artificial or natural systems whose primary goal is the transportation of information or goods. We say that a graph $\mathcal{G}$ is navigable when an agent is able to efficiently reach any target node in $\mathcal{G}$ by means of local routing decisions. In a social network navigability translates to the ability of reaching an indiv… ▽ More

    Submitted 17 May, 2020; originally announced May 2020.

    Comments: In Proceedings of Web Intelligence 2019 (WI19), the IEEE/WIC/ACM International Conference on Web Intelligence, pages 418--422. arXiv admin note: text overlap with arXiv:1812.08012

    ACM Class: I.2.8; F.2.1

  5. arXiv:2002.06450  [pdf, ps, other

    cs.CL cs.IR cs.LG

    Supervised Phrase-boundary Embeddings

    Authors: Manni Singh, David Weston, Mark Levene

    Abstract: We propose a new word embedding model, called SPhrase, that incorporates supervised phrase information. Our method modifies traditional word embeddings by ensuring that all target words in a phrase have exactly the same context. We demonstrate that including this information within a context window produces superior embeddings for both intrinsic evaluation tasks and downstream extrinsic tasks.

    Submitted 15 February, 2020; originally announced February 2020.

    Comments: 12 pages, 3 figures, 4 tables

  6. Characterisation of the $χ$-index and the $rec$-index

    Authors: Mark Levene, Trevor Fenner, Judit Bar-Ilan

    Abstract: Axiomatic characterisation of a bibliometric index provides insight into the properties that the index satisfies and facilitates the comparison of different indices. A geometric generalisation of the $h$-index, called the $χ$-index, has recently been proposed to address some of the problems with the $h$-index, in particular, the fact that it is not scale invariant, i.e., multiplying the number of… ▽ More

    Submitted 24 June, 2019; originally announced June 2019.

    Comments: 14 pages, 3 figures. This is a pre-print of an article published in Scientometrics. The final authenticated version is available online at: https://doi.org/10.1007/s11192-019-03151-7

  7. arXiv:1903.05440  [pdf, other

    cs.CL cs.CY

    Market Trend Prediction using Sentiment Analysis: Lessons Learned and Paths Forward

    Authors: Andrius Mudinas, Dell Zhang, Mark Levene

    Abstract: Financial market forecasting is one of the most attractive practical applications of sentiment analysis. In this paper, we investigate the potential of using sentiment \emph{attitudes} (positive vs negative) and also sentiment \emph{emotions} (joy, sadness, etc.) extracted from financial news or tweets to help predict stock price movements. Our extensive experiments using the \emph{Granger-causali… ▽ More

    Submitted 13 March, 2019; originally announced March 2019.

    Comments: 10 pages, 4 figues, 6 tables

  8. arXiv:1812.08012  [pdf, other

    cs.SI physics.soc-ph

    A general centrality framework based on node navigability

    Authors: Pasquale De Meo, Mark Levene, Fabrizio Messina, Alessandro Provetti

    Abstract: Centrality metrics are a popular tool in Network Science to identify important nodes within a graph. We introduce the Potential Gain as a centrality measure that unifies many walk-based centrality metrics in graphs and captures the notion of node navigability, interpreted as the property of being reachable from anywhere else (in the graph) through short walks. Two instances of the Potential Gain (… ▽ More

    Submitted 12 March, 2020; v1 submitted 19 December, 2018; originally announced December 2018.

    Comments: 26 pages, 11 figures. To be published in IEEE Transactions on Knowledge and Data Engineering

  9. arXiv:1603.07150  [pdf, other

    cs.DL cs.CL cs.IR

    The Anatomy of a Search and Mining System for Digital Archives

    Authors: Martyn Harris, Mark Levene, Dell Zhang, Dan Levene

    Abstract: Samtla (Search And Mining Tools with Linguistic Analysis) is a digital humanities system designed in collaboration with historians and linguists to assist them with their research work in quantifying the content of any textual corpora through approximate phrase search and document comparison. The retrieval engine uses a character-based n-gram language model rather than the conventional word-based… ▽ More

    Submitted 23 March, 2016; originally announced March 2016.

    Comments: 49 pages

  10. arXiv:1511.08712  [pdf, ps, other

    physics.soc-ph cs.SI

    A stochastic evolutionary model generating a mixture of exponential distributions

    Authors: Trevor Fenner, Mark Levene, George Loizou

    Abstract: Recent interest in human dynamics has stimulated the investigation of the stochastic processes that explain human behaviour in various contexts, such as mobile phone networks and social media. In this paper, we extend the stochastic urn-based model proposed in \cite{FENN15} so that it can generate mixture models,in particular, a mixture of exponential distributions. The model is designed to captur… ▽ More

    Submitted 14 January, 2016; v1 submitted 27 November, 2015; originally announced November 2015.

    Comments: 14 pages. arXiv admin note: substantial text overlap with arXiv:1502.07558

  11. arXiv:1304.6945  [pdf, ps, other

    cs.DL

    A bibliometric index based on the complete list of cited publications

    Authors: Mark Levene, Trevor Fenner, Judit Bar-Ilan

    Abstract: We propose a new index, the $j$-index, which is defined for an author as the sum of the square roots of the numbers of citations to each of the author's publications. The idea behind the $j$-index it to remedy a drawback of the $h$-index $-$ that the $h$-index does not take into account the full citation record of a researcher. The square root function is motivated by our desire to avoid the possi… ▽ More

    Submitted 25 April, 2013; originally announced April 2013.

    Comments: 12 pages

  12. arXiv:1103.1530  [pdf, ps, other

    physics.soc-ph cs.AI

    A Discrete Evolutionary Model for Chess Players' Ratings

    Authors: Trevor Fenner, Mark Levene, George Loizou

    Abstract: The Elo system for rating chess players, also used in other games and sports, was adopted by the World Chess Federation over four decades ago. Although not without controversy, it is accepted as generally reliable and provides a method for assessing players' strengths and ranking them in official tournaments. It is generally accepted that the distribution of players' rating data is approximately… ▽ More

    Submitted 30 March, 2011; v1 submitted 8 March, 2011; originally announced March 2011.

    Comments: 17 pages, 4 figures

  13. arXiv:0904.2595  [pdf, ps, other

    cs.AI cs.LG

    A Methodology for Learning Players' Styles from Game Records

    Authors: Mark Levene, Trevor Fenner

    Abstract: We describe a preliminary investigation into learning a Chess player's style from game records. The method is based on attempting to learn features of a player's individual evaluation function using the method of temporal differences, with the aid of a conventional Chess engine architecture. Some encouraging results were obtained in learning the styles of two recent Chess world champions, and we… ▽ More

    Submitted 16 April, 2009; originally announced April 2009.

    Comments: 15 pages, 3 figures

  14. arXiv:cs/0610060  [pdf, ps, other

    cs.AI

    Comparing Typical Opening Move Choices Made by Humans and Chess Engines

    Authors: Mark Levene, Judit Bar-Ilan

    Abstract: The opening book is an important component of a chess engine, and thus computer chess programmers have been develo** automated methods to improve the quality of their books. For chess, which has a very rich opening theory, large databases of high-quality games can be used as the basis of an opening book, from which statistics relating to move choices from given positions can be collected. In o… ▽ More

    Submitted 11 October, 2006; originally announced October 2006.

    Comments: 12 pages, 1 figure, 6 tables

    ACM Class: I.2.0

  15. arXiv:cs/0606115  [pdf, ps, other

    cs.AI cs.IR

    Evaluating Variable Length Markov Chain Models for Analysis of User Web Navigation Sessions

    Authors: Jose Borges, Mark Levene

    Abstract: Markov models have been widely used to represent and analyse user web navigation data. In previous work we have proposed a method to dynamically extend the order of a Markov chain model and a complimentary method for assessing the predictive power of such a variable length Markov chain. Herein, we review these two methods and propose a novel method for measuring the ability of a variable length… ▽ More

    Submitted 28 June, 2006; originally announced June 2006.

  16. arXiv:cs/0505039  [pdf

    cs.IR

    Methods for comparing rankings of search engine results

    Authors: Judit Bar-Ilan, Mazlita Mat-Hassan, Mark Levene

    Abstract: In this paper we present a number of measures that compare rankings of search engine results. We apply these measures to five queries that were monitored daily for two periods of about 21 days each. Rankings of the different search engines (Google, Yahoo and Teoma for text searches and Google, Yahoo and Picsearch for image searches) are compared on a daily basis, in addition to longitudinal comp… ▽ More

    Submitted 14 May, 2005; originally announced May 2005.

    Comments: 19 pages, 4 figures, 8 tables

    ACM Class: H.3.3

  17. arXiv:cs/0503030  [pdf, ps, other

    cs.AI cs.CL

    A Suffix Tree Approach to Email Filtering

    Authors: Rajesh M. Pampapathi, Boris Mirkin, Mark Levene

    Abstract: We present an approach to email filtering based on the suffix tree data structure. A method for the scoring of emails using the suffix tree is developed and a number of scoring and score normalisation functions are tested. Our results show that the character level representation of emails and classes facilitated by the suffix tree can significantly improve classification accuracy when compared w… ▽ More

    Submitted 6 December, 2005; v1 submitted 14 March, 2005; originally announced March 2005.

    Comments: Revisions made in the light of reviewer comments. Main changes: (i) The extension and elaboration of section 4.4 which describes the scoring algorithm; (ii) Favouring the use of false positive and false negative performance measures over the use of precision and recall; (iii) The addition of ROC curves wherever possible; and (iv) Inclusion of performance statistics for algorithm. Re-submitted 5th August 2005

  18. arXiv:cs/0412002  [pdf, ps, other

    cs.AI cs.IR

    Ranking Pages by Topology and Popularity within Web Sites

    Authors: Jose Borges, Mark Levene

    Abstract: We compare two link analysis ranking methods of web pages in a site. The first, called Site Rank, is an adaptation of PageRank to the granularity of a web site and the second, called Popularity Rank, is based on the frequencies of user clicks on the outlinks in a page that are captured by navigation sessions of users through the web site. We ran experiments on artificially created web sites of d… ▽ More

    Submitted 9 December, 2004; v1 submitted 1 December, 2004; originally announced December 2004.

    Comments: 15 pages, 6 figures

  19. arXiv:cs/0406032  [pdf

    cs.IR cs.AI

    A Dynamic Clustering-Based Markov Model for Web Usage Mining

    Authors: José Borges, Mark Levene

    Abstract: Markov models have been widely utilized for modelling user web navigation behaviour. In this work we propose a dynamic clustering-based method to increase a Markov model's accuracy in representing a collection of user web navigation sessions. The method makes use of the state cloning concept to duplicate states in a way that separates in-links whose corresponding second-order probabilities diver… ▽ More

    Submitted 17 June, 2004; originally announced June 2004.

  20. arXiv:cs/0307073  [pdf, ps, other

    cs.DB

    Search and Navigation in Relational Databases

    Authors: Richard Wheeldon, Mark Levene, Kevin Keenoy

    Abstract: We present a new application for keyword search within relational databases, which uses a novel algorithm to solve the join discovery problem by finding Memex-like trails through the graph of foreign key dependencies. It differs from previous efforts in the algorithms used, in the presentation mechanism and in the use of primary-key only database queries at query-time to maintain a fast response… ▽ More

    Submitted 31 July, 2003; originally announced July 2003.

    Comments: 12 pages, 7 figures

    ACM Class: H.3; H.4; H.5

  21. arXiv:cs/0306122  [pdf, ps, other

    cs.DS cs.IR

    The Best Trail Algorithm for Assisted Navigation of Web Sites

    Authors: Richard Wheeldon, Mark Levene

    Abstract: We present an algorithm called the Best Trail Algorithm, which helps solve the hypertext navigation problem by automating the construction of memex-like trails through the corpus. The algorithm performs a probabilistic best-first expansion of a set of navigation trees to find relevant and compact trails. We describe the implementation of the algorithm, scoring methods for trails, filtering algor… ▽ More

    Submitted 22 June, 2003; originally announced June 2003.

    Comments: 11 pages, 11 figures

    ACM Class: H.3.3; H.5.4; G.2.2; F.2.2