Skip to main content

Showing 1–38 of 38 results for author: Jacobs, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.10543  [pdf, other

    eess.AS cs.CL cs.SD

    Multilingual acoustic word embeddings for zero-resource languages

    Authors: Christiaan Jacobs

    Abstract: This research addresses the challenge of develo** speech applications for zero-resource languages that lack labelled data. It specifically uses acoustic word embedding (AWE) -- fixed-dimensional representations of variable-duration speech segments -- employing multilingual transfer, where labelled data from several well-resourced languages are used for pertaining. The study introduces a new neur… ▽ More

    Submitted 23 January, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: PhD thesis

  2. arXiv:2401.02192  [pdf

    eess.IV cs.CV cs.LG

    Nodule detection and generation on chest X-rays: NODE21 Challenge

    Authors: Ecem Sogancioglu, Bram van Ginneken, Finn Behrendt, Marcel Bengs, Alexander Schlaefer, Miron Radu, Di Xu, Ke Sheng, Fabien Scalzo, Eric Marcus, Samuele Papa, Jonas Teuwen, Ernst Th. Scholten, Steven Schalekamp, Nils Hendrix, Colin Jacobs, Ward Hendrix, Clara I Sánchez, Keelin Murphy

    Abstract: Pulmonary nodules may be an early manifestation of lung cancer, the leading cause of cancer-related deaths among both men and women. Numerous studies have established that deep learning methods can yield high-performance levels in the detection of lung nodules in chest X-rays. However, the lack of gold-standard public datasets slows down the progression of the research and prevents benchmarking of… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: 15 pages, 5 figures

  3. arXiv:2311.05032  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Transfer learning from a sparsely annotated dataset of 3D medical images

    Authors: Gabriel Efrain Humpire-Mamani, Colin Jacobs, Mathias Prokop, Bram van Ginneken, Nikolas Lessmann

    Abstract: Transfer learning leverages pre-trained model features from a large dataset to save time and resources when training new models for various tasks, potentially enhancing performance. Due to the lack of large datasets in the medical imaging domain, transfer learning from one medical imaging model to other medical imaging models has not been widely explored. This study explores the use of transfer le… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  4. Tracking China's cross-strait bot networks against Taiwan

    Authors: Charity S. Jacobs, Lynnette Hui Xian Ng, Kathleen M. Carley

    Abstract: The cross-strait relationship between China and Taiwan is marked by increasing hostility around potential reunification. We analyze an unattributed bot network and how repeater bots engaged in an influence campaign against Taiwan following US House Speaker Nancy Pelosi's visit to Taiwan in 2022. We examine the message amplification tactics employed by four key bot sub-communities, the widespread d… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 10 pages with 5 figures. Published in Conference Proceedings for Social, Cultural, and Behavioral Modeling (SBP-BRiMS 2023)

  5. arXiv:2309.03383  [pdf, other

    eess.IV cs.CV

    Kidney abnormality segmentation in thorax-abdomen CT scans

    Authors: Gabriel Efrain Humpire Mamani, Nikolas Lessmann, Ernst Th. Scholten, Mathias Prokop, Colin Jacobs, Bram van Ginneken

    Abstract: In this study, we introduce a deep learning approach for segmenting kidney parenchyma and kidney abnormalities to support clinicians in identifying and quantifying renal abnormalities such as cysts, lesions, masses, metastases, and primary tumors. Our end-to-end segmentation method was trained on 215 contrast-enhanced thoracic-abdominal CT scans, with half of these scans containing one or more abn… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  6. arXiv:2309.02576  [pdf, other

    eess.IV cs.CV cs.LG

    Emphysema Subty** on Thoracic Computed Tomography Scans using Deep Neural Networks

    Authors: Weiyi Xie, Colin Jacobs, Jean-Paul Charbonnier, Dirk Jan Slebos, Bram van Ginneken

    Abstract: Accurate identification of emphysema subtypes and severity is crucial for effective management of COPD and the study of disease heterogeneity. Manual analysis of emphysema subtypes and severity is laborious and subjective. To address this challenge, we present a deep learning-based approach for automating the Fleischner Society's visual score system for emphysema subty** and severity analysis. W… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Journal ref: Sci Rep. 2023 Aug 29;13(1):14147

  7. arXiv:2308.07179  [pdf, other

    cs.CL

    Incorporating Annotator Uncertainty into Representations of Discourse Relations

    Authors: S. Magalí López Cortez, Cassandra L. Jacobs

    Abstract: Annotation of discourse relations is a known difficult task, especially for non-expert annotators. In this paper, we investigate novice annotators' uncertainty on the annotation of discourse relations on spoken conversational data. We find that dialogue context (single turn, pair of turns within speaker, and pair of turns across speakers) is a significant predictor of confidence scores. We compute… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  8. arXiv:2307.03645  [pdf, other

    cs.CL

    The distribution of discourse relations within and across turns in spontaneous conversation

    Authors: S. Magalí López Cortez, Cassandra L. Jacobs

    Abstract: Time pressure and topic negotiation may impose constraints on how people leverage discourse relations (DRs) in spontaneous conversational contexts. In this work, we adapt a system of DRs for written language to spontaneous dialogue using crowdsourced annotations from novice annotators. We then test whether discourse relations are used differently across several types of multi-utterance contexts. W… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: Proceedings of Computational Approaches to Discourse 2023, collocated with the 2023 meeting of the Association for Computational Linguistics, Toronto, Canada

  9. arXiv:2307.02083  [pdf, other

    eess.AS cs.CL

    Leveraging multilingual transfer for unsupervised semantic acoustic word embeddings

    Authors: Christiaan Jacobs, Herman Kamper

    Abstract: Acoustic word embeddings (AWEs) are fixed-dimensional vector representations of speech segments that encode phonetic content so that different realisations of the same word have similar embeddings. In this paper we explore semantic AWE modelling. These AWEs should not only capture phonetics but also the meaning of a word (similar to textual word embeddings). We consider the scenario where we only… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: Submitted to IEEE SPL

  10. arXiv:2306.00410  [pdf, other

    cs.CL cs.SD eess.AS

    Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili

    Authors: Christiaan Jacobs, Nathanaël Carraz Rakotonirina, Everlyn Asiko Chimoto, Bruce A. Bassett, Herman Kamper

    Abstract: We consider hate speech detection through keyword spotting on radio broadcasts. One approach is to build an automatic speech recognition (ASR) system for the target low-resource language. We compare this to using acoustic word embedding (AWE) models that map speech segments to a space where matching words have similar vectors. We specifically use a multilingual AWE model trained on labelled data f… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted to Interspeech 2023

  11. arXiv:2208.01561  [pdf, other

    cs.CL

    Lost in Space Marking

    Authors: Cassandra L. Jacobs, Yuval Pinter

    Abstract: We look at a decision taken early in training a subword tokenizer, namely whether it should be the word-initial token that carries a special mark, or the word-final one. Based on surface-level considerations of efficiency and cohesion, as well as morphological coverage, we find that a Unigram LM tokenizer trained on pre-tokenized English text is better off marking the word-initial token, while one… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

    Comments: Submission to SIGMORPHON 2021

  12. arXiv:2201.04532  [pdf, other

    cs.CV

    Structure and position-aware graph neural network for airway labeling

    Authors: Weiyi Xie, Colin Jacobs, Jean-Paul Charbonnier, Bram van Ginneken

    Abstract: We present a novel graph-based approach for labeling the anatomical branches of a given airway tree segmentation. The proposed method formulates airway labeling as a branch classification problem in the airway tree graph, where branch features are extracted using convolutional neural networks (CNN) and enriched using graph neural networks. Our graph neural network is structure-aware by having each… ▽ More

    Submitted 12 January, 2022; originally announced January 2022.

  13. arXiv:2106.12834  [pdf, other

    cs.CL cs.SD eess.AS

    Multilingual transfer of acoustic word embeddings improves when training on languages related to the target zero-resource language

    Authors: Christiaan Jacobs, Herman Kamper

    Abstract: Acoustic word embedding models map variable duration speech segments to fixed dimensional vectors, enabling efficient speech search and discovery. Previous work explored how embeddings can be obtained in zero-resource settings where no labelled data is available in the target language. The current best approach uses transfer learning: a single supervised multilingual model is trained using labelle… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Comments: Accepted to Interspeech 2021

  14. arXiv:2106.01351  [pdf, other

    eess.IV cs.CV

    Deep Clustering Activation Maps for Emphysema Subty**

    Authors: Weiyi Xie, Colin Jacobs, Bram van Ginneken

    Abstract: We propose a deep learning clustering method that exploits dense features from a segmentation network for emphysema subty** from computed tomography (CT) scans. Using dense features enables high-resolution visualization of image regions corresponding to the cluster assignment via dense clustering activation maps (dCAMs). This approach provides model interpretability. We evaluated clustering resu… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

  15. arXiv:2105.11748  [pdf, other

    eess.IV cs.CV cs.LG

    Dense Regression Activation Maps For Lesion Segmentation in CT scans of COVID-19 patients

    Authors: Weiyi Xie, Colin Jacobs, Jean-Paul Charbonnier, Bram van Ginneken

    Abstract: Automatic lesion segmentation on thoracic CT enables rapid quantitative analysis of lung involvement in COVID-19 infections. However, obtaining a large amount of voxel-level annotations for training segmentation networks is prohibitively expensive. Therefore, we propose a weakly-supervised segmentation method based on dense regression activation maps (dRAMs). Most weakly-supervised segmentation ap… ▽ More

    Submitted 18 November, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

  16. arXiv:2103.10731  [pdf, other

    cs.CL eess.AS

    Acoustic word embeddings for zero-resource languages using self-supervised contrastive learning and multilingual adaptation

    Authors: Christiaan Jacobs, Yevgen Matusevych, Herman Kamper

    Abstract: Acoustic word embeddings (AWEs) are fixed-dimensional representations of variable-length speech segments. For zero-resource languages where labelled data is not available, one AWE approach is to use unsupervised autoencoder-based recurrent models. Another recent approach is to use multilingual transfer: a supervised AWE model is trained on several well-resourced languages and then applied to an un… ▽ More

    Submitted 19 March, 2021; originally announced March 2021.

    Comments: Accepted to SLT 2021

  17. arXiv:2009.09725  [pdf, other

    eess.IV cs.CV

    Improving Automated COVID-19 Grading with Convolutional Neural Networks in Computed Tomography Scans: An Ablation Study

    Authors: Coen de Vente, Luuk H. Boulogne, Kiran Vaidhya Venkadesh, Cheryl Sital, Nikolas Lessmann, Colin Jacobs, Clara I. Sánchez, Bram van Ginneken

    Abstract: Amidst the ongoing pandemic, several studies have shown that COVID-19 classification and grading using computed tomography (CT) images can be automated with convolutional neural networks (CNNs). Many of these studies focused on reporting initial results of algorithms that were assembled from commonly used components. The choice of these components was often pragmatic rather than systematic. For in… ▽ More

    Submitted 21 September, 2020; originally announced September 2020.

    Comments: 9 pages, 6 figures

  18. arXiv:2009.09123  [pdf, other

    cs.CL cs.AI

    Will it Unblend?

    Authors: Yuval Pinter, Cassandra L. Jacobs, Jacob Eisenstein

    Abstract: Natural language processing systems often struggle with out-of-vocabulary (OOV) terms, which do not appear in training data. Blends, such as "innoventor", are one particularly challenging class of OOV, as they are formed by fusing together two or more bases that relate to the intended meaning in unpredictable manners and degrees. In this work, we run experiments on a novel dataset of English OOV b… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

    Comments: Findings of EMNLP 2020

  19. Relational Modeling for Robust and Efficient Pulmonary Lobe Segmentation in CT Scans

    Authors: Weiyi Xie, Colin Jacobs, Jean-Paul Charbonnier, Bram van Ginneken

    Abstract: Pulmonary lobe segmentation in computed tomography scans is essential for regional assessment of pulmonary diseases. Recent works based on convolution neural networks have achieved good performance for this task. However, they are still limited in capturing structured relationships due to the nature of convolution. The shape of the pulmonary lobes affect each other and their borders relate to the… ▽ More

    Submitted 12 May, 2020; v1 submitted 15 April, 2020; originally announced April 2020.

  20. arXiv:2003.03444  [pdf, ps, other

    cs.CL

    NYTWIT: A Dataset of Novel Words in the New York Times

    Authors: Yuval Pinter, Cassandra L. Jacobs, Max Bittker

    Abstract: We present the New York Times Word Innovation Types dataset, or NYTWIT, a collection of over 2,500 novel English words published in the New York Times between November 2017 and March 2019, manually annotated for their class of novelty (such as lexical derivation, dialectal variation, blending, or compounding). We present baseline results for both uncontextual and contextual prediction of novelty c… ▽ More

    Submitted 23 October, 2020; v1 submitted 6 March, 2020; originally announced March 2020.

    Comments: COLING 2020

  21. arXiv:2001.09078  [pdf, other

    cs.DB

    Adaptive Low-level Storage of Very Large Knowledge Graphs

    Authors: Jacopo Urbani, Ceriel Jacobs

    Abstract: The increasing availability and usage of Knowledge Graphs (KGs) on the Web calls for scalable and general-purpose solutions to store this type of data structures. We propose Trident, a novel storage architecture for very large KGs on centralized systems. Trident uses several interlinked data structures to provide fast access to nodes and edges, with the physical storage changing depending on the t… ▽ More

    Submitted 24 January, 2020; originally announced January 2020.

    Comments: Accepted WWW 2020

  22. The Liver Tumor Segmentation Benchmark (LiTS)

    Authors: Patrick Bilic, Patrick Christ, Hongwei Bran Li, Eugene Vorontsov, Avi Ben-Cohen, Georgios Kaissis, Adi Szeskin, Colin Jacobs, Gabriel Efrain Humpire Mamani, Gabriel Chartrand, Fabian Lohöfer, Julian Walter Holch, Wieland Sommer, Felix Hofmann, Alexandre Hostettler, Naama Lev-Cohain, Michal Drozdzal, Michal Marianne Amitai, Refael Vivantik, Jacob Sosna, Ivan Ezhov, Anjany Sekuboyina, Fernando Navarro, Florian Kofler, Johannes C. Paetzold , et al. (84 additional authors not shown)

    Abstract: In this work, we report the set-up and results of the Liver Tumor Segmentation Benchmark (LiTS), which was organized in conjunction with the IEEE International Symposium on Biomedical Imaging (ISBI) 2017 and the International Conferences on Medical Image Computing and Computer-Assisted Intervention (MICCAI) 2017 and 2018. The image dataset is diverse and contains primary and secondary tumors with… ▽ More

    Submitted 25 November, 2022; v1 submitted 13 January, 2019; originally announced January 2019.

    Comments: Patrick Bilic, Patrick Christ, Hongwei Bran Li, and Eugene Vorontsov made equal contributions to this work. Published in Medical Image Analysis

    Journal ref: Medical Image Analysis (2022) Pg. 102680

  23. arXiv:1811.12789  [pdf, other

    cs.CV

    iW-Net: an automatic and minimalistic interactive lung nodule segmentation deep network

    Authors: Guilherme Aresta, Colin Jacobs, Teresa Araújo, António Cunha, Isabel Ramos, Bram van Ginneken, Aurélio Campilho

    Abstract: We propose iW-Net, a deep learning model that allows for both automatic and interactive segmentation of lung nodules in computed tomography images. iW-Net is composed of two blocks: the first one provides an automatic segmentation and the second one allows to correct it by analyzing 2 points introduced by the user in the nodule's boundary. For this purpose, a physics inspired weight map that takes… ▽ More

    Submitted 30 November, 2018; originally announced November 2018.

    Comments: Pre-print submitted to IEEE Transactions on Biomedical Engineering

  24. arXiv:1709.09713  [pdf, other

    cs.MS cs.PF physics.comp-ph physics.flu-dyn

    Energy efficiency of finite difference algorithms on multicore CPUs, GPUs, and Intel Xeon Phi processors

    Authors: Satya P. Jammy, Christian T. Jacobs, David J. Lusher, Neil D. Sandham

    Abstract: In addition to hardware wall-time restrictions commonly seen in high-performance computing systems, it is likely that future systems will also be constrained by energy budgets. In the present work, finite difference algorithms of varying computational and memory intensity are evaluated with respect to both energy efficiency and runtime on an Intel Ivy Bridge CPU node, an Intel Xeon Phi Knights Lan… ▽ More

    Submitted 27 September, 2017; originally announced September 2017.

    Comments: Submitted to Computers and Fluids

  25. arXiv:1708.03183  [pdf, other

    cs.CE physics.geo-ph

    Automated Tiling of Unstructured Mesh Computations with Application to Seismological Modelling

    Authors: Fabio Luporini, Michael Lange, Christian T. Jacobs, Gerard J. Gorman, J. Ramanujam, Paul H. J. Kelly

    Abstract: Sparse tiling is a technique to fuse loops that access common data, thus increasing data locality. Unlike traditional loop fusion or blocking, the loops may have different iteration spaces and access shared datasets through indirect memory accesses, such as A[map[i]] -- hence the name "sparse". One notable example of such loops arises in discontinuous-Galerkin finite element methods, because of th… ▽ More

    Submitted 19 June, 2019; v1 submitted 10 August, 2017; originally announced August 2017.

    Comments: 29 pages including supplementary materials and references

    ACM Class: D.1.2; G.4

  26. arXiv:1704.08368  [pdf, ps, other

    physics.flu-dyn cs.CE physics.comp-ph

    Surface-sampled simulations of turbulent flow at high Reynolds number

    Authors: Neil D. Sandham, Roderick Johnstone, Christian T. Jacobs

    Abstract: A new approach to turbulence simulation, based on a combination of large-eddy simulation (LES) for the whole flow and an array of non-space-filling quasi-direct numerical simulations (QDNS), which sample the response of near-wall turbulence to large-scale forcing, is proposed and evaluated. The technique overcomes some of the cost limitations of turbulence simulation, since the main flow is treate… ▽ More

    Submitted 26 April, 2017; originally announced April 2017.

    Comments: Author accepted version. Accepted for publication in the International Journal for Numerical Methods in Fluids on 26 April 2017

    Journal ref: International Journal for Numerical Methods in Fluids 85(9):525-537, 2017

  27. Validation, comparison, and combination of algorithms for automatic detection of pulmonary nodules in computed tomography images: the LUNA16 challenge

    Authors: Arnaud Arindra Adiyoso Setio, Alberto Traverso, Thomas de Bel, Moira S. N. Berens, Cas van den Bogaard, Piergiorgio Cerello, Hao Chen, Qi Dou, Maria Evelina Fantacci, Bram Geurts, Robbert van der Gugten, Pheng Ann Heng, Bart Jansen, Michael M. J. de Kaste, Valentin Kotov, Jack Yu-Hung Lin, Jeroen T. M. C. Manders, Alexander Sónora-Mengana, Juan Carlos García-Naranjo, Evgenia Papavasileiou, Mathias Prokop, Marco Saletta, Cornelia M Schaefer-Prokop, Ernst T. Scholten, Luuk Scholten , et al. (7 additional authors not shown)

    Abstract: Automatic detection of pulmonary nodules in thoracic computed tomography (CT) scans has been an active area of research for the last two decades. However, there have only been few studies that provide a comparative performance evaluation of different systems on a common database. We have therefore set up the LUNA16 challenge, an objective evaluation framework for automatic nodule detection algorit… ▽ More

    Submitted 15 July, 2017; v1 submitted 23 December, 2016; originally announced December 2016.

  28. Towards automatic pulmonary nodule management in lung cancer screening with deep learning

    Authors: Francesco Ciompi, Kaman Chung, Sarah J. van Riel, Arnaud Arindra Adiyoso Setio, Paul K. Gerke, Colin Jacobs, Ernst Th. Scholten, Cornelia Schaefer-Prokop, Mathilde M. W. Wille, Alfonso Marchiano, Ugo Pastorino, Mathias Prokop, Bram van Ginneken

    Abstract: The introduction of lung cancer screening programs will produce an unprecedented amount of chest CT scans in the near future, which radiologists will have to read in order to decide on a patient follow-up strategy. According to the current guidelines, the workup of screen-detected nodules strongly relies on nodule size and nodule type. In this paper, we present a deep learning system based on mult… ▽ More

    Submitted 23 May, 2017; v1 submitted 28 October, 2016; originally announced October 2016.

    Comments: Published on Scientific Reports

    Journal ref: Sci. Rep. 7, 46479; (2017)

  29. arXiv:1610.09146  [pdf, other

    cs.DS cs.DC cs.MS physics.comp-ph physics.flu-dyn

    Performance evaluation of explicit finite difference algorithms with varying amounts of computational and memory intensity

    Authors: Satya P. Jammy, Christian T. Jacobs, Neil D. Sandham

    Abstract: Future architectures designed to deliver exascale performance motivate the need for novel algorithmic changes in order to fully exploit their capabilities. In this paper, the performance of several numerical algorithms, characterised by varying degrees of memory and computational intensity, are evaluated in the context of finite difference methods for fluid dynamics problems. It is shown that, by… ▽ More

    Submitted 28 October, 2016; originally announced October 2016.

    Comments: Author accepted version. Accepted for publication in Journal of Computational Science on 27 October 2016

  30. arXiv:1609.01277  [pdf, other

    cs.MS cs.SC cs.SE physics.comp-ph

    OpenSBLI: A framework for the automated derivation and parallel execution of finite difference solvers on a range of computer architectures

    Authors: Christian T. Jacobs, Satya P. Jammy, Neil D. Sandham

    Abstract: Exascale computing will feature novel and potentially disruptive hardware architectures. Exploiting these to their full potential is non-trivial. Numerical modelling frameworks involving finite difference methods are currently limited by the 'static' nature of the hand-coded discretisation schemes and repeatedly may have to be re-written to run efficiently on new hardware. In contrast, OpenSBLI us… ▽ More

    Submitted 14 November, 2016; v1 submitted 5 September, 2016; originally announced September 2016.

    Comments: Author accepted version, with a small amendment: the link in the "Code Availability" section has been updated, and now refers to the OpenSBLI source code repository on GitHub. Accepted for publication in the Journal of Computational Science on 8 November 2016

    Journal ref: Journal of Computational Science 18 (2017) 12-23

  31. arXiv:1606.05741   

    cs.DL

    Connecting web-based map** services with scientific data repositories: collaborative curation and retrieval of simulation data via a geospatial interface

    Authors: Christian T. Jacobs, Alexandros Avdis

    Abstract: Increasing quantities of scientific data are becoming readily accessible via online repositories such as those provided by Figshare and Zenodo. Geoscientific simulations in particular generate large quantities of data, with several research groups studying many, often overlap** areas of the world. When studying a particular area, being able to keep track of one's own simulations as well as those… ▽ More

    Submitted 21 September, 2016; v1 submitted 18 June, 2016; originally announced June 2016.

    Comments: Submission withdrawn from the International Journal of Digital Curation on 9 September 2016 in order to prepare a joint paper with additional colleagues

  32. arXiv:1601.08091  [pdf, other

    physics.flu-dyn cs.CE math.OC

    On the validity of tidal turbine array configurations obtained from steady-state adjoint optimisation

    Authors: Christian T. Jacobs, Matthew D. Piggott, Stephan C. Kramer, Simon W. Funke

    Abstract: Extracting the optimal amount of power from an array of tidal turbines requires an intricate understanding of tidal dynamics and the effects of turbine placement on the local and regional scale flow. Numerical models have contributed significantly towards this understanding, and more recently, adjoint-based modelling has been employed to optimise the positioning of the turbines in an array in an a… ▽ More

    Submitted 29 January, 2016; originally announced January 2016.

    Comments: Conference paper comprising 15 pages and 13 figures. Submitted to the Proceedings of the ECCOMAS Congress 2016 (VII European Congress on Computational Methods in Applied Sciences and Engineering), held in Crete, Greece on 5-10 June 2016

  33. arXiv:1511.08915  [pdf, other

    cs.DB cs.AI

    Column-Oriented Datalog Materialization for Large Knowledge Graphs (Extended Technical Report)

    Authors: Jacopo Urbani, Ceriel Jacobs, Markus Krötzsch

    Abstract: The evaluation of Datalog rules over large Knowledge Graphs (KGs) is essential for many applications. In this paper, we present a new method of materializing Datalog inferences, which combines a column-based memory layout with novel optimization methods that avoid redundant inferences at runtime. The pro-active caching of certain subqueries further increases efficiency. Our empirical evaluation sh… ▽ More

    Submitted 11 February, 2016; v1 submitted 28 November, 2015; originally announced November 2015.

    ACM Class: I.2.3; H.2.4

  34. arXiv:1510.01560  [pdf, other

    cs.CG physics.ao-ph physics.comp-ph physics.flu-dyn

    Shoreline and Bathymetry Approximation in Mesh Generation for Tidal Renewable Simulations

    Authors: Alexandros Avdis, Christian T. Jacobs, Jon Hill, Matthew D. Piggott, Gerard J. Gorman

    Abstract: Due to the fractal nature of the domain geometry in geophysical flow simulations, a completely accurate description of the domain in terms of a computational mesh is frequently deemed infeasible. Shoreline and bathymetry simplification methods are used to remove small scale details in the geometry, particularly in areas away from the region of interest. To that end, a novel method for shoreline an… ▽ More

    Submitted 6 October, 2015; originally announced October 2015.

    Comments: Pre-print of conference publication accepted in the Proceedings of 11th European Wave & Tidal Energy Conference (EWTEC 2015, http://www.ewtec.org/ewtec2015/ ). This paper was presented at the EWTEC 2015 conference on Tuesday 8 September 2015 in Nantes, France. Number of pages: 7. Number of figures: 6

  35. arXiv:1509.04729  [pdf, other

    cs.DL cs.CE

    Integrating Research Data Management into Geographical Information Systems

    Authors: Christian T. Jacobs, Alexandros Avdis, Simon L. Mouradian, Matthew D. Piggott

    Abstract: Ocean modelling requires the production of high-fidelity computational meshes upon which to solve the equations of motion. The production of such meshes by hand is often infeasible, considering the complexity of the bathymetry and coastlines. The use of Geographical Information Systems (GIS) is therefore a key component to discretising the region of interest and producing a mesh appropriate to res… ▽ More

    Submitted 15 September, 2015; originally announced September 2015.

    Comments: Accepted, camera-ready version. To appear in the Proceedings of the 5th International Workshop on Semantic Digital Archives (http://sda2015.dke-research.de/), held in Poznań, Poland on 18 September 2015 as part of the 19th International Conference on Theory and Practice of Digital Libraries (http://tpdl2015.info/)

  36. Experiences with efficient methodologies for teaching computer programming to geoscientists

    Authors: Christian T. Jacobs, Gerard J. Gorman, Huw E. Rees, Lorraine Craig

    Abstract: Computer programming was once thought of as a skill required only by professional software developers. But today, given the ubiquitous nature of computation and data science it is quickly becoming necessary for all scientists and engineers to have at least a basic knowledge of how to program. Teaching how to program, particularly to those students with little or no computing background, is well-kn… ▽ More

    Submitted 9 June, 2016; v1 submitted 20 May, 2015; originally announced May 2015.

    Comments: Second revised version. This version was accepted for publication in the Journal of Geoscience Education on 9 June 2016. Contains 5 figures. The main change is the inclusion of a new section on outlook and future work

    Journal ref: Journal of Geoscience Education 64(3):183-198, 2016

  37. arXiv:1405.7290  [pdf

    cs.CE cs.DL cs.MS

    PyRDM: A Python-based library for automating the management and online publication of scientific software and data

    Authors: Christian T. Jacobs, Alexandros Avdis, Gerard J. Gorman, Matthew D. Piggott

    Abstract: The recomputability and reproducibility of results from scientific software requires access to both the source code and all associated input and output data. However, the full collection of these resources often does not accompany the key findings published in journal articles, thereby making it difficult or impossible for the wider scientific community to verify the correctness of a result or to… ▽ More

    Submitted 21 August, 2014; v1 submitted 28 May, 2014; originally announced May 2014.

    Comments: Revised version. The main changes are: Added pdfLaTeX to the dependencies list; Improved Figure 1 to show the 'publish' option selected in Diamond; Added two paragraphs to explain why users would want to use PyRDM; Added some content on the PyRDM roadmap, and also some content regarding engagement with libraries and research software engineers

    Journal ref: Journal of Open Research Software 2:e28 (2014) 1-6

  38. arXiv:cs/0306110  [pdf

    cs.DC

    Run Control and Monitor System for the CMS Experiment

    Authors: M. Bellato, L. Berti, V. Brigljevic, G. Bruno, E. Cano, S. Cittolin, A. Csilling, S. Erhan, D. Gigi, F. Glege, R. Gomez-Reino, M. Gulmini, J. Gutleber, C. Jacobs, M. Kozlovszky, H. Larsen, I. Magrans, G. Maron, F. Meijers, E. Meschi, S. Murray, A. Oh, L. Orsini, L. Pollet, A. Racz , et al. (8 additional authors not shown)

    Abstract: The Run Control and Monitor System (RCMS) of the CMS experiment is the set of hardware and software components responsible for controlling and monitoring the experiment during data-taking. It provides users with a "virtual counting room", enabling them to operate the experiment and to monitor detector status and data quality from any point in the world. This paper describes the architecture of t… ▽ More

    Submitted 18 June, 2003; originally announced June 2003.

    Comments: Talk from the 2003 Computing in High Energy and Nuclear Physics (CHEP03), La Jolla, Ca, USA, March 2003, 8 pages, PSN THGT002

    ACM Class: C.2.4; J.2