Skip to main content

Showing 1–50 of 56 results for author: Ward, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13012  [pdf, other

    cs.LG cs.CR stat.ML

    Data Plagiarism Index: Characterizing the Privacy Risk of Data-Copying in Tabular Generative Models

    Authors: Joshua Ward, Chi-Hua Wang, Guang Cheng

    Abstract: The promise of tabular generative models is to produce realistic synthetic data that can be shared and safely used without dangerous leakage of information from the training set. In evaluating these models, a variety of methods have been proposed to measure the tendency to copy data from the training dataset when generating a sample. However, these methods suffer from either not considering data-c… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  2. arXiv:2404.17144  [pdf

    cs.LG eess.SP

    Sensor Response-Time Reduction using Long-Short Term Memory Network Forecasting

    Authors: Simon J. Ward, Muhamed Baljevic, Sharon M. Weiss

    Abstract: The response time of a biosensor is a crucial metric in safety-critical applications such as medical diagnostics where an earlier diagnosis can markedly improve patient outcomes. However, the speed at which a biosensor reaches a final equilibrium state can be limited by poor mass transport and long molecular diffusion times that increase the time it takes target molecules to reach the active sensi… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: 9 pages, 3 figures

  3. arXiv:2403.09057  [pdf, other

    cs.CL cs.AI

    A Continued Pretrained LLM Approach for Automatic Medical Note Generation

    Authors: Dong Yuan, Eti Rastogi, Gautam Naik, Sree Prasanna Rajagopal, Sagar Goyal, Fen Zhao, Bharath Chintagunta, Jeff Ward

    Abstract: LLMs are revolutionizing NLP tasks. However, the use of the most advanced LLMs, such as GPT-4, is often prohibitively expensive for most specialized fields. We introduce HEAL, the first continuously trained 13B LLaMA2-based LLM that is purpose-built for medical conversations and measured on automated scribing. Our results demonstrate that HEAL outperforms GPT-4 and PMC-LLaMA in PubMedQA, with an a… ▽ More

    Submitted 3 April, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: Accepted to NAACL 2024

  4. arXiv:2403.07780  [pdf, other

    stat.ML cs.LG

    FairRR: Pre-Processing for Group Fairness through Randomized Response

    Authors: Xianli Zeng, Joshua Ward, Guang Cheng

    Abstract: The increasing usage of machine learning models in consequential decision-making processes has spurred research into the fairness of these systems. While significant work has been done to study group fairness in the in-processing and post-processing setting, there has been little that theoretically connects these results to the pre-processing domain. This paper proposes that achieving group fairne… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  5. arXiv:2312.09232  [pdf, other

    cs.CV

    DVQI: A Multi-task, Hardware-integrated Artificial Intelligence System for Automated Visual Inspection in Electronics Manufacturing

    Authors: Audrey Chung, Francis Li, Jeremy Ward, Andrew Hryniowski, Alexander Wong

    Abstract: As electronics manufacturers continue to face pressure to increase production efficiency amid difficulties with supply chains and labour shortages, many printed circuit board assembly (PCBA) manufacturers have begun to invest in automation and technological innovations to remain competitive. One such method is to leverage artificial intelligence (AI) to greatly augment existing manufacturing proce… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 8 pages

  6. arXiv:2308.07467  [pdf, ps, other

    cs.IT eess.SP math.CA

    Sequences with identical autocorrelation spectra

    Authors: Daniel J. Katz, Adeebur Rahman, Michael J Ward

    Abstract: Aperiodic autocorrelation measures the similarity between a finite-length sequence of complex numbers and translates of itself. Autocorrelation is important in communications, remote sensing, and scientific instrumentation. The autocorrelation function reports the aperiodic autocorrelation at every possible translation. Knowing the autocorrelation function of a sequence is equivalent to knowing th… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: 12 pages

    MSC Class: 94A12 42A05 42A38 42A85

  7. arXiv:2303.08774  [pdf, other

    cs.CL cs.AI

    GPT-4 Technical Report

    Authors: OpenAI, Josh Achiam, Steven Adler, Sandhini Agarwal, Lama Ahmad, Ilge Akkaya, Florencia Leoni Aleman, Diogo Almeida, Janko Altenschmidt, Sam Altman, Shyamal Anadkat, Red Avila, Igor Babuschkin, Suchir Balaji, Valerie Balcom, Paul Baltescu, Haiming Bao, Mohammad Bavarian, Jeff Belgum, Irwan Bello, Jake Berdine, Gabriel Bernadett-Shapiro, Christopher Berner, Lenny Bogdonoff, Oleg Boiko , et al. (256 additional authors not shown)

    Abstract: We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based mo… ▽ More

    Submitted 4 March, 2024; v1 submitted 15 March, 2023; originally announced March 2023.

    Comments: 100 pages; updated authors list; fixed author names and added citation

  8. arXiv:2301.07537  [pdf, ps, other

    cs.DS

    An Improved Approximation for Maximum Weighted $k$-Set Packing

    Authors: Theophile Thiery, Justin Ward

    Abstract: We consider the weighted $k$-set packing problem, in which we are given a collection of weighted sets, each with at most $k$ elements and must return a collection of pairwise disjoint sets with maximum total weight. For $k = 3$, this problem generalizes the classical 3-dimensional matching problem listed as one of the Karp's original 21 NP-complete problems. We give an algorithm attaining an appro… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

    Comments: To appear in SODA'23. Comments welcome. 26 pages

  9. arXiv:2208.08562  [pdf, other

    cs.CV cs.AI stat.ML

    Restructurable Activation Networks

    Authors: Kartikeya Bhardwaj, James Ward, Caleb Tung, Dibakar Gope, Lingchuan Meng, Igor Fedorov, Alex Chalfin, Paul Whatmough, Danny Loh

    Abstract: Is it possible to restructure the non-linear activation functions in a deep network to create hardware-efficient models? To address this question, we propose a new paradigm called Restructurable Activation Networks (RANs) that manipulate the amount of non-linearity in models to improve their hardware-awareness and efficiency. First, we propose RAN-explicit (RAN-e) -- a new hardware-aware search sp… ▽ More

    Submitted 7 September, 2022; v1 submitted 17 August, 2022; originally announced August 2022.

    Comments: This work was presented at an Arm AI virtual tech talk. Video is available at https://www.youtube.com/watch?v=EUqFNE28Kq4

  10. arXiv:2206.05802  [pdf, other

    cs.CL cs.LG

    Self-critiquing models for assisting human evaluators

    Authors: William Saunders, Catherine Yeh, Jeff Wu, Steven Bills, Long Ouyang, Jonathan Ward, Jan Leike

    Abstract: We fine-tune large language models to write natural language critiques (natural language critical comments) using behavioral cloning. On a topic-based summarization task, critiques written by our models help humans find flaws in summaries that they would have otherwise missed. Our models help find naturally occurring flaws in both model and human written summaries, and intentional flaws in summari… ▽ More

    Submitted 13 June, 2022; v1 submitted 12 June, 2022; originally announced June 2022.

  11. arXiv:2205.07609  [pdf, other

    physics.data-an cs.LG hep-ex physics.ins-det

    Reduction of detection limit and quantification uncertainty due to interferent by neural classification with abstention

    Authors: Alex Hagen, Ken Jarman, Jesse Ward, Greg Eiden, Charles Barinaga, Emily Mace, Craig Aalseth, Anthony Carado

    Abstract: Many measurements in the physical sciences can be cast as counting experiments, where the number of occurrences of a physical phenomenon informs the prevalence of the phenomenon's source. Often, detection of the physical phenomenon (termed signal) is difficult to distinguish from naturally occurring phenomena (termed background). In this case, the discrimination of signal events from background ca… ▽ More

    Submitted 22 April, 2022; originally announced May 2022.

    Comments: Preprint submitted to Nuclear Instruments and Methods in Physics Research,\ A 12 pages, 10 figures

  12. arXiv:2202.12448  [pdf

    cs.CL

    Deep neural networks for fine-grained surveillance of overdose mortality

    Authors: Patrick J. Ward, April M. Young, Svetla Slavova, Madison Liford, Lara Daniels, Ripley Lucas, Ramakanth Kavuluru

    Abstract: Surveillance of drug overdose deaths relies on death certificates for identification of the substances that caused death. Drugs and drug classes can be identified through the International Classification of Diseases, 10th Revision (ICD-10) codes present on death certificates. However, ICD-10 codes do not always provide high levels of specificity in drug identification. To achieve more fine-grained… ▽ More

    Submitted 6 June, 2022; v1 submitted 24 February, 2022; originally announced February 2022.

    Comments: Accepted to appear in the American Journal of Epidemiology

  13. arXiv:2202.10952  [pdf, other

    physics.soc-ph cs.CY cs.SI

    Assessing the influence of French vaccine critics during the two first years of the COVID-19 pandemic

    Authors: Mauro Faccin, Floriana Gargiulo, Laëtitia Atlani-Duault, Jeremy K. Ward

    Abstract: When the threat of COVID-19 became widely acknowledged, many hoped that this epidemic would squash "the anti-vaccine movement". However, when vaccines started arriving in rich countries at the end of 2020, it appeared that vaccine hesitancy might be an issue even in the context of this major epidemic. Does it mean that the mobilization of vaccine-critical activists on social media is one of the ma… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

    Comments: 18 pages, 7 figures and 2 tables

    Journal ref: PLOS ONE, 17(8) p.1-19 (2022)

  14. arXiv:2201.11671  [pdf

    physics.med-ph cond-mat.mtrl-sci cs.LG physics.bio-ph physics.ins-det

    Capture Agent Free Biosensing using Porous Silicon Arrays and Machine Learning

    Authors: Simon J. Ward, Tengfei Cao, Xiang Zhou, Catie Chang, Sharon M. Weiss

    Abstract: Biosensors are an essential tool for medical diagnostics, environmental monitoring and food safety. Typically, biosensors are designed to detect specific analytes through functionalization with the appropriate capture agents. However, the use of capture agents limits the number of analytes that can be simultaneously detected and reduces the robustness of the biosensor. In this work, we report a ve… ▽ More

    Submitted 22 January, 2022; originally announced January 2022.

    Comments: 15 pages, 3 figures, 2 tables

    Journal ref: Biosensors 13 (2023) 1-12

  15. arXiv:2201.06965  [pdf

    cs.CY

    Reinforcement of vaccine mandates and public attitudes towards vaccines: What can we learn from google search activity ?

    Authors: Florian Cafiero, Jeremy Ward

    Abstract: International public health policies increasingly favor mandatory immunization. If its short-term effects on vaccine coverage are well documented, there has been little consideration to its effects on public attitudes towards vaccines. In this paper, we examine Google searches related to vaccines in five countries (Australia, France, Germany, Italy, Serbia) and two American states (California) whi… ▽ More

    Submitted 11 January, 2022; originally announced January 2022.

  16. arXiv:2112.14340  [pdf, other

    eess.IV cs.CV cs.LG

    Super-Efficient Super Resolution for Fast Adversarial Defense at the Edge

    Authors: Kartikeya Bhardwaj, Dibakar Gope, James Ward, Paul Whatmough, Danny Loh

    Abstract: Autonomous systems are highly vulnerable to a variety of adversarial attacks on Deep Neural Networks (DNNs). Training-free model-agnostic defenses have recently gained popularity due to their speed, ease of deployment, and ability to work across many DNNs. To this end, a new technique has emerged for mitigating attacks on image classification DNNs, namely, preprocessing adversarial images using su… ▽ More

    Submitted 28 December, 2021; originally announced December 2021.

    Comments: This preprint is for personal use only. The official article will appear in proceedings of Design, Automation & Test in Europe (DATE), 2022, as part of the Special Initiative on Autonomous Systems Design (ASD)

  17. arXiv:2104.09113  [pdf

    cs.CL

    No comments: Addressing commentary sections in websites' analyses

    Authors: Florian Cafiero, Paul Guille-Escuret, Jeremy Ward

    Abstract: Removing or extracting the commentary sections from a series of websites is a tedious task, as no standard way to code them is widely adopted. This operation is thus very rarely performed. In this paper, we show that these commentary sections can induce significant biases in the analyses, especially in the case of controversial Highlights $\bullet$ Commentary sections can induce biases in the anal… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

    Comments: 6th International Conference on Computational Social Science, Massachusetts Institute of Technology (MIT), Jul 2020, Cambridge, MA, United States

  18. arXiv:2102.09679  [pdf, ps, other

    cs.DS

    Improved Multi-Pass Streaming Algorithms for Submodular Maximization with Matroid Constraints

    Authors: Chien-Chung Huang, Theophile Thiery, Justin Ward

    Abstract: We give improved multi-pass streaming algorithms for the problem of maximizing a monotone or arbitrary non-negative submodular function subject to a general $p$-matchoid constraint in the model in which elements of the ground set arrive one at a time in a stream. The family of constraints we consider generalizes both the intersection of $p$ arbitrary matroid constraints and $p$-uniform hypergraph… ▽ More

    Submitted 18 February, 2021; originally announced February 2021.

    Comments: Accepted at APPROX 2020, 25 pages

  19. arXiv:2102.09644  [pdf, ps, other

    cs.DS

    Two-Sided Weak Submodularity for Matroid Constrained Optimization and Regression

    Authors: Theophile Thiery, Justin Ward

    Abstract: We study the following problem: Given a variable of interest, we would like to find a best linear predictor for it by choosing a subset of $k$ relevant variables obeying a matroid constraint. This problem is a natural generalization of subset selection problems where it is necessary to spread observations amongst multiple different classes. We derive new, strengthened guarantees for this problem b… ▽ More

    Submitted 18 January, 2023; v1 submitted 18 February, 2021; originally announced February 2021.

    Comments: Appeared in COLT'22. 29 pages, 1 figure. Comments welcome. The earlier version of this paper contains a local-search algorithm's analysis which we substitute with the analysis of the residual random greedy algorithm

  20. arXiv:2011.06268  [pdf, ps, other

    cs.DS

    FPT-Algorithms for the \ell-Matchoid Problem with a Coverage Objective

    Authors: Chien-Chung Huang, Justin Ward

    Abstract: We consider the problem of optimizing a coverage function under a $\ell$-matchoid of rank $k$. We design fixed-parameter algorithms as well as streaming algorithms to compute an exact solution. Unlike previous work that presumes linear representativity of matroids, we consider the general oracle model. For the special case where the coverage function is linear, we give a deterministic fixed-parame… ▽ More

    Submitted 13 December, 2022; v1 submitted 12 November, 2020; originally announced November 2020.

  21. Transaction Pricing for Maximizing Throughput in a Sharded Blockchain Ledger

    Authors: James R. Riehl, Jonathan Ward

    Abstract: In this paper, we present a pricing mechanism that aligns incentives of agents who exchange resources on a decentralized ledger with the goal of maximizing transaction throughput. Subdividing a blockchain ledger into shards promises to greatly increase transaction throughput with minimal loss of security. However, the organization and type of the transactions also affects the ledger's efficiency,… ▽ More

    Submitted 1 September, 2020; originally announced September 2020.

    Journal ref: 2020 Crypto Valley Conference on Blockchain Technology (CVCBT), Rotkreuz, Switzerland, 2020, pp. 36-42

  22. arXiv:2006.05390  [pdf, ps, other

    cs.CR cs.DC cs.MA

    Democratising blockchain: A minimal agency consensus model

    Authors: Marcin Abram, David Galindo, Daniel Honerkamp, Jonathan Ward, **-Mann Wong

    Abstract: We propose a novel consensus protocol based on a hybrid approach, that combines a directed acyclic graph (DAG) and a classical chain of blocks. This architecture allows us to enforce collective block construction, minimising the monopolistic power of the round-leader. In this way, we decrease the possibility for collusion among senders and miners, as well as miners themselves, allowing the use of… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

  23. arXiv:2003.01871  [pdf, other

    cs.RO cs.CV

    Semantic sensor fusion: from camera to sparse lidar information

    Authors: Julie Stephany Berrio, Mao Shan, Stewart Worrall, James Ward, Eduardo Nebot

    Abstract: To navigate through urban roads, an automated vehicle must be able to perceive and recognize objects in a three-dimensional environment. A high-level contextual understanding of the surroundings is necessary to plan and execute accurate driving maneuvers. This paper presents an approach to fuse different sensory information, Light Detection and Ranging (lidar) scans and camera images. The output o… ▽ More

    Submitted 3 March, 2020; originally announced March 2020.

    Comments: 8 pages, this paper was submitted to ITSC 2020

    MSC Class: 00-02 ACM Class: I.4

  24. arXiv:1912.13199  [pdf

    cs.CV cs.LG eess.IV

    Comparison of object detection methods for crop damage assessment using deep learning

    Authors: Ali HamidiSepehr, Seyed Vahid Mirnezami, Jason K. Ward

    Abstract: Severe weather events can cause large financial losses to farmers. Detailed information on the location and severity of damage will assist farmers, insurance companies, and disaster response agencies in making wise post-damage decisions. The goal of this study was a proof-of-concept to detect damaged crop areas from aerial imagery using computer vision and deep learning techniques. A specific obje… ▽ More

    Submitted 21 April, 2020; v1 submitted 31 December, 2019; originally announced December 2019.

  25. Simulating Crowds in Real Time with Agent-Based Modelling and a Particle Filter

    Authors: Nick Malleson, Kevin Minors, Le-Minh Kieu, Jonathan A. Ward, Andrew A. West, Alison Heppenstall

    Abstract: Agent-based modelling is a valuable approach for systems whose behaviour is driven by the interactions between distinct entities. They have shown particular promise as a means of modelling crowds of people in streets, public transport terminals, stadiums, etc. However, the methodology faces a fundamental difficulty: there are no established mechanisms for dynamically incorporating real-time data i… ▽ More

    Submitted 20 September, 2019; originally announced September 2019.

  26. arXiv:1909.08311  [pdf, other

    physics.soc-ph cs.SI

    Asymmetric participation of defenders and critics of vaccines to debates on French-speaking Twitter

    Authors: Floriana Gargiulo, Florian Cafiero, Paul Guille-Escuret, Valerie Seror, Jeremy Ward

    Abstract: For more than a decade, doubt about vaccines has become an increasingly important global issue. Polarization of opinions on this matter, especially through social media, has been repeatedly observed, but details about the balance of forces are left unclear. In this paper, we analyse the flow of information on vaccines on the French-speaking realm of Twitter between 2016 and 2017. Two major asymmet… ▽ More

    Submitted 4 May, 2020; v1 submitted 18 September, 2019; originally announced September 2019.

    Journal ref: Sci Rep 10, 6599 (2020). https://doi.org/10.1038/s41598-020-62880-5

  27. arXiv:1904.09354  [pdf, other

    cs.DS cs.LG math.OC

    Submodular Maximization Beyond Non-negativity: Guarantees, Fast Algorithms, and Applications

    Authors: Christopher Harshaw, Moran Feldman, Justin Ward, Amin Karbasi

    Abstract: It is generally believed that submodular functions -- and the more general class of $γ$-weakly submodular functions -- may only be optimized under the non-negativity assumption $f(S) \geq 0$. In this paper, we show that once the function is expressed as the difference $f = g - c$, where $g$ is monotone, non-negative, and $γ$-weakly submodular and $c$ is non-negative modular, then strong approximat… ▽ More

    Submitted 19 April, 2019; originally announced April 2019.

    Comments: submitted to ICML 2019

  28. arXiv:1902.02851  [pdf, other

    cs.RO

    Towards Provably Not-at-Fault Control of Autonomous Robots in Arbitrary Dynamic Environments

    Authors: Sean Vaskov, Shreyas Kousik, Hannah Larson, Fan Bu, James Ward, Stewart Worrall, Matthew Johnson-Roberson, Ram Vasudevan

    Abstract: As autonomous robots increasingly become part of daily life, they will often encounter dynamic environments while only having limited information about their surroundings. Unfortunately, due to the possible presence of malicious dynamic actors, it is infeasible to develop an algorithm that can guarantee collision-free operation. Instead, one can attempt to design a control technique that guarantee… ▽ More

    Submitted 7 February, 2019; originally announced February 2019.

    Comments: 10 pages, 3 figures

  29. arXiv:1809.09774  [pdf, other

    cs.RO

    Identifying robust landmarks in feature-based maps

    Authors: Julie Stephany Berrio, James Ward, Stewart Worrall, Eduardo Nebot

    Abstract: To operate in an urban environment, an automated vehicle must be capable of accurately estimating its position within a global map reference frame. This is necessary for optimal path planning and safe navigation. To accomplish this over an extended period of time, the global map requires long-term maintenance. This includes the addition of newly observable features and the removal of transient fea… ▽ More

    Submitted 25 September, 2018; originally announced September 2018.

    Comments: Submitted to ICRA2019

    MSC Class: 62J02; 62J07;

  30. arXiv:1710.03631  [pdf, other

    cs.IT

    Angular Accuracy of Steerable Feature Detectors

    Authors: Zsuzsanna Püspöki, Arash Amini, Julien Fageot, John Paul Ward, Michael Unser

    Abstract: The detection of landmarks or patterns is of interest for extracting features in biological images. Hence, algorithms for finding these keypoints have been extensively investigated in the literature, and their localization and detection properties are well known. In this paper, we study the complementary topic of local orientation estimation, which has not received similar attention. Simply stated… ▽ More

    Submitted 10 October, 2017; originally announced October 2017.

    Comments: 13 pages, 3 figures

  31. arXiv:1612.07925  [pdf, ps, other

    cs.DS

    Better Guarantees for k-Means and Euclidean k-Median by Primal-Dual Algorithms

    Authors: Sara Ahmadian, Ashkan Norouzi-Fard, Ola Svensson, Justin Ward

    Abstract: Clustering is a classic topic in optimization with $k$-means being one of the most fundamental such problems. In the absence of any restrictions on the input, the best known algorithm for $k$-means with a provable guarantee is a simple local search heuristic yielding an approximation guarantee of $9+ε$, a ratio that is known to be tight with respect to such methods. We overcome this barrier by p… ▽ More

    Submitted 10 April, 2017; v1 submitted 23 December, 2016; originally announced December 2016.

  32. arXiv:1604.08618  [pdf, ps, other

    cs.NI cs.DC

    Stringer: Balancing Latency and Resource Usage in Service Function Chain Provisioning

    Authors: Freddy C. Chua, Julie Ward, Ying Zhang, Puneet Sharma, Bernardo A. Huberman

    Abstract: Network Functions Virtualization, or NFV, enables telecommunications infrastructure providers to replace special-purpose networking equipment with commodity servers running virtualized network functions (VNFs). A service provider utilizing NFV technology faces the SFC provisioning problem of assigning VNF instances to nodes in the physical infrastructure (e.g., a datacenter), and routing Service F… ▽ More

    Submitted 9 June, 2016; v1 submitted 28 April, 2016; originally announced April 2016.

  33. On The Continuous Steering of the Scale of Tight Wavelet Frames

    Authors: Zsuzsanna Püspöki, John Paul Ward, Daniel Sage, Michael Unser

    Abstract: In analogy with steerable wavelets, we present a general construction of adaptable tight wavelet frames, with an emphasis on scaling operations. In particular, the derived wavelets can be "dilated" by a procedure comparable to the operation of steering steerable wavelets. The fundamental aspects of the construction are the same: an admissible collection of Fourier multipliers is used to extend a t… ▽ More

    Submitted 7 December, 2015; originally announced December 2015.

  34. arXiv:1507.04227  [pdf, ps, other

    cs.DS

    A bi-criteria approximation algorithm for $k$ Means

    Authors: Konstantin Makarychev, Yury Makarychev, Maxim Sviridenko, Justin Ward

    Abstract: We consider the classical $k$-means clustering problem in the setting bi-criteria approximation, in which an algoithm is allowed to output $βk > k$ clusters, and must produce a clustering with cost at most $α$ times the to the cost of the optimal set of $k$ clusters. We argue that this approach is natural in many settings, for which the exact number of clusters is a priori unknown, or unimportant… ▽ More

    Submitted 3 August, 2015; v1 submitted 15 July, 2015; originally announced July 2015.

  35. arXiv:1507.03719  [pdf, ps, other

    cs.DS cs.AI cs.DC cs.LG

    A New Framework for Distributed Submodular Maximization

    Authors: Rafael da Ponte Barbosa, Alina Ene, Huy L. Nguyen, Justin Ward

    Abstract: A wide variety of problems in machine learning, including exemplar clustering, document summarization, and sensor placement, can be cast as constrained submodular maximization problems. A lot of recent effort has been devoted to develo** distributed algorithms for these problems. However, these results suffer from high number of rounds, suboptimal approximation ratios, or both. We develop a fram… ▽ More

    Submitted 11 August, 2016; v1 submitted 14 July, 2015; originally announced July 2015.

  36. arXiv:1505.02020  [pdf, other

    physics.soc-ph cond-mat.stat-mech cs.SI nlin.AO

    Influence of Luddism on innovation diffusion

    Authors: Andrew Mellor, Mauro Mobilia, Sidney Redner, Alastair M. Rucklidge, Jonathan A. Ward

    Abstract: We generalize the classical Bass model of innovation diffusion to include a new class of agents --- Luddites --- that oppose the spread of innovation. Our model also incorporates ignorants, susceptibles, and adopters. When an ignorant and a susceptible meet, the former is converted to a susceptible at a given rate, while a susceptible spontaneously adopts the innovation at a constant rate. In resp… ▽ More

    Submitted 24 November, 2015; v1 submitted 8 May, 2015; originally announced May 2015.

    Comments: 11 pages, 7 figures

    Journal ref: Phys. Rev. E 92, 012806 (2015)

  37. arXiv:1504.07283  [pdf, other

    cs.DC

    QoS-Based Pricing and Scheduling of Batch Jobs in OpenStack Clouds

    Authors: Thomas Sandholm, Julie Ward, Filippo Balestrieri, Bernardo A. Huberman

    Abstract: The current Cloud infrastructure services (IaaS) market employs a resource-based selling model: customers rent nodes from the provider and pay per-node per-unit-time. This selling model places the burden upon customers to predict their job resource requirements and durations. Inaccurate prediction by customers can result in over-provisioning of resources, or under-provisioning and poor job perform… ▽ More

    Submitted 27 April, 2015; originally announced April 2015.

  38. arXiv:1502.02606  [pdf, other

    cs.LG cs.AI cs.DC

    The Power of Randomization: Distributed Submodular Maximization on Massive Datasets

    Authors: Rafael da Ponte Barbosa, Alina Ene, Huy L. Nguyen, Justin Ward

    Abstract: A wide variety of problems in machine learning, including exemplar clustering, document summarization, and sensor placement, can be cast as constrained submodular maximization problems. Unfortunately, the resulting submodular optimization problems are often too large to be solved on a single machine. We develop a simple distributed algorithm that is embarrassingly parallel and it achieves provable… ▽ More

    Submitted 22 April, 2015; v1 submitted 9 February, 2015; originally announced February 2015.

  39. arXiv:1409.1399  [pdf, ps, other

    cs.DS cs.DM

    Maximizing k-Submodular Functions and Beyond

    Authors: Justin Ward, Stanislav Zivny

    Abstract: We consider the maximization problem in the value oracle model of functions defined on $k$-tuples of sets that are submodular in every orthant and $r$-wise monotone, where $k\geq 2$ and $1\leq r\leq k$. We give an analysis of a deterministic greedy algorithm that shows that any such function can be approximated to a factor of $1/(1+r)$. For $r=k$, we give an analysis of a randomised greedy algorit… ▽ More

    Submitted 23 November, 2015; v1 submitted 4 September, 2014; originally announced September 2014.

    Comments: Full version of a SODA'14 paper, to appear in ACM Transactions on Algorithms (TALG)

    ACM Class: F.2.0

    Journal ref: ACM Transactions on Algorithms 12(4) Article no. 47 (2016)

  40. arXiv:1406.4974  [pdf, ps, other

    cs.DC

    Academic Cloud Computing Research: Five Pitfalls and Five Opportunities

    Authors: Adam Barker, Blesson Varghese, Jonathan Stuart Ward, Ian Sommerville

    Abstract: This discussion paper argues that there are five fundamental pitfalls, which can restrict academics from conducting cloud computing research at the infrastructure level, which is currently where the vast majority of academic research lies. Instead academics should be conducting higher risk research, in order to gain understanding and open up entirely new areas. We call for a renewed mindset and… ▽ More

    Submitted 19 June, 2014; originally announced June 2014.

    Comments: Accepted and presented at the 6th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud'14)

  41. arXiv:1311.4728  [pdf, ps, other

    cs.DS

    Optimal approximation for submodular and supermodular optimization with bounded curvature

    Authors: Maxim Sviridenko, Jan Vondrák, Justin Ward

    Abstract: We design new approximation algorithms for the problems of optimizing submodular and supermodular functions subject to a single matroid constraint. Specifically, we consider the case in which we wish to maximize a nondecreasing submodular function or minimize a nonincreasing supermodular function in the setting of bounded total curvature $c$. In the case of submodular maximization with curvature… ▽ More

    Submitted 12 December, 2014; v1 submitted 19 November, 2013; originally announced November 2013.

  42. arXiv:1310.4415  [pdf, ps, other

    cs.DS

    Submodular Stochastic Probing on Matroids

    Authors: Marek Adamczyk, Maxim Sviridenko, Justin Ward

    Abstract: In a stochastic probing problem we are given a universe $E$, where each element $e \in E$ is active independently with probability $p_e$, and only a probe of e can tell us whether it is active or not. On this universe we execute a process that one by one probes elements --- if a probed element is active, then we have to include it in the solution, which we gradually construct. Throughout the proce… ▽ More

    Submitted 18 February, 2014; v1 submitted 16 October, 2013; originally announced October 2013.

  43. arXiv:1309.5821  [pdf, ps, other

    cs.DB

    Undefined By Data: A Survey of Big Data Definitions

    Authors: Jonathan Stuart Ward, Adam Barker

    Abstract: The term big data has become ubiquitous. Owing to a shared origin between academia, industry and the media there is no single unified definition, and various stakeholders provide diverse and often contradictory definitions. The lack of a consistent definition introduces ambiguity and hampers discourse relating to big data. This short paper attempts to collate the various definitions which have gai… ▽ More

    Submitted 20 September, 2013; originally announced September 2013.

    Comments: Big data definition paper, 2 pages

  44. arXiv:1306.1394  [pdf, ps, other

    cs.DC

    A Cloud Computing Survey: Developments and Future Trends in Infrastructure as a Service Computing

    Authors: Jonathan Stuart Ward, Adam Barker

    Abstract: Cloud computing is a recent paradigm based around the notion of delivery of resources via a service model over the Internet. Despite being a new paradigm of computation, cloud computing owes its origins to a number of previous paradigms. The term cloud computing is well defined and no longer merits rigorous taxonomies to furnish a definition. Instead this survey paper considers the past, present a… ▽ More

    Submitted 6 June, 2013; originally announced June 2013.

  45. arXiv:1305.7403  [pdf, other

    cs.DC

    Monitoring Large-Scale Cloud Systems with Layered Gossip Protocols

    Authors: Jonathan Stuart Ward, Adam Barker

    Abstract: Monitoring is an essential aspect of maintaining and develo** computer systems that increases in difficulty proportional to the size of the system. The need for robust monitoring tools has become more evident with the advent of cloud computing. Infrastructure as a Service (IaaS) clouds allow end users to deploy vast numbers of virtual machines as part of dynamic and transient architectures. Curr… ▽ More

    Submitted 31 May, 2013; originally announced May 2013.

    Comments: Extended Abstract for the ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC 2013) Poster Track

  46. arXiv:1305.4328  [pdf, ps, other

    physics.soc-ph cs.SI nlin.AO

    Competition-induced criticality in a model of meme popularity

    Authors: James P. Gleeson, Jonathan A. Ward, Kevin P. O'Sullivan, William T. Lee

    Abstract: Heavy-tailed distributions of meme popularity occur naturally in a model of meme diffusion on social networks. Competition between multiple memes for the limited resource of user attention is identified as the mechanism that poises the system at criticality. The popularity growth of each meme is described by a critical branching process, and asymptotic analysis predicts power-law distributions of… ▽ More

    Submitted 21 January, 2014; v1 submitted 19 May, 2013; originally announced May 2013.

    Comments: This version accepted for publication in Physical Review Letters. 6 pages main text, 12 pages Supplementary Material

    Journal ref: Phys. Rev. Lett. 112, 048701 (2014)

  47. arXiv:1302.4347  [pdf, ps, other

    cs.DS

    Large Neighborhood Local Search for the Maximum Set Packing Problem

    Authors: Maxim Sviridenko, Justin Ward

    Abstract: In this paper we consider the classical maximum set packing problem where set cardinality is upper bounded by $k$. We show how to design a variant of a polynomial-time local search algorithm with performance guarantee $(k+2)/3$. This local search algorithm is a special case of a more general procedure that allows to swap up to $Θ(\log n)$ elements per iteration. We also design problem instances wi… ▽ More

    Submitted 18 February, 2013; originally announced February 2013.

  48. arXiv:1302.0164  [pdf, ps, other

    physics.soc-ph cs.SI nlin.AO

    Aperiodic dynamics in a deterministic model of attitude formation in social groups

    Authors: Jonathan Ward, Peter Grindrod

    Abstract: Homophily and social influence are the fundamental mechanisms that drive the evolution of attitudes, beliefs and behaviour within social groups. Homophily relates the similarity between pairs of individuals' attitudinal states to their frequency of interaction, and hence structural tie strength, while social influence causes the convergence of individuals' states during interaction. Building on th… ▽ More

    Submitted 1 February, 2013; originally announced February 2013.

  49. arXiv:1209.3318  [pdf, other

    math.OC cs.CV math.NA

    Hessian Schatten-Norm Regularization for Linear Inverse Problems

    Authors: Stamatios Lefkimmiatis, John Paul Ward, Michael Unser

    Abstract: We introduce a novel family of invariant, convex, and non-quadratic functionals that we employ to derive regularized solutions of ill-posed linear inverse imaging problems. The proposed regularizers involve the Schatten norms of the Hessian matrix, computed at every pixel of the image. They can be viewed as second-order extensions of the popular total-variation (TV) semi-norm since they satisfy th… ▽ More

    Submitted 2 February, 2013; v1 submitted 14 September, 2012; originally announced September 2012.

    Comments: 15 pages double-column format. This manuscript will appear in IEEE Transactions on Image Processing

    Journal ref: IEEE Trans. Image Process. 22 (2013), no. 5, 1873--1888

  50. CloudMonitor: Profiling Power Usage

    Authors: James William Smith, Ali Khajeh-Hosseini, Jonathan Stuart Ward, Ian Sommerville

    Abstract: In Cloud Computing platforms the addition of hardware monitoring devices to gather power usage data can be impractical or uneconomical due to the large number of machines to be metered. CloudMonitor, a monitoring tool that can generate power models for software-based power estimation, can provide insights to the energy costs of deployments without additional hardware. Accurate power usage data lea… ▽ More

    Submitted 11 May, 2012; originally announced May 2012.

    Comments: 2 page submission to appear in IEEE Cloud 2012 Work In Progress Track