Skip to main content

Showing 1–19 of 19 results for author: Dell'Amico, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.11849  [pdf, other

    cs.LG cs.AI math.CO

    Self-Labeling the Job Shop Scheduling Problem

    Authors: Andrea Corsini, Angelo Porrello, Simone Calderara, Mauro Dell'Amico

    Abstract: In this work, we propose a Self-Supervised training strategy specifically designed for combinatorial problems. One of the main obstacles in applying supervised paradigms to such problems is the requirement of expensive target solutions as ground-truth, often produced with costly exact solvers. Inspired by Semi- and Self-Supervised learning, we show that it is possible to easily train generative mo… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    ACM Class: I.2; G.2

  2. arXiv:2207.03244  [pdf, other

    cs.LG cs.NE math.CO

    Learning the Quality of Machine Permutations in Job Shop Scheduling

    Authors: Andrea Corsini, Simone Calderara, Mauro Dell'Amico

    Abstract: In recent years, the power demonstrated by Machine Learning (ML) has increasingly attracted the interest of the optimization community that is starting to leverage ML for enhancing and automating the design of algorithms. One combinatorial optimization problem recently tackled with ML is the Job Shop scheduling Problem (JSP). Most of the works on the JSP using ML focus on Deep Reinforcement Learni… ▽ More

    Submitted 16 September, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

    MSC Class: 90B35 ACM Class: I.2.6; G.2.1

  3. Unsupervised Detection and Clustering of Malicious TLS Flows

    Authors: Gibran Gomez, Platon Kotzias, Matteo Dell'Amico, Leyla Bilge, Juan Caballero

    Abstract: Malware abuses TLS to encrypt its malicious traffic, preventing examination by content signatures and deep packet inspection. Network detection of malicious TLS flows is an important, but challenging, problem. Prior works have proposed supervised machine learning detectors using TLS features. However, by trying to represent all malicious traffic, supervised binary detectors produce models that are… ▽ More

    Submitted 23 December, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

  4. arXiv:2107.13275  [pdf, ps, other

    cs.DM

    Benchmark Instances and Optimal Solutions for the Traveling Salesman Problem with Drone

    Authors: Mauro Dell'Amico, Roberto Montemanni, Stefano Novellani

    Abstract: The use of drones in logistics is gaining more and more interest, and drones are becoming a more viable and common way of distributing parcels in an urban environment. As a consequence, there is a flourishing production of articles in the field of operational optimization of the combined use of trucks and drones for fulfilling customers requests. The aim is minimizing the total time required to se… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

    Comments: 21 pages, reference to benchmark

  5. arXiv:1910.07283  [pdf, other

    cs.LG cs.DS stat.ML

    FISHDBC: Flexible, Incremental, Scalable, Hierarchical Density-Based Clustering for Arbitrary Data and Distance

    Authors: Matteo Dell'Amico

    Abstract: FISHDBC is a flexible, incremental, scalable, and hierarchical density-based clustering algorithm. It is flexible because it empowers users to work on arbitrary data, skip** the feature extraction step that usually transforms raw data in numeric arrays letting users define an arbitrary distance function instead. It is incremental and scalable: it avoids the $\mathcal O(n^2)$ performance of other… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

  6. arXiv:1907.04824  [pdf, other

    cs.PF cs.DS

    Scheduling With Inexact Job Sizes: The Merits of Shortest Processing Time First

    Authors: Matteo Dell'Amico

    Abstract: It is well known that size-based scheduling policies, which take into account job size (i.e., the time it takes to run them), can perform very desirably in terms of both response time and fairness. Unfortunately, the requirement of knowing a priori the exact job size is a major obstacle which is frequently insurmountable in practice. Often, it is possible to get a coarse estimation of job size, bu… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

  7. The Supermarket Model with Known and Predicted Service Times

    Authors: Michael Mitzenmacher, Matteo Dell'Amico

    Abstract: The supermarket model refers to a system with a large number of queues, where new customers choose d queues at random and join the one with the fewest customers. This model demonstrates the power of even small amounts of choice, as compared to simply joining a queue chosen uniformly at random, for load balancing systems. In this work we perform simulation-based studies to consider variations where… ▽ More

    Submitted 17 February, 2022; v1 submitted 23 May, 2019; originally announced May 2019.

    Comments: Revision, 16 pages, 22 figures. Published at IEEE TPDS

    ACM Class: F.2.2; C.1.4; G.3; D.4.8

  8. Enhanced arc-flow formulations to minimize weighted completion time on identical parallel machines

    Authors: Arthur Kramer, Mauro Dell'Amico, Manuel Iori

    Abstract: We consider the problem of scheduling a set of jobs on a set of identical parallel machines, with the aim of minimizing the total weighted completion time. The problem has been solved in the literature with a number of mathematical formulations, some of which require the implementation of tailored branch-and-price methods. In our work, we solve the problem instead by means of new arc-flow formulat… ▽ More

    Submitted 31 August, 2018; originally announced August 2018.

    Comments: 25 pages

  9. arXiv:1506.09158  [pdf, other

    cs.DS

    On Fair Size-Based Scheduling

    Authors: Matteo Dell'Amico, Damiano Carra, Pietro Michiardi

    Abstract: By executing jobs serially rather than in parallel, size-based scheduling policies can shorten time needed to complete jobs; however, major obstacles to their applicability are fairness guarantees and the fact that job sizes are rarely known exactly a-priori. Here, we introduce the Pri family of size-based scheduling policies; Pri simulates any reference scheduler and executes jobs in the order of… ▽ More

    Submitted 30 June, 2015; originally announced June 2015.

  10. arXiv:1410.6122  [pdf, other

    cs.DC

    PSBS: Practical Size-Based Scheduling

    Authors: Matteo Dell'Amico, Damiano Carra, Pietro Michiardi

    Abstract: Size-based schedulers have very desirable performance properties: optimal or near-optimal response time can be coupled with strong fairness guarantees. Despite this, such systems are very rarely implemented in practical settings, because they require knowing a priori the amount of work needed to complete jobs: this assumption is very difficult to satisfy in concrete systems. It is definitely more… ▽ More

    Submitted 6 August, 2015; v1 submitted 22 October, 2014; originally announced October 2014.

    Comments: arXiv admin note: substantial text overlap with arXiv:1403.5996

  11. On User Availability Prediction and Network Applications

    Authors: Matteo Dell'Amico, Maurizio Filippone, Pietro Michiardi, Yves Roudier

    Abstract: User connectivity patterns in network applications are known to be heterogeneous, and to follow periodic (daily and weekly) patterns. In many cases, the regularity and the correlation of those patterns is problematic: for network applications, many connected users create peaks of demand; in contrast, in peer-to-peer scenarios, having few users online results in a scarcity of available resources. O… ▽ More

    Submitted 30 April, 2014; originally announced April 2014.

    Comments: Accepted for publication in IEEE/ACM Transactions on Networking

  12. arXiv:1403.5996  [pdf, other

    cs.DS

    Revisiting Size-Based Scheduling with Estimated Job Sizes

    Authors: Matteo Dell'Amico, Damiano Carra, Mario Pastorelli, Pietro Michiardi

    Abstract: We study size-based schedulers, and focus on the impact of inaccurate job size information on response time and fairness. Our intent is to revisit previous results, which allude to performance degradation for even small errors on job size estimates, thus limiting the applicability of size-based schedulers. We show that scheduling performance is tightly connected to workload characteristics: in t… ▽ More

    Submitted 25 July, 2014; v1 submitted 24 March, 2014; originally announced March 2014.

    Comments: To be published in the proceedings of IEEE MASCOTS 2014

  13. arXiv:1402.2107  [pdf, other

    cs.DC

    OS-Assisted Task Preemption for Hadoop

    Authors: Mario Pastorelli, Matteo Dell'Amico, Pietro Michiardi

    Abstract: This work introduces a new task preemption primitive for Hadoop, that allows tasks to be suspended and resumed exploiting existing memory management mechanisms readily available in modern operating systems. Our technique fills the gap that exists between the two extremes cases of killing tasks (which waste work) or waiting for their completion (which introduces latency): experimental results indic… ▽ More

    Submitted 10 February, 2014; originally announced February 2014.

  14. arXiv:1306.6023  [pdf, other

    cs.DC

    A Simulator for Data-Intensive Job Scheduling

    Authors: Matteo Dell'Amico

    Abstract: Despite the fact that size-based schedulers can give excellent results in terms of both average response times and fairness, data-intensive computing execution engines generally do not employ size-based schedulers, mainly because of the fact that job size is not known a priori. In this work, we perform a simulation-based analysis of the performance of size-based schedulers when they are employed… ▽ More

    Submitted 21 August, 2013; v1 submitted 25 June, 2013; originally announced June 2013.

    Report number: EURECOM RR-13-282

  15. arXiv:1302.2749  [pdf, ps, other

    cs.DC

    Practical Size-based Scheduling for MapReduce Workloads

    Authors: Mario Pastorelli, Antonio Barbuzzi, Damiano Carra, Matteo Dell'Amico, Pietro Michiardi

    Abstract: We present the Hadoop Fair Sojourn Protocol (HFSP) scheduler, which implements a size-based scheduling discipline for Hadoop. The benefits of size-based scheduling disciplines are well recognized in a variety of contexts (computer networks, operating systems, etc...), yet, their practical implementation for a system such as Hadoop raises a number of important challenges. With HFSP, which is availa… ▽ More

    Submitted 3 May, 2013; v1 submitted 12 February, 2013; originally announced February 2013.

    Comments: 12 pages, 8 figures

    ACM Class: C.2.4

  16. arXiv:1201.2360  [pdf, other

    cs.DC

    Adaptive Redundancy Management for Durable P2P Backup

    Authors: Matteo Dell'Amico, Pietro Michiardi, Laszlo Toka, Pasquale Cataldi

    Abstract: We design and analyze the performance of a redundancy management mechanism for Peer-to-Peer backup applications. Armed with the realization that a backup system has peculiar requirements -- namely, data is read over the network only during restore processes caused by data loss -- redundancy management targets data durability rather than attempting to make each piece of information availabile at an… ▽ More

    Submitted 17 January, 2014; v1 submitted 11 January, 2012; originally announced January 2012.

  17. arXiv:1010.0626  [pdf, other

    cs.NI

    Back To The Future: On Predicting User Uptime

    Authors: Matteo Dell'Amico, Pietro Michiardi, Yves Roudier

    Abstract: Correlation in user connectivity patterns is generally considered a problem for system designers, since it results in peaks of demand and also in the scarcity of resources for peer-to-peer applications. The other side of the coin is that these connectivity patterns are often predictable and that, to some extent, they can be dealt with proactively. In this work, we build predictors aiming to dete… ▽ More

    Submitted 4 October, 2010; originally announced October 2010.

  18. arXiv:1009.1344  [pdf, ps, other

    cs.NI cs.DC

    On Scheduling and Redundancy for P2P Backup

    Authors: Laszlo Toka, Matteo Dell'Amico, Pietro Michiardi

    Abstract: An online backup system should be quick and reliable in both saving and restoring users' data. To do so in a peer-to-peer implementation, data transfer scheduling and the amount of redundancy must be chosen wisely. We formalize the problem of exchanging multiple pieces of data with intermittently available peers, and we show that random scheduling completes transfers nearly optimally in terms of… ▽ More

    Submitted 7 September, 2010; originally announced September 2010.

  19. arXiv:0907.3402  [pdf, ps, other

    cs.CR

    Measuring Password Strength: An Empirical Analysis

    Authors: Matteo Dell'Amico, Pietro Michiardi, Yves Roudier

    Abstract: We present an in-depth analysis on the strength of the almost 10,000 passwords from users of an instant messaging server in Italy. We estimate the strength of those passwords, and compare the effectiveness of state-of-the-art attack methods such as dictionaries and Markov chain-based techniques. We show that the strength of passwords chosen by users varies enormously, and that the cost of atta… ▽ More

    Submitted 20 July, 2009; originally announced July 2009.

    Comments: 15 pages, 9 figures