-
Differential reddening in 48 globular clusters: An end to the quest for the intracluster medium
Authors:
E. Pancino,
A. Zocchi,
M. Rainer,
M. Monaci,
D. Massari,
M. Monelli,
L. K. Hunt,
L. Monaco,
C. E. Martínez-Vázquez,
N. Sanna,
S. Bianchi,
P. B. Stetson
Abstract:
For decades, it has been theorized that a tenuous but detectable intracluster medium should be present in globular clusters, which is continuously replenished by the gas and dust ejected by bright giants and periodically cleared by interactions with the Galactic disk. However, dedicated searches, especially in infrared and radio wavelengths, have returned mostly upper limits, which are lower than…
▽ More
For decades, it has been theorized that a tenuous but detectable intracluster medium should be present in globular clusters, which is continuously replenished by the gas and dust ejected by bright giants and periodically cleared by interactions with the Galactic disk. However, dedicated searches, especially in infrared and radio wavelengths, have returned mostly upper limits, which are lower than theoretical expectations by several orders of magnitude. We profited from recent wide-field photometry for 48 Galactic globular clusters to compute high-resolution maps of differential reddening, which can be used to correct any photometric catalog in these areas for reddening variations. Using 3D reddening maps from the literature, we evaluated the amount of foreground extinction. This allowed us to estimate the masses of the intracluster medium in our sample clusters, with an accuracy of one order of magnitude. Our estimates agree with the few available literature detections and with theoretical expectations. Because the discrepancy between observations and expectations only concerns literature upper limits, we explored possible reasons why they could be underestimated and we show that two recent discoveries can explain the discrepancy. The first is the recent discovery that the intracluster medium in 47 Tuc is not centrally concentrated. This is also supported by our maps, which in the majority of cases do not show a central reddening concentration. The second is the discovery that the dust in metal-poor ([Fe/H] less than about -1 dex) globular clusters is dominated by iron grains rather than silicates, which undermines previous dust mass estimates from observed upper limits. We conclude that current evidence, including our maps, does not contradict theoretical expectations and the problem of the missing intracluster medium is no longer an issue.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
Shear, writhe and filaments: turbulence in the high latitude molecular cloud MBM 40
Authors:
Marco Monaci,
Loris Magnani,
Steven N. Shore,
Henrik Olofsson,
Mackenzie R. Joy
Abstract:
Context. It is almost banal to say that the interstellar medium (ISM) is structurally and thermodynamically complex. But the variety of the governing processes, including stellar feedback, renders the investigation challenging. High latitude molecular clouds (HLMCs) with no evidence of internal star formation, such as MBM 40, are excellent sites for studying the chemistry and dynamic evolution of…
▽ More
Context. It is almost banal to say that the interstellar medium (ISM) is structurally and thermodynamically complex. But the variety of the governing processes, including stellar feedback, renders the investigation challenging. High latitude molecular clouds (HLMCs) with no evidence of internal star formation, such as MBM 40, are excellent sites for studying the chemistry and dynamic evolution of the cold neutral ISM.
Aims. We used this high latitude cloud as an exemplar for the dynamical and chemical processes in the diffuse interstellar medium.
Methods. We analyzed new and archival $^{12}$CO, $^{13}$CO, CH, HCO$^+$, CS, H$_2$CO, HCN data from Five College Radio Observatory (FCRAO), Onsala Space Observatory (OSO), Arizona Radio Observatory (ARO) and W. Gordon telescope (Arecibo) combined with the Galactic Arecibo L-band Feed Array HI (GALFA-HI) HI 21 cm data set, to study the chemistry, thermal state, and dynamics of MBM 40. A new dynamical analytical approach was adopted by considering each line profile as a line of sight Probability Distribution Function (PDF) of the turbulence weighted by gas emissivity.
Results. The atomic and molecular gas are smoothly distributed in space and velocity. No steep transition is seen between circumcloud atomic and cloud molecular gas in either radial velocity or structure. We proposed a topology of the cloud from the molecular tracers, a contorted filamentary structure that is shaped by a broad embedding shear flow in the neutral atomic gas. Comparative examination of different molecular tracers shows that $^{13}$CO, H$_2$CO and CS arise from only denser molecular cores, where $^{12}$CO, CH and HCO$^+$ traces diffuse gas with broader range of dynamics.
△ Less
Submitted 11 June, 2023;
originally announced June 2023.
-
Unboxing Tree Ensembles for interpretability: a hierarchical visualization tool and a multivariate optimal re-built tree
Authors:
Giulia Di Teodoro,
Marta Monaci,
Laura Palagi
Abstract:
The interpretability of models has become a crucial issue in Machine Learning because of algorithmic decisions' growing impact on real-world applications. Tree ensemble methods, such as Random Forests or XgBoost, are powerful learning tools for classification tasks. However, while combining multiple trees may provide higher prediction quality than a single one, it sacrifices the interpretability p…
▽ More
The interpretability of models has become a crucial issue in Machine Learning because of algorithmic decisions' growing impact on real-world applications. Tree ensemble methods, such as Random Forests or XgBoost, are powerful learning tools for classification tasks. However, while combining multiple trees may provide higher prediction quality than a single one, it sacrifices the interpretability property resulting in "black-box" models. In light of this, we aim to develop an interpretable representation of a tree-ensemble model that can provide valuable insights into its behavior. First, given a target tree-ensemble model, we develop a hierarchical visualization tool based on a heatmap representation of the forest's feature use, considering the frequency of a feature and the level at which it is selected as an indicator of importance. Next, we propose a mixed-integer linear programming (MILP) formulation for constructing a single optimal multivariate tree that accurately mimics the target model predictions. The goal is to provide an interpretable surrogate model based on oblique hyperplane splits, which uses only the most relevant features according to the defined forest's importance indicators. The MILP model includes a penalty on feature selection based on their frequency in the forest to further induce sparsity of the splits. The natural formulation has been strengthened to improve the computational performance of {mixed-integer} software. Computational experience is carried out on benchmark datasets from the UCI repository using a state-of-the-art off-the-shelf solver. Results show that the proposed model is effective in yielding a shallow interpretable tree approximating the tree-ensemble decision function.
△ Less
Submitted 18 January, 2024; v1 submitted 15 February, 2023;
originally announced February 2023.
-
The mixing of dust and gas in the high latitude translucent cloud MBM 40
Authors:
Marco Monaci,
Loris Magnani,
Steven N. Shore
Abstract:
Context. High latitude molecular clouds (hereafter HLMCs) permit the study of interstellar gas dynamics and astrochemistry with good accuracy due to their proximity, generally clear lines of sight, and lack of internal star-forming activity which can heavily modify the physical context. MBM 40, one of the nearest HLMCs, has been extensively studied, making it a superb target to infer and study the…
▽ More
Context. High latitude molecular clouds (hereafter HLMCs) permit the study of interstellar gas dynamics and astrochemistry with good accuracy due to their proximity, generally clear lines of sight, and lack of internal star-forming activity which can heavily modify the physical context. MBM 40, one of the nearest HLMCs, has been extensively studied, making it a superb target to infer and study the dust-to-gas mixing ratio (DGMR).
Aims. The mixing of dust and gas in the interstellar medium remains a fundamental issue to keep track of astrochemistry evolution and molecular abundances. Accounting for both molecular and atomic gas is difficult because $H_2$ is not directly observable and HI spectra always show different dynamical profiles blended together which are not directly correlated with the cloud. We used two independent strategies to infer the molecular and atomic gas column densities and compute the dust-to-gas mixing ratio.
Methods. We combined $HI$ 21 cm and $^{12}CO$ line observations with the IRAS 100 $μ$m image to infer the dust-to-gas mixing ratio within the cloud. The cloud 21 cm profile was extracted using a hybrid Gaussian decomposition where $^{12}CO$ was used to deduce the total molecular hydrogen column density. Infrared images were used to calculate the dust emission.
Results. The dust-to-gas mixing ratio is nearly uniform within the cloud as outlined by the hairpin structure. The total hydrogen column density and 100 $μ$m emissivity are linearly correlated over a range in $N(H_{tot})$ of one order of magnitude.
△ Less
Submitted 12 January, 2023;
originally announced January 2023.
-
Margin Optimal Classification Trees
Authors:
Federico D'Onofrio,
Giorgio Grani,
Marta Monaci,
Laura Palagi
Abstract:
In recent years, there has been growing attention to interpretable machine learning models which can give explanatory insights on their behaviour. Thanks to their interpretability, decision trees have been intensively studied for classification tasks and, due to the remarkable advances in mixed integer programming (MIP), various approaches have been proposed to formulate the problem of training an…
▽ More
In recent years, there has been growing attention to interpretable machine learning models which can give explanatory insights on their behaviour. Thanks to their interpretability, decision trees have been intensively studied for classification tasks and, due to the remarkable advances in mixed integer programming (MIP), various approaches have been proposed to formulate the problem of training an Optimal Classification Tree (OCT) as a MIP model. We present a novel mixed integer quadratic formulation for the OCT problem, which exploits the generalization capabilities of Support Vector Machines for binary classification. Our model, denoted as Margin Optimal Classification Tree (MARGOT), encompasses maximum margin multivariate hyperplanes nested in a binary tree structure. To enhance the interpretability of our approach, we analyse two alternative versions of MARGOT, which include feature selection constraints inducing sparsity of the hyperplanes' coefficients. First, MARGOT has been tested on non-linearly separable synthetic datasets in a 2-dimensional feature space to provide a graphical representation of the maximum margin approach. Finally, the proposed models have been tested on benchmark datasets from the UCI repository. The MARGOT formulation turns out to be easier to solve than other OCT approaches, and the generated tree better generalizes on new observations. The two interpretable versions effectively select the most relevant features, maintaining good prediction quality.
△ Less
Submitted 8 October, 2023; v1 submitted 19 October, 2022;
originally announced October 2022.
-
An Exact Method for Fortification Games
Authors:
Markus Leitner,
Ivana Ljubić,
Michele Monaci,
Markus Sinnl,
Kübra Tanınmış
Abstract:
A fortification game (FG) is a three-level, two-player Stackelberg game, also known as defender-attacker-defender game, in which at the uppermost level, the defender selects some assets to be protected from potential malicious attacks. At the middle level, the attacker solves an interdiction game by depreciating unprotected assets, i.e., reducing the values of such assets for the defender, while a…
▽ More
A fortification game (FG) is a three-level, two-player Stackelberg game, also known as defender-attacker-defender game, in which at the uppermost level, the defender selects some assets to be protected from potential malicious attacks. At the middle level, the attacker solves an interdiction game by depreciating unprotected assets, i.e., reducing the values of such assets for the defender, while at the innermost level the defender solves a recourse problem over the surviving or partially damaged assets. Fortification games have applications in various important areas, such as military operations, design of survivable networks, protection of facilities, or power grid protection. In this work, we present an exact solution algorithm for FGs, in which the recourse problems correspond to (possibly NP-hard) combinatorial optimization problems. The algorithm is based on a new generic mixed-integer linear programming reformulation in the natural space of fortification variables. Our new model makes use of fortification cuts that measure the contribution of a given fortification strategy to the objective function value. These cuts are generated on-the-fly by solving separation problems, which correspond to (modified) middle-level interdiction games. We design a branch-and-cut-based solution algorithm based on fortification cuts, their lifted versions, and other speed-up techniques. We present a computational study using the knapsack fortification game and the shortest path fortification game. For the latter one, we include a comparison with a state-of-the-art solution method from the literature. Our algorithm outperforms this method and allows us to solve previously unsolved instances to optimality.
△ Less
Submitted 9 February, 2022; v1 submitted 26 November, 2021;
originally announced November 2021.
-
An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent agents
Authors:
Marta Monaci,
Valerio Agasucci,
Giorgio Grani
Abstract:
There is a growing interest in integrating machine learning techniques and optimization to solve challenging optimization problems. In this work, we propose a deep reinforcement learning methodology for the job shop scheduling problem (JSSP). The aim is to build up a greedy-like heuristic able to learn on some distribution of JSSP instances, different in the number of jobs and machines. The need f…
▽ More
There is a growing interest in integrating machine learning techniques and optimization to solve challenging optimization problems. In this work, we propose a deep reinforcement learning methodology for the job shop scheduling problem (JSSP). The aim is to build up a greedy-like heuristic able to learn on some distribution of JSSP instances, different in the number of jobs and machines. The need for fast scheduling methods is well known, and it arises in many areas, from transportation to healthcare. We model the JSSP as a Markov Decision Process and then we exploit the efficacy of reinforcement learning to solve the problem. We adopt an actor-critic scheme, where the action taken by the agent is influenced by policy considerations on the state-value function. The procedures are adapted to take into account the challenging nature of JSSP, where the state and the action space change not only for every instance but also after each decision. To tackle the variability in the number of jobs and operations in the input, we modeled the agent using two incident LSTM models, a special type of deep neural network. Experiments show the algorithm reaches good solutions in a short time, proving that is possible to generate new greedy heuristics just from learning-based methodologies. Benchmarks have been generated in comparison with the commercial solver CPLEX. As expected, the model can generalize, to some extent, to larger problems or instances originated by a different distribution from the one used in training.
△ Less
Submitted 21 November, 2023; v1 submitted 18 October, 2021;
originally announced October 2021.
-
Network Design with Service Requirements: Scaling-up the Size of Solvable Problems
Authors:
Naga V. C. Gudapati,
Enrico Malaguti,
Michele Monaci
Abstract:
Network design, a cornerstone of mathematical optimization, is about defining the main characteristics of a network satisfying requirements on connectivity, capacity, and level-of-service. It finds applications in logistics and transportation, telecommunications, data sharing, energy distribution, and distributed computing. In multi-commodity network design, one is required to design a network min…
▽ More
Network design, a cornerstone of mathematical optimization, is about defining the main characteristics of a network satisfying requirements on connectivity, capacity, and level-of-service. It finds applications in logistics and transportation, telecommunications, data sharing, energy distribution, and distributed computing. In multi-commodity network design, one is required to design a network minimizing the installation cost of its arcs and the operational cost to serve a set of point-to-point connections. The definition of this prototypical problem was recently enriched by additional constraints imposing that each origin-destination of a connection is served by a single path satisfying one or more level-of-service requirements, thus defining the Network Design with Service Requirements [Balakrishnan, Li, and Mirchandani. Operations Research, 2017]. These constraints are crucial, e.g., in telecommunications and computer networks, in order to ensure reliable and low-latency communication. In this paper we provide a new formulation for the problem, where variables are associated with paths satisfying the end-to-end service requirements. We present a fast algorithm for enumerating all the exponentially-many feasible paths and, when this is not viable, we provide a column generation scheme that is embedded into a branch-and-cut-and-price algorithm. Extensive computational experiments on a large set of instances show that our approach is able to move a step further in the solution of the Network Design with Service Requirements, compared with the current state-of-the-art.
△ Less
Submitted 2 July, 2021;
originally announced July 2021.
-
Exact Solution Techniques for Two-dimensional Cutting and Packing
Authors:
Manuel Iori,
Vinícius L. de Lima,
Silvano Martello,
Flávio K. Miyazawa,
Michele Monaci
Abstract:
We survey the main formulations and solution methods for two-dimensional orthogonal cutting and packing problems, where both items and bins are rectangles. We focus on exact methods and relaxations for the four main problems from the literature: finding a packing with minimum height, packing the items into the minimum number of bins, finding a packing of maximum value, and determining the existenc…
▽ More
We survey the main formulations and solution methods for two-dimensional orthogonal cutting and packing problems, where both items and bins are rectangles. We focus on exact methods and relaxations for the four main problems from the literature: finding a packing with minimum height, packing the items into the minimum number of bins, finding a packing of maximum value, and determining the existence of a feasible packing.
△ Less
Submitted 27 April, 2020;
originally announced April 2020.
-
In Search of Dense Subgraphs: How Good is Greedy Peeling?
Authors:
Naga V. C. Gudapati,
Enrico Malaguti,
Michele Monaci
Abstract:
The problem of finding the densest subgraph in a given graph has several applications in graph mining, particularly in areas like social network analysis, protein and gene analyses etc. Depending on the application, finding dense subgraphs can be used to determine regions of high importance, similar characteristics or enhanced interaction. The densest subgraph extraction problem is a fundamentally…
▽ More
The problem of finding the densest subgraph in a given graph has several applications in graph mining, particularly in areas like social network analysis, protein and gene analyses etc. Depending on the application, finding dense subgraphs can be used to determine regions of high importance, similar characteristics or enhanced interaction. The densest subgraph extraction problem is a fundamentally a non-linear optimization problem. Nevertheless, it can be solved in polynomial time by an exact algorithm based on the iterative solution of a series of maximum flow sub-problems. Despite its polynomial time complexity, the computing time required by the exact algorithms on very large graphs could be prohibitive. Thus, to approach graphs with millions of vertices and edges, one has to resort to heuristic algorithms. We provide an efficient implementation of a greedy heuristic from the literature that is extremely fast and has some nice theoretical properties. We also introduce a new heurisitic algorithm that is built on top of the greedy and the exact methods. An extensive computational study is presented to evaluate the performance of various solution methods on a benchmark composed of 86 instances taken from the literature. This analysis shows that the proposed heuristic algorithm proved very effective on a large number of test instances, often providing either the optimal solution or near-optimal solution within short computing times.
△ Less
Submitted 6 November, 2019;
originally announced November 2019.
-
StreaMon: a data-plane programming abstraction for Software-defined Stream Monitoring
Authors:
Giuseppe Bianchi,
Marco Bonola,
Giulio Picierro,
Salvatore Pontarelli,
Marco Monaci
Abstract:
The fast evolving nature of modern cyber threats and network monitoring needs calls for new, "software-defined", approaches to simplify and quicken programming and deployment of online (stream-based) traffic analysis functions. StreaMon is a carefully designed data-plane abstraction devised to scalably decouple the "programming logic" of a traffic analysis application (tracked states, features, an…
▽ More
The fast evolving nature of modern cyber threats and network monitoring needs calls for new, "software-defined", approaches to simplify and quicken programming and deployment of online (stream-based) traffic analysis functions. StreaMon is a carefully designed data-plane abstraction devised to scalably decouple the "programming logic" of a traffic analysis application (tracked states, features, anomaly conditions, etc.) from elementary primitives (counting and metering, matching, events generation, etc), efficiently pre-implemented in the probes, and used as common instruction set for supporting the desired logic. Multi-stage multi-step real-time tracking and detection algorithms are supported via the ability to deploy custom states, relevant state transitions, and associated monitoring actions and triggering conditions. Such a separation entails platform-independent, portable, online traffic analysis tasks written in a high level language, without requiring developers to access the monitoring device internals and program their custom monitoring logic via low level compiled languages (e.g., C, assembly, VHDL). We validate our design by develo** a prototype and a set of simple (but functionally demanding) use-case applications and by testing them over real traffic traces.
△ Less
Submitted 11 November, 2013;
originally announced November 2013.