Search | arXiv e-print repository

A critical appraisal of water table depth estimation: Challenges and opportunities within machine learning

Authors: Joseph Janssen, Ardalan Tootchi, Ali A. Ameli

Abstract: Fine-resolution spatial patterns of water table depth (WTD) play a crucial role in sha** ecological resilience, hydrological connectivity, and anthropocentric objectives. Generally, a large-scale (e.g., continental or global) spatial map of static WTD can be simulated using either physically-based (PB) or machine learning-based (ML) models. We construct three fine-resolution (500 m) ML simulatio… ▽ More Fine-resolution spatial patterns of water table depth (WTD) play a crucial role in sha** ecological resilience, hydrological connectivity, and anthropocentric objectives. Generally, a large-scale (e.g., continental or global) spatial map of static WTD can be simulated using either physically-based (PB) or machine learning-based (ML) models. We construct three fine-resolution (500 m) ML simulations of WTD, using the XGBoost algorithm and more than 20 million real and proxy observations of WTD, across the United States and Canada. The three ML models were constrained using known physical relations between WTD's drivers and WTD and were trained by sequentially adding real and proxy observations of WTD. We interpret the black box of our physically constrained ML models and compare it against available literature in groundwater hydrology. Through an extensive (pixel-by-pixel) evaluation, we demonstrate that our models can more accurately predict unseen real and proxy observations of WTD across most of North America's ecoregions compared to three available PB simulations of WTD. However, we still argue that large-scale WTD estimation is far from being a solved problem. We reason that due to biased observational data mainly collected from low-elevation floodplains, the misspecification of equations within physically-based models, and the over-flexibility of machine learning models, verifiably accurate simulations of WTD do not yet exist. Ultimately, we thoroughly discuss future directions that may help hydrogeologists decide how to proceed with WTD estimations, with a particular focus on the application of machine learning and the use of proxy satellite data. △ Less

Submitted 9 June, 2024; v1 submitted 30 April, 2024; originally announced May 2024.

arXiv:2404.08972 [pdf, other]

Improved Approximations for Flexible Network Design

Authors: Dylan Hyatt-Denesik, Afrouz Jabal Ameli, Laura Sanita

Abstract: Flexible network design deals with building a network that guarantees some connectivity requirements between its vertices, even when some of its elements (like vertices or edges) fail. In particular, the set of edges (resp. vertices) of a given graph are here partitioned into safe and unsafe. The goal is to identify a minimum size subgraph that is 2-edge-connected (resp. 2-vertex-connected), and s… ▽ More Flexible network design deals with building a network that guarantees some connectivity requirements between its vertices, even when some of its elements (like vertices or edges) fail. In particular, the set of edges (resp. vertices) of a given graph are here partitioned into safe and unsafe. The goal is to identify a minimum size subgraph that is 2-edge-connected (resp. 2-vertex-connected), and stay so whenever any of the unsafe elements gets removed. In this paper, we provide improved approximation algorithms for flexible network design problems, considering both edge-connectivity and vertex-connectivity, as well as connectivity values higher than 2. For the vertex-connectivity variant, in particular, our algorithm is the first with approximation factor strictly better than 2. △ Less

Submitted 13 April, 2024; originally announced April 2024.

arXiv:2311.17010 [pdf, ps, other]

Node Connectivity Augmentation of Highly Connected Graphs

Authors: Waldo Galvez, Dylan Hyatt-Denesik, Afrouz Jabal Ameli, Laura Sanita

Abstract: Node-connectivity augmentation is a fundamental network design problem. We are given a $k$-node connected graph $G$ together with an additional set of links, and the goal is to add a cheap subset of links to $G$ to make it $(k+1)$-node connected. In this work, we characterize completely the computational complexity status of the problem, by showing hardness for all values of $k$ which were not a… ▽ More Node-connectivity augmentation is a fundamental network design problem. We are given a $k$-node connected graph $G$ together with an additional set of links, and the goal is to add a cheap subset of links to $G$ to make it $(k+1)$-node connected. In this work, we characterize completely the computational complexity status of the problem, by showing hardness for all values of $k$ which were not addressed previously in the literature. We then focus on $k$-node connectivity augmentation for $k=n-4$, which corresponds to the highest value of $k$ for which the problem is NP-hard. We improve over the previously best known approximation bounds for this problem, by develo** a $\frac{3}{2}$-approximation algorithm for the weighted setting, and a $\frac{4}{3}$-approximation algorithm for the unweighted setting. △ Less

Submitted 28 November, 2023; originally announced November 2023.

arXiv:2311.11869 [pdf, other]

A PTAS for Triangle-Free 2-Matching

Authors: Miguel Bosch-Calvo, Fabrizio Grandoni, Afrouz Jabal Ameli

Abstract: In the Triangle-Free (Simple) 2-Matching problem we are given an undirected graph $G=(V,E)$. Our goal is to compute a maximum-cardinality $M\subseteq E$ satisfying the following properties: (1) at most two edges of $M$ are incident on each node (i.e., $M$ is a 2-matching) and (2) $M$ does not induce any triangle. In his Ph.D. thesis from 1984, Harvitgsen presents a complex polynomial-time algorith… ▽ More In the Triangle-Free (Simple) 2-Matching problem we are given an undirected graph $G=(V,E)$. Our goal is to compute a maximum-cardinality $M\subseteq E$ satisfying the following properties: (1) at most two edges of $M$ are incident on each node (i.e., $M$ is a 2-matching) and (2) $M$ does not induce any triangle. In his Ph.D. thesis from 1984, Harvitgsen presents a complex polynomial-time algorithm for this problem, with a very complex analysis. This result was never published in a journal nor reproved in a different way, to the best of our knowledge. In this paper we have a fresh look at this problem and present a simple PTAS for it based on local search. Our PTAS exploits the fact that, as long as the current solution is far enough from the optimum, there exists a short augmenting trail (similar to the maximum matching case). △ Less

Submitted 20 November, 2023; originally announced November 2023.

Comments: 27 pages, 18 figures

MSC Class: 68W25 (Primary) 68W40; 68R10; 05C40; 05C70 (Secondary) ACM Class: F.2.2

arXiv:2305.02240 [pdf, other]

A $4/3$ Approximation for $2$-Vertex-Connectivity

Authors: Miguel Bosch-Calvo, Fabrizio Grandoni, Afrouz Jabal Ameli

Abstract: The 2-Vertex-Connected Spanning Subgraph problem (2VCSS) is among the most basic NP-hard (Survivable) Network Design problems: we are given an (unweighted) undirected graph $G$. Our goal is to find a subgraph $S$ of $G$ with the minimum number of edges which is $2$-vertex-connected, namely $S$ remains connected after the deletion of an arbitrary node. 2VCSS is well-studied in terms of approximatio… ▽ More The 2-Vertex-Connected Spanning Subgraph problem (2VCSS) is among the most basic NP-hard (Survivable) Network Design problems: we are given an (unweighted) undirected graph $G$. Our goal is to find a subgraph $S$ of $G$ with the minimum number of edges which is $2$-vertex-connected, namely $S$ remains connected after the deletion of an arbitrary node. 2VCSS is well-studied in terms of approximation algorithms, and the current best (polynomial-time) approximation factor is $10/7$ by Heeger and Vygen [SIDMA'17] (improving on earlier results by Khuller and Vishkin [STOC'92] and Garg, Vempala and Singla [SODA'93]). △ Less

Submitted 19 December, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

Comments: 33 pages, 20 figures, ICALP 2023

MSC Class: 68W25 (Primary) 68W40; 68R10; 05C40; 05C85 (Secondary) ACM Class: F.2.2

arXiv:2211.12431 [pdf, ps, other]

Finding Almost Tight Witness Trees

Authors: Dylan Hyatt-Denesik, Afrouz Jabal Ameli, Laura Sanità

Abstract: This paper addresses a graph optimization problem, called the Witness Tree problem, which seeks a spanning tree of a graph minimizing a certain non-linear objective function. This problem is of interest because it plays a crucial role in the analysis of the best approximation algorithms for two fundamental network design problems: Steiner Tree and Node-Tree Augmentation. We will show how a wiser c… ▽ More This paper addresses a graph optimization problem, called the Witness Tree problem, which seeks a spanning tree of a graph minimizing a certain non-linear objective function. This problem is of interest because it plays a crucial role in the analysis of the best approximation algorithms for two fundamental network design problems: Steiner Tree and Node-Tree Augmentation. We will show how a wiser choice of witness trees leads to an improved approximation for Node-Tree Augmentation, and for Steiner Tree in special classes of graphs. △ Less

Submitted 5 May, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

Comments: 33 pages, 7 figures, submitted to IPCO 2023

arXiv:2209.10265 [pdf, ps, other]

Improved Approximation for Two-Edge-Connectivity

Authors: Mohit Garg, Fabrizio Grandoni, Afrouz Jabal Ameli

Abstract: The basic goal of survivable network design is to construct low-cost networks which preserve a sufficient level of connectivity despite the failure or removal of a few nodes or edges. One of the most basic problems in this area is the $2$-Edge-Connected Spanning Subgraph problem (2-ECSS): given an undirected graph $G$, find a $2$-edge-connected spanning subgraph $H$ of $G$ with the minimum number… ▽ More The basic goal of survivable network design is to construct low-cost networks which preserve a sufficient level of connectivity despite the failure or removal of a few nodes or edges. One of the most basic problems in this area is the $2$-Edge-Connected Spanning Subgraph problem (2-ECSS): given an undirected graph $G$, find a $2$-edge-connected spanning subgraph $H$ of $G$ with the minimum number of edges (in particular, $H$ remains connected after the removal of one arbitrary edge). 2-ECSS is NP-hard and the best-known (polynomial-time) approximation factor for this problem is $4/3$. Interestingly, this factor was achieved with drastically different techniques by [Hunkenschr{ö}der, Vempala and Vetta '00,'19] and [Seb{ö} and Vygen, '14]. In this paper we present an improved $\frac{118}{89}+ε<1.326$ approximation for 2-ECSS. The key ingredient in our approach (which might also be helpful in future work) is a reduction to a special type of structured graphs: our reduction preserves approximation factors up to $6/5$. While reducing to 2-vertex-connected graphs is trivial (and heavily used in prior work), our structured graphs are "almost" 3-vertex-connected: more precisely, given any 2-vertex-cut $\{u,v\}$ of a structured graph $G=(V,E)$, $G[V\setminus \{u,v\}]$ has exactly 2 connected components, one of which contains exactly one node of degree $2$ in $G$. △ Less

Submitted 12 November, 2022; v1 submitted 21 September, 2022; originally announced September 2022.

Comments: SODA 2023 (To Appear)

arXiv:2112.11799 [pdf, ps, other]

Breaching the 2-Approximation Barrier for the Forest Augmentation Problem

Authors: Fabrizio Grandoni, Afrouz Jabal Ameli, Vera Traub

Abstract: The basic goal of survivable network design is to build cheap networks that guarantee the connectivity of certain pairs of nodes despite the failure of a few edges or nodes. A celebrated result by Jain [Combinatorica'01] provides a 2-approximation for a wide class of these problems. However nothing better is known even for very basic special cases, raising the natural question whether any improved… ▽ More The basic goal of survivable network design is to build cheap networks that guarantee the connectivity of certain pairs of nodes despite the failure of a few edges or nodes. A celebrated result by Jain [Combinatorica'01] provides a 2-approximation for a wide class of these problems. However nothing better is known even for very basic special cases, raising the natural question whether any improved approximation factor is possible at all. In this paper we address one of the most basic problems in this family for which 2 is still the best-known approximation factor, the Forest Augmentation Problem (FAP): given an undirected unweighted graph (that w.l.o.g. is a forest) and a collection of extra edges (links), compute a minimum cardinality subset of links whose addition to the graph makes it 2-edge-connected. Several better-than-2 approximation algorithms are known for the special case where the input graph is a tree, a.k.a. the Tree Augmentation Problem (TAP). Recently this was achieved also for the weighted version of TAP, and for the k-edge-connectivity generalization of TAP. These results heavily exploit the fact that the input graph is connected, a condition that does not hold in FAP. In this paper we breach the 2-approximation barrier for FAP. Our result is based on two main ingredients. First, we describe a reduction to the Path Augmentation Problem (PAP), the special case of FAP where the input graph is a collection of disjoint paths. Our reduction is not approximation preserving, however it is sufficiently accurate to improve on a factor 2 approximation. Second, we present a better-than-2 approximation algorithm for PAP, an open problem on its own. Here we exploit a novel notion of implicit credits which might turn out to be helpful in future related work. △ Less

Submitted 20 April, 2022; v1 submitted 22 December, 2021; originally announced December 2021.

arXiv:2105.08577 [pdf, ps, other]

Approximation Algorithms for Demand Strip Packing

Authors: Waldo Gálvez, Fabrizio Grandoni, Afrouz Jabal Ameli, Kamyar Khodamoradi

Abstract: In the Demand Strip Packing problem (DSP), we are given a time interval and a collection of tasks, each characterized by a processing time and a demand for a given resource (such as electricity, computational power, etc.). A feasible solution consists of a schedule of the tasks within the mentioned time interval. Our goal is to minimize the peak resource consumption, i.e. the maximum total demand… ▽ More In the Demand Strip Packing problem (DSP), we are given a time interval and a collection of tasks, each characterized by a processing time and a demand for a given resource (such as electricity, computational power, etc.). A feasible solution consists of a schedule of the tasks within the mentioned time interval. Our goal is to minimize the peak resource consumption, i.e. the maximum total demand of tasks executed at any point in time. It is known that DSP is NP-hard to approximate below a factor 3/2, and standard techniques for related problems imply a (polynomial-time) 2-approximation. Our main result is a (5/3+eps)-approximation algorithm for any constant eps>0. We also achieve best-possible approximation factors for some relevant special cases. △ Less

Submitted 19 May, 2021; v1 submitted 18 May, 2021; originally announced May 2021.

Comments: Submitted to The 24th International Conference on Approximation Algorithms for Combinatorial Optimization Problems (APPROX 2021)

ACM Class: F.2.2

arXiv:2004.07229 [pdf]

doi 10.1073/pnas.2025581118

Network Medicine Framework for Identifying Drug Repurposing Opportunities for COVID-19

Authors: Deisy Morselli Gysi, Ítalo Do Valle, Marinka Zitnik, Asher Ameli, Xiao Gan, Onur Varol, Susan Dina Ghiassian, JJ Patten, Robert Davey, Joseph Loscalzo, Albert-László Barabási

Abstract: The current pandemic has highlighted the need for methodologies that can quickly and reliably prioritize clinically approved compounds for their potential effectiveness for SARS-CoV-2 infections. In the past decade, network medicine has developed and validated multiple predictive algorithms for drug repurposing, exploiting the sub-cellular network-based relationship between a drug's targets and di… ▽ More The current pandemic has highlighted the need for methodologies that can quickly and reliably prioritize clinically approved compounds for their potential effectiveness for SARS-CoV-2 infections. In the past decade, network medicine has developed and validated multiple predictive algorithms for drug repurposing, exploiting the sub-cellular network-based relationship between a drug's targets and disease genes. Here, we deployed algorithms relying on artificial intelligence, network diffusion, and network proximity, tasking each of them to rank 6,340 drugs for their expected efficacy against SARS-CoV-2. To test the predictions, we used as ground truth 918 drugs that had been experimentally screened in VeroE6 cells, and the list of drugs under clinical trial, that capture the medical community's assessment of drugs with potential COVID-19 efficacy. We find that while most algorithms offer predictive power for these ground truth data, no single method offers consistently reliable outcomes across all datasets and metrics. This prompted us to develop a multimodal approach that fuses the predictions of all algorithms, showing that a consensus among the different predictive methods consistently exceeds the performance of the best individual pipelines. We find that 76 of the 77 drugs that successfully reduced viral infection do not bind the proteins targeted by SARS-CoV-2, indicating that these drugs rely on network-based actions that cannot be identified using docking-based strategies. These advances offer a methodological pathway to identify repurposable drugs for future pathogens and neglected diseases underserved by the costs and extended timeline of de novo drug development. △ Less

Submitted 9 August, 2020; v1 submitted 15 April, 2020; originally announced April 2020.

arXiv:1911.02259 [pdf, ps, other]

Breaching the 2-Approximation Barrier for Connectivity Augmentation: a Reduction to Steiner Tree

Authors: Jarosław Byrka, Fabrizio Grandoni, Afrouz Jabal Ameli

Abstract: The basic goal of survivable network design is to build a cheap network that maintains the connectivity between given sets of nodes despite the failure of a few edges/nodes. The Connectivity Augmentation Problem (CAP) is arguably one of the most basic problems in this area: given a $k$(-edge)-connected graph $G$ and a set of extra edges (links), select a minimum cardinality subset $A$ of links suc… ▽ More The basic goal of survivable network design is to build a cheap network that maintains the connectivity between given sets of nodes despite the failure of a few edges/nodes. The Connectivity Augmentation Problem (CAP) is arguably one of the most basic problems in this area: given a $k$(-edge)-connected graph $G$ and a set of extra edges (links), select a minimum cardinality subset $A$ of links such that adding $A$ to $G$ increases its edge connectivity to $k+1$. Intuitively, one wants to make an existing network more reliable by augmenting it with extra edges. The best known approximation factor for this NP-hard problem is $2$, and this can be achieved with multiple approaches (the first such result is in [Frederickson and Jájá'81]). It is known [Dinitz et al.'76] that CAP can be reduced to the case $k=1$, a.k.a. the Tree Augmentation Problem (TAP), for odd $k$, and to the case $k=2$, a.k.a. the Cactus Augmentation Problem (CacAP), for even $k$. Several better than $2$ approximation algorithms are known for TAP, culminating with a recent $1.458$ approximation [Grandoni et al.'18]. However, for CacAP the best known approximation is $2$. In this paper we breach the $2$ approximation barrier for CacAP, hence for CAP, by presenting a polynomial-time $2\ln(4)-\frac{967}{1120}+ε<1.91$ approximation. Previous approaches exploit properties of TAP that do not seem to generalize to CacAP. We instead use a reduction to the Steiner tree problem which was previously used in parameterized algorithms [Basavaraju et al.'14]. This reduction is not approximation preserving, and using the current best approximation factor for Steiner tree [Byrka et al.'13] as a black-box would not be good enough to improve on $2$. To achieve the latter goal, we ``open the box'' and exploit the specific properties of the instances of Steiner tree arising from CacAP. △ Less

Submitted 7 November, 2019; v1 submitted 6 November, 2019; originally announced November 2019.

Comments: Corrected a typo in the abstract (in metadata)

MSC Class: 68W25; 68Q25; 05C85

Showing 1–11 of 11 results for author: Ameli, A