-
Trained Random Forests Completely Reveal your Dataset
Authors:
Julien Ferry,
Ricardo Fukasawa,
Timothée Pascal,
Thibaut Vidal
Abstract:
We introduce an optimization-based reconstruction attack capable of completely or near-completely reconstructing a dataset utilized for training a random forest. Notably, our approach relies solely on information readily available in commonly used libraries such as scikit-learn. To achieve this, we formulate the reconstruction problem as a combinatorial problem under a maximum likelihood objective…
▽ More
We introduce an optimization-based reconstruction attack capable of completely or near-completely reconstructing a dataset utilized for training a random forest. Notably, our approach relies solely on information readily available in commonly used libraries such as scikit-learn. To achieve this, we formulate the reconstruction problem as a combinatorial problem under a maximum likelihood objective. We demonstrate that this problem is NP-hard, though solvable at scale using constraint programming -- an approach rooted in constraint propagation and solution-domain reduction. Through an extensive computational investigation, we demonstrate that random forests trained without bootstrap aggregation but with feature randomization are susceptible to a complete reconstruction. This holds true even with a small number of trees. Even with bootstrap aggregation, the majority of the data can also be reconstructed. These findings underscore a critical vulnerability inherent in widely adopted ensemble methods, warranting attention and mitigation. Although the potential for such reconstruction attacks has been discussed in privacy research, our study provides clear empirical evidence of their practicability.
△ Less
Submitted 29 February, 2024;
originally announced February 2024.
-
Optimized Cranial Bandeau Remodeling
Authors:
James Drake,
Marina Drygala,
Ricardo Fukasawa,
Jochen Koenemann,
Andre Linhares,
Thomas Looi,
John Phillips,
David Qian,
Nikoo Saber,
Justin Toth,
Chris Woodbeck,
Jessie Yeung
Abstract:
Craniosynostosis, a condition affecting 1 in 2000 infants, is caused by premature fusing of cranial vault sutures, and manifests itself in abnormal skull growth patterns. Left untreated, the condition may lead to severe developmental impairment. Standard practice is to apply corrective cranial bandeau remodeling surgery in the first year of the infant's life. The most frequent type of surgery invo…
▽ More
Craniosynostosis, a condition affecting 1 in 2000 infants, is caused by premature fusing of cranial vault sutures, and manifests itself in abnormal skull growth patterns. Left untreated, the condition may lead to severe developmental impairment. Standard practice is to apply corrective cranial bandeau remodeling surgery in the first year of the infant's life. The most frequent type of surgery involves the removal of the so-called fronto-orbital bar from the patient's forehead and the cutting of well-placed incisions to reshape the skull in order to obtain the desired result. In this paper, we propose a precise optimization model for the above cranial bandeau remodeling problem and its variants. We have developed efficient algorithms that solve best incision placement, and show hardness for more general cases in the class. To the best of our knowledge this paper is the first to introduce optimization models for craniofacial surgery applications.
△ Less
Submitted 12 March, 2021; v1 submitted 22 December, 2019;
originally announced December 2019.
-
A joint routing and speed optimization problem
Authors:
Ricardo Fukasawa,
Qie He,
Fernando Santos,
Yongjia Song
Abstract:
Fuel cost contributes to a significant portion of operating cost in cargo transportation. Though classic routing models usually treat fuel cost as input data, fuel consumption heavily depends on the travel speed, which has led to the study of optimizing speeds over a given fixed route. In this paper, we propose a joint routing and speed optimization problem to minimize the total cost, which includ…
▽ More
Fuel cost contributes to a significant portion of operating cost in cargo transportation. Though classic routing models usually treat fuel cost as input data, fuel consumption heavily depends on the travel speed, which has led to the study of optimizing speeds over a given fixed route. In this paper, we propose a joint routing and speed optimization problem to minimize the total cost, which includes the fuel consumption cost. The only assumption made on the dependence between the fuel cost and travel speed is that it is a strictly convex differentiable function. This problem is very challenging, with medium-sized instances already difficult for a general mixed-integer convex optimization solver. We propose a novel set partitioning formulation and a branch-cut-and-price algorithm to solve this problem. Our algorithm clearly outperforms the off-the-shelf optimization solver, and is able to solve some benchmark instances to optimality for the first time.
△ Less
Submitted 29 May, 2017; v1 submitted 26 February, 2016;
originally announced February 2016.
-
On the mixing set with a knapsack constraint
Authors:
Ahmad Abdi,
Ricardo Fukasawa
Abstract:
The mixing set with a knapsack constraint arises as a substructure in mixed-integer programming reformulations of chance-constrained programs with stochastic right-hand-sides over a finite discrete distribution. Recently, Luedtke et al. (2010) and Küçükyavuz (2012) studied valid inequalities for such sets. However, most of their results were focused on the equal probabilities case (equivalently wh…
▽ More
The mixing set with a knapsack constraint arises as a substructure in mixed-integer programming reformulations of chance-constrained programs with stochastic right-hand-sides over a finite discrete distribution. Recently, Luedtke et al. (2010) and Küçükyavuz (2012) studied valid inequalities for such sets. However, most of their results were focused on the equal probabilities case (equivalently when the knapsack reduces to a cardinality constraint), with only minor results in the general case. In this paper, we focus on the general probabilities case (general knapsack constraint). We characterize the valid inequalities that do not come from the knapsack polytope and use this characterization to generalize the inequalities previously derived for the equal probabilities case. We also show that one can separate over a large class of inequalities in polynomial time.
△ Less
Submitted 4 July, 2012;
originally announced July 2012.