Search | arXiv e-print repository

A Novel Framework for Automated Warehouse Layout Generation

Authors: Atefeh Shahroudnejad, Payam Mousavi, Oleksii Perepelytsia, Sahir, David Staszak, Matthew E. Taylor, Brent Bawel

Abstract: Optimizing warehouse layouts is crucial due to its significant impact on efficiency and productivity. We present an AI-driven framework for automated warehouse layout generation. This framework employs constrained beam search to derive optimal layouts within given spatial parameters, adhering to all functional requirements. The feasibility of the generated layouts is verified based on criteria suc… ▽ More Optimizing warehouse layouts is crucial due to its significant impact on efficiency and productivity. We present an AI-driven framework for automated warehouse layout generation. This framework employs constrained beam search to derive optimal layouts within given spatial parameters, adhering to all functional requirements. The feasibility of the generated layouts is verified based on criteria such as item accessibility, required minimum clearances, and aisle connectivity. A scoring function is then used to evaluate the feasible layouts considering the number of storage locations, access points, and accessibility costs. We demonstrate our method's ability to produce feasible, optimal layouts for a variety of warehouse dimensions and shapes, diverse door placements, and interconnections. This approach, currently being prepared for deployment, will enable human designers to rapidly explore and confirm options, facilitating the selection of the most appropriate layout for their use-case. △ Less

Submitted 11 July, 2024; originally announced July 2024.

arXiv:2302.07170 [pdf, ps, other]

On the degree-Kirchhoff index, Gutman index and the Schultz index of pentagonal cylinder/ Möbius chain

Authors: Md. Abdus Sahir, Sk. Md. Abu Nayeem

Abstract: The degree-Kirchhoff index of a graph is given by the sum of inverses of non-zero eigenvalues of the normalized Laplacian matrix of the graph multiplied with the total degree of the graph. Explicit formulas for the degree-Kirchhoff index of various types of cylinder chain and Möbius chain have been obtained by many researchers in the recent past. In the present paper, we obtain closed-form formula… ▽ More The degree-Kirchhoff index of a graph is given by the sum of inverses of non-zero eigenvalues of the normalized Laplacian matrix of the graph multiplied with the total degree of the graph. Explicit formulas for the degree-Kirchhoff index of various types of cylinder chain and Möbius chain have been obtained by many researchers in the recent past. In the present paper, we obtain closed-form formulas for the degree-Kirchhoff index of pentagonal cylinder/ Möbius chain. Also we find here the Gutman index and the Schultz index for those graphs. △ Less

Submitted 14 February, 2023; originally announced February 2023.

MSC Class: Primary: 05C09; 05C12; Secondary: 05C50

arXiv:2301.06535 [pdf, other]

doi 10.1016/j.mlwa.2024.100535

Case-Base Neural Networks: survival analysis with time-varying, higher-order interactions

Authors: Jesse Islam, Maxime Turgeon, Robert Sladek, Sahir Bhatnagar

Abstract: In the context of survival analysis, data-driven neural network-based methods have been developed to model complex covariate effects. While these methods may provide better predictive performance than regression-based approaches, not all can model time-varying interactions and complex baseline hazards. To address this, we propose Case-Base Neural Networks (CBNNs) as a new approach that combines th… ▽ More In the context of survival analysis, data-driven neural network-based methods have been developed to model complex covariate effects. While these methods may provide better predictive performance than regression-based approaches, not all can model time-varying interactions and complex baseline hazards. To address this, we propose Case-Base Neural Networks (CBNNs) as a new approach that combines the case-base sampling framework with flexible neural network architectures. Using a novel sampling scheme and data augmentation to naturally account for censoring, we construct a feed-forward neural network that includes time as an input. CBNNs predict the probability of an event occurring at a given moment to estimate the full hazard function. We compare the performance of CBNNs to regression and neural network-based survival methods in a simulation and three case studies using two time-dependent metrics. First, we examine performance on a simulation involving a complex baseline hazard and time-varying interactions to assess all methods, with CBNN outperforming competitors. Then, we apply all methods to three real data applications, with CBNNs outperforming the competing models in two studies and showing similar performance in the third. Our results highlight the benefit of combining case-base sampling with deep learning to provide a simple and flexible framework for data-driven modeling of single event survival outcomes that estimates time-varying effects and a complex baseline hazard by design. An R package is available at https://github.com/Jesse-Islam/cbnn. △ Less

Submitted 9 January, 2024; v1 submitted 16 January, 2023; originally announced January 2023.

arXiv:2206.12267 [pdf, other]

Efficient Penalized Generalized Linear Mixed Models for Variable Selection and Genetic Risk Prediction in High-Dimensional Data

Authors: Julien St-Pierre, Karim Oualkacha, Sahir Rai Bhatnagar

Abstract: Sparse regularized regression methods are now widely used in genome-wide association studies (GWAS) to address the multiple testing burden that limits discovery of potentially important predictors. Linear mixed models (LMMs) have become an attractive alternative to principal components (PC) adjustment to account for population structure and relatedness in high-dimensional penalized models. However… ▽ More Sparse regularized regression methods are now widely used in genome-wide association studies (GWAS) to address the multiple testing burden that limits discovery of potentially important predictors. Linear mixed models (LMMs) have become an attractive alternative to principal components (PC) adjustment to account for population structure and relatedness in high-dimensional penalized models. However, their use in binary trait GWAS rely on the invalid assumption that the residual variance does not depend on the estimated regression coefficients. Moreover, LMMs use a single spectral decomposition of the covariance matrix of the responses, which is no longer possible in generalized linear mixed models (GLMMs). We introduce a new method called pglmm, a penalized GLMM that allows to simultaneously select genetic markers and estimate their effects, accounting for between-individual correlations and binary nature of the trait. We develop a computationally efficient algorithm based on PQL estimation that allows to scale regularized mixed models on high-dimensional binary trait GWAS (~300,000 SNPs). We show through simulations that penalized LMM and logistic regression with PC adjustment fail to correctly select important predictors and/or that prediction accuracy decreases for a binary response when the dimensionality of the relatedness matrix is high compared to pglmm. Further, we demonstrate through the analysis of two polygenic binary traits in the UK Biobank data that our method can achieve higher predictive performance, while also selecting fewer predictors than a sparse regularized logistic lasso with PC adjustment. Our method is available as a Julia package PenalizedGLMM.jl. △ Less

Submitted 24 June, 2022; originally announced June 2022.

Comments: 26 pages, 5 figures

arXiv:2205.13609 [pdf, ps, other]

Variable Selection for Individualized Treatment Rules with Discrete Outcomes

Authors: Zeyu Bian, Erica EM Moodie, Susan M Shortreed, Sylvie D Lambert, Sahir Bhatnagar

Abstract: An individualized treatment rule (ITR) is a decision rule that aims to improve individual patients health outcomes by recommending optimal treatments according to patients specific information. In observational studies, collected data may contain many variables that are irrelevant for making treatment decisions. Including all available variables in the statistical model for the ITR could yield a l… ▽ More An individualized treatment rule (ITR) is a decision rule that aims to improve individual patients health outcomes by recommending optimal treatments according to patients specific information. In observational studies, collected data may contain many variables that are irrelevant for making treatment decisions. Including all available variables in the statistical model for the ITR could yield a loss of efficiency and an unnecessarily complicated treatment rule, which is difficult for physicians to interpret or implement. Thus, a data-driven approach to select important tailoring variables with the aim of improving the estimated decision rules is crucial. While there is a growing body of literature on selecting variables in ITRs with continuous outcomes, relatively few methods exist for discrete outcomes, which pose additional computational challenges even in the absence of variable selection. In this paper, we propose a variable selection method for ITRs with discrete outcomes. We show theoretically and empirically that our approach has the double robustness property, and that it compares favorably with other competing approaches. We illustrate the proposed method on data from a study of an adaptive web-based stress management tool to identify which variables are relevant for tailoring treatment. △ Less

Submitted 29 September, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

arXiv:2204.07254 [pdf, other]

Methodical Advice Collection and Reuse in Deep Reinforcement Learning

Authors: Sahir, Ercüment İlhan, Srijita Das, Matthew E. Taylor

Abstract: Reinforcement learning (RL) has shown great success in solving many challenging tasks via use of deep neural networks. Although using deep learning for RL brings immense representational power, it also causes a well-known sample-inefficiency problem. This means that the algorithms are data-hungry and require millions of training samples to converge to an adequate policy. One way to combat this iss… ▽ More Reinforcement learning (RL) has shown great success in solving many challenging tasks via use of deep neural networks. Although using deep learning for RL brings immense representational power, it also causes a well-known sample-inefficiency problem. This means that the algorithms are data-hungry and require millions of training samples to converge to an adequate policy. One way to combat this issue is to use action advising in a teacher-student framework, where a knowledgeable teacher provides action advice to help the student. This work considers how to better leverage uncertainties about when a student should ask for advice and if the student can model the teacher to ask for less advice. The student could decide to ask for advice when it is uncertain or when both it and its model of the teacher are uncertain. In addition to this investigation, this paper introduces a new method to compute uncertainty for a deep RL agent using a secondary neural network. Our empirical results show that using dual uncertainties to drive advice collection and reuse may improve learning performance across several Atari games. △ Less

Submitted 14 April, 2022; originally announced April 2022.

Comments: To be published in ALA2022: Adaptive and Learning Agents Workshop 2022 at AAMAS

arXiv:2201.10858 [pdf, ps, other]

On Kirchhoff index and number of spanning trees of linear pentagonal cylinder and Mobius chain graph

Authors: Md. Abdus Sahir, Sk. Md. Abu Nayeem

Abstract: In this paper, we derive closed-form formulas for Kirchhoff index and Wiener index of linear pentagonal cylinder graph and linear pentagonal Mobius chain graph. We also obtain explicit formulas for finding total number of spanning trees for both the graphs. In this paper, we derive closed-form formulas for Kirchhoff index and Wiener index of linear pentagonal cylinder graph and linear pentagonal Mobius chain graph. We also obtain explicit formulas for finding total number of spanning trees for both the graphs. △ Less

Submitted 26 January, 2022; originally announced January 2022.

MSC Class: 05C09

arXiv:2111.14160 [pdf, other]

Learning To Segment Dominant Object Motion From Watching Videos

Authors: Sahir Shrestha, Mohammad Ali Armin, Hongdong Li, Nick Barnes

Abstract: Existing deep learning based unsupervised video object segmentation methods still rely on ground-truth segmentation masks to train. Unsupervised in this context only means that no annotated frames are used during inference. As obtaining ground-truth segmentation masks for real image scenes is a laborious task, we envision a simple framework for dominant moving object segmentation that neither requ… ▽ More Existing deep learning based unsupervised video object segmentation methods still rely on ground-truth segmentation masks to train. Unsupervised in this context only means that no annotated frames are used during inference. As obtaining ground-truth segmentation masks for real image scenes is a laborious task, we envision a simple framework for dominant moving object segmentation that neither requires annotated data to train nor relies on saliency priors or pre-trained optical flow maps. Inspired by a layered image representation, we introduce a technique to group pixel regions according to their affine parametric motion. This enables our network to learn segmentation of the dominant foreground object using only RGB image pairs as input for both training and inference. We establish a baseline for this novel task using a new MovingCars dataset and show competitive performance against recent methods that require annotated masks to train. △ Less

Submitted 28 November, 2021; originally announced November 2021.

Comments: DICTA 2021

arXiv:2109.08267 [pdf, other]

CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research

Authors: Chris Cummins, Bram Wasti, Jiadong Guo, Brandon Cui, Jason Ansel, Sahir Gomez, Somya Jain, Jia Liu, Olivier Teytaud, Benoit Steiner, Yuandong Tian, Hugh Leather

Abstract: Interest in applying Artificial Intelligence (AI) techniques to compiler optimizations is increasing rapidly, but compiler research has a high entry barrier. Unlike in other domains, compiler and AI researchers do not have access to the datasets and frameworks that enable fast iteration and development of ideas, and getting started requires a significant engineering investment. What is needed is a… ▽ More Interest in applying Artificial Intelligence (AI) techniques to compiler optimizations is increasing rapidly, but compiler research has a high entry barrier. Unlike in other domains, compiler and AI researchers do not have access to the datasets and frameworks that enable fast iteration and development of ideas, and getting started requires a significant engineering investment. What is needed is an easy, reusable experimental infrastructure for real world compiler optimization tasks that can serve as a common benchmark for comparing techniques, and as a platform to accelerate progress in the field. We introduce CompilerGym, a set of environments for real world compiler optimization tasks, and a toolkit for exposing new optimization tasks to compiler researchers. CompilerGym enables anyone to experiment on production compiler optimization problems through an easy-to-use package, regardless of their experience with compilers. We build upon the popular OpenAI Gym interface enabling researchers to interact with compilers using Python and a familiar API. We describe the CompilerGym architecture and implementation, characterize the optimization spaces and computational efficiencies of three included compiler environments, and provide extensive empirical evaluations. Compared to prior works, CompilerGym offers larger datasets and optimization spaces, is 27x more computationally efficient, is fault-tolerant, and capable of detecting reproducibility bugs in the underlying compilers. In making it easy for anyone to experiment with compilers - irrespective of their background - we aim to accelerate progress in the AI and compiler research domains. △ Less

Submitted 22 December, 2021; v1 submitted 16 September, 2021; originally announced September 2021.

Comments: 12 pages. Source code available at https://github.com/facebookresearch/CompilerGym

arXiv:2101.07359 [pdf, other]

Variable Selection in Regression-based Estimation of Dynamic Treatment Regimes

Authors: Zeyu Bian, Erica EM Moodie, Susan M Shortreed, Sahir Bhatnagar

Abstract: Dynamic treatment regimes (DTRs) consist of a sequence of decision rules, one per stage of intervention, that finds effective treatments for individual patients according to patient information history. DTRs can be estimated from models which include the interaction between treatment and a small number of covariates which are often chosen a priori. However, with increasingly large and complex data… ▽ More Dynamic treatment regimes (DTRs) consist of a sequence of decision rules, one per stage of intervention, that finds effective treatments for individual patients according to patient information history. DTRs can be estimated from models which include the interaction between treatment and a small number of covariates which are often chosen a priori. However, with increasingly large and complex data being collected, it is difficult to know which prognostic factors might be relevant in the treatment rule. Therefore, a more data-driven approach of selecting these covariates might improve the estimated decision rules and simplify models to make them easier to interpret. We propose a variable selection method for DTR estimation using penalized dynamic weighted least squares. Our method has the strong heredity property, that is, an interaction term can be included in the model only if the corresponding main terms have also been selected. Through simulations, we show our method has both the double robustness property and the oracle property, and the newly proposed methods compare favorably with other variable selection approaches. △ Less

Submitted 3 December, 2021; v1 submitted 18 January, 2021; originally announced January 2021.

arXiv:2010.03744 [pdf, other]

Maximum Reward Formulation In Reinforcement Learning

Authors: Sai Krishna Gottipati, Yashaswi Pathak, Rohan Nuttall, Sahir, Raviteja Chunduru, Ahmed Touati, Sriram Ganapathi Subramanian, Matthew E. Taylor, Sarath Chandar

Abstract: Reinforcement learning (RL) algorithms typically deal with maximizing the expected cumulative return (discounted or undiscounted, finite or infinite horizon). However, several crucial applications in the real world, such as drug discovery, do not fit within this framework because an RL agent only needs to identify states (molecules) that achieve the highest reward within a trajectory and does not… ▽ More Reinforcement learning (RL) algorithms typically deal with maximizing the expected cumulative return (discounted or undiscounted, finite or infinite horizon). However, several crucial applications in the real world, such as drug discovery, do not fit within this framework because an RL agent only needs to identify states (molecules) that achieve the highest reward within a trajectory and does not need to optimize for the expected cumulative return. In this work, we formulate an objective function to maximize the expected maximum reward along a trajectory, derive a novel functional form of the Bellman equation, introduce the corresponding Bellman operators, and provide a proof of convergence. Using this formulation, we achieve state-of-the-art results on the task of molecule generation that mimics a real-world drug discovery pipeline. △ Less

Submitted 18 December, 2023; v1 submitted 7 October, 2020; originally announced October 2020.

Comments: 14 pages, 5 figures Update based on reviewer feedback

arXiv:2009.10629 [pdf, ps, other]

doi 10.1007/s11222-023-10371-8

Accelerated Gradient Methods for Sparse Statistical Learning with Nonconvex Penalties

Authors: Kai Yang, Masoud Asgharian, Sahir Bhatnagar

Abstract: Nesterov's accelerated gradient (AG) is a popular technique to optimize objective functions comprising two components: a convex loss and a penalty function. While AG methods perform well for convex penalties, such as the LASSO, convergence issues may arise when it is applied to nonconvex penalties, such as SCAD. A recent proposal generalizes Nesterov's AG method to the nonconvex setting. The propo… ▽ More Nesterov's accelerated gradient (AG) is a popular technique to optimize objective functions comprising two components: a convex loss and a penalty function. While AG methods perform well for convex penalties, such as the LASSO, convergence issues may arise when it is applied to nonconvex penalties, such as SCAD. A recent proposal generalizes Nesterov's AG method to the nonconvex setting. The proposed algorithm requires specification of several hyperparameters for its practical application. Aside from some general conditions, there is no explicit rule for selecting the hyperparameters, and how different selection can affect convergence of the algorithm. In this article, we propose a hyperparameter setting based on the complexity upper bound to accelerate convergence, and consider the application of this nonconvex AG algorithm to high-dimensional linear and logistic sparse learning problems. We further establish the rate of convergence and present a simple and useful bound to characterize our proposed optimal dam** sequence. Simulation studies show that convergence can be made, on average, considerably faster than that of the conventional proximal gradient algorithm. Our experiments also show that the proposed method generally outperforms the current state-of-the-art methods in terms of signal recovery. △ Less

Submitted 28 November, 2022; v1 submitted 22 September, 2020; originally announced September 2020.

Comments: 42 pages, 13 figures

Journal ref: Stat Comput 34, 59 (2024)

arXiv:2009.10264 [pdf, other]

casebase: An Alternative Framework For Survival Analysis and Comparison of Event Rates

Authors: Sahir Rai Bhatnagar, Maxime Turgeon, Jesse Islam, James A. Hanley, Olli Saarela

Abstract: In epidemiological studies of time-to-event data, a quantity of interest to the clinician and the patient is the risk of an event given a covariate profile. However, methods relying on time matching or risk-set sampling (including Cox regression) eliminate the baseline hazard from the likelihood expression or the estimating function. The baseline hazard then needs to be estimated separately using… ▽ More In epidemiological studies of time-to-event data, a quantity of interest to the clinician and the patient is the risk of an event given a covariate profile. However, methods relying on time matching or risk-set sampling (including Cox regression) eliminate the baseline hazard from the likelihood expression or the estimating function. The baseline hazard then needs to be estimated separately using a non-parametric approach. This leads to step-wise estimates of the cumulative incidence that are difficult to interpret. Using case-base sampling, Hanley & Miettinen (2009) explained how the parametric hazard functions can be estimated using logistic regression. Their approach naturally leads to estimates of the cumulative incidence that are smooth-in-time. In this paper, we present the casebase R package, a comprehensive and flexible toolkit for parametric survival analysis. We describe how the case-base framework can also be used in more complex settings: competing risks, time-varying exposure, and variable selection. Our package also includes an extensive array of visualization tools to complement the analysis of time-to-event data. We illustrate all these features through four different case studies. *SRB and MT contributed equally to this work. △ Less

Submitted 21 September, 2020; originally announced September 2020.

Comments: 31 pages, 10 figures

arXiv:1108.2438 [pdf]

doi 10.1016/j.electacta.2009.06.068

Measuring Oxygen, Carbon Monoxide and Hydrogen Sulfide Diffusion Coefficient and Solubility in Nafion Membranes

Authors: Vijay A. Sethuraman, Saahir Khan, Jesse S. Jur, Andrew T. Haug, John W. Weidner

Abstract: A Devanathan-Stachurski type diffusion cell made from a fuel cell assembly is designed to evaluate the gas transport properties of a proton exchange membrane as a function of cell temperature and gas pressure. Data obtained on this cell using the electrochemical monitoring technique (EMT) is used to estimate solubility and diffusion coefficient of oxygen (O2), carbon monoxide (CO) and hydrogen sul… ▽ More A Devanathan-Stachurski type diffusion cell made from a fuel cell assembly is designed to evaluate the gas transport properties of a proton exchange membrane as a function of cell temperature and gas pressure. Data obtained on this cell using the electrochemical monitoring technique (EMT) is used to estimate solubility and diffusion coefficient of oxygen (O2), carbon monoxide (CO) and hydrogen sulfide (H2S) in Nafion membranes. Membrane swelling and reverse-gas diffusion due to water flux are accounted for in the parameter estimation procedure. Permeability of all three gases was found to increase with temperature. The estimated activation energies for O2, CO and H2S diffusion in Nafion 112 are 12.58, 20 and 8.85 kJ mol^-1, respectively. The estimated enthalpies of mixing for O2, CO and H2S in Nafion 112 are 5.88, 3.74 and 7.61 kJ mol^-1, respectively. An extensive comparison of transport properties estimated in this study to those reported in the literature suggests good agreement. Oxygen permeability in Nafion 117 was measured as a function of gas pressures between 1 and 3 atm. Oxygen diffusion coefficient in Nafion 117 is invariant with pressure and the solubility increases with pressure and obeys Henry's law. The estimated Henry's constant is 3.5 x 10^3 atm. △ Less

Submitted 11 August, 2011; originally announced August 2011.

Comments: 27 pages, 16 figures

Journal ref: Electrochimica Acta, 54(27), 6850-6860, 2009

Showing 1–14 of 14 results for author: Sahir