-
Efficient Observation Time Window Segmentation for Administrative Data Machine Learning
Authors:
Musa Taib,
Geoffrey G. Messier
Abstract:
Machine learning models benefit when allowed to learn from temporal trends in time-stamped administrative data. These trends can be represented by dividing a model's observation window into time segments or bins. Model training time and performance can be improved by representing each feature with a different time resolution. However, this causes the time bin size hyperparameter search space to gr…
▽ More
Machine learning models benefit when allowed to learn from temporal trends in time-stamped administrative data. These trends can be represented by dividing a model's observation window into time segments or bins. Model training time and performance can be improved by representing each feature with a different time resolution. However, this causes the time bin size hyperparameter search space to grow exponentially with the number of features. The contribution of this paper is to propose a computationally efficient time series analysis to investigate binning (TAIB) technique that determines which subset of data features benefit the most from time bin size hyperparameter tuning. This technique is demonstrated using hospital and housing/homelessness administrative data sets. The results show that TAIB leads to models that are not only more efficient to train but can perform better than models that default to representing all features with the same time bin size.
△ Less
Submitted 12 March, 2024; v1 submitted 29 January, 2024;
originally announced January 2024.
-
The Human Behind the Data: Reflections from an Ongoing Co-Design and Deployment of a Data-Navigation Interface for Front-Line Emergency Housing Shelter Staff
Authors:
Teale W Masrani,
Helen Ai He,
Geoffrey Messier
Abstract:
On any night in Canada, at least 35,000 individuals experience homelessness. These individuals use emergency shelters to transition out of homelessness and into permanent housing. We designed and deployed a technology to support front-line staff at the largest emergency housing shelter in Calgary, Canada. Over a period of five months in 2022, we worked closely with front-line staff to co-design an…
▽ More
On any night in Canada, at least 35,000 individuals experience homelessness. These individuals use emergency shelters to transition out of homelessness and into permanent housing. We designed and deployed a technology to support front-line staff at the largest emergency housing shelter in Calgary, Canada. Over a period of five months in 2022, we worked closely with front-line staff to co-design an interface for supporting a holistic understanding of client context and facilitating decision-making. The tool is currently in-use and our collaboration is ongoing. In this paper, we reflect on preliminary findings regarding the second iteration of the tool. We find that supporting shelter staff in understanding the human behind the data was a critical component of design. This work contributes to literature on how data tools may be integrated into homeless shelters in a way that aligns with shelters' values.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
A Graph Analysis of the Impact of COVID-19 on Emergency Housing Shelter Access Patterns
Authors:
Geoffrey G. Messier
Abstract:
This paper investigates how COVID-19 disrupted emergency housing shelter access patterns in Calgary, Canada and what aspects of these changes persist to the present day. This analysis will utilize aggregated shelter access data for over 40,000 individuals from seven major urban shelters dating from 2018 to the present. A graph theoretic approach will be used to examine the journeys of individuals…
▽ More
This paper investigates how COVID-19 disrupted emergency housing shelter access patterns in Calgary, Canada and what aspects of these changes persist to the present day. This analysis will utilize aggregated shelter access data for over 40,000 individuals from seven major urban shelters dating from 2018 to the present. A graph theoretic approach will be used to examine the journeys of individuals between shelters before, during and after the COVID-19 lockdown period. This approach treats shelters as nodes in a graph and a person's transition between shelter as an arrow or edge between nodes. This perspective is used to create both timeline and network diagrams that visualize shelter use and the flow of people between shelters. Statistical results are also presented that illustrate the differences between the cohorts of people who only used shelter pre/post-lockdown, people who stayed in shelter during lockdown and people who used shelter for the first time during lockdown. The results demonstrate not only how a complex system of care responded to the pandemic but also the characteristics of the people most likely to continue to rely on that system during an emergency.
△ Less
Submitted 15 August, 2023;
originally announced August 2023.
-
The Effect of Epidemiological Cohort Creation on the Machine Learning Prediction of Homelessness and Police Interaction Outcomes Using Administrative Health Care Data
Authors:
Faezehsadat Shahidi,
M. Ethan MacDonald,
Dallas Seitz,
Geoffrey Messier
Abstract:
Background: Mental illness can lead to adverse outcomes such as homelessness and police interaction and understanding of the events leading up to these adverse outcomes is important. Predictive models may help identify individuals at risk of such adverse outcomes. Using a fixed observation window cohort with logistic regression (LR) or machine learning (ML) models can result in lower performance w…
▽ More
Background: Mental illness can lead to adverse outcomes such as homelessness and police interaction and understanding of the events leading up to these adverse outcomes is important. Predictive models may help identify individuals at risk of such adverse outcomes. Using a fixed observation window cohort with logistic regression (LR) or machine learning (ML) models can result in lower performance when compared with adaptive and parcellated windows.
Method: An administrative healthcare dataset was used, comprising of 240,219 individuals in Calgary, Alberta, Canada who were diagnosed with addiction or mental health (AMH) between April 1, 2013, and March 31, 2018. The cohort was followed for 2 years to identify factors associated with homelessness and police interactions. To understand the benefit of flexible windows to predictive models, an alternative cohort was created. Then LR and ML models, including random forests (RF), and extreme gradient boosting (XGBoost) were compared in the two cohorts.
Results: Among 237,602 individuals, 0.8% (1,800) experienced first homelessness, while 0.32% (759) reported initial police interaction among 237,141 individuals. Male sex (AORs: H=1.51, P=2.52), substance disorder (AORs: H=3.70, P=2.83), psychiatrist visits (AORs: H=1.44, P=1.49), and drug abuse (AORs: H=2.67, P=1.83) were associated with initial homelessness (H) and police interaction (P). XGBoost showed superior performance using the flexible method (sensitivity =91%, AUC =90% for initial homelessness, and sensitivity =90%, AUC=89% for initial police interaction)
Conclusion: This study identified key features associated with initial homelessness and police interaction and demonstrated that flexible windows can improve predictive modeling.
△ Less
Submitted 20 July, 2023;
originally announced July 2023.
-
A Simpler Method for Understanding Emergency Shelter Access Patterns
Authors:
Geoffrey G. Messier
Abstract:
The Simplified Access Metric (SAM) is a new approach for characterizing emergency shelter access patterns as a measure of shelter client vulnerability. The goal of SAM is to provide shelter operators with an intuitive way to understand access patterns that can be implemented by non-technical staff using spreadsheet operations. Client data from a large North American shelter will be used to demonst…
▽ More
The Simplified Access Metric (SAM) is a new approach for characterizing emergency shelter access patterns as a measure of shelter client vulnerability. The goal of SAM is to provide shelter operators with an intuitive way to understand access patterns that can be implemented by non-technical staff using spreadsheet operations. Client data from a large North American shelter will be used to demonstrate that SAM produces similar results to traditional transitional, episodic and chronic client cluster analysis. Since SAM requires less data than cluster analysis, it is also able to generate a real time picture of how shelter access patterns are affected by external factors. Timelines generated from nine years of shelter client data using SAM demonstrate the impact of Housing First programming and the COVID-19 lockdown on how people access shelter. Finally, SAM allows shelter staff to move beyond assigning transitional, episodic and chronic labels and instead use the "soft" output of SAM directly as a measure of vulnerability.
△ Less
Submitted 24 March, 2023; v1 submitted 24 October, 2022;
originally announced October 2022.
-
A Rule Search Framework for the Early Identification of Chronic Emergency Homeless Shelter Clients
Authors:
Caleb John,
Geoffrey G. Messier
Abstract:
This paper uses rule search techniques for the early identification of emergency homeless shelter clients who are at risk of becoming long term or chronic shelter users. Using a data set from a major North American shelter containing 12 years of service interactions with over 40,000 individuals, the optimized pruning for unordered search (OPUS) algorithm is used to develop rules that are both intu…
▽ More
This paper uses rule search techniques for the early identification of emergency homeless shelter clients who are at risk of becoming long term or chronic shelter users. Using a data set from a major North American shelter containing 12 years of service interactions with over 40,000 individuals, the optimized pruning for unordered search (OPUS) algorithm is used to develop rules that are both intuitive and effective. The rules are evaluated within a framework compatible with the real-time delivery of a housing program meant to transition high risk clients to supportive housing. Results demonstrate that the median time to identification of clients at risk of chronic shelter use drops from 297 days to 162 days when the methods in this paper are applied.
△ Less
Submitted 26 April, 2023; v1 submitted 19 May, 2022;
originally announced May 2022.
-
Predicting Chronic Homelessness: The Importance of Comparing Algorithms using Client Histories
Authors:
Geoffrey G. Messier,
Caleb John,
Ayush Malik
Abstract:
This paper investigates how to best compare algorithms for predicting chronic homelessness for the purpose of identifying good candidates for housing programs. Predictive methods can rapidly refer potentially chronic shelter users to housing but also sometimes incorrectly identify individuals who will not become chronic (false positives). We use shelter access histories to demonstrate that these f…
▽ More
This paper investigates how to best compare algorithms for predicting chronic homelessness for the purpose of identifying good candidates for housing programs. Predictive methods can rapidly refer potentially chronic shelter users to housing but also sometimes incorrectly identify individuals who will not become chronic (false positives). We use shelter access histories to demonstrate that these false positives are often still good candidates for housing. Using this approach, we compare a simple threshold method for predicting chronic homelessness to the more complex logistic regression and neural network algorithms. While traditional binary classification performance metrics show that the machine learning algorithms perform better than the threshold technique, an examination of the shelter access histories of the cohorts identified by the three algorithms show that they select groups with very similar characteristics. This has important implications for resource constrained not-for-profit organizations since the threshold technique can be implemented using much simpler information technology infrastructure than the machine learning algorithms.
△ Less
Submitted 24 March, 2023; v1 submitted 31 May, 2021;
originally announced May 2021.
-
The Best Thresholds for Rapid Identification of Episodic and Chronic Homeless Shelter Use
Authors:
Geoffrey Guy Messier,
Leslie Tutty,
Caleb John
Abstract:
This paper explores how to best identify clients for housing services based on their homeless shelter access patterns. We focus on counting the number of shelter stays and episodes of shelter use for a client within a time window. Thresholds are then applied to these values to determine if that individual is a good candidate for housing support. Using new housing referral impact metrics, we explor…
▽ More
This paper explores how to best identify clients for housing services based on their homeless shelter access patterns. We focus on counting the number of shelter stays and episodes of shelter use for a client within a time window. Thresholds are then applied to these values to determine if that individual is a good candidate for housing support. Using new housing referral impact metrics, we explore a range of threshold and time window values to determine which combination both maximizes impact and identifies good candidates for housing as soon as possible. New insights are also provided regarding the characteristics of the "under-the-radar" client group who are typically not identified for housing support.
△ Less
Submitted 24 March, 2023; v1 submitted 3 May, 2021;
originally announced May 2021.
-
Adaptive M-QAM for Indoor Wireless Environments : Rate & Power Adaptation
Authors:
I. Dey,
G. G. Messier,
S. Magierowski
Abstract:
This letter presents a detailed study for indoor wireless environments, where transmit power, rate and target bit error rate (BER) are varied to increase spectral efficiency. The study is conducted for the recently proposed joint fading and two-path shadowing (JFTS) channel model, which is shown to be accurate for modeling non-Gaussian indoor WLAN environments. Analysis is done for both average an…
▽ More
This letter presents a detailed study for indoor wireless environments, where transmit power, rate and target bit error rate (BER) are varied to increase spectral efficiency. The study is conducted for the recently proposed joint fading and two-path shadowing (JFTS) channel model, which is shown to be accurate for modeling non-Gaussian indoor WLAN environments. Analysis is done for both average and instantaneous BER constraints without channel coding, where only a discrete finite set of constellations is available. Numerical results show that, for a JFTS channel i) varying only the transmission rate (modulation constellation size) achieves more improvement in spectral efficiency compared to varying transmit power only, and ii) varying rate and/or power subject to instantaneous BER (IBER) constraint offers better performance than when subject to average BER (A-BER) constraint.
△ Less
Submitted 20 November, 2017;
originally announced November 2017.
-
Petroleum Refinery Multi-Antenna Propagation Measurements
Authors:
Mohamed Gaafar,
Geoffrey G Messier
Abstract:
This paper presents the results of the first multi- antenna propagation measurement campaign to be conducted at an operating petroleum refining facility. The measurement equipment transmits pseudo-random noise test signals from two antennas at a 2.47 GHz carrier with a signal bandwidth of approximately 25 MHz. The measurement data is analyzed to extract path loss exponent, shadowing distribution,…
▽ More
This paper presents the results of the first multi- antenna propagation measurement campaign to be conducted at an operating petroleum refining facility. The measurement equipment transmits pseudo-random noise test signals from two antennas at a 2.47 GHz carrier with a signal bandwidth of approximately 25 MHz. The measurement data is analyzed to extract path loss exponent, shadowing distribution, fading distribution, coherence bandwidth and antenna correlation. The results reveal an environment where large scale attenuation is relatively mild, fading is severe and good performance is expected from both antenna and frequency diversity.
△ Less
Submitted 1 November, 2016;
originally announced November 2016.
-
Vehicle Routing Problems for Drone Delivery
Authors:
Kevin Dorling,
Jordan Heinrichs,
Geoffrey G. Messier,
Sebastian Magierowski
Abstract:
Unmanned aerial vehicles, or drones, have the potential to significantly reduce the cost and time of making last-mile deliveries and responding to emergencies. Despite this potential, little work has gone into develo** vehicle routing problems (VRPs) specifically for drone delivery scenarios. Existing VRPs are insufficient for planning drone deliveries: either multiple trips to the depot are not…
▽ More
Unmanned aerial vehicles, or drones, have the potential to significantly reduce the cost and time of making last-mile deliveries and responding to emergencies. Despite this potential, little work has gone into develo** vehicle routing problems (VRPs) specifically for drone delivery scenarios. Existing VRPs are insufficient for planning drone deliveries: either multiple trips to the depot are not permitted, leading to solutions with excess drones, or the effect of battery and payload weight on energy consumption is not considered, leading to costly or infeasible routes. We propose two multi-trip VRPs for drone delivery that address both issues. One minimizes costs subject to a delivery time limit, while the other minimizes the overall delivery time subject to a budget constraint. We mathematically derive and experimentally validate an energy consumption model for multirotor drones, demonstrating that energy consumption varies approximately linearly with payload and battery weight. We use this approximation to derive mixed integer linear programs for our VRPs. We propose a cost function that considers our energy consumption model and drone reuse, and apply it in a simulated annealing (SA) heuristic for finding sub-optimal solutions to practical scenarios. To assist drone delivery practitioners with balancing cost and delivery time, the SA heuristic is used to show that the minimum cost has an inverse exponential relationship with the delivery time limit, and the minimum overall delivery time has an inverse exponential relationship with the budget. Numerical results confirm the importance of reusing drones and optimizing battery size in drone delivery VRPs.
△ Less
Submitted 7 August, 2016;
originally announced August 2016.
-
Dense Urban Channel Measurements for Utility Pole Fixed Wireless Links
Authors:
Michael W. Wasson,
Geoffrey G. Messier,
Devin P. Smith
Abstract:
This radio channel measurement campaign characterizes the propagation conditions experienced in a dense urban environment over fixed backhaul links between wireless devices that are mounted on utility or traffic light poles. The measurements characterize the 2x1 multiple input single output channel in the 2.45 GHz band for both spatially separated omni antennas and cross polarized directional ante…
▽ More
This radio channel measurement campaign characterizes the propagation conditions experienced in a dense urban environment over fixed backhaul links between wireless devices that are mounted on utility or traffic light poles. The measurements characterize the 2x1 multiple input single output channel in the 2.45 GHz band for both spatially separated omni antennas and cross polarized directional antennas. Results presented include both small and large scale channel statistics, antenna correlation coefficient values and the off-broadside rejection achieved with the directional antennas.
△ Less
Submitted 27 May, 2016;
originally announced May 2016.
-
On the Capacity of Joint Fading and Two-path Shadowing Channels
Authors:
I. Dey,
G. G. Messier,
S. Magierowski
Abstract:
The ergodic and outage channel capacity of different optimal and suboptimal combinations of transmit power and modulation rate adaptation strategies over a joint fading and two-path shadowing (JFTS) fading/shadowing channel is studied in this paper. Analytically tractable expressions for channel capacity are obtained, assuming perfect channel side information (CSI) at the receiver and/or the trans…
▽ More
The ergodic and outage channel capacity of different optimal and suboptimal combinations of transmit power and modulation rate adaptation strategies over a joint fading and two-path shadowing (JFTS) fading/shadowing channel is studied in this paper. Analytically tractable expressions for channel capacity are obtained, assuming perfect channel side information (CSI) at the receiver and/or the transmitter with negligible feedback delay. Furthermore, the impacts of the JFTS parameters on the channel capacity achieved by these adaptive transmission techniques are determined.
△ Less
Submitted 11 May, 2016;
originally announced May 2016.
-
Minimizing the Net Present Cost of Deploying and Operating Wireless Sensor Networks
Authors:
Kevin Dorling,
Geoffrey G. Messier,
Stefan Valentin,
Sebastian Magierowski
Abstract:
Minimizing the cost of deploying and operating a Wireless Sensor Network (WSN) involves deciding how to partition a budget between competing expenses such as node hardware, energy, and labor. Most commercial network operators account for interest rates in their budgeting exercises, providing a financial incentive to defer some costs until a later time. In this paper, we propose a net present cost…
▽ More
Minimizing the cost of deploying and operating a Wireless Sensor Network (WSN) involves deciding how to partition a budget between competing expenses such as node hardware, energy, and labor. Most commercial network operators account for interest rates in their budgeting exercises, providing a financial incentive to defer some costs until a later time. In this paper, we propose a net present cost (NPC) model for WSN capital and operating expenses that accounts for interest rates. Our model optimizes the number, size, and spacing between expenditures in order to minimize the NPC required for the network to achieve a desired operational lifetime. In general this optimization problem is non-convex, but if the spacing between expenditures is linearly proportional to the size of the expenditures, and the number of maintenance cycles is known in advance, the problem becomes convex and can be solved to global optimality. If non-deferrable recurring costs are low, then evenly spacing the expenditures can provide near-optimal results. With the provided models and methods, network operators can now derive a payment schedule to minimize NPC while accounting for various operational parameters. The numerical examples show substantial cost benefits under practical assumptions.
△ Less
Submitted 8 August, 2015;
originally announced August 2015.