Skip to main content

Showing 1–10 of 10 results for author: Liu, C H B

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.03579  [pdf, other

    stat.AP cs.DB stat.ME

    Some Statistical and Data Challenges When Building Early-Stage Digital Experimentation and Measurement Capabilities

    Authors: C. H. Bryan Liu

    Abstract: Digital experimentation and measurement (DEM) capabilities -- the knowledge and tools necessary to run experiments with digital products, services, or experiences and measure their impact -- are fast becoming part of the standard toolkit of digital/data-driven organisations in guiding business decisions. Many large technology companies report having mature DEM capabilities, and several businesses… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: PhD thesis. Imperial College London. Official library version available on: https://spiral.imperial.ac.uk/handle/10044/1/110307

  2. Measuring e-Commerce Metric Changes in Online Experiments

    Authors: C. H. Bryan Liu, Emma J. McCoy

    Abstract: Digital technology organizations routinely use online experiments (e.g. A/B tests) to guide their product and business decisions. In e-commerce, we often measure changes to transaction- or item-based business metrics such as Average Basket Value (ABV), Average Basket Size (ABS), and Average Selling Price (ASP); yet it remains a common pitfall to ignore the dependency between the value/size of tran… ▽ More

    Submitted 17 April, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: To appear in WWW '23 Companion. 5 pages, 4 figures, 2 tables. The experiment code and results on the two publicly available datasets are available on GitHub/Zenodo: https://doi.org/10.5281/zenodo.7659092. This version supersedes a previous working paper with a different title

  3. arXiv:2111.10198  [pdf, other

    stat.AP cs.DB stat.ME

    Datasets for Online Controlled Experiments

    Authors: C. H. Bryan Liu, Ângelo Cardoso, Paul Couturier, Emma J. McCoy

    Abstract: Online Controlled Experiments (OCE) are the gold standard to measure impact and guide decisions for digital products and services. Despite many methodological advances in this area, the scarcity of public datasets and the lack of a systematic review and categorization hinder its development. We present the first survey and taxonomy for OCE datasets, which highlight the lack of a public dataset to… ▽ More

    Submitted 14 January, 2022; v1 submitted 19 November, 2021; originally announced November 2021.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks. 17 pages, 2 figures, 2 tables. Dataset available on Open Science Framework: https://osf.io/64jsb/

  4. arXiv:2007.11638  [pdf, other

    stat.ME stat.AP

    An Evaluation Framework for Personalization Strategy Experiment Designs

    Authors: C. H. Bryan Liu, Emma J. McCoy

    Abstract: Online Controlled Experiments (OCEs) are the gold standard in evaluating the effectiveness of changes to websites. An important type of OCE evaluates different personalization strategies, which present challenges in low test power and lack of full control in group assignment. We argue that getting the right experiment setup -- the allocation of users to treatment/analysis groups -- should take pre… ▽ More

    Submitted 9 May, 2023; v1 submitted 22 July, 2020; originally announced July 2020.

    Comments: Presented in the AdKDD 2020 workshop, in conjunction with The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2020. Main paper: 7 pages, 2 figures, 2 tables, Supplementary document: 6 pages. Fixed minor typos in Eqs. (17) and (18), and Expr. (27a)

  5. arXiv:1909.03457  [pdf, other

    stat.ME stat.AP

    What is the value of experimentation & measurement?

    Authors: C. H. Bryan Liu, Benjamin Paul Chamberlain

    Abstract: Experimentation and Measurement (E&M) capabilities allow organizations to accurately assess the impact of new propositions and to experiment with many variants of existing products. However, until now, the question of measuring the measurer, or valuing the contribution of an E&M capability to organizational success has not been addressed. We tackle this problem by analyzing how, by decreasing esti… ▽ More

    Submitted 8 September, 2019; originally announced September 2019.

    Comments: Accepted into IEEE International Conference on Data Mining (ICDM) 2019. Main paper: 6 pages, 3 figures; Supplementary document: 7 pages, 2 figures. Code available on: https://github.com/liuchbryan/value_of_experimentation

  6. arXiv:1807.04098  [pdf, other

    cs.LG cs.CY cs.IR cs.NE stat.ML

    A Recurrent Neural Network Survival Model: Predicting Web User Return Time

    Authors: Georg L. Grob, Ângelo Cardoso, C. H. Bryan Liu, Duncan A. Little, Benjamin Paul Chamberlain

    Abstract: The size of a website's active user base directly affects its value. Thus, it is important to monitor and influence a user's likelihood to return to a site. Essential to this is predicting when a user will return. Current state of the art approaches to solve this problem come in two flavors: (1) Recurrent Neural Network (RNN) based solutions and (2) survival analysis methods. We observe that both… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

    Comments: Accepted into ECML PKDD 2018; 8 figures and 1 table

    Journal ref: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2018. Lecture Notes in Computer Science, vol 11053. pp 152-168

  7. arXiv:1806.02588  [pdf, other

    stat.ME cs.DM stat.AP

    Designing Experiments to Measure Incrementality on Facebook

    Authors: C. H. Bryan Liu, Elaine M. Bettaney, Benjamin Paul Chamberlain

    Abstract: The importance of Facebook advertising has risen dramatically in recent years, with the platform accounting for almost 20% of the global online ad spend in 2017. An important consideration in advertising is incrementality: how much of the change in an experimental metric is an advertising campaign responsible for. To measure incrementality, Facebook provide lift studies. As Facebook lift studies d… ▽ More

    Submitted 11 July, 2018; v1 submitted 7 June, 2018; originally announced June 2018.

    Comments: Accepted into 2018 AdKDD & TargetAd Workshop in conjunction with KDD 2018; 6 pages, 4 figures, and 2 tables

  8. arXiv:1803.06258  [pdf, other

    stat.ME cs.DM stat.AP

    Online Controlled Experiments for Personalised e-Commerce Strategies: Design, Challenges, and Pitfalls

    Authors: C. H. Bryan Liu, Benjamin Paul Chamberlain

    Abstract: Online controlled experiments are the primary tool for measuring the causal impact of product changes in digital businesses. It is increasingly common for digital products and services to interact with customers in a personalised way. Using online controlled experiments to optimise personalised interaction strategies is challenging because the usual assumption of statistically equivalent user grou… ▽ More

    Submitted 1 July, 2021; v1 submitted 16 March, 2018; originally announced March 2018.

    Comments: Not peer-reviewed but retained for historic interest. Removed an erroneous statement on Welch's t-test assumptions in Section 3.2. 9 pages, 7 figures

  9. arXiv:1706.09865  [pdf, other

    stat.ML cs.CY cs.LG

    Generalising Random Forest Parameter Optimisation to Include Stability and Cost

    Authors: C. H. Bryan Liu, Benjamin Paul Chamberlain, Duncan A. Little, Angelo Cardoso

    Abstract: Random forests are among the most popular classification and regression methods used in industrial applications. To be effective, the parameters of random forests must be carefully tuned. This is usually done by choosing values that minimize the prediction error on a held out dataset. We argue that error reduction is only one of several metrics that must be considered when optimizing random forest… ▽ More

    Submitted 13 July, 2017; v1 submitted 29 June, 2017; originally announced June 2017.

    Comments: To appear in ECML-PKDD 2017

    Journal ref: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2017. LNCS vol 10536, pp. 102-113 (2017)

  10. arXiv:1703.02596  [pdf, other

    cs.LG cs.CY cs.IR cs.NE stat.ML

    Customer Lifetime Value Prediction Using Embeddings

    Authors: Benjamin Paul Chamberlain, Angelo Cardoso, C. H. Bryan Liu, Roberto Pagliari, Marc Peter Deisenroth

    Abstract: We describe the Customer LifeTime Value (CLTV) prediction system deployed at ASOS.com, a global online fashion retailer. CLTV prediction is an important problem in e-commerce where an accurate estimate of future value allows retailers to effectively allocate marketing spend, identify and nurture high value customers and mitigate exposure to losses. The system at ASOS provides daily estimates of th… ▽ More

    Submitted 6 July, 2017; v1 submitted 7 March, 2017; originally announced March 2017.

    Comments: 10 pages, 11 figures

    Journal ref: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Pages 1753-1762, 2017