Skip to main content

Showing 1–8 of 8 results for author: Boué, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.09302  [pdf, other

    cs.LG cs.AI cs.DC

    High Significant Fault Detection in Azure Core Workload Insights

    Authors: Pranay Lohia, Laurent Boue, Sharath Rangappa, Vijay Agneeswaran

    Abstract: Azure Core workload insights have time-series data with different metric units. Faults or Anomalies are observed in these time-series data owing to faults observed with respect to metric name, resources region, dimensions, and its dimension value associated with the data. For Azure Core, an important task is to highlight faults or anomalies to the user on a dashboard that they can perceive easily.… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  2. arXiv:2401.04732  [pdf, other

    cs.IR cs.AI cs.LG

    A case study of Generative AI in MSX Sales Copilot: Improving seller productivity with a real-time question-answering system for content recommendation

    Authors: Manpreet Singh, Ravdeep Pasricha, Nitish Singh, Ravi Prasad Kondapalli, Manoj R, Kiran R, Laurent Boué

    Abstract: In this paper, we design a real-time question-answering system specifically targeted for hel** sellers get relevant material/documentation they can share live with their customers or refer to during a call. Taking the Seismic content repository as a relatively large scale example of a diverse dataset of sales material, we demonstrate how LLM embeddings of sellers' queries can be matched with the… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Journal ref: Microsoft Journal of Applied Research, Volume 20, 2024

  3. arXiv:2401.00737  [pdf, other

    cs.IR cs.AI cs.LG cs.SE

    Searching, fast and slow, through product catalogs

    Authors: Dayananda Ubrangala, Juhi Sharma, Sharath Kumar Rangappa, Kiran R, Ravi Prasad Kondapalli, Laurent Boué

    Abstract: String matching algorithms in the presence of abbreviations, such as in Stock Kee** Unit (SKU) product catalogs, remains a relatively unexplored topic. In this paper, we present a unified architecture for SKU search that provides both a real-time suggestion system (based on a Trie data structure) as well as a lower latency search system (making use of character level TF-IDF in combination with l… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

    Journal ref: Microsoft Journal of Applied Research, Volume 20, 2024

  4. arXiv:2312.08021  [pdf, other

    cs.IR cs.AI cs.LG

    Improving search relevance of Azure Cognitive Search by Bayesian optimization

    Authors: Nitin Agarwal, Ashish Kumar, Kiran R, Manish Gupta, Laurent Boué

    Abstract: Azure Cognitive Search (ACS) has emerged as a major contender in "Search as a Service" cloud products in recent years. However, one of the major challenges for ACS users is to improve the relevance of the search results for their specific usecases. In this paper, we propose a novel method to find the optimal ACS configuration that maximizes search relevance for a specific usecase (product search,… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Journal ref: Microsoft Journal of Applied Research, Volume 20, 2024

  5. arXiv:2308.01976  [pdf, other

    cs.LG cs.AI cs.CL cs.IR

    Domain specificity and data efficiency in typo tolerant spell checkers: the case of search in online marketplaces

    Authors: Dayananda Ubrangala, Juhi Sharma, Ravi Prasad Kondapalli, Kiran R, Amit Agarwala, Laurent Boué

    Abstract: Typographical errors are a major source of frustration for visitors of online marketplaces. Because of the domain-specific nature of these marketplaces and the very short queries users tend to search for, traditional spell cheking solutions do not perform well in correcting typos. We present a data augmentation method to address the lack of annotated typo data and train a recurrent neural network… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Journal ref: Microsoft Journal of Applied Research, Volume 19, 2023

  6. arXiv:2212.07951  [pdf, other

    cs.SE cs.LG cs.PL

    A Data Source Dependency Analysis Framework for Large Scale Data Science Projects

    Authors: Laurent Boué, Pratap Kunireddy, Pavle Subotić

    Abstract: Dependency hell is a well-known pain point in the development of large software projects and machine learning (ML) code bases are not immune from it. In fact, ML applications suffer from an additional form, namely, "data source dependency hell". This term refers to the central role played by data and its unique quirks that often lead to unexpected failures of ML models which cannot be explained by… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  7. arXiv:1904.12320  [pdf, other

    cs.LG cs.DM cs.IR stat.ML

    Real numbers, data science and chaos: How to fit any dataset with a single parameter

    Authors: Laurent Boué

    Abstract: We show how any dataset of any modality (time-series, images, sound...) can be approximated by a well-behaved (continuous, differentiable...) scalar function with a single real-valued parameter. Building upon elementary concepts from chaos theory, we adopt a pedagogical approach demonstrating how to adjust this parameter in order to achieve arbitrary precision fit to all samples of the data. Targe… ▽ More

    Submitted 28 April, 2019; originally announced April 2019.

  8. arXiv:1811.11987  [pdf

    cs.LG cs.AI cs.CV cs.SC stat.ML

    Deep learning for pedestrians: backpropagation in CNNs

    Authors: Laurent Boué

    Abstract: The goal of this document is to provide a pedagogical introduction to the main concepts underpinning the training of deep neural networks using gradient descent; a process known as backpropagation. Although we focus on a very influential class of architectures called "convolutional neural networks" (CNNs) the approach is generic and useful to the machine learning community as a whole. Motivated by… ▽ More

    Submitted 29 November, 2018; originally announced November 2018.