-
XWalk: Random Walk Based Candidate Retrieval for Product Search
Authors:
Jon Eskreis-Winkler,
Yubin Kim,
Andrew Stanton
Abstract:
In e-commerce, head queries account for the vast majority of gross merchandise sales and improvements to head queries are highly impactful to the business. While most supervised approaches to search perform better in head queries vs. tail queries, we propose a method that further improves head query performance dramatically. We propose XWalk, a random-walk based graph approach to candidate retriev…
▽ More
In e-commerce, head queries account for the vast majority of gross merchandise sales and improvements to head queries are highly impactful to the business. While most supervised approaches to search perform better in head queries vs. tail queries, we propose a method that further improves head query performance dramatically. We propose XWalk, a random-walk based graph approach to candidate retrieval for product search that borrows from recommendation system techniques. XWalk is highly efficient to train and inference in a large-scale high traffic e-commerce setting, and shows substantial improvements in head query performance over state-of-the-art neural retreivers. Ensembling XWalk with a neural and/or lexical retriever combines the best of both worlds and the resulting retrieval system outperforms all other methods in both offline relevance-based evaluation and in online A/B tests.
△ Less
Submitted 22 July, 2023;
originally announced July 2023.
-
Revenue, Relevance, Arbitrage and More: Joint Optimization Framework for Search Experiences in Two-Sided Marketplaces
Authors:
Andrew Stanton,
Akhila Ananthram,
Congzhe Su,
Liangjie Hong
Abstract:
Two-sided marketplaces such as eBay, Etsy and Taobao have two distinct groups of customers: buyers who use the platform to seek the most relevant and interesting item to purchase and sellers who view the same platform as a tool to reach out to their audience and grow their business. Additionally, platforms have their own objectives ranging from growing both buyer and seller user bases to revenue m…
▽ More
Two-sided marketplaces such as eBay, Etsy and Taobao have two distinct groups of customers: buyers who use the platform to seek the most relevant and interesting item to purchase and sellers who view the same platform as a tool to reach out to their audience and grow their business. Additionally, platforms have their own objectives ranging from growing both buyer and seller user bases to revenue maximization. It is not difficult to see that it would be challenging to obtain a globally favorable outcome for all parties. Taking the search experience as an example, any interventions are likely to impact either buyers or sellers unfairly to course correct for a greater perceived need. In this paper, we address how a company-aligned search experience can be provided with competing business metrics that E-commerce companies typically tackle. As far as we know, this is a pioneering work to consider multiple different aspects of business indicators in two-sided marketplaces to optimize a search experience. We demonstrate that many problems are difficult or impossible to decompose down to credit assigned scores on individual documents, rendering traditional methods inadequate. Instead, we express market-level metrics as constraints and discuss to what degree multiple potentially conflicting metrics can be tuned to business needs. We further explore the use of policy learners in the form of Evolutionary Strategies to jointly optimize both group-level and market-level metrics simultaneously, side-step** traditional cascading methods and manual interventions. We empirically evaluate the effectiveness of the proposed method on Etsy data and demonstrate its potential with insights.
△ Less
Submitted 15 May, 2019;
originally announced May 2019.
-
Robust deblending of simultaneous source seismic data
Authors:
Aaron Stanton,
Keith Wilkinson
Abstract:
Simultaneous source seismic acquisition is an efficient method of seismic surveying that can considerably reduce the cost of high density seismic acquisition. The method results in overlap** records, or interference, that must be removed prior to subsequent processing. Deblending methods typically rely on the incoherence of the blending noise relative to the underlying signal. There are many com…
▽ More
Simultaneous source seismic acquisition is an efficient method of seismic surveying that can considerably reduce the cost of high density seismic acquisition. The method results in overlap** records, or interference, that must be removed prior to subsequent processing. Deblending methods typically rely on the incoherence of the blending noise relative to the underlying signal. There are many common situations where these assumptions break down, for instance, when the underlying signal contains noise or erratic amplitudes, or when shooting times are not sufficiently random. We present a robust inversion based deblending algorithm that can overcome these challenges.
△ Less
Submitted 14 December, 2018;
originally announced December 2018.
-
Character sheaves for classical symmetric pairs
Authors:
Kari Vilonen,
Ting Xue,
with an appendix by Dennis Stanton
Abstract:
We establish a Springer theory for classical symmetric pairs. We give an explicit description of character sheaves in this setting. In particular we determine the cuspidal character sheaves.
We establish a Springer theory for classical symmetric pairs. We give an explicit description of character sheaves in this setting. In particular we determine the cuspidal character sheaves.
△ Less
Submitted 7 November, 2021; v1 submitted 7 June, 2018;
originally announced June 2018.
-
Mining for Causal Relationships: A Data-Driven Study of the Islamic State
Authors:
Andrew Stanton,
Amanda Thart,
Ashish Jain,
Priyank Vyas,
Arpan Chatterjee,
Paulo Shakarian
Abstract:
The Islamic State of Iraq and al-Sham (ISIS) is a dominant insurgent group operating in Iraq and Syria that rose to prominence when it took over Mosul in June, 2014. In this paper, we present a data-driven approach to analyzing this group using a dataset consisting of 2200 incidents of military activity surrounding ISIS and the forces that oppose it (including Iraqi, Syrian, and the American-led c…
▽ More
The Islamic State of Iraq and al-Sham (ISIS) is a dominant insurgent group operating in Iraq and Syria that rose to prominence when it took over Mosul in June, 2014. In this paper, we present a data-driven approach to analyzing this group using a dataset consisting of 2200 incidents of military activity surrounding ISIS and the forces that oppose it (including Iraqi, Syrian, and the American-led coalition). We combine ideas from logic programming and causal reasoning to mine for association rules for which we present evidence of causality. We present relationships that link ISIS vehicle-bourne improvised explosive device (VBIED) activity in Syria with military operations in Iraq, coalition air strikes, and ISIS IED activity, as well as rules that may serve as indicators of spikes in indirect fire, suicide attacks, and arrests.
△ Less
Submitted 5 August, 2015;
originally announced August 2015.