Skip to main content

Showing 1–1 of 1 results for author: Sachdeva, B

Searching in archive eess. Search in all archives.
.
  1. arXiv:2008.03964  [pdf, other

    cs.CL cs.CV cs.LG eess.SY

    DQI: A Guide to Benchmark Evaluation

    Authors: Swaroop Mishra, Anjana Arunkumar, Bhavdeep Sachdeva, Chris Bryan, Chitta Baral

    Abstract: A `state of the art' model A surpasses humans in a benchmark B, but fails on similar benchmarks C, D, and E. What does B have that the other benchmarks do not? Recent research provides the answer: spurious bias. However, develo** A to solve benchmarks B through E does not guarantee that it will solve future benchmarks. To progress towards a model that `truly learns' an underlying task, we need t… ▽ More

    Submitted 10 August, 2020; originally announced August 2020.

    Comments: ICML UDL 2020