Skip to main content

Showing 1–20 of 20 results for author: Lewis, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18211  [pdf, other

    cs.CY cs.AI

    AI Cards: Towards an Applied Framework for Machine-Readable AI and Risk Documentation Inspired by the EU AI Act

    Authors: Delaram Golpayegani, Isabelle Hupont, Cecilia Panigutti, Harshvardhan J. Pandit, Sven Schade, Declan O'Sullivan, Dave Lewis

    Abstract: With the upcoming enforcement of the EU AI Act, documentation of high-risk AI systems and their risk management information will become a legal requirement playing a pivotal role in demonstration of compliance. Despite its importance, there is a lack of standards and guidelines to assist with drawing up AI and risk documentation aligned with the AI Act. This paper aims to address this gap by provi… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. TARexp: A Python Framework for Technology-Assisted Review Experiments

    Authors: Eugene Yang, David D. Lewis

    Abstract: Technology-assisted review (TAR) is an important industrial application of information retrieval (IR) and machine learning (ML). While a small TAR research community exists, the complexity of TAR software and workflows is a major barrier to entry. Drawing on past open source TAR efforts, as well as design patterns from the IR and ML open source software, we present an open source Python framework… ▽ More

    Submitted 24 April, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

    Comments: 6 pages, 4 figures, accepted as a SIGIR 2022 demo paper

  3. arXiv:2202.11149  [pdf, other

    cs.MA stat.AP

    Incorporating social norms into a configurable agent-based model of the decision to perform commuting behaviour

    Authors: Robert Greener, Daniel Lewis, Jon Reades, Simon Miles, Steven Cummins

    Abstract: Interventions to increase active commuting have been recommended as a method to increase population physical activity, but evidence is mixed. Social norms related to travel behaviour may influence the uptake of active commuting interventions but are rarely considered in their design and evaluation. In this study we develop an agent-based model that incorporates social norms related to travel behav… ▽ More

    Submitted 10 August, 2022; v1 submitted 22 February, 2022; originally announced February 2022.

    Comments: 18 pages, 2 figures, 4 tables. Published version in ATT'22 Workshop Agents in Traffic and Transportation, July 25, 2022, Vienna, Austria, http://ceur-ws.org/Vol-3173/12.pdf

    ACM Class: J.3; J.4

  4. arXiv:2108.12752  [pdf, other

    cs.IR

    TAR on Social Media: A Framework for Online Content Moderation

    Authors: Eugene Yang, David D. Lewis, Ophir Frieder

    Abstract: Content moderation (removing or limiting the distribution of posts based on their contents) is one tool social networks use to fight problems such as harassment and disinformation. Manually screening all content is usually impractical given the scale of social media data, and the need for nuanced human interpretations makes fully automated approaches infeasible. We consider content moderation from… ▽ More

    Submitted 29 August, 2021; originally announced August 2021.

    Comments: 9 pages, 2 figures, accepted at DESIRES 2021

  5. Certifying One-Phase Technology-Assisted Reviews

    Authors: David D. Lewis, Eugene Yang, Ophir Frieder

    Abstract: Technology-assisted review (TAR) workflows based on iterative active learning are widely used in document review applications. Most stop** rules for one-phase TAR workflows lack valid statistical guarantees, which has discouraged their use in some legal contexts. Drawing on the theory of quantile estimation, we provide the first broadly applicable and statistically valid sample-based stop** ru… ▽ More

    Submitted 29 August, 2021; originally announced August 2021.

    Comments: 10 pages, 4 figures, accepted at CIKM 2021

  6. arXiv:2108.09959  [pdf

    cs.CY

    Artificial Intelligence Ethics: An Inclusive Global Discourse?

    Authors: Cathy Roche, Dave Lewis, P. J. Wall

    Abstract: It is widely accepted that technology is ubiquitous across the planet and has the potential to solve many of the problems existing in the Global South. Moreover, the rapid advancement of artificial intelligence (AI) brings with it the potential to address many of the challenges outlined in the Sustainable Development Goals (SDGs) in ways which were never before possible. However, there are many qu… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

    Comments: In proceedings of the 1st Virtual Conference on Implications of Information and Digital Technologies for Development, 2021

  7. Heuristic Stop** Rules For Technology-Assisted Review

    Authors: Eugene Yang, David D. Lewis, Ophir Frieder

    Abstract: Technology-assisted review (TAR) refers to human-in-the-loop active learning workflows for finding relevant documents in large collections. These workflows often must meet a target for the proportion of relevant documents found (i.e. recall) while also holding down costs. A variety of heuristic stop** rules have been suggested for striking this tradeoff in particular settings, but none have been… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: 10 pages, 2 figures. Accepted at DocEng 21

  8. On Minimizing Cost in Legal Document Review Workflows

    Authors: Eugene Yang, David D. Lewis, Ophir Frieder

    Abstract: Technology-assisted review (TAR) refers to human-in-the-loop machine learning workflows for document review in legal discovery and other high recall review tasks. Attorneys and legal technologists have debated whether review should be a single iterative process (one-phase TAR workflows) or whether model training and review should be separate (two-phase TAR workflows), with implications for the cho… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: 10 pages, 3 figures. Accepted at DocEng 21

  9. arXiv:2105.01044  [pdf, other

    cs.IR cs.CL

    Goldilocks: Just-Right Tuning of BERT for Technology-Assisted Review

    Authors: Eugene Yang, Sean MacAvaney, David D. Lewis, Ophir Frieder

    Abstract: Technology-assisted review (TAR) refers to iterative active learning workflows for document review in high recall retrieval (HRR) tasks. TAR research and most commercial TAR software have applied linear models such as logistic regression to lexical features. Transformer-based models with supervised tuning are known to improve effectiveness on many text classification tasks, suggesting their use in… ▽ More

    Submitted 19 January, 2022; v1 submitted 3 May, 2021; originally announced May 2021.

    Comments: 6 pages, 1 figure, accepted at ECIR 2022

  10. arXiv:2007.13417  [pdf, other

    physics.app-ph cs.CV eess.IV

    Image-driven discriminative and generative machine learning algorithms for establishing microstructure-processing relationships

    Authors: Wufei Ma, Elizabeth Kautz, Arun Baskaran, Aritra Chowdhury, Vineet Joshi, Bülent Yener, Daniel Lewis

    Abstract: We investigate methods of microstructure representation for the purpose of predicting processing condition from microstructure image data. A binary alloy (uranium-molybdenum) that is currently under development as a nuclear fuel was studied for the purpose of develo** an improved machine learning approach to image recognition, characterization, and building predictive capabilities linking micros… ▽ More

    Submitted 27 July, 2020; originally announced July 2020.

    Comments: 14 pages, 15 figures

  11. arXiv:2005.00986  [pdf

    cs.CV cs.AI

    Using Artificial Intelligence to Analyze Fashion Trends

    Authors: Mengyun Shi, Van Dyk Lewis

    Abstract: Analyzing fashion trends is essential in the fashion industry. Current fashion forecasting firms, such as WGSN, utilize the visual information from around the world to analyze and predict fashion trends. However, analyzing fashion trends is time-consuming and extremely labor intensive, requiring individual employees' manual editing and classification. To improve the efficiency of data analysis of… ▽ More

    Submitted 3 May, 2020; originally announced May 2020.

  12. arXiv:2001.06367  [pdf, other

    math.CO cs.DM

    On Covering Numbers, Young Diagrams, and the Local Dimension of Posets

    Authors: Gábor Damásdi, Stefan Felsner, António Girão, Balázs Keszegh, David Lewis, Dániel T. Nagy, Torsten Ueckerdt

    Abstract: We study covering numbers and local covering numbers with respect to difference graphs and complete bipartite graphs. In particular we show that in every cover of a Young diagram with $\binom{2k}{k}$ steps with generalized rectangles there is a row or a column in the diagram that is used by at least $k+1$ rectangles, and prove that this is best-possible. This answers two questions by Kim, Martin,… ▽ More

    Submitted 17 January, 2020; originally announced January 2020.

    Comments: Parts of this paper have previously been reported in arXiv submission arXiv:1902.08223

  13. arXiv:1909.05660  [pdf, other

    q-bio.NC cs.LG eess.IV stat.ML

    Predicting intelligence based on cortical WM/GM contrast, cortical thickness and volumetry

    Authors: Juan Miguel Valverde, Vandad Imani, John D. Lewis, Jussi Tohka

    Abstract: We propose a four-layer fully-connected neural network (FNN) for predicting fluid intelligence scores from T1-weighted MR images for the ABCD-challenge. In addition to the volumes of brain structures, the FNN uses cortical WM/GM contrast and cortical thickness at 78 cortical regions. These last two measurements were derived from the T1-weighted MR images using cortical surfaces produced by the CIV… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

    Comments: Submission to the ABCD Neurocognitive Prediction Challenge at MICCAI 2019

  14. arXiv:1906.05496  [pdf, other

    physics.app-ph cs.CV cs.LG

    An image-driven machine learning approach to kinetic modeling of a discontinuous precipitation reaction

    Authors: Elizabeth Kautz, Wufei Ma, Saumyadeep Jana, Arun Devaraj, Vineet Joshi, Bülent Yener, Daniel Lewis

    Abstract: Micrograph quantification is an essential component of several materials science studies. Machine learning methods, in particular convolutional neural networks, have previously demonstrated performance in image recognition tasks across several disciplines (e.g. materials science, medical imaging, facial recognition). Here, we apply these well-established methods to develop an approach to microstru… ▽ More

    Submitted 13 June, 2019; originally announced June 2019.

    Comments: 30 pages, 8 figures

  15. arXiv:1609.04667  [pdf

    cs.CY

    War-Algorithm Accountability

    Authors: Dustin A. Lewis, Gabriella Blum, Naz K. Modirzadeh

    Abstract: In this briefing report, we introduce a new concept (war algorithms) that elevates algorithmically-derived choices and decisions to a, and perhaps the, central concern regarding technical autonomy in war. We thereby aim to shed light on and recast the discussion regarding autonomous weapon systems. We define war algorithm as any algorithm that is expressed in computer code, that is effectuated thr… ▽ More

    Submitted 12 September, 2016; originally announced September 2016.

  16. Capturing divergence in dependency trees to improve syntactic projection

    Authors: Ryan Georgi, Fei Xia, William D. Lewis

    Abstract: Obtaining syntactic parses is a crucial part of many NLP pipelines. However, most of the world's languages do not have large amounts of syntactically annotated corpora available for building parsers. Syntactic projection techniques attempt to address this issue by using parallel corpora consisting of resource-poor and resource-rich language pairs, taking advantage of a parser for the resource-rich… ▽ More

    Submitted 14 May, 2016; originally announced May 2016.

  17. arXiv:1312.1378  [pdf, other

    cs.NI

    An Analytical Model for Loc/ID Map**s Caches

    Authors: Florin Coras, Jordi Domingo-Pascual, Darrel Lewis, Albert Cabellos-Aparicio

    Abstract: Concerns regarding the scalability of the inter-domain routing have encouraged researchers to start elaborating a more robust Internet architecture. While consensus on the exact form of the solution is yet to be found, the need for a semantic decoupling of a node's location and identity is generally accepted as a promising way forward. However, this typically requires the use of caches that store… ▽ More

    Submitted 6 December, 2013; v1 submitted 4 December, 2013; originally announced December 2013.

  18. arXiv:0909.2368  [pdf

    cs.CR

    Web Single Sign-On Authentication using SAML

    Authors: Kelly D. Lewis andjames E. Lewis

    Abstract: Companies have increasingly turned to application service providers (ASPs) or Software as a Service (SaaS) vendors to offer specialized web-based services that will cut costs and provide specific and focused applications to users. The complexity of designing, installing, configuring, deploying, and supporting the system with internal resources can be eliminated with this type of methodology, pro… ▽ More

    Submitted 12 September, 2009; originally announced September 2009.

    Comments: International Journal of Computer Science Issues (IJCSI), Volume 1, pp41-48, August 2009

    Journal ref: K. D. LEWIS andJ. E. LEWIS, " Web Single Sign-On Authentication using SAML", International Journal of Computer Science Issues (IJCSI), Volume 1, pp41-48, August 2009

  19. arXiv:math/0204068  [pdf, ps, other

    math.AG cs.CC math.OC

    Computational problems for vector-valued quadratic forms

    Authors: Francesco Bullo, Jorge Cortes, Andrew D. Lewis, Sonia Martinez

    Abstract: Given two real vector spaces $U$ and $V$, and a symmetric bilinear map $B: U\times U\to V$, let $Q_B$ be its associated quadratic map $Q_B$. The problems we consider are as follows: (i) are there necessary and sufficient conditions, checkable in polynomial-time, for determining when $Q_B$ is surjective?; (ii) if $Q_B$ is surjective, given $v\in V$ is there a polynomial-time algorithm for finding… ▽ More

    Submitted 5 April, 2002; originally announced April 2002.

    Comments: 6 pages, no figures, submitted to Workshop on Open Problems in Mathematical Systems and Control Theory

    MSC Class: 11Exx; 14Pxx; 14Q99; 15A63

  20. arXiv:cmp-lg/9407020  [pdf, ps

    cs.CL

    A Sequential Algorithm for Training Text Classifiers

    Authors: David D. Lewis, William A. Gale

    Abstract: The ability to cheaply train text classifiers is critical to their use in information retrieval, content analysis, natural language processing, and other tasks involving data which is partly or fully textual. An algorithm for sequential sampling during machine learning of statistical classifiers was developed and tested on a newswire text categorization task. This method, which we call uncertain… ▽ More

    Submitted 24 July, 1994; v1 submitted 24 July, 1994; originally announced July 1994.

    Comments: 10 pages, uuencoded, compressed PostScript; Proc. SIGIR-94 LaTex available from [email protected]