Skip to main content

Showing 1–13 of 13 results for author: Rosenberg, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2307.10430  [pdf, other

    cs.LG cs.CR

    DP-TBART: A Transformer-based Autoregressive Model for Differentially Private Tabular Data Generation

    Authors: Rodrigo Castellon, Achintya Gopal, Brian Bloniarz, David Rosenberg

    Abstract: The generation of synthetic tabular data that preserves differential privacy is a problem of growing importance. While traditional marginal-based methods have achieved impressive results, recent work has shown that deep learning-based approaches tend to lag behind. In this work, we present Differentially-Private TaBular AutoRegressive Transformer (DP-TBART), a transformer-based autoregressive mode… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  2. arXiv:2305.16958  [pdf, other

    cs.CL cs.AI cs.LG

    MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies

    Authors: Shiyue Zhang, Shijie Wu, Ozan Irsoy, Steven Lu, Mohit Bansal, Mark Dredze, David Rosenberg

    Abstract: Autoregressive language models are trained by minimizing the cross-entropy of the model distribution Q relative to the data distribution P -- that is, minimizing the forward cross-entropy, which is equivalent to maximum likelihood estimation (MLE). We have observed that models trained in this way may "over-generalize", in the sense that they produce non-human-like text. Moreover, we believe that r… ▽ More

    Submitted 26 May, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: ACL 2023 (22 pages)

  3. arXiv:2303.17564  [pdf, other

    cs.LG cs.AI cs.CL q-fin.GN

    BloombergGPT: A Large Language Model for Finance

    Authors: Shijie Wu, Ozan Irsoy, Steven Lu, Vadim Dabravolski, Mark Dredze, Sebastian Gehrmann, Prabhanjan Kambadur, David Rosenberg, Gideon Mann

    Abstract: The use of NLP in the realm of financial technology is broad and complex, with applications ranging from sentiment analysis and named entity recognition to question answering. Large Language Models (LLMs) have been shown to be effective on a variety of tasks; however, no LLM specialized for the financial domain has been reported in literature. In this work, we present BloombergGPT, a 50 billion pa… ▽ More

    Submitted 21 December, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: Updated to include Training Chronicles (Appendix C)

  4. arXiv:2207.01809  [pdf, other

    cs.LG stat.AP

    Sedentary Behavior Estimation with Hip-worn Accelerometer Data: Segmentation, Classification and Thresholding

    Authors: Yiren Wang, Fatima Tuz-Zahra, Rong Zablocki, Chongzhi Di, Marta M. Jankowska, John Bellettiere, Jordan A. Carlson, Andrea Z. LaCroix, Sheri J. Hartman, Dori E. Rosenberg, **g**g Zou, Loki Natarajan

    Abstract: Cohort studies are increasingly using accelerometers for physical activity and sedentary behavior estimation. These devices tend to be less error-prone than self-report, can capture activity throughout the day, and are economical. However, previous methods for estimating sedentary behavior based on hip-worn data are often invalid or suboptimal under free-living situations and subject-to-subject va… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

  5. arXiv:2111.00340  [pdf

    cs.LG cs.CY

    Identifying and mitigating bias in algorithms used to manage patients in a pandemic

    Authors: Yifan Li, Garrett Yoon, Mustafa Nasir-Moin, David Rosenberg, Sean Neifert, Douglas Kondziolka, Eric Karl Oermann

    Abstract: Numerous COVID-19 clinical decision support systems have been developed. However many of these systems do not have the merit for validity due to methodological shortcomings including algorithmic bias. Methods Logistic regression models were created to predict COVID-19 mortality, ventilator status and inpatient status using a real-world dataset consisting of four hospitals in New York City and anal… ▽ More

    Submitted 30 October, 2021; originally announced November 2021.

    Comments: 4 pages, 1 tables

  6. arXiv:2106.04484  [pdf, other

    cs.CV cs.CL cs.LG

    Are VQA Systems RAD? Measuring Robustness to Augmented Data with Focused Interventions

    Authors: Daniel Rosenberg, Itai Gat, Amir Feder, Roi Reichart

    Abstract: Deep learning algorithms have shown promising results in visual question answering (VQA) tasks, but a more careful look reveals that they often do not understand the rich signal they are being fed with. To understand and better measure the generalization capabilities of VQA systems, we look at their robustness to counterfactually augmented data. Our proposed augmentations are designed to make a fo… ▽ More

    Submitted 17 September, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: ACL 2021. Our code and data are available at https://danrosenberg.github.io/rad-measure/

  7. arXiv:2103.01867  [pdf, other

    cs.CL cs.AI cs.CV

    Dual Reinforcement-Based Specification Generation for Image De-Rendering

    Authors: Ramakanth Pasunuru, David Rosenberg, Gideon Mann, Mohit Bansal

    Abstract: Advances in deep learning have led to promising progress in inferring graphics programs by de-rendering computer-generated images. However, current methods do not explore which decoding methods lead to better inductive bias for inferring graphics programs. In our work, we first explore the effectiveness of LSTM-RNN versus Transformer networks as decoders for order-independent graphics programs. Si… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

    Comments: AAAI 2021 Scientific Document Understanding Workshop (9 pages)

  8. arXiv:1811.08973  [pdf, other

    cs.AI cs.SE

    Improving Grey-Box Fuzzing by Modeling Program Behavior

    Authors: Siddharth Karamcheti, Gideon Mann, David Rosenberg

    Abstract: Grey-box fuzzers such as American Fuzzy Lop (AFL) are popular tools for finding bugs and potential vulnerabilities in programs. While these fuzzers have been able to find vulnerabilities in many widely used programs, they are not efficient; of the millions of inputs executed by AFL in a typical fuzzing run, only a handful discover unseen behavior or trigger a crash. The remaining inputs are redund… ▽ More

    Submitted 21 November, 2018; originally announced November 2018.

    Comments: 5 pages, 3 figures

  9. arXiv:1809.03684  [pdf, other

    cs.CE

    Visual Attention Model for Cross-sectional Stock Return Prediction and End-to-End Multimodal Market Representation Learning

    Authors: Ran Zhao, Yuntian Deng, Mark Dredze, Arun Verma, David Rosenberg, Amanda Stent

    Abstract: Technical and fundamental analysis are traditional tools used to analyze individual stocks; however, the finance literature has shown that the price movement of each individual stock correlates heavily with other stocks, especially those within the same sector. In this paper we propose a general purpose market representation that incorporates fundamental and technical indicators and relationships… ▽ More

    Submitted 8 March, 2019; v1 submitted 11 September, 2018; originally announced September 2018.

    Comments: Accepted as full paper in the 32nd International FLAIRS Conference

  10. Adaptive Grey-Box Fuzz-Testing with Thompson Sampling

    Authors: Siddharth Karamcheti, Gideon Mann, David Rosenberg

    Abstract: Fuzz testing, or "fuzzing," refers to a widely deployed class of techniques for testing programs by generating a set of inputs for the express purpose of finding bugs and identifying security flaws. Grey-box fuzzing, the most popular fuzzing strategy, combines light program instrumentation with a data driven process to generate new program inputs. In this work, we present a machine learning approa… ▽ More

    Submitted 24 August, 2018; originally announced August 2018.

    Comments: Published as a workshop paper in the 11th ACM Workshop on Artificial Intelligence and Security (AISec '18) with the 25th ACM Conference on Computer and Communications Security (CCS '18)

  11. arXiv:1704.06687  [pdf, other

    cs.CV cs.IR stat.ML

    Scatteract: Automated extraction of data from scatter plots

    Authors: Mathieu Cliche, David Rosenberg, Dhruv Madeka, Connie Yee

    Abstract: Charts are an excellent way to convey patterns and trends in data, but they do not facilitate further modeling of the data or close inspection of individual data points. We present a fully automated system for extracting the numerical values of data points from images of scatter plots. We use deep learning techniques to identify the key components of the chart, and optical character recognition to… ▽ More

    Submitted 21 April, 2017; originally announced April 2017.

    Comments: Submitted to ECML PKDD 2017 proceedings, 16 pages

    Journal ref: Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2017. Lecture Notes in Computer Science, vol 10534. Springer, Cham

  12. arXiv:1206.5261  [pdf

    cs.LG cs.AI stat.ML

    Mixture-of-Parents Maximum Entropy Markov Models

    Authors: David S. Rosenberg, Dan Klein, Ben Taskar

    Abstract: We present the mixture-of-parents maximum entropy Markov model (MoP-MEMM), a class of directed graphical models extending MEMMs. The MoP-MEMM allows tractable incorporation of long-range dependencies between nodes by restricting the conditional distribution of each node to be a mixture of distributions given the parents. We show how to efficiently compute the exact marginal posterior node distribu… ▽ More

    Submitted 20 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence (UAI2007)

    Report number: UAI-P-2007-PG-318-325

  13. arXiv:1007.4427  [pdf, other

    math.PR cs.GT math.OC

    Strategic Information Exchange

    Authors: Dinah Rosenberg, Eilon Solan, Nicolas Vieille

    Abstract: We study a class of two-player repeated games with incomplete information and informational externalities. In these games, two states are chosen at the outset, and players get private information on the pair, before engaging in repeated play. The payoff of each player only depends on his `own' state and on his own action. We study to what extent, and how, information can be exchanged in equilibriu… ▽ More

    Submitted 26 July, 2010; originally announced July 2010.