Skip to main content

Showing 1–12 of 12 results for author: Dixon, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.01529  [pdf, other

    cs.CR cs.LG

    On the Abuse and Detection of Polyglot Files

    Authors: Luke Koch, Sean Oesch, Amul Chaulagain, Jared Dixon, Matthew Dixon, Mike Huettal, Amir Sadovnik, Cory Watson, Brian Weber, Jacob Hartman, Richard Patulski

    Abstract: A polyglot is a file that is valid in two or more formats. Polyglot files pose a problem for malware detection systems that route files to format-specific detectors/signatures, as well as file upload and sanitization tools. In this work we found that existing file-format and embedded-file detection tools, even those developed specifically for polyglot files, fail to reliably detect polyglot files… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 18 pages, 11 figures

  2. arXiv:2404.14219  [pdf, other

    cs.CL cs.AI

    Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

    Authors: Marah Abdin, Sam Ade Jacobs, Ammar Ahmad Awan, Jyoti Aneja, Ahmed Awadallah, Hany Awadalla, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Jianmin Bao, Harkirat Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Qin Cai, Martin Cai, Caio César Teodoro Mendes, Weizhu Chen, Vishrav Chaudhary, Dong Chen, Dongdong Chen, Yen-Chun Chen, Yi-Ling Chen, Parul Chopra , et al. (90 additional authors not shown)

    Abstract: We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5 (e.g., phi-3-mini achieves 69% on MMLU and 8.38 on MT-bench), despite being small enough to be deployed on a phone. The innovation lies entirely in our dataset… ▽ More

    Submitted 23 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: 19 pages

  3. arXiv:2308.09199  [pdf, other

    cs.LG cs.CR physics.optics

    Polynomial Bounds for Learning Noisy Optical Physical Unclonable Functions and Connections to Learning With Errors

    Authors: Apollo Albright, Boris Gelfand, Michael Dixon

    Abstract: It is shown that a class of optical physical unclonable functions (PUFs) can be learned to arbitrary precision with arbitrarily high probability, even in the presence of noise, given access to polynomially many challenge-response pairs and polynomially bounded computational power, under mild assumptions about the distributions of the noise and challenge vectors. This extends the results of Rhürami… ▽ More

    Submitted 7 September, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

    Comments: 10 pages, 2 figures, submitted to IEEE Transactions on Information Forensics and Security

    Report number: LA-UR-23-29328

  4. arXiv:2308.03290  [pdf, other

    cs.CV cs.LG

    FLIQS: One-Shot Mixed-Precision Floating-Point and Integer Quantization Search

    Authors: Jordan Dotzel, Gang Wu, Andrew Li, Muhammad Umar, Yun Ni, Mohamed S. Abdelfattah, Zhiru Zhang, Liqun Cheng, Martin G. Dixon, Norman P. Jouppi, Quoc V. Le, Sheng Li

    Abstract: Quantization has become a mainstream compression technique for reducing model size, computational requirements, and energy consumption for modern deep neural networks (DNNs). With improved numerical support in recent hardware, including multiple variants of integer and floating point, mixed-precision quantization has become necessary to achieve high-quality results with low model cost. Prior mixed… ▽ More

    Submitted 1 May, 2024; v1 submitted 7 August, 2023; originally announced August 2023.

    Comments: Accepted to AutoML 2024

  5. arXiv:2206.10014  [pdf, other

    q-fin.PR cs.LG q-fin.PM q-fin.RM stat.ML

    Deep Partial Least Squares for Empirical Asset Pricing

    Authors: Matthew F. Dixon, Nicholas G. Polson, Kemen Goicoechea

    Abstract: We use deep partial least squares (DPLS) to estimate an asset pricing model for individual stock returns that exploits conditioning information in a flexible and dynamic way while attributing excess returns to a small set of statistical risk factors. The novel contribution is to resolve the non-linear factor structure, thus advancing the current paradigm of deep learning in empirical asset pricing… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

  6. Federated Learning Enables Big Data for Rare Cancer Boundary Detection

    Authors: Sarthak Pati, Ujjwal Baid, Brandon Edwards, Micah Sheller, Shih-Han Wang, G Anthony Reina, Patrick Foley, Alexey Gruzdev, Deepthi Karkada, Christos Davatzikos, Chiharu Sako, Satyam Ghodasara, Michel Bilello, Suyash Mohan, Philipp Vollmuth, Gianluca Brugnara, Chandrakanth J Preetha, Felix Sahm, Klaus Maier-Hein, Maximilian Zenk, Martin Bendszus, Wolfgang Wick, Evan Calabrese, Jeffrey Rudie, Javier Villanueva-Meyer , et al. (254 additional authors not shown)

    Abstract: Although machine learning (ML) has shown promise in numerous domains, there are concerns about generalizability to out-of-sample data. This is currently addressed by centrally sharing ample, and importantly diverse, data from multiple sites. However, such centralization is challenging to scale (or even not feasible) due to various limitations. Federated ML (FL) provides an alternative to train acc… ▽ More

    Submitted 25 April, 2022; v1 submitted 22 April, 2022; originally announced April 2022.

    Comments: federated learning, deep learning, convolutional neural network, segmentation, brain tumor, glioma, glioblastoma, FeTS, BraTS

  7. arXiv:2111.09954  [pdf, other

    cs.LG physics.ao-ph

    MS-nowcasting: Operational Precipitation Nowcasting with Convolutional LSTMs at Microsoft Weather

    Authors: Sylwester Klocek, Haiyu Dong, Matthew Dixon, Panashe Kanengoni, Najeeb Kazmi, Pete Luferenko, Zhongjian Lv, Shikhar Sharma, Jonathan Weyn, Siqi Xiang

    Abstract: We present the encoder-forecaster convolutional long short-term memory (LSTM) deep-learning model that powers Microsoft Weather's operational precipitation nowcasting product. This model takes as input a sequence of weather radar mosaics and deterministically predicts future radar reflectivity at lead times up to 6 hours. By stacking a large input receptive field along the feature dimension and co… ▽ More

    Submitted 23 May, 2022; v1 submitted 18 November, 2021; originally announced November 2021.

    Comments: Minor updates to reflect final submission to NeurIPS workshop

    Journal ref: NeurIPS 2021 Workshop on Tackling Climate Change with Machine Learning, 2021. https://www.climatechange.ai/papers/neurips2021/19

  8. arXiv:2004.04717  [pdf, other

    stat.ML cs.LG

    Industrial Forecasting with Exponentially Smoothed Recurrent Neural Networks

    Authors: Matthew F Dixon

    Abstract: Time series modeling has entered an era of unprecedented growth in the size and complexity of data which require new modeling approaches. While many new general purpose machine learning approaches have emerged, they remain poorly understand and irreconcilable with more traditional statistical modeling approaches. We present a general class of exponential smoothed recurrent neural networks (RNNs) w… ▽ More

    Submitted 30 October, 2020; v1 submitted 9 April, 2020; originally announced April 2020.

  9. arXiv:2002.10990  [pdf, other

    q-fin.PM cs.LG q-fin.CP stat.ML

    G-Learner and GIRL: Goal Based Wealth Management with Reinforcement Learning

    Authors: Matthew Dixon, Igor Halperin

    Abstract: We present a reinforcement learning approach to goal based wealth management problems such as optimization of retirement plans or target dated funds. In such problems, an investor seeks to achieve a financial goal by making periodic investments in the portfolio while being employed, and periodically draws from the account when in retirement, in addition to the ability to re-balance the portfolio b… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

  10. arXiv:1903.07677  [pdf, other

    stat.ML cs.LG stat.ME

    Deep Fundamental Factor Models

    Authors: Matthew F. Dixon, Nicholas G. Polson

    Abstract: Deep fundamental factor models are developed to automatically capture non-linearity and interaction effects in factor modeling. Uncertainty quantification provides interpretability with interval estimation, ranking of factor importances and estimation of interaction effects. With no hidden layers we recover a linear factor model and for one or more hidden layers, uncertainty bands for the sensitiv… ▽ More

    Submitted 27 August, 2020; v1 submitted 18 March, 2019; originally announced March 2019.

    Journal ref: Forthcoming in SIAM J. Financial Mathematics, 2020

  11. arXiv:1903.03019  [pdf

    cs.CY

    Engaging Users with Educational Games: The Case of Phishing

    Authors: Matt Dixon, Nalin Asanka Gamagedara Arachchilage, James Nicholson

    Abstract: Phishing continues to be a difficult problem for individuals and organisations. Educational games and simulations have been increasingly acknowledged as enormous and powerful teaching tools, yet little work has examined how to engage users with these games. We explore this problem by conducting workshops with 9 younger adults and reporting on their expectations for cybersecurity educational games.… ▽ More

    Submitted 7 March, 2019; originally announced March 2019.

    Comments: 4

    Journal ref: CHI '19 Extended Abstracts on Human Factors in Computing Systems Proceedings (CHI 2019), 2019

  12. arXiv:1603.08604  [pdf, other

    cs.LG cs.CE

    Classification-based Financial Markets Prediction using Deep Neural Networks

    Authors: Matthew Dixon, Diego Klabjan, ** Hoon Bang

    Abstract: Deep neural networks (DNNs) are powerful types of artificial neural networks (ANNs) that use several hidden layers. They have recently gained considerable attention in the speech transcription and image recognition community (Krizhevsky et al., 2012) for their superior predictive properties including robustness to overfitting. However their application to algorithmic trading has not been previousl… ▽ More

    Submitted 13 June, 2017; v1 submitted 28 March, 2016; originally announced March 2016.