Skip to main content

Showing 1–5 of 5 results for author: Hill, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.00176  [pdf, other

    cs.CL

    ChipNeMo: Domain-Adapted LLMs for Chip Design

    Authors: Mingjie Liu, Teodor-Dumitru Ene, Robert Kirby, Chris Cheng, Nathaniel Pinckney, Rongjian Liang, Jonah Alben, Himyanshu Anand, Sanmitra Banerjee, Ismet Bayraktaroglu, Bonita Bhaskaran, Bryan Catanzaro, Arjun Chaudhuri, Sharon Clay, Bill Dally, Laura Dang, Parikshit Deshpande, Siddhanth Dhodhi, Sameer Halepete, Eric Hill, Jiashang Hu, Sumit Jain, Ankit **dal, Brucek Khailany, George Kokai , et al. (17 additional authors not shown)

    Abstract: ChipNeMo aims to explore the applications of large language models (LLMs) for industrial chip design. Instead of directly deploying off-the-shelf commercial or open-source LLMs, we instead adopt the following domain adaptation techniques: domain-adaptive tokenization, domain-adaptive continued pretraining, model alignment with domain-specific instructions, and domain-adapted retrieval models. We e… ▽ More

    Submitted 4 April, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

    Comments: Updated results for ChipNeMo-70B model

  2. An Ensemble Approach for Annotating Source Code Identifiers with Part-of-speech Tags

    Authors: Christian D. Newman, Michael J. Decker, Reem S. AlSuhaibani, Anthony Peruma, Satyajit Mohapatra, Tejal Vishnoi, Marcos Zampieri, Mohamed W. Mkaouer, Timothy J. Sheldon, Emily Hill

    Abstract: This paper presents an ensemble part-of-speech tagging approach for source code identifiers. Ensemble tagging is a technique that uses machine-learning and the output from multiple part-of-speech taggers to annotate natural language text at a higher quality than the part-of-speech taggers are able to obtain independently. Our ensemble uses three state-of-the-art part-of-speech taggers: SWUM, POSSE… ▽ More

    Submitted 1 September, 2021; originally announced September 2021.

    Comments: 18 pages. arXiv admin note: text overlap with arXiv:2007.08033

    Journal ref: in IEEE Transactions on Software Engineering, vol. , no. 01, pp. 1-1, 5555

  3. arXiv:2103.16977  [pdf, other

    econ.GN cs.LG cs.MA stat.ML

    Solving Heterogeneous General Equilibrium Economic Models with Deep Reinforcement Learning

    Authors: Edward Hill, Marco Bardoscia, Arthur Turrell

    Abstract: General equilibrium macroeconomic models are a core tool used by policymakers to understand a nation's economy. They represent the economy as a collection of forward-looking actors whose behaviours combine, possibly with stochastic effects, to determine global variables (such as prices) in a dynamic equilibrium. However, standard semi-analytical techniques for solving these models make it difficul… ▽ More

    Submitted 31 March, 2021; originally announced March 2021.

    Comments: 11 pages, 5 figures

    MSC Class: 91B69 ACM Class: I.2.8; I.6.5; J.4

  4. On the Generation, Structure, and Semantics of Grammar Patterns in Source Code Identifiers

    Authors: Christian D. Newman, Reem S. AlSuhaibani, Michael J. Decker, Anthony Peruma, Dishant Kaushik, Mohamed Wiem Mkaouer, Emily Hill

    Abstract: Identifiers make up a majority of the text in code. They are one of the most basic mediums through which developers describe the code they create and understand the code that others create. Therefore, understanding the patterns latent in identifier naming practices and how accurately we are able to automatically model these patterns is vital if researchers are to support developers and automated a… ▽ More

    Submitted 15 July, 2020; originally announced July 2020.

    Comments: 69 pages, 3 figures, 16 tables

    Journal ref: Journal of Systems and Software, 2020, 110740, ISSN 0164-1212

  5. arXiv:1111.1224  [pdf, ps, other

    math.NT cs.CC

    Counting Value Sets: Algorithm and Complexity

    Authors: Qi Cheng, Joshua E. Hill, Daqing Wan

    Abstract: Let $p$ be a prime. Given a polynomial in $\F_{p^m}[x]$ of degree $d$ over the finite field $\F_{p^m}$, one can view it as a map from $\F_{p^m}$ to $\F_{p^m}$, and examine the image of this map, also known as the value set. In this paper, we present the first non-trivial algorithm and the first complexity result on computing the cardinality of this value set. We show an elementary connection betwe… ▽ More

    Submitted 4 November, 2011; originally announced November 2011.

    MSC Class: 11Yxx