Skip to main content

Showing 1–12 of 12 results for author: MacMillan, K

.
  1. arXiv:2306.04751  [pdf, other

    cs.CL

    How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources

    Authors: Yizhong Wang, Hamish Ivison, Pradeep Dasigi, Jack Hessel, Tushar Khot, Khyathi Raghavi Chandu, David Wadden, Kelsey MacMillan, Noah A. Smith, Iz Beltagy, Hannaneh Hajishirzi

    Abstract: In this work we explore recent advances in instruction-tuning language models on a range of open instruction-following datasets. Despite recent claims that open models can be on par with state-of-the-art proprietary models, these claims are often accompanied by limited evaluation, making it difficult to compare models across the board and determine the utility of various resources. We provide a la… ▽ More

    Submitted 30 October, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: 18 pages, 6 figure, 10 tables. NeurIPS 2023 Datasets and Benchmarks Track Camera Ready

  2. arXiv:2303.14334  [pdf, other

    cs.HC cs.AI cs.CL

    The Semantic Reader Project: Augmenting Scholarly Documents through AI-Powered Interactive Reading Interfaces

    Authors: Kyle Lo, Joseph Chee Chang, Andrew Head, Jonathan Bragg, Amy X. Zhang, Cassidy Trier, Chloe Anastasiades, Tal August, Russell Authur, Danielle Bragg, Erin Bransom, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Yen-Sung Chen, Evie Yu-Yen Cheng, Yvonne Chou, Doug Downey, Rob Evans, Raymond Fok, Fangzhou Hu, Regan Huff, Dongyeop Kang, Tae Soo Kim, Rodney Kinney , et al. (30 additional authors not shown)

    Abstract: Scholarly publications are key to the transfer of knowledge from scholars to others. However, research papers are information-dense, and as the volume of the scientific literature grows, the need for new technology to support the reading process grows. In contrast to the process of finding papers, which has been transformed by Internet technology, the experience of reading research papers has chan… ▽ More

    Submitted 23 April, 2023; v1 submitted 24 March, 2023; originally announced March 2023.

  3. arXiv:2301.10140  [pdf, other

    cs.DL cs.CL

    The Semantic Scholar Open Data Platform

    Authors: Rodney Kinney, Chloe Anastasiades, Russell Authur, Iz Beltagy, Jonathan Bragg, Alexandra Buraczynski, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Arman Cohan, Miles Crawford, Doug Downey, Jason Dunkelberger, Oren Etzioni, Rob Evans, Sergey Feldman, Joseph Gorney, David Graham, Fangzhou Hu, Regan Huff, Daniel King, Sebastian Kohlmeier, Bailey Kuehl, Michael Langan, Daniel Lin , et al. (23 additional authors not shown)

    Abstract: The volume of scientific output is creating an urgent need for automated tools to help scientists keep up with developments in their field. Semantic Scholar (S2) is an open data platform and website aimed at accelerating science by hel** scholars discover and understand scientific literature. We combine public and proprietary data sources using state-of-the-art techniques for scholarly PDF conte… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

    Comments: 8 pages, 6 figures

  4. A Comparative Analysis of Ookla Speedtest and Measurement Labs Network Diagnostic Test (NDT7)

    Authors: Kyle MacMillan, Tarun Mangla, James Saxon, Nicole P. Marwell, Nick Feamster

    Abstract: Consumers, regulators, and ISPs all use client-based "speed tests" to measure network performance, both in single-user settings and in aggregate. Two prevalent speed tests, Ookla's Speedtest and Measurement Lab's Network Diagnostic Test (NDT), are often used for similar purposes, despite having significant differences in both the test design and implementation, and in the infrastructure used to pe… ▽ More

    Submitted 25 January, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

  5. arXiv:2110.15345  [pdf, other

    cs.NI

    Measuring the Consolidation of DNS and Web Hosting Providers

    Authors: Synthia Wang, Kyle MacMillan, Brennan Schaffner, Nick Feamster, Marshini Chetty

    Abstract: Despite the Internet's continued growth, it increasingly depends on a small set of service providers to support Domain Name System (DNS) and web content hosting. This trend poses many potential threats including susceptibility to outages, failures, and potential censorship by providers. This paper aims to quantify consolidation in terms of popular domains' reliance on a small set of organizations… ▽ More

    Submitted 30 January, 2024; v1 submitted 28 October, 2021; originally announced October 2021.

  6. arXiv:2105.13478  [pdf, other

    cs.NI

    Measuring the Performance and Network Utilization of Popular Video Conferencing Applications

    Authors: Kyle MacMillan, Tarun Mangla, James Saxon, Nick Feamster

    Abstract: Video conferencing applications (VCAs) have become a critical Internet application, even more so during the COVID-19 pandemic, as users worldwide now rely on them for work, school, and telehealth. It is thus increasingly important to understand the resource requirements of different VCAs and how they perform under different network conditions, including: how much speed (upstream and downstream thr… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

  7. arXiv:2008.03254  [pdf, other

    cs.CR cs.NI

    Evaluating Snowflake as an Indistinguishable Censorship Circumvention Tool

    Authors: Kyle MacMillan, Jordan Holland, Prateek Mittal

    Abstract: Tor is the most well-known tool for circumventing censorship. Unfortunately, Tor traffic has been shown to be detectable using deep-packet inspection. WebRTC is a popular web frame-work that enables browser-to-browser connections. Snowflake is a novel pluggable transport that leverages WebRTC to connect Tor clients to the Tor network. In theory, Snowflake was created to be indistinguishable from o… ▽ More

    Submitted 14 October, 2020; v1 submitted 23 July, 2020; originally announced August 2020.

  8. Primary Pseudoperfect Numbers, Arithmetic Progressions, and the Erdős-Moser Equation

    Authors: Jonathan Sondow, Kieren MacMillan

    Abstract: A primary pseudoperfect number (PPN) is an integer $K > 1$ such that the reciprocals of $K$ and its prime factors sum to 1. PPNs arise in studying perfectly weighted graphs and singularities of algebraic surfaces, and are related to Sylvester's sequence, Giuga numbers, Znám's problem, the inheritance problem, and Curtiss's bound on solutions of a unit fraction equation. Here we show… ▽ More

    Submitted 16 December, 2018; originally announced December 2018.

    Comments: 7 pages, 1 table

    MSC Class: 11D68; 11A41

    Journal ref: Amer. Math. Monthly 124 (2017) 232-240

  9. arXiv:1706.05084  [pdf, other

    cs.CL cs.IR cs.LG stat.ML

    Topic supervised non-negative matrix factorization

    Authors: Kelsey MacMillan, James D. Wilson

    Abstract: Topic models have been extensively used to organize and interpret the contents of large, unstructured corpora of text documents. Although topic models often perform well on traditional training vs. test set evaluations, it is often the case that the results of a topic model do not align with human interpretation. This interpretability fallacy is largely due to the unsupervised nature of topic mode… ▽ More

    Submitted 2 July, 2017; v1 submitted 12 June, 2017; originally announced June 2017.

  10. arXiv:1011.2154  [pdf, ps, other

    math.NT

    Reducing the Erdos-Moser equation 1^n + 2^n + . . . + k^n = (k+1)^n modulo k and k^2

    Authors: Jonathan Sondow, Kieren MacMillan

    Abstract: An open conjecture of Erdos and Moser is that the only solution of the Diophantine equation in the title is the trivial solution 1+2=3. Reducing the equation modulo k and k^2, we give necessary and sufficient conditions on solutions to the resulting congruence and supercongruence. A corollary is a new proof of Moser's result that the conjecture is true for odd exponents n. We also connect solution… ▽ More

    Submitted 9 November, 2010; originally announced November 2010.

    Comments: 10 pages, 2 tables, submitted for publication

    MSC Class: 11D61 (Primary); 11D79; 11A41 (Secondary)

    Journal ref: Integers 11 (2011), article #A34

  11. arXiv:1011.0076  [pdf, ps, other

    math.NT math.HO

    Proofs of power sum and binomial coefficient congruences via Pascal's identity

    Authors: Kieren MacMillan, Jonathan Sondow

    Abstract: A frequently cited theorem says that for n > 0 and prime p, the sum of the first p n-th powers is congruent to -1 modulo p if p-1 divides n, and to 0 otherwise. We survey the main ingredients in several known proofs. Then we give an elementary proof, using an identity for power sums proven by Pascal in 1654. An application is a simple proof of a congruence for certain sums of binomial coefficients… ▽ More

    Submitted 30 October, 2010; originally announced November 2010.

    Comments: 4 pages, to appear in Amer. Math. Monthly

    MSC Class: 11A07 (Primary); 11B65 (Secondary)

    Journal ref: Amer. Math. Monthly 118 (2011) 549-551

  12. arXiv:1010.2275  [pdf, ps, other

    math.NT

    Divisibility of Power Sums and the Generalized Erdos-Moser Equation

    Authors: Kieren MacMillan, Jonathan Sondow

    Abstract: Using elementary methods, we determine the highest power of 2 dividing a power sum 1^n + 2^n + . . . + m^n, generalizing Lengyel's formula for the case where m is itself a power of 2. An application is a simple proof of Moree's result that, if (a,m,n) is any solution of the generalized Erdos-Moser Diophantine equation 1^n + 2^n + . . . + (m-1)^n = am^n, then m is odd.

    Submitted 19 May, 2011; v1 submitted 11 October, 2010; originally announced October 2010.

    Comments: 4 pages, simplified proof of Proposition 1, added reference [4]

    MSC Class: 11D79 (Primary) 11D61 (Secondary)

    Journal ref: Elemente der Mathematik 67 (2012) 182-186