Skip to main content

Showing 1–31 of 31 results for author: Yan, E

.
  1. arXiv:2407.00037  [pdf, ps, other

    econ.TH

    Information About Other Players in Mechanism Design

    Authors: Eric Yan

    Abstract: We show the existence of mechanism design settings where the social planner has an interest in players receiving noisy signals about the types of other agents. When the social planner is interested only in partial implementation, any social choice rule that is incentive compatible after players receive additional information about other agents was originally incentive compatible prior to the chang… ▽ More

    Submitted 28 May, 2024; originally announced July 2024.

    Comments: undergraduate thesis at Harvard College. comments welcome!

  2. arXiv:2403.15128  [pdf, other

    cs.MA

    An Agent-Centric Perspective on Norm Enforcement and Sanctions

    Authors: Elena Yan, Luis G. Nardin, Jomi F. Hübner, Olivier Boissier

    Abstract: In increasingly autonomous and highly distributed multi-agent systems, centralized coordination becomes impractical and raises the need for governance and enforcement mechanisms from an agent-centric perspective. In our conceptual view, sanctioning norm enforcement is part of this agent-centric approach and they aim at promoting norm compliance while preserving agents' autonomy. The few works deal… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  3. arXiv:2307.00734  [pdf, other

    physics.ao-ph cs.LG physics.flu-dyn

    On the choice of training data for machine learning of geostrophic mesoscale turbulence

    Authors: F. E. Yan, J. Mak, Y. Wang

    Abstract: 'Data' plays a central role in data-driven methods, but is not often the subject of focus in investigations of machine learning algorithms as applied to Earth System Modeling related problems. Here we consider the case of eddy-mean interaction in rotating stratified turbulence in the presence of lateral boundaries, a problem of relevance to ocean modeling, where the eddy fluxes contain dynamically… ▽ More

    Submitted 2 July, 2023; originally announced July 2023.

    Comments: 23 pages, 8 figures

  4. arXiv:2301.09196  [pdf, ps, other

    math.NT math.PR

    Universality for Cokernels of Dedekind Domain Valued Random Matrices

    Authors: Eric Yan

    Abstract: We use the moment method of Wood to study the distribution of random finite modules over a countable Dedekind domain with finite quotients, generated by taking cokernels of random $n\times n$ matrices with entries valued in the domain. Previously, Wood found that when the entries of a random $n\times n$ integral matrix are not too concentrated modulo a prime, the asymptotic distribution (as… ▽ More

    Submitted 10 June, 2023; v1 submitted 22 January, 2023; originally announced January 2023.

    Comments: 14 pages, no figures

  5. arXiv:2208.07634  [pdf, other

    physics.ao-ph math.OC physics.geo-ph

    On constraining the mesoscale eddy energy dissipation time-scale

    Authors: Julian Mak, Alexandros Avdis, Tomos W. David, Han Seul Lee, Yongsu Na, Yan Wang, Fei Er Yan

    Abstract: A physically plausible lower bound on the spatially varying geostrophic mesoscale eddy energy dissipation time-scale within the ocean, related to the geographical energy transfer rate out of the geostrophic mesoscales, is provided by means of a simple and computational inexpensive inverse calculation. Data diagnosed from a high resolution global configuration ocean simulation is supplied to a para… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: 27 pages, 8 figures, pre-print version (with minor updates to figures to reduce file size) submitted to J. Adv. Model. Earth Syst

  6. arXiv:2110.14819  [pdf, other

    cs.CV cs.LG

    Characterizing and Taming Resolution in Convolutional Neural Networks

    Authors: Eddie Yan, Liang Luo, Luis Ceze

    Abstract: Image resolution has a significant effect on the accuracy and computational, storage, and bandwidth costs of computer vision model inference. These costs are exacerbated when scaling out models to large inference serving systems and make image resolution an attractive target for optimization. However, the choice of resolution inherently introduces additional tightly coupled choices, such as image… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

  7. UoB at SemEval-2021 Task 5: Extending Pre-Trained Language Models to Include Task and Domain-Specific Information for Toxic Span Prediction

    Authors: Erik Yan, Harish Tayyar Madabushi

    Abstract: Toxicity is pervasive in social media and poses a major threat to the health of online communities. The recent introduction of pre-trained language models, which have achieved state-of-the-art results in many NLP tasks, has transformed the way in which we approach natural language processing. However, the inherent nature of pre-training means that they are unlikely to capture task-specific statist… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

    Comments: Published in Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021); Code available at: https://github.com/erikdyan/toxic_span_detection

    Journal ref: 2021.semeval-1.28 (2021) 243-248

  8. arXiv:2109.04452  [pdf, other

    cs.CL

    Analysis of Language Change in Collaborative Instruction Following

    Authors: Anna Effenberger, Eva Yan, Rhia Singh, Alane Suhr, Yoav Artzi

    Abstract: We analyze language change over time in a collaborative, goal-oriented instructional task, where utility-maximizing participants form conventions and increase their expertise. Prior work studied such scenarios mostly in the context of reference games, and consistently found that language complexity is reduced along multiple dimensions, such as utterance length, as conventions are formed. In contra… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: Findings of EMNLP 2021 Short Paper

  9. arXiv:2101.05420  [pdf, other

    math.CO

    The Determinant of $\{\pm 1\}$-Matrices and Oriented Hypergraphs

    Authors: Lucas J. Rusnak, Josephine Reynes, Russell Li, Eric Yan, Justin Yu

    Abstract: The determinants of $\{\pm 1\}$-matrices are calculated by via the oriented hypergraphic Laplacian and summing over an incidence generalization of vertex cycle-covers. These cycle-covers are signed and partitioned into families based on their hyperedge containment. Every non-edge-monic family is shown to contribute a net value of $0$ to the Laplacian, while each edge-monic family is shown to sum t… ▽ More

    Submitted 29 June, 2021; v1 submitted 13 January, 2021; originally announced January 2021.

    Comments: 17 pages, 11 figures

    MSC Class: 05C50; 05B20; 05C65; 05C22

  10. arXiv:2005.07722  [pdf, other

    math.CO

    Oriented Hypergraphs: Balanceability

    Authors: Lucas J. Rusnak, Selena Li, Brian Xu, Eric Yan, Shirley Zhu

    Abstract: An oriented hypergraph is an oriented incidence structure that extends the concepts of signed graphs, balanced hypergraphs, and balanced matrices. We introduce hypergraphic structures and techniques that generalize the circuit classification of the signed graphic frame matroid to any oriented hypergraphic incidence matrix via its locally-signed-graphic substructure. To achieve this, Camion's algor… ▽ More

    Submitted 15 May, 2020; originally announced May 2020.

    Comments: 19 pages, 9 figures

    MSC Class: 05C75 (Primary) 05C65; 05C22; 05C50; 05B35 (Secondary)

  11. arXiv:2004.12275  [pdf

    cs.SI cs.DL

    Citation Cascade and the Evolution of Topic Relevance

    Authors: Chao Min, Qingyu Chen, Erjia Yan, Yi Bu, Jianjun Sun

    Abstract: Citation analysis, as a tool for quantitative studies of science, has long emphasized direct citation relations, leaving indirect or high order citations overlooked. However, a series of early and recent studies demonstrate the existence of indirect and continuous citation impact across generations. Adding to the literature on high order citations, we introduce the concept of a citation cascade: t… ▽ More

    Submitted 25 April, 2020; originally announced April 2020.

  12. arXiv:2003.08773  [pdf, other

    cs.CV cs.LG eess.IV stat.ML

    Do CNNs Encode Data Augmentations?

    Authors: Eddie Yan, Yan** Huang

    Abstract: Data augmentations are important ingredients in the recipe for training robust neural networks, especially in computer vision. A fundamental question is whether neural network features encode data augmentation transformations. To answer this question, we introduce a systematic approach to investigate which layers of neural networks are the most predictive of augmentation transformations. Our appro… ▽ More

    Submitted 27 October, 2021; v1 submitted 28 February, 2020; originally announced March 2020.

    MSC Class: 68T45

  13. Nine Million Book Items and Eleven Million Citations: A Study of Book-Based Scholarly Communication Using OpenCitations

    Authors: Yongjun Zhu, Erjia Yan, Silvio Peroni, Chao Che

    Abstract: Books have been widely used to share information and contribute to human knowledge. However, the quantitative use of books as a method of scholarly communication is relatively unexamined compared to journal articles and conference papers. This study uses the COCI dataset (a comprehensive open citation dataset provided by OpenCitations) to explore books' roles in scholarly communication. The COCI d… ▽ More

    Submitted 6 December, 2019; v1 submitted 14 June, 2019; originally announced June 2019.

  14. arXiv:1901.04993  [pdf

    cs.IR cs.LG stat.AP stat.ML

    Large-Scale Joint Topic, Sentiment & User Preference Analysis for Online Reviews

    Authors: Xinli Yu, Zheng Chen, Wei-Shih Yang, Xiaohua Hu, Erjia Yan

    Abstract: This paper presents a non-trivial reconstruction of a previous joint topic-sentiment-preference review model TSPRA with stick-breaking representation under the framework of variational inference (VI) and stochastic variational inference (SVI). TSPRA is a Gibbs Sampling based model that solves topics, word sentiments and user preferences altogether and has been shown to achieve good performance, bu… ▽ More

    Submitted 14 January, 2019; originally announced January 2019.

  15. arXiv:1812.09387  [pdf

    cs.LG stat.ML

    Correlated Anomaly Detection from Large Streaming Data

    Authors: Zheng Chen, Xinli Yu, Yuan Ling, Bo Song, Wei Quan, Xiaohua Hu, Erjia Yan

    Abstract: Correlated anomaly detection (CAD) from streaming data is a type of group anomaly detection and an essential task in useful real-time data mining applications like botnet detection, financial event detection, industrial process monitor, etc. The primary approach for this type of detection in previous researches is based on principal score (PS) of divided batches or sliding windows by computing top… ▽ More

    Submitted 14 January, 2019; v1 submitted 19 December, 2018; originally announced December 2018.

  16. arXiv:1812.07810  [pdf

    cs.LG cs.CR math.NA stat.ML

    Fast Botnet Detection From Streaming Logs Using Online Lanczos Method

    Authors: Zheng Chen, Xinli Yu, Chi Zhang, ** Zhang, Cui Lin, Bo Song, Jianliang Gao, Xiaohua Hu, Wei-Shih Yang, Erjia Yan

    Abstract: Botnet, a group of coordinated bots, is becoming the main platform of malicious Internet activities like DDOS, click fraud, web scra**, spam/rumor distribution, etc. This paper focuses on design and experiment of a new approach for botnet detection from streaming web server logs, motivated by its wide applicability, real-time protection capability, ease of use and better security of sensitive da… ▽ More

    Submitted 19 December, 2018; originally announced December 2018.

  17. Challenges of measuring the impact of software: an examination of the lme4 R package

    Authors: Kai Li, Pei-Ying Chen, Erjia Yan

    Abstract: The rise of software as a research object is mirrored in the increasing interests towards quantitative studies of scientific software. However, due to the inconsistent practice of citing software, most of the existing studies analyzing the impact of scientific software are based on identification of software name mentions in full-text publications. Despite its limitations, citation data have a muc… ▽ More

    Submitted 27 November, 2018; originally announced November 2018.

  18. arXiv:1807.04188  [pdf, other

    cs.LG cs.DC stat.ML

    A Hardware-Software Blueprint for Flexible Deep Learning Specialization

    Authors: Thierry Moreau, Tianqi Chen, Luis Vega, Jared Roesch, Eddie Yan, Lianmin Zheng, Josh Fromm, Ziheng Jiang, Luis Ceze, Carlos Guestrin, Arvind Krishnamurthy

    Abstract: Specialized Deep Learning (DL) acceleration stacks, designed for a specific set of frameworks, model architectures, operators, and data types, offer the allure of high performance while sacrificing flexibility. Changes in algorithms, models, operators, or numerical systems threaten the viability of specialized hardware accelerators. We propose VTA, a programmable deep learning architecture templat… ▽ More

    Submitted 22 April, 2019; v1 submitted 11 July, 2018; originally announced July 2018.

    Comments: 6 pages plus references, 8 figures

  19. arXiv:1805.08166  [pdf, other

    cs.LG stat.ML

    Learning to Optimize Tensor Programs

    Authors: Tianqi Chen, Lianmin Zheng, Eddie Yan, Ziheng Jiang, Thierry Moreau, Luis Ceze, Carlos Guestrin, Arvind Krishnamurthy

    Abstract: We introduce a learning-based framework to optimize tensor programs for deep learning workloads. Efficient implementations of tensor operators, such as matrix multiplication and high dimensional convolution, are key enablers of effective deep learning systems. However, existing systems rely on manually optimized libraries such as cuDNN where only a narrow range of server class GPUs are well-suppor… ▽ More

    Submitted 8 January, 2019; v1 submitted 21 May, 2018; originally announced May 2018.

    Comments: NeurIPS 2018

  20. arXiv:1802.04799  [pdf, other

    cs.LG cs.AI cs.PL

    TVM: An Automated End-to-End Optimizing Compiler for Deep Learning

    Authors: Tianqi Chen, Thierry Moreau, Ziheng Jiang, Lianmin Zheng, Eddie Yan, Meghan Cowan, Haichen Shen, Leyuan Wang, Yuwei Hu, Luis Ceze, Carlos Guestrin, Arvind Krishnamurthy

    Abstract: There is an increasing need to bring machine learning to a wide diversity of hardware devices. Current frameworks rely on vendor-specific operator libraries and optimize for a narrow range of server-class GPUs. Deploying workloads to new platforms -- such as mobile phones, embedded devices, and accelerators (e.g., FPGAs, ASICs) -- requires significant manual effort. We propose TVM, a compiler that… ▽ More

    Submitted 5 October, 2018; v1 submitted 12 February, 2018; originally announced February 2018.

    Comments: Significantly improved version, add automated optimization

  21. arXiv:1612.03231  [pdf

    cs.IR cs.CL

    A natural language interface to a graph-based bibliographic information retrieval system

    Authors: Yongjun Zhu, Erjia Yan, Il-Yeol Song

    Abstract: With the ever-increasing scientific literature, there is a need on a natural language interface to bibliographic information retrieval systems to retrieve related information effectively. In this paper, we propose a natural language interface, NLI-GIBIR, to a graph-based bibliographic information retrieval system. In designing NLI-GIBIR, we developed a novel framework that can be applicable to gra… ▽ More

    Submitted 9 December, 2016; originally announced December 2016.

  22. arXiv:1503.06664  [pdf

    cond-mat.other cond-mat.mtrl-sci

    Bit Patterned Magnetic Recording: Theory, Media Fabrication, and Recording Performance

    Authors: Thomas R. Albrecht, Hitesh Arora, Vipin Ayanoor-Vitikkate, Jean-Marc Beaujour, Daniel Bedau, David Berman, Alexei L. Bogdanov, Yves-Andre Chapuis, Julia Cushen, Elizabeth E. Dobisz, Gregory Doerk, He Gao, Michael Grobis, Bruce Gurney, Weldon Hanson, Olav Hellwig, Toshiki Hirano, Pierre-Olivier Jubert, Dan Kercher, Jeffrey Lille, Zuwei Liu, C. Mathew Mate, Yuri Obukhov, Kanaiyalal C. Patel, Kurt Rubin , et al. (6 additional authors not shown)

    Abstract: Bit Patterned Media (BPM) for magnetic recording provide a route to densities $>1 Tb/in^2$ and circumvents many of the challenges associated with conventional granular media technology. Instead of recording a bit on an ensemble of random grains, BPM uses an array of lithographically defined isolated magnetic islands, each of which stores one bit. Fabrication of BPM is viewed as the greatest challe… ▽ More

    Submitted 19 March, 2015; originally announced March 2015.

    Comments: 44 pages

    ACM Class: B.3.2; B.4.2

  23. arXiv:1309.2546  [pdf

    cs.DL

    Finding knowledge paths among scientific disciplines

    Authors: Erjia Yan

    Abstract: This paper discovers patterns of knowledge dissemination among scientific disciplines. While the transfer of knowledge is largely unobservable, citations from one discipline to another have been proven to be an effective proxy to study disciplinary knowledge flow. This study constructs a knowledge flow network in that a node represents a Journal Citation Report subject category and a link denotes… ▽ More

    Submitted 10 September, 2013; originally announced September 2013.

    Comments: 31 pages, 12 figures

  24. Entitymetrics: Measuring the Impact of Entities

    Authors: Ying Ding, Min Song, Jia Han, Qi Yu, Erjia Yan, Lili Lin, Tamy Chambers

    Abstract: This paper proposes entitymetrics to measure the impact of knowledge units. Entitymetrics highlight the importance of entities embedded in scientific literature for further knowledge discovery. In this paper, we use Metformin, a drug for diabetes, as an example to form an entity-entity citation network based on literature related to Metformin. We then calculate the network features and compare the… ▽ More

    Submitted 10 September, 2013; originally announced September 2013.

    Journal ref: PLOS ONE 8(8): e71416, 2013

  25. arXiv:1211.5820  [pdf

    cs.DL

    A bird's-eye view of scientific trading: Dependency relations among fields of science

    Authors: Erjia Yan, Ying Ding, Blaise Cronin, Loet Leydesdorff

    Abstract: We use a trading metaphor to study knowledge transfer in the sciences as well as the social sciences. The metaphor comprises four dimensions: (a) Discipline Self-dependence, (b) Knowledge Exports/Imports, (c) Scientific Trading Dynamics, and (d) Scientific Trading Impact. This framework is applied to a dataset of 221 Web of Science subject categories. We find that: (i) the Scientific Trading Impac… ▽ More

    Submitted 25 November, 2012; originally announced November 2012.

  26. arXiv:1105.3212  [pdf

    cs.DL

    A recursive field-normalized bibliometric performance indicator: An application to the field of library and information science

    Authors: Ludo Waltman, Erjia Yan, Nees Jan van Eck

    Abstract: Two commonly used ideas in the development of citation-based research performance indicators are the idea of normalizing citation counts based on a field classification scheme and the idea of recursive citation weighing (like in PageRank-inspired indicators). We combine these two ideas in a single indicator, referred to as the recursive mean normalized citation score indicator, and we study the va… ▽ More

    Submitted 16 May, 2011; originally announced May 2011.

  27. arXiv:1012.4876  [pdf

    cs.DL

    Weighted citation: An indicator of an article's prestige

    Authors: Erjia Yan, Ying Ding

    Abstract: We propose using the technique of weighted citation to measure an article's prestige. The technique allocates a different weight to each reference by taking into account the impact of citing journals and citation time intervals. Weighted citation captures prestige, whereas citation counts capture popularity. We compare the value variances for popularity and prestige for articles published in the J… ▽ More

    Submitted 21 December, 2010; originally announced December 2010.

    Comments: 17 pages, 6 figures

  28. arXiv:1012.4875  [pdf

    cs.IR cs.SI

    Upper Tag Ontology (UTO) For Integrating Social Tagging Data

    Authors: Ying Ding, Elin K. Jacob, Michael Fried, Ioan Toma, Erjia Yan, Schubert Foo

    Abstract: Data integration and mediation have become central concerns of information technology over the past few decades. With the advent of the Web and the rapid increases in the amount of data and the number of Web documents and users, researchers have focused on enhancing the interoperability of data through the development of metadata schemes. Other researchers have looked to the wealth of metadata gen… ▽ More

    Submitted 21 December, 2010; originally announced December 2010.

    Comments: 31 pages, 7 figures

  29. arXiv:1012.4872  [pdf

    cs.DL

    PageRank for ranking authors in co-citation networks

    Authors: Ying Ding, Erjia Yan, Arthur Frazho, James Caverlee

    Abstract: Google's PageRank has created a new synergy to information retrieval for a better ranking of Web pages. It ranks documents depending on the topology of the graphs and the weights of the nodes. PageRank has significantly advanced the field of information retrieval and keeps Google ahead of competitors in the search engine market. It has been deployed in bibliometrics to evaluate research impact, ye… ▽ More

    Submitted 21 December, 2010; originally announced December 2010.

    Comments: 19 pages, 7 figures

  30. arXiv:1012.4870  [pdf

    cs.DL

    Discovering author impact: A PageRank perspective

    Authors: Erjia Yan, Ying Ding

    Abstract: This article provides an alternative perspective for measuring author impact by applying PageRank algorithm to a coauthorship network. A weighted PageRank algorithm considering citation and coauthorship network topology is proposed. We test this algorithm under different dam** factors by evaluating author impact in the informetrics research community. In addition, we also compare this weighted P… ▽ More

    Submitted 21 December, 2010; originally announced December 2010.

    Comments: 17 pages, 5 figures

  31. arXiv:1012.4862  [pdf

    cs.DL

    Applying centrality measures to impact analysis: A coauthorship network analysis

    Authors: Erjia Yan, Ying Ding

    Abstract: Many studies on coauthorship networks focus on network topology and network statistical mechanics. This article takes a different approach by studying micro-level network properties, with the aim to apply centrality measures to impact analysis. Using coauthorship data from 16 journals in the field of library and information science (LIS) with a time span of twenty years (1988-2007), we construct a… ▽ More

    Submitted 21 December, 2010; originally announced December 2010.

    Comments: 17 pages, 4 figures