Skip to main content

Showing 1–19 of 19 results for author: Mars, J

.
  1. arXiv:2405.08965  [pdf, other

    cs.PL cs.AI

    LLMs are Meaning-Typed Code Constructs

    Authors: Jason Mars, Yi** Kang, Jayanaka Dantanarayana, Chandra Irugalbandara, Kugesan Sivasothynathan, Lingjia Tang

    Abstract: Programming with Generative AI (GenAI) models is a type of Neurosymbolic programming and has seen tremendous adoption across many domains. However, leveraging GenAI models in code today can be complex, counter-intuitive and often require specialized frameworks, leading to increased complexity. This is because it is currently unclear as to the right abstractions through which we should marry GenAI… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  2. arXiv:2405.03832  [pdf, other

    cs.CL cs.AI

    Guylingo: The Republic of Guyana Creole Corpora

    Authors: Christopher Clarke, Roland Daynauth, Charlene Wilkinson, Hubert Devonish, Jason Mars

    Abstract: While major languages often enjoy substantial attention and resources, the linguistic diversity across the globe encompasses a multitude of smaller, indigenous, and regional languages that lack the same level of computational support. One such region is the Caribbean. While commonly labeled as "English speaking", the ex-British Caribbean region consists of a myriad of Creole languages thriving alo… ▽ More

    Submitted 2 July, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: Accepted to NAACL 2024 Main Conference Special Theme Track: Languages of Latin America and The Caribbean

  3. arXiv:2401.07123  [pdf, other

    cs.HC cs.CL

    One Agent Too Many: User Perspectives on Approaches to Multi-agent Conversational AI

    Authors: Christopher Clarke, Karthik Krishnamurthy, Walter Talamonti, Yi** Kang, Lingjia Tang, Jason Mars

    Abstract: Conversational agents have been gaining increasing popularity in recent years. Influenced by the widespread adoption of task-oriented agents such as Apple Siri and Amazon Alexa, these agents are being deployed into various applications to enhance user experience. Although these agents promote "ask me anything" functionality, they are typically built to focus on a single or finite set of expertise.… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

  4. arXiv:2312.14972  [pdf, other

    cs.SE cs.AI cs.LG

    Scaling Down to Scale Up: A Cost-Benefit Analysis of Replacing OpenAI's LLM with Open Source SLMs in Production

    Authors: Chandra Irugalbandara, Ashish Mahendra, Roland Daynauth, Tharuka Kasthuri Arachchige, Jayanaka Dantanarayana, Krisztian Flautner, Lingjia Tang, Yi** Kang, Jason Mars

    Abstract: Many companies use large language models (LLMs) offered as a service, like OpenAI's GPT-4, to create AI-enabled product experiences. Along with the benefits of ease-of-use and shortened time-to-solution, this reliance on proprietary services has downsides in model control, performance reliability, uptime predictability, and cost. At the same time, a flurry of open-source small language models (SLM… ▽ More

    Submitted 16 April, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: Updated title, Revised content

    Journal ref: ISPASS-2024: 2024 IEEE International Symposium on Performance Analysis of Systems and Software

  5. arXiv:2307.12935  [pdf, other

    cs.CL cs.AI

    Rule By Example: Harnessing Logical Rules for Explainable Hate Speech Detection

    Authors: Christopher Clarke, Matthew Hall, Gaurav Mittal, Ye Yu, Sandra Sajeev, Jason Mars, Mei Chen

    Abstract: Classic approaches to content moderation typically apply a rule-based heuristic approach to flag content. While rules are easily customizable and intuitive for humans to interpret, they are inherently fragile and lack the flexibility or robustness needed to moderate the vast amount of undesirable content found online today. Recent advances in deep learning have demonstrated the promise of using hi… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: ACL 2023 Main Conference

  6. arXiv:2305.16521  [pdf, other

    cs.CL cs.LG

    Label Agnostic Pre-training for Zero-shot Text Classification

    Authors: Christopher Clarke, Yuzhao Heng, Yi** Kang, Krisztian Flautner, Lingjia Tang, Jason Mars

    Abstract: Conventional approaches to text classification typically assume the existence of a fixed set of predefined labels to which a given text can be classified. However, in real-world applications, there exists an infinite label space for describing a given text. In addition, depending on the aspect (sentiment, topic, etc.) and domain of the text (finance, legal, etc.), the interpretation of the label c… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023

  7. arXiv:2305.09864  [pdf, other

    cs.CL cs.DC cs.PL cs.SE

    The Jaseci Programming Paradigm and Runtime Stack: Building Scale-out Production Applications Easy and Fast

    Authors: Jason Mars, Yi** Kang, Roland Daynauth, Baichuan Li, Ashish Mahendra, Krisztian Flautner, Lingjia Tang

    Abstract: Today's production scale-out applications include many sub-application components, such as storage backends, logging infrastructure and AI models. These components have drastically different characteristics, are required to work in collaboration, and interface with each other as microservices. This leads to increasingly high complexity in develo**, optimizing, configuring, and deploying scale-ou… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  8. arXiv:2206.08434  [pdf, other

    cs.DC cs.AI cs.AR cs.PL

    The Case for a Wholistic Serverless Programming Paradigm and Full Stack Automation for AI and Beyond -- The Philosophy of Jaseci and Jac

    Authors: Jason Mars

    Abstract: In this work, the case is made for a wholistic top-down re-envisioning of the system stack from the programming language level down through the system architecture to bridge this complexity gap. The key goal of our design is to address the critical need for the programmer to articulate solutions with higher level abstractions at the problem level while having the runtime system stack subsume and h… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  9. arXiv:2203.07665  [pdf, other

    cs.CL cs.AI cs.IR

    One Agent To Rule Them All: Towards Multi-agent Conversational AI

    Authors: Christopher Clarke, Joseph Joshua Peper, Karthik Krishnamurthy, Walter Talamonti, Kevin Leach, Walter Lasecki, Yi** Kang, Lingjia Tang, Jason Mars

    Abstract: The increasing volume of commercially available conversational agents (CAs) on the market has resulted in users being burdened with learning and adopting multiple agents to accomplish their tasks. Though prior work has explored supporting a multitude of domains within the design of a single agent, the interaction experience suffers due to the large action space of desired capabilities. To address… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

  10. arXiv:2203.06668  [pdf, other

    cs.CL cs.AI

    Towards Personalized Intelligence at Scale

    Authors: Yi** Kang, Ashish Mahendra, Christopher Clarke, Lingjia Tang, Jason Mars

    Abstract: Personalized Intelligence (PI) is the problem of providing customized AI experiences tailored to each individual user. In many applications, PI is preferred or even required. Existing personalization approaches involve fine-tuning pre-trained models to create new customized models. However, these approaches require a significant amount of computation to train, scaling with model size and the numbe… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

  11. arXiv:2006.13378  [pdf, other

    cs.DC cs.GR

    A Benchmarking Framework for Interactive 3D Applications in the Cloud

    Authors: Tianyi Liu, Sen He, Sunzhou Huang, Danny Tsang, Lingjia Tang, Jason Mars, Wei Wang

    Abstract: With the growing popularity of cloud gaming and cloud virtual reality (VR), interactive 3D applications have become a major type of workloads for the cloud. However, despite their growing importance, there is limited public research on how to design cloud systems to efficiently support these applications, due to the lack of an open and reliable research infrastructure, including benchmarks and per… ▽ More

    Submitted 2 August, 2020; v1 submitted 23 June, 2020; originally announced June 2020.

  12. arXiv:1909.02027  [pdf, other

    cs.CL cs.AI cs.LG

    An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction

    Authors: Stefan Larson, Anish Mahendran, Joseph J. Peper, Christopher Clarke, Andrew Lee, Parker Hill, Jonathan K. Kummerfeld, Kevin Leach, Michael A. Laurenzano, Lingjia Tang, Jason Mars

    Abstract: Task-oriented dialog systems need to know when a query falls outside their range of supported intents, but current text classification corpora only define label sets that cover every example. We introduce a new dataset that includes queries that are out-of-scope---i.e., queries that do not fall into any of the system's supported intents. This poses a new challenge because models cannot assume that… ▽ More

    Submitted 4 September, 2019; originally announced September 2019.

    Comments: Accepted to EMNLP-IJCNLP 2019

  13. arXiv:1904.03122  [pdf, other

    cs.CL

    Outlier Detection for Improved Data Quality and Diversity in Dialog Systems

    Authors: Stefan Larson, Anish Mahendran, Andrew Lee, Jonathan K. Kummerfeld, Parker Hill, Michael A. Laurenzano, Johann Hauswald, Lingjia Tang, Jason Mars

    Abstract: In a corpus of data, outliers are either errors: mistakes in the data that are counterproductive, or are unique: informative samples that improve model robustness. Identifying outliers can lead to better datasets by (1) removing noise in datasets and (2) guiding collection of additional data to fill gaps. However, the problem of detecting both outlier types has received relatively little attention… ▽ More

    Submitted 5 April, 2019; originally announced April 2019.

    Comments: Accepted as long paper to NAACL 2019

  14. arXiv:1808.02513  [pdf, other

    cs.LG stat.ML

    Rethinking Numerical Representations for Deep Neural Networks

    Authors: Parker Hill, Babak Zamirai, Shengshuo Lu, Yu-Wei Chao, Michael Laurenzano, Mehrzad Samadi, Marios Papaefthymiou, Scott Mahlke, Thomas Wenisch, Jia Deng, Lingjia Tang, Jason Mars

    Abstract: With ever-increasing computational demand for deep learning, it is critical to investigate the implications of the numeric representation and precision of DNN model weights and activations on computational efficiency. In this work, we explore unconventional narrow-precision floating-point representations as it relates to inference accuracy and efficiency to steer the improved design of future DNN… ▽ More

    Submitted 7 August, 2018; originally announced August 2018.

  15. arXiv:1704.05733  [pdf, ps, other

    cond-mat.soft cond-mat.mes-hall physics.chem-ph

    Salt-induced microheterogeneities in binary liquid mixtures

    Authors: Markus Bier, Julian Mars, Hailong Li, Markus Mezger

    Abstract: The salt-induced microheterogeneity (MH) formation in binary liquid mixtures is studied by small-angle X-ray scattering (SAXS) and liquid state theory. Previous experiments have shown that this phenomenon occurs for antagonistic salts, whose cations and anions prefer different components of the solvent mixture. However, so far the precise mechanism leading to the characteristic length scale of MHs… ▽ More

    Submitted 4 August, 2017; v1 submitted 19 April, 2017; originally announced April 2017.

    Journal ref: Phys. Rev. E 96, 022603 (2017)

  16. arXiv:1604.03450  [pdf, ps, other

    physics.data-an cs.IT

    A Noise-Robust Method with Smoothed \ell_1/\ell_2 Regularization for Sparse Moving-Source Map**

    Authors: Mai Quyen Pham, Benoit Oudompheng, Jérôme I. Mars, Barbara Nicolas

    Abstract: The method described here performs blind deconvolution of the beamforming output in the frequency domain. To provide accurate blind deconvolution, sparsity priors are introduced with a smooth \ell_1/\ell_2 regularization term. As the mean of the noise in the power spectrum domain is dependent on its variance in the time domain, the proposed method includes a variance estimation step, which allows… ▽ More

    Submitted 1 April, 2016; originally announced April 2016.

  17. arXiv:1303.0742  [pdf, ps, other

    cs.LG q-bio.NC stat.ML

    Multivariate Temporal Dictionary Learning for EEG

    Authors: Quentin Barthélemy, Cédric Gouy-Pailler, Yoann Isaac, Antoine Souloumiac, Anthony Larue, Jérôme I. Mars

    Abstract: This article addresses the issue of representing electroencephalographic (EEG) signals in an efficient way. While classical approaches use a fixed Gabor dictionary to analyze EEG signals, this article proposes a data-driven method to obtain an adapted dictionary. To reach an efficient dictionary learning, appropriate spatial and temporal modeling is required. Inter-channels links are taken into ac… ▽ More

    Submitted 4 March, 2013; originally announced March 2013.

    Journal ref: Published in Journal of Neuroscience Methods, vol. 215, pp. 19-28, 2013

  18. Searching for thermal signatures of persistent currents in normal metal rings

    Authors: Germain Souche, Julien Huillery, H. Pothier, Philippe Gandit, Jerome I. Mars, Sergey Skipetrov, Olivier Bourgeois

    Abstract: We introduce a calorimetric approach to probe persistent currents in normal metal rings. The heat capacity of a large ensemble of silver rings is measured by nanocalorimetry under a varying magnetic field at different temperatures (60 mK, 100 mK and 150 mK). Periodic oscillations versus magnetic field are detected in the phase signal of the temperature oscillations, though not in the amplitude (bo… ▽ More

    Submitted 18 March, 2013; v1 submitted 18 January, 2013; originally announced January 2013.

    Journal ref: Physical Review B (Condensed Matter) 87 (2013) 115120

  19. Fine frequency shift of sigle vortex entrance and exit in superconducting loops

    Authors: F. R. Ong, Olivier Bourgeois, S. E. Skipetrov, J. Chaussy, Simona Popa, Jérôme Mars, Jean-Louis Lacoume

    Abstract: The heat capacity $C_{p}$ of an array of independent aluminum rings has been measured under an external magnetic field $\vec{H}$ using highly sensitive ac-calorimetry based on a silicon membrane sensor. Each superconducting vortex entrance induces a phase transition and a heat capacity jump and hence $C_{p}$ oscillates with $\vec{H}$. This oscillatory and non-stationary behaviour measured versus… ▽ More

    Submitted 22 August, 2007; originally announced August 2007.