Skip to main content

Showing 1–8 of 8 results for author: Marginean, A

.
  1. arXiv:2402.09171  [pdf, other

    cs.SE

    Automated Unit Test Improvement using Large Language Models at Meta

    Authors: Nadia Alshahwan, Jubin Chheda, Anastasia Finegenova, Beliz Gokkaya, Mark Harman, Inna Harper, Alexandru Marginean, Shubho Sengupta, Eddy Wang

    Abstract: This paper describes Meta's TestGen-LLM tool, which uses LLMs to automatically improve existing human-written tests. TestGen-LLM verifies that its generated test classes successfully clear a set of filters that assure measurable improvement over the original test suite, thereby eliminating problems due to LLM hallucination. We describe the deployment of TestGen-LLM at Meta test-a-thons for the Ins… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: 12 pages, 8 figures, 32nd ACM Symposium on the Foundations of Software Engineering (FSE 24)

  2. arXiv:2402.06111  [pdf, other

    cs.SE

    Observation-based unit test generation at Meta

    Authors: Nadia Alshahwan, Mark Harman, Alexandru Marginean, Rotem Tal, Eddy Wang

    Abstract: TestGen automatically generates unit tests, carved from serialized observations of complex objects, observed during app execution. We describe the development and deployment of TestGen at Meta. In particular, we focus on the scalability challenges overcome during development in order to deploy observation-based test carving at scale in industry. So far, TestGen has landed 518 tests into production… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 12 pages, 8 figures, FSE 2024, Mon 15 - Fri 19 July 2024, Porto de Galinhas, Brazil

  3. arXiv:2402.04380  [pdf, other

    cs.SE

    Assured LLM-Based Software Engineering

    Authors: Nadia Alshahwan, Mark Harman, Inna Harper, Alexandru Marginean, Shubho Sengupta, Eddy Wang

    Abstract: In this paper we address the following question: How can we use Large Language Models (LLMs) to improve code independently of a human, while ensuring that the improved code - does not regress the properties of the original code? - improves the original in a verifiable and measurable way? To address this question, we advocate Assured LLM-Based Software Engineering; a generate-and-test approac… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 6 pages, 1 figure, InteNSE 24: ACM International Workshop on Interpretability, Robustness, and Benchmarking in Neural Software Engineering, April, 2024, Lisbon, Portugal

  4. arXiv:2310.06856  [pdf, other

    cs.CY cs.AI

    Brave new world: Artificial Intelligence in teaching and learning

    Authors: Adrian Groza, Anca Marginean

    Abstract: We exemplify how Large Language Models are used in both teaching and learning. We also discuss the AI incidents that have already occurred in the education domain, and we argue for the urgent need to introduce AI policies in universities and for the ongoing strategies to regulate AI. Regarding policy for AI, our view is that each institution should have a policy for AI in teaching and learning. Th… ▽ More

    Submitted 27 September, 2023; originally announced October 2023.

  5. arXiv:2306.08997   

    cs.CL cs.AI cs.LG

    Exploring the MIT Mathematics and EECS Curriculum Using Large Language Models

    Authors: Sarah J. Zhang, Samuel Florin, Ariel N. Lee, Eamon Niknafs, Andrei Marginean, Annie Wang, Keith Tyser, Zad Chin, Yann Hicke, Nikhil Singh, Madeleine Udell, Yoon Kim, Tonio Buonassisi, Armando Solar-Lezama, Iddo Drori

    Abstract: We curate a comprehensive dataset of 4,550 questions and solutions from problem sets, midterm exams, and final exams across all MIT Mathematics and Electrical Engineering and Computer Science (EECS) courses required for obtaining a degree. We evaluate the ability of large language models to fulfill the graduation requirements for any MIT major in Mathematics and EECS. Our results demonstrate that… ▽ More

    Submitted 24 June, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Did not receive permission to release the data or model fine-tuned on the data

  6. arXiv:2212.06717  [pdf, other

    astro-ph.SR astro-ph.IM cs.CV cs.LG

    A Machine Learning Enhanced Approach for Automated Sunquake Detection in Acoustic Emission Maps

    Authors: Vanessa Mercea, Alin Razvan Paraschiv, Daniela Adriana Lacatus, Anca Marginean, Diana Besliu-Ionescu

    Abstract: Sunquakes are seismic emissions visible on the solar surface, associated with some solar flares. Although discovered in 1998, they have only recently become a more commonly detected phenomenon. Despite the availability of several manual detection guidelines, to our knowledge, the astrophysical data produced for sunquakes is new to the field of Machine Learning. Detecting sunquakes is a daunting ta… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: Solar Physics accepted for publication, 44 total pages, 9 appendix pages, 21 figures, 6 tables

  7. arXiv:2206.11472  [pdf, other

    astro-ph.SR cs.LG

    Predicting the Geoeffectiveness of CMEs Using Machine Learning

    Authors: Andreea-Clara Pricopi, Alin Razvan Paraschiv, Diana Besliu-Ionescu, Anca-Nicoleta Marginean

    Abstract: Coronal mass ejections (CMEs) are the most geoeffective space weather phenomena, being associated with large geomagnetic storms, having the potential to cause disturbances to telecommunication, satellite network disruptions, power grid damages and failures. Thus, considering these storms' potential effects on human activities, accurate forecasts of the geoeffectiveness of CMEs are paramount. This… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

    Comments: 25 pages, 7 figures, and 10 tables, The Astrophysical Journal, In Press

  8. arXiv:1806.10235  [pdf, other

    cs.SE

    Indexing Operators to Extend the Reach of Symbolic Execution

    Authors: Earl T. Barr, David Clark, Mark Harman, Alexandru Marginean

    Abstract: Traditional program analysis analyses a program language, that is, all programs that can be written in the language. There is a difference, however, between all possible programs that can be written and the corpus of actual programs written in a language. We seek to exploit this difference: for a given program, we apply a bespoke program transformation Indexify to convert expressions that current… ▽ More

    Submitted 26 June, 2018; originally announced June 2018.