Skip to main content

Showing 1–19 of 19 results for author: Santos, C S

.
  1. arXiv:2404.10155  [pdf, other

    cs.SE cs.LG

    Quality Assessment of Prompts Used in Code Generation

    Authors: Mohammed Latif Siddiq, Simantika Dristi, Joy Saha, Joanna C. S. Santos

    Abstract: Large Language Models (LLMs) are gaining popularity among software engineers. A crucial aspect of develo** effective code-generation LLMs is to evaluate these models using a robust benchmark. Evaluation benchmarks with quality issues can provide a false sense of performance. In this work, we conduct the first-of-its-kind study of the quality of prompts within benchmarks used to compare the perfo… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Under review

  2. arXiv:2403.10646  [pdf

    cs.LG cs.CR

    A Survey of Source Code Representations for Machine Learning-Based Cybersecurity Tasks

    Authors: Beatrice Casey, Joanna C. S. Santos, George Perry

    Abstract: Machine learning techniques for cybersecurity-related software engineering tasks are becoming increasingly popular. The representation of source code is a key portion of the technique that can impact the way the model is able to learn the features of the source code. With an increasing number of these techniques being developed, it is valuable to see the current state of the field to better unders… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  3. arXiv:2401.01200  [pdf, other

    cs.CV cs.AI

    Skin cancer diagnosis using NIR spectroscopy data of skin lesions in vivo using machine learning algorithms

    Authors: Flavio P. Loss, Pedro H. da Cunha, Matheus B. Rocha, Madson Poltronieri Zanoni, Leandro M. de Lima, Isadora Tavares Nascimento, Isabella Rezende, Tania R. P. Canuto, Luciana de Paula Vieira, Renan Rossoni, Maria C. S. Santos, Patricia Lyra Frasson, Wanderson Romão, Paulo R. Filgueiras, Renato A. Krohling

    Abstract: Skin lesions are classified in benign or malignant. Among the malignant, melanoma is a very aggressive cancer and the major cause of deaths. So, early diagnosis of skin cancer is very desired. In the last few years, there is a growing interest in computer aided diagnostic (CAD) using most image and clinical data of the lesion. These sources of information present limitations due to their inability… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

  4. arXiv:2312.12598  [pdf, other

    cs.SE cs.AI

    A Case Study on Test Case Construction with Large Language Models: Unveiling Practical Insights and Challenges

    Authors: Roberto Francisco de Lima Junior, Luiz Fernando Paes de Barros Presta, Lucca Santos Borborema, Vanderson Nogueira da Silva, Marcio Leal de Melo Dahia, Anderson Carlos Sousa e Santos

    Abstract: This paper presents a detailed case study examining the application of Large Language Models (LLMs) in the construction of test cases within the context of software engineering. LLMs, characterized by their advanced natural language processing capabilities, are increasingly garnering attention as tools to automate and enhance various aspects of the software development life cycle. Leveraging a cas… ▽ More

    Submitted 21 December, 2023; v1 submitted 19 December, 2023; originally announced December 2023.

  5. arXiv:2311.00943  [pdf

    cs.SE

    Sound Call Graph Construction for Java Object Deserialization

    Authors: Joanna C. S. Santos, Mehdi Mirakhorli, Ali Shokri

    Abstract: Object serialization and deserialization is widely used for storing and preserving objects in files, memory, or database as well as for transporting them across machines, enabling remote interaction among processes and many more. This mechanism relies on reflection, a dynamic language that introduces serious challenges for static analyses. Current state-of-the-art call graph construction algorithm… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  6. arXiv:2311.00889  [pdf, other

    cs.SE cs.AI

    Generate and Pray: Using SALLMS to Evaluate the Security of LLM Generated Code

    Authors: Mohammed Latif Siddiq, Joanna C. S. Santos, Sajith Devareddy, Anna Muller

    Abstract: With the growing popularity of Large Language Models (LLMs) in software engineers' daily practices, it is important to ensure that the code generated by these tools is not only functionally correct but also free of vulnerabilities. Although LLMs can help developers to be more productive, prior empirical studies have shown that LLMs can generate insecure code. There are two contributing factors to… ▽ More

    Submitted 3 June, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: Under review; 12 Pages

  7. arXiv:2307.08220  [pdf, other

    cs.SE cs.LG

    A Lightweight Framework for High-Quality Code Generation

    Authors: Mohammed Latif Siddiq, Beatrice Casey, Joanna C. S. Santos

    Abstract: In recent years, the use of automated source code generation utilizing transformer-based generative models has expanded, and these models can generate functional code according to the requirements of the developers. However, recent research revealed that these automatically generated source codes can contain vulnerabilities and other quality issues. Despite researchers' and practitioners' attempts… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

    Comments: Under Review

  8. arXiv:2305.00418  [pdf, other

    cs.SE cs.LG

    Using Large Language Models to Generate JUnit Tests: An Empirical Study

    Authors: Mohammed Latif Siddiq, Joanna C. S. Santos, Ridwanul Hasan Tanvir, Noshin Ulfat, Fahmid Al Rifat, Vinicius Carvalho Lopes

    Abstract: A code generation model generates code by taking a prompt from a code comment, existing code, or a combination of both. Although code generation models (e.g., GitHub Copilot) are increasingly being adopted in practice, it is unclear whether they can successfully be used for unit test generation without fine-tuning for a strongly typed language like Java. To fill this gap, we investigated how well… ▽ More

    Submitted 8 March, 2024; v1 submitted 30 April, 2023; originally announced May 2023.

    Comments: Accepted in Research Track of The 28th International Conference on Evaluation and Assessment in Software Engineering (EASE 2024)

  9. A Biomedical Entity Extraction Pipeline for Oncology Health Records in Portuguese

    Authors: Hugo Sousa, Arian Pasquali, Alípio Jorge, Catarina Sousa Santos, Mário Amorim Lopes

    Abstract: Textual health records of cancer patients are usually protracted and highly unstructured, making it very time-consuming for health professionals to get a complete overview of the patient's therapeutic course. As such limitations can lead to suboptimal and/or inefficient treatment procedures, healthcare providers would greatly benefit from a system that effectively summarizes the information of tho… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  10. arXiv:2304.07840  [pdf, other

    cs.LG cs.SE

    Enhancing Automated Program Repair through Fine-tuning and Prompt Engineering

    Authors: Rishov Paul, Md. Mohib Hossain, Mohammed Latif Siddiq, Masum Hasan, Anindya Iqbal, Joanna C. S. Santos

    Abstract: Sequence-to-sequence models have been used to transform erroneous programs into correct ones when trained with a large enough dataset. Some recent studies also demonstrated strong empirical evidence that code review could improve the program repair further. Large language models, trained with Natural Language (NL) and Programming Language (PL), can contain inherent knowledge of both. In this study… ▽ More

    Submitted 21 July, 2023; v1 submitted 16 April, 2023; originally announced April 2023.

    Comments: 12 pages, 2 figures, 4 tables

  11. ArCode: Facilitating the Use of Application Frameworks to Implement Tactics and Patterns

    Authors: Ali Shokri, Joanna C. S. Santos, Mehdi Mirakhorli

    Abstract: Software designers and developers are increasingly relying on application frameworks as first-class design concepts. They instantiate the services that frameworks provide to implement various architectural tactics and patterns. One of the challenges in using frameworks for such tasks is the difficulty of learning and correctly using frameworks' APIs. This paper introduces a learning-based approach… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

    Comments: This paper has been accepted in the main track of 2021 IEEE International Conference on Software Architecture (ICSA 2021) and is going to be published. Please feel free to cite it

  12. arXiv:2007.00478  [pdf, other

    eess.IV q-bio.QM

    PAD-UFES-20: a skin lesion dataset composed of patient data and clinical images collected from smartphones

    Authors: Andre G. C. Pacheco, Gustavo R. Lima, Amanda S. Salomão, Breno A. Krohling, Igor P. Biral, Gabriel G. de Angelo, Fábio C. R. Alves Jr, José G. M. Esgario, Alana C. Simora, Pedro B. C. Castro, Felipe B. Rodrigues, Patricia H. L. Frasson, Renato A. Krohling, Helder Knidel, Maria C. S. Santos, Rachel B. do Espírito Santo, Telma L. S. G. Macedo, Tania R. P. Canuto, Luíz F. S. de Barros

    Abstract: Over the past few years, different computer-aided diagnosis (CAD) systems have been proposed to tackle skin lesion analysis. Most of these systems work only for dermoscopy images since there is a strong lack of public clinical images archive available to design them. To fill this gap, we release a skin lesion benchmark composed of clinical images collected from smartphone devices and a set of pati… ▽ More

    Submitted 9 July, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

  13. arXiv:1710.04132  [pdf

    cs.CY cs.PL

    Aprendendo Programacao Orientada a Objetos com uma Abordagem Ludica Baseada em Greenfoot e Robocode

    Authors: Cleison Simoes Santos, Allen Hichard Marques Santos, Suenny Mascarenhas Souza, Roberto Almeida Bittencourt

    Abstract: One the major challenges in undergraduate computing programs is the learning of object-oriented programming (OOP). This paradigm has a variety of concepts with an abstraction level usually high for most beginners, even the ones who already code in an imperative language. Furthermore, transitioning from imperative programming to OOP is a complex issue, with various inappropriate side effects. A sig… ▽ More

    Submitted 16 October, 2017; v1 submitted 7 October, 2017; originally announced October 2017.

    Comments: 10 pages, 3 figures, 2 tables, COBENGE 2015 - XLIII Congresso Brasileiro de Educação em Engenharia, in Portuguese

  14. A Large-Scale Study on the Usage of Testing Patterns that Address Maintainability Attributes (Patterns for Ease of Modification, Diagnoses, and Comprehension)

    Authors: Danielle Gonzalez, Joanna C. S. Santos, Andrew Popovich, Mehdi Mirakhorli, Mei Nagappan

    Abstract: Test case maintainability is an important concern, especially in open source and distributed development environments where projects typically have high contributor turnover with varying backgrounds and experience, and where code ownership changes often. Similar to design patterns, patterns for unit testing promote maintainability quality attributes such as ease of diagnoses, modifiability, and co… ▽ More

    Submitted 26 April, 2017; originally announced April 2017.

    Comments: Mining Software Repositories (MSR) 2017 Research Track

    Journal ref: 017 IEEE/ACM 14th International Conference on Mining Software Repositories (MSR), Buenos Aires, 2017, pp. 391-401

  15. arXiv:1606.02212  [pdf, ps, other

    gr-qc astro-ph.CO

    Strong energy condition and the repulsive character of f(R) gravity

    Authors: Crislane S. Santos, Janilo Santos, Salvatore Capozziello, Jailson S. Alcaniz

    Abstract: The Raychaudhuri equation enables to examine the whole spacetime structure without specific solutions of Einstein's equations, playing a central role for the understanding of the gravitational interaction in Cosmology. In General Relativity, without considering a cosmological constant, a non-positive contribution in the Raychaudhuri equation is usually interpreted as the manifestation of the attra… ▽ More

    Submitted 6 March, 2017; v1 submitted 7 June, 2016; originally announced June 2016.

    Comments: 14 pages, 2 figures, text revised, minor modifications added, accepted for publication in General Relativity and Gravitation

    Journal ref: Gen. Relativ. Gravit. (2017) 49:50

  16. Real spectra for non-Hermitian Dirac equation in 1+1 dimensions with a most general coupling

    Authors: V. G. C. S. dos Santos, A. de Souza Dutra, M. B. Hott

    Abstract: The most general combination of couplings of fermions with external potentials in 1+1 dimensions, must include vector, scalar and pseudoscalar potentials. We consider such a mixing of potentials in a PT-symmetric time-independent Dirac equation. The Dirac equation is mapped into an effective PT-symmetric Schrödinger equation. Despite the non-hermiticity of the effective potential, we find real e… ▽ More

    Submitted 23 March, 2009; originally announced March 2009.

    Comments: 11 pages

  17. arXiv:quant-ph/0311044  [pdf, ps, other

    quant-ph

    Non-Hermitian time-dependent quantum systems with real energies

    Authors: A. de Souza Dutra, M. B. Hott, V. G. C. S dos Santos

    Abstract: In this work we intend to study a class of time-dependent quantum systems with non-Hermitian Hamiltonians, particularly those whose Hermitian counterpart are important for the comprehension of posed problems in quantum optics and quantum chemistry, which consists of an oscillator with time-dependent mass and frequency under the action of a time-dependent imaginary potential. The propagator for a… ▽ More

    Submitted 16 September, 2004; v1 submitted 7 November, 2003; originally announced November 2003.

    Comments: 8 pages - title and content changed

  18. arXiv:hep-th/0202046  [pdf, ps, other

    hep-th

    Method of approximation for potentials in impenetrable boxes: Harmonic Oscillator and Coulomb potentials

    Authors: A. de Souza Dutra, V. G. C. S. dos Santos, A. M. Stuchi

    Abstract: In this work we develop an approach to obtain analytical expressions for potentials in an impenetrable box. It is illustrated through the particular cases of the harmonic oscillator and the Coulomb potential. In this kind of system the energy expression respect the correct quantum limits, which is a very important quality. The similarity of this kind of problem with the quasi exactly solvable po… ▽ More

    Submitted 7 February, 2002; originally announced February 2002.

    Comments: Latex, 13 pages

  19. QED vacuum between an unusual pair of plates

    Authors: M. V. Cougo-Pinto, C. Farina, F. C. Santos. A. C. Santos

    Abstract: We consider the photon field between an unusual configuration of infinite parallel plates: a perfectly conducting plate $(ε\to\infty)$ and an infinitely permeable one $μ\to\infty)$. After quantizing the vector potential in the Coulomb gauge, we obtain explicit expressions for the vacuum expectation values of field operators of the form $<{\hat E}_i{\hat E}_j>_0$ and $<{\hat B}_i{\hat B}_j>_0$. T… ▽ More

    Submitted 6 November, 1998; originally announced November 1998.

    Comments: Latex, 16 pages, no figures

    Report number: IF/UFRJ/98/68

    Journal ref: J.Phys.A32:4463-4474,1999