Search | arXiv e-print repository

Call Graph Soundness in Android Static Analysis

Authors: Jordan Samhi, René Just, Tegawendé F. Bissyandé, Michael D. Ernst, Jacques Klein

Abstract: Static analysis is sound in theory, but an implementation may unsoundly fail to analyze all of a program's code. Any such omission is a serious threat to the validity of the tool's output. Our work is the first to measure the prevalence of these omissions. Previously, researchers and analysts did not know what is missed by static analysis, what sort of code is missed, or the reasons behind these o… ▽ More Static analysis is sound in theory, but an implementation may unsoundly fail to analyze all of a program's code. Any such omission is a serious threat to the validity of the tool's output. Our work is the first to measure the prevalence of these omissions. Previously, researchers and analysts did not know what is missed by static analysis, what sort of code is missed, or the reasons behind these omissions. To address this gap, we ran 13 static analysis tools and a dynamic analysis on 1000 Android apps. Any method in the dynamic analysis but not in a static analysis is an unsoundness. Our findings include the following. (1) Apps built around external frameworks challenge static analyzers. On average, the 13 static analysis tools failed to capture 61% of the dynamically-executed methods. (2) A high level of precision in call graph construction is a synonym for a high level of unsoundness; (3) No existing approach significantly improves static analysis soundness. This includes those specifically tailored for a given mechanism, such as DroidRA to address reflection. It also includes systematic approaches, such as EdgeMiner, capturing all callbacks in the Android framework systematically. (4) Modeling entry point methods challenges call graph construction which jeopardizes soundness. △ Less

Submitted 10 July, 2024; originally announced July 2024.

arXiv:2407.06573 [pdf, other]

LLM for Mobile: An Initial Roadmap

Authors: Daihang Chen, Yonghui Liu, Mingyi Zhou, Yanjie Zhao, Haoyu Wang, Shuai Wang, Xiao Chen, Tegawendé F. Bissyandé, Jacques Klein, Li Li

Abstract: When mobile meets LLMs, mobile app users deserve to have more intelligent usage experiences. For this to happen, we argue that there is a strong need to appl LLMs for the mobile ecosystem. We therefore provide a research roadmap for guiding our fellow researchers to achieve that as a whole. In this roadmap, we sum up six directions that we believe are urgently required for research to enable nativ… ▽ More When mobile meets LLMs, mobile app users deserve to have more intelligent usage experiences. For this to happen, we argue that there is a strong need to appl LLMs for the mobile ecosystem. We therefore provide a research roadmap for guiding our fellow researchers to achieve that as a whole. In this roadmap, we sum up six directions that we believe are urgently required for research to enable native intelligence in mobile devices. In each direction, we further summarize the current research progress and the gaps that still need to be filled by our fellow researchers. △ Less

Submitted 9 July, 2024; originally announced July 2024.

arXiv:2407.00225 [pdf, other]

Large-scale, Independent and Comprehensive study of the power of LLMs for test case generation

Authors: Wendkûuni C. Ouédraogo, Kader Kaboré, Haoye Tian, Yewei Song, Anil Koyuncu, Jacques Klein, David Lo, Tegawendé F. Bissyandé

Abstract: Unit testing, crucial for identifying bugs in code modules like classes and methods, is often neglected by developers due to time constraints. Automated test generation techniques have emerged to address this, but often lack readability and require developer intervention. Large Language Models (LLMs), like GPT and Mistral, show promise in software engineering, including in test generation. However… ▽ More Unit testing, crucial for identifying bugs in code modules like classes and methods, is often neglected by developers due to time constraints. Automated test generation techniques have emerged to address this, but often lack readability and require developer intervention. Large Language Models (LLMs), like GPT and Mistral, show promise in software engineering, including in test generation. However, their effectiveness remains unclear. This study conducts the first comprehensive investigation of LLMs, evaluating the effectiveness of four LLMs and five prompt engineering techniques, for unit test generation. We analyze 216\,300 tests generated by the selected advanced instruct-tuned LLMs for 690 Java classes collected from diverse datasets. We assess correctness, understandability, coverage, and bug detection capabilities of LLM-generated tests, comparing them to EvoSuite, a popular automated testing tool. While LLMs show potential, improvements in test correctness are necessary. This study reveals the strengths and limitations of LLMs compared to traditional methods, paving the way for further research on LLMs in software engineering. △ Less

Submitted 28 June, 2024; originally announced July 2024.

arXiv:2406.17959 [pdf, other]

Anomalous soft photons: status and perspectives

Authors: R. Bailhache, D. Bonocore, P. Braun-Munzinger, X. Feal, S. Floerchinger, J. Klein, K. Köhler, P. Lebiedowicz, C. M. Peter, R. Rapp, K. Reygers, W. Schäfer, H. S. Scheid, K. Schweda, J. Stachel, H. van Hees, C. A. van Veen, M. Völkl

Abstract: This report summarizes the work of the EMMI Rapid Reaction Task Force on "Real and Virtual Photon Production at Ultra-Low Transverse Momentum and Low Mass at the LHC". We provide an overview of the soft-photon puzzle, i.e., of the long-standing discrepancy between experimental data and predictions based on Low's soft-photon theorem, also referred to as "anomalous" soft photon production, and we re… ▽ More This report summarizes the work of the EMMI Rapid Reaction Task Force on "Real and Virtual Photon Production at Ultra-Low Transverse Momentum and Low Mass at the LHC". We provide an overview of the soft-photon puzzle, i.e., of the long-standing discrepancy between experimental data and predictions based on Low's soft-photon theorem, also referred to as "anomalous" soft photon production, and we review the current theoretical understanding of soft radiation and soft theorems. We also focus on low-mass dileptons as a tool for determining the electrical conductivity of the medium produced in high-energy nucleus-nucleus collisions. We discuss how both topics can be addressed with the planned ALICE 3 detector at the LHC. △ Less

Submitted 25 June, 2024; originally announced June 2024.

Comments: 63 pages, 20 figures

Report number: EMMI-RRTF-ER20-01, TUM-HEP-1496-24

arXiv:2406.13972 [pdf, other]

CREF: An LLM-based Conversational Software Repair Framework for Programming Tutors

Authors: Boyang Yang, Haoye Tian, Weiguo Pian, Haoran Yu, Haitao Wang, Jacques Klein, Tegawendé F. Bissyandé, Shunfu **

Abstract: Program repair techniques offer cost-saving benefits for debugging within software development and programming education scenarios. With the proven effectiveness of Large Language Models (LLMs) in code-related tasks, researchers have explored their potential for program repair. However, it is crucial to recognize that existing repair benchmarks may have influenced LLM training data, potentially ca… ▽ More Program repair techniques offer cost-saving benefits for debugging within software development and programming education scenarios. With the proven effectiveness of Large Language Models (LLMs) in code-related tasks, researchers have explored their potential for program repair. However, it is crucial to recognize that existing repair benchmarks may have influenced LLM training data, potentially causing data leakage. To evaluate LLMs' realistic repair capabilities, (1) we introduce an extensive, non-crawled benchmark, referred to as TutorCode, comprising 1,239 C++ defect codes and associated information such as tutor guidance, solution description, failing test cases, and the corrected code. Our work assesses the repair performance of 12 LLMs on TutorCode, measuring repair correctness (TOP-5 and AVG-5) and patch precision (RPSR). (2) We then provide a comprehensive investigation into which types of extra information can help LLMs improve their performance in repairing defects. Among these types, tutor guidance was found to be the most effective information in enhancing LLM repair capabilities. To fully harness LLMs' conversational capabilities and the benefits of augmented information, (3) we introduce a novel conversational semi-automatic repair framework CREF assisting human tutor. It demonstrates a remarkable AVG-5 improvement of 17.2%-24.6% compared to the baseline, achieving an impressive AVG-5 of 76.6% when utilizing GPT-4. These results highlight the potential for enhancing LLMs' repair capabilities through interactions with tutors and historical conversations involving incorrect responses. The successful application of CREF in a real-world educational setting demonstrates its effectiveness in reducing tutors' workload and improving students' learning experience, while also showcasing its promise for facilitating other software engineering tasks, such as code review. △ Less

Submitted 8 July, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

arXiv:2406.07514 [pdf, other]

Scintillation Light in SBND: Simulation, Reconstruction, and Expected Performance of the Photon Detection System

Authors: SBND Collaboration, P. Abratenko, R. Acciarri, C. Adams, L. Aliaga-Soplin, O. Alterkait, R. Alvarez-Garrote, C. Andreopoulos, A. Antonakis, L. Arellano, J. Asaadi, W. Badgett, S. Balasubramanian, V. Basque, A. Beever, B. Behera, E. Belchior, M. Betancourt, A. Bhat, M. Bishai, A. Blake, B. Bogart, J. Bogenschuetz, D. Brailsford, A. Brandt , et al. (158 additional authors not shown)

Abstract: SBND is the near detector of the Short-Baseline Neutrino program at Fermilab. Its location near to the Booster Neutrino Beam source and relatively large mass will allow the study of neutrino interactions on argon with unprecedented statistics. This paper describes the expected performance of the SBND photon detection system, using a simulated sample of beam neutrinos and cosmogenic particles. Its… ▽ More SBND is the near detector of the Short-Baseline Neutrino program at Fermilab. Its location near to the Booster Neutrino Beam source and relatively large mass will allow the study of neutrino interactions on argon with unprecedented statistics. This paper describes the expected performance of the SBND photon detection system, using a simulated sample of beam neutrinos and cosmogenic particles. Its design is a dual readout concept combining a system of 120 photomultiplier tubes, used for triggering, with a system of 192 X-ARAPUCA devices, located behind the anode wire planes. Furthermore, covering the cathode plane with highly-reflective panels coated with a wavelength-shifting compound recovers part of the light emitted towards the cathode, where no optical detectors exist. We show how this new design provides a high light yield and a more uniform detection efficiency, an excellent timing resolution and an independent 3D-position reconstruction using only the scintillation light. Finally, the whole reconstruction chain is applied to recover the temporal structure of the beam spill, which is resolved with a resolution on the order of nanoseconds. △ Less

Submitted 11 June, 2024; originally announced June 2024.

Comments: 21 pages, 17 figures

Report number: FERMILAB-PUB-24-0303-PPD

arXiv:2405.19700 [pdf, other]

Initial measurement of reactor antineutrino oscillation at SNO+

Authors: SNO+ Collaboration, :, A. Allega, M. R. Anderson, S. Andringa, M. Askins, D. J. Auty, A. Bacon, J. Baker, F. Barão, N. Barros, R. Bayes, E. W. Beier, T. S. Bezerra, A. Bialek, S. D. Biller, E. Blucher, E. Caden, E. J. Callaghan, M. Chen, S. Cheng, B. Cleveland, D. Cookman, J. Corning, M. A. Cox , et al. (96 additional authors not shown)

Abstract: The SNO+ collaboration reports its first spectral analysis of long-baseline reactor antineutrino oscillation using 114 tonne-years of data. Fitting the neutrino oscillation probability to the observed energy spectrum yields constraints on the neutrino mass-squared difference $Δm^2_{21}$. In the ranges allowed by previous measurements, the best-fit $Δm^2_{21}$ is (8.85$^{+1.10}_{-1.33}$) $\times$ 1… ▽ More The SNO+ collaboration reports its first spectral analysis of long-baseline reactor antineutrino oscillation using 114 tonne-years of data. Fitting the neutrino oscillation probability to the observed energy spectrum yields constraints on the neutrino mass-squared difference $Δm^2_{21}$. In the ranges allowed by previous measurements, the best-fit $Δm^2_{21}$ is (8.85$^{+1.10}_{-1.33}$) $\times$ 10$^{-5}$ eV$^2$. This measurement is continuing in the next phases of SNO+ and is expected to surpass the present global precision on $Δm^2_{21}$ with about three years of data. △ Less

Submitted 30 May, 2024; originally announced May 2024.

arXiv:2405.14239 [pdf, other]

Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Visual Representations

Authors: Mohammed Baharoon, Jonathan Klein, Dominik L. Michels

Abstract: Vision-language contrastive learning frameworks like CLIP enable learning representations from natural language supervision, and provide strong zero-shot classification capabilities. However, due to the nature of the supervisory signal in these paradigms, they lack the ability to learn localized features, leading to degraded performance on dense prediction tasks like segmentation and detection. On… ▽ More Vision-language contrastive learning frameworks like CLIP enable learning representations from natural language supervision, and provide strong zero-shot classification capabilities. However, due to the nature of the supervisory signal in these paradigms, they lack the ability to learn localized features, leading to degraded performance on dense prediction tasks like segmentation and detection. On the other hand, self-supervised learning methods have shown the ability to learn granular representations, complementing the high-level features in vision-language training. In this work, we present Harmony, a framework that combines vision-language training with discriminative and generative self-supervision to learn visual features that can be generalized across vision downstream tasks. Our framework is specifically designed to work on web-scraped data by not relying on negative examples and addressing the one-to-one correspondence issue using soft CLIP targets generated by an EMA model. We comprehensively evaluate Harmony across various vision downstream tasks and find that it significantly outperforms the baseline CLIP and the previously leading joint self and weakly-supervised methods, MaskCLIP and SLIP. Specifically, when comparing against these methods, Harmony shows superior performance in fine-tuning and zero-shot classification on ImageNet-1k, semantic segmentation on ADE20K, and both object detection and instance segmentation on MS-COCO, when pre-training a ViT-S/16 on CC3M. We also show that Harmony outperforms other self-supervised learning methods like iBOT and MAE across all tasks evaluated. On https://github.com/MohammedSB/Harmony our code is publicly available. △ Less

Submitted 23 May, 2024; originally announced May 2024.

Comments: 20 pages, 2 figures

MSC Class: 68T07; 68T45 ACM Class: I.2.10

arXiv:2405.05117 [pdf, other]

(In)Security of Mobile Apps in Develo** Countries: A Systematic Literature Review

Authors: Alioune Diallo, Jordan Samhi, Tegawendé Bissyandé, Jacques Klein

Abstract: In develo** countries, several key sectors, including education, finance, agriculture, and healthcare, mainly deliver their services via mobile app technology on handheld devices. As a result, mobile app security has emerged as a paramount issue in develo** countries. In this paper, we investigate the state of research on mobile app security, focusing on develo** countries. More specifically… ▽ More In develo** countries, several key sectors, including education, finance, agriculture, and healthcare, mainly deliver their services via mobile app technology on handheld devices. As a result, mobile app security has emerged as a paramount issue in develo** countries. In this paper, we investigate the state of research on mobile app security, focusing on develo** countries. More specifically, we performed a systematic literature review exploring the research directions taken by existing works, the different security concerns addressed, and the techniques used by researchers to highlight or address app security issues. Our main findings are: (1) the literature includes only a few studies on mobile app security in the context of develo** countries ; (2) among the different security concerns that researchers study, vulnerability detection appears to be the leading research topic; (3) FinTech apps are revealed as the main target in the relevant literature. Overall, our work highlights that there is largely room for develo** further specialized techniques addressing mobile app security in the context of develo** countries. △ Less

Submitted 7 May, 2024; originally announced May 2024.

Comments: 18 pages, 1 figure, submitted to a well-known journal

arXiv:2404.12636 [pdf, other]

Multi-Objective Fine-Tuning for Enhanced Program Repair with LLMs

Authors: Boyang Yang, Haoye Tian, Jiadong Ren, Hongyu Zhang, Jacques Klein, Tegawendé F. Bissyandé, Claire Le Goues, Shunfu **

Abstract: Large language models (LLMs) have demonstrated remarkable capabilities on a broad spectrum of downstream tasks. Within the realm of software engineering, specialized tasks on code, such as program repair, present unique challenges, necessitating fine-tuning to unlock state-of-the-art performance. Fine-tuning approaches proposed in the literature for LLMs on program repair tasks are however general… ▽ More Large language models (LLMs) have demonstrated remarkable capabilities on a broad spectrum of downstream tasks. Within the realm of software engineering, specialized tasks on code, such as program repair, present unique challenges, necessitating fine-tuning to unlock state-of-the-art performance. Fine-tuning approaches proposed in the literature for LLMs on program repair tasks are however generally overlooking the need to reason about the logic behind code changes, beyond syntactic patterns in the data. High-performing fine-tuning experiments also usually come at very high computational costs. With MORepair, we propose a novel perspective on the learning focus of LLM fine-tuning for program repair: we not only adapt the LLM parameters to the syntactic nuances of the task of code transformation (objective 1), but we also specifically fine-tune the LLM with respect to the logical reason behind the code change in the training data (objective 2). Such a multi-objective fine-tuning will instruct LLMs to generate high-quality patches. We apply MORepair to fine-tune four open-source LLMs with different sizes and architectures. Experimental results on C++ and Java repair benchmarks show that the implemented fine-tuning effectively boosts LLM repair performance by 7.6% to 10% in Top-10 repair suggestions. We further show that our fine-tuning strategy yields superior performance compared to the incumbent state-of-the-art in fine-tuned models for program repair, Fine-tune-CoT and RepairLLaMA. △ Less

Submitted 22 April, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

arXiv:2404.08817 [pdf, other]

Revisiting Code Similarity Evaluation with Abstract Syntax Tree Edit Distance

Authors: Yewei Song, Cedric Lothritz, Daniel Tang, Tegawendé F. Bissyandé, Jacques Klein

Abstract: This paper revisits recent code similarity evaluation metrics, particularly focusing on the application of Abstract Syntax Tree (AST) editing distance in diverse programming languages. In particular, we explore the usefulness of these metrics and compare them to traditional sequence similarity metrics. Our experiments showcase the effectiveness of AST editing distance in capturing intricate code s… ▽ More This paper revisits recent code similarity evaluation metrics, particularly focusing on the application of Abstract Syntax Tree (AST) editing distance in diverse programming languages. In particular, we explore the usefulness of these metrics and compare them to traditional sequence similarity metrics. Our experiments showcase the effectiveness of AST editing distance in capturing intricate code structures, revealing a high correlation with established metrics. Furthermore, we explore the strengths and weaknesses of AST editing distance and prompt-based GPT similarity scores in comparison to BLEU score, execution match, and Jaccard Similarity. We propose, optimize, and publish an adaptable metric that demonstrates effectiveness across all tested languages, representing an enhanced version of Tree Similarity of Edit Distance (TSED). △ Less

Submitted 3 June, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

Comments: ACL 2024 Main

arXiv:2404.06201 [pdf, other]

Open-Source AI-based SE Tools: Opportunities and Challenges of Collaborative Software Learning

Authors: Zhihao Lin, Wei Ma, Tao Lin, Yaowen Zheng, **gquan Ge, Jun Wang, Jacques Klein, Tegawende Bissyande, Yang Liu, Li Li

Abstract: Large Language Models (LLMs) have become instrumental in advancing software engineering (SE) tasks, showcasing their efficacy in code understanding and beyond. Like traditional SE tools, open-source collaboration is key in realising the excellent products. However, with AI models, the essential need is in data. The collaboration of these AI-based SE models hinges on maximising the sources of high-… ▽ More Large Language Models (LLMs) have become instrumental in advancing software engineering (SE) tasks, showcasing their efficacy in code understanding and beyond. Like traditional SE tools, open-source collaboration is key in realising the excellent products. However, with AI models, the essential need is in data. The collaboration of these AI-based SE models hinges on maximising the sources of high-quality data. However, data especially of high quality, often holds commercial or sensitive value, making it less accessible for open-source AI-based SE projects. This reality presents a significant barrier to the development and enhancement of AI-based SE tools within the software engineering community. Therefore, researchers need to find solutions for enabling open-source AI-based SE models to tap into resources by different organisations. Addressing this challenge, our position paper investigates one solution to facilitate access to diverse organizational resources for open-source AI models, ensuring privacy and commercial sensitivities are respected. We introduce a governance framework centered on federated learning (FL), designed to foster the joint development and maintenance of open-source AI code models while safeguarding data privacy and security. Additionally, we present guidelines for developers on AI-based SE tool collaboration, covering data requirements, model architecture, updating strategies, and version control. Given the significant influence of data characteristics on FL, our research examines the effect of code data heterogeneity on FL performance. △ Less

Submitted 9 April, 2024; originally announced April 2024.

arXiv:2404.06074 [pdf]

doi 10.3390/lubricants12040120

The Effects of Splayed Lipid Molecules on Lubrication by Lipid Bilayers

Authors: Di **, Jacob Klein

Abstract: The outstanding lubrication of articular cartilage in the major synovial joints such as hips and knees, essential for the joint well-being, has been attributed to boundary layers of lipids at the outer cartilage surfaces, which have very low friction mediated by the hydration lubrication mechanism at their highly hydrated exposed headgroups. However, the role of spontaneously present lipid splays,… ▽ More The outstanding lubrication of articular cartilage in the major synovial joints such as hips and knees, essential for the joint well-being, has been attributed to boundary layers of lipids at the outer cartilage surfaces, which have very low friction mediated by the hydration lubrication mechanism at their highly hydrated exposed headgroups. However, the role of spontaneously present lipid splays, lipids with an acyl tail in each of the opposing bilayers, in modulating the frictional force between lipid bilayers has not, to date, been considered. In this study, we perform all-atom molecular dynamics simulations to quantitatively assess the significance of splayed molecules within the framework of lubricating lipid bilayers. We demonstrate that, although transient, splayed molecules significantly increase the inter-membrane friction until their retraction back into the lamellar phase, with this effect more steadily occurring at lower sliding velocities that are comparable to the physiological velocities of sliding articular cartilage. △ Less

Submitted 9 April, 2024; originally announced April 2024.

arXiv:2404.05234 [pdf]

Bioinspired polymer-incorporating self-lubricating and antifouling hydrogels

Authors: Weifeng Lin, Monika Kluzek, Nir kampf, Yifeng Cao, Jacob Klein

Abstract: Healthy articular cartilage has excellent lubricating properties, with friction coefficients reaching extremely low values at physiological pressures. Such high-performing lubricating layer in joints is attributed to the surface hydration arising from the interplay between multiple hydrophilic biopolymers (such as hyaluronic acid, proteoglycans, and lubricin) and phospholipids in the cartilage mat… ▽ More Healthy articular cartilage has excellent lubricating properties, with friction coefficients reaching extremely low values at physiological pressures. Such high-performing lubricating layer in joints is attributed to the surface hydration arising from the interplay between multiple hydrophilic biopolymers (such as hyaluronic acid, proteoglycans, and lubricin) and phospholipids in the cartilage matrix. Mimicking such molecular structure, hydrogels, composed of a hydrophilic polymer network, have the potential to replicate the lubricating feature and possibly replace natural cartilages. In this study, we have synthesized a poly(2-methacryloyloxyethyl phosphorylcholine-co-N-isopropylacrylamide) (PMPC-co-PNIPAM, PMN)random copolymer with highly-hydrated lubricious 2-methacryloyloxyethyl phosphorylcholine moieties and less hydrated N-isopropylacrylamide moieties. Incorporation of PMN copolymers within various hydrogels significantly reduces the gels sliding surface friction, resulting in low friction coefficients against different counter surfaces, including stainless steel (hard metal surface), polyethylene (hydrophobic surface), and polyHEMA (soft hydrogel surface). Additionally, hydrogels containing PMN are shown to be biocompatible and have excellent antifouling properties, making them an ideal coating for commercially available stents. With these qualities, hydrogels containing PMN stand out as a promising new material with numerous possible applications. △ Less

Submitted 8 April, 2024; originally announced April 2024.

arXiv:2404.05035 [pdf]

Pressure-dependent adhesion between solid-supported PC-lipid bilayers and vesicles under electric fields

Authors: Yu Zhang, Di **, Jacob Klein

Abstract: Fusion of lipid bilayers in membranes is important in processes from vesicle-cell interactions (as in drug delivery) to exosome-cell signaling, while transient transmembrane electric fields are known to occur spontaneously. Two contacting phosphatidylcholine (PC) lipid membranes are known to fuse into one under external electric fields, suggesting that the interaction between them is modified by t… ▽ More Fusion of lipid bilayers in membranes is important in processes from vesicle-cell interactions (as in drug delivery) to exosome-cell signaling, while transient transmembrane electric fields are known to occur spontaneously. Two contacting phosphatidylcholine (PC) lipid membranes are known to fuse into one under external electric fields, suggesting that the interaction between them is modified by the field as they approach, prior to the fusion event. Here we measure directly the adhesion energy between dimyristoylphosphatidylcholine (DMPC) and between distearoylphosphatidylcholine (DSPC) surface layers attached to solid substrates both without and with a transmembrane electric field. We find a marked pressure-dependent adhesion behavior in the electric field, which we attribute to fusion intermediates that are formed, shedding new light on membrane electro-fusion. △ Less

Submitted 7 April, 2024; originally announced April 2024.

arXiv:2403.20174 [pdf, other]

Quadratic optical response of CrSBr controlled by spin-selective interlayer coupling

Authors: Marie-Christin Heißenbüttel, Pierre-Maurice Piel, Julian Klein, Thorsten Deilmann, Ursula Wurstbauer, Michael Rohlfing

Abstract: The optical properties of the layered magnet CrSBr are dominated by intralayer excitons: the antiferromagnetic order between the layers makes layer-to-layer charge hop**, and therefore interlayer excitons, spin-forbidden. An external magnetic field, however, continuously drives the magnetic order towards layer-to-layer ferromagnetic, which opens spin-allowed charge-transfer channels between the… ▽ More The optical properties of the layered magnet CrSBr are dominated by intralayer excitons: the antiferromagnetic order between the layers makes layer-to-layer charge hop**, and therefore interlayer excitons, spin-forbidden. An external magnetic field, however, continuously drives the magnetic order towards layer-to-layer ferromagnetic, which opens spin-allowed charge-transfer channels between the layers. Here we elaborate how their admixture changes the composition and nature of the excitons, leading to an extension over many layers, and causes a quadratic red-shift with respect to the external magnetic field. We address these effects by ab-initio $GW$-BSE calculations as a function of magnetic field and cast the data into a minimal four-band model to elucidate the interplay between the various interaction and coupling mechanisms. Our findings should be generally valid for antiferromagnetic layered magnets with and without external magnetic fields, and moreover for any couple of layers with different spin directions. Our insights help to systematically address excitons and predict their optical signatures in such systems. △ Less

Submitted 29 March, 2024; originally announced March 2024.

Comments: 9 pages, 3 figures

arXiv:2403.06346 [pdf, ps, other]

On the rational invariants of quantum systems of $n$-qubits

Authors: Luca Candelori, Vladimir Y. Chernyak, John R. Klein

Abstract: For an $n$-qubit system, a rational function on the space of mixed states which is invariant with respect to the action of the group of local symmetries may be viewed as a detailed measure of entanglement. We show that the field of all such invariant rational functions is purely transcendental over the complex numbers and has transcendence degree $4^n - 2n-1$. An explicit transcendence basis is al… ▽ More For an $n$-qubit system, a rational function on the space of mixed states which is invariant with respect to the action of the group of local symmetries may be viewed as a detailed measure of entanglement. We show that the field of all such invariant rational functions is purely transcendental over the complex numbers and has transcendence degree $4^n - 2n-1$. An explicit transcendence basis is also exhibited. △ Less

Submitted 12 March, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

Comments: greatly simplified the treatment of quotients

MSC Class: 81P40; 81P42; 13A50; 14L24

arXiv:2403.03212 [pdf, other]

Performance of a modular ton-scale pixel-readout liquid argon time projection chamber

Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, T. Alves, H. Amar, P. Amedo, J. Anderson, D. A. Andrade , et al. (1340 additional authors not shown)

Abstract: The Module-0 Demonstrator is a single-phase 600 kg liquid argon time projection chamber operated as a prototype for the DUNE liquid argon near detector. Based on the ArgonCube design concept, Module-0 features a novel 80k-channel pixelated charge readout and advanced high-coverage photon detection system. In this paper, we present an analysis of an eight-day data set consisting of 25 million cosmi… ▽ More The Module-0 Demonstrator is a single-phase 600 kg liquid argon time projection chamber operated as a prototype for the DUNE liquid argon near detector. Based on the ArgonCube design concept, Module-0 features a novel 80k-channel pixelated charge readout and advanced high-coverage photon detection system. In this paper, we present an analysis of an eight-day data set consisting of 25 million cosmic ray events collected in the spring of 2021. We use this sample to demonstrate the imaging performance of the charge and light readout systems as well as the signal correlations between the two. We also report argon purity and detector uniformity measurements, and provide comparisons to detector simulations. △ Less

Submitted 5 March, 2024; originally announced March 2024.

Comments: 47 pages, 41 figures

Report number: FERMILAB-PUB-24-0073-LBNF

arXiv:2402.17181 [pdf, ps, other]

On the variety of X-states

Authors: Luca Candelori, Vladimir Y. Chernyak, John R. Klein

Abstract: We introduce the notion of an X-state on $n$-qubits. After taking the Zariski closure of the set of X-states in the space of all mixed states, we obtain a complex algebraic variety $\scr X$ that is equipped with the action of the Lie group of local symmetries $G$. We show that the field of $G$-invariant rational functions on $\scr X$ is purely transcendental over the complex numbers of degree… ▽ More We introduce the notion of an X-state on $n$-qubits. After taking the Zariski closure of the set of X-states in the space of all mixed states, we obtain a complex algebraic variety $\scr X$ that is equipped with the action of the Lie group of local symmetries $G$. We show that the field of $G$-invariant rational functions on $\scr X$ is purely transcendental over the complex numbers of degree $2^{2n-1}-n-1$. △ Less

Submitted 26 February, 2024; originally announced February 2024.

MSC Class: Primary: 81P40; 81P42; 13A50; Secondary: 14L24

arXiv:2402.03782 [pdf, other]

Soft Prompt Tuning for Cross-Lingual Transfer: When Less is More

Authors: Fred Philippy, Siwen Guo, Shohreh Haddadan, Cedric Lothritz, Jacques Klein, Tegawendé F. Bissyandé

Abstract: Soft Prompt Tuning (SPT) is a parameter-efficient method for adapting pre-trained language models (PLMs) to specific tasks by inserting learnable embeddings, or soft prompts, at the input layer of the PLM, without modifying its parameters. This paper investigates the potential of SPT for cross-lingual transfer. Unlike previous studies on SPT for cross-lingual transfer that often fine-tune both the… ▽ More Soft Prompt Tuning (SPT) is a parameter-efficient method for adapting pre-trained language models (PLMs) to specific tasks by inserting learnable embeddings, or soft prompts, at the input layer of the PLM, without modifying its parameters. This paper investigates the potential of SPT for cross-lingual transfer. Unlike previous studies on SPT for cross-lingual transfer that often fine-tune both the soft prompt and the model parameters, we adhere to the original intent of SPT by kee** the model parameters frozen and only training the soft prompt. This does not only reduce the computational cost and storage overhead of full-model fine-tuning, but we also demonstrate that this very parameter efficiency intrinsic to SPT can enhance cross-lingual transfer performance to linguistically distant languages. Moreover, we explore how different factors related to the prompt, such as the length or its reparameterization, affect cross-lingual transfer performance. △ Less

Submitted 6 February, 2024; originally announced February 2024.

Comments: Accepted at the 1st Workshop on Modular and Open Multilingual NLP (co-located with EACL 2024)

arXiv:2402.03287 [pdf, other]

A Lennard-Jones Layer for Distribution Normalization

Authors: Mulun Na, Jonathan Klein, Biao Zhang, Wojtek Pałubicki, Sören Pirk, Dominik L. Michels

Abstract: We introduce the Lennard-Jones layer (LJL) for the equalization of the density of 2D and 3D point clouds through systematically rearranging points without destroying their overall structure (distribution normalization). LJL simulates a dissipative process of repulsive and weakly attractive interactions between individual points by considering the nearest neighbor of each point at a given moment in… ▽ More We introduce the Lennard-Jones layer (LJL) for the equalization of the density of 2D and 3D point clouds through systematically rearranging points without destroying their overall structure (distribution normalization). LJL simulates a dissipative process of repulsive and weakly attractive interactions between individual points by considering the nearest neighbor of each point at a given moment in time. This pushes the particles into a potential valley, reaching a well-defined stable configuration that approximates an equidistant sampling after the stabilization process. We apply LJLs to redistribute randomly generated point clouds into a randomized uniform distribution. Moreover, LJLs are embedded in the generation process of point cloud networks by adding them at later stages of the inference process. The improvements in 3D point cloud generation utilizing LJLs are evaluated qualitatively and quantitatively. Finally, we apply LJLs to improve the point distribution of a score-based 3D point cloud denoising network. In general, we demonstrate that LJLs are effective for distribution normalization which can be applied at negligible cost without retraining the given neural network. △ Less

Submitted 5 February, 2024; originally announced February 2024.

Comments: Upon request, we are happy to share the source code to generate the results presented in this paper. Please contact the first or the last author of this manuscript

MSC Class: 68T07 ACM Class: I.2; I.3.5

arXiv:2402.02570 [pdf, other]

Gazebo Plants: Simulating Plant-Robot Interaction with Cosserat Rods

Authors: Junchen Deng, Samhita Marri, Jonathan Klein, Wojtek Pałubicki, Sören Pirk, Girish Chowdhary, Dominik L. Michels

Abstract: Robotic harvesting has the potential to positively impact agricultural productivity, reduce costs, improve food quality, enhance sustainability, and to address labor shortage. In the rapidly advancing field of agricultural robotics, the necessity of training robots in a virtual environment has become essential. Generating training data to automatize the underlying computer vision tasks such as ima… ▽ More Robotic harvesting has the potential to positively impact agricultural productivity, reduce costs, improve food quality, enhance sustainability, and to address labor shortage. In the rapidly advancing field of agricultural robotics, the necessity of training robots in a virtual environment has become essential. Generating training data to automatize the underlying computer vision tasks such as image segmentation, object detection and classification, also heavily relies on such virtual environments as synthetic data is often required to overcome the shortage and lack of variety of real data sets. However, physics engines commonly employed within the robotics community, such as ODE, Simbody, Bullet, and DART, primarily support motion and collision interaction of rigid bodies. This inherent limitation hinders experimentation and progress in handling non-rigid objects such as plants and crops. In this contribution, we present a plugin for the Gazebo simulation platform based on Cosserat rods to model plant motion. It enables the simulation of plants and their interaction with the environment. We demonstrate that, using our plugin, users can conduct harvesting simulations in Gazebo by simulating a robotic arm picking fruits and achieve results comparable to real-world experiments. △ Less

Submitted 4 February, 2024; originally announced February 2024.

Comments: Upon request, we are happy to share our GazeboPlants plugin open-source (MPL 2.0)

ACM Class: I.6.3; I.6.m

arXiv:2402.02172 [pdf, other]

CodeAgent: Collaborative Agents for Software Engineering

Authors: Daniel Tang, Kisub Kim, Yewei Song, Cedric Lothritz, Bei Li, Saad Ezzini, Haoye Tian, Jacques Klein, Tegawende F. Bissyande

Abstract: Code review, which aims at ensuring the overall quality and reliability of software, is a cornerstone of software development. Unfortunately, while crucial, Code review is a labor-intensive process that the research community is looking to automate. Existing automated methods rely on single input-output generative models and thus generally struggle to emulate the collaborative nature of code revie… ▽ More Code review, which aims at ensuring the overall quality and reliability of software, is a cornerstone of software development. Unfortunately, while crucial, Code review is a labor-intensive process that the research community is looking to automate. Existing automated methods rely on single input-output generative models and thus generally struggle to emulate the collaborative nature of code review. This work introduces CodeAgent, a novel multi-agent Large Language Model (LLM) system for code review automation. CodeAgent incorporates a supervisory agent, QA-Checker, to ensure that all the agents' contributions address the initial review question. We evaluated CodeAgent on critical code review tasks: (1) detect inconsistencies between code changes and commit messages, (2) identify vulnerability introductions, (3) validate code style adherence, and (4) suggest code revisions. The results demonstrate CodeAgent's effectiveness, contributing to a new state-of-the-art in code review automation. Our data and code are publicly available (\url{https://github.com/Code4Agent/codeagent}). △ Less

Submitted 28 June, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

arXiv:2402.01568 [pdf, other]

Do** Liquid Argon with Xenon in ProtoDUNE Single-Phase: Effects on Scintillation Light

Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, H. Amar Es-sghir, P. Amedo, J. Anderson, D. A. Andrade, C. Andreopoulos , et al. (1300 additional authors not shown)

Abstract: Do** of liquid argon TPCs (LArTPCs) with a small concentration of xenon is a technique for light-shifting and facilitates the detection of the liquid argon scintillation light. In this paper, we present the results of the first do** test ever performed in a kiloton-scale LArTPC. From February to May 2020, we carried out this special run in the single-phase DUNE Far Detector prototype (ProtoDUN… ▽ More Do** of liquid argon TPCs (LArTPCs) with a small concentration of xenon is a technique for light-shifting and facilitates the detection of the liquid argon scintillation light. In this paper, we present the results of the first do** test ever performed in a kiloton-scale LArTPC. From February to May 2020, we carried out this special run in the single-phase DUNE Far Detector prototype (ProtoDUNE-SP) at CERN, featuring 770 t of total liquid argon mass with 410 t of fiducial mass. The goal of the run was to measure the light and charge response of the detector to the addition of xenon, up to a concentration of 18.8 ppm. The main purpose was to test the possibility for reduction of non-uniformities in light collection, caused by deployment of photon detectors only within the anode planes. Light collection was analysed as a function of the xenon concentration, by using the pre-existing photon detection system (PDS) of ProtoDUNE-SP and an additional smaller set-up installed specifically for this run. In this paper we first summarize our current understanding of the argon-xenon energy transfer process and the impact of the presence of nitrogen in argon with and without xenon dopant. We then describe the key elements of ProtoDUNE-SP and the injection method deployed. Two dedicated photon detectors were able to collect the light produced by xenon and the total light. The ratio of these components was measured to be about 0.65 as 18.8 ppm of xenon were injected. We performed studies of the collection efficiency as a function of the distance between tracks and light detectors, demonstrating enhanced uniformity of response for the anode-mounted PDS. We also show that xenon do** can substantially recover light losses due to contamination of the liquid argon by nitrogen. △ Less

Submitted 9 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

Comments: 35 pages, 20 figures

Report number: CERN-EP-2024-024; FERMILAB-PUB-23-0819-LBNF

arXiv:2401.15013 [pdf]

doi 10.1021/acsami.2c11781

Reactive additive capillary stam** with double network hydrogel-derived aerogel stamps under solvothermal conditions

Authors: Fatih Alarslan, Martin Frosinn, Kevin Ruwisch, Jannis Thien, Tim Jähnichen, Louisa Eckert, Jonas Klein, Markus Haase, Dirk Enke, Joachim Wollschläger, Uwe Beginn, Martin Steinhart

Abstract: Integration of solvothermal reaction products into complex thin-layer architectures is frequently achieved by combinations of layer transfer and subtractive lithography, whereas direct additive substrate patterning with solvothermal reaction products has remained challenging. We report reactive additive capillary stam** under solvothermal conditions as a parallel contact-lithographic access to p… ▽ More Integration of solvothermal reaction products into complex thin-layer architectures is frequently achieved by combinations of layer transfer and subtractive lithography, whereas direct additive substrate patterning with solvothermal reaction products has remained challenging. We report reactive additive capillary stam** under solvothermal conditions as a parallel contact-lithographic access to patterns of solvothermal reaction products in thin-layer configurations. To this end, corresponding precursor inks are infiltrated into mechanically robust mesoporous aerogel stamps derived from double-network hydrogels (DNHGs). The stamp is then brought into contact with a substrate to be patterned under solvothermal reaction conditions inside an autoclave. The precursor ink forms liquid bridges between the topographic surface pattern of the stamp and the substrate. Evaporation-driven enrichment of the precursors in these liquid bridges along with their liquid-bridge-guided conversion into the solvothermal reaction products yields large-area submicron patterns of the solvothermal reaction products replicating the stamp topography. As example, we prepared thin hybrid films, which contained ordered monolayers of superparamagnetic submicron nickel ferrite dots prepared by solvothermal capillary stam** surrounded by nickel electrodeposited in a second, orthogonal substrate functionalization step. The submicron nickel ferrite dots acted as magnetic hardener halving the remanence of the ferromagnetic nickel layer. In this way, thin-layer electromechanical systems, transformers and positioning systems may be customized. △ Less

Submitted 26 January, 2024; originally announced January 2024.

Journal ref: ACS Appl. Mater. Interfaces 2022, 14, 39, 44992-45004

arXiv:2401.14974 [pdf]

doi 10.1021/acsapm.3c01036

Solvent-Free High-Temperature Capillary Stam** of Stimuli-Responsive Polymers: Wettability Management by Orthogonal Substrate Functionalization

Authors: Fatih Alarslan, Hanna Hübner, Jonas Klein, Karsten Küpper, Joachim Wollschläger, Markus Haase, Markus Gallei, Martin Steinhart

Abstract: The wettability of surfaces determines their antifouling, antifogging, anti-icing, and self-cleaning properties as well as their usability for sensing, oil-water separation, water collection, and water purification. Solvent-free high-temperature capillary stam** of stimuli-responsive polymers yielding arrays of stimuli-responsive polymer microdots on differently modified substrates enables the f… ▽ More The wettability of surfaces determines their antifouling, antifogging, anti-icing, and self-cleaning properties as well as their usability for sensing, oil-water separation, water collection, and water purification. Solvent-free high-temperature capillary stam** of stimuli-responsive polymers yielding arrays of stimuli-responsive polymer microdots on differently modified substrates enables the flexible generation of switchable surfaces with different water contact angles (WCAs). Potential problems associated with the deposition of polymer solutions, such as the handling of volatile organic solvents, phase separation induced by solvent evaporation, and capillarity-driven flow processes, are circumvented. We used composite stamps with topographically patterned contact surfaces consisting of metallic nickel cores and porous MnO2 coatings taking up the stimuli-responsive polymers. The short transport paths from the MnO2 contact layers to the counterpart substrates enabled the stam** of polymer melts containing components impeding flow, such as carbon nanotubes (CNTs). Thus-obtained arrays of polymer-CNT hybrid microdots prevent problems associated with continuous coatings including delamination and crack propagation. Moreover, the range within which the properties of the stamped stimuli-responsive polymer microdots are switchable can be tuned by orthogonal substrate modification. As an example, we stamped hybrid microdots consisting of poly(2-(methacryloyloxy)ethyl ferrocenecarboxylate) (PFcMA) and CNTs onto indium tin oxide (ITO) substrates. Coating the ITO substrates with a poly(ethylene oxide)-terminated silane shifted the WCAs obtained by switching the PFcMA between its oxidized and reduced states by nearly 50°. △ Less

Submitted 26 January, 2024; originally announced January 2024.

Journal ref: ACS Appl. Polym. Mater. 2023, 5, 7, 5738-5746

arXiv:2401.09420 [pdf, other]

LionHeart: A Layer-based Map** Framework for Heterogeneous Systems with Analog In-Memory Computing Tiles

Authors: Corey Lammie, Flavio Ponzina, Yuxuan Wang, Joshua Klein, Marina Zapater, Irem Boybat, Abu Sebastian, Giovanni Ansaloni, David Atienza

Abstract: When arranged in a crossbar configuration, resistive memory devices can be used to execute MVM, the most dominant operation of many ML algorithms, in constant time complexity. Nonetheless, when performing computations in the analog domain, novel challenges are introduced in terms of arithmetic precision and stochasticity, due to non-ideal circuit and device behaviour. Moreover, these non-idealitie… ▽ More When arranged in a crossbar configuration, resistive memory devices can be used to execute MVM, the most dominant operation of many ML algorithms, in constant time complexity. Nonetheless, when performing computations in the analog domain, novel challenges are introduced in terms of arithmetic precision and stochasticity, due to non-ideal circuit and device behaviour. Moreover, these non-idealities have a temporal dimension, resulting in a degrading application accuracy over time. Facing these challenges, we propose a novel framework, named LionHeart, to obtain hybrid analog-digital map**s to execute DL inference workloads using heterogeneous accelerators. The accuracy-constrained map**s derived by LionHeart showcase, across different DNNs and datasets, high accuracy and potential for speedup. The results of the full system simulations highlight run-time reductions and energy efficiency gains that exceed 6X, with a user-defined accuracy threshold with respect to a fully digital floating point implementation. △ Less

Submitted 17 January, 2024; originally announced January 2024.

arXiv:2312.14898 [pdf, other]

Enriching Automatic Test Case Generation by Extracting Relevant Test Inputs from Bug Reports

Authors: Wendkûuni C. Ouédraogo, Laura Plein, Kader Kaboré, Andrew Habib, Jacques Klein, David Lo, Tegawendé F. Bissyandé

Abstract: The quality of a software is highly dependent on the quality of the tests it is submitted to. Writing tests for bug detection is thus essential. However, it is time-consuming when done manually. Automating test cases generation has therefore been an exciting research area in the software engineering community. Most approaches have been focused on generating unit tests. Unfortunately, current effor… ▽ More The quality of a software is highly dependent on the quality of the tests it is submitted to. Writing tests for bug detection is thus essential. However, it is time-consuming when done manually. Automating test cases generation has therefore been an exciting research area in the software engineering community. Most approaches have been focused on generating unit tests. Unfortunately, current efforts often do not lead to the generation of relevant inputs, which limits the efficiency of automatically generated tests. Towards improving the relevance of test inputs, we present \name, a technique for exploring bug reports to identify input values that can be fed to automatic test generation tools. In this work, we investigate the performance of using inputs extracted from bug reports with \name to generate test cases with Evosuite. The evaluation is performed on the Defects4J benchmark. For Defects4J projects, our study has shown that \name successfully extracted 68.68\% of relevant inputs when using regular expression in its approach versus 50.21\% relevant inputs without regular expression. Further, our study has shown the potential to improve the Line and Instruction Coverage across all projects. Overall, we successfully collected relevant inputs that led to the detection of 45 bugs that were previously undetected by the baseline. △ Less

Submitted 22 December, 2023; originally announced December 2023.

arXiv:2312.14725 [pdf, other]

Enhancing Text-to-SQL Translation for Financial System Design

Authors: Yewei Song, Saad Ezzini, Xunzhu Tang, Cedric Lothritz, Jacques Klein, Tegawendé Bissyandé, Andrey Boytsov, Ulrick Ble, Anne Goujon

Abstract: Text-to-SQL, the task of translating natural language questions into SQL queries, is part of various business processes. Its automation, which is an emerging challenge, will empower software practitioners to seamlessly interact with relational databases using natural language, thereby bridging the gap between business needs and software capabilities. In this paper, we consider Large Language Model… ▽ More Text-to-SQL, the task of translating natural language questions into SQL queries, is part of various business processes. Its automation, which is an emerging challenge, will empower software practitioners to seamlessly interact with relational databases using natural language, thereby bridging the gap between business needs and software capabilities. In this paper, we consider Large Language Models (LLMs), which have achieved state of the art for various NLP tasks. Specifically, we benchmark Text-to-SQL performance, the evaluation methodologies, as well as input optimization (e.g., prompting). In light of the empirical observations that we have made, we propose two novel metrics that were designed to adequately measure the similarity between SQL queries. Overall, we share with the community various findings, notably on how to select the right LLM on Text-to-SQL tasks. We further demonstrate that a tree-based edit distance constitutes a reliable metric for assessing the similarity between generated SQL queries and the oracle for benchmarking Text2SQL approaches. This metric is important as it relieves researchers from the need to perform computationally expensive experiments such as executing generated queries as done in prior works. Our work implements financial domain use cases and, therefore contributes to the advancement of Text2SQL systems and their practical adoption in this domain. △ Less

Submitted 8 January, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

Comments: 10 pages, ICSE-SEIP 2024

arXiv:2312.14053 [pdf, other]

Dual Attention U-Net with Feature Infusion: Pushing the Boundaries of Multiclass Defect Segmentation

Authors: Rasha Alshawi, Md Tamjidul Hoque, Md Meftahul Ferdaus, Mahdi Abdelguerfi, Kendall Niles, Ken Prathak, Joe Tom, Jordan Klein, Murtada Mousa, Johny Javier Lopez

Abstract: The proposed architecture, Dual Attentive U-Net with Feature Infusion (DAU-FI Net), addresses challenges in semantic segmentation, particularly on multiclass imbalanced datasets with limited samples. DAU-FI Net integrates multiscale spatial-channel attention mechanisms and feature injection to enhance precision in object localization. The core employs a multiscale depth-separable convolution block… ▽ More The proposed architecture, Dual Attentive U-Net with Feature Infusion (DAU-FI Net), addresses challenges in semantic segmentation, particularly on multiclass imbalanced datasets with limited samples. DAU-FI Net integrates multiscale spatial-channel attention mechanisms and feature injection to enhance precision in object localization. The core employs a multiscale depth-separable convolution block, capturing localized patterns across scales. This block is complemented by a spatial-channel squeeze and excitation (scSE) attention unit, modeling inter-dependencies between channels and spatial regions in feature maps. Additionally, additive attention gates refine segmentation by connecting encoder-decoder pathways. To augment the model, engineered features using Gabor filters for textural analysis, Sobel and Canny filters for edge detection are injected guided by semantic masks to expand the feature space strategically. Comprehensive experiments on a challenging sewer pipe and culvert defect dataset and a benchmark dataset validate DAU-FI Net's capabilities. Ablation studies highlight incremental benefits from attention blocks and feature injection. DAU-FI Net achieves state-of-the-art mean Intersection over Union (IoU) of 95.6% and 98.8% on the defect test set and benchmark respectively, surpassing prior methods by 8.9% and 12.6%, respectively. Ablation studies highlight incremental benefits from attention blocks and feature injection. The proposed architecture provides a robust solution, advancing semantic segmentation for multiclass problems with limited training data. Our sewer-culvert defects dataset, featuring pixel-level annotations, opens avenues for further research in this crucial domain. Overall, this work delivers key innovations in architecture, attention, and feature engineering to elevate semantic segmentation efficacy. △ Less

Submitted 21 December, 2023; originally announced December 2023.

Comments: under review in IEEE Transactions on Artificial Intelligence

arXiv:2312.05811 [pdf]

Liposomic lubricants suppress shear-stress induced inflammatory gene regulation in the joint in vivo

Authors: Linyi Zhu, Weifeng Lin, Monika Kluzek, Jadwiga Miotla-Zarebska, Vicky Batchelor, Matthew Gardiner, Chris Chan, Peter Culmer, Anastasios Chanalaris, Ronit Goldberg, Jacob Klein, Tonia L. Vincent

Abstract: Osteoarthritis (OA) is a widespread, debilitating joint disease associated with articular cartilage degradation. It is driven via mechano-inflammatory catabolic pathways, presumed up-regulated due to increased shear stress on the cartilage-embedded chondrocytes, that lead to tissue degeneration. Here we demonstrate that the up-regulation of the matrix metalloproteinase 3 (Mmp3) and interleukin-1be… ▽ More Osteoarthritis (OA) is a widespread, debilitating joint disease associated with articular cartilage degradation. It is driven via mechano-inflammatory catabolic pathways, presumed up-regulated due to increased shear stress on the cartilage-embedded chondrocytes, that lead to tissue degeneration. Here we demonstrate that the up-regulation of the matrix metalloproteinase 3 (Mmp3) and interleukin-1beta (Il1b) genes upon surgical joint destabilization in a model of murine OA is completely suppressed when lipid-based lubricants are injected into the joints. At the same time, Timp1, a compression but not shear-stress sensitive gene, is unaffected by lubricant. Our results provide direct evidence that biolubrication couples to catabolic gene regulation in OA, shed strong light on the nature of the chondrocytes' response to shear stress, and have clear implications for novel OA treatments. △ Less

Submitted 12 December, 2023; v1 submitted 10 December, 2023; originally announced December 2023.

arXiv:2312.03130 [pdf, other]

The DUNE Far Detector Vertical Drift Technology, Technical Design Report

Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, H. Amar, P. Amedo, J. Anderson, D. A. Andrade, C. Andreopoulos , et al. (1304 additional authors not shown)

Abstract: DUNE is an international experiment dedicated to addressing some of the questions at the forefront of particle physics and astrophysics, including the mystifying preponderance of matter over antimatter in the early universe. The dual-site experiment will employ an intense neutrino beam focused on a near and a far detector as it aims to determine the neutrino mass hierarchy and to make high-precisi… ▽ More DUNE is an international experiment dedicated to addressing some of the questions at the forefront of particle physics and astrophysics, including the mystifying preponderance of matter over antimatter in the early universe. The dual-site experiment will employ an intense neutrino beam focused on a near and a far detector as it aims to determine the neutrino mass hierarchy and to make high-precision measurements of the PMNS matrix parameters, including the CP-violating phase. It will also stand ready to observe supernova neutrino bursts, and seeks to observe nucleon decay as a signature of a grand unified theory underlying the standard model. The DUNE far detector implements liquid argon time-projection chamber (LArTPC) technology, and combines the many tens-of-kiloton fiducial mass necessary for rare event searches with the sub-centimeter spatial resolution required to image those events with high precision. The addition of a photon detection system enhances physics capabilities for all DUNE physics drivers and opens prospects for further physics explorations. Given its size, the far detector will be implemented as a set of modules, with LArTPC designs that differ from one another as newer technologies arise. In the vertical drift LArTPC design, a horizontal cathode bisects the detector, creating two stacked drift volumes in which ionization charges drift towards anodes at either the top or bottom. The anodes are composed of perforated PCB layers with conductive strips, enabling reconstruction in 3D. Light-trap-style photon detection modules are placed both on the cryostat's side walls and on the central cathode where they are optically powered. This Technical Design Report describes in detail the technical implementations of each subsystem of this LArTPC that, together with the other far detector modules and the near detector, will enable DUNE to achieve its physics goals. △ Less

Submitted 5 December, 2023; originally announced December 2023.

Comments: 425 pages; 281 figures Central editing team: A. Heavey, S. Kettell, A. Marchionni, S. Palestini, S. Rajogopalan, R. J. Wilson

Report number: Fermilab Report no: TM-2813-LBNF

arXiv:2312.01241 [pdf, other]

Just-in-Time Security Patch Detection -- LLM At the Rescue for Data Augmentation

Authors: Xunzhu Tang, Zhenghan Chen, Kisub Kim, Haoye Tian, Saad Ezzini, Jacques Klein

Abstract: In the face of growing vulnerabilities found in open-source software, the need to identify {discreet} security patches has become paramount. The lack of consistency in how software providers handle maintenance often leads to the release of security patches without comprehensive advisories, leaving users vulnerable to unaddressed security risks. To address this pressing issue, we introduce a novel… ▽ More In the face of growing vulnerabilities found in open-source software, the need to identify {discreet} security patches has become paramount. The lack of consistency in how software providers handle maintenance often leads to the release of security patches without comprehensive advisories, leaving users vulnerable to unaddressed security risks. To address this pressing issue, we introduce a novel security patch detection system, LLMDA, which capitalizes on Large Language Models (LLMs) and code-text alignment methodologies for patch review, data enhancement, and feature combination. Within LLMDA, we initially utilize LLMs for examining patches and expanding data of PatchDB and SPI-DB, two security patch datasets from recent literature. We then use labeled instructions to direct our LLMDA, differentiating patches based on security relevance. Following this, we apply a PTFormer to merge patches with code, formulating hybrid attributes that encompass both the innate details and the interconnections between the patches and the code. This distinctive combination method allows our system to capture more insights from the combined context of patches and code, hence improving detection precision. Finally, we devise a probabilistic batch contrastive learning mechanism within batches to augment the capability of the our LLMDA in discerning security patches. The results reveal that LLMDA significantly surpasses the start of the art techniques in detecting security patches, underscoring its promise in fortifying software maintenance. △ Less

Submitted 12 December, 2023; v1 submitted 2 December, 2023; originally announced December 2023.

arXiv:2311.02681 [pdf]

Recent progress in cartilage lubrication

Authors: Weifeng Lin, Jacob Klein

Abstract: Healthy articular cartilage, covering the ends of bones in major joints such as hips and knees, presents the most efficiently-lubricated surface known in nature, with friction coefficients as low as 0.001 up to physiologically high pressures. Such low friction is indeed essential for its well being. It minimizes wear-and-tear and hence the cartilage degradation associated with osteoarthritis, the… ▽ More Healthy articular cartilage, covering the ends of bones in major joints such as hips and knees, presents the most efficiently-lubricated surface known in nature, with friction coefficients as low as 0.001 up to physiologically high pressures. Such low friction is indeed essential for its well being. It minimizes wear-and-tear and hence the cartilage degradation associated with osteoarthritis, the most common joint disease, and, by reducing shear stress on the mechanotransductive, cartilage-embedded chondrocytes (the only cell type in the cartilage), it regulates their function to maintain homeostasis. Understanding the origins of such low friction of the articular cartilage, therefore, is of major importance in order to alleviate disease symptoms, and slow or even reverse its breakdown. This progress report considers the relation between frictional behavior and the cellular mechanical environment in the cartilage, then reviews the mechanism of lubrication in the joints, in particular focusing on boundary lubrication. Following recent advances based on hydration lubrication, a proposed synergy between different molecular components of the synovial joints, acting together in enabling the low friction, has been proposed. Additionally, recent development of natural and of bio-inspired lubricants is reviewed. △ Less

Submitted 5 November, 2023; originally announced November 2023.

arXiv:2311.01311 [pdf, other]

Software Engineering for OpenHarmony: A Research Roadmap

Authors: Li Li, Xiang Gao, Hailong Sun, Chunming Hu, Xiaoyu Sun, Haoyu Wang, Haipeng Cai, Ting Su, Xiapu Luo, Tegawendé F. Bissyandé, Jacques Klein, John Grundy, Tao Xie, Haibo Chen, Huaimin Wang

Abstract: Mobile software engineering has been a hot research topic for decades. Our fellow researchers have proposed various approaches (with over 7,000 publications for Android alone) in this field that essentially contributed to the great success of the current mobile ecosystem. Existing research efforts mainly focus on popular mobile platforms, namely Android and iOS. OpenHarmony, a newly open-sourced m… ▽ More Mobile software engineering has been a hot research topic for decades. Our fellow researchers have proposed various approaches (with over 7,000 publications for Android alone) in this field that essentially contributed to the great success of the current mobile ecosystem. Existing research efforts mainly focus on popular mobile platforms, namely Android and iOS. OpenHarmony, a newly open-sourced mobile platform, has rarely been considered, although it is the one requiring the most attention as OpenHarmony is expected to occupy one-third of the market in China (if not in the world). To fill the gap, we present to the mobile software engineering community a research roadmap for encouraging our fellow researchers to contribute promising approaches to OpenHarmony. Specifically, we start by presenting a literature review of mobile software engineering, attempting to understand what problems have been targeted by the mobile community and how they have been resolved. We then summarize the existing (limited) achievements of OpenHarmony and subsequently highlight the research gap between Android/iOS and OpenHarmony. This research gap eventually helps in forming the roadmap for conducting software engineering research for OpenHarmony. △ Less

Submitted 21 November, 2023; v1 submitted 2 November, 2023; originally announced November 2023.

arXiv:2310.12753

Patch-CLIP: A Patch-Text Pre-Trained Model

Authors: Xunzhu Tang, Zhenghan Chen, Saad Ezzini, Haoye Tian, Jacques Klein, Tegawende F. Bissyande

Abstract: In recent years, patch representation learning has emerged as a necessary research direction for exploiting the capabilities of machine learning in software generation. These representations have driven significant performance enhancements across a variety of tasks involving code changes. While the progress is undeniable, a common limitation among existing models is their specialization: they pred… ▽ More In recent years, patch representation learning has emerged as a necessary research direction for exploiting the capabilities of machine learning in software generation. These representations have driven significant performance enhancements across a variety of tasks involving code changes. While the progress is undeniable, a common limitation among existing models is their specialization: they predominantly excel in either predictive tasks, such as security patch classification, or in generative tasks such as patch description generation. This dichotomy is further exacerbated by a prevalent dependency on potentially noisy data sources. Specifically, many models utilize patches integrated with Abstract Syntax Trees (AST) that, unfortunately, may contain parsing inaccuracies, thus acting as a suboptimal source of supervision. In response to these challenges, we introduce PATCH-CLIP, a novel pre-training framework for patches and natural language text. PATCH-CLIP deploys a triple-loss training strategy for 1) patch-description contrastive learning, which enables to separate patches and descriptions in the embedding space, 2) patch-description matching, which ensures that each patch is associated to its description in the embedding space, and 3) patch-description generation, which ensures that the patch embedding is effective for generation. These losses are implemented for joint learning to achieve good performance in both predictive and generative tasks involving patches. Empirical evaluations focusing on patch description generation, demonstrate that PATCH-CLIP sets new state of the art performance, consistently outperforming the state-of-the-art in metrics like BLEU, ROUGE-L, METEOR, and Recall. △ Less

Submitted 30 March, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

Comments: The paper is incomplete, causing much confusion for the community

arXiv:2310.07290 [pdf, other]

Revisiting Android App Categorization

Authors: Marco Alecci, Jordan Samhi, Tegawendé F. Bissyandé, Jacques Klein

Abstract: Numerous tools rely on automatic categorization of Android apps as part of their methodology. However, incorrect categorization can lead to inaccurate outcomes, such as a malware detector wrongly flagging a benign app as malicious. One such example is the SlideIT Free Keyboard app, which has over 500000 downloads on Google Play. Despite being a "Keyboard" app, it is often wrongly categorized along… ▽ More Numerous tools rely on automatic categorization of Android apps as part of their methodology. However, incorrect categorization can lead to inaccurate outcomes, such as a malware detector wrongly flagging a benign app as malicious. One such example is the SlideIT Free Keyboard app, which has over 500000 downloads on Google Play. Despite being a "Keyboard" app, it is often wrongly categorized alongside "Language" apps due to the app's description focusing heavily on language support, resulting in incorrect analysis outcomes, including mislabeling it as a potential malware when it is actually a benign app. Hence, there is a need to improve the categorization of Android apps to benefit all the tools relying on it. In this paper, we present a comprehensive evaluation of existing Android app categorization approaches using our new ground-truth dataset. Our evaluation demonstrates the notable superiority of approaches that utilize app descriptions over those solely relying on data extracted from the APK file, while also leaving space for potential improvement in the former category. Thus, we propose two innovative approaches that effectively outperform the performance of existing methods in both description-based and APK-based methodologies. Finally, by employing our novel description-based approach, we have successfully demonstrated that adopting a higher-performing categorization method can significantly benefit tools reliant on app categorization, leading to an improvement in their overall performance. This highlights the significance of develo** advanced and efficient app categorization methodologies for improved results in software engineering tasks. △ Less

Submitted 11 October, 2023; originally announced October 2023.

Comments: Accepted at ICSE2024

arXiv:2310.06644 [pdf, other]

Zero-Level-Set Encoder for Neural Distance Fields

Authors: Stefan Rhys Jeske, Jonathan Klein, Dominik L. Michels, Jan Bender

Abstract: Neural shape representation generally refers to representing 3D geometry using neural networks, e.g., to compute a signed distance or occupancy value at a specific spatial position. In this paper, we present a novel encoder-decoder neural network for embedding 3D shapes in a single forward pass. Our architecture is based on a multi-scale hybrid system incorporating graph-based and voxel-based comp… ▽ More Neural shape representation generally refers to representing 3D geometry using neural networks, e.g., to compute a signed distance or occupancy value at a specific spatial position. In this paper, we present a novel encoder-decoder neural network for embedding 3D shapes in a single forward pass. Our architecture is based on a multi-scale hybrid system incorporating graph-based and voxel-based components, as well as a continuously differentiable decoder. Furthermore, the network is trained to solve the Eikonal equation and only requires knowledge of the zero-level set for training and inference. This means that in contrast to most previous work, our network is able to output valid signed distance fields without explicit prior knowledge of non-zero distance values or shape occupancy. We further propose a modification of the loss function in case that surface normals are not well defined, e.g., in the context of non-watertight surfaces and non-manifold geometry. Overall, this can help reduce the computational overhead of training and evaluating neural distance fields, as well as enabling the application to difficult shapes. We finally demonstrate the efficacy, generalizability and scalability of our method on datasets consisting of deforming shapes, both based on simulated data and raw 3D scans. We further show single-class and multi-class encoding, on both fixed and variable vertex-count inputs, showcasing a wide range of possible applications. △ Less

Submitted 5 February, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

arXiv:2310.06320 [pdf, other]

Automatic Generation of Test Cases based on Bug Reports: a Feasibility Study with Large Language Models

Authors: Laura Plein, Wendkûuni C. Ouédraogo, Jacques Klein, Tegawendé F. Bissyandé

Abstract: Software testing is a core discipline in software engineering where a large array of research results has been produced, notably in the area of automatic test generation. Because existing approaches produce test cases that either can be qualified as simple (e.g. unit tests) or that require precise specifications, most testing procedures still rely on test cases written by humans to form test suite… ▽ More Software testing is a core discipline in software engineering where a large array of research results has been produced, notably in the area of automatic test generation. Because existing approaches produce test cases that either can be qualified as simple (e.g. unit tests) or that require precise specifications, most testing procedures still rely on test cases written by humans to form test suites. Such test suites, however, are incomplete: they only cover parts of the project or they are produced after the bug is fixed. Yet, several research challenges, such as automatic program repair, and practitioner processes, build on the assumption that available test suites are sufficient. There is thus a need to break existing barriers in automatic test case generation. While prior work largely focused on random unit testing inputs, we propose to consider generating test cases that realistically represent complex user execution scenarios, which reveal buggy behaviour. Such scenarios are informally described in bug reports, which should therefore be considered as natural inputs for specifying bug-triggering test cases. In this work, we investigate the feasibility of performing this generation by leveraging large language models (LLMs) and using bug reports as inputs. Our experiments include the use of ChatGPT, as an online service, as well as CodeGPT, a code-related pre-trained LLM that was fine-tuned for our task. Overall, we experimentally show that bug reports associated to up to 50% of Defects4J bugs can prompt ChatGPT to generate an executable test case. We show that even new bug reports can indeed be used as input for generating executable test cases. Finally, we report experimental results which confirm that LLM-generated test cases are immediately useful in software engineering tasks such as fault localization as well as patch validation in automated program repair. △ Less

Submitted 10 October, 2023; originally announced October 2023.

arXiv:2309.06341 [pdf, other]

doi 10.1103/PhysRevD.109.072002

Event-by-Event Direction Reconstruction of Solar Neutrinos in a High Light-Yield Liquid Scintillator

Authors: A. Allega, M. R. Anderson, S. Andringa, J. Antunes, M. Askins, D. J. Auty, A. Bacon, J. Baker, N. Barros, F. Barão, R. Bayes, E. W. Beier, T. S. Bezerra, A. Bialek, S. D. Biller, E. Blucher, E. Caden, E. J. Callaghan, M. Chen, S. Cheng, B. Cleveland, D. Cookman, J. Corning, M. A. Cox, R. Dehghani , et al. (94 additional authors not shown)

Abstract: The direction of individual $^8$B solar neutrinos has been reconstructed using the SNO+ liquid scintillator detector. Prompt, directional Cherenkov light was separated from the slower, isotropic scintillation light using time information, and a maximum likelihood method was used to reconstruct the direction of individual scattered electrons. A clear directional signal was observed, correlated with… ▽ More The direction of individual $^8$B solar neutrinos has been reconstructed using the SNO+ liquid scintillator detector. Prompt, directional Cherenkov light was separated from the slower, isotropic scintillation light using time information, and a maximum likelihood method was used to reconstruct the direction of individual scattered electrons. A clear directional signal was observed, correlated with the solar angle. The observation was aided by a period of low primary fluor concentration that resulted in a slower scintillator decay time. This is the first time that event-by-event direction reconstruction in high light-yield liquid scintillator has been demonstrated in a large-scale detector. △ Less

Submitted 10 April, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

Comments: 6 pages, 6 figures. Accepted manuscript by PRD

arXiv:2309.04778 [pdf]

doi 10.1063/5.0175185

Intrinsic magnetic properties of the layered antiferromagnet CrSBr

Authors: Fangchao Long, Kseniia Mosina, René Hübner, Zdenek Sofer, Julian Klein, Slawomir Prucnal, Manfred Helm, Florian Dirnberger, Shengqiang Zhou

Abstract: Van der Waals magnetic materials are an ideal platform to study low-dimensional magnetism. Opposed to other members of this family, the magnetic semiconductor CrSBr is highly resistant to degradation in air, which, besides its exceptional optical, electronic, and magnetic properties, is the reason the compound is receiving considerable attention at the moment. For many years, its magnetic phase di… ▽ More Van der Waals magnetic materials are an ideal platform to study low-dimensional magnetism. Opposed to other members of this family, the magnetic semiconductor CrSBr is highly resistant to degradation in air, which, besides its exceptional optical, electronic, and magnetic properties, is the reason the compound is receiving considerable attention at the moment. For many years, its magnetic phase diagram seemed to be well-understood. Recently, however, several groups observed a magnetic transition in magnetometry measurements at temperatures of around 40 K that is not expected from theoretical considerations, causing a debate about the intrinsic magnetic properties of the material. In this letter, we report the absence of this particular transition in magnetization measurements conducted on high-quality CrSBr crystals, attesting to the extrinsic nature of the low-temperature magnetic phase observed in other works. Our magnetometry results obtained from large bulk crystals are in very good agreement with the magnetic phase diagram of CrSBr previously predicted by the mean-field theory; A-type antiferromagnetic order is the only phase observed below the Néel temperature at TN = 131 K. Moreover, numerical fits based on the Curie-Weiss law confirm that strong ferromagnetic correlations are present within individual layers even at temperatures much larger than TN. △ Less

Submitted 9 September, 2023; originally announced September 2023.

Comments: 13 pages, submitted to Appl. Phys. Lett

Journal ref: Appl. Phys. Lett. 123, 222401 (2023)

arXiv:2309.00935 [pdf]

Effect of cholesterol on the mechanical stability of gel-phase phospholipid bilayers studied by AFM force spectroscopy

Authors: Salome Mielke, Raya Sorkin, Jacob Klein

Abstract: The low sliding friction of articular cartilage in the major joints , which is crucial for its homeostasis and for joint health, has been attributed to lipid bilayers forming lubricious boundary layers at its surface. The robustness of such layers, and thus their lubrication efficiency at joint pressures, depends on the lipids forming them, including cholesterol, which may act to strengthen of wea… ▽ More The low sliding friction of articular cartilage in the major joints , which is crucial for its homeostasis and for joint health, has been attributed to lipid bilayers forming lubricious boundary layers at its surface. The robustness of such layers, and thus their lubrication efficiency at joint pressures, depends on the lipids forming them, including cholesterol, which may act to strengthen of weaken the bilayer. A systematic study using an Atomic Force Microscope (AFM) was carried out to understand the effect of cholesterol on the nanomechanical stability of two saturated phospholipids, DSPC (1,2-distearoyl-sn-glycero-3-phosphatidlycholine) and DPPC (1,2-dipalmitoyl-sn-glycero-phosphatidylcholine), that differ in acyl chain lengths. Measurements were carried out both in water and in phosphate buffer solution (PBS). The nanomechanical stability of the lipid bilayers was quantitatively evaluated by measuring the breakthrough force needed to puncture the bilayer by the AFM tip. The molar fractions of cholesterol incorporated in the bilayers were 10% and 40%. We found that for both DSPC and DPPC, cholesterol significantly decreases the mechanical stability of the bilayers in solid ordered (SO) phase. In accordance with the literature, the strengthening effect of salt on the lipid bilayers was also observed. For DPPC with 10 mol % cholesterol, the effect of tip properties and the experimental procedure parameters on the breakthrough forces were also studied. Tip radius (2 - 42 nm), material (Si, Si3N4, Au) and loading rate (40 - 1000 nm/s) were varied systematically. The values of the breakthrough forces measured were not significantly affected by any of these parameters, showing that the weakening effect of cholesterol does not result from such changes in experimental conditions. This study helps to shed light on the mechanism of physiological lubrication. △ Less

Submitted 2 September, 2023; originally announced September 2023.

arXiv:2308.16586 [pdf, other]

Learning to Represent Patches

Authors: Xunzhu Tang, Haoye Tian, Zhenghan Chen, Weiguo Pian, Saad Ezzini, Abdoul Kader Kabore, Andrew Habib, Jacques Klein, Tegawende F. Bissyande

Abstract: Patch representation is crucial in automating various software engineering tasks, like determining patch accuracy or summarizing code changes. While recent research has employed deep learning for patch representation, focusing on token sequences or Abstract Syntax Trees (ASTs), they often miss the change's semantic intent and the context of modified lines. To bridge this gap, we introduce a novel… ▽ More Patch representation is crucial in automating various software engineering tasks, like determining patch accuracy or summarizing code changes. While recent research has employed deep learning for patch representation, focusing on token sequences or Abstract Syntax Trees (ASTs), they often miss the change's semantic intent and the context of modified lines. To bridge this gap, we introduce a novel method, Patcherizer. It delves into the intentions of context and structure, merging the surrounding code context with two innovative representations. These capture the intention in code changes and the intention in AST structural modifications pre and post-patch. This holistic representation aptly captures a patch's underlying intentions. Patcherizer employs graph convolutional neural networks for structural intention graph representation and transformers for intention sequence representation. We evaluated Patcherizer's embeddings' versatility in three areas: (1) Patch description generation, (2) Patch accuracy prediction, and (3) Patch intention identification. Our experiments demonstrate the representation's efficacy across all tasks, outperforming state-of-the-art methods. For example, in patch description generation, Patcherizer excels, showing an average boost of 19.39% in BLEU, 8.71% in ROUGE-L, and 34.03% in METEOR scores. △ Less

Submitted 3 October, 2023; v1 submitted 31 August, 2023; originally announced August 2023.

arXiv:2308.15234 [pdf, other]

Hyperbolic Code Retrieval: A Novel Approach for Efficient Code Search Using Hyperbolic Space Embeddings

Authors: Xunzhu Tang, zhenghan Chen, Saad Ezzini, Haoye Tian, Yewei Song, Jacques Klein, Tegawende F. Bissyande

Abstract: Within the realm of advanced code retrieval, existing methods have primarily relied on intricate matching and attention-based mechanisms. However, these methods often lead to computational and memory inefficiencies, posing a significant challenge to their real-world applicability. To tackle this challenge, we propose a novel approach, the Hyperbolic Code QA Matching (HyCoQA). This approach leverag… ▽ More Within the realm of advanced code retrieval, existing methods have primarily relied on intricate matching and attention-based mechanisms. However, these methods often lead to computational and memory inefficiencies, posing a significant challenge to their real-world applicability. To tackle this challenge, we propose a novel approach, the Hyperbolic Code QA Matching (HyCoQA). This approach leverages the unique properties of Hyperbolic space to express connections between code fragments and their corresponding queries, thereby obviating the necessity for intricate interaction layers. The process commences with a reimagining of the code retrieval challenge, framed within a question-answering (QA) matching framework, constructing a dataset with triple matches characterized as \texttt{<}negative code, description, positive code\texttt{>}. These matches are subsequently processed via a static BERT embedding layer, yielding initial embeddings. Thereafter, a hyperbolic embedder transforms these representations into hyperbolic space, calculating distances between the codes and descriptions. The process concludes by implementing a scoring layer on these distances and leveraging hinge loss for model training. Especially, the design of HyCoQA inherently facilitates self-organization, allowing for the automatic detection of embedded hierarchical patterns during the learning phase. Experimentally, HyCoQA showcases remarkable effectiveness in our evaluations: an average performance improvement of 3.5\% to 4\% compared to state-of-the-art code retrieval techniques. △ Less

Submitted 29 August, 2023; originally announced August 2023.

arXiv:2308.15233 [pdf, other]

Multilevel Semantic Embedding of Software Patches: A Fine-to-Coarse Grained Approach Towards Security Patch Detection

Authors: Xunzhu Tang, zhenghan Chen, Saad Ezzini, Haoye Tian, Yewei Song, Jacques Klein, Tegawende F. Bissyande

Abstract: The growth of open-source software has increased the risk of hidden vulnerabilities that can affect downstream software applications. This concern is further exacerbated by software vendors' practice of silently releasing security patches without explicit warnings or common vulnerability and exposure (CVE) notifications. This lack of transparency leaves users unaware of potential security threats,… ▽ More The growth of open-source software has increased the risk of hidden vulnerabilities that can affect downstream software applications. This concern is further exacerbated by software vendors' practice of silently releasing security patches without explicit warnings or common vulnerability and exposure (CVE) notifications. This lack of transparency leaves users unaware of potential security threats, giving attackers an opportunity to take advantage of these vulnerabilities. In the complex landscape of software patches, gras** the nuanced semantics of a patch is vital for ensuring secure software maintenance. To address this challenge, we introduce a multilevel Semantic Embedder for security patch detection, termed MultiSEM. This model harnesses word-centric vectors at a fine-grained level, emphasizing the significance of individual words, while the coarse-grained layer adopts entire code lines for vector representation, capturing the essence and interrelation of added or removed lines. We further enrich this representation by assimilating patch descriptions to obtain a holistic semantic portrait. This combination of multi-layered embeddings offers a robust representation, balancing word complexity, understanding code-line insights, and patch descriptions. Evaluating MultiSEM for detecting patch security, our results demonstrate its superiority, outperforming state-of-the-art models with promising margins: a 22.46\% improvement on PatchDB and a 9.21\% on SPI-DB in terms of the F1 metric. △ Less

Submitted 29 August, 2023; originally announced August 2023.

arXiv:2308.12701 [pdf, other]

How are We Detecting Inconsistent Method Names? An Empirical Study from Code Review Perspective

Authors: Kisub Kim, Xin Zhou, Dongsun Kim, Julia Lawall, Kui Liu, Tegawendé F. Bissyandé, Jacques Klein, Jaekwon Lee, David Lo

Abstract: Proper naming of methods can make program code easier to understand, and thus enhance software maintainability. Yet, developers may use inconsistent names due to poor communication or a lack of familiarity with conventions within the software development lifecycle. To address this issue, much research effort has been invested into building automatic tools that can check for method name inconsisten… ▽ More Proper naming of methods can make program code easier to understand, and thus enhance software maintainability. Yet, developers may use inconsistent names due to poor communication or a lack of familiarity with conventions within the software development lifecycle. To address this issue, much research effort has been invested into building automatic tools that can check for method name inconsistency and recommend consistent names. However, existing datasets generally do not provide precise details about why a method name was deemed improper and required to be changed. Such information can give useful hints on how to improve the recommendation of adequate method names. Accordingly, we construct a sample method-naming benchmark, ReName4J, by matching name changes with code reviews. We then present an empirical study on how state-of-the-art techniques perform in detecting or recommending consistent and inconsistent method names based on ReName4J. The main purpose of the study is to reveal a different perspective based on reviewed names rather than proposing a complete benchmark. We find that the existing techniques underperform on our review-driven benchmark, both in inconsistent checking and the recommendation. We further identify potential biases in the evaluation of existing techniques, which future research should consider thoroughly. △ Less

Submitted 24 August, 2023; originally announced August 2023.

arXiv:2308.01413 [pdf, other]

LaFiCMIL: Rethinking Large File Classification from the Perspective of Correlated Multiple Instance Learning

Authors: Tiezhu Sun, Weiguo Pian, Nadia Daoudi, Kevin Allix, Tegawendé F. Bissyandé, Jacques Klein

Abstract: Transfomer-based models have significantly advanced natural language processing, in particular the performance in text classification tasks. Nevertheless, these models face challenges in processing large files, primarily due to their input constraints, which are generally restricted to hundreds or thousands of tokens. Attempts to address this issue in existing models usually consist in extracting… ▽ More Transfomer-based models have significantly advanced natural language processing, in particular the performance in text classification tasks. Nevertheless, these models face challenges in processing large files, primarily due to their input constraints, which are generally restricted to hundreds or thousands of tokens. Attempts to address this issue in existing models usually consist in extracting only a fraction of the essential information from lengthy inputs, while often incurring high computational costs due to their complex architectures. In this work, we address the challenge of classifying large files from the perspective of correlated multiple instance learning. We introduce LaFiCMIL, a method specifically designed for large file classification. LaFiCMIL is optimized for efficient operation on a single GPU, making it a versatile solution for binary, multi-class, and multi-label classification tasks. We conducted extensive experiments using seven diverse and comprehensive benchmark datasets to assess LaFiCMIL's effectiveness. By integrating BERT for feature extraction, LaFiCMIL demonstrates exceptional performance, setting new benchmarks across all datasets. A notable achievement of our approach is its ability to scale BERT to handle nearly 20,000 tokens while operating on a single GPU with 32GB of memory. This efficiency, coupled with its state-of-the-art performance, highlights LaFiCMIL's potential as a groundbreaking approach in the field of large file classification. △ Less

Submitted 23 May, 2024; v1 submitted 30 July, 2023; originally announced August 2023.

Comments: Accepted at NLDB 2024

arXiv:2307.12675 [pdf, other]

doi 10.1103/PhysRevB.108.195410

Charge transfer-induced Lifshitz transition and magnetic symmetry breaking in ultrathin CrSBr crystals

Authors: Marco Bianchi, Kimberly Hsieh, Esben Juel Porat, Florian Dirnberger, Julian Klein, Kseniia Mosina, Zdenek Sofer, Alexander N. Rudenko, Mikhail I. Katsnelson, Yong P. Chen, Malte Rösner, Philip Hofmann

Abstract: Ultrathin CrSBr flakes are exfoliated \emph{in situ} on Au(111) and Ag(111) and their electronic structure is studied by angle-resolved photoemission spectroscopy. The thin flakes' electronic properties are drastically different from those of the bulk material and also substrate-dependent. For both substrates, a strong charge transfer to the flakes is observed, partly populating the conduction ban… ▽ More Ultrathin CrSBr flakes are exfoliated \emph{in situ} on Au(111) and Ag(111) and their electronic structure is studied by angle-resolved photoemission spectroscopy. The thin flakes' electronic properties are drastically different from those of the bulk material and also substrate-dependent. For both substrates, a strong charge transfer to the flakes is observed, partly populating the conduction band and giving rise to a highly anisotropic Fermi contour with an Ohmic contact to the substrate. The fundamental CrSBr band gap is strongly renormalized compared to the bulk. The charge transfer to the CrSBr flake is substantially larger for Ag(111) than for Au(111), but a rigid energy shift of the chemical potential is insufficient to describe the observed band structure modifications. In particular, the Fermi contour shows a Lifshitz transition, the fundamental band gap undergoes a transition from direct on Au(111) to indirect on Ag(111) and a do**-induced symmetry breaking between the intra-layer Cr magnetic moments further modifies the band structure. Electronic structure calculations can account for non-rigid Lifshitz-type band structure changes in thin CrSBr as a function of do** and strain. In contrast to undoped bulk band structure calculations that require self-consistent $GW$ theory, the doped thin film properties are well-approximated by density functional theory if local Coulomb interactions are taken into account on the mean-field level and the charge transfer is considered. △ Less

Submitted 24 July, 2023; originally announced July 2023.

arXiv:2307.12609 [pdf, other]

doi 10.1145/3643991.3644866

AndroLibZoo: A Reliable Dataset of Libraries Based on Software Dependency Analysis

Authors: Jordan Samhi, Tegawendé F. Bissyandé, Jacques Klein

Abstract: Android app developers extensively employ code reuse, integrating many third-party libraries into their apps. While such integration is practical for developers, it can be challenging for static analyzers to achieve scalability and precision when libraries account for a large part of the code. As a direct consequence, it is common practice in the literature to consider developer code only during s… ▽ More Android app developers extensively employ code reuse, integrating many third-party libraries into their apps. While such integration is practical for developers, it can be challenging for static analyzers to achieve scalability and precision when libraries account for a large part of the code. As a direct consequence, it is common practice in the literature to consider developer code only during static analysis --with the assumption that the sought issues are in developer code rather than the libraries. However, analysts need to distinguish between library and developer code. Currently, many static analyses rely on white lists of libraries. However, these white lists are unreliable, inaccurate, and largely non-comprehensive. In this paper, we propose a new approach to address the lack of comprehensive and automated solutions for the production of accurate and ``always up to date" sets of libraries. First, we demonstrate the continued need for a white list of libraries. Second, we propose an automated approach to produce an accurate and up-to-date set of third-party libraries in the form of a dataset called AndroLibZoo. Our dataset, which we make available to the community, contains to date 34 813 libraries and is meant to evolve. △ Less

Submitted 8 February, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

arXiv:2305.19178 [pdf]

Cell-inspired, massive electromodulation of interfacial energy dissipation

Authors: Yu Zhang, Di **, Ran Tivony, Nir Kampf, Jacob Klein

Abstract: Transient electric fields across cell bilayer membranes can lead to electroporation, as well as to cell fusion, and have been extensively studied. We find that transmembrane electric fields similar to those in cells can lead to a massive, reversible modulation--by up to 200-fold--of the interfacial energy dissipation between surfaces sliding across the lipid bilayer membranes. Atomistic simulation… ▽ More Transient electric fields across cell bilayer membranes can lead to electroporation, as well as to cell fusion, and have been extensively studied. We find that transmembrane electric fields similar to those in cells can lead to a massive, reversible modulation--by up to 200-fold--of the interfacial energy dissipation between surfaces sliding across the lipid bilayer membranes. Atomistic simulations reveal that this arises from (fully reversible) electroporation of the interfacially-confined bilayers, and formation of bilayer bridges analogous to stalks preceding intermembrane fusion. These cell-membrane-mimicking effects topologically-force the slip to partially-revert from the low-dissipation, hydrated lipid-headgroups plane to the intra-bilayer, high-dissipation acyl tail interface. Our results demonstrate that lipid bilayers under transmembrane electric fields can have striking materials-modification properties, and shed new light on membrane hemifusion. △ Less

Submitted 26 June, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

Comments: Yu Zhang and Di ** contributes equally to this work

Showing 1–50 of 528 results for author: Klein, J