Do Large Language Models Generate Similar Codes from Mutated Prompts?: A Case Study of Gemini Pro
In this work, we delve into the domain of source code similarity detection using Large Language Models (LLMs). Our investigation is motivated by the necessity to identify similarities among different pieces of source code, a critical aspect for tasks such as plagiarism detection and code reuse. We specifically focus on exploring the effectiveness of leveraging LLMs for this purpose. To achieve this, we utilized the LLMSecEval dataset, comprising 150 NL prompts for code generation across two languages - C and Python, and employed radamsa, a mutation-based input generator, to create 27 different mutations per NL prompt. Subsequently, using Gemini Pro, we generated code for the original and mutated NL prompts to study code similarity using CodeBERT. Our experiment aims to uncover the extent to which LLMs can consistently generate similar code despite mutations in the input NL prompts, providing insights into the robustness and generalization capabilities of LLMs in understanding and comparing programming code structures and semantics.
Poster (FSE_2024_LLM_Sim.pdf) | 632KiB |
Wed 17 JulDisplayed time zone: Brasilia, Distrito Federal, Brazil change
10:30 - 11:00 | |||
10:30 30mPoster | MicroSensor: Towards an Extensible Tool for the Static Analysis of Microservices Systems in Continuous Integration Posters Edson Soares Instituto Atlantico & State University of Ceara (UECE), Matheus Paixao State University of Ceará, Allysson Allex Araújo Federal University of Cariri | ||
10:30 30mPoster | SORBET: A Framework to Evaluate the Robustness of LiDAR 3D Object Detection and Its Impacts on Autonomous Driving Posters | ||
10:30 30mPoster | An Analysis of the Costs and Benefits of Autocomplete in IDEs Posters Shaokang Jiang University of California, San Diego, Michael Coblenz University of California, San Diego | ||
10:30 30mPoster | Go the Extra Mile: Fixing Propagated Error-Handling Bugs Posters Haoran Liu National University of Defense Technology, Zhouyang Jia National University of Defense Technology, Huiping Zhou National University of Defense Technology, Haifang Zhou National University of Defense Technology, Shanshan Li National University of Defense Technology | ||
10:30 30mPoster | Hybrid Regression Test Selection by Synergizing File and Method Call Dependences Posters Luyao Liu College of Computer, National University of Defense Technology, Guofeng Zhang College of Computer, National University of Defense Technology, Zhenbang Chen College of Computer, National University of Defense Technology, Ji Wang School of Computer, National University of Defense Technology, China | ||
10:30 30mPoster | Do Large Language Models Generate Similar Codes from Mutated Prompts?: A Case Study of Gemini Pro Posters DOI Pre-print Media Attached File Attached | ||
10:30 30mPoster | Towards Realistic SATD Identification Through Machine Learning Models: Ongoing Research and Preliminary Results Posters Eliakim Gama State University of Ceará, Matheus Paixao State University of Ceará, Mariela I. Cortés State University of Ceará, Lucas Monteiro State University of Ceará DOI Pre-print | ||
10:30 30mPoster | Building Software Engineering Capacity through a University Open Source Program Office Posters | ||
10:30 30mPoster | Inferring Natural Preconditions via Program Transformation Posters | ||
10:30 30mPoster | RFNIT: Robotic Framework for Non-Invasive Testing Posters Davi Simoes Freitas Centro de Informática at Universidade Federal de Pernambuco, Breno Miranda Centro de Informática at Universidade Federal de Pernambuco, Juliano Iyoda Centro de Informática at Universidade Federal de Pernambuco |
This room is conjoined with the Foyer to provide additional space for the coffee break, and hold poster presentations throughout the event.