Enhancing LLM code generation using natural language processing in the contexts of machine learning

Research output: Chapter in Book/Conference proceeding with ISSN or ISBNConference contribution with ISSN or ISBNpeer-review

Abstract

The rapid rise in popularity of Generative AI and Large Language Models (LLMs) has brought both innovation and controversy, particularly regarding plagiarism and IP law infringements. However, one underexplored concern is the generation of code by these models, which, despite their potential, often includes errors and promotes poor programming practices. This paper explores new methods to address these issues by integrating LLMs with Automated Machine Learning (AutoML). By leveraging AutoML’s capabilities in hyperparameter tuning and model selection, we propose a novel approach for generating robust machine learning algorithms. This integration aims to enhance the accuracy and reliability of code generation while mitigating legal risks. Our findings include the application of Natural Language Processing (NLP) and Natural Language Understanding (NLU) techniques to interpret chatbot prompts, thereby improving the generation and customization of machine learning models. The proposed methodology demonstrates practical implementation and high prediction accuracy, offering a promising solution to the current challenges faced by LLM-based code generation. In summary the findings of the paper are as follows: A new implementation of natural language processing for natural language understanding in the context of chatbot prompts aims to serve as an initial step for feature extraction, which will be utilised by an AutoML system to generate machine learning algorithms.
Original languageEnglish
Title of host publicationDatabase Engineered Applications - 28th International Symposium, IDEAS 2024, Proceedings
Subtitle of host publication28th International Symposium, IDEAS 2024, Bayonne, France, August 26–29, 2024, Proceedings
EditorsRichard Chbeir, Sergio Ilarri, Yannis Manolopoulos, Peter Z. Revesz, Jorge Bernardino, Carson K. Leung
PublisherSpringer
Pages92-105
Number of pages14
Edition1
ISBN (Electronic)9783031834721
ISBN (Print)9783031834721, 9783031834721, 9783031834721, 9783031834721, 9783031834714
DOIs
Publication statusPublished - 16 Mar 2025
EventInternational Database Engineered Applications Symposium - IUT de Bayonne, Bayonne, France
Duration: 28 Aug 202431 Aug 2024
Conference number: 28
https://conferences.sigappfr.org/ideas2024/

Publication series

NameLecture Notes in Computer Science
Volume15511 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceInternational Database Engineered Applications Symposium
Abbreviated titleIDEAS2024
Country/TerritoryFrance
CityBayonne
Period28/08/2431/08/24
Internet address

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

Keywords

  • LLM
  • NLP
  • AutoML
  • Chatbot
  • Machine Learning

Fingerprint

Dive into the research topics of 'Enhancing LLM code generation using natural language processing in the contexts of machine learning'. Together they form a unique fingerprint.

Cite this