Abstract
This study explores the explainability capabilities of large language models (LLMs), when employed to autonomously generate machine learning (ML) solutions. We examine two classification tasks: (i) a binary classification problem focused on predicting driver alertness states, and (ii) a multilabel classification problem based on the yeast dataset. Three state-of-the-art LLMs (i.e. OpenAI GPT, Anthropic Claude, and DeepSeek) are prompted to design training pipelines for four common classifiers: Random Forest, XGBoost, Multilayer Perceptron, and Long Short-Term Memory networks. The generated models are evaluated in terms of predictive performance (recall, precision, and F1-score)and explainability using SHAP (SHapley Additive exPlanations). Specifically, we measure Average SHAP Fidelity (Mean Squared Error between SHAP approximations and model outputs) and Average SHAP Sparsity (number of features deemed influential). The results reveal that LLMs
are capable of producing effective and interpretable models, achieving high fidelity and consistent sparsity, highlighting their potential as automated tools for interpretable ML pipeline generation. Importantly, our findings indicate that the quality of LLM-generated pipelines closely approximates that of manually engineered solutions both in predictive accuracy and explainability, suggesting that LLMs can reliably support or even partially automate explainable ML workflows with minimal loss in interpretive depth.
are capable of producing effective and interpretable models, achieving high fidelity and consistent sparsity, highlighting their potential as automated tools for interpretable ML pipeline generation. Importantly, our findings indicate that the quality of LLM-generated pipelines closely approximates that of manually engineered solutions both in predictive accuracy and explainability, suggesting that LLMs can reliably support or even partially automate explainable ML workflows with minimal loss in interpretive depth.
| Original language | English |
|---|---|
| Title of host publication | THE 17TH INTERNATIONAL CONFERENCE ON MANAGEMENT OF DIGITAL ECOSYSTEMS |
| Subtitle of host publication | MEDES 2025 |
| Publisher | Springer |
| Number of pages | 14 |
| Publication status | Accepted/In press - 26 Sept 2025 |
| Event | The 17th International Conference on Management of Digital Ecosystems - Ho Chi Minh City, Viet Nam Duration: 24 Nov 2025 → 26 Nov 2025 https://conferences.sigappfr.org/medes2025/ |
Publication series
| Name | Communications in Computer and Information Science |
|---|---|
| Publisher | Springer |
| ISSN (Print) | 1865-0929 |
| ISSN (Electronic) | 1865-0937 |
Conference
| Conference | The 17th International Conference on Management of Digital Ecosystems |
|---|---|
| Abbreviated title | MEDES 2025 |
| Country/Territory | Viet Nam |
| City | Ho Chi Minh City |
| Period | 24/11/25 → 26/11/25 |
| Internet address |