Investigating Language Models for Functional Programming Code Completion

"Tree AI"

Language Models

Haskell

Functional Programming

Code Completion

CodeGPT

UniXcoder

Investigating Language Models for Functional Programming Code Completion

In a recent study, researchers evaluated the performance of language models, specifically CodeGPT and UniXcoder, in completing code for the functional programming language Haskell Haskell Case Study. The evaluation utilized Haskell functions from a publicly accessible Haskell dataset on HuggingFace, alongside a novel HumanEval dataset. The findings revealed that the pre-training knowledge of imperative languages in Large Language Models (LLMs) does not transfer well to functional languages. However, code completion for such languages is feasible, indicating a need for high-quality Haskell datasets for model training.

Functional programming languages have been underrepresented in code completion research.
Language models like CodeGPT and UniXcoder evaluated on Haskell functions.
Automatic evaluation indicates imperative knowledge does not transfer well.
A manual evaluation shows CodeGPT often generates empty predictions; UniXcoder produces incomplete predictions.
The study highlights the necessity for more high-quality Haskell datasets.

This paper emphasizes the importance of incorporating diverse programming languages into AI models to enhance developer tools across various coding paradigms. Such research can lead to more inclusive and effective coding assistance for functional programming languages, fostering a more versatile software development environment.

Personalized AI news from scientific papers.