Transformer-based Code Completion

A proof-of-concept source code completion model, based on the Universal Transformer architecture, trained on the FunCom dataset, made up of 2.1 million Java methods.

Check out the live demo!

Technical Details

In contrast to the original proposed Transformer, which is made up of a set of layers, each with their own parameters, the Universal Transformer applies the same layer repeatedly. This improves performance across many tasks, particularly those of an algorithmic nature (such as processing source code as opposed to natural language).

In tasks such as completion where we are not translating between languages, only the "Encoder" layer of the Transformer is utilized.

The steps to predict the next token, given an input prompt:

The prompt is tokenized using a SentencePiece model
The set of input tokens are processed by an embedding layer, which turns them into vectors
A Transformer Encoder Layer is applied n=4 times
Finally, a Dense layer is used to produce next-token probabilities

This process is applied repeatedly until the end-of-sentence token is reached, or a specified maximum length is surpassed.

In contrast to my Code Summarization Transformer, I implemented this project using PyTorch.

Running Locally

You'll need Python 3 with torch, sentencepiece, and Keras-Preprocessing. Run run.py with Python to enter an interactive demo.

You can train a new model by editing the parameters in train.py.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Transformer-based Code Completion

Technical Details

Running Locally

Files

README.md

Latest commit

History

README.md

File metadata and controls

Transformer-based Code Completion

Technical Details

Running Locally