pytorch-xla-transformer-language-model

This repository is an open source test case for pytorch/xla that runs a minimal training loop for a Transformer language model on a single TPU device.

This code is intended to be used as reference for testing the compilation of the model by XLA, and is not intended to be used for training a reasonable language model. During initial runs, this code triggered recompilation far too often, but these issues have now been resolved.

Depends on Docker image gcr.io/tpu-pytorch/xla:r0.1.

export TPU_IP=#YOU MUST SET YOUR TPU IP
export XRT_TPU_CONFIG="tpu_worker;0;$TPU_IP"
export XLA_USE_32BIT_LONG=1
export XLA_IR_DEBUG=1
export XLA_HLO_DEBUG=1
python3 train.py

Output is in run.log.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
datasets/enwik8/train		datasets/enwik8/train
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
train.py		train.py
transformer.py		transformer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pytorch-xla-transformer-language-model

About

Releases

Packages

Languages

License

dlibenzi/pytorch-xla-transformer-language-model

Folders and files

Latest commit

History

Repository files navigation

pytorch-xla-transformer-language-model

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages