MNIST Trainer From Scratch on GPU

About

This is a CNN trainer for MNIST handwritten digit dataset written in C++ and CUDA from scratch.

Tools for GPU implementation

CUDA (https://developer.nvidia.com/cuda-toolkit)
OpenACC (https://www.openacc.org/)

Requirements

nvcc
pgcc

Feature

This program supports NVIDIA Tensor Core. Tensor Core is an arithmetic circuit specialized for matrix multiplication operations.

CUDA can access Tensor Cores through WMMA API like below.

wmma::fragment<wmma::matrix_a, TILESIZE, TILESIZE, TILESIZE, __half, wmma::row_major> a_frag;
wmma::fragment<wmma::matrix_b, TILESIZE, TILESIZE, TILESIZE, __half, wmma::row_major> b_frag;
wmma::fragment<wmma::accumulator, TILESIZE, TILESIZE, TILESIZE, __half> c_frag;
wmma::fill_fragment(c_frag, __float2half(0.f));

wmma::load_matrix_sync(a_frag, &a_half[wid*ELEMS_TILE], 16);
wmma::load_matrix_sync(b_frag, &b_half[wid*ELEMS_TILE], 16);
wmma::mma_sync(c_frag, a_frag, b_frag, c_frag);
wmma::store_matrix_sync(&c_half[wid*ELEMS_TILE], c_frag, 16, wmma::mem_row_major);

Tensor core performs a certain size of matrix multiplications. We can make large matrix multiplications by splitting each matrix into small 'tiles' and throw it into Tensor Cores.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
mnist		mnist
README.md		README.md
cnn.cpp		cnn.cpp
cnn.hpp		cnn.hpp
dnn.cpp		dnn.cpp
dnn.hpp		dnn.hpp
dot.cu		dot.cu
main.cpp		main.cpp
tensor.cpp		tensor.cpp
tensor.hpp		tensor.hpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MNIST Trainer From Scratch on GPU

About

Tools for GPU implementation

Requirements

Feature

About

Releases

Packages

Contributors 2

Languages

TsutsuiMasayoshi/CNN-from-scratch

Folders and files

Latest commit

History

Repository files navigation

MNIST Trainer From Scratch on GPU

About

Tools for GPU implementation

Requirements

Feature

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages