Popular repositories Loading
-
-
lp-sparsemap
lp-sparsemap PublicLP-SparseMAP: Differentiable sparse structured prediction in coarse factor graphs
-
Repositories
Showing 10 of 62 repositories
- tower-eval Public
deep-spin/tower-eval’s past year of commit activity - nanotron Public Forked from huggingface/nanotron
Minimalistic large language model 3D-parallelism training
deep-spin/nanotron’s past year of commit activity - infinite-former Public
deep-spin/infinite-former’s past year of commit activity - Megatron-LM-pretrain Public Forked from NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
deep-spin/Megatron-LM-pretrain’s past year of commit activity - Megatron-DeepSpeed Public Forked from microsoft/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
deep-spin/Megatron-DeepSpeed’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…