Skip to content
View zhuzilin's full-sized avatar
🤔
llm...
🤔
llm...

Block or report zhuzilin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
zhuzilin/README.md

Hoping to be what Paul Graham called hacker.

Hey, I'm zhuzilin, an engineer driven by curiosity.

My main focus is on MLSys.

  • You can ask me about deep learning frameworks. I am contributor to many tools like pytorch, tensorflow and horovod.
  • I am a LLM believer and am really lucky to get hands dirty on training them @WeChat, from pretraining from scratch to sft and rlhf, along with writing training frameworks for those.
  • Recently, I wrote ring-flash-attention and am working on improving OpenRLHF/OpenRLHF.

I'm also interested in JavaScript engine. I've read the es5 spec to write es and helped fixed bugs in the early stage of oven-sh/bun.

Avatar is Shoyo Hinata, from Haikyu!!.


我是 zhuzilin,一个由兴趣驱动的工程师~

我的主要精力放在 MLSys 领域。

  • 我对深度学习训练框架比较了解,是 pytorch, tensorflow, horovod 等工具的 contributor。
  • LLM 信徒,在微信大模型团队打工中。有幸深入接触过 LLM 训练的各个环节,不管是从零预训练,还是 sft 与 rlhf,以及写用来做这些事的训练框架。
  • 最近写了 ring-flash-attention,并且在尝试优化 OpenRLHF/OpenRLHF 中。

我对 JavaScript 引擎也比较感兴趣。读过 spec,写过解释器(es),还给早期的 oven-sh/bun 提过一些 bugfix。

头像是日向翔阳,《排球少年》。

Pinned Loading

  1. ring-flash-attention ring-flash-attention Public

    Ring attention implementation with flash attention

    Python 554 43

  2. OpenRLHF/OpenRLHF OpenRLHF/OpenRLHF Public

    An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

    Python 2.2k 221

  3. pytorch/pytorch pytorch/pytorch Public

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    Python 82.9k 22.4k

  4. tensorflow/tensorflow tensorflow/tensorflow Public

    An Open Source Machine Learning Framework for Everyone

    C++ 186k 74.3k

  5. es es Public archive

    A JavaScript interpreter from scratch, supporting ES5 syntax.

    C++ 25 6

  6. oven-sh/bun oven-sh/bun Public

    Incredibly fast JavaScript runtime, bundler, test runner, and package manager – all in one

    Zig 73.7k 2.7k