Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FEAT: Support OpenHermes 2.5 #776

Merged
merged 3 commits into from
Dec 18, 2023
Merged

Conversation

Bojun-Feng
Copy link
Contributor

@Bojun-Feng Bojun-Feng commented Dec 18, 2023

Fix #775

  • Update Model Family JSON
  • Update README
  • Update Docs

I have tested the GGUF model locally with Llama.cpp but did not test the PyTorch ones due to the lack of CUDA support.

I played around with OpenHermes 2.5 on my laptop and generally believe it to be the best 7B local model we have so far. Here are some outputs from the Q2_K quantization (I'm sure other quantizations will perform even better) with 0 temperature for deterministic output, if anyone is interested:

screenshots Screenshot 2023-12-17 at 6 29 38 PM Screenshot 2023-12-17 at 6 36 45 PM Screenshot 2023-12-17 at 8 15 46 PM Screenshot 2023-12-17 at 10 23 32 PM

@XprobeBot XprobeBot added this to the v0.7.3 milestone Dec 18, 2023
@aresnow1 aresnow1 merged commit 5a77ff9 into xorbitsai:main Dec 18, 2023
12 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

FEAT: support openhermes 2.5
3 participants