Hermes 3 Technical Report | Awesome LLM Papers Contribute to Awesome LLM Papers

Hermes 3 Technical Report

Ryan Teknium, Jeffrey Quesnelle, Chen Guang . No Venue 2024

[Paper] [Paper]   Search on Google Scholar   Search on Semantic Scholar
Uncategorized

Instruct (or “chat”) tuned models have become the primary way in which most people interact with large language models. As opposed to “base” or “foundation” models, instruct-tuned models are optimized to respond to imperative statements. We present Hermes 3, a neutrally-aligned generalist instruct and tool use model with strong reasoning and creative abilities. Its largest version, Hermes 3 405B, achieves state of the art performance among open weight models on several public benchmarks.

https://huggingface.co/discussions/paper/66c7e0af737ba92ae3f2d507

Similar Work