Weaver: Foundation Models for Creative Writing

The article introduces Weaver, a family of large language models (LLMs) specifically designed for content creation. The models are pre-trained on a selected corpus to enhance their writing capabilities and are fine-tuned for creative and professional writing. The Weaver family includes models of varying sizes, from Weaver Mini (1.8B) to Weaver Ultra (34B), which can be chosen based on the application and dispatched by a routing agent to balance response quality and computation cost. The models have been shown to outperform larger generalist LLMs in writing capabilities, with the Weaver Ultra model surpassing GPT-4, a state-of-the-art generalist LLM.

Weaver also supports retrieval-augmented generation (RAG) and function calling, which can be used to improve AI-assisted writing systems. This includes integrating external knowledge bases, tools, or APIs, and providing personalized writing assistance. The article also discusses guidelines and best practices for pre-training and fine-tuning domain-specific LLMs.

Key takeaways:

This work introduces Weaver, a family of large language models (LLMs) dedicated to content creation, pre-trained on a carefully selected corpus to improve writing capabilities.
Weaver models are fine-tuned for creative and professional writing purposes and aligned to the preference of professional writers using novel methods for instruction data synthesis and LLM alignment.
The Weaver family consists of models of different sizes, suitable for various applications and can be dynamically dispatched by a routing agent according to query complexity to balance response quality and computation cost.
Weaver natively supports retrieval-augmented generation (RAG) and function calling (tool usage), with various use cases for improving AI-assisted writing systems, including integration of external knowledge bases, tools, or APIs, and providing personalized writing assistance.

Weaver: Foundation Models for Creative Writing

Key takeaways:

Comments (0)

Newsletter