Stop Thinking, Just Do!

Sungsoo Kim's Blog

How to Build LLMs on Your Company’s Data While on a Budget

tagsTags

27 January 2024


Article Source


How to Build LLMs on Your Company’s Data While on a Budget

Abstract

Large Language Models (LLMs) are taking AI mainstream across companies and individuals. However, public LLMs are trained on general-purpose data. They do not include your own corporate data and they are black boxes on how they are trained. Because terminology is different for healthcare, financial, retail, digital-native and other industries, companies today are looking for industry-specific LLMs to better understand the terminology, context and knowledge that better suits their needs. In contrast to closed LLMs, open source-based models can be used for commercial usage or customized to suit an enterprise’s needs on their own data. Learn how Databricks makes it easy for you to build, tune and use custom models, including a deep dive into Dolly, the first open source, instruction-following LLM, fine-tuned on a human-generated instruction dataset licensed for research and commercial use.

In this session, you will:

  • See a real-life demo of creating your own LLMs specific to your industry
  • Learn how to securely train on your own documents if needed
  • Learn how Databricks makes it quick, scalable and inexpensive
  • Deep dive into Dolly and its applications

Talk by: Sean Owen


comments powered by Disqus