Mitigating LLM Hallucinations with a Metrics-First Evaluation Framework

Abstract

Join in on this workshop where we will showcase some powerful metrics to evaluate the quality of the inputs and outputs with a focus on both RAG and fine-tuning use cases. In the context of LLMs, “hallucination” refers to a phenomenon where the model generates text that is incorrect, nonsensical, or not real. Since LLMs are not databases or search engines, they would not cite where their response is based on. These models generate text as an extrapolation from the prompt you provided.

What attendees can expect to takeaway from the workshop:

Deep dive into research-backed metrics to evaluate the quality of the inputs (data quality, RAG context quality, etc) and outputs (hallucinations) while building LLM powered applications.
Evaluation and experimentation framework while prompt engineering with RAG, as well as while fine-tuning with your own data
Demo led practical guide to building guardrails and mitigating hallucinations while building LLM powered applications

This event is inspired by DeepLearning.AI’s GenAI short courses, created in collaboration with AI companies across the globe. Our courses help you learn new skills, tools, and concepts efficiently within 1 hour.

https://www.deeplearning.ai/short-courses/

About Galileo

At Galileo we are building the first algorithm-powered LLMOps Platform for the enterprise. Galileo provides ML teams with an intelligent ML data bench to collaboratively improve data quality across their model workflows – from pre-training, to post-production. Galileo is currently powering ML teams across the Fortune 500 as well as startups across multiple industries.

Speakers:

Vikram Chatterji, Co-founder and CEO at Galileo / vikram-chatterji

Atindriyo Sanyal, Co-founder and CTO at Galileo

/ atinsanyal

Stop Thinking, Just Do!