An LLM Evaluation Framework for High-Stakes AI

June 16, 2026 • Podcast

By
Violet Turri and Katherine-Marie Robinson

This podcast presents the Evaluation Large Language Models library, which turns evaluation from an ad-hoc process into a repeatable, extensible framework.

Publisher

Software Engineering Institute

DOI (Digital Object Identifier)

10.58012/ks2t-3g75

Listen

Watch

Abstract

Experimentation and validation of LLM performance is critical when building LLM-driven systems that must reliably deliver a service, from customer service chat bots to intelligence analysis tools. To help teams meet the need for rigorous evaluation methods, a research team in the SEI’s AI Division led by Violet Turri has developed the Evaluating Large Language Models (ELM) library, which is built on best practices for LLM evaluation and benchmarking. In the latest episode from the Carnegie Mellon University Software Engineering Institute Podcast Series, Turri sits down with Katie Robinson, a design researcher also in the SEI’s AI division, to discuss the ELM library, which turns evaluation from an ad-hoc process into a repeatable, extensible framework.

About the Speaker

Violet Turri

Violet Turri is an assistant software developer in the SEI AI Division where she works on multiple machine-learning engineering projects with an emphasis on explainability, test and evaluation strategies, and computer vision. Turri holds a bachelor’s degree in computer science from Cornell University and has a research background in human-computer …

Katherine-Marie Robinson

Katherine-Marie Robinson is an assistant design researcher in the SEI’s AI Division. Since joining the SEI in September 2022, Robinson has worked on a wide variety of projects where she aims to bring a responsible AI (RAI) lens to the work at hand including researching and developing tools, curriculums, and …

An LLM Evaluation Framework for High-Stakes AI

June 16, 2026 • Podcast

By Violet Turri and Katherine-Marie Robinson

Publisher

DOI (Digital Object Identifier)

Listen

Watch

Abstract

About the Speaker

Violet Turri

Katherine-Marie Robinson

SHARE

Subscribe

Supplemental Materials

Related Links

By
Violet Turri and Katherine-Marie Robinson