Augmenting Causal Discovery with LLMs

Abstract

A talk by Ayodeji Ijishakin and Ahmed Abdulaal, Computer Science PhD Student’s at University College London.

Scientific discovery hinges on the effective integration of metadata, which refers to a set of ‘cognitive’ operations such as determining what information is relevant for inquiry, and data, which encompasses physical operations such as observation and experimentation. This talk introduces the Causal Modelling Agent (CMA), a novel framework that synergizes the metadata-based reasoning capabilities of Large Language Models (LLMs) with the data-driven modelling of Deep Structural Causal Models (DSCMs) for the task of causal discovery. We evaluate the CMA’s performance on a number of benchmarks, as well as on the real-world task of modelling the clinical and radiological phenotype of Alzheimer’s Disease (AD). Our experimental results indicate that the CMA can outperform previous data-driven or metadata-driven approaches to causal discovery. In our real-world application, we use the CMA to derive new insights into the causal relationships among biomarkers of AD.

Stop Thinking, Just Do!