Stop Thinking, Just Do!

Sungsoo Kim's Blog

Large Concept Models (LCMs)

tagsTags

27 January 2025


Article Source


Large Concept Models (LCMs)

Abstract

In this video, we dive into Large Concept Models (LCMs), an innovative architecture from a recent Meta paper titled: “Large Concept Models: Language Modeling in a Sentence Representation Space”.

Unlike Large Language Models (LLMs), LCMs operate on higher-level semantics, processing concepts instead of tokens, making them more akin to human reasoning, and possibly positioning them as a future rival to the current token-based LLM architecture.

Throughout the video, we explain what reasoning in the abstract concepts space means, and explore the Large Concept Models architecture, diving into few architecture options.

Specifically, two LCMs are based on Diffusion Models, so we provide a brief reminder about Diffusion Models and then explain the Diffusion-based LCMs.


comments powered by Disqus