IAS Physics Group Meeting

The Harmonic Oscillator of Large Language Models

Abstract: Recently, I have been trying to understand how large language models (LLMs) work, what their basic building blocks are and how they are trained. In this talk I want to share some of my understanding with you by discussing the so-called ‘Transformer’ architecture. This architecture is the driving force behind most of the current AI revolution, and therefore quintessential for understanding the LLMs around today. 

 

Date & Time

November 08, 2023 | 11:00am – 12:15pm

Location

Bloomberg Lecture Hall (IAS)

Event Series

Categories

Tags