LLMs: From Transformers to GPT| Winter 2024

Professional Masters Program | University of Washington, Seattle

Pre-requisites

The course assumes that you have the basics of machine learning and are comfortable coding in Python. If this is not the case, please reach out to the TAs or the instructor asap. Set up is also key for success in the course. Assignment 1 partly takes care of the setup (coding environment, APIs, etc).

Flavor of the Course

The course will be a modern introduction to deep learning, transformers, GPT and beyond. We will have a mix of concepts, examples, applications in the industry, theory, algorithms, code and demos. Expect a lot of hands-on coding assignments and mini-projects. Towards the later half of the course, we will start to touch on the latest trends in Generative AI.

Lecture Dates

Tuesday, 4-6 pm In-person
Thursday, 4-6 pm In-person

Course Syllabus (we may leave out some topics depending on the available time)

Introduction and Motivation for LLMs
When even DL became a thing of the past! Deep Learning and its evolution
What started the AI transformation? Transformer Architecture
How do you concisely express data? Embeddings
How does one make search smarter? Similarity Search with Transformers and Embeddings
Discriminative vs Generative Transformers
It’s just the stream of consciousness! Purely Generative Tranformers: GPT, GPT-2, GPT-3
What made ChatGPT so popular? Fine-tuning and RLHF: GPT-3.5 and GPT-4
LLMs vs APIs
If only you had prompted me! Prompt Engineering
Use of APIs
To open or not to open? Closed vs Open-source LLMs
Privacy, cost and other issues? Open-source LLMs: LLama, MixTral, Phi-1.5 and Phi-2
Fine-tuning LLMs
Can you make my fine-tuning easy for me? Tricks to fine-tune LLMs
I can only handle smaller models!! Distillation and its use-cases
What do I do if I don’t have enough data :-/ LLMs for Data Augmentation
When LLM becomes your annotator Using LLMs to label data and train smaller models
How can you trust an LLM? Evaluating LLMs
Show me the cool stuff!: Question-Answering, Sentiment Analysis and more
Showcasing LLMs over web demos: Use of StreamLit to build web-apps with innovative uses of LLMs and smaller models
Is my data safe? Privacy and building in-house LLMs with privacy constraints
LLMs -> LVMs: Moving from Language to Images and Videos
How the heck can you generate an image from just text? Stable Diffusion
Can you remove the photo-bomber from the pic? In-painting using image segmentation and stable diffusion

llm2024

LLMs: From Transformers to GPT| Winter 2024

Professional Masters Program | University of Washington, Seattle

Instructor - Dr. Karthik Mohan

Pre-requisites

Flavor of the Course

Lecture Dates

Course Syllabus (we may leave out some topics depending on the available time)