Retrieval Augmented Generation (RAG)

Heat up your career this summer with courses from Google, IBM, and more for £190/year. Save now.

Retrieval Augmented Generation (RAG)

Instructor: Zain Hasan

5 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

31 hours to complete

3 weeks at 10 hours a week

Flexible schedule

Learn at your own pace

5 modules

Gain insight into a topic and learn the fundamentals.

Intermediate level

Recommended experience

31 hours to complete

3 weeks at 10 hours a week

Flexible schedule

Learn at your own pace

What you'll learn

How to design and build RAG systems tailored to real-world needs
How to weigh tradeoffs between cost, speed, and quality to choose the right techniques for each component of a RAG system
A foundational framework to adapt RAG systems as new tools and methods emerge

Skills you'll gain

Details to know

Shareable certificate

Add to your LinkedIn profile

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

There are 5 modules in this course

Retrieval Augmented Generation (RAG) improves large language model (LLM) responses by retrieving relevant data from knowledge bases—often private, recent, or domain-specific—and using it to generate more accurate, grounded answers.

In this course, you’ll learn how to build RAG systems that connect LLMs to external data sources. You’ll explore core components like retrievers, vector databases, and language models, and apply key techniques at both the component and system level. Through hands-on work with real production tools, you’ll gain the skills to design, refine, and evaluate reliable RAG pipelines—and adapt to new methods as the field advances. Across five modules, you'll complete hands-on programming assignments that guide you through building each core part of a RAG system, from simple prototypes to production-ready components. Through hands-on labs, you’ll: - Build your first RAG system by writing retrieval and prompt augmentation functions and passing structured input into an LLM. - Implement and compare retrieval methods like semantic search, BM25, and Reciprocal Rank Fusion to see how each impacts LLM responses. - Scale your RAG system using Weaviate and a real news dataset—chunking, indexing, and retrieving documents with a vector database. - Develop a domain-specific chatbot for a fictional clothing store that answers FAQs and provides product suggestions based on a custom dataset. - Improve chatbot reliability by handling real-world challenges like dynamic pricing and logging user interactions for monitoring and debugging. You’ll apply your skills using real-world data from domains like media, healthcare, and e-commerce. By the end of the course, you’ll combine everything you’ve learned to implement a fully functional, more advanced RAG system tailored to your project’s needs.

Learn foundational RAG concepts, get familiar with the main components of a RAG system, including the LLM, knowledge base, and retriever, and start building your first functional RAG system.

What's included

8 videos1 reading1 assignment1 programming assignment2 ungraded labs

8 videosTotal 41 minutes

Module 1 introduction1 minutePreview module
A conversation with Andrew Ng8 minutes
Introduction to RAG5 minutes
Applications of RAG4 minutes
RAG architecture overview5 minutes
Introduction to LLMs9 minutes
Introduction to information retrieval5 minutes
Module 1 conclusion1 minute

1 readingTotal 5 minutes

Module 1 slides5 minutes

1 assignmentTotal 20 minutes

Module 1 Quiz20 minutes

1 programming assignmentTotal 180 minutes

Introduction to RAG systems180 minutes

2 ungraded labsTotal 120 minutes

Brief introduction to Python60 minutes
LLM calls and crafting simple augmented prompts60 minutes

Learn foundational information retrieval techniques, including keyword search, semantic search, and metadata filtering. Then build and evaluate a hybrid search pipeline that combines all three techniques.

What's included

10 videos1 reading1 assignment1 programming assignment2 ungraded labs

10 videosTotal 53 minutes

Module 2 introduction1 minutePreview module
Retriever architecture overview3 minutes
Metadata filtering4 minutes
Keyword search - TF-IDF7 minutes
Keyword search - BM254 minutes
Semantic search - introduction7 minutes
Semantic search - embedding model deepdive6 minutes
Hybrid search7 minutes
Evaluating retrieval8 minutes
Module 2 conclusion2 minutes

1 readingTotal 5 minutes

Module 2 - Slides5 minutes

1 assignmentTotal 30 minutes

Module 2 Quiz30 minutes

1 programming assignmentTotal 180 minutes

Implementing retriever functions in a RAG system180 minutes

2 ungraded labsTotal 120 minutes

Vector embeddings in RAG60 minutes
Retrieval metrics60 minutes

Learn how vector databases scale up search and techniques to improve retrieval, such as chunking, query parsing, and reranking.

What's included

9 videos1 reading1 assignment1 programming assignment2 ungraded labs

9 videosTotal 47 minutes

Module 3 introduction1 minutePreview module
Approximate nearest neighbors algorithms (ANN) 7 minutes
Vector databases5 minutes
Chunking6 minutes
Advanced chunking techniques5 minutes
Query parsing5 minutes
Cross-encoders and ColBERT8 minutes
Reranking4 minutes
Module 3 conclusion1 minute

1 readingTotal 5 minutes

Module 3 Slides5 minutes

1 assignmentTotal 30 minutes

Module 3 Quiz30 minutes

1 programming assignmentTotal 180 minutes

Building RAG Systems with a Vector Database 180 minutes

2 ungraded labsTotal 120 minutes

Introduction to the Weaviate API60 minutes
Chunking60 minutes

Learn all about large language models, how they work, as well as techniques like prompt engineering, hallucination detection, agentic system design, and fine-tuning, to further improve their performance in a RAG system.

What's included

11 videos1 assignment1 programming assignment2 ungraded labs

11 videosTotal 67 minutes

Module 4 introduction1 minutePreview module
Transformer architecture8 minutes
LLM sampling strategies8 minutes
Choosing your LLM8 minutes
Prompt engineering: building your augmented prompt5 minutes
Prompt engineering: advanced techniques8 minutes
Handling hallucinations7 minutes
Evaluating your LLM's performance5 minutes
Agentic RAG6 minutes
RAG vs. Fine-Tuning6 minutes
Module 4 conclusion1 minute

1 assignmentTotal 30 minutes

Module 4 Quiz30 minutes

1 programming assignmentTotal 180 minutes

Developing a RAG-based Chatbot180 minutes

2 ungraded labsTotal 120 minutes

Exploring LLM capabilities60 minutes
Prompt engineering60 minutes

Learn how to monitor and evaluate a RAG system both at the component level and end-to-end and consider the tradeoffs in system performance, cost, capability, and security faced by production RAG systems.

What's included

11 videos1 reading1 assignment1 programming assignment1 ungraded lab

11 videosTotal 53 minutes

Module 5 introduction1 minutePreview module
What makes production challenging3 minutes
Implementing RAG evaluation strategies7 minutes
Logging, monitoring, and observability4 minutes
Customized evaluation5 minutes
Quantization7 minutes
Cost vs Response Quality5 minutes
Latency vs Response Quality4 minutes
Security5 minutes
Multimodal RAG6 minutes
Module 5 conclusion1 minute

1 readingTotal 5 minutes

(Optional) Opportunity to mentor other learners5 minutes

1 assignmentTotal 30 minutes

Module 5 Quiz30 minutes

1 programming assignmentTotal 180 minutes

Improving the ChatBot 180 minutes

1 ungraded labTotal 60 minutes

Tracing a RAG system60 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

Zain Hasan

DeepLearning.AI

0 Courses0 learners

Offered by

DeepLearning.AI

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Open new doors with Coursera Plus

Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Learn more

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Explore degrees

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Learn more

Frequently asked questions

Access to lectures and assignments depends on your type of enrollment. If you take a course in audit mode, you will be able to see most course materials for free. To access graded assignments and to earn a Certificate, you will need to purchase the Certificate experience, during or after your audit. If you don't see the audit option:

The course may not offer an audit option. You can try a Free Trial instead, or apply for Financial Aid.
The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you purchase a Certificate you get access to all course materials, including graded assignments. Upon completing the course, your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile. If you only want to read and view the course content, you can audit the course for free.

You will be eligible for a full refund until two weeks after your payment date, or (for courses that have just launched) until two weeks after the first session of the course begins, whichever is later. You cannot receive a refund once you’ve earned a Course Certificate, even if you complete the course within the two-week refund period. See our full refund policy.