Phil Nash
Build RAG from Scratch
#1about 3 minutes
Why large language models need retrieval augmented generation
Large language models have knowledge cutoffs and lack access to private data, a problem solved by providing relevant context at query time using RAG.
#2about 1 minute
How similarity search and vector embeddings power RAG
RAG relies on similarity search, not keyword search, which captures meaning by converting text into numerical representations called vector embeddings.
#3about 6 minutes
Building a simple bag-of-words vectorizer from scratch
A basic vector embedding can be created by tokenizing text, building a vocabulary of unique words, and representing each document as a vector of word counts.
#4about 8 minutes
Comparing document vectors using cosine similarity
Cosine similarity measures the angle between two vectors to determine their semantic closeness by focusing on direction (meaning) rather than magnitude.
#5about 3 minutes
Understanding the limitations of a bag-of-words model
The simple bag-of-words model is sensitive to vocabulary, slow to scale, and fails to capture nuanced semantic meaning like word order or synonyms.
#6about 4 minutes
Using professional embedding models and vector databases
Production RAG systems use sophisticated embedding models and specialized vector databases for efficient, accurate, and scalable similarity search.
#7about 2 minutes
Exploring advanced RAG techniques and other applications
Beyond basic similarity search, techniques like ColBERT and knowledge graphs can improve retrieval accuracy, and vector search can power features like related content recommendations.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
03:45 MIN
Understanding retrieval-augmented generation systems
AI Model Management Life Circles: ML Ops For Generative AI Models From Research to Deployment
02:42 MIN
Powering real-time AI with retrieval augmented generation
Scrape, Train, Predict: The Lifecycle of Data for AI Applications
02:53 MIN
Understanding Retrieval-Augmented Generation (RAG)
Graphs and RAGs Everywhere... But What Are They? - Andreas Kollegger - Neo4j
01:19 MIN
How retrieval-augmented generation (RAG) works
Make it simple, using generative AI to accelerate learning
05:31 MIN
Understanding retrieval-augmented generation (RAG)
Exploring LLMs across clouds
01:59 MIN
What is Retrieval Augmented Generation (RAG)?
Building Real-Time AI/ML Agents with Distributed Data using Apache Cassandra and Astra DB
01:49 MIN
Enhancing AI responses with retrieval augmented generation
Bringing the power of AI to your application.
04:10 MIN
A deep dive into retrieval-augmented generation
Lies, Damned Lies and Large Language Models
Featured Partners
Related Videos
Building Blocks of RAG: From Understanding to Implementation
Ashish Sharma
Carl Lapierre - Exploring Advanced Patterns in Retrieval-Augmented Generation
Carl Lapierre
Make it simple, using generative AI to accelerate learning
Duan Lightfoot
Large Language Models ❤️ Knowledge Graphs
Michael Hunger
Building Real-Time AI/ML Agents with Distributed Data using Apache Cassandra and Astra DB
Dieter Flick
Langchain4J - An Introduction for Impatient Developers
Juarez Junior
RAG like a hero with Docling
Alex Soto & Markus Eisele
Accelerating GenAI Development: Harnessing Astra DB Vector Store and Langflow for LLM-Powered Apps
Dieter Flick & Michel de Ru
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.




SMG Swiss Marketplace Group
Canton de Valbonne, France
Senior

Amdocs
Kontich, Belgium
Senior
Terraform
Kubernetes
Machine Learning
Continuous Integration

CGI Group Inc.
Köln, Germany
Senior
Data analysis
Natural Language Processing

webLyzard
Vienna, Austria
DevOps
Docker
PostgreSQL
Kubernetes
Elasticsearch
+2

Infosupport
Veenendaal, Netherlands
€0K
Natural Language Processing
![Phd Position On "human-centered Design And Evaluation Of Learning Analytics And Ai Tools In Edu[...]](https://wearedevelopers-develop.imgix.net/develop/public/default-job-listing-cover.png?w=400&ar=3.55&fit=crop&crop=entropy&auto=compress,format)
Phd Position On "human-centered Design And Evaluation Of Learning Analytics And Ai Tools In Edu[...]
Universidad De Valladolid
Municipality of Valladolid, Spain
€17K
Data analysis
Machine Learning