Alex Soto & Markus Eisele

Aug 20, 2025 • World Congress 2025

RAG like a hero with Docling

Your RAG pipeline has security holes you haven't considered. Learn to defend against data poisoning and a new class of vector store attacks.

#1about 3 minutes

Using RAG to enrich LLMs with proprietary data

Retrieval-augmented generation (RAG) is the key to making large language models useful for enterprises by providing them with up-to-date, proprietary information.

#2about 4 minutes

The challenge of parsing complex document structures

Simple document parsers can misinterpret layouts like multi-column text, leading to corrupted data and incorrect outputs from the language model.

#3about 3 minutes

Using Docling to convert documents into structured formats

Docling is an open-source tool that acts like an advanced OCR service, converting various binary document formats into a structured, parsable tree.

#4about 7 minutes

Demo of a basic RAG ingestion pipeline

A live demonstration shows how a Quarkus application uses Docling to ingest a PDF, generate embeddings, and store the resulting chunks and vectors in Redis.

#5about 3 minutes

Securing RAG against data poisoning and leaks

To prevent data poisoning and sensitive data leaks, it is crucial to sanitize documents, verify their signatures, and use tools for PII masking.

#6about 4 minutes

Mitigating vector store attacks and encryption challenges

Vector stores are vulnerable to attacks like close vector modification and reversal, and standard encryption breaks vector distance, requiring specialized solutions.

#7about 5 minutes

Demo of a secure ingestion pipeline in action

A final demonstration showcases a secure pipeline that verifies document signatures, anonymizes sensitive data, and encrypts vectors before storing them.

Andrew Comp
Cosio Valtellino, Italy

Intermediate

TypeScript

Cards Co

Remote

Intermediate

JavaScript

TypeScript

Name of

Remote

Intermediate

PHP

Java

+1

Demo: Implementing RAG with LangChain4J and a vector database

07:55 MIN

Demo: Implementing RAG with LangChain4J and a vector database

Langchain4J - An Introduction for Impatient Developers

Addressing unique security risks in RAG systems

02:00 MIN

Addressing unique security risks in RAG systems

Beyond the Hype: Building Trustworthy and Reliable LLM Applications with Guardrails

Using RAG for secure enterprise data integration

03:19 MIN

Using RAG for secure enterprise data integration

Bringing AI Everywhere

A deep dive into retrieval-augmented generation

04:10 MIN

A deep dive into retrieval-augmented generation

Lies, Damned Lies and Large Language Models

Understanding retrieval-augmented generation (RAG)

05:31 MIN

Understanding retrieval-augmented generation (RAG)

Exploring LLMs across clouds

Simplifying retrieval-augmented generation (RAG) pipelines

02:05 MIN

Simplifying retrieval-augmented generation (RAG) pipelines

One AI API to Power Them All

Visualizing the end-to-end RAG architecture

03:31 MIN

Visualizing the end-to-end RAG architecture

Building Blocks of RAG: From Understanding to Implementation

Code walkthrough for building a RAG-based chatbot

09:46 MIN

Code walkthrough for building a RAG-based chatbot

Creating Industry ready solutions with LLM Models

Featured Partners

Carl Lapierre - Exploring Advanced Patterns in Retrieval-Augmented Generation

Carl Lapierre - Exploring Advanced Patterns in Retrieval-Augmented Generation

Carl Lapierre

about a year ago • World Congress 2024

Building Blocks of RAG: From Understanding to Implementation

Building Blocks of RAG: From Understanding to Implementation

Ashish Sharma

about a year ago • WeAreDevelopers LIVE

Accelerating GenAI Development: Harnessing Astra DB Vector Store and Langflow for LLM-Powered Apps

Accelerating GenAI Development: Harnessing Astra DB Vector Store and Langflow for LLM-Powered Apps

Dieter Flick & Michel de Ru

about 2 years ago • World Congress 2024

Build RAG from Scratch

Build RAG from Scratch

Phil Nash

about 2 years ago • World Congress 2024

Large Language Models ❤️ Knowledge Graphs

Large Language Models ❤️ Knowledge Graphs

Michael Hunger

about 2 years ago • World Congress 2024

Beyond the Hype: Building Trustworthy and Reliable LLM Applications with Guardrails

Beyond the Hype: Building Trustworthy and Reliable LLM Applications with Guardrails

Alex Soto

about 6 months ago • World Congress 2025

Building AI Applications with LangChain and Node.js

Building AI Applications with LangChain and Node.js

Julián Duque

about 6 months ago • World Congress 2025

Langchain4J - An Introduction for Impatient Developers

Langchain4J - An Introduction for Impatient Developers

Juarez Junior

about 2 years ago • World Congress 2024

Related Articles

View all articles

CH

Chris Heilmann

Dev Digest 138 - Are you secure about this?

Hello there! This is the 2nd "out of the can" edition of 3 as I am on vacation in Greece eating lovely things on the beach. So, fewer news, but lots of great resources. Many around the topic of security. Enjoy! News and ArticlesGoogle Pixel phones t...

Dev Digest 138 - Are you secure about this?

CH

Chris Heilmann

Dev Digest 134 - Where pixels sing?

News and ArticlesWeAreDevelopers LIVE Data and Security Day is on Wednesday, 25/09/2024. Learn about OPC UA Updates, Best Practices for Using GitHub Secrets, Passwordless Web 1.5, Emerging AI Security Risks, Data Privacy in LLMs and get a chance to t...

Dev Digest 134 - Where pixels sing?

CH

Chris Heilmann

Dev Digest 116 - WWWAI?

This time, learn how to un-AI Google's search results, what's new on the web, avoid a new security hole and go back to BASICS with us. News and ArticlesWhat a week. Google, Microsoft, OpenAI and many others had their big flagship events announcing th...

Dev Digest 116 - WWWAI?

DC

Daniel Cranney

Panel Discussion: Responsible AI in Practice - Real-World Examples and Challenges

IntroductionIn the ever-evolving landscape of artificial intelligence, the concept of "responsible AI" has emerged as a cornerstone for ethical and practical AI implementation. During the WWC24 Panel discussion, three eminent experts—Mina, Bjorn Brin...

Panel Discussion: Responsible AI in Practice - Real-World Examples and Challenges

From learning to earning

Jobs that call for the skills explored in this talk.

Data Engineer (f/m/d) - AI

smartclip Europe GmbH
Hamburg, Germany

Intermediate

Senior

ETL

Java

Scala

AI Software Engineer

Ratbacher GmbH

Remote

€60K

GIT

Machine Learning

Data Scientist / Data Engineer (m/f/d) for LLM/RAG pipelines

Testsieger.de Vergleichsportal GmbH
Osnabrück, Germany

MySQL

Pandas

PyTorch

Data analysis

Machine Learning

Data Science DevSecOps Engineer

Rocken AG

Gitlab

Ansible

Grafana

Openshift

Prometheus

+1

Fullstack Web Entwickler - Next.js & AI

Rocken AG

Next.js

TypeScript

Database Engineer Microsoft SQL/Azure

Lang AG

Microsoft SQL Server

Software Development Full-Stack AI Engineer

Rocken AG

Remote

Data Scientist - Data Engineer

Ratbacher GmbH

Remote

€60K

DevOps

PostgreSQL

Kubernetes

+4

Data Scientist - Data Engineer

Ratbacher GmbH

Remote

€60K

Machine Learning