Member-only story

RAG (Retrieval-augmented generation) on Wikipedia Page

2 min readOct 20, 2024

Reading Country wiki to Get Independence Day & Capital using RAG

Photo by Kostiantyn Li on Unsplash

In this tutorial I am going to use large language model to study wikipedia article on countries and extract their independence day and capital information using RAG which is retrival-augmented generation. RAG is a popular technique used for applying LLM on custom documents which you don’t want to share remotely with large language models.I am going to use Meta llama as LLM and mxbai as embedding. I ‘ll be using LangChain to put different parts of the chain together.

There is explanation to the tutorial on following youtube link:

https://www.youtube.com/watch?v=Fg6NWNkCLLA

Step 1

Load libraries

from langchain_community.document_loaders import WikipediaLoader
from langchain_text_splitters import RecursiveCharacterTextSplitter
from langchain_postgres.vectorstores import PGVector
from langchain_huggingface.embeddings import HuggingFaceEmbeddings
from langchain_core.runnables import RunnablePassthrough
from langchain_ollama import ChatOllama
from langchain_core.output_parsers import StrOutputParser
from langchain import hub 
from IPython.display import Markdown
import langchain

Create an account to read the full story.

The author made this story available to Medium members only.
If you’re new to Medium, create a new account to read this story on us.

Continue in app

Or, continue in mobile web

Sign up with Google

Sign up with Facebook

Already have an account? Sign in

Written by Pankaj Kumar

MS Data Science SMU TX, MSc Financial Engg WQU. Interest in Algos, Discovering Trends in data. Worked 4 Hedge Funds n Inv banks https://vitvinyas.onrender.com/

No responses yet

Write a response

What are your thoughts?

Also publish to my profile

Recommended from Medium

How to build chatbot with a local LLM in 5 minutes

Anthony Sun

How to build chatbot with a local LLM in 5 minutes

Generated by ChatGPT

Oct 31, 2024

You’re Doing RAG Wrong: How to Fix Retrieval-Augmented Generation for Local LLMs

In

Towards AI

by

DarkBones

You’re Doing RAG Wrong: How to Fix Retrieval-Augmented Generation for Local LLMs

How To Set Up RAG Locally, Avoid Common Issues, and Improve RAG Retrieval Accuracy.

4d ago

Lists

Coding & Development

11 stories1033 saves

Predictive Modeling w/ Python

20 stories1856 saves

Natural Language Processing

1977 stories1620 saves

Practical Guides to Machine Learning

10 stories2225 saves

Create a RAG Agent with LangGraph to Extract the information from a PDF File

Ferry Djaja

Create a RAG Agent with LangGraph to Extract the information from a PDF File

In this blog, we will build a simple agent to extract the information from a PDF file with LangGraph. We will be using GPT-4o to extract…

Sep 23, 2024

Retrieval-Augmented Generation (RAG) with ChromaDB and Ollama

Arun Patidar

How to Implement RAG with ChromaDB and Ollama: A Python Guide for Beginners

Overview of Retrieval-Augmented Generation (RAG)

Dec 10, 2024

Document Loaders in LangChain: A Component of RAG System

Mdabdullahalhasib

Document Loaders in LangChain: A Component of RAG System

Explore how to load different types of data and convert them into Documents to process and store in a Vector Database.

Oct 8, 2024

How to Build a RAG Application?

Dhruv Yadav

How to Build a RAG Application?

AI advancements are happening at a rapid pace, and one fascinating concept that’s gaining traction is the Retrieval-Augmented Generation…

Dec 2, 2024

See more recommendations

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams