LLM RAG: Local Search Engine for Course Notes

A Retrieval-Augmented Generation (RAG) pipeline designed to answer queries using course notes and academic materials.
This project integrates dense retrieval (FAISS) with sparse retrieval (BM25) for hybrid document search, optimized to run locally with Ollama + LLaMA 3.2 embeddings.

🚀 Features

Hybrid retrieval: Combines FAISS (dense embeddings) and BM25 (sparse search).
Latency optimized: Reduced query response time from 10 minutes → 30 seconds using serialized caching (95% speedup).
High factual grounding: Achieves 80%+ grounding accuracy by injecting top-ranked retrieved chunks into dynamic prompts.
Local-first: Runs entirely on your machine with Ollama and Python (no external APIs required).
Utilities included:
- benchmark_retriever.py → evaluate retrieval performance.
- visualize_chunks.py → inspect document splits and chunk relevance.

🛠 Tech Stack

Python 3.10+
LLaMA 3.2 via Ollama
LangChain
FAISS + BM25
PyTorch / TensorFlow (for embeddings)

📂 Project Structure

├── main.py # Entry point for the RAG pipeline
├── rag.py # Core RAG logic (retrieval + generation)
├── data_prep.py # Preprocess and split documents
├── benchmark_retriever.py # Evaluate retriever performance
├── visualize_chunks.py # Visualize text chunks for debugging
├── requirements.txt # Python dependencies
├── data/ # Source course notes (PDF + processed markdown)

⚡ Installation

Clone the repository:

git clone https://github.com/yourusername/LLM-RAG.git
cd LLM-RAG

Install dependencies:

pip install -r requirements.txt

Install Ollama and pull the LLaMA 3.2 model:

ollama pull llama3.2

▶️ Usage

Run the pipeline with:

python main.py

Enter a query (e.g., "Explain A search"*)

The system retrieves relevant course notes, injects them into a dynamic prompt, and generates an answer using LLaMA 3.2.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
faiss_index		faiss_index
.DS_Store		.DS_Store
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
benchmark_retriever.py		benchmark_retriever.py
data_prep.py		data_prep.py
get-pip.py		get-pip.py
main.py		main.py
pyproject.toml		pyproject.toml
rag.py		rag.py
requirements.txt		requirements.txt
split_docs.pkl		split_docs.pkl
uv.lock		uv.lock
visualize_chunks.py		visualize_chunks.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM RAG: Local Search Engine for Course Notes

🚀 Features

🛠 Tech Stack

📂 Project Structure

⚡ Installation

▶️ Usage

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LLM RAG: Local Search Engine for Course Notes

🚀 Features

🛠 Tech Stack

📂 Project Structure

⚡ Installation

▶️ Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages