Systems

Each system is engineered with a focus on:

retrieval precision
architectural clarity
real-world failure modes
production constraints

Classical Information Retrieval

Systems in this category focus on lexical retrieval, indexing theory, and ranking mechanics — the foundations behind modern search engines.

🔍 DevShelf

Search Engine from First Principles

A distributed vertical search engine for Computer Science literature, built without Lucene or ElasticSearch.

What it demonstrates:

Positional inverted indices
TF-IDF–based ranking
Offline indexing vs online query execution
Deterministic, explainable retrieval

View DevShelf →

Retrieval-Augmented Generation (RAG)

These systems extend retrieval pipelines with embeddings, reranking, and large language models, while maintaining strict control over precision and data flow.

🧠 MQNotebook

Enterprise-Grade RAG System

A local-first RAG engine designed to ingest and retrieve information from messy, real-world enterprise documents.

What it demonstrates:

OCR-first ingestion for scanned PDFs
Structured parsing of spreadsheets and slide decks
Cross-encoder reranking for precision
Secure, BYOK deployment model

View MQNotebook →

How These Systems Connect

DevShelf establishes a strong foundation in classical information retrieval.

MQNotebook builds on those principles, addressing the limitations of lexical search by introducing semantic retrieval and LLM-based reasoning, while preserving control over relevance and hallucinations.

Together, they represent a complete spectrum of retrieval system design — from first principles to modern AI infrastructure.

DevShelf — Search Engine from First Principles (Java)

A first-principles search engine demonstrating offline indexing, positional inverted indices, and hybrid ranking.

BabyGPT: Generative Text Engine

A character-level Generative AI model built from scratch using Dual-Stack LSTMs and custom temperature sampling.

MQNotebook — Enterprise-Grade RAG System

A local-first RAG system engineered to handle scanned PDFs, complex spreadsheets, and slide decks with precision retrieval and reranking.

Classical Information Retrieval#

🔍 DevShelf#

Retrieval-Augmented Generation (RAG)#

🧠 MQNotebook#

How These Systems Connect#

Classical Information Retrieval

🔍 DevShelf

Retrieval-Augmented Generation (RAG)

🧠 MQNotebook

How These Systems Connect