Blog

My thoughts on technology, business, and building products.

External Posts

External Post

The Hidden Attack Surfaces of Retrieval-Augmented Generation

Explore how Retrieval-Augmented Generation systems are vulnerable to Prompt Injection and Corpus Poisoning, and why deployed pipelines face persistent risks.

Retrieval-Augmented Generation
RAG Security
Prompt Injection
Corpus Poisoning
Adversarial NLP
LLM Security
Knowledge Base Vulnerabilities
AI Risk
Machine Learning Security
Enterprise AI

December 17, 2025

External Post

Exploring Information Retrieval from BoW to BM25

Explore traditional information retrieval models like BoW, TF-IDF, and BM25, their key strengths and limitations, and why they remain central to search systems today. We also touch on the rise of semantic search and how it complements these methods for more accurate results.

Bag of Words
BM-25
TF-IDF
Document Ranking
Information Retrieval
Query Relevance
Retrieval Models
Search Engines

February 8, 2025

External Post

Metrics for Evaluation of Retrieval in Retrieval-Augmented Generation (RAG) Systems

Learn about evaluation metrics for the retrieval component of a RAG system. Discover how metrics like Precision@k, Recall@k, F1@k, MRR, and MAP can optimize your system's performance. Ensure your RAG pipeline delivers high-quality context for better-generated responses.

NLP
Retrieval Augmented Generation
Evaluation
Precision@k
Recall@k
F1@k
MAP
MRR
DCG@k
NDCG@k

June 19, 2024

External Post

Building Baseline RAG Pipelines with OpenAI and LLAMA 3 8B from Scratch

Learn about the fundamentals of RAG pipelines and its key components. Design OpenAI and LLAMA 3-based pipelines with a shared ingestor and Chroma vector database to run experiments on advanced RAG methods and failure modes.

NLP
Retrieval Augmented Generation
OpenAI
LLAMA3
Chroma Database

June 8, 2024

External Post

Evaluating Machine Summarization with ROUGE

NLP
Model Evaluation
Seq2Seq
Summarization
ROUGE

June 1, 2024

External Post

Understanding the BLEU Score for Translation Model Evaluation

Discover how to evaluate machine translation models with the BLEU score. Understand its precision-based principles, calculations, and practical implementation for accurate sequence-to-sequence model assessment and implement it from scratch.

NLP
Model Evaluation
Seq2Seq
Machine Translation
BLEU

May 27, 2024

External Post

NLP Model Evaluation - Metrics, Benchmarks, and Beyond

Discover how models for language tasks such as text classification, generation, or machine translation can be evaluated. In-depth exploration of essential classification metrics like precision, recall, and F1 score, and introductions to further metrics and benchmarks.

NLP
Model Evaluation
Accuracy
Precision
Recall
F1 Score

May 25, 2024

External Post

Deconvolutions And What To Do About Artifacts

Deconvolutions, transposed convolutions, fractionally strided convolutions, and what to do about artifacts generated by them.

Convnets
Deconvolutions
Object Detection

December 29, 2021