Technical Blogs & Projects

Deep dives into machine learning concepts with practical code implementations

Sep 14, 2025

A detailed comparison of Mixtral 8x7B with LLaMA 2, and an implementation of an optimized Mixture of Experts (MoE) layer in PyTorch.

Aug 31, 2025

Step-by-step guide to building the LLaMA model from scratch in PyTorch, with in-depth explanations of each essential component.

Aug 3, 2025

Understanding the evolution from Multi-Head Attention to modern inference optimizations

Jul 19, 2025

A comprehensive exploration of RoPE with theoretical derivations from first principles and PyTorch implementation

Showing 4 posts