All Projects

Open-source projects spanning LLMs, ML, and data engineering

Featured
LLM

AI Research Agent

Autonomous research agent that decomposes queries, executes multi-turn tool calling with web search and arXiv, and validates completeness through self-reflection. Features real-time Streamlit UI.

AI AgentsRAG
PythonGoogle GeminiStreamlitTavily API
View on GitHub
Featured
Vector DB

Weaviate MCP Inspector

Natural language interface for Weaviate vector databases through Claude using Model Context Protocol. Enables intuitive database exploration via conversation with 9 inspection tools.

MCPTools
PythonWeaviateFastMCPClaude
View on GitHub
Featured
Computer Vision

Handwritten Text Recognizer

Deep learning OCR system with ResNet encoder and Transformer decoder (14M parameters). Achieves 70% error reduction via augmentation. Deployed as FastAPI microservice on GCP with monitoring.

Deep LearningMLOps
PyTorchFastAPIDockerGCP
View on GitHub
ML

Movie Recommendation System

Netflix-inspired collaborative filtering system using matrix factorization (SVD, SVD++) on 480K+ users and 17K+ movies. Implements KNN-based recommendations with comprehensive feature engineering.

Recommender SystemsCollaborative Filtering
PythonScikit-learnSurpriseNetworkX
View on GitHub
Graph ML

Social Network Link Prediction

Graph ML solution for predicting missing links in Facebook social network (1.86M nodes, 9.4M edges). Engineers 30+ graph features achieving 0.92 F1 score with Random Forest.

Social NetworksFeature Engineering
PythonNetworkXScikit-learn
View on GitHub
NLP

Tweet Sentiment Extractor

Transformer-based NLP model that extracts sentiment-bearing phrases from tweets. Fine-tuned question-answering implementation for Kaggle competition with comprehensive error analysis.

TransformersSentiment Analysis
PythonTransformersTensorFlow
View on GitHub
Data Engineering

ETL Pipeline

End-to-end data pipeline orchestrating NYC taxi data from extraction to BigQuery loading. Uses Apache Spark on Dataproc with Prefect orchestration and Terraform for infrastructure as code.

ETLCloud
PythonApache SparkGCPPrefectTerraform
View on GitHub

Showing 7 projects