Portfolio

1.Demystifying Attention: Building Core Mechanisms of Transformers in PyTorch

A from-scratch PyTorch implementation of core transformer components including self-attention, masked attention, and multi-head attention following the seminal Attention Is All You Need paper.

2.Character-Level GPT: Lightweight GPT-Style Transformer

A nanoGPT-inspired character-level transformer model to study tokenization, autoregressive decoding, and small-data generative modeling behavior.

3.Prompt Engineering Strategies for Dialogue Summarization (FLAN-T5)

Comparative evaluation of zero-shot, one-shot, and few-shot prompting techniques on dialogue summarization, revealing one-shot prompting as the most efficient for lightweight FLAN-T5 models.

4.RAG-Based Question-Answering System (Gemini + Chroma Vector DB)

A Retrieval-Augmented Generation system using Gemini LLM, Chroma vector DB for embeddings with metadata, and LangGraph for iterative retrieval and improved answer grounding.

5.Energy Consumption Forecasting Engine (Causal ML + Time-Series Modeling)

An end-to-end forecasting system employing Transformers, XGBoost, Random Forests, Prophet, and DoWhy causal inference to model large-scale energy consumption patterns.

6.Mobile Health Time-Series Classification Framework (1D-CNN, LSTM, Conv-LSTM)

A 98%-accuracy human activity recognition pipeline using classical ML and deep time-series architectures including Conv-LSTM for rich temporal modeling.

7.Credit Risk Modeling & Real-Time Loan Scoring (Gradio Deployment)

A production-ready Random Forest credit scoring system achieving ~90% accuracy, deployed via Gradio for interpretable, real-time financial risk assessment.

9.Contrastive Learning for Anomaly Detection in Patient Behavior

A Siamese neural network using contrastive loss to detect anomalous patient sensor patterns, achieving 98% detection accuracy after balancing and optimization.

10.GAN-Based Data Augmentation for Skin Lesion Classification

A DCGAN-based data augmentation pipeline for dermatoscopic images, improving model accuracy by 15% and reducing overfitting through realistic synthetic image generation.

11.Customer Churn A/B Test Dashboard (Tableau)

A Tableau dashboard that visualizes customer churn experiment outcomes, retention patterns, and demographic segmentation from A/B test results.

12.Netflix Content Insights Dashboard (Tableau)

A Tableau analytics dashboard exploring Netflix content performance, genre-level trends, and global distribution insights.

Tahsin Mullick

Portfolio