Projects & Publications

AI engineering, data pipelines, predictive modeling, and peer-reviewed research

10+ Projects 4 Publications IEEE 路 Springer 路 AIP

AI & ML Engineering

AI Engineering

"Owlie" GPT Chatbot

Production RAG pipeline powering a university-facing chatbot. Engineered with Sentence Transformers, LLAMA3-70B (Groq API), and FAISS vector indexing over 5,000+ JSOM webpages via custom web scraping. Improved response accuracy by ~85% and cut latency by ~95% to under 3 seconds per query. Deployed and validated with UT Dallas JSOM leadership.

RAG LLAMA3-70B Groq API FAISS Sentence Transformers Python
ML Engineering

Credit Risk Evaluation Model

End-to-end ML pipeline that reduced loan default rates from 25.8% to 3%, an 88.4% improvement. Built with XGBoost and Neural Networks with SQL-validated data pipelines for feature engineering and model deployment. Comprehensive evaluation across precision, recall, and AUC metrics.

Python XGBoost Neural Networks SQL Feature Engineering 88.4% Improvement
AI Engineering

Renewable Energy Investment Dashboard

U.S. renewable energy investment analysis platform with real-time EIA & FRED data feeds, IRR/NPV/LCOE financial modelling, and an integrated Claude AI research assistant for natural language queries. Interactive charts, state-level breakdowns, and investment scenario comparisons.

Claude AI Python EIA API FRED API Financial Modelling Vercel
AI Engineering

Travel AI, LLM Itinerary Planner

AI-powered travel planning assistant that generates personalized itineraries using LLM inference. Clean web frontend built with React, backed by a FastAPI service handling itinerary generation, routing logic, and user preference management. Modular backend designed for scalability.

LLM FastAPI Python REST API Itinerary Generation
ML Engineering

Disease Prediction via Ensemble Learning

Ensemble model combining SVC, Naive Bayes, and Decision Trees to predict multiple diseases, raising accuracy from 95% to 99%. Reduced data processing time 42% via SQL optimization. Research published in IJRTE demonstrating a 25% increase in predictive accuracy for multi-disease scenarios.

Ensemble Learning SVC Naive Bayes Decision Trees SQL 99% Accuracy
NLP Research

ASR for Indian Regional Languages (Wav2Vec2)

Automatic Speech Recognition system for low-resource Tamil language using fine-tuned Wav2Vec2 with a custom linear layer and tokenizer. Achieved 61.3% WER on Mozilla Common Voice dataset, outperforming the prior state-of-the-art 69.76% WER. Published in Springer's Advances in Data Science and Computing Technologies.

Wav2Vec2 Fine-tuning NLP PyTorch Low-Resource ASR Springer Publication
NLP Research

ASR with NVIDIA NeMo Framework

Explored NVIDIA's NeMo framework for Automatic Speech Recognition on Indian regional languages. Investigated pre-trained model effectiveness and fine-tuning strategies for low-resource language processing. Research published in AIP Conference Proceedings.

NVIDIA NeMo Speech Recognition Fine-tuning Low-Resource NLP AIP Publication
ML Research

Healthcare Analytics & ML Applications

Comprehensive study on ML applications in healthcare, predictive modeling, patient outcome prediction, and medical data analysis using ensemble methods and modern AI/ML techniques. Published in IEEE International Conference proceedings.

Machine Learning Healthcare AI Predictive Modeling Ensemble Methods IEEE Publication

Data Analytics & Business Intelligence

Data Analytics

Olist E-Commerce Churn Analysis

Customer churn analysis on the Olist Brazilian e-commerce dataset. Applied RFM segmentation and cohort analysis to identify at-risk customer segments, then built predictive models to classify churn likelihood. Delivered actionable retention insights from real transaction-level data.

Python RFM Segmentation Cohort Analysis Predictive Modelling Pandas Scikit-learn
AI Tool

JobTrack, Chrome Extension

Chrome extension that lets you track job applications directly from any job board, save listings, update application status, and add notes without leaving the page. Streamlines the job hunt workflow with a clean, minimal UI and persistent local storage.

Chrome Extension JavaScript HTML/CSS Local Storage Productivity

Research Publications

Springer 路 2023

ASR for Indian Regional Languages Using Fine-Tuned Wav2Vec2 Model

Advances in Data Science and Computing Technologies (ADSC 2022), Springer Nature Singapore

Developed an ASR system for low-resource Tamil using fine-tuned Wav2Vec2 with a custom tokenizer. Achieved 61.3% WER on Mozilla Common Voice, outperforming prior 69.76% WER benchmark.

Wav2Vec2 NLP Speech Recognition PyTorch Low-Resource Languages
IEEE 路 2023

Predictive Analysis of Multiple Diseases Using Ensemble Learning

IEEE International Conference on Intelligent Systems and Emerging Technologies

Ensemble of SVC, Naive Bayes, and Decision Trees raised disease prediction accuracy from 95% to 99%. SQL optimization cut data processing time 42%. Demonstrated 25% improvement in multi-disease prediction accuracy.

Ensemble Learning SVC Naive Bayes Healthcare AI 99% Accuracy
AIP 路 2023

ASR for Indian Regional Language Using NVIDIA's NeMo Framework

AIP Conference Proceedings

Investigated NVIDIA NeMo's pre-trained models and fine-tuning strategies for ASR on Indian regional languages, advancing low-resource speech processing research.

NVIDIA NeMo Speech Recognition Fine-tuning Low-Resource NLP
IEEE 路 2023

Advanced Machine Learning Applications in Healthcare

IEEE International Conference on Emerging Technologies

Comprehensive study of ML applications in healthcare analytics, predictive modeling, patient outcome prediction, and medical data analysis using modern AI/ML and ensemble methods.

Healthcare AI Predictive Modeling Ensemble Methods Medical Data Analysis
View Full ResearchGate Profile

Interested in collaborating?

I'm open to AI engineering, data engineering, and data science opportunities. Let's build something impactful together.