$ whoami

Manjunath Popuri AI/ML Engineer & Data Scientist

LLMs & GenAINLPPython AWS & AzureTensorFlow Power BISQLBERT

MS CS (AI Specialization) @ Binghamton University. Building AI-powered systems that bridge research and real-world impact - from NLP pipelines to LLM deployments.

3+
Years in AI/ML
10+
AI Projects
5+
Technologies
scroll to explore
LLMs โ—†
NLP Pipelines โ—†
BERT / Transformers โ—†
AWS & Azure โ—†
TensorFlow โ—†
Python โ—†
Power BI โ—†
GenAI โ—†
Mistral-7B โ—†
CI/CD Pipelines โ—†
LLMs โ—†
NLP Pipelines โ—†
BERT / Transformers โ—†
AWS & Azure โ—†
TensorFlow โ—†
Python โ—†
Power BI โ—†
GenAI โ—†
Manjunath Popuri
Manjunath Popuri
Data Scientist ยท AI/ML Engineer
New Hyde Park, New York
Machine Learning / DL95%
NLP & Transformers92%
Cloud (AWS / Azure)91%
Data Analytics & BI97%
Python & SQL99%
const manjunath = {
university: "Binghamton University",
degree: "MS Computer Science (AI)",
focus: "LLMs, GenAI, NLP, ML",
prevExp: "Cognizant - Machine Learning Engineer",
openTo: "Full-time Data Scientist & AI/ML Roles"
};

Building the Future
with AI

I'm an AI/ML Engineer and Data Scientist with hands-on experience across the full model lifecycle - from feature engineering and deep learning to LLM fine-tuning, RAG pipelines, and cloud deployment on AWS, Azure, and GCP. Currently advancing my MS in Computer Science (AI Specialization) at Binghamton University while building production-grade AI systems in both financial intelligence and clinical healthcare.

Previously at Cognizant, I engineered credit risk models for 10M+ customers and automated ML pipelines that cut processing time by 20%. Today I build things like HIPAA-compliant GenAI summarization engines, fraud detection platforms with Responsible AI layers, and reinforcement learning trading agents - work that sits at the intersection of rigorous data science and real-world deployment at scale.

I'm particularly passionate about translating complex AI research into scalable, business-ready solutions that deliver measurable impact. I thrive in fast-paced environments where innovation, experimentation, and continuous learning drive meaningful outcomes.

Beyond technical work, I enjoy collaborating across teams, mentoring peers, and staying at the forefront of emerging trends in generative AI, responsible AI, and intelligent automation.

Career

Work Experience

Jul 2025 - Present
Current ยท
AI/ML Engineer
Binghamton University
  • Developed a RAG-based QA system using Phi-3 Mini and FLAN-T5; achieved 88% accuracy by optimizing retrieval chains for medical and financial datasets.
  • Fine-tuned LLMs with QLoRA and PEFT, reducing GPU memory overhead by 20% while maintaining peak predictive performance and model stability.
  • Built an interactive Streamlit suite and Power BI dashboards via FastAPI to transform complex model outputs into real-time insights and stakeholder reporting.
  • Applied SHAP and MLflow for model explainability and tracking; implemented automated validation pipelines ensuring reproducibility and 99.9% reliability.
Aug 2024 - May 2025
10 mo
Graduate Research Assistant
Binghamton University
  • Developed instruction-tuned transformer pipelines for semantic analysis of MIMIC-III EHR data, reducing manual labeling and text-extraction time by 25%.
  • Managed HIPAA-compliant ETL pipelines for 10M+ records of structured and unstructured claims data using PySpark and SQL.
  • Trained XGBoost and Quantized Phi-3 models to classify complex health insurance claims, optimizing for clinical relevance and HIPAA regulatory standards.
Jul 2021 - Jun 2023
2 yrs
Machine Learning Engineer
Cognizant
  • Deployed XGBoost and LSTM models to predict credit default risk for 10M+ customers, improving precision (AUC-ROC) by 8% while maintaining Basel III compliance.
  • Architected automated feature engineering pipelines using PySpark and SQL, reducing data processing runtime by 20% and enabling near real-time risk scoring.
  • Enhanced deployed risk models with automated monitoring and Power BI dashboards, reducing manual audits and improving portfolio risk management efficiency by 10%.
  • Conducted A/B testing on credit policies, identifying drivers of customer churn and informing a new risk strategy that supported a 5% increase in acquisition.
Mar - May 2022
Internship ยท 3 mo
AWS Virtual Intern
All India Council for Technical Education (AICTE)
  • Built cloud-based solutions using AWS services including EC2, S3, Lambda, and RDS for scalable application deployment.
  • Designed and implemented relational databases with Amazon RDS to power dynamic, data-driven applications.
  • Deployed serverless architectures leveraging AWS Lambda and S3 for efficient event-driven processing pipelines.
Portfolio

Featured Projects

Spanning two high-stakes domains - financial intelligence and clinical AI - each project reflects an end-to-end ML mindset: from raw data and model design to deployed, measurable real-world impact.

[ 01 ] Finance
๐Ÿ›ก๏ธ
Financial Risk & Fraud Detection System
End-to-end fraud intelligence platform trained on 10M+ transactions - combining supervised classification, anomaly detection, and a RAG-enabled GenAI layer for real-time financial analysis. Achieved 92% accuracy, +12% fraud recall, and a 15% reduction in loan-approval bias through Responsible AI techniques.
PyTorchXGBoostAzure MLRAGLLMsDockerSQLTableau
โ†—
[ 02 ] Healthcare
๐Ÿฅ
Clinical Chart Review Assistant (RAG + LLM)
Production RAG pipeline using FLAN-T5, Phi-3, and ClinicalBERT (PEFT/LoRA) to auto-generate SOAP note summaries and flag medication safety issues. Reduced hallucination rate by 40% through retrieval grounding - delivering a reliable, audit-ready clinical intelligence layer at scale.
FLAN-T5ClinicalBERTLangChainChromaDBSageMakerFastAPIStreamlit
โ†—
[ 03 ] Finance
๐Ÿ“‰
Customer Churn Prediction & Personalization Engine
Churn-prediction system built on 500K+ customer records using advanced feature engineering and ensemble methods. Delivered personalization-driven behavioral insights that reduced modeled churn risk by 12%, paired with interactive Power BI dashboards for real-time stakeholder decision-making.
Scikit-learnXGBoostPandasSQLPower BITableau
โ†—
[ 04 ] Healthcare
๐Ÿ“‹
GenAI Clinical Summarization & Compliance Engine
PPO-based RL pipeline with toxicity-aware rewards enforcing HIPAA compliance in LLM-generated clinical summaries - achieving zero policy violations across 10K test documents. Directly applicable to RADV audit documentation and regulated healthcare reporting workflows.
PyTorchHuggingFaceLangChainSageMakerChromaDBRL / PPO
โ†—
[ 05 ] Finance
๐Ÿค–
Autonomous Trading Agent - Reinforcement Learning
DQN-based RL agent achieving +15% return over baselines in rigorous backtesting. Deployed as a FastAPI REST API on AWS with Docker containerization - demonstrating reinforcement learning applied to sequential decision-making and real-time agentic AI architectures.
PyTorchTensorFlowOpenAI GymFastAPIDockerAWS
โ†—
[ 06 ] Healthcare
๐Ÿ“ˆ
Patient Outcome Prediction & Cohort Segmentation
LSTM/GRU discharge readiness models combined with K-Means cohort stratification across 6 patient segments - improving hospital resource planning accuracy by 18%. Full MLflow experiment tracking and Tableau reporting deliver end-to-end observability from model training to clinical insight.
PyTorchTensorFlowLSTM / GRUSageMakerMLflowTableau
โ†—
[ 07 ] Healthcare
๐Ÿ”ฌ
Medical Imaging Classification - Computer Vision
ResNet/MobileNetV3 models classifying X-ray and MRI abnormalities at 92% accuracy, deployed via MLflow and Docker for real-time clinical inference. PySpark preprocessing at scale ensures the pipeline handles production-grade imaging volumes without bottlenecks - bridging research-quality CV with clinical deployment demands.
PyTorchTensorFlowOpenCVResNetMobileNetV3PySparkMLflowDocker
โ†—
Tech Stack

Skills & Tools

Programming & Querying
Python
R
SQL
Java
C / C++
Data Science & Analytics
Predictive Analytics
Statistical Modeling
Hypothesis Testing
A/B Testing
Regression & Classification
Machine Learning & AI
TensorFlow / PyTorch
Hugging Face / Transformers
Generative AI & LLM Fine-tuning
NLP (FLAN-T5, Phi-3)
Deep Learning (CNN, RNN, LSTM, GRU)
OpenCV
Data Tools & Visualization
Pandas / NumPy
Matplotlib / Seaborn / Plotly
Power BI / Tableau
Streamlit
Cloud & MLOps
AWS (SageMaker, EC2, S3, Lambda)
Azure ML
GCP (BigQuery, Dataflow)
Docker / Kubernetes
CI/CD / MLflow
ETL, Databases & Big Data
PySpark / Apache Spark
Snowflake / dbt / Airflow
MySQL / PostgreSQL / MongoDB
MS SQL Server / ChromaDB
Academic

Education

๐ŸŽ“
Master of Science - Computer Science
Binghamton University, SUNY ยท 2023 - 2025
Specialization: Artificial Intelligence
Thomas J. Watson College of Engineering and Applied Science
Binghamton, New York
๐Ÿ›๏ธ
Bachelor of Technology - Computer Science & Engineering
Vasireddy Venkatadri Institute of Technology, India ยท 2019 - 2023
Strong foundation in data structures, algorithms, and software engineering
Research

Publications

IJFANS - International Journal of Food and Nutritional Sciences December 2022
Opinion Mining on Twitter Data Using Machine Learning
Sentiment analysis of tweets related to NASA's Artemis I mission. Applied NLP techniques - BERT, Word2Vec, VADER - alongside classification models (SVM, Decision Tree, Random Forest) to achieve 97.4% accuracy. Developed a Flask-based dashboard for real-time sentiment visualization.
BERTWord2VecVADER SVMRandom ForestFlask NLPSentiment Analysis
Read Paper โ†—
Credentials

Certifications

๐Ÿค–
Generative AI with Large Language Models
Amazon Web Services (AWS)
Issued May 2025Credential ID: U34TIU0OIOUL
See Credential โ†—
โ˜๏ธ
AWS Certifications
All India Council for Technical Education (AICTE)
Issued May 2022Credential ID: 662eafd1c8ba132fceaa7d40946aa011
See Credential โ†—
๐Ÿง 
Machine Learning with Python
Coursera
Issued Oct 2021
See Credential โ†—
// let's build something great

Let's Connect

I'm actively looking for full-time AI/ML and Data Science roles.
Let's talk about how I can bring value to your team.