Initializing AI Systems...

Arsalan Shaikh

Building production AI systems at scale. Specialized in deep learning architectures, NLP pipelines, and intelligent automation. From research to deployment, I engineer solutions that drive measurable business impact through advanced machine learning and data-driven insights.

ML Engineer at Adastraa AI
Arsalan Shaikh Profile

Current Role

Machine Learning Engineer

Adastraa AI

Dec 2025 – Present

Building AI-powered advertising intelligence platform for enterprise-scale campaign optimization

  • Designed and deployed Amazon Marketing Stream pipelines using AWS SQS for real-time data ingestion
  • Architected AI-powered Ads Copilot delivering automated campaign insights and optimization recommendations
  • Deployed and maintained production systems on AWS EC2 & AWS Amplify with 99.9% uptime
Python AWS SQS EC2 Amplify FastAPI ML Pipelines

Professional Experience

Oct 2025 – Dec 2025
Gen AI Engineer (Remote) | Journalyst

Engineered real-time voice-driven trading psychology coach using cutting-edge Gen AI technologies
• Integrated Groq Llama 3.3, Whisper ASR, and PlayAI TTS for conversational AI pipeline
• Implemented Pinecone-backed RAG system for context-aware psychological insights
• Developed custom VAD (Voice Activity Detection) and adaptive intent-prompting algorithms
• Delivered psychological insights from speech patterns and behavioral analysis in real-time

Sep 2025 - Oct 2025
Software Engineer Intern (Remote) | Dictation Daddy

• Fine-tuned OpenAI Whisper Large for domain-specific transcription, improving accuracy in production use cases.
• Optimized inference pipeline, reducing latency by 40% (1s → 600ms) for near real-time response.
• Deployed scalable APIs with monitoring, logging, and fault-tolerance for reliable integration.
• Integrated Gemini 1.5 LLM to deliver adaptive text formatting (formal/informal) based on context.

2023
Infosys Springboard Intern

Developed a CNN model for Iris Tumor Detection with 92% accuracy, implemented image augmentation pipeline.

Key Achievements

Mumbai Hacks 2025

Finalist

15,000+ Participants

Selected from a pool of over 15,000 participants across India to compete in the finals

Led team as Team Lead & Decision Maker through all competition phases

Directed problem-solving strategy and technical architecture decisions

Delivered final pitch presentation showcasing innovative AI solution

Academic Background

B.Tech in AI & Data Science

MIT Aurangabad

2021 - 2025
CGPA: 7.5

Technical Expertise

Machine Learning & Deep Learning

Regression Classification Clustering Ensemble Methods Feature Engineering PCA Neural Networks CNN RNN/LSTM/GRU Real-Time ML Systems

Generative AI & LLMs

OpenAI GPT APIs Gemini 1.5 Groq Llama (3.1 / 3.3) Whisper ASR PlayAI TTS LangChain RAG Pipelines Prompt Engineering

NLP

Tokenization Text Preprocessing Word2Vec BERT Text Classification NLTK spaCy

Cloud (AWS)

EC2 Amplify S3 SQS Amazon Marketing Stream

MLOps & DevOps

Docker CI/CD Pipelines Model Deployment Logging Monitoring

Data & Analytics

EDA SQL (MySQL) NoSQL (MongoDB) Power BI Matplotlib Seaborn

Programming & Frameworks

Python NumPy Pandas Scikit-Learn TensorFlow PyTorch FastAPI Flask React

Problem Solving Practice

Consistent problem solving & algorithmic practice

Less
More

AI Projects

Hybrid Chatbot Project Gen AI
AI Hybrid Chatbot

A hybrid chatbot system combining knowledge base with LLM capabilities using Groq. Features React frontend, Flask backend, providing both fast responses from local knowledge base and AI-powered responses using llama-3.1-8b-instant.

React Flask Groq LLM Python
AI Interview Platform NLP
Aimers - AI Interview Platform

Advanced BERT-based evaluation system with 90%+ accuracy in assessing candidate responses. Features sentiment analysis, answer relevance scoring, and comprehensive feedback generation.

Python BERT NLP FastAPI TF-IDF
Loan Risk Prediction ML
Loan Risk Prediction System

XGBoost model with 91% AUC score for credit risk assessment. Includes automated feature engineering pipeline and interactive Power BI dashboards for risk visualization.

XGBoost Power BI Scikit-learn FastAPI
Tumor Detection CV
Iris Tumor Detection CNN

92% accurate CNN model developed during Infosys internship. Implements advanced image augmentation and hyperparameter optimization for medical image classification.

CNN TensorFlow OpenCV Medical AI
Movie Recommendation NLP + ML
Movie Recommendation System

Movie Recommendation system built using Natural Language Processing & Cosine Similarity for intelligent content-based filtering.

NLP Cosine Similarity IMDB API Streamlit

My Resume

Download My Professional Journey

Explore my comprehensive background in AI engineering, including academic achievements, technical skills, project experiences, and professional accomplishments in detail.

Get In Touch

Let's Build The Future With AI

I'm actively exploring new opportunities and collaborations. Whether you have a project in mind, want to discuss AI, or just say hello, I'd love to hear from you.