Arsalan Shaikh
Entry-level Data Scientist with hands-on experience in ML and AI solutions, including medical imaging, financial risk analysis, and an AI interview platform. Delivered strong results in model accuracy and performance across real-world projects. Skilled in full ML workflows and translating complex data into actionable insights.

About Me
AI Engineer & Data Scientist
As a dedicated AI Engineer, I architect and build intelligent systems from the ground up. My expertise lies in transforming complex datasets into actionable insights using advanced deep learning and NLP models. I thrive on solving real-world problems by engineering scalable and efficient AI solutions.
My journey in data science complements my engineering skills, allowing me to not only implement models but also to deeply understand the narratives within the data. From feature engineering to deploying robust MLOps pipelines, I manage the full lifecycle of an AI project, ensuring impactful and data-driven results.
Gen AI Engineer (Remote) | Journalyst
Leading Gen AI initiatives and developing cutting-edge AI solutions.
Software Engineer Intern (Remote) | Dictation Daddy
• Fine-tuned OpenAI Whisper Large for
domain-specific transcription,
improving accuracy in production use
cases.
• Optimized inference pipeline,
reducing latency by 40%+ (1s →
600ms) for near real-time
response.
• Deployed scalable APIs with
monitoring, logging, and
fault-tolerance for reliable
integration.
• Integrated Gemini 1.5 LLM to
deliver adaptive text formatting
(formal/informal) based on
context.
Infosys Springboard Intern
Developed a CNN model for Iris Tumor Detection with 92% accuracy, implemented image augmentation pipeline.
B.Tech in AI & Data Science
MIT Aurangabad | CGPA: 7.5 | Specialized in Deep Learning and NLP
Quasar Hackathon Finalist
Led team to finals with AI-powered interview platform, recognized for innovative NLP implementation.
My Expertise
Machine Learning & Deep Learning
- Regression, Classification, Clustering
- Ensemble Methods (Voting, Stacking, Bagging, Boosting)
- PCA, Feature Engineering
- Model Tuning & Optimization
Natural Language Processing
- Text Preprocessing, Tokenization
- Word2Vec, BERT
- NLTK, spaCy
- Text Classification, Similarity Analysis
Data Analysis & Visualization
- EDA, Data Cleaning
- SQL (MySQL), MongoDB
- Power BI, Tableau
- Matplotlib, Seaborn
Programming & MLOps
- Python (Pandas, NumPy, Scikit-learn, TensorFlow, Keras, PyTorch, FastAPI)
- R, C++
- Git, GitHub (CI/CD)
- Docker, MLflow, DVC
- API Deployment
AI Projects



Iris Tumor Detection CNN
92% accurate CNN model developed during Infosys internship. Implements advanced image augmentation and hyperparameter optimization for medical image classification.
My Resume
Download My Professional Journey
Explore my comprehensive background in AI engineering, including academic achievements, technical skills, project experiences, and professional accomplishments in detail.
Get In Touch
Let's Build The Future With AI
I'm actively exploring new opportunities and collaborations. Whether you have a project in mind, want to discuss AI, or just say hello, I'd love to hear from you.