Skills Experience Projects Certifications Blog About
Data Engineer · Cloud & Analytics

Aniket Andhale

I build modern data solutions using distributed systems and cloud computing to turn raw data into real business impact.

Skills
Programming & Data Processing
Python Python
Pandas Pandas
NumPy NumPy
SQL SQL
PySpark PySpark
Cloud & Data Platforms
GCP Google Cloud
BigQuery BigQuery
Cloud Storage Cloud Storage
Cloud Functions Cloud Functions
Pub/Sub Pub/Sub
Dataflow Dataflow
Vertex AI Vertex AI
Azure Microsoft Azure
Databases & Tools
MySQL MySQL
PostgreSQL PostgreSQL
MongoDB MongoDB
SQL Server SQL Server
Git Git
GitHub GitHub
GitHub Looker Studio
GitHub MS Excel
Power BI
Experience & Education
Baker Hughes
Data Analyst Intern
Past
Jan 2025 – Nov 2025 · Pune, India
  • Automated internal publication workflows using Python (Pandas), reducing manual processing by 95% and generating monthly award reports for monetary distribution.
  • Processed and re-engineered 500+ legacy PDFs using Python (NLP, image processing, data scraping), performing data cleaning, validation, and structured extraction to improve reporting efficiency.
  • Built a Power BI dashboard from log data with KPIs and DAX measures, enabling real-time license monitoring, reducing analysis time by 80% and operational costs by 20%.
  • Designed and deployed an end-to-end ML-driven data extraction pipeline (Microsoft AI Hub + Power Automate + SharePoint + MS Teams integration), improving extraction accuracy by 30% and reducing SME review effort.
Python SQL Power BI Excel Microsoft AI Hub PowerApps Image Processing
NorthStar Impact Solutions
Tech & Research Intern
Past
July 2024 – Sept 2024 · Remote, India
  • Implemented data scraping with Python & NLP, processing 50+ PDFs/hr; optimized RegEx patterns achieving 98% accurate extraction from complex tables across 10+ diverse financial report formats.
  • Structured PostgreSQL schema for storing & retrieving extracted data, significantly reducing manual entry time.
  • Documented methodologies, processes and flowcharts; presented findings and analysis results to the team.
PythonNLPPostgreSQL
Marathwada Institute of Technology, Aurangabad
B.Tech — Artificial Intelligence and Data Science
Education
2021 – 2025 · Aurangabad, India
  • Built a strong foundation in data analytics and cloud technologies through hands-on academic and personal projects.
  • Held multiple positions of responsibility, won the Google Cloud Vertex AI Hackathon, and received the TATA Scholarship for academic merit.
DBMSCloud ComputingMLAI
Featured Projects
Credit Card Fraud Detection with Data Engineering on Google Cloud
Built a real-time fraud detection pipeline processing 300k+ transactions using Pub/Sub and Dataflow, achieving 97% precision with BigQuery ML while reducing fraud response time by 30%.
PythonSQLBigQueryDataflowPub/SubFirestoreLooker StudioCloud FunctionsSecret Manager
Book Recommendation System Using Collaborative Filtering
Built a collaborative filtering recommendation system using KMeans + KNN on 270k+ user ratings, achieving RMSE 0.9 and MAE 0.8. Improved performance through EDA and feature tuning. Deployed a Dockerized Streamlit app on Azure App Services for a scalable solution.
PythonMachine LearningEDAStreamlitDockerGitHub ActionsMicrosoft Azure
Uber Data Analytics | Data Engineering with GCP
Designed a fact-dimension model for Uber trip data, improving query performance by 30%. Built an ETL pipeline with Pandas and Mage for scalable data processing. Used BigQuery and Looker Studio on GCP for analytics and visualization.
PythonPandasMageCloud StorageCompute EngineBigQueryLooker StudioFact-Dimension ModelingETL Pipeline
Bank Loan Analysis Power BI Report
Analyzed 38K+ loan records using SQL Server and Power BI with a structured data model for MTD/MoM tracking. Built a 3-page dashboard with 20+ DAX measures covering loan status and risk segments.
Power BISQLSQL Server
Certifications
Associate Cloud Engineer
Google Cloud
Azure Certificate
Data Fundamentals (DP-900)
Microsoft Azure
Azure Certificate
AI Fundamentals (AI-900)
Microsoft Azure
Featured Blogs
Add blog image
Artificial Intelligence
Tabu Search | Artificial Intelligence
Tabu Search algorithm with its optimization strategy and Python implementation.
Add blog image
Cloud
Overview of Microsoft Azure Database Services
Overview of Microsoft Azure database services, including relational, NoSQL, storage, and analytics solutions.
APRIL 2024 Read more
Add blog image
Cloud Community
Google Cloud Study Jams: Navigating Cloud Horizons with GDSC MITA
Shares my experience leading Google Cloud Study Jams at GDSC MITA.
About Me
Aniket Andhale
Aniket Andhale
// Data Engineer · Pune, India

For the past 2+ years, I’ve been exploring Data Science and Cloud Computing, trying to understand how data systems truly work behind the scenes. Over time, I realized that data engineering sits at the intersection of my two interests, working with data and building on the cloud.

I enjoy designing end-to-end pipelines, from ingestion to analytics, and creating systems that makes real business impact. What excites me most is building a systems that can reliably serve millions of users. Outside of tech, I’m discovering a new side of myself by learning the flute.

"I can accept failure, but I can't accept not trying.."

— Michael Jordan

// Resume
Open in Drive ↗
📄

Preview blocked. Use the Drive button above.