Ritesh Ojha

About Me
I design, build, and optimize modern data platforms that transform raw data into reliable, actionable business intelligence. Currently working as an Associate Data Analytics Engineer at Air Arabia, I handle end-to-end data engineering workflows — from data ingestion, transformation, and orchestration to warehouse optimization, reporting, and delivery. My focus is on creating scalable pipelines, improving data reliability, and enabling faster decision-making through efficient cloud-based analytics systems. My core stack includes Microsoft Fabric, Snowflake, Power BI, PySpark, Apache Airflow, Python, SQL, and REST API integrations.
With 2+ years of experience in data engineering and analytics, I specialize in building end-to-end data products — from automated API-based data ingestion to delivering high-performance Power BI dashboards for executive decision-making. I'm passionate about bridging the gap between raw technical data and actionable business insights through streamlined automation and robust data governance.
My Expertise
ETL/ELT Pipelines, Data Warehousing, BI Reporting, Cloud Data Architecture
My Goals
Building scalable data platforms that drive data-driven decision making and business growth.
Technical Toolkit
A comprehensive overview of the technologies and tools I use to bring ideas to life.
Cloud & Data Platforms
Data Engineering
Languages & Query
BI & Visualization
Featured Projects
Selection of some of my most challenging and impactful work, spanning from AI research to production-ready enterprise systems.
Spotify Data Engineering Project
Data engineering pipeline on AWS for processing Spotify data. Loaded CSV files into S3, performed ETL using AWS Glue, stored processed data as Parquet files, and queried/visualized using Amazon Athena and Power BI.
Data Engineering End to End Podcast episodes Project
I created a data pipeline using Airflow. The pipeline will download podcast episodes. I stored our results in a Postgres database that we can easily query.
Career Journey
Associate Data Engineering Analyst
- •Designing and maintaining data pipelines using Microsoft Fabric for seamless data integration and processing.
- •Utilizing Snowflake as the primary data storage layer, implementing Bronze (raw ingestion) and Silver (cleansed/structured) layers.
- •Executing Gold layer transformations using PySpark for advanced analytics and business-ready datasets.
- •Developing interactive Power BI dashboards for executive insights and Paginated Reports for structured data exports.
- •Automating data ingestion from APIs and performing full ETL processes to transform raw data into meaningful insights.
- •Collaborating with cross-functional teams to ensure data accuracy, quality, and performance across the analytics lifecycle.
Data Analyst
- •Specialized in leveraging Google Sheets to develop and manage data-driven products.
- •Used the platform's capabilities to analyze data, create detailed models, and design solutions that support product development and business decisions.
Data Analyst
- •Developed and implemented a data management system using Google Sheets.
- •Built 5+ interactive dashboard reports in Power BI to visualize data.
- •Automated data storage processes using JavaScript, improving efficiency by 80%.
- •Developed 8+ task automation using Google App Script, streamlining communication and data delivery.
Data Science & Machine Learning Intern
- •Gained hands-on experience in data science and machine learning methodologies.
- •Worked on real-world data projects involving data preprocessing, analysis, and model development.
Recognition
Microsoft Certified: Power BI Data Analyst Associate
Snowflake SnowPro Core Certification
Let's Build Data Solutions
I'm always open to discussing data engineering projects, BI solutions, or opportunities to help transform your data into actionable insights.