Name: Ovaiz Ali

Job Role: Data Engineer

Experience: 3+ Years

Address: Calgary, AB, Canada


Technical Skills

Core Languages

PYTHON 95%
SQL 95%

Data & Cloud

ETL and Data Warehouses 90%
Data Modelling 85%

AI & Analytics

AI & Machine Learning 80%

About

About Me

A professional data engineer with over three years of experience, proficient in designing, building, and maintaining optimal data pipeline architectures. Skilled in implementing complex, scalable big data projects, with a deep understanding of database structures, principles, and best practices. Well-versed in multi-cloud platforms.

  • Profile: Data Engineering & Analytics
  • Domain: Retail, Banking, Ecommerce, Finance & Digital Marketing
  • Programming: Python, SQL, R, JavaScript & DS(NumPy, Pandas, Scikit-Learn)
  • Databases: MySQL, Azure SQL Database, BigQuery & AWS Redshift, S3
  • BI Tools: Power BI,Tableau, Looker, AlteryX & KNIME
  • Other Skills: GCP, AZURE, PySpark, Excel, Git, Google Analytics, NLP & LLM
  • Language: English, Hindi, Gujarati
  • Interest: Traveling, Singing & Community Services

0 +   Projects completed

LinkedIn Profile

Resume

Professional Journey

Seasoned Data Engineer with 3+ years of experience building robust data pipelines and infrastructure to empower data-driven decision making. Proven expertise in data extraction, transformation, and loading (ETL), cloud platforms, and big data processing.

Experience


May 2025 – Present

Consultant, Data and Analytics

BDO Canada LLP

  • Designed scalable ETL pipelines using medallion architecture on Microsoft Fabric with PySpark and config-driven automation.
  • Deployed Fabric items and resolved production defects, achieving less than 1% failure rate for Credit Union and Public Sector clients.

Jan 2025 – May 2025

Intern, Data and Analytics

BDO Canada LLP

  • Built end-to-end data engineering project using Microsoft Fabric with PySpark, Data Factory, and Power BI.
  • Prototyped ML recommendation engine with 8 million synthetic records and migrated 130+ on-prem databases with 87% faster consolidation.

June 2024 – Present

AI/ML Engineer - Part-time

Crony Software

  • Developed NLP chatbot using NLTK and FastAPI with CI/CD deployment, improving customer experience by 20%.
  • Prototyped RAG solutions using LangChain and OpenAI, serving 300+ weekly customers for retail analytics.

Aug 2022 – Dec 2023

Junior Consultant, Data Engineer

Systems Limited

  • Improved SSRS reporting by 30% using SQL optimization and managed TB-scale BFSI data in Vertica/Cloudera.
  • Enhanced Hadoop extraction by 15% with Apache Spark and reduced downtime by 30% on Azure Synapse pipelines.



Education


Jan 2024 – May 2025

Master of Applied Computer Science

Dalhousie University, Canada

CGPA: 4.23 / 4.30

  • Runner-up in GenAI Hackathon
  • Tutorial TA for Advanced Cloud Computing
  • Graduate Mentor at T@DGE
  • IT Student Navigator

Aug 2018 – June 2022

BSc in Computer Science

National University of Computer and Emerging Sciences, Pakistan

CGPA: 3.64 / 4.00 (Cum Laude)

  • Technical Lead at Google Developers
  • Runner-up in Procom'22 (Data Science)
  • Runner-up in Developer's Day (Data Science)

Projects

Projects

Below are the sample Data Engineering projects on Python, SQL and Cloud Technologies.

Distributed Database for Healthcare

Java GCP Cloud SQL

Implemented distributed database system for healthcare industry with ERD design, database fragmentation logic, Global Data Catalog (GDC), and Java-based data parsing for optimal multi-region data management.

Serverless Image Captioning

Python AWS Lambda S3 DynamoDB Streamlit

Serverless application generating captions for images using AWS Lambda, S3 for storage, DynamoDB for metadata management, and Streamlit for an interactive frontend interface.

CanBuddy - AI News & Chat Assistant

Python AWS GCP LLM Streamlit

Multi-cloud AI-powered web application transforming Canadian Reddit posts into news articles. Features AI-generated news summaries, conversational AI assistant, and interactive dashboard with real-time data visualization using AWS Lambda, Step Functions, GCP, and HuggingFace LLMs.

DAL Vacation Home - Multi-Cloud

Python React AWS GCP

Comprehensive vacation home management system integrating AWS Lex chatbots, Lambda functions, SNS notifications, and GCP services. Features virtual assistant, message passing, notification system, and BigQuery-powered data analytics with Looker Studio visualizations.

Telco Churn Prediction

Python Machine Learning LIME Streamlit

Telecommunications customer churn prediction system using ensemble learning (Random Forest, AdaBoost, Stacking Classifier) with explainability features. Implements cost-sensitive learning, LIME explanations, and counterfactual analysis for personalized retention strategies, achieving 88.83% accuracy.

Olympics Data Analytics

Python Azure Data Factory Databricks Synapse

End-to-end Azure data analytics pipeline processing Tokyo 2021 Olympics dataset. Implements data ingestion via Azure Data Factory, transformation in Azure Databricks, and analytics with Azure Synapse Analytics on Data Lake Gen 2 for comprehensive Olympic athlete and medal insights.

AWS Data Engineering Pipeline

Python AWS GCP Glue BigQuery

End-to-end multi-cloud data processing pipeline for NYC taxi data. Combines AWS services (Lambda, Glue, S3, Step Functions, EventBridge, SNS) with Google Cloud (BigQuery, Looker Studio). Implements Bronze-Silver-Gold ETL architecture with real-time failure monitoring and CloudWatch alarms.

Canada Housing Market Analysis

Python Machine Learning Data Analytics LLM Plotly

Comprehensive analysis of temporary residents' impact on Canadian housing market. Combines PCA dimensionality reduction, k-Means clustering, ARIMA time-series forecasting, and LLM-generated insights. Interactive Plotly dashboard with scatter plots, radar charts, and predictive visualizations for housing affordability trends.


0 Certifications
0 Projects
0 Skill Badges

More projects on Github

I love to solve business problems & uncover hidden data stories


GitHub Profile

Contact

Contact Me

Below are the details to reach out to me!

Address

Calgary, AB, Canada

Contact Number

+1(782) 234-1914

Email Address

ovaizali123@gmail.com

Download Resume

Resume Link