Loading
Patryk Sitarek

Data Engineer

Machine Learning

Data Analysis

Data QA

  • Bio
  • Resume
  • Skills
  • About Me
  • Contact
Patryk Sitarek

Data Engineer

Machine Learning

Data Analysis

Data QA

  • Bio
  • Resume
  • Skills
  • About Me
  • Contact
Menu
Hello, I’m Patryk Data Engineer

I will shape your data into insights.

Located in Poland, work for clients from all over the world.

With over 3 years of experience in data engineering and working with data. Skilled in data extraction, transformation and analysis, as well as building data processing workflows. Additional expertise in Machine Learning and Data QA.

Successfully contributed to projects for both local and Global 500 companies. This diverse experience has honed the ability to adapt to different industries and deliver tailored and scalable solutions that drive data-driven decision-making at every level.

4

Years of
Experience

9

Completed
Projects

5

Global
Clients

Resume

Experience
2024 - Present
Data Engineer
Warsaw, Accenture

Design and implement end-to-end ETL processes for BI reporting using Microsoft Fabric and Azure cloud. Enabling Fabric capabilities to users. Developing GenAI applications.

2022 - 2023
Junior QA Data Engineer
Warsaw, BitPeak

Planning and executing data validation processes. Building ETL processes for BI reporting using Azure cloud.

2016
IT Internships
Leipzig, Vitalis GmbH

Internship, during which I was responsible for administering operating systems, database maintenance and managing the internet network.

key Projects
2025
Fabric Costs & Utilization Reporting
Fast-Moving Consumer Goods
I worked on a BI reporting product on the use of Power BI/Fabric platform resources and the resulting costs generated by individual business units.
In addition, we provided formal recommendations and tools to support effective platform management, and introduced new capabilities for the Fabric platform.

The benefits of the project are:
  • Tools providing information on resources used and costs generated
  • New Fabric capabilities enablement
  • Platform management support
2024
Migration to Microsoft Fabric and further development
Fast-Moving Consumer Goods

I worked on migration of a data product for international leader in its industry. The product is a model for tracking user adoption and engagement with Power BI initiatives and products across the organization. On further stages the model has been enriched with data related to the use of Power BI resources.


The benefits of the project are:

  • Increased performance:
    • Power BI reports responsiveness with Direct Lake
    • Processing time reduced from 8 h to 1 h
    • Data availability improved from T–2 days to T–1 day
  • Improved error and fault tolerance
  • Significantly increased data reliability
  • Model enriched with detailed insights about resources used and costs

The solution is fully based on Microsoft Fabric platform, using data pipelines, Python notebooks, data warehouse and direct lake semantic model with Power BI reports.

2024
Recommendation algorithm for supply chain management
Fast-Moving Consumer Goods

The project involved implementing a recommendation algorithm for order splitting (knapsack problem) to optimally divide the order based on product parameters and available transport fleet. The algorithm is used on the wholesale platform of a leading FMCG company worldwide.


The benefits of the project are:

  • Automatic recommendations for freight forwarders that replaced their manual work
  • Optimized transportation resources, reducing shipping costs

The solution is based on knapsack problem algorithm implemented in Python and deployed using Azure Machine Learning.

2024
GenAI-based application for customer service support and automatic review

Preparing a template for an application based on GenAI services, which, based on a chat or telephone conversation, prepares a transcription, a summary, verifies and evaluates the content of the conversation based on guidelines and proposes further actions.

The solution is based on Python with microservices architecture and Azure Cognitive Services.

2023
Building a data warehouse and BI reporting solution for sales data
Payments

I started the project as a Data QA Specialist, but from the very beginning I was fascinated by the role of a Data Engineer. In addition to QA tasks, I built data pipelines and helped with data modeling, eventually becoming the lead Data Engineer.
The product was a data warehouse built in a medallion architecture, designed to enable efficient processing of sales data from sources with limited availability while maintaining high scalability.


The benefits of the project are:

  • Unified data warehouse with aggregated data related to the company's core business. Power BI reports that analyze sales of key products.
  • Instant analysis of sales data that allows to respond to key products and customers.
  • Enabling the building of custom reports based on the data model.

The solution was based on Azure Data Factory and Synapse Analytics services, and serverless computing.

2022
Building a data warehouse for SAF-T reporting
Healthcare

It was my first professional commercial project and first experience with cloud computing. Worked as Data QA, responsible for validating data in the model, ETL processes, and automated validation rules within daily processing.


The benefits of the project are:

  • Centralized data warehouse for Standard Audit File for Tax (SAF-T) reporting
  • Reduced hundreds of hours of manual work by people to a few hours of result verification
  • Minimized the number of errors that required subsequent correction

The technology stack was Azure Cloud: Data Factory, Synapse Analytics with Delta Lake, and SQL Server. Most of the code is Python notebooks and stored procedures.

Certifications
Jan 2026
Microsoft DP-700: Fabric Data Engineer Associate
Microsoft Fabric

This certificate is a good summary of my nearly two years of practical experience with the Fabric platform. The scope covers knowledge of services, solutions, and data engineering skills in Fabric. It includes not only batch processing, but also streaming and platform management.

Certificate
Dec 2024
Microsoft DP-600: Fabric Analytics Engineer Associate
Microsoft Fabric

The certification validates my skills to design, build, deploy, and maintain end-to-end enterprise analytics solutions using Microsoft Fabric and Power BI, including data preparation, semantic modelling, and securing analytics assets.

Certificate
Sep 2023
Microsoft AI-900: Azure AI Fundamentals
Microsoft Azure

The certification demonstrates foundational knowledge of artificial intelligence and machine learning concepts, and how to apply them using Microsoft Azure services.

Certificate
Jan 2023
ISTQB® Certified Tester Foundation Level
Testing

The certification validates that an individual has a solid understanding of core software testing principles, terminology, lifecycle models, test techniques & tools, and test management.

Education
Master of Micro- and Nanotechnology
2021 - 2023
Master of Micro- and Nanotechnology
University of Silesia

Master of Science in Micro- and Nanotechnology. Minors in machine learning and electronic.

Bechelor of Computer Science
2017 - 2021
Bechelor of Computer Science
University of Silesia

Bechelor of Engineering in Computer Science. Minors in algorithms and data structures, database management and machine learning.

2013 - 2017
Technician of Computer Science
ZSOT in Lubliniec

My education in the industry began at a technical school. Today, I don't use any of the tools I was taught there, but I appreciate the foundational knowledge and paradigms I learned there.

Skills

Python 3
Python 3
90%
T-SQL, SQL Server
T-SQL, SQL Server
80%
Data Transformation
Data Transformation
75%
Data Pipelines
Data Pipelines
90%
Data Analysis
Data Analysis
75%
Machine Learning
Machine Learning
75%
Cloud Computing
Cloud Computing
75%
Data QA
Data QA
75%
Tools & Platforms
  • Microsoft Azure Cloud
    80%
  • Microsoft Fabric
    90%
  • Azure Data Factory
    90%
  • Azure Synapse Analytics
    75%
  • Azure Data Lake Storage Gen2
    80%
  • Delta Lake
    90%
  • Spark
    80%
  • Azure Machine Learning
    25%
  • Power BI
    25%
  • Pandas
    75%
  • NumPy
    75%
  • Keras
    50%
  • SciKit-Learn
    50%
  • OpenAI
    75%
  • GitHub
    75%
  • Azure DevOps
    75%
  • Jira
    60%
  • Confluence
    60%
  • Visual Studio Code
    80%
  • SQL Server Management Studio
    50%
Languages
  • English
  • German
  • Polish
Soft skills
  • Scrum & Agile
  • CI/CD
  • Cross-Functional Collaboration
  • Critical thinking
  • Communication
  • Business analysis
  • Team work organization

About

This section does not contain specific information, but I encourage you to read it if you want to get to know me better. I try to explain how I got to where I am.

Childhood and Secondary School

I am 28 years old and grew up in Dobrodzień, a small town in southern Poland. As a child, I used to service computers and boost their performance to be able to play my favorite games. That is how I became interested in IT and chose a technical secondary school with an IT specialization. That's where I started programming and creating websites. That's where I started programming and creating websites. I obtained a degree in IT, but it wasn't my passion yet.

Studies

Despite my doubts, I decided to study computer science, which turned out to be a good decision. Looking back, I realise that I was naturally drawn to data-related specialisations. I put a lot of effort into learning Python, applying it in algorithms and data structures, and developed solid skills in databases and SQL. During my master's studies, I specialised in machine learning.

As part of my thesis, I built custom smog measurement stations and trained models to programmatically replace heaters, increasing the accuracy of smog measurements – a key environmental problem in Poland.

First Job
During my master's degree studies, I started my first professional job as a Data QA. I quickly realised that working with data is very interesting, but on the other side – as a developer.  While still working as a Data QA, I began to get involved in development tasks. That is how I became a data engineer. After a few months, I became the lead data engineer on a small project. Initially, my tools were SQL and Python with Pandas and NumPy. That's also where I had my first practical experience with Azure and AWS cloud computing. As a data engineer, I focused on the Azure ecosystem with Apache Spark and T-SQL.
Data Engineer Role

At my next company, I was a data engineer specialising in Azure from the outset. Initially, I used my background in machine learning and joined a team developing an application using GenAI for the finance. I also worked on machine learning and model management in the Azure Machine Learning environment.

Then I joined a project where our goal was to migrate the existing solution to Microsoft Fabric. After a successful migration and unique hands-on experience, more and more demands related to Fabric emerged.



Today, I am a data engineer with unique hands-on experience in the Fabric stack, in which I specialise. The most common demand concerns the development of BI products, but I also help with the effective management of the Fabric platform, including monitoring its usage and costs.

CONTACT

Let’s make your project brilliant!

If you would like to get in touch, the easiest way to contact me is on LinkedIn:

LinkedIn
/in/sitarek/
My location
Katowice, Poland

© 2025 RyanCV