About Me
Data Processing Engineer, Certified Data Engineer.

Data Processing Engineer, Certified Data Engineer.
In this project, I designed and implemented a robust data pipeline using Apache Airflow, an open-source workflow automation tool. The goal was to automate and streamline the process of collecting, processing, and visualizing data from multiple sources, providing timely insights for decision-makers.
In this project, I conducted sentiment analysis on Twitter data using Apache Spark, a powerful big data processing framework. I analyzed tweets to determine public sentiment about a particular topic or event, providing valuable insights for decision-making and brand management.
In this project, I write a set of SQL queries to analyze COVID-19 data from the PortfolioProject database, focusing on cases, vaccinations, and population statistics. With these queries, users can extract and manipulate data to understand infection rates, mortality percentages, vaccination coverage, and more.
In this project, I focuses on preparing the dataset for analysis by addressing issues such as missing data, date standardization, and column parsing. By using these SQL queries, users can efficiently preprocess and clean the data for further analysis or visualization.