OLUMIDE OLAOYE
PORTFOLIO

Microsoft Certified Azure Data Scientist, Google Certified Data Analyst skilled in SQL, Excel, Power BI, Tableau, Python, and R.
@olu_olaoye
Proficient in predictive modeling, data processing, data mining algorithms, and scripting languages.
Excellent understanding and proficiency of platforms for effective data analyses, with an in-depth understanding of the entire scope of the data analysis process.

Predicting Climate Change and Impacts in Africa

Here's the scenario >>> As a Data Scientist in a non-governmental organization, I was tasked with predicting climate change and impacts in Africa and report the state of climate change at the upcoming African Union Summit. On this task, I utilized some machine learning algorithms to predict the climate changes in the year 2025 across the regions discovered in the dataset.

People Analytics - HR Employee Attrition and Workforce Dynamics

The data was gotten from kaggle. The combined data source integrates information from four distinct tables, providing a comprehensive overview of various aspects related to the organization's workforce. The combined data source thus offers a holistic view of the organization's workforce dynamics, encompassing employee feedback, job structure, office locations, and attrition information. This integrated dataset enables a more comprehensive analysis of the relationships and trends within the organization over time.

Football Analytics - 2018 World Cup Squad

The year was 2018 and the FIFA World Cup that year was hosted by Russia. The World Cup 2018 squad dataset was handed over to explore. As a Data Analyst in the Analytics team of a Football Academy company, my task was to uncover some hidden insights to help optimize the business's scouting decisions. The Director of Football Analytics would like to know the extent of insights that can be uncovered from the dataset and make recommendations to improve and optimize the scouting strategies of the firm.

Impact Evaluation Analysis - Kenya Bridge Education Program

As a Data Analyst in a not-for-profit international education organization, I utilized historical data of the Bridge Kenya Programme to assess the impact of the program in 111 schools, in 7 provinces across 31 regions in Kenya. The data was for over 13,000 pupils from grades 1-5 from the end of an undisclosed school term in the past five years.

Exploratory Data Analysis on IMDB Movie Data

As a Data Analyst in a movie entertainment firm, I was tasked to scrape data for the top 250 movies from the IMDb website and perform an Exploratory Data Analysis (EDA) on the scraped data in order to answer some business questions.

Data Cleaning in SQL

In this project, I gathered data on housing project in Nashville, USA and cleaned the dataset in SQL Server to answer some questions on housing problems in the area.

Wrangling and Analyze Data - WeRateDogs Datasets

Real-world data rarely comes clean. Using Python and its libraries, I gathered data from a variety of sources and in a variety of formats, assessed its quality and tidiness, then cleaned it. I documented my wrangling efforts in a Jupyter Notebook, plus showcased them through analyses and visualizations using Python (and its libraries). The dataset that I cleaned, analyzed and visualized, is the tweet archive of Twitter user @dog_rates, also known as WeRateDogs. WeRateDogs is a Twitter account that rates people's dogs with a humorous comment about the dog. These ratings almost always have a denominator of 10. The numerators, though? Almost always greater than 10. 11/10, 12/10, 13/10, etc. Why? Because "they're good dogs Brent." WeRateDogs has over 4 million followers and has received international media coverage. WeRateDogs downloaded their Twitter archive and sent it to Udacity via email exclusively for me to use in this project. This archive contains basic tweet data (tweet ID, timestamp, text, etc.) for all 5000+ of their tweets as they stood on August 1, 2017

Communicate Data Findings - Prosper Loan Data

In this project, I wanted to look at the features of loans that could be used to predict their Borrower APR. The main emphasis was on the Loan Original Amount, borrower's Prosper Rating (Alpha), loan term (Term), and borrower's Stated Monthly Income. The dataset consisted of Borrower APRs and attributes of 113,937 loans. The attributes included Loan Original Amount, borrower's Prosper Rating (Alpha), loan term (Term), borrower's Stated Monthly Income, as well as many other features such as borrower's Employment Status, Debt To Income Ratio, Current Loan Status, etc. 352 data points were removed from the analysis due to very large stated monthly income that seemed as outliers and missing borrower APR information. I utilized Python libary visualization packages - matplotlib and seaborn to provide interesting insights and relationships among the loan features and their effects on the Borrower APR.

Investigate TMDb Movie Data

The primary goal of the project is to go through the general data analysis process — using basic data analysis technique with NumPy, Pandas, and Matplotlib. The movie dataset, which is originally from Kaggle, was cleaned and provided by Udacity. According Kaggle introduction page, the data contains information that are provided from The Movie Database (TMDb). It collects 5000+ movies basic move information and movie matrices, including user ratings, popularity and revenue data. These metrics can be seen as how successful these movies are. The movie basic information contained like cast, director, keywords, runtime, genres, etc.

Automobile Analytics in Python

As a Data Analyst in an Automotive firm in Calgary, AB, Canada, my Product Manager just approached me and wanted to know how efficient the makes of vehicles the company sells have performed over a period of time. I was given access to the automobile dataset consisting of key variables needed to answer the questions asked by my Product Manager. I used my expertise in NumPy, Pandas, matplotlib, seaborn to wrangle the data, analyzed it, performed some visualizations, in order to unravel trends to answer the questions asked by the business.

Data Exploration in SQL

Performed Data Exploration of Covid-19 Dataset in SQL Server. The dataset was gotten from ourworldindata.org/covid-deaths. I changed the dataset from csv file to excel, then imported into SQL server for exploration to uncover some interesting insights.

Movie Correlation Project in Python

I performed analysis of the movie industry dataset to ascertain the level of correlation among some of the movie features in the dataset using some Python libary packages like NumPy, Pandas, matplotlib, seaborn. The dataset was gotten from Kaggle.

Hotel Data Analysis Project in SQL Server and Power BI

As a Data Analyst in the hospitality industry, I builded a hotel database in SQL Server and connected it to Power BI. Basically, I performed some data manipulation and transfoormation on the hotel dataset from the database in SQL Server, then imported it into Power BI for further analyses and visualizations to uncover further insights in answering some business questions.

Tableau Projects

This contains all my Tableau Dashboards and Projects to showcase my documentation, visualization, reporting, communication and presentation skills.

Sales Forecasting Analysis of a Global Retail Store

I performed Time Series Analysis and forecasting on the sales of Product Categories across the various Market Segments for the next six months. I went ahead to provide analytics that will aid proper estimation and effective planning of inventory and business processes.

Boston Housing Prices Project

I performed descriptive analytics and visualizations on the dataset for Boston Housing Prices derived from the United States Census Service. Python library packages were utilized for this task.

Social Sector Analytics

I performed analysis to unravel the causes and impacts of fire incidents in some Nigerian markets in 2020. This analysis helped to derive actionable insights about the markets in which these recurring fires have occurred and made recommendations on what could be done to reduce the causes of these incidents in the future.

Bellabeat Analytics in R

I performed analysis and visualizations in R using Bellabeat (a Wellness Technology Company), as a case study. I utilized differerent packages in R to draw different insights and made recommendations on how Bellabeat can play it smart in the fitness industry. The dataset for this analysis came from FitBit Fitness Tracker Data on Kaggle.

Mobile OS Usage Project

Scenario: I performed analysis on products dataset as a Data Analyst in an IT firm in Ontario, Canada. The company just developed a mobile game application targeted at young people between the ages of 18 and 45. This mobile application has recorded tremendous success since the launch with over 100 million downloads recorded on various OS platforms. I used Python libary packages to unravel the mobile OS usage on different platforms in order to optimize the business process.