Snow nothing,
reap nothing.
Hey,my name is Shuai Gao (Leo).
I am a formal mapping engineer, current Master of Data Science graduate from RMIT.
I am familiar with Python, Excel, R, Java and SQL, skilful in time series analysis,
data mining, machine learning and statistics. I am also acclimated to big data and cloud computing technologies.
e.g. Google cloud, MapReduce, Hadoop.
Download my CV here
Download Chinese CV
Education
SKILLS AND CERTIFICATE
•
Certified machine learning analyst by ITman
•
Language: English (Sophisticated), Mandarin (MT)
•
Software:
Proficient in
Excel and
Python; familiar with
SQL,
JAVA and
R; familiar with
SAS
•
Keywords:
Supervised | Unsupervised | Semisupervised | Reinforcement Learning | MapReduce | Time Series Analysis |Regression | SVM | Random Forest | Feature Selection | Dimensionality reduction | Data Collection | Scrappy |
Artificial Neural Network | Lasso Regression | Ridge Regression | Data Pre-processing | Cloud computing |
Logistic Regression | K Means | DB SCAN | Crawl |
Working Experience
Coles
Coles Supermarkets Australia Pty Ltd, trading as Coles, is an Australian supermarket,
retail and consumer services chain,
headquartered in Melbourne as part of the Coles Group.
Advanced Analytics Internship
• Used unsupervised machine learning algorithms for user information grouping based on the purchase behavior and personal information statistics of users in Australia;
• Responsible for data processing, including extracting data from SQL, cleaning data, feature selection and statistical analysis as well as data visualization;
• Results: Scored the user value according to the user's RFM (recent purchase, purchase frequency, purchase expenses), which was recognized by the team. Grouped the information such as discount tendency, quality requirement,
purchase quantity and prices, etc., laying the basis for advertisement placement;
Melbourne, Australia
MAR 2019 – Present
IT MAN
Leading IT Consulting Agency in Melbourne. Affordable IT professionals for crafting amazing websites,
mobile apps. We are also experts in digital marketing and providing excellent IT solutions
to help the business grow faster
Data Scientist Intern
• Independently researched the Beijing job market and used machine learning algorithms for salary forecasting;
• Grabbed job related data from 51Job, performing statistical analysis and feature selection for data cleaning and processing;
• Used deep learning methods (Random Forest, SVM, etc.) to build a machine learning model to predict job salary levels;
• Results: The salary level can be basically predicted given a description of the position information, and the prediction accuracy reached over 80%;
Melbourne, Australia
JUN 2018 – FEB 2019
Project
•
End to end data analysis of regional job market:
Data collection, data pre-processing, words cloud, modelling and association finding. Analyze the information about the job market in Beijing, China.
•
Scrapy project:
Crawling data from popular websites.
•
Wechat Robot:
By using Mini Batch K-Means algorithm to covert picture received from WeChat to reduce the size of the picture without distortion, and send back picture to the
friend.
•
Web analysis based on Request package:
Using Request package to get the information from HTML sit.
•
Australian Workforce (Full-time/Part-time) Male-Female Ratio Shinny App Data Visualization :
• Description: Visualized the historical trends of the number of employees in various industries/positions in
Australia and analyzed trends and proportions of male and female practitioners;
• Completed data selection, cleaning, grouping, and created interactive charts independently;
• Found out through the interactive chart that women still occupied a small proportion in most industries.
Although the total number of male and female employees in some industries was almost the same, men were
more likely to hold full-time jobs; developed interactive light applications through Shinny app to share
visualization results through url;
• ranked top 5% in the class;
•
Interactive visualization:
Interactive visualization of the relationship between population life expectancy and the gross domestic product has changed over time.
•
Cloud-based application:
Web application for analysis health indicator of every country in serval years. Using serval web tech to collect, analysis data and track users' action.