Hi, my name is Yaohong Liang
I'm a Data Science and Analytics professional!

Know more

About me

Profile Image

As a Data Analyst in the RevOps team at GoHealth, I manage a diverse range of responsibilities. These include automating data processing workflows, enhancing the customer lifetime value (LTV) model, and creating comprehensive reports and dashboards by integrating data from multiple sources. My work generates analytical insights that drive business performance improvements. Additionally, I have developed a thorough understanding of the customer lifecycle journey, from enrollment to renewal. I am proficient in analyzing and interpreting customer success metrics, such as churn rates, new business rates, renewal rates, and other key performance indicators.

In my free time, I love playing sports, trying out new recipes, and getting lost in a good book. Weekends are all about hanging out with family and friends. Lately, I've been really into visiting museums and playing board games with friends. Basketball is a big passion of mine, and I’m always up for a game. I'd love to meet other basketball fans in the city to play together.

Projects

Financial Payment Analysis

Fraudulent transactions in financial payments represent a significant challenge for the global financial industry. In this project, I performed exploratory analysis to understand the fraud transactions on a bank simulated dataset, and I also applied machine learning methods to detect fraudulent activities.

See Live Source Code

Electric Vehicle Market Analysis

Electric Vechicle has been very popular in the automative market. It is an observable trend that the global electric vehicle market has been expanding quickly in recent years. To understand the development of the electric vehicle market, this project studies the data of electric vehicles registered in WA from 1997 to 2023.

See Live Source Code

Document Clustering and Topic Modeling

Customer reviews are valuable assets for most companies. In this project, I applied Natural Language Processing technique to cluster customers' reviews on an E-commerce site, and identify the latent topics of these review texts.

See Live Source Code

Customer Churn Prediction

I developed supervised learning algorithms for customer churn prediction in this project. The labelled data in this data set is imbalanced, so I applied SMOTE for oversampling. Besides, I applied encoding, standardization technique to transform the features. Logistic Regressions, KNN, Random Forest algorithms are used for modeling. Model evaluation involves metircs like f1-score, ROC and AUC scores.

See Live Source Code

Credit Card Fraud Detection

In this project, I applied my ML skills to construct models that can detect fraud credit card transactions on a highly imbalanced dataset, in which only less than 1% transactions are considered fraud. Random downsampling technique is used to handle the imbalance data.

See Live Source Code

Flight Paths Visualization

This project aims at developing an interactive map to visualize the flight paths in the United States. You can choose any airport in the U.S. and the map will show all the destinations that you can go from the selected airport.

See Live Source Code

Heart Disease Detection

This analysis aims for developing a machine learning model to classify heart disease using data collected through non-invasive procedure. The final model achieves 84% accuracy and has a false positive rate of 18%.

See Live Source Code

Contact

[yaohong010@gmail.com]

Send Email