During my experience as a data analyst, I managed a diverse range of responsibilities. These include automating data processing workflows, creating centralized dataset from multiple sources, enhancing the customer lifetime value (LTV) model, performing retention analysis and creating dashboards for cross-functional teams. My work generates analytical insights that drive business performance improvements. Additionally, I have developed a thorough understanding of subscription-based customer lifecycle journey, from enrollment to renewal. I am proficient in analyzing and interpreting growth metrics, such as activation rates, churn rates, retention rates and other key performance indicators.
In my free time, I love playing basketball and tennis, trying out new recipes, and getting lost in a good book. Lately, I've been really into pickle ball, and I'm always up for a game in the bay area.
The dashboard summarizes unicorns across industries, highlights the investors backing them, and presents their valuations, providing a clear overview of the broader unicorn ecosystem.
Tools:
Skills:
The analysis will cover crucial areas such as revenue and retention trends, cohort behavior, the impact of discounts on pricing, and customer segmentation. Additionally, more advanced techniques, including lifetime value (LTV) forecasting, cross-selling analysis will be employed to provide deeper insights into customer behavior and business performance.
Tools:
Skills:
In this project, I performed exploratory analysis to understand the fraud transactions on a bank simulated dataset, and I also applied statistical methods to detect fraudulent activities.
Tools:
Skills:
To understand the development of the electric vehicle market, this project studies the data of electric vehicles registered in WA from 1997 to 2023.
Tools:
Skills:
In this project, I performed analysis on customers' reviews on an E-commerce site to identify their areas of interest/concern.
Tools:
Skills:
I developed supervised learning algorithms for customer churn prediction in this project. The labelled data in this data set is imbalanced, so I applied SMOTE for oversampling. Besides, I applied encoding, standardization technique to transform the features. Logistic Regressions, KNN, Random Forest algorithms are used for modeling. Model evaluation involves metircs like f1-score, ROC and AUC scores.
Tools:
Skills:
In this project, I used machine learning techniques to build models that can detect fraud credit card transactions on a highly imbalanced dataset, in which only less than 1% transactions are considered fraud. Random downsampling method is used to handle the imbalance data.
Tools:
Skills:
This analysis aims for developing a statistical model to classify heart disease using data collected through non-invasive procedure. The final model achieves 84% accuracy and has a false positive rate of 18%.
Tools:
Skills:
[yaohong010@gmail.com]
Send Email