Exploratory data analysis
Understand Customer Page path and Map User Journey
Data dimensional modelling (split)
Define Null (Control/A) and Alternate (Challenger/B) Hypothesis
Choose a statistical test
Perform a one-sided, paired, t-test
Summary Statistics
Develop visualisation of data (raw and standardised)
Interpret test result
By designing a dimensional data model and utilising statistics, we can see the distribution of customer-journeys as each customer attempt to complete their applications on the website.
Data distribution is split into two groups;
'before content changes were made to the website' and 'after content changes were made to the website'
Numerical data type
Sample size is (according to central limit theorem) assumed to be normally distributed
Samples are paired
Direction os test is two-sided
Test statistic = -0.2070
P-value = 0.8367
Significance level (𝜶) = 0.05 (5%)
We cannot conclude that changing the content on the “Before You Start Page” affected the proportion of users completing an application first time.
The test result is not statistically significant enough to reject the null hypothesis.
This does not mean we have proved the null hypothesis.