Examples 2 and 3 as presented in the article couldn't be more horrendous examples of invalid inferences from test results if the author had tried.
It's the cargo cult approach to testing: "let's change everything and if the revamp increases conversions we'll guess what factor had the most effect".
It's the cargo cult approach to testing: "let's change everything and if the revamp increases conversions we'll guess what factor had the most effect".