Archive

Posts Tagged ‘online testing’

E-commerce: When should you reveal the price in your shopping carts?

May 11th, 2011 4 comments

E-commerce is a lot like magic. According to my in-depth research into the art of illusion (hey, I watched the movie “The Prestige”), there are three basic elements to the classic magic trick:

  • The Pledge (set-up)
  • The Turn (the trick itself)
  • The Prestige (the reveal)

Much like a really good magic trick, you must also have your audience join you on a convincing journey, and deliver real value to them in the end without them feeling, well, tricked. For David Copperfield, that value is entertainment without feeling lied to by the illusion itself. For the classic e-commerce purchase funnel, that value is a quality product without being shocked by an unfair price at the end of the journey.

To that end, you might say e-commerce offer presentation has three stages (which may, in fact, overlap):

  • The Interrupt (gains attention)
  • The Value (establishes appeal)
  • The Price (the reveal)

But when do you reveal price in the cart process? That was a question we received in our recent Web clinic, Shopping Carts Optimized: How a few tweaks led to 12% more revenue across an entire ecommerce website. I discussed this topic with our Director of Sciences, Bob Kemper, and here’s what he had to say… Read more…

Marketing Optimization: How to determine the proper sample size

As I delve deeper into the abyss of numbers while preparing for my interactive panel on validity at Optimization Summit, I’m coming across more fallacies about validity in marketing tests. Here’s one I recently heard…

“We were told that if we send each treatment to 4,000 people on our list we would have a valid test.”

I changed the number to protect the innocent, but this is a common misconception.

That’s why they play the game

Now I’ll give them credit. The misconception above is thinking along the correct lines. A large enough sample size is necessary to ensure you have validity.

However, while you can take an educated guess, it is impossible to know the minimum sample size before the test is actually run. Just ask a Las Vegas bookie.

Because, an important factor in sample size determination is the difference in results between the treatments. If the treatments return very different results, it’s much easier to confidently say that you really do have two (or however many) emails that will perform differently. You don’t need as many samples to do that.

However, if the treatments have very similar results, you want many more observations to see if there really is a difference.

Think of it this way. I recently went to Disney World, and while waiting in line for a ride the line split.I was  curious to see if more people would go to the left or the right.

If I saw nine people go to the left, and one go to the right, I’d feel pretty confident that people tend to favor the left.

But what if the split was six to the left and four to the right? I would want way more observations to feel confident that there is a real difference, whether people really do favor one side over the other, or if what I’m seeing is just random chance. Maybe for the next ten people, six will go to the right and four to the left.

And that’s why it’s impossible to determine the exact sample size you need for every test. You would, essentially, need to know the response you would get for each treatment before you tested. And, after all, that’s why we run the tests. Because it’s nearly impossible to guess on an outcome. Again, just ask a Las Vegas bookie. The house often wins. But not always.

I asked Phillip Porter, a data analyst here at MECLABS, for a more official sounding explanation than my Mickey Mouse example above; an explanation that you can use verbatim to sound smart and win any internal debate. Here’s what he said…

“Significance is based on sample size and effect size. The larger the sample you have, the smaller effect size you can find to be significant. The larger the effect size present, the smaller sample size you need to find significance. Larger sample sizes are generally better, however any difference, no matter how small, can be found to be significant with a large enough sample size.”

“Imagine the thrill of getting your weight guessed by a professional.”

However, much like Navin R. Johnson’s carnival barker in “The Jerk,” it doesn’t hurt to take a guess as long as you know what you’re doing. And if you truly are a professional, you don’t have anything to lose by guessing (except maybe some Chiclets).

“You can guesstimate before you run the test, but the actual numbers may not match what you were expecting. If you could guesstimate correctly every time before running the test, why would you need to run the test?” Phillip said. Proving not only that great minds think alike, but that it doesn’t hurt to try to get a sense for how much traffic or email recipients you need to get a valid test.

In fact, we include a pre-test estimation tool in our Fundamentals of Online Testing course. There’s one in the MECLABS Test Protocol as well, which our researchers use for all of their experimentation.

But a true professional will still run the final numbers…

How to determine if you have a big enough sample size

I don’t want to get all Matt Damon writing on a chalkboard in “Good Will Hunting” on you, but (and read this in your best Boston accent) it comes down to the math. I had a tough time on how I could truly serve you with this blog post. The last thing I wanted to do was raise a problem and not give you a solution (or only provide a paid solution).

I think, in the end, the best thing to do is just provide you with the equation. The sample size calculation that we use internally can be found in Cochran, W. G. (1977). Sampling Techniques, 3rd ed., Wiley, New York.):

Phillip explained the formula…

“This formula provides us with the minimum sample size needed to detect significant differences when Z is determined by the acceptable likelihood of error (the abscissa of the normal curve). The value of Z is generally set to 1.96, representing a level (likelihood) of error of 5%.  We want the highest accuracy possible, with the smallest sample size.  This level of error, 5%, gives us the best tradeoff between these two goals.

p is the conversion rate we expect to see (estimate of the true conversion rate in the population), and d is the minimum absolute size difference we wish to detect (margin of error, half of the confidence interval).”

We are working on a dead simple validity tool (the iPod of validity tools, if you will) to pass out at Optimization Summit. But for now, you can try putting the above formula in your own Excel spreadsheet.

Even Phillip admitted, “If you are trying to calculate this by hand it can look intimidating, but if you build the formula in Excel it is pretty simple.”

Related Resources

Optimization Summit 2011 – June 1 -3

Email Marketing Tests: What to do when a radical change produces negligible results

Online Testing and Optimization: ROI your test results by considering effect size

Online Marketing Tests: How could you be so sure?

Testing Madness: What the odds of picking a perfect NCAA Tournament bracket can teach us about running valid tests

Photo attribution: Rainer Ebert

Email Marketing Tests: What to do when a radical change produces negligible results

April 25th, 2011 1 comment

We usually share tests on this blog that our optimization research analysts conduct in our labs with our Research Partners. Sometimes we share tests from our audience is well, but rarely do we share our own tests.

In today’s blog post I wanted to share a recent email test from our own marketing team. Not because the results were impressive. If you’ve followed MarketingExperiments for any time, you’ve certainly seen us share results from 162% lifts and 59% gains. Today, we’re going to discuss what to do when you get something entirely different. Read more…

Homepage Optimization: Radical redesign ideas for multivariable testing

April 11th, 2011 3 comments

Class was in session again at MECLABS, and this eager beaver right here decided to join in on the learning once more. Of course, I had the MarketingExperiments audience in mind, so I took my handy dandy notebook and jotted down what I saw and heard.

Once again, Senior Optimization Manager, Adam Lapp, was front and center teaching our Research Analysts about, you guessed it, optimization. But, this class he focused on radical redesigns and the importance of multivariable testing when transforming a page in such a drastic way. After a lesson, it was time to put the learning into practice. And once again, being the cool teacher that he is, Adam set up a fun optimization contest. With bragging rights, being the main prize.

This time it wasn’t a battle of the sexes, but I can say it had a few surprising twists at the end. On one team you had MECLABS Research Analysts Aimee Bolton and Spencer Whiting and on the other you had Dustin Eichholt and Zuzia Soldenhoff-Thorpe. Their mission was to create the most effective radical redesign for a homepage. I will show you their designs as well as some advice from the teacher, pupils and myself. Hopefully you can apply it on your next testing adventure, especially in the case your design is a bit…um, far out. Read more…

Value Proposition: Our research team answers your questions

April 1st, 2011 No comments

Value proposition is one of the most popular topics we teach about. Getting your value proposition right is maddeningly difficult but highly rewarding. As they say in “The Town” before robbing Fenway Park, “If it was easy, everyone would do it.”

So, it wasn’t surprising that we received an inordinate amount of questions during our recent Web clinic – Do You Have the Right Value Proposition? How to test, measure, and integrate your Value Proposition online.

Unfortunately, we can’t answer every single question we receive. But I passed several of the questions around the labs, and here is what our research team had to say… Read more…

Landing Page Design: Eye path vs. Thought sequence

March 23rd, 2011 14 comments

I was impressed by the clarity of the headline and imaging of RealGoodsSolar’s landing page and later provided suggestions for testing their various sources of traffic to build dedicated landing pages for distinct levels of motivation.

Today, I wanted to add one last piece of analysis: the thought sequence effected by the page. “Eye path” is a concept often invoked by usability and user interface (UI) specialists. However, “eye path” is passive. What we want to optimize is the thought sequences in the mind of page visitors to lead them to the desired action, and to do that we must deliberately position content in the clearest and easiest-to-absorb sequence.

The rest of this post is presented in the general order of the expected thought sequence of a visitor to RealGoodsSolar’s webpage. Read more…