Category Archives: DataBase Marketing

Waiting for Pareto

A Warped View of Visitor / Customer Analysis?

Many folks look at the world through the lens of the Gaussian Bell Curve when the real model they should be looking at is Pareto Power Law, as elegantly explained by John Hagel here (you really should read that post when you have a chance, another link provided at the end of this post).  For the math inclined, as quoted from another source in John’s post:

Gaussian and Paretian distributions differ radically.  The main feature of the Gaussian distribution . . . can be entirely characterized by its mean and variance . . . A Paretian distribution does not show a well-behaved mean or variance.  A power law, therefore, has no average that can be assumed to represent the typical features of the distribution and no finite standard deviations upon which to base confidence intervals . . . 

Yea, like he said.  In English, that means when you are “optimizing”, you could be driving towards a suboptimal result if you’re not paying attention to what you should be paying attention to. Witness all the discussion of sample size and standard deviation in web analytics lately - are we talking Bell Curve here, folks?  If you walk the Long Tail walk, you should talk the Pareto talk. Here’s a pic from the Hagel blog if you’re not following this idea:

Gauss vs Pareto

(Image originally by Albert-Laszlo Barabasi, “Linked: The New Science of Networks”)

The bell curve average can kill you.  The real results of your tests are masked because you are looking at the wrong outcome variables.  Sure, go ahead and segment source; that’s where the idea of “there is no such thing as an average visitor” came from back in 2001, right?  So fine; you have the initial segmentation correct, and if you are just optimizing for conversion, you have it right.  If you are optimizing for profit, you’re not even close yet.  I’ve referred to this situation as the “Reporting versus Analysis” problem; here’s a math-ier view from John’s post, quoting McKelvey and Boisot:

Processing dots is appropriate to what we label the routinizing strategy.  Processing patterns, on the other hand, better serves what we call the Pareto-adaptive strategy.  Processing dots means processing data, a low-level cognitive activity.  By contrast, processing patterns – pattern recognition – is a high-level cognitive activity, one that involves selecting relevant patterns from among myriad possibilities. . .

John offers his own “In other words” on the paragraph above:

In a Paretian world, surface events can become a distraction, diverting attention from the deep structures molding these surface events.  Surfaces are extraordinarily complex and rapidly evolving while the deep structures display more simplicity and stability. These deep structures are profoundly historical in nature – they evolve through positive feedback loops and path dependence.  Snapshots become misleading and understanding requires a dynamic view of the landscape.

I first saw the effect John describes above at HSN, where the surface (the TV show itself) appeared to be complete chaos but the deep inner structure (the customer) was smooth and completely predictable.  Another way to say this is any one ”snapshot” of time seems quite chaotic, but if you watch the whole “movie” over time, you see the stable inner core.

The reverse is also true: as you try to artificially “stabilize” the surface – in HSN’s case, by moving to a rigid or supply-driven programming format rather than a demand or customer-driven format – the core starts to destabilize.  Marketing programs stop working as well as they did and the LifeTime Value of the customer erodes.  Darwinian, sort of.

What all this means to you: Optimizing for conversion is fine, as far as it goes.  But how do you know ”the converted” are the best customers you can get?  How do you know you’re not wasting resources optimizing the conversion of worst customers, for example?  How do you know that buried in your conversion optimization – no matter how much segmentation you do on the front end – is a Pareto distribution that is skewing the actual results?

Example:

One of my earliest clients was a vintage 1998 dot-com that had a massive business in lead generation.  They basically generated e-mail names through various niche interest content sites and then rented those names.  Over time, they also started creating their own products and selling to the names they generated.  Great business, killer margins.  They were a massive buyer of all types of online media.

But then that media started to get much more expensive, and they had to start looking at Campaign ROMI as opposed to just sales, and that is why I was brought in.  We were in the middle of a system integration that would provide a “ROMI Dashboard” to the media buyers when the word came down to kill off about 1/3 of the campaigns they were running.  I begged them to wait for the data (we’re almost there!) but the “shoot from the hip” culture prevailed.

Pareto had his day.  The campaigns killed were the ones with the lowest initial response rate – and as it turns out, these same campaigns also happened to generate about 95% of their most profitable customers.  You just had to wait about 3 months after initial conversion to see the profit. 

The customer value death spiral (Kevin uses the term file momentum to describe the same idea) ensued and they were never able to recover.  Once this kind of best customer value erosion has completed, it is very difficult to survive long enough to rebuild the power of the database.  Media costs rose as customer productivity fell and when these two lines crossed, the company collapsed.

This death spiral / file momentum idea is incredibly important in the Marketing Productivity area and somewhat difficult to get your arms around, so it’s worth helping you to visualize it (if you’d like audio with your visualization, try this). 

Here we go:

Say that 20% of the customers you acquire are “best customers” and generate 80% of your profits.  In profit terms, say this means $200 in net profit per customer over 6 months.  Now, let’s say you change your customer acquisition methods and your new “best customers” generate $100 in net profits over 6 months.

The day before you initiate this change in campaigns, you still are generating best customers with a value of $200 over 6 months.  So you have about 6 months before the profit effect of these customers disappears.  And remember, these folks are responsible for the majority of your profits.  Few in number, powerful impact.

So, you start generating customers only worth $100 over 6 months.  You don’t even notice anything wrong, since the value plays out over time.  Conversions are fine, and sales are fine – because you are still living off the $200 customers while you are bringing in the $100 customers.  And the rest of your operations remain the same, so unit sales and so forth seem fine.  We’re only talking about a change to a small group of customers, right?

About month 3, sales start to slip, out of nowhere.  You’ve entered the wide mouth of the death spiral (cue the audio).  You can’t explain it, you haven’t “changed” anything recently. Conversions of those new $100 customers are still coming in just fine, thank you.  But after month 6, you start living only on the $100 customers and sales are now 3/4 what they used to be.  You’re in the throat of the death spiral now.

Let’s say you do some analysis and “discover” your mistake.  You kill the campaigns that generate the $100 best customers and start generating $200 best customers again.  How long will it take to get back to where you were?

Right.  6 months – at a minimum.  Problem is, the campaigns that generate $200 customers are twice as expensive as the campaigns that generate $100 customers, and you’re now in the vortex of the death spiral.  You spend like crazy but can’t make it up in time and at month 4, media costs outstrip cash flow and the company goes belly up.

Pareto plays a very tough game, eh?  While you’re thinking about it, here again is a link to the Hagel post.

6033% ROI, Defining Churn

Jim answers questions from fellow Drillers
(More questions with answers here, Work Overview here, Index of concepts here)


6033% ROI
—————-

Q: We have exchanged email a few times, and I don’t recall if I ever said thank you for your book.  While I had been experimenting with many CRM programs in my little dry cleaning shop, your book gave my thoughts order and clarity to refine what I had started.  Today, I see the world differently.

A: Well, thanks for the thanks!

Q: You may or may not remember me. Just after I sold my dry cleaning shop, I had bought your Drilling Down book. I was the dry cleaner who had been doing rudimentary data mining and CRM with a point of sale system I had developed in Regina, Saskatchewan.

A: I do remember. Internally, I was thinking, “Wow, this is going to be a real test of the Drilling Down concept”? I mean, I have seen it work in many small businesses, but dry clean (seems to me) is a very tough, tough business. Too many players, a lot of competing on price, etc. A great environment for underground customer marketing in terms of beating the other guy – they will never know what happened to them. But still, tough for small owner / operator to have the “will” and time to really make it happen. So yea, I remember…

Q: Well, I’ve continued working within the dry cleaning as a marketing consultant. The programs I had developed in my shop have now been transplanted into a few of my client’s shops, and are bearing fruit.

Tonight one of my clients reported a ROI of 6033% doing direct mail to certain customers in his market in California. Another client of mine reported his fourth year of steady growth. One of my first clients has been showing a 7 percent annual compound growth, and he is in a flat or declining market. What began in my shop has been proven across North America, into Europe and Australia by my clients.

A: I can’t express how exciting that is. Congratulations!

Q: Jim, data mining dry cleaner’s data is a blast. You would be stunned at the quantity, and quality of data a dry cleaner gathers today. Would you ever have thought data mining could be applied to suits and shirts? Well yes, it can.

A: I am stunned, and I bow to your most excellent Drilling!

Q: Once again, thank you.

A: And thank you for sharing this, it’s very, very exciting to hear. Like you said, no other word for it than “stunning”. I remain most stunned!  Keep me informed. Perhaps you should write a book?

Jim

Defining Churn
———————

Q: I work for an economics consulting firm based in Washington DC. I am researching customer churn and customer displacement statistics across a variety of industries to try to establish a benchmark of what is considered high and low customer displacement.

A: Nice to meet you, and a noble task!

Q: Do you happen to have any such churn statistics, or know if a place you could recommend?  I found plenty of statistics regarding churn rates within the telecom industry, but am most interested in companies that are involved in business-to-business relationships with their customers (relationship between a customer and a supplier).

In addition, I would also like to find churn statistics for customers who use multiple suppliers. For example, a customer may go to several grocery stores rather than sticking with one dedicated store.  I would be interested in learning more about the statistics companies in these types of industries use to track customer displacement.

A: The reason you find a lot of churn info in telco / cable is the end of the customer life is easily defined by the disconnect, and these numbers are reported publicly as part of annual reports and so forth. In many other businesses like the ones you describe, typically the companies have failed to define customer defection and so in their minds, there is no churn because there is no defection.

A “customer”, even though they have not contacted the company for 3, 5 or 10 years, is always still a customer. If the company thinks like this there is no churn rate to be measured, by the definition the company has chosen for itself.

At the same time, defining defection is pretty easy to do by looking at the transactional data and defining the patterns of defection, for example “if a customer has not ordered from us in 3 years they are highly unlikely to order again”. That’s defection defined; you just put a line in the sand and say “3 years no contact is a defection”. The company then should declare customers in this status “defected” and then a churn rate could be found. This is pretty easy to do, so if not executed, one of two situations exist: either the company does not have the data or they don’t have the “will” to discuss, internally or externally, the concept of customer defection.

A third possibility exists: the company in fact has the data and has defined defection, but would never, ever speak to churn or customer defection in any kind of public forum because this information is so critically important from a competitive and strategy perspective. To discuss these numbers or the implications in public could have dramatic consequences for company positioning in the market or stock price. So if they have the numbers, they’re locked in a safe.

As a result, I’m sorry to say, I do not have any broad-based “sources” for you, save one possibility: a book called The Loyalty Effect by Frederick F. Reichheld (1996). In this book, Reichheld goes through the business models of 25 different companies that excel at retaining customers in different industries , and proves out the financial model of customer retention using real data. This is the book where the quote, “It costs 5x more to acquire a new customer than retain a current customer” (or the various bastardizations) came from. So it might help you out.

The only other thing I can suggest is that “churn” is not always the word used to describe these stats but is most often used when the disconnect is easily defined, as in telco / cable; “displacement” is a rare use for this idea as far as I can tell..

“Customer Turnover” is a popular phrase in Europe and is used by some in the US; also “defection rate” is used quite a bit. So if you’re pounding on Google to try to find these numbers, try those phrases and others you may find when doing these searches. Banking / finance / insurance is another area where the “disconnect” is often easily defined, so you will find various defection rates in some of their case studies on the web.

Jim

Get the book at Booklocker.com

Find Out Specifically What is in the Book

Learn Customer Marketing Concepts and Metrics (site article list)

Download the first 9 chapters of the Drilling Down book: PDF 

Book: Managing Customers as Investments

Are You Spending More on Your Customers than They are Really Worth? authors Gupta and Lehmann ask.

Based on my experience, Why Should We Care? is the return question.  Lots of companies apparently do not care, at least not yet.  But someday these companies will “hit the wall” with the traditional focus on acquiring new customers, and then they will care.  Just a Case of History Repeating, don’t ya know.

When I published the 3rd edition of my book, I was pretty sure the business world had finally made it past “Why Should We Care?” and would be on to “How Do We Do This?”.  Wrong, as Ron constantly remind me.  Before the book reviewed below was published, I was using what I called the “portfolio approach to managing customers” idea to set up the Current Value / Potential Value model.  Spent an entire chapter on the idea.  Apparently, that was not quite enough to answer the “Why Should We Care” question.  My bad.  Turns out the same idea is worthy of an entire book.  Sigh…

So for those of you who are more interested in the “Why Should We Care?” (90% of You?) as opposed to the tactical “How Do We Do This?” presented in the Measuring Engagement Series, I give you the following book:

This is a 6 Chapter, no nonsense, 165 page book that is heavily annotated with the kinds of “proof” you need to potentially get your Boss to care about this topic.  I mean, the boss-person can just hit the EndNotes (another 33 pages with the Appendix) and find out how the theories, formulas, and examples in this book have all been well documented by a slew of hard core academics and Consultainers alike.  These references to various studies, books, papers, and so on probably have more impact than say, sending an e-mail to the boss titled “Interesting Stuff” with a link to my blog…

The book is easy to read and shoves all the “Math” into the Appendix so whether you’re using the right or left brain you can follow right along – ignore the math and just Grok the pictures, or get right down into full-blown proofs with your Calculus shoes on.  Have it your way, as they say.

Just look at these Chapter Titles:

1.  Customers are Assets – no argument there from me.  Want definitive proof?  Here it is.

2.  The Value of A Customer – do you know the value of yours?  Here is an easy – and I mean easy – way to estimate this value.  Call it Lifetime Value “Light” –  it’s way better than what you have now, I bet.

3.  Customer-Based Strategy – Oh, to develop Strategy and actually do something based on the Value of the Customer, as opposed to whining about how they are “in control”.  This is the best chapter in the book.  Customers can only take control if you give it to them, you know.  And that’s a Strategy problem.

4.  Customer-Based Valuation – they’re talking firm valuation here, for the purposes of acquiring companies or selling them.  As I said before, Wall Street uses the Current Value / Potential Value Model.

5.  Customer-Based Planning – as in building Customer Value right into the Business Plan, so the execution is rock solid.

6.  Customer-Based Organization – sure, the tough one.  How you make it all work in the org.  A bit of the Analytical Culture thing.

The Gupta and Lehmann book is great because it takes what I’ve learned through 20 years of “exposure” (Why You Should Care) and explains it in corporate speak, creating links to stock prices and all kind of other good stuff the CFO would really like to hear about, like projecting future sales, estimating the buyout value of the firm, evaluating acquisitions, and so forth.  All from the same kind of customer analysis we just worked through in the Measuring Engagement series.  Really.  Except they hide all the numbers from you – unless you go looking for them.

Oh, and did I mention this info is all well documented by a slew of hard core academics and Consultainers alike?  Seriously though, there are over 120 footnotes that at the very least provide you a library of solid references and case studies on the topic of using customer value data to drive increased profits.

So, if you’re a Marketer trying to create a bridge to Finance, get a copy for your friend over there.  If you’re an analyst trying to get a deeper understanding of why customer analysis matters to the business side, get a copy for yourself.  If all of a sudden you are in charge of “CRM” or “Customer Experience” or whatever they are calling running a business that doesn’t shred its own customers these days, get this book for the sake of your company.  The book really is a great read and might help you make sense of all the disparate and seemingly conflicting marketing and service ideas you read about today.

And while you’re at Amazon, get a guide for the people who will have to turn all this Customer Value Data into Profits for you – mine.