Customer Relationship Manageme

CUSTOMER RELATIONSHIP MANAGEMENT: FROM CONVERSION TO
CHURN TO WINBACK
A Dissertation
Submitted
to the Temple University Graduate Board
In Partial Fulfillment
of the Requirements for the Degree of
Doctor of Philosophy
By
Ke Li
May, 2013
Examining Committee Members:

Anthony Di Benedetto, Advisory Co-Chair, Department of Marketing and Supply Chain
Management
Eric M. Eisenstein, Advisory Co-Chair, TU Department of Marketing and Supply Chain
Management
Eric T. Bradlow, University of Pennsylvania
Ji Zhu, University of Michigan
Sanat K. Sarkar, External Member, TU Department of Statistics
ii
UMI Number: 3564826
All rights reserved
INFORMATION TO ALL USERS

The quality of this reproduction is dependent upon the quality of the copy submitted.
In the unlikely event that the author did not send a complete manuscript
and there are missing pages, these will be noted. Also, if material had to be removed,
a note will indicate the deletion.
UMI 3564826
Published by ProQuest LLC (2013). Copyright in the Dissertation held by the Author.
Microform Edition © ProQuest LLC.
All rights reserved. This work is protected against
unauthorized copying under Title 17, United States Code
ProQuest LLC.
789 East Eisenhower Parkway
P.O. Box 1346
Ann Arbor, MI 48106 - 1346
©
Copyright
2005
by
KeLi
All Rights Reserved
iii
ABSTRACT
With the grant of a big CRM dataset from a large media company, this
dissertation examines four different categories of factors that could impact three stages of
customer relationship management, namely customer acquisition, retention, and winback
of lost customers. Specifically, with the aid of machine learning method of random
forests and text mining technique, this study identify among the factors of customer
heterogeneity (e.g. in usage of self-care service channels, duration of service,
responsiveness to marketing actions), firm’s marketing initiatives (e.g. the volume of the
marketing communications, the depth of the promotion, the different communication
channels they use, and the marketing penetration in different geographical areas),
customer self-reported deactivation reasons, as well as the call centers notes in text form,
which factors play bigger roles than others during each of the three stages of CRM.
Furthermore, the authors also examine how these factors evolve throughout these three
stages of CRM in terms of their effects on shaping customers’ decision making of
whether to convert to paid customer, to churn, or to reactivate their service with the
company. The findings help managers better allocate their resources in the processes of
acquiring, retaining and winning back customers.
iv
ACKNOWLEDGMENTS
I would like to express my sincere gratitude to my “Fantanstic Four” dissertation
committee members who have guided me and assisted me throughout the completion of
this dissertation. Dr. Anthony Di Benedetto, it is your book New Product Management
that convinced me that Marketing is the most interesting research area that I should spend
the rest of my life time with. Dr. Eric Eisentein, it is your consistent belief in me and
persistent pushing me into this quantitative world that enables me to handle the analysis
of the big data from SiriusXM. Dr. Ji Zhu, we only met once when you came to Temple
University to give presentation but you quickly impressed me by already coming up with
the machine learning methods to deal with the classification problem in Sirius data after
we only chatted for 5 minutes. Afterwards, both of your expertise and spiritual support
carry me through this process of completing the dissertation. Finally Dr. Eric Bradlow,
even though I was deeply impressed by your presentation at O-Bayes conference at
Wharton in June 2009 but it was not until several years later that you finally become my
dissertation committee member and you have absolutely become the game-changer in my
career ever since. Your guidance to my dissertation direction, your prompt replies to my
every weekly report with your insights and encouragement, as well as your word-by-
word review and revision of my dissertation not only turns the dissertation into an
extremely interesting project to me but also paved way for me to become a serious
researcher. With all of your deep intelligence and great patience this experience of
working with all of you will surely benefit my whole career.
v
Dr. Sanat Sarkar, you are not only my external reader of my committee but also
one of my mentors in this school who supported me in all kinds of ways.
Dr. Aaronson, when you interviewed me for the PhD program at Fox School of
Business you told me Fox School needs students like me. For this very line of words I
have strived not to let you down. In addition, throughout all these years at Temple, you
have always given me a hand whenever it is the most dire situation for me.
I would also thank CIBER Temple, particularly Ms. Kim Cahill and Mr. Arvind
Phatak, without your supporting me financially I won’t be able to accomplish this task.
When other people doubted me, it is you, Kim, who always hold strong confidence in me
which means everything to me.
I would acknowledge WCAI and SiriusXM, this dissertation is based on your
grant of data. Without it, it is a mission impossible.
I would also acknowledge Dr. Bei Kang, who assisted me in many ways to bring
this effort into conclusion.
Finally I would like to thank my family. My mother Xi Liu, my father Faliang Li,
and my sister Hao Li, without their unconditional love and support I would never be able
to make it. Particularly my mom, even though I did not become a scientist as she wished
me to be, this is at least the first step to become a scholar if she could compromise her
dream. However, her training and education of me to be an independent woman has

become an invaluable asset to me that will benefit me throughout my whole life time.
Thank you, mom.
vi
This dissertation is dedicated to my grandparents Cuihua Zeng, Mengqiu Zhang,
Yingquan Li, Yingzhou Liu, and my parents Xi Liu and Faliang Li.
vii
TABLE OF CONTENTS
PAGE
ABSTRACT....................................................................................................................... iv
ACKNOWLEDGMENTS ...................................................................................................v
DEDICATION .................................................................................................................. vii
LIST OF TABLES ............................................................................................................. xi
LIST OF FIGURES .......................................................................................................... xii
CHAPTER
1. INTRODUCTION ...................................................................................................1
2. LITERATURE REVIEW ........................................................................................8
2.1 CRM (CUSTOMER RELATIONSHIP MANAGEMENT) ............................8
2.1.1 ACQUISITION AND CONVERSION…...……...............……………8
2.1.2 RETENTION AND CHURN………………...………………………11
2.1.3WINBACK…………..……………………..…………………………16
2.1.4 ACQUISITION, RETENTION AND WINBACK……….…….……18
2.2 TEXT MINING ..............................................................................................19
2.2.1 TRANSFORMATION OF TEXT ………………...............................20
2.2.2 TEXT CLASSIFICATION ……………………..……………….…..26
3. HYPOTHESES ........................................................................................................30
3.1 SELF-CARE ...................................................................................................30
3.2 MARKETING INITIATIVES ........................................................................33
viii
3.2.1 MARKETING COMMUNICATION CHANNELS AND
CUSTOMER RESPONSIVENESS ………………………………...34
3.2.2 RADIO DENSITY…………………………………………………..39
3.2.3 PROMOTIONAL DEPTH…………………………………………..39
3.2.4 PRICING AND CHURN AND WINBACK………………………...40
3.3 DEACTIVATION REASONS .......................................................................42
3.4 DURATION ON WINBACK.........................................................................43
4. METHODOLOGY ..................................................................................................44
4.1 CLASSIFICATION METHODS....................................................................44
4.1.1 BAYESIAN LOGIT/PROBIT MODEL…………………………......44
4.1.2 ADABOOST…….……………………………………….…………..47
4.1.3 RANDOM FOREST……….………………………………………...50
4.2 CLASSIFIER PERFORMANCE MEASURE ...............................................55
5. DATA AND VARIABLES .....................................................................................59
5.1 DATA .............................................................................................................59
5.2 VARIABLES ..................................................................................................61
5.3 TIME FRAME OF VARIABLES ..................................................................63
6. RESULTS AND DISCUSSION ..............................................................................69
6.1 VARIABLE IMPORTANCE IN FOUR EVENTS ........................................69
6.2 HYPOTHESIS TESTING ..............................................................................76
3. LIMITATIONS, FUTURE DIRECTIONS AND CONCLUSIONS ...........................85
REFERENCES ..................................................................................................................89
APPENDICES
A. LIWC2007 OUTPUT VARIABLE INFORMATION ........................................101
B. SUMMARY STATISTICS OF CONVERSION TABLE... ................................103
C. SUMMARY STATISTICS OF CHURN TABLE...............................................107

ix
D. SUMMARY STATISTICS OF NONCONVERTED WINBACK TABLE ........112
E. SUMMARY STATISTICS OF CHURNED WINBACK TABLE .....................116
F. CONVERSION VARIABLE IMPORTANCE TABLE ......................................121
G. CHURN VARIABLE IMPORTANCE TABLE .................................................126
H. NONCONVERTED WINBACK VARIABLE IMPORTANCE TABLE ..........131
I. CHURNED WINBACK VARIABLE IMPORTANCE TABLE ........................136
x
LIST OF TABLES
Table Page
1. Factors Affecting Aacquisition and Conversion .......................................................50
2. Factors Affecting Retention and Churn ..................................................................57
3. Factors Affecting Winback. ......................................................................................67
4. Data for 4 Events ......................................................................................................75
5. LIWC Output Snapshot ..........................................................................................100
6. Selected Variables and Their Definitions ............................................................100
7. Self-care Channel Activity in 4 Events .................................................................67
8. Marketing Variable Estimates ...............................................................................67
9. Variables Predicting all 4 Events ............................................................................67
Appendix A LIWC2007 output variable information ......................................................101
Appendix B Summary Statistics of Conversion table......................................................103
Appendix C Summary Statistics of Churn table ..............................................................107
Appendix D Summary Statistics of NonConverted Winback table .................................112
Appendix E Summary Statistics of Churned winback table ............................................116
Appendix F Conversion Variable Importance table ........................................................121
Appendix G Churn Variable Importance table ................................................................126
Appendix H NonConverted Winback Variable Importance table ...................................131
Appendix I Churned Winback Variable Importance table ..............................................136
xi
LIST OF FIGURES
Figure Page
1. Conversion Variable Importance Plot.......................................................................50
2. Churn Variable Importance Plot ...............................................................................57
3. Unconverted Winback Variable Importance Plot. ....................................................67
4. Churned Winback Variable Importance Plot ............................................................75
xii
1
CHAPTER 1
INTRODUCTION
Customer Relationship Management (CRM) has become increasingly prominent
in marketing since firms realized that, in order to maximize their profit, they should
carefully manage their relationship with their customers, often attempting to anticipate
consumer behavior. This is particularly the case in the service industry. Even though
CRM is a relatively recent term, the emphasis of marketing strategies on customers has
had a long history. From the very beginning of marketing campaigns, entrepreneurs and
firms have relied on all kinds of means to attract potential customers and to keep them
loyal for repeated purchases while providing them with the “right” products. Practitioners
have always emphasized both products and customers simultaneously, but the shift of
focus of marketing research from product to customers became practicable only after the
birth of database marketing in the 1980s.
Regarded as the predecessor of CRM, early database marketing merely consisted
of customers’ demographics and their transactions with firms, but these early efforts held
the promise of enabling targeted direct marketing to individual customers. Over time,
increased processing power and advances in methodology raised the competitive bar so
that, in order to be competitive, firms must have some form of CRM strategy in place.
Today, many companies apply RFM (Recency, Frequency, and Monetary Value)
methods as their CRM models of choice (Reinartz & Kumar, 2003; Fader, Hardie, & Lee,
2005; Cui et al., 2006). These models use secondary data as an input, and segment
customers so that they can target the right customers for promotion, and to profit from the
2
relationship (Berger & Magliozzi 1992). The most valuable customers identified by RFM
segmentation are the ones with the highest recency, frequency, and monetary value.
Criticism for this segmentation approach has pointed out that it assumes that customers’
behavior patterns are static throughout their whole life cycle. If incorrect, this assumption
results in companies overmarketing to the most attractive RFM segments while
neglecting other segments that could be profitable if properly developed. Another
application that database marketing has facilitated is the customer life cycle or customer
lifetime value (CLV) analysis. As early as 1940s, companies had already begun to
approximate the average value of their customers (Petrison et al., 1997), but academic
research on CLV flourished only after the introduction of database marketing. Most of
the CLV literature has been focused on constructing different probabilistic models of the
underlying consumer behavior. A smaller set of papers explore the relationship between
firm profitability, marketing actions and CLV (Hogan et al., 2002; Reinartz & Kumar,
2003; Rust, Zeithaml, & Lemon, 2004). Studies show that CLV is an effective metric to
segment customers and allocate marketing resources (Haenlein, Kaplan, & Schoder,
2006). A recent study by Fader, Hardie and Lee (2005) links both RFM and CLV by iso-
value curves in customer base analysis showing statistical sufficiency of RFM under a
restricted class of buy-till-you-die models.
Database marketing evolved into relationship marketing during the 1990s due to
four different drivers according to Fletcher, Wheeler and Wright (1991): the changing
role of direct marketing, changing cost structures, technology and economic conditions.
With increasing marketing costs, companies realize that to gain competitive advantage
3
and higher profitability, it is important to develop individualized relationships with
customers based on their specific needs so as to increase the length of customer life cycle
as well as the value of the transactions between the firm and the customer during each
stage of that life cycle. As a result, marketing activities are increasingly being organized
around relationships with customers rather than products and services. One example of
this has been the increased use of loyalty programs (Tellis & Zufryden 1995; Kim, et al.
2001; Kivetz & Simonson 2002; Roehm, et al. 2002; Kivetz & Simonson 2003; Lewis
2004). In addition, the change of database constructs with the advances in technology
from buyer database to the customer database integrated with marketing activities (Shaw
and Stone 1988) at inexpensive costs, as well as the separation of consumer and business
markets also smoothed the progress of the evolution from database marketing to
relationship marketing.
Nowadays, a fully-developed CRM system provides a 360 degree view of
customer data. It involves using technology to organize, automate, and synchronize sales,
marketing, customer service, and technical support so that firms can track not only the
behavior and spending history of their customers but also the marketing activities,
customer services as well as the communications that companies have had with each
individual customer. Furthermore, the development of methods for differentiating
customer communications brings about the analysis of communication data from social
media sites such as Twitter, LinkedIn, and Facebook. Thus companies can track and
communicate with customers who share opinions and experiences about their company,
products, or services, even when the sharing has taken place in a non-company forum.
4
The challenge facing both marketing practitioners and academics is no longer the scarcity
of data but the tremendous abundance and variety of data that CRM systems generate.
The explosion of data has forced market researchers to seek new techniques, which have
been borrowed from machine learning, computer science, and database management.
The goal of CRM remains to achieve customer loyalty and to maximize customer
lifetime value. Modern CRM systems attempt to realize these goals by approaching each
stage of the customer’s relationship with the company as separate “stage”: customer
acquisition, customer retention, and winning back lost customers. All activities
performed during each of these three stages are of great importance since customer
lifetime value can be maximized only when the allocation of resources are balanced in
these activities (Reinartz, Thomas, & Kumar, 2005). Although researchers have
recognized the importance of each stage, the limited availability of CRM data in the past,
particularly the focus of most company databases solely on existing customers, has led
the majority of academic research in marketing to address customer retention being
disproportionately focused on. Customer winback (or recapturing of lost customers) is
extremely under-researched but can provide huge return on investment to companies
(Thomas, Blattberg, & Fox, 2004). Even though there is some existing research done on
examining customer acquisition and retention simultaneously, none of the existing
research has investigated all three stages of CRM activities together. With gratitude to
The Wharton Customer Analytics Initiatives (WCAI), this dissertation is endowed with
the access to the CRM database of a large North American media company that covers all
three stages of CRM activities, as well as the text data from the call center notes.
5
Equipped with this rich, multi-stage subscriber interactions data, I apply classification
methods from machine learning to predict whether customers will convert, churn or
reactivate their service based on their past usage patterns, their text communications with
customer services, the firm’s marketing actions, as well as customers’ self-reported
deactivation reasons. With the aid of the variable importance from random forests and
gradient boosting, I am able to identify the most important factors that impact each of the
three stages of customer relationship management, namely customer acquisition,
retention, and winback of lost customers, and I am also able to explore how each factor
evolves throughout these stages with regard to their effects on shaping customers’
decision making processes of whether to convert to paid customer, to churn, or to
reactivate their service with the company.
Achieving these research goals makes both theoretical and managerial
contributions. Theoretically, this is by far the most comprehensive analysis of customer
relationship management in the academic realm in terms of data range and variety, as
well as the possible factors that impact customer acquisition, retention and winback. For
the first time in marketing literature, different factors such as customer duration,
marketing communication channels, customer responsiveness to the marketing efforts,
customer satisfaction, customer self-reported deactivation reasons, pricing, service
quality, customers’ usage of self-service, as well as text mining of call center notes are
empirically examined together throughout the three stages of CRM. Moreover, given that
current research on winback is scant, this study enriches the literature by exploring how
factors affecting customer conversion and retention might also impact customer winback.
6
Managerially, understanding how different factors affect different stages of
customer relationship management, as well as which factors are more influential than
others, is critical to firms because companies are frequently faced with the problem of
limited resources and how to optimally allocate them. If it is marketing activities that
play a bigger role in acquiring, retaining and winning back lost customers, firms should
maintain or increase their budget to marketing initiatives. If customer heterogeneity in
their responsiveness to marketing communications, in their usage of self service channels,
or in their sensitivity to price can better forecast customers’ future behavior, this
dissertation can identify those customer characteristics and thus facilitate companies to
target the right customers at the right time. If the service dissatisfaction, product
dissatisfaction, or competition revealed from the lost customers’ self-reported
deactivation reasons turn out to be the major factors that drive customers to discontinue
their services with the company, managers may find it profitable to improve their service
or product quality. If the text mining of the customer call center notes is more predictive
of customers’ decision making in terms of whether to continue or terminate their
relationship with the company, firms should invest more in understanding their customers’
direct feedback. Furthermore, this dissertation assists managers to find answers to the
following relevant questions: 1) among all the marketing communication channels that
they apply to reach out customers (e.g. email, direct mail, or phone calls), which ones are
the most effective? 2) Which customers’ own activities such as contacting the customer
service center or employing online self service are more indicative of their future
behavior? 3) Should they apply the same strategy to recapture the customers who never
7
converted to a paid customer and the customers who churned after being a paid one? This
dissertation sheds light on all of these important issues, and it should aid firms in
deciding whether they should attach more importance to their own marketing initiatives,
service quality, or customers’ activities and characteristics.
The subsequent chapters of this dissertation are organized in the following
manner: chapter 2 reviews the literature in customer relationship management and text
mining; chapter 3 lays out the conceptual hypothesis; chapter 4 discusses the
methodology that will be applied; chapter 5 describes the data and preprocessing steps;
chapter 6 discusses the results and implications; and finally chapter 7 points out the
limitations of the research and provides future direction and conclusions.

8
CHAPTER 2
LITERATURE REVIEW
2.1 CRM (Customer Relationship Management)
Drawing from the CRM literature and based on my research objectives, this
review focuses on the contributing factors that extant research has revealed in influencing
the three stages of a company’s CRM activities: 1) customer acquisition, 2) retention and
3) winback of lost customers. I also discuss how prior researchers treat the relationship
among customer acquisition, retention and winback from an analytical point of view.
2.1.1 Customer Acquisition and Conversion
The customer acquisition literature is closely related to that of new product
adoption and diffusion of innovation (Bass 1969, Mahajan, Muller, & Kerin, 1984;
Norton & Bass 1987; Davis 1989; Davis et al., 1989; Rogers, 1995; Zhu et al., 2003; Zhu
& Kraemer, 2005; Campbell & Frei, 2010). Early research investigates what factors are
related to aggregate product adoption; for example, in the Bass model, Bass (1969) uses
the product characteristics and the number of previous adopters to predict product
adoption. Mahajan et al. (1982) also discover that the pricing package for new products
has a major impact on their adoption and retention. Subsequently, using individual data,
researchers find that individual characteristics such as demographics play as important a
role in product adoption as product characteristics and marketing efforts (Chatterjee &
Eliashberg, 1990; Sinha & Chandrashekaran, 1992; Goolsbee & Klenow, 2002). Lewis
(2006) in particular reveals that acquisition discount depth is negatively related to repeat-
9
buying rates and customer asset value using customer-level data from a newspaper and an
online grocery store.
In investigating what factors influence customer service adoption, Villanueva, et
al. (2008) study the impact of marketing-induced versus word-of-mouth customer
acquisition on customer equity growth and find out that customers acquired by marketing
actions are of less value to the firm compared to customers persuaded by word-of-mouth;
this is because the former only adds more short-term value, but the latter adds more long-
term value to the firm. Musalem and Joshi (2009) identify the relationship between
customer responsiveness to a firm’s CRM efforts and customer acquisition and retention.
They recommend that firms invest more in moderately responsive customers instead of
the high responsive customers since the latter are also pursued by competing companies
which would lead to the erosion of the effects of an individual firm’s CRM efforts. Using
a nine-year period of U.S. airlines industry data, Grewal et al. (2010) discover that
heterogenity in customer satisfaction impacts both acquisition and retention sales. Nam et
al. (2010), on the other hand, explore the role of word-of-mouth resulting from service
signal quality as well as advertising and the retail environment on service adoption.
Finally Xue et al. (2011) relate customer demand for banking services, the availability of
alternative channels, customers' efficiency in service coproduction, and local Internet
banking penetration to Internet banking adoption.
The study of free trial and subsequent conversion can be dated back to 1974 when
Scott (1974) examines whether offering a two-week free newspaper trial can induce a
higher probability of conversion to a six-month newspaper subscription compared with

10
the control group. Research on conversion explores how free samples can help induce
paid purchases (Gedenk and Neslin 1999; Bawa and Shoemaker 2004). Contrary to
Scott’s negative results, Gedenk and Neslin show positive short-term and long-term
effects of free samples on acquiring customers. In exploring what factors have
contributed to customer conversion from free to fee, Pauwels and Weiss (2008) find that
adjusting price promotion, email and search-engine referrals can all facilitate customer
conversion.
Table 1 Factors affecting Acquisition and Conversion
Factors Articles
Acquisition
product characteristics, pricing Bass (1969), Mahajan et al. (1982)
individual characteristics Chatterjee and Eliashberg 1990, Sinha and
Chandrashekaran 1992, Goolsbee and Klenow
2002, Xue et al. 2011
marketing efforts Lewis (2006)
word-of-mouth Villanueva, et al. (2008), Nam et al. (2010)
customer responsiveness to a Musalem and Joshi (2009)
firm’s CRM
customer satisfaction heterogenity Grewal et al. (2010)
customers' efficiency in using Xue et al. 2011
self-service channels, local
Internet banking penetration
Conversion
free trial, free samples Scott (1974), Grewal et al. (2010)
price promotion, email and Pauwels and Weiss (2008)
search-engine referrals
11
In summary, prior research has uncovered a wide range of factors that could affect
customer acquisition or conversion: from product characteristics, marketing efforts (such
as pricing strategy, advertising and promotion), customer heterogeneity in demographics,
in responsiveness to CRM efforts, in satisfaction level, in usage of self-service channels,
to word-of-mouth as signal of service quality (See Table 1 above). In this dissertation, I
examine these factors all together and rank their importance so that I know, at least which
customer acquisition factors are more important than others.
2.1.2 Retention and Churn
Customer retention and churn has long been a concern for the service industry,
and marketers and marketing researchers have tried to understand the underlying
mechanisms. Two research streams can be identified in the customer retention literature
and they are closely related to one another: 1) research exploring how some factors can
increase or decrease the customer retention rates and 2) research studying why customers
churn, or discontinue their relationship/service with the company. In essence, these two
streams aim for the same goal: to retain customers as long as possible so that companies
can maximize customer lifetime value. Identifying the churn factors and finding out
strategies to offset it enables better retention.
In examining the potential factors and their relationship with customer retention,
McGahan and Ghemawat (1994) analyze life insurance data and find that in very
competitive environments, large firms achieve greater customer retention than their
smaller rivals. Ganesh, et al. ’s (2000) paper also discloses that customer heterogenity can
12
result in different retention rates. Rust and Zahorik (1993), Bolton (1998) and Gustafsson
et al. (2005) discover that customer satisfaction levels are positively related to the
duration of customer retention. In the meantime, Gustafsson et al’s (2005) examination of
the impact of affective commitment (a emotional factor that indicates the degree of
reciprocity or personal involvement in the relationship between a customer and a
company), calculative commitment (which signifies that the customer-firm relationship is
based on economic dependence on the product benefits), as well as prior churn on
retention reveal consistent effects of calculative commitment, and prior churn on
retention. While both studying the effects of dual pricing, namely access and usage
prices for subscription services on customer retention, Danaha (2002) finds that access
rates have much stronger effect on retention than usage price. Lambrecht (2006)
discovers that flat rate does not increase churn rate, but pay-per-use increases churn rate.
Some researchers have examined the role of the channel of acquisition in CRM.
For example, online versus traditional channels could have different impacts on customer
retention, and Hitt and Frei (2002) show that retention is marginally higher for customers
using online channels. In exploring how service quality can affect retention, Boulding et
al. (1993) link perceptions of the dimensions of service quality to a person's overall
quality perception to predict customer retention; the researchers find that service quality
perceptions positively affect intended behaviors. Zeithaml, Berry, and Parasuraman (1996)
show that service quality affects subscriber behavioral intentions with respect to the
service, which in turn affects their retention decisions.

13
Using a sample of retail banking customers observed over a 30-month period at a
large U.S. bank, Campbell and Frei (2010) discover that customers’ usage of self-service
channels are positively associated with customer retention. With the subscription
databases of seven services offered by a telecommunications provider, Schweidel, Fader,
and Bradlow (2008) apply five factors to forecast customer retention behavior: (1)
duration dependence, (2) promotional effects, (3) subscriber heterogeneity, (4) cross-
cohort effects, and (5) calendar-time effects (e.g., seasonality). The results show
promotional effects significantly improve the forecast accuracy of retention behavior
across all seven services. Customer heterogeneity, calendar-time effects, and duration
dependence only improve in five services, while cross-cohort effects are insignificant in
all services.
In exploring what factors have contributed to customer churn or switching service,
Zeithaml, Berry, and Parasuraman (1996) look at whether service quality on particular
behaviors signals customer switching and they find that customers’ behavioral intentions
are strongly influenced by service quality. Bolton (1998), Verhoef (2003), and
Gustafsson et al. (2005) examine whether customers’ dissatisfaction with service and
commitment was related to customer churn.
Keaveney (1995) conducts a field study which identifies more than 800 critical
behaviors of service companies that could cause customers to churn. The author groups
these 800 critical behaviors into 8 general categories: 1) pricing (which includes the
subcategories of high prices, price increases, unfair pricing practices, and deceptive
pricing practices); 2) inconvenience (such as inconvenient service provider location,

14
hours of operation, waiting time for service or to for an appointment, etc.); 3) core service
failures caused by mistakes, billing errors or technical problems; 4) service encounter
failures (which is the interaction failure between customers and customer service
personnel at company caused by employee attitude or inability to provide service that
customers expect); 5) employee responses to service failures; 6) competition; 7) ethical
problems (such as dishonest or intimidating behavior, unsafe or unhealthy practices, or
conflicts of interest) ; and 8) involuntary switching. Most of these factors were echoed or
further studied by other researchers (e.g., Shaffer and Zhang (2002) also find that
customers churn due to incentives promoted by competitors).
Braun and Schweidel (2011) classified the causes of churn into three categories: 1)
controllable factors such as price, dissatisfaction with service, or with the product; 2)
uncontrollable reasons which include moving from the service area and
death/divorce/family issues; and 3) firm-initiated churn which is due to customer
nonpayment or abuse of service. The researchers use the data from a land-based
telecommunications services organization to test their competing-risk model. They
conclude that the ability of a firm to reduce customer churn is diminished by the
pervasiveness of uncontrollable influences which have a "dampening effect" on the return
of a firm's retention marketing efforts.
Other factors that are found to cause churn include customer heterogeneity
(Morrison and Schmittlein 1980), payment equity (Bolton, Kannan, and Bramlett 2000;
Bolton and Lemon 1999), loyalty programs (Bolton, Kannan, and Bramlett 2000;
15
Verhoef 2003), marketing activity (Lewis 2004, 2005), retention duration (Hughes 2006;
Reichheld, 1996), and multichannel communication (Godfrey, et al. 2011), to name a few.
In summary, prior research shows that the factors that influence a consumer’s
decision to remain a customer with a company or to churn are similar to the ones that
have impact on customer acquisition. Customer heterogeneity, marketing initiatives, as
well as customer satisfaction and perceived service quality all impact both acquisition
and retention. However, no prior studies have examined all of these factors concurrently
in both of these stages or in the winback stage. This study fills this gap by exploring how
these same factors behave in different stages of customer relationship management.
Table 2 Factors affecting Retention and Churn
Factors Articles
Retention
Firm size in competitive environments: Mcgahan and Ghemawat (1994)
large vs. small
Customer heterogenity Ganesh, et al. (2000), Schweidel, Fader,
Bradlow (2008)
Customer satisfaction levels Rust and Zahorik (1993), Bolton (1998) and
Gustafsson et al. (2005)
Affective commitment, calculative Gustafsson et al. (2005)
commiment, prior churn on retention
Access rate vs. usage price Danaha (2002)
Online vs. traditional channels Hitt and Frei (2002)
Service quality Boulding et al (1993); Zeithaml, Berry, and
Parasuraman (1996)
Self-service channels Campbell and Frei (2010)
Duration dependence, promotional Schweidel, Fader, and Bradlow (2008)
effects, cross-cohort effects, calendar-
time effects (e.g., seasonality)
Churn
Flat rate vs. pay-per-use Lambrecht (2006)
16
(Table 2, continued)
Pricing , inconvenience, core service Keaveney (1995)
failures caused by mistakes, billing
errors or technical problems, service
encounter failures, employee responses
to service failures, ethical problems,
involuntary switching
Competition Keaveney (1995), Shaffer and Zhang (2002)
Controllable factors, uncontrollable Braun and Schweidel (2011)
reasons, firm-initiated churn
Customer heterogeneity Morrison and Schmittlein 1980
Payment equity Bolton, Kannan, and Bramlett 2000; Bolton
and Lemon 1999
Loyalty programs Bolton, Kannan, and Bramlett 2000; Verhoef
2003
Marketing activity Lewis 2004, 2005
Retention duration Hughes 2006; Reichheld 1996
Multichannel communication Godfrey, et al 2011
2.1.3 Winback
Even though I believe that each of the CRM stages from acquisition to retention
to winback are all important to any firm that intends to maximize their profitability with
their customers, winback or the reacquisition of lost customers is by far the least
researched area among the three. As Griffin and Lowenstein (2001) and Thomas, et al.
(2004) point out, the importance of allocating more resources to winning back customers
is apparent: the probability of a firm winning back a churned customer and the net return
on investment can be eight times as high as winning a new customer. However, very
limited research has explored which factors can facilitate firms winning back lost
customer except Stauss and Friege (1999) and Thomas, et al. (2004). Stauss and Friege
(1999) develop the second lifetime value (SLTV) metric to facilitate evaluating the NPV
17
generated after a customer has been won back and point out that not every customer
should be won back. Thomas, et al. (2004) focus on the reacquisition pricing strategy and
find out the optimal pricing strategy for a firm to recapture their churned customers
involves a low reacquisition price and higher prices when customers have been
reacquired.
Even though both customer heterogeneity and pricing strategy are important
factors in winning back lost customers, I assert that other factors that have be influential
in shaping customer acquisition and retention should also be examined for their effect on
winback. Therefore, in this dissertation, I explore how prior marketing activities, prior
customer usage of self service channels, prior deactivation reasons, customer
responsiveness to marketing initiatives as well as the prior call center notes by customers
affect the odds of winning back customers. Particularly in this study, through the rich
data, I classify the lost customer into two categories: those who tried the product for some
trial period but decided not to convert being a paid customer, and those who have become
a paid customer for some time and then decided to churn. Making this distinction in the
lost customer base is of great importance to both academics and managers because it
allows us to assume that there are different underlying factors that influence recapturing
these two different kinds of lost customers. Hence this research can assist companies in
allocating their resources so that they can achieve a greater return on investment.
Table 3 Factors Affecting Winback
Factors Articles
Winback
18
Customer heterogeneity Stauss and Friege (1999)
Optimal pricing strategy for a firm to Thomas, et al. (2004)
recapture their churned customers
involves a low reacquisition price and
higher prices when customers have
been reacquired
2.1.4 Acquisition, Retention and Winback
In addition to exploring possible factors and their relationship with customer
acquisition, retention and winback, prior researchers also have had extensive discussions
about how these three events are related, especially between customer acquisition and
retention. Two absolutely opposite assumptions about their relationships have been made.
One assumption is that customer acquisition and retention are two independent events
(Blattberg and Deighton 1996; Gupta, Lehmann et al. 2004). The other assumes that the
customer acquisition process affects the customer retention process (Hansotia and Wang
1997; Thomas 2001; Reinartz, Thomas et al. 2005; Schweidel, Fader, Bradlow 2008).
Blattberg and Deighton (1996) find that independently developed acquisition
response and retention curves can appropriately relate marketing expenditures to
customer acquisition and retention rates. Thomas (2001), on the other hand, develops a
model for estimating the length of a customer's lifetime linking customer acquisition to
customer retention and shows the financial impact of not accounting for the effect of
acquisition on customer retention. Gupta, et al. (2004) also study the customer value by
treating customer acquisition and retention as two independent events, but Schweidel,
19
Fader, and Bradlow (2008) explore the duration dependency between acquisition and
retention.
No matter whether we believe these events are independent or not, the consensus
is that we should truncate and censor the data (Bolton's 1998, Schmittlein and Helsen
1993) in order to separate acquisition data from retention data for further statistical
analysis. In this study, I hold that all CRM events are related, particular acquisition
activities affect customer retention, both acquisition and retention efforts have an impact
on winning back lost customers.
In the following section, I briefly review the text mining and text classification
methods that will be used.
2.2 Text Mining
Today about 80% of the data held within an organization is in the form of text
documents —for example, reports, Web forms, open-ended survey responses, news feeds,
e-mails, and call center notes, etc. Texts are essential for an organization to gain a better
understanding of their customers’ behavior and the abundance of the text data forced
organizations to seek ways to explore and leverage this information. Text mining is a
technology that aims to capture key concepts and themes and to uncover hidden
relationships and trends from collections of textual materials without requiring
knowledge of the precise words or terms that authors have used to express those concepts.
Text mining is one branch of data mining, or the analysis steps of Knowledge Discovery
in Databases (KDD) process that results in the discovery of new patterns in large data
sets. The data that text mining deals with are usually referred to as unstructured data
20
since the data cannot be stored in a relational database and thus cannot be structured by
categorical, ordinal, and continuous variables. Data consisting of both structured and
unstructured data are called semistructured data.
Text mining usually involves two steps: the first step transforms the textual data
into structured data with specialized text analytic techniques; and the second step applies
learning algorithms such as clustering, classification or predictive modeling to explore
relationships within the data. Sometimes the transformed data can be combined with
other structured data to make predictions of the future behavior. In this dissertation, I use
semi-structured data to predict customer acquisition, retention and winback.
2.2.1 Transformation of Text
The primary problem with the management of all of the unstructured text data is
that there are no standard rules for writing text so that a computer can understand it. The
language, and consequently the meaning, varies for every document and every piece of
text. The only way to accurately retrieve and organize such unstructured data is to
analyze the language and thus uncover its meaning. There are several different automated
approaches to the extraction of concepts from unstructured information. These
approaches can be broken down into two kinds, linguistic and nonlinguistic.
2.2.1.1 Nonlinguistic Approach
Nonlinguistic solutions are based on statistics and computer technology. They
treat each document d j as an array of words and apply computer technology to quickly
scan and categorize key concepts within the text. These key concepts are called
21
vocabulary and they are used as the feature set to train the classifier. After key concepts
are identified in each document, one counts the number of times each feature word occurs
and calculates their statistical proximity to related concepts. From raw text documents to
the data matrix that is needed for further analysis, there are several preprocessing steps
that need to be taken:
1) The removal of function words: since not all words can be used to train the
classifier, some words or phrases have to be removed. The first of these words are
function words. Function words are also called stop words, which include: auxiliary verbs,
conjunctions, articles, prepositions, etc. These words appear frequently in most text
documents but they do not contribute to the training of the classifier, therefore they need
to be removed before the next preprocessing step.
2) Stemming: the process of grouping the words which share the same
morphological root and misspelled words. For example, words “precise”, “precision”,
“preciseness” should be grouped as one word “precise”. Even though Baker and
McCallum (1998) argue that stemming can sometimes hurt effectiveness, the common
practice is still to adopt it since it can reduce both the dimensionality of term space and
the stochastic dependence between terms (Sebastiani 2002).
3) Representation of feature value: After feature words or phrases are determined,
I need to decide how to represent the feature values. One way is to use a binary vector,
assigning 1 to the feature value if the document contains the feature word and 0 otherwise.
Another way is to count the frequency of the feature words that appear in each document.
This, however, leads to problem of how to weight the feature words.

22
According to Salton and McGill (1983), there are four kinds of automatic
weighting systems to assign weights to feature terms extracted from the documents.
a) tf (term frequency) weighting system, which sets the weight of term k in
document i to be the frequency of occurrence of word construct k in document i.
Therefore, terms occurring in every document of a collection are treated equally with
those that occur in only a few documents.
WEIGHTik = FREQik
b) idf (inverse document frequency) weighting system, which takes into
account the number of documents that each term is assigned and assumes that the content
representation by the frequency of the term in the document is decreased by the
frequency of the number of documents DOCFREQk that each term is assigned to.
FREQik
WEIGHTik 
DOCFREQk
c) Term discrimination weighting system, which depends on the degree to
which the assignment of a term to the documents is capable of decreasing the average
distance between the documents. DISCVALUEk, the discrimination value of term k, is
calculated as the difference between two measurements of document space density,
corresponding to the densities before and after assigning term k.
WEIGHTik = FREQik .DISCVALUEk
d) Term relevance weighting system, which is based on probability indexing
theory stating that the best index terms tend to occur in the relevant documents with
respect to some query. If we define TERMRELk as the ratio of the proportion of relevant
23
items in which term k occurs to the proportion of nonrelevant items in which the term
occurs, then the term relevance weighting system can be expressed as:
WEIGHTik = FREQik .TERMRELk
The most commonly used weighting systems are actually the product of the
term frequency and the inverse document frequency ( tf  idf , or tfidf) and normalized
tf  idf , which incorporates the length of document vectors into the weighting system
(Salton and Buckley 1988).
4) Feature selection: since the number of extracted feature words for most
documents of a collection is several thousands, it can induce high dimensional term space
and overfitting problems. Feature selection is the name given to a broad set of methods
used to reduce dimensionality. It is a necessity in text classification. Feature selection is
an attempt to select, from the original set T , the set T ' terms (with | T ' | | T | ) that yields
the highest effectiveness for document indexing.
The most commonly used feature selection method applies an evaluation function
that to a single word (Soucy and Mineau, 2003). First, all words are evaluated and sorted
independently according to the assigned criterion. Then a predefined number of best
features are used to form the best feature subset. Individual words are scored using
measures such as mutual information, information gain, odds ratio,  2 statistics, term
strength and so on (Brank et al. 2002; Torkkola 2002; Forman 2003; Sousa et al. 2003).
These metrics have their origin in machine learning or text retrieval but there are also
new related stastistical regularization and variable selection methods such as LASSO
(least absolute shrinkage and selection operator, Tibshirani 1996) and elastic net (Zou &
24
Hastie, 2005). These techniques are similar to ridge regression. The lasso minimizes the
residual sum of squares, subject to the sum of the absolute values of the coefficients
being less than a constant. Thus it restricts some coefficients to be 0, which has the effect
of reducing the dimensionality of the data. An elastic net is the combination of the ridge
and lasso since it applies both L2 and L1 penalties for regularization. Elastic net is
particulary applicable for large datasets when the number of predictor variables is much
greater than that of the obervations.
However, even after all these procedures are performed, the text classification
completed by a nonlinguistic approach still suffers from limited accuracy. Due to the
large presence of polysemy (e.g. a word or phrase that has multiple related meanings),
homonymy (e.g. a group of words that share the same spelling and pronunciation but
different meanings), and synonymy (e.g. different words with same or similar meanings),
the original terms may not be the optimal dimensions for document content
representation. To compensate for this, linguistic approaches have been applied to help
reduce the dimensionality even further and hence substantially enhance the prediction
accuracy.
2.2.1.2 Linguistic Approach
Linguistics-based text mining, applies the principles of natural language
processing (NLP)—the computer-assisted analysis of human languages—to the analysis
of words, phrases, and syntax, or structure, of text. A system that incorporates NLP can
intelligently extract concepts, including compound phrases. Moreover, knowledge of the
underlying language allows classification of concepts into related groups, such as

25
products, organizations, or people, using meaning and context. This approach offers the
speed and cost-effectiveness of statistics-based systems, but it offers a far higher degree
of accuracy while requiring far less human intervention.
Two commonly used linguistic approaches to further reduce the dimension of the
text transformed data matrix are term clustering and latent semantic indexing. Term
clustering groups words with a high degree of pairwise semantic relatedness, so that the
groups of their centroids (or representative of them) may be used instead of the terms as
dimensions of the vector space. Term clustering addresses synonymy and it can be done
either by unsupervised clustering or supervised clustering. Unsupervised clustering
means clustering is not affected by the category labels attached to the documents (Lewis
1992; Li and Jain 1998). Supervised clustering gathers those terms that tend to indicate
the presence of the same category, or group of categories (Baker and McCallum 1998;
Tishby 2001).
Latent Semantic Indexing (LSI, Deerwester et al. 1990) is a technique developed
in Information Retrieval to address the dimensional problem deriving from the usage of
synonyms and polysemes. It infers the dependence among the original terms from a
corpus and transfers this dependence into the newly obtained, independent dimensions.
LSI applies singular value decomposition to the matrix formed by the original document
vectors to map it to the new vectors. Therefore, unlike term clustering, the newly
obtained dimensions by LSI are not intuitively interpretable. However, they work well in
bringing out the “latent” semantic structure of the vocabulary used in the corpus. The
drawback is that, if some original terms are particularly good for discriminating a
26
category, that discrimination power may be lost in the new vector space (Sebastiani,
2002).
After both non-linguistic and linguistic approaches are performed to transform the
original text data, the newly generated data matrix is ready for further text classification.
2.2.2 Text Classification
Text classification is also called text categorization, document classification, or
document categorization. According to Sebastiani (2002), there are two approaches to
text classification: knowledge engineering techniques and machine learning approaches.
Knowledge engineering techniques relies on expert system which typically consists of a
set of manually defined logical rules, one per category of type
If < DNF formula > then <category>.
A DNF (“disjunctive normal form”) formula is a disjunction of conjunctive clauses; the
document is classified under <category> iif (if and only if) it satisfies at least one of the
clauses. The most famous example of this approach is the CONSTRUE system (Hayes et
al. 1990) constructed by Carnegie Group for the Reuters news agency. DNF systems
suffer from the same problems as other expert systems, such as the well-studied
“knowledge acquisition bottleneck”, which refers to that knowledge acquisition is the
major impediment and most time-consuming part of development of an expert system.
Machine learning methods can be used to overcome the bottleneck problem, because
machine learning methods do not require antecedent expert knowledge, but rather rely
solely on the data itself to build the classifiers. For this reason, machine learning methods
have progressively increased in popularity and dominance in most text classification

27
applications. In Chapter 4 I will give a detailed description of the two machine learning
methods used in this study. Below I talk about two other issues in text classification:
binary verses multiclass text classification and single-label vs. multi-label text
classification.
2.2.2.1 Binary vs. Multiclass
Most text classification research today is about binary coding of content since the
most important text classification applications are binary classifications. For example,
filtering is used to decide whether document I is about one particular category or not
(Sebastiani 2002). In addition, binary classification algorithms serve as the basis for
multiclass classification. Most multi-class classifiers are built as extensions of their
binary counterparts. They decompose the multi-class classifier into binary classification
using either a one-against-all or all-against-all strategies. One-against-all method uses the
training set to build one classifier per class and to distinguish the samples in a single class
from the samples in all remaining classes. All-against-all method builds classifier for
each pair of classes. There are also direct approaches to build multi-class classifications,
such as nearest neighbor, the generative approach and naïve Bayes.
2.2.2.2 Single-label vs. Multilabel Text Classification
Another important dimension of text classification is the distinction between
single-label and multilabel classification. In single-label cases, one document is assigned
to only one category, therefore it is also called nonoverlaping category classification. In
multi-labeling cases, any number of categories from 0 to C can assigned to the same
document hence it is also known as overlapping category classification. An example of

28
multi-label classification is the categorization of movies. For example, the best motion
picture award winner at the year 2012 Academy Awards, the Artist, can be classified into
three categories simultaneously: comedy, romance and drama. Multi-label classification
methods have been increasingly called for by the modern applications such as music
categorization (Li & Ogihara, 2003), semantic scene classification (Boutell et al., 2004)
and protein function classification (Luo & Zincir-Hyewood, 2005).
There are several approaches to multi-label classification (Tsoumakas 2007): 1)
ranking, to order a set of labels L, so that the topmost labels are more related to the new
instance; 2) hierarchical classification, to label a data set which has a hierarchical
structure; if each document is labeled with more than one node of the hierarchical
structure, then it is called hierarchical multilabel classification; 3) multiple-instance
learning, to learn a concept given a set of positive and negative of instances
simultaneously (Maron & Lozano-Perez, 1997).
The extant multi-label classification methods can be grouped into two categories:
problem transformation methods and algorithm adaptation methods. Problem
transformation methods transform the multi-label classification problem either into one or
more single-label classification (Boutell et al., 2004) or regression problems. Algorithm
adaptation methods adapt machine learning algorithm to multi-label cases. For example,
AdaBoost.MH and AdaBoost.MR (Schapire & Singer, 2000) are two extensions of
AdaBoost (Freund & Schapire, 1997) for multi-label classification. Other algorithm
adaptation methods include ML-kNN (Zhang & Zhou, 2005) which is an adaptation of
the kNN lazy learning algorithm for multi-label data, and improved SVMs using ranking
29
(Elisseeff & Weston, 2002) and stacking (Godbole & Sarawagi, 2004). In this
dissertation, I apply text mining techniques to examine the text notes from the call center
of a large media and entertainment company to explore if the text data can help predict
whether customers will convert from free to fee, to churn or be won back.
30
CHAPTER 3
HYPOTHESES
3.1 Self-care
Self-care, or self-service is one of the customer contact modes that customers use
in order to interface with service providers. Kellogg and Chase (1995) define customer
contact as the function of the interaction between a customer and a service provider.
Based on whether customer contact is technologically involved, Froehle and Roth (2004)
classify the context under which customers and their service providers interact with each
other into five categories: 1) technology-free customer contact, in which a customer has
face-to-face service interactions (Chase 1978) with a human service provider or customer
representative. 2) technology assisted customer contact, where the customer
representative employs technology as an aid to improve the face-to-face contact, but the
customer does not have access to the technology. 3) technology-facilitated customer
contact, in which both customer representative and customers have access to the
technology during their face-to-face service encounter. 4) technology mediated customer
contact, where the customer and customer representative are not physically present with
each other so they communicate via a voice telephone call or online instant messaging, or
email; 5) technology-generated customer contact, such as bank’s ATMs, self-service
kiosks in the airport, automated car washes, and so on.
To encourage customer participation, to enhance operations efficiency and cost-
effectiveness, as well as to be more competitive (Fitzsimmons and Fitzsimmons, 2004;
Huete and Roth, 1988; Haynes and Thies, 1991; Hill et al., 2002; Oliveira et al., 2002;
31
Roth, 2000; Boyer et al., 2002; Menon, 2003; Karmarkar and Pitbladdo, 1995; Heskett et
al. 1997), more and more service companies are encouraging customers to adopt self-care
or self-service. This trend has been called the “self-service revolution”, which provides a
multichannel service delivery system with a collection of employee and self-service
channels in both physical and virtual environments (Xue, et al. 2007).
Different theories help to explain customers’ decision making process of whether
to adopt self-care or not, and of which ones to choose. Customers’ technology readiness
(Parasuraman 2000) states that there are four characteristics that indicate whether
customers are ready for technology or not: optimism, innovativeness, discomfort, and
insecurity about technology. Optimism means customers think technology is a good thing;
Innovativeness refers to “a tendency to be a technology pioneer and thought leader”;
Discomfort is “a perceived lack of control over technology and a feeling of being
overwhelmed by it,” and insecurity is defined as the “distrust of technology and
skepticism about its ability to work properly.” Based on this concept, if a customer is
optimistic and innovative, he or she will adopt the newer technology such as using online
self-care while if a customer is feeling discomfort and insecure, he or she may opt to pick
up a phone and talk to a customer representative. Moreover, the Theory of Reasoned
Action (Fishbein and Azjen, 1975) and Theory of Planned Behavior (Ajzen, 1985, 1991)
from psychology hold that an individual’s intention, their perceived control over their
behavior, their attitude towards the behavior, as well as the rational cognitive assessments
of the behavior are essential in their decision whether to take action or not. So when an
individual feels he or she has the control over whether to perform or not to perform the
32
behavior, the stronger the intention to engage in a behavior, the more likely the action is
taken. In the meantime, attitude, which is “a person’s general feelings of favorableness or
unfavorableness towards some stimulus object” (Fishbein and Azjen 1975), drives an
individual’s intentions for future behavior. Hence a onetime unhappy communication
experience with the service provider could result in customers’ attitude formation
towards the company and affect their future behavior and decision on whether they would
continue or terminate their service. Thus when customers first started to apply new
technology such as instant messaging or email to communicate with their service
providers a decade ago, companies regarded it as a challenge (Zeithaml et al. 2002).
In addition to customers’ heterogeneous behavioral characteristics, other factors
such as the availability of different channels, the cost and benefits of using different
channels, as well as the different activities that they perform through these channels also
impact their choice of self-care channel. If the activity they need to perform is to make
payment and it can be accomplished by simply logging onto their online account,
customers are not likely to bother to call a live agent.
Even though customers’ choice of whether to utilize self-care and which channel
to use is not intended to maximize service provider’s profits, prior studies have found a
causal link between utilization of self-service channels and the relationship between
customers and the service provider. For example, Hitt and Frei, (2002), Campbell and
Frei (2010), Xue et al. (2007, 2011) all find that employing a self-care channel can
increase customer retention rates. Based on these past findings and theories related, I
assume, given the different channels that customers can employ (e.g online help (CUWI),
33
interactive voice response with call center (IVR), making calls to a live agent in a call
center, and different activities they perform, such as changing account information,
making payment, or changing service (e.g. upgrading or downgrading service plans), that
not only the employment of self-service but also the interactions between self-service
channels and activities are significant in predicting customers’ decision making of
conversion, churn and reactivation with their service provider. Furthermore, if a customer
utilized self-care channel during the trial period, and he or she converted to a paid
customer, it indicates a positive attitude being formed and a positive impact on his or her
future behavior. Thus, I expect that the customer will be less likely to churn and more
likely to be won back if for any reason they discontinued their service with the service
provider:
H1a: The employment of self-care channel in the conversion period and churn period
increases the likelihood of customer conversion, non-churn and subsequent reactivation
with the company.
H1b: The interaction of self-care channels and customers’ activities in both trial period
and churn period are highly predictive of customer retention and winback.
3.2 Marketing Initiatives
Customers choosing different self-care channels to communicate with their
service-providers are just one direction of the interactions between customers and firms.
To acquire customers more effectively, to maintain a long term relationship with them
afterwards, as well as to regain lost customers so that customers’ lifetime value with the
company can be maximized, firms also reach out to customers using various marketing
34
initiatives. Past literature has shown that marketing actions are typically effective in
customer acquisition, retention, and winback (Bass 1969; Mahajan et al. 1982; Berger et
al. 2002; Danaher 2002; Lewis 2004, 2005, 2006) and such actions include pricing,
promotions and marketing outbound communications, and so on. In this dissertation, I
look into the following marketing initiatives and their respective impact on acquisition,
retention and winback: marketing communication efforts and customers’ responsiveness,
promotion depth and pricing.
3.2.1 Marketing Communication Channels and Customer Responsiveness
Marketing communication through different channels has become an
indispensible part of marketing strategy. With more availability of communication
channels to contact customers, the problem that firms are faced with is which channel
they should utilize to interact with which customers. One consideration is the cost-benefit
analysis of each communication channel and the other is the effectiveness of each
communication channels or the mix of them. Traditional communications media either
cannot be customized to meet each customer’s demand (e.g. television) or is expensive to
implement (e.g. sales forces, direct mail). With the availability of electronic media,
companies tend to send e-mails to customers to welcome them to join the club, to instruct
them how to start their service, to remind them of the service or payment due, or to send
promotional offers at certain occasions, based on the customer information stored in their
database. However, do emails work better than direct mail or telemarketing? What is the
effect when a firm applies all three communication channels to reach the same customer?
35
Prior researchers have explored the volume and mixed effect of communication channel
in non-service settings.
Two different findings regarding the effects of marketing volume in the
relationship between customer and firms have been disclosed. Venkatesan and Kumar
(2004) discover a positive linear relationship between the level of bidirectional
communication between customers and firms and purchase frequencies. Godfrey et al.
(2011) find that an inverted-U shape between customers repurchase and the volume of
three individual communication channels: telephone, e-mail, and direct mail. Godfrey et
al (2011) further explain these two different phenomena by two social norm theories:
reciprocal action theory and reactance theory. Reciprocal action theory holds that people
feel obliged to proportionally respond to goodness by goodness (Bagozzi 1995; Becker
1990). Therefore when firms invest more on customers by increasing marketing volume,
customers would reciprocate by repeated purchase. Reactance theory on the other hand
explains why excessive marketing communication volume would diminish the effect of
marketing efforts: when the volume of the marketing communication increases to a
certain extent, customers would perceive the incoming marketing communication efforts
as persuasive attempts to force them to make purchase (Clee & Wicklund 1980) thus
respond negatively to all kinds of marketing efforts such as personal selling (e.g.,
Wicklund, Slattum, & Solomon 1970), advertising (e.g., Robertson & Rossiter 1974),
direct marketing (e.g., Morimoto & Chang 2006), and rewards programs (e.g., Kivetz
2005).
36
However, what Venkatesan and Kumar (2004) and Godfrey et al. (2011) study as
the volume of communication is distinct from the total volume of marketing
communications defined in this dissertation. Venkatesan and Kumar (2004) count both
inbound and outbound communication by customers and the firm and Godfrey et al
(2011)’s volume is for each individual communication channel. What I refer to as the
total volume is the additive effect of multiple communication channels that occur in both
the conversion and retention period. Nonetheless, the reciprocal action theory and
reactance theory can be applied to explain the different effects of marketing
communications in different stages of CRM. In the trial period, when customers initially
receive the welcome or promotional communications from a firm, it helps them to learn
about the company and their products so the higher the communication volume, the
stronger the bond and thus higher probability for them to convert to a paid customer.
However, after customers have converted and started a steady relationship with the
company, the cumulative effect of the marketing efforts in both trial and paid period
makes the communication happening in the retention period easy to trigger the reactance
behavior and thus result in churn. As to winning back lost customers, since I use trial
period marketing volume to predict the probability of winning back unconverted
customers and both trial period and retention period marketing volume to forecast the
likelihood of recapturing churned customers, I would expect the same effects of
marketing communication volume on the winback period. Hence, I postulate the
following:
37
H2a: The total volume of marketing outbound in the trial period is positively associated
customers’ acquisition, retention and winback probability respectively.
H2b: The volume of marketing communication in the paid period, particularly the ones
closer to the churn date or right censored date, is positively associated customer churn.
Customer responsiveness to firm’s marketing efforts is an indicator of customers’
interaction level with their service provider. The more responsive the customers are, the
stronger is the relationship between the customer and the firm, thus the more likely they
would convert from a free to fee customer, to be won back, and less probable to churn.
Accordingly I expect the following:
H3: The response rate of a customer during the both trial period and paid period is
positively associated with customer conversion and winback but negatively associated
with customer churn.
In addition to the total volume of contact channels, each individual channel (e.g.,
telephone, direct mail, e-mail) has been characterized as more or less interpersonal.
Email is viewed as the least interpersonal communication channel to reach customers
since most of them do not require any customers’ involvement but to open and read it. In
addition, due to the cheap cost associated with sending out emails, customers have been
bombarded with all kinds of commercial emails and would simply discard them as junk
mail and never care to open. Prior research shows that a more involving and
interpersonal contact channel have a much higher conversion rate on average than a less
involving contact channel (Anderson and Narus 1999, p. 302). Moreover, compared with
inexpensive and prevalent email communications, customers feel that they are more
38
valued or cared about by the company when they receive direct mail since it shows that
the company is willing to invest in their relationship. As the direct mails are more and
more replaced by emails, the utilization of direct mail may actually enhances
relationships and thus have a better chance to retain them. Finally, regarded as the most
interpersonal channel among the three, the telephone channel requires both a service
company’s personnel and customers’ involvement. Customer representatives’ attitude as
well as their ability to solve customers’ issues varies among different representatives,
which could create uncertainty and negative impact on customer perception in firm’s
service quality if an encounter did not have a satisfactory outcome. Since each
communication channel has its own advantages and disadvantages, I expect different
marginal effects from each of them.
Adding to the complexity of the different impact of each individual
communication channel is the synergy effect that previous researchers have studied
(Reinartz, Thomas, and Kumar 2005, Godfrey et al. 2011). The common understanding is
that the interaction effect is greater than the sum of individual channel effect and previous
researchers again use reciprocal action theory and reactance theory to explicate the two
possible outcomes of the interaction effects of multichannel communication: the initial
combination of multichannel communication could elicit customers’ good feelings of
being invested but excessive communication coming in all kinds of forms can drive
customers away from rather than closer to a company. Therefore I postulate the following:
H4a: Different communication channels have different effects on customer conversion,
retention and winback.

39
H4b: The interaction effect of multichannel communication is strongly significant in
predicting customer conversion, churn and winback.
3.2.2 Radio Density
Radio density is defined as the total number of radio stations in each zip code
which captures the regional differences in terms of product availability and product
variety in the telecommunication company studied. Product variety provided to
customers during their trial period largely impacts customers’ experience with the firm’s
products and thus affects their decision whether to convert from a free trial customer to a
paid customer. However, if a customer has opted to terminate their service with the firm,
the factor of radio density should be not significant in affecting their decision to
reactivate service with the company or not since the radio density is static for long period
of times.
H5a: The radio density is positively related with the probability of customer conversion,
and negatively related to customer churn.
H5b: The radio density is not significant in predicting winning back both of unconverted
and churned customers.
3.2.3 Promotional Depth
Promotional depth is one of the factors that early researchers have explored in its
association with customer lifetime value. Two contradictory findings concerning its
effect on customer lifetime value have been revealed. Analyzing the long-term behavior
of some newspaper and online grocery customers and their acquisition discount,
Anderson, et al. (2004) find a negative relationship between acquisition discounts and
40
customer value; in the mean time, Anderson and Simester (2004) discover that customers
acquired through catalogs with more discounted items have higher long-term value.
Despite the disparity of the two outcomes, prior researchers are in agreement that deep
price cuts can increase the temporary sales for the first-time customers but decrease
future purchases by established customers. Applying this concept to the three stages of
CRM, one can reasonably expect that the promotion given out in the trial period would
encourage customers to convert, however, the discounts that feed the established paid
customers signal the segment of customer base that are price-sensitive and they are at
greater chance to churn once the promotional discount come to a halt. Furthermore, once
the customers who have enjoyed too many promotions churned, they would be hard to
gain back. In this dissertation, the promotional depth is denoted by two terms: the number
of promotional discounts which shows how many times a customer was offered the
promotional price, and the price range which indicate how deep the price cut a customer
received during the retention period. Hence I hypothesize:
H6a: Promotional discounts are positively associated with customer churn but negatively
associated with winback;
H6b: There is a positive relationship between price range and customer churn but a
negative relationship between price range and customer winback.
3.3.4 Pricing and Churn and Winback
Pricing is another “old” topic that researchers first started with to explore its
relationship with acquisition (Bass 1969; Mahajan, Muller et al. 1984; Norton and Bass
1987; Davis 1989; Davis, et al. 1989; Rogers 1995) and it is closely related to promotions.
41
Most extant literature focuses on what is the optimal pricing strategy to acquire new
customers and even win back lost customers (Thomas, et al. 2004). This dissertation
examines how the average price that a customer pays for his or her overall services with
the service provider each month helps to predict their probability of churn or future
winback. The average price that a customer pays is connected to classifying customer
base that a company has and company usually would regard the higher price customers as
high value, while the ones who pay low average prices as low value customers. Therefore
studying how much the price a customer pays on average and its relationship with their
CRM targeting strategy is important in targeting at the right customer at right time.
Therefore one can expect that the higher price a customer pays each month, the happier
he or she is with the service that the company provides, and the lower probability he or
she would churn. However, once a customer churns, firm’s strategy to win back lost
customers is usually to offer deeper discounts (Thomas, et al. 2004) than to existing
customers. The customers who used to enjoy the discounts, or are of lower customer
value to the company, maybe enticed to accept the new discount and reactivate their
service with the company. As to the high value customers, since they are mostly loyal,
companies tend not to offer discounts to them until they decide to terminate the service
one day. The deep discount that the company offers to win them back could be tempting
since now they discover they could actually enjoy the same service with much lower
price than they usually paid. Hence I postulate the following:
H7a: The average price a customer pays to his or her service provider is negatively related
to their churn;
42
H7b: The average price a customer pays before he or she churned has an U-shape
relationship with their probability of winback.
3.3 Deactivation Reasons
One very revealing information source is customers’ self-reported deactivation
reasons for relationship termination. Prior researchers have explored the various causes
that lead to customer churn (Keaveney 1995), and Braun and Schweidel (2011) even use
the multiple causes of churn to model customer lifetime. Combining both Keaveney
(1995) and Braun and Schweidel (2011)’s categorization methods, we group our
customer self-reported deactivation reasons into ten subcategories: "Service
Dissatisfaction", “Uncontrollable Reason”, "Product Dissatisfaction", "Cost/price/fee",
"Non Usage", "Prepaid period ended" (this one is grouped into “Free ended” for trial
deactivation reason), "Free ended", "Competition", "Unknown reason", and “NonPay”.
This grouping of the deactivation reasons covers from customer satisfaction for both
service and product, to competition, to customers’ usage of service, as well as to financial
reasons. They reveal whether these factors play role in influencing customers’ decision
making of whether to convert, churn or reactivate service with their company during each
stage of CRM. Particularly for conversion from free to fee, the reason of “Free ended”
should be the highly predictive of whether a customer converts or not. Therefore I
hypothesize the following:
H8a: Deactivation reasons should be among the top important factors that are predictive of
customer behavior of conversion, churn and being won back.

43
H8b: Among all deactivation reasons, “Free ended” plays a major role in why a customer
would not convert.
3.4 Duration on Winback
Prior researchers have examined the effect of retention duration on customer
churn (Hughes 2006; Reichheld 1996), and the impact of the acquisition duration on
retention duration (Schweidel, et al. 2008). They find that while acquisition duration has
a positive relationship with retention, retention has a negative effect on churn. Therefore,
the longer the trial period, the longer the subsequent retention duration. On the other
hand, the longer a customer stays with the company, the less likely he or she will churn.
Building on these research findings, I speculate that on the one hand, the longer the
acquisition duration and retention duration, the stronger the relationship between the
customer and company it indicates, therefore the more likely that a customer could be
won back once they discontinued their service for any reason. However, if a customer has
had too long history with the company, he or she tends to have experienced the product
of the company so well that once he or she decides to churn, it could imply that he or she
has already had it all so it will be hard to win them back. Therefore, I postulate the
following:
H9: There is an inverted U-shape relationship between the customer acquisition and
retention duration and the probability of winning them back.

44
CHAPTER 4
METHODOLOGY
4.1 Classification Methods
Based on the research objectives of this dissertation, as well as the four binary
outcome events that this study examines, namely, conversion, churn, winning back
unconverted customers and winning back churned customers, in this section, we discuss
three classification methods that are used in this study: Bayesian logit/probit, boosting,
and random forest. I focus on the binary class version of these methods.
4.1.1 Bayesian Logit/Probit Model for Classification
The standard approach in marketing for binary and multiclass classification are
generalized linear models. The commonly used generalized linear models include the
probit, logit, log-log and complementary log-log models. These models are all
exponential-family models and differ according to the choice of link function used to
transform the DV that will be expressed as a linear function of the Xs and they also may
differ on the assumptions underlying the regression structure on latent continuous data.
The most commonly used generalized linear models are probit and logit models, and
there is actually not much difference between them. The logistic distribution is similar to
the normal distribution except in the tails, which are considerably fatter (It more closely
resembles a t distribution with 7 degrees of freedom). Therefore, the two distributions
tend to give similar probabilities for the intermediate values of x '  such as between -1.2
and +1.2, where x is the vector of input variable and  are the parameters. When x ' 
gets extremely small or extremely large, the logistic distribution tends to give larger
45
probabilities to y  0 than normal distribution. However, unless the data is extremely
unbalanced, such as too few responses with y  1 or y  0 , I expect little differences in
predictions from the probit or logit models.
The Bayesian versions of generalized linear models for classification usually
employ Bayesian logit or probit models for binary outcomes, and Bayesian multinomial
logit or probit for polychotomous response data. There are several possible prior
assumptions for Bayesian probit or logistic models. Readers who wish to pursue a full
discussion of the intricacies of choosing a prior can consult Gelman, Carlin, Stern, and
Rubin (2004). However, rather than provide a comprehensive discussion of how to
choose priors in a Bayesian context, here I provide a brief justification of my chosen prior.
Binary classification - hierarchical logit model
Given a set of training data D  {( x1 , y1 ),( x2 , y2 ),...( xn , yn )} , let yi {0,1} denote
the class label for each binary outcome with 1 being the code for membership and 0 for
the nonmembership, and define x as the n  p matrix of standardized input variables, then
the logistic classifier takes the form:
exp(x'i β)
Pr( yi  1| xi )  Z ( x'i β)  (1)
1  exp(x'i β)
where β is the vector for parameters, and Z  Z1 ,...Z n is the latent variable. Each Z i is
independent and follows a normal distribution: N (xi'  ,1) , and
yi  1 if Zi  0
yi  0 if Zi  0
46
Therefore the decision boundary for classification is the set of points for which z(s)
are zero, or a hyperplane defined by {x | 0   T x  0} , and the conditional probability of
each outcome is ½.
The Bayesian approach to the logistic classifier assumes that each parameter  j
follows a prior distribution and this prior distribution is usually assumed to be normal
with mean  j and variance V :
1 (  j   j ) 2
p(  j |  j ,V ) ~ N ( j ,V )  exp( ), j  1,...d (2)
2 V 2V
Here I assume that each parameter  j is independent of all other parameters
conditional on hyperparameters. In this case, they are  j and V .
A full Bayesian approach specifies a second-stage prior on the hyperparameters to
make a hierarchical model. For the hierarchical logistic model with normal prior, I could
further assume that the hyperparameters  comes from a normal distribution and V from
inverse Wishart distribution of:
 ~ N (,V  A1 ),
(3)
V ~ IW ( ,V )
where A1  100I or larger to set a diffuse prior for the variance.
Since I assume that the  j are conditionally independent, then
p(  )   p(  j )
If I multiply it by the likelihood of the data, I get the posterior distribution of  :

47
I ni
y 1 yij
p(  | D)  p( D |  ) p(  )  {  pij ij (1  pij ) } p(  ) (4)
i 1 j 1
p( ) ~ N (  , A1 )
4.1.2 Boosting
Boosting is one of the most powerful learning ideas introduced in the last twenty
years and it was specifically designed for classification problems. It is designed to
combine the outputs of many “weak” classifiers in order to generate a powerful
“committee.” From this perspective boosting is similar to bagging and other committee-
based approaches which take a simple unweighted average of the predictions from each
model, essentially giving equal probability to each model. However this resemblance is
only on the surface and boosting is fundamentally different from the committee-based
approaches. For example, compared with bagging which uses bootstrap to sample the
training set, boosting has no randomness at all.
There are a number of boosting algorithms but the most popular one is called
Adaptive Boosting (AdaBoost) introduced by Freund and Schapire (1996). Later on,
researchers also explored some algorithms to modify the original AdaBoost algorithm
such as Gentle AdaBoost, Logit AdaBoost, and Real AdaBoost (Friedman et al. 2000).
Recent developments in this area include the regularization of the boosting algorithm
(Friedman 2001; Rosset, Zhu, and Hastie 2004), such as utilizing a learning rate
parameter to regularize the boosting when classification trees constitutes the base
classifiers. Another proposed algorithm for regularization, which is gaining popularity, is
called stochastic gradient boosting (Friedman 2002, Ridgeway 2006). It employs a

48
random permutation sampling strategy at each iteration to obtain a refined training set
and achieves significant computational savings.
Binary AdaBoost
The original AdaBoost algorithm by Freund and Schapire (1996) has two versions,
denoted as “AdaBoost.M1” and “AdaBoost.M2” respectively. These two versions are
equivalent in dealing with binary classification problems but differ in handling problems
with more than two classes. Here I only introduce “AdaBoost.M1” since Freund and
Schapire (1996) show that AdaBoost.M1 fits a forward stepwise additive logistic
regression model that minimizes the expectation of the exponential loss function, ,
with F(x) denoting the boosted classifier. Consider a two-class problem, with the output
variable coded as Y ∈ {−1, 1}. Given a vector of predictor variables X, a classifier C ( X )
produces a prediction taking one of the two values {−1, 1}. The error rate on the training
sample is
1 N
err   I ( yi  C ( xi )) ,
N i 1
(4)
and the expected error rate on future predictions is EXY I (Y  C ( X )) .
A weak classifier is defined as the one whose error rate is only a little better than
random guessing. The boosting algorithm has access to the weak learning algorithm and
sequentially applies the weak classification algorithm to repeatedly modified versions of
the data, thus producing a series of weak classifiers Cm ( x) , m  1, 2,..., M . The
predictions from all of the weak classifiers are then pooled through a weighted majority
vote to generate the final prediction:

49
M
C ( x)  sign(  mCm ( x)) (5)
m 1
Here 1 ,  2 ,...,  M denote the contribution (or weights) of each weaker learner Cm ( x) and
are computed by the boosting algorithm. The more accurate classifier receives higher
weights.
The data are also modified at each boosting step. Initially all training observations
( xi , yi ) , i  1, 2,..., N , were given the same weights, that is wi  1/ N , i  1, 2,...N . For
each successive iteration m  2,3,..., M , the observation weights are individually
modified so that the observations that were misclassified at the previous step are given
more weights, whereas the weights are decreased for those that were classified correctly.
As a result as iterations proceed, observations that are hard to classify correctly receive
ever-increasing influence. Each successive classifier is thus forced to focus on those
training observations that are misclassified in the previous iteration in the sequence.
The power of AdaBoost is to considerably increase the performance of even a
very weak classifier. This AdaBoost.M1 algorithm is also called “Discrete AdaBoost”
since the base classifier Cm ( x) returns a discrete class label. Friedman et al. (2000)
modified the algorithm to enable the base classifier to return real-valued predictions thus
called “Real AdaBoost”. Other modifications such as Gentle AdaBoost requires fitting a
regressor at each iteration and results in the original GentleBoost algorithm whenever  m
=1.
As to the “base learner” Cm ( x) for AdaBoost algorithm, the most commonly used
one is classification trees, where improvements are often most dramatic. In fact, Breiman
50
(1996, 1998) referred to AdaBoost with classification trees as the “best off-the-shelf
classifier in the world”.
4.1.3 Random Forest
Another ensemble classification method which aims at enhancing the
classification accuracy are random forests. The idea of a random forest is to combine tree
predictors so that each tree depends on the values of a random vector sample
independently and with the same distribution for all trees in the forest.
Random forests were developed by Breiman (2001), the same author who
introduced bagging. Compared with Adaboost, random forests are more robust with
respect to noise, which occurs because Adaboost has no random elements and grows an
ensemble of trees by successive reweightings of the training set where the current
weights depend on the past history of the ensemble formation whereas random forests do
not depend on the past history of the ensemble. Unlike SVM, random forests require little
tuning and is very easy to implement.
Like bagging, random forests also use the bootstrap to generate samples and to
grow a committee of classification trees. To classify a new object from an input vector,
one puts the input vector down each of the trees in the forest and each tree gives a
classification. Then the new object is classified into the class which “won the most votes”
from all the trees in the forest. Unlike bagging, the trees in random forests are de-
correlated and they allow random selection of features to split each node. Bagging can
actually be regarded as a special case of random forests, and random forests are the
51
combination of bagging and random selection of subsets of features (Ho 1998; Amit &
Geman 1997).
4.1.3.1Construction of Random Forests
A random forest is constructed as follows. For each tree in the forests Tb (b = 1 to
B),
1) Draw a bootstrap sample Z* of size N from the original data to be the training set.
2) Grow a random forests tree Tb with the sample Z*. If there are p input variables, a
number m<<p is specified such that at each node, m variables are selected at
random out of p variables and the best split on these m is used to split the node
into daughter nodes. The value of m is held constant during the forest growing.
3) Each tree Tb is grown to the largest extent possible and there is no pruning.
Then output the ensemble of trees {Tb }1B . For the classification prediction at a new point x,
let Cˆb ( x) be the class prediction of the bth random forest tree, then
Cˆ rfB ( x) = majority vote of {Cˆb ( x)}1B . (6)
4.1.3.2 Strength and Correlation
In the original paper on random forests, Breiman (2001) shows that the forest
error rate relies on two parameters: the strength of each individual tree in the forest and
the correlation between any two trees in the forest, defined as follows.
Consider a random forest classifier consisting of a collection of tree-structured
classifiers {h(x, k ), k  1,...} where the {k } that are independently and identically
52
distributed (i.i.d) random vectors and each tree Tb casts a unit vote for the most popular
class at input x. Define the margin function for a random forest as:
{mr ( X , Y )  P (h( X , )  Y )  max P (h( X , )  j) (7)
and the strength of the set of classifiers {h(x, k )} is
s  EX ,Y mr ( X , Y ) . (8)
Now define the raw margin function as
{rmg (, X , Y )  I (h( X , )  Y )  I (h( X , )  ˆj( X , Y )) (9)
where I is the indicator function. Hence mr ( X , Y )  Ermg (, X , Y ) . Since any two
values of (, ) are i.i.d, the variance of mr is:
var(mr )  E, (cov X ,Y rmg (, X , Y )rmg (, X , Y )) (10)
In brief, the higher the correlation, the greater the forest error rate and increasing
the strength of the individual trees decreases the forest error rate. Reducing the predictor
number m at each node reduces both the correlation and the strength and increasing it
increases both. For classification, the default value for m is and the minimum node
size is one (Hastie, et al. 2009). However, the "optimal" value of m can be found quickly
by using the oob (out of bag) error rate and this is the only adjustable parameter to which
random forests are somewhat sensitive.
In the following sections, I introduce several unique features or output that
random forests possess.
4.1.3.3 Use of Out of Bag Data

53
Random forests use out of bag (oob) data as the test set; therefore cross-validation
is not needed to estimate the test set error. When the training set for the current tree is
drawn by sampling with replacement, about one-third of the cases are left out of the
sample. This oob (out-of-bag) data is used to get a running unbiased estimate of the
classification error as trees are added to the forest. It is also used to get estimates of
variable importance, which is a prominent feature of random forests.
During the run, each tree is constructed using a different bootstrap sample from
the original data but the oob data are not used in the construction of the kth tree. Put each
case left out in the construction of the kth tree down the kth tree to get a classification.
Thus, a test set classification is obtained for each case in about one-third of the trees. At
the end of the run, take j to be the class that got most of the votes every time case n was
oob. The proportion of times that j is not equal to the true class of n averaged over all
cases is the oob error estimate.
4.1.3.4 Variable Importance
Variable importance is a very important feature of random forests and it is used in
this dissertation to rank the factors that could influence customers’ decision making
during the three stages of CRM. The variable importance in random forest is calculated in
the following way: first, input the oob cases in every tree grown in the forest and count
the number of votes cast for the correct class; second, permute the values of variable m
randomly in the oob cases and enter these cases into the tree again; third, subtract the
number of votes for the correct class in the variable-m-permuted oob data from the
54
number of votes for the correct class in the intact oob data; finally, average this number
over all trees in the forest to obtain the raw importance score for variable m.
If the values of this score from tree to tree are independent, one can calculate the
standard error by a standard computation. The correlations of these scores between trees
have been calculated for numerous data sets and proved to be quite low, therefore
standard errors is computed in the classical way: one can divide the raw score by its
standard error to get a z-score, and assign a significance level to the z-score assuming
normality. When the number of variables is very large, forests can be run with all the
variables once, and then be run again using only the most important variables from the
first run. For each case, a local importance score for variable m for this case is generated
by subtracting the percentage of votes for the correct class in the variable-m-permuted
oob data from the percentage of votes for the correct class in the untouched oob data.
Gini importance is another measure for variable importance and it is generated by
random forests in the following way: every time a split of a node is made on variable m
the gini impurity criterion decreases. Adding up the gini value decreases for each
individual variable over all trees in the forest, gives a Gini importance value.
4.1.3.5 Advantages of Random Forests
Even though Breiman (2001)’s claim that random forests is unexcelled in
accuracy among current algorithms, many later empirical studies showed that this is not
the case. However, random forests do possess the following advantages: 1) they run
efficiently on large data sets; 2) they can handle thousands of input variables without
variable deletion; 3) they give estimates of what variables are important in the
55
classification; 4) they generate an internal unbiased estimate of the generalization error as
the forest building progresses; 5) they incorporate methods for balancing error in class
population unbalanced data sets; 6) the generated forests can be saved for future use on
other data; 7) they compute proximities between pairs of cases and hence offers an
experimental method for detecting variable interactions; 8) random forests do not overfit
(Breiman, 2001).
In this dissertation, I employ random forests, gradient boosting to perform
classification for the following three events: 1) conversion (or trial conversion), the event
when a subscriber who is not paying for a subscription out of his own pocket (but
provided free by the company) decides to become a paying subscriber and purchases a
subscription after his trial period ends; 2) churn, the event when a revenue-generating
subscriber who pays out-of-pocket decides to terminate service, for some reason; and 3)
winback, the event when a deactivated subscriber who either discontinued his service at
the end of trial-period or churned as a paid-subscriber reactivates his service with the
company.
4.2 Measuring Classifier Performance
There are quite a few measurements to evaluate performance of classification
methods. The commonly used classification performance measurements include accuracy
rate (1- error rate), precision and recall, sensitivity and specificity, F-measure, Youden’s
index (arithmetic mean between sensitivity and specificity), Matthews correlation
(correlation between the actual and predicted), as well as graphical tools such as ROC
curves and Cumulative Lift charts. However all of these performance check methods are
56
based on a confusion matrix which consists of four dimensions: true positive (TP), true
negative (TN), false positive (FP), and false negative (FN). Each measurement has
strength and weaknesses, which I outline in the discussion of commonly used measures
of text classification accuracy below.
1) Accuracy rate = (TP + TN) / T (11)
where T=TP + TN + FP + FN, or the total size of the test set. This is most straight-
forward measurement, however, when a certain category comprises the majority of the
test set, the accuracy can easily reach the high percentage. Sometimes it is worth trying to
maximize the accuracy score, but accuracy (and its counterpart error) are considered
fairly crude scores that don't give much information about the performance of a classifier.
2) Precision (PPV) = TP / (TP + FP) (12)
is also called Positive predictive value (PPV), which measures the proportion of predicted
positives which are actually positive. The issue with precision is that when a
classification method outputs only confident categories (the categories that take the
majority), the precision rate will increase to a high percentage.
3) Recall (Sensitivity) = TP/(TP+FN) (13)
measures the proportion of actual positives which are predicted positive. The problem
with the measurement is that when the classifier outputs loosely, the recall rate can get
really high.
4) F measure= the harmonic mean between precision and recall
The F-scores are in the interval [0, 1]. The higher the score, the higher the
classification quality with 1 indicating perfect classification. Since there is a trade-off

57
between recall and precision, the F-measure is usually used as a simple measure to
evaluate the classifier. Considering that you can get a perfect precision score by always
assigning zero categories, or a perfect recall score by always assigning every category.
However, to maximize the F-score, the classifier has to assign the correct categories and
only the correct categories, which maximizes both precision and recall at the same time.
There are two types of F-measures, resulting from two different ways of
averaging: micro-average F-measure and macro-average F-measure. The Micro-
averaging F-measure is computed globally over all category decisions. So if I have
multiclass classification, for each class i , I define TPi as the number of documents
assigned correctly to class i , FPi as the number of documents that do not belong to class
i but are assigned to class i , FNi as the number of documents that are not assigned to
class i by classifier but which actually belong to class i , and  as the precision, and  as
the recall, then
M M
 TPi  TP i
= M
i 1
,  = M
i 1
 (TPi  FPi )  (TP  FN )

i i
i 1 i 1 (14)
2
F (micro-averaging) =
  (15)
where M is the total number of categories. In contrast, the micro-averaging F-value gives
equal weight to each document, therefore it is the same as accuracy rate when you are
dealing with single-label classification. It tends to be dominated by the classifier’s
performance on common categories.

58
The macro averaging F-measure is calculated locally over each category first and
then averaged over all categories.
2 i i F i
Fi  , and F (macro-averaging) = i 1
 i  i M (16)
where M is the number of categories.

59
CHAPTER 5
DATA AND VARIABLES
5.1 Data
The data utilized in this dissertation comes from Sirius XM, the largest worldwide
satellite company, and was granted through the Wharton Customer Analysis Initiative
(WCAI). This data set provides a “360 view” of SiriusXM’s 1,000,000 customers who
first subscribed or tried in late 2009, and are tracked through roughly 2.5 years up to
February 2012. It includes information on subscribers, subscribers’ vehicles, subscribers’
radios, product usage, billing & payments, outbound direct marketing, and customer
service interactions (online and voice). While customers may access the service via the
Internet and mobile phones, the majority of subscribers listen to SiriusXM in their
vehicles.
The dataset I received includes 9 tables from Sirius XM: Services Table, MDB
(Outbound Marketing) Table, Billing Table, Collections Table, Notes Table, Self Care
Table, Listening Log Table, ESN Table (Device Information), VLC Table (Vehicle
Information), and one table created by WCAI: Radio Locator Table.
Drawing from this data, I first focus on customers who own one vehicle since the
sole vehicle owners’ activation, deactivation, or reactivation activities can provide clean
estimates of the effects of various predictors on conversion, retention, and winback.
Moreover, many of the multi-vehicle subscribers are business customers who are likely to
be qualitatively different from non-business subscribers. This reduces the sample size
from 1 million to less than 700,000 customers. Second I determine which customers are
60
paid customers since only these customers should be examined in the retention period.
The number of paid customer with only one vehicle is 464,810. Third, I separate the data
into three parts: trial period, paid period and winback period by combining the
information from each customer’s trial period end date, first paid service start date,
deactivation date and reactivation date. For paid customers, their trial period records
consist of all interactions that occur before the first paid service start date and their
retention period starts from their first paid service start date; for non-converted customers
the trial period records consist of interactions either before their deactivation date or the
end date in the data. Using these definitions results in 392,532 trial customers, of whom
221,333 converted into paid customer. The reported conversion rate of the company for
2010 was 46.2%, which is similar to my data sample.
To decide which paid customers churned, I used their deactivation date, if it was
before the end of the data in the sample (March 1, 2012). I treat customers who have not
churned by this data as right-censored, meaning they are classified as non-churned
customers. Thus I obtain 228,597 churned customers and 236,213 non-churned customers.
Finally the winback customer base consists of customers who have a clear deactivation
date. Among them, customers who have a specific reactivation date are defined as the
ones that have been successfully won back while others are not. For my research
purposes, I further divided the winback customers into categories: the ones who
deactivated during or at the end of their trial period and the ones who churned after
conversion. There are 1,539 non-converted customers who were reacquired but 157,158
were not. Among the churned customers, 67,124 were recaptured and 148,047 were not.
61
Table 4 below shows the final datasets that I used to predict conversion, churn and
winback both unconverted customers and churned customer. Table 4 below shows the
data for each of the three periods as well as the rate of conversion, churn and winback.
Table 4 Data for 4 events
Trial Period Paid Period Unconverted Churned

Winback Winback
Non-
Converted 221,333 236,213 Won back 1,539 67,124
churned
Not won
Unconverted 171,199 Churned 228,597 157,158 148,047
back
Total 392,532 Total 464,810 Total 158,697 215,171
Conversion Churn Winback
56.38% 49.18% 1% 31.2%
rate rate rate
5.2 Variables
In order to explore my research questions and hypotheses, I created four
categories of variables from the data: 1) Customer characteristics variables, which
include both a linear and quadratic form for mean-centered acquisition/conversion
duration, retention/paid/churn period duration, the dummy variable for usage of self-care
in both conversion and churn period, customer response rate to firm’s marketing actions,
three self-care activity variables (e.g "Change_AcctInfo", “Make_Payment”,
“Change_Service”), three self-care channel variables ( “Live_agent” for speaking to
customer representatives, “IVR” for interactive voice response, and “CUWI” for internet
62
help), as well as the interactions between self-care channels and activities which count
how many time each customer use each self-care channel to do each activity; 2)
Marketing initiative variables, which consist of total number of marketing out-bound
contacts, the counts of each communication channel that each customer receives from the
firm, number of promotions that each customer gets, mean-centered average price in both
linear and quadratic forms, and total-radio number for each zip code which I treat as radio
density variable (Xue, et al. 2011); 3) Deactivation reasons, which have been grouped
into 10 subcategories: "Service Dissatisfaction", “Uncontrollable reason”, "Product
Dissatisfaction", "Cost/price/fee", "Non Usage", "Prepaid period ended" (this one is
grouped into “Free ended” for trial deactivation reason), "Free ended", "Competition",
"Unknown reason", and “NonPay”; 4) Text mining variables. I analyzed the call center
notes text using LIWC2007 (Linguistic Inquiry and Word Count). LIWC2007 is text
analysis software which is able to calculate the degree to which people use different
categories of words across text with 64 language dimensions. The variables generated by
LIWC include 22 linguistic dimensions such as percentage of words in the text that are
pronouns, articles, auxiliary verbs etc, 32 psychological process word categories
including social, affective, cognitive, perceptual, biological, and relativity processes, 7
personal concern categories (e.g., work, home, leisure activities), and 3 paralinguistic
dimensions (assents, fillers, nonfluencies). Therefore, these variables are clusters from
the results of a word extraction process, which groups the words and phrases that express
the same meaning or have the same linguistic function together. For a complete list of the
LIWC output variables and their examples, please see the Appendix 1.
63
The variable values generated by LIWC are the percentage of total words for each
entry. For example, an incomplete snapshot of the LIWC output in Table 5 shows that
11.59% of note #1 is composed of pronouns compared with 15% of note #2. Also 5.92%
of note #5 consists of positive emotional words compared with 12.5% of those for note
#6. Since these values are the percentage of frequency in each note, the total word count
for each note has been taken into consideration.
Table 5 LIWC Output Snapshot

Notes funct pronoun they posemo negemo anger sad cogmech
1 54.32 11.59 0.45 4.32 1.59 0 0.23 12.05
2 57.5 15 0 5 0 0 0 15
3 50 10.94 0 6.25 0 0 0 14.06
4 48.94 19.15 0 4.26 2.13 0 0 17.02
5 55.62 10.06 0 5.92 0.59 0 0 13.61
6 53.12 15.62 3.12 12.5 0 0 0 15.62
7 60.4 21.78 0 1.98 4.95 0.99 0.99 17.82
8 64.85 18.18 1.21 3.64 1.82 1.82 0 18.79
9 48.94 19.15 0 4.26 2.13 0 0 17.02
10 60 14.29 0 2.86 5.71 2.86 0 20
11 75 30 5 0 0 0 0 30
12 54.76 7.14 0 0 0 0 0 19.05
13 59.33 20.1 3.35 1.91 3.35 0.96 0 17.22
14 46.43 10.71 0 4.46 0.89 0.89 0 12.5
15 53.33 13.33 0 6.67 0 0 0 20
16 48.94 19.15 0 4.26 2.13 0 0 17.02
5.3 Time Frame of Variables
All time-varying variables, including all self-care activities that each customer
performed, the self-care channels that each customer used, all marketing variables such as
64
the total marketing outbound communication, customer responsiveness to marketing
communications, each marketing outbound channel), as well as the call center notes
during both conversion and retention period, were separated into two time frames: 60
days within the conversion and retention period end dates and 60 days beyond the end
date of conversion and retention period. Therefore, for these variables the number of
variables is doubled. The reason why I used 60 days instead of other time frame is
because that SiriusXM usually communicates with its customers 30 days before their
contract ends to ask them whether they want to renew their services or not and if not,
SiriusXM would take some action (such as offering a discount) to retain customers. In
addition, based on prior research results, I know that the recency effect plays a role in
customers’ decision making, so dividing the data into these time-based categories, I can
also test whether activities that happened closer to the conversion or churn dates are more
predictive of these events. It seems reasonable that activities that occurred further in the
past should exhibit decaying effects.
In order to model the winback event, I used the conversion period variables to
predict winning back non-converted customers and the retention period variables to
predict recapturing churned customers.
Table 6 shows some selected variables and their definition. The summary
statistics of the four tables are attached as Appendix 2-5.

65
Table 6 Selected variables and their definitions
Variable Definition
RADIO_TOTAL The total number of radio in a given zip code
Acquisition_duration Mean centered trial period length in months
Acquisition_duration2 Quadratic form of Acquisition_duration

Retention_duration Mean centered paid period length in months
Retention_duration2 Quadratic form of Retention_duration
Unknown_reason 1 if customer deactivated for 'Unknown_reason', 0
otherwise
Free_ended 1 if customer deactivated for 'Free_ended', 0 otherwise
Uncontrollable_reason 1 if customer deactivated for 'Uncontrollable_reason', 0
otherwise
Service_Dissatisfaction 1 if customer deactivated for 'Service_Dissatisfaction', 0
otherwise
NonPay 1 if customer deactivated for 'NonPay', 0 otherwise
Non_Usage 1 if customer deactivated for 'Non_Usage', 0 otherwise
Product_Dissatisfaction 1 if customer deactivated for 'Product_Dissatisfaction', 0
otherwise
Cost_price_fee 1 if customer deactivated due to Cost or price or fee, 0
otherwise
Competition 1 if customer deactivated due to competition, 0 otherwise
Acquis_Live_Agent60 Count of customer calling customer representatives within
60 days of their trial period end date.
Acquis_CUWI60 Count of customer using Internet Help within 60 days of
their trial period end date.
Acquis_IVR60 Count of customer using Interactive Voice Response
within 60 days of trial period end date
Acquis_Live_Agent_gt60 Count of customer calling customer representatives
beyond 60 days of their trial period end date.
Acquis_CUWI_gt60 Count of customer called call center within 60 days of
their trial period end date.
Acquis_IVR_gt60 Count of customer using Interactive Voice Response
beyond 60 days of trial period end date
Acquis_ChangeAcctInfo6 Count of customer changing account info through all self-
0 care channels within 60 days of their trial period end date.
Acquis_Change_Service6 Count of customer upgrading or downgrading service
0 through all self-care channels within 60 days of their trial
period end date.
Acquis_Make_Payment60 Count of customer using self-care channels to make
payment within 60 days of their trial period end date.
Acquis_Make_Payment_g Count of customer using self-care channels to make
66
t60 payment beyond 60 days of their trial period end date.
Acquis_ChangeAcctInfo_ Count of customer changing account info through all self-
gt60 care channels beyond 60 days of their trial period end
date.
Acquis_Change_Service_ Count of customer upgrading or downgrading service
gt60 through all self-care channels beyond 60 days of their trial
period end date.
Acq_LiveAg_ChangeInfo Count of customer calling customer rep to change account
60 info within 60 days of their trial period end date.
Acq_LiveAg_ChangeSer6 Count of customer calling customer rep to change service
0 plan within 60 days of their trial period end date.
Acq_LiveAg_MakePaymt Count of customer calling customer rep to make payment
60 within 60 days of their trial period end date.
Acq_CUWI_MakePaymt6 Count of customer using internet help channel to make
0 payment within 60 days of their trial period end date.
Acq_CUWI_ChangeInfo6 Count of customer using internet help channel to make
0 payment within 60 days of their trial period end date.
Acq_IVR_MakePaymt60 Count of customer using interactive voice message to
make payment within 60 days of their trial period end
date.
Acq_CUWI_ChangeSer60 Count of customer using internet help channel to change
service plans within 60 days of their trial period end date.
Acq_IVR_ChangeSer60 Count of customer using interactive voice message to
change service plan within 60 days of their trial period
end date.
Acq_IVR_ChangeInfo60 Count of customer using interactive voice message to
change account info within 60 days of their trial period
end date.
Acq_LiveAg_MakePaymt Count of customer calling customer rep to make payment
_gt60 beyond 60 days of their trial period end date.
Acq_CUWI_ChangeInfo_ Count of customer using internet help channel to make
gt60 payment beyond 60 days of their trial period end date.
Acq_CUWI_ChangeSer_g Count of customer using internet help channel to change
t60 service plans beyond 60 days of their trial period end date.
Acq_LiveAg_ChangeSer_ Count of customer calling customer rep to change service
gt60 plan beyond 60 days of their trial period end date.
Acq_CUWI_MakePaymt_ Count of customer using interactive voice message to
gt60 make payment beyond 60 days of their trial period end
date.
Acq_LiveAg_ChangeInfo Count of customer calling customer rep to change account
_gt60 info beyond 60 days of their trial period end date.
Acq_IVR_MakePaymt_gt Count of customer calling customer rep to change service
60 plan beyond 60 days of their trial period end date.
Acq_IVR_ChangeInfo_gt Count of customer using internet help channel to make
67
60 payment beyond 60 days of their trial period end date.
Acq_IVR_ChangeSer_gt6 Count of customer using interactive voice message to
0 change service plan within 60 days of their trial period
end date.
Acquis_MDBcount60 Total count of marketing outbound initiatives within 60
days of customer's trial period end date
Acquis_MDBcount_gt60 Total count of marketing outbound initiatives beyond 60
ACQUSI_EM60 Number of email sent to customers within 60 days of
customer's trial period end date
ACQUSI_DM60 Number of direct mail sent to customers within 60 days of
ACQUSI_TELEM60 Number of telemarketing reaching customers within 60
ACQUSI_EM_gt60 Number of email sent to customers beyond 60 days of
ACQUSI_TELEM_gt60 Number of telemarketing reaching customers beyond 60
ACQUSI_DM_gt60 number of direct mail sent to customers within 60 days of
Trial60Response_Rate Customer's response rate to firm's marketing outbound
initiatives that was sent within 60 days of trial period end
date
Trial_gt60Response_Rate Customer's response rate to firm's marketing outbound
initiatives that was sent beyond 60 days of trial period end
date
ACQUSI_EMDM60 Interaction of email and direct mail sent to customer
ACQUSI_EMTELEM60 Interaction of email and telemarketing sent to customer
ACQUSI_DMTELEM60 Interaction of direct mail and telemarketing sent to
customer within 60 days of trial period end date
ACQUSI_EMDMTELEM Interaction of email, direct mail and telemarketing sent to
60 customer within 60 days of trial period end date
ACQUSI_EMDM_gt60 Interaction of email and direct mail sent to customer
beyond 60 days of trial period end date
ACQUSI_EMTELEM_gt Interaction of email, direct mail and telemarketing sent to
60 customer beyond 60 days of trial period end date
ACQUSI_DMTELEM_gt Interaction of email, direct mail and telemarketing sent to
60 customer beyond 60 days of trial period end date
ACQUSI_EMDMTELEM Interaction of email, direct mail and telemarketing sent to
_gt60 customer beyond 60 days of trial period end date
AcqSelfcare_dummy 1 if customer uses any self-care channel, 0 otherwise
Variables unique to retention/churn period
68
Service_count Number of services that each customer had during paid
period
MC_Ave_Price Mean-centered average price a customer paid during
MC2_Ave_Price Quadratic form of MC_Ave_Price
Price_range The difference between the maxi and mini price a
customer paid
Promo_count Number of promotions a customer gets during paid period
69
CHAPTER 6
RESULTS AND DISCUSSIONS
6.1 Variable Importance in Four Events
Four variable importance graphs were produced by random forests to exhibit the
30 most important variables for each of the four events: conversion, churn, winning back
of non-converted customers, and wining back of churned customers. Studying each of
them, I find:
1) For conversion period (see Figure 1 below): a) the 30 most predictive variables
are dominated by text mining variables from LIWC of both linguistic words and
psychological vocabulary regardless of whether they are within or beyond 60 days of trial
period end date. This result indicate that analyzing what and why customers made phone
calls to customer representatives is of critical importance in predicting conversion. b)
Other factors that play roles in converting customers include: marketing actions such as
the total number of outbound communications that company sent to customer beyond 60
days of trial period end date and customers’ responsiveness to them, the number of direct
mail that the company sent to customer within 60 days of the trial period end date, as well
as the number of phone calls that the company made to customers beyond the 60 days of
the trial period end date. This result shows that during the early stage of the trial period
when customers were trying out the company’s product, the additive effect of company’s
communications as well as whether customers are responsive to company’s marketing

70
efforts are important in affecting customers’ conversion. In addition, marketing
communications by phone play more important role during this period than other
communication methods. However, when it is close to the date that trial period ends,
direct mail are more effective in converting customers. c) The duration of customers’ trial
period, the radio density in each zip code, whether customers apply self-care channel and
how many times they called customer care center are all also highly indicative of whether
customers would convert to paid customer or not. This result is consistent with the prior
research findings. d) Among all the deactivation reasons that customers report, only
“Free_ended” play an important role in predicting whether customers would convert or
not. This result supports Hypothesis 8b and shows that the major reason that customers
would not convert was due to the end of free trial period.
71
Figure 1 Conversion Variable Importance Plot

2) For retention period: a) Firm’s marketing initiative variables play dominant
roles in shaping customers decision making on whether to churn or stay with the
company. The total number of outbound communications that company sent to customer
during the paid period regardless of the time frame, the total marketing communications
occurring during the trial period, the number of times the company reach customers by
email, direct mail, or phone call both within and beyond the paid period end date, as well
as the number of promotions each customers gets are all highly predicative of customers’
churn or no-churn behavior. b) From customer side, whether customers use self-care
services, their responsiveness to firm’s marketing actions throughout the whole retention
72
period, the number of services they had with the company as well as the average price
and price range associated with them, both the trial period duration and paid period
duration, are all differentiating characteristics that reveal customers’ future behavior
direction. c) A few psycholinguistic variables such as the total number of function words,
the number of verbs, prepositions as well as the number of relativity words (e.g. area,
bend, exit, stop) that appear in the call center notes are also good predictor of whether
customer will churn or not.

73
Figure 2 Churn Variable Importance Plot
3) To win back the customers who fail to become paid customers, the factors that
used to influence their initial decisions of conversion or non-conversion still play a large
role. Even though the number of text mining variables decrease by 2 and their importance
ranking also declined compared with marketing variables, they still constitute nearly half
of the top 30 important variable list which shows the importance of investing in studying
the content of calls that customers made during their trial period. Radio density becomes
the top one factor influencing whether company can win back customer or not, which
74
indicates the regional difference in availability and variety of the product is the major
aspect that firms should consider when they allocate the marketing resources regarding
who should be won back. The total number of marketing initiatives and customers’
responsiveness to it, the trial period duration, as well as the communication channels that
the firm uses to reach out customer are all effective in winning back the non-converted
customers.
Figure 3 Non-Converted Winback Variable Importance Plot

75
4) For winning back churned customers, whether customers used self-care during
both trial and paid periods, the number of promotions that customers used to get during
the paid period, customers’ responsiveness to marketing communications throughout the
retention period, the average price and price range that customers used to get during the
retention period, customers’ trial period duration, customers’ total service number with
the company, as well as firm’s marketing communication variables are all highly
predictive of whether customers could be won back or not. Particularly the number of
promotions that customers get during the paid period became the top predictor of
customer churn or not, which could imply that the customers who churned were mostly
price sensitive. The larger number of promotions each customer gets, the more price-
sensitive they are, and the more difficult for the firm to reacquire them. A few
psycholinguistic variables such as “cogmech”, “social”, functional words and verbs are
also among the top 30 predictors in winning back churned customers.

76
Figure 4 Churned Winback Variable Importance Plot
5) Examining all four variable importance graphs together, I discovered that: all
marketing action variables, whether customers use self-care or not, customer duration in
both conversion and retention periods are strong predictors in classifying converted and
unconverted customer, churned and non-churned customer, as well as the customers that
could be won back or not. Finally, nearly all deactivation variables except ‘Free_ended”
are missing from the top 30 list. Some of them are significant in predicting one or two
events (see Table 9 below) but compared with other variables, they are not that
77
‘important’ in terms of predicting customer conversion, churn and reacquisition.
Therefore H8a is not supported.
6.2 Hypothesis Testing
Random forests only give us the overall importance ranking of all variables in
terms of increasing classification accuracy. To examine individual variables’ marginal
effect and some interaction effects among certain variables on the probability of four
events, namely conversion, churn, winning back unconverted customers and winning
back churned customer, I rely on logistic regression to provide the parameter estimates
and their statistical significance.
6.2.1 Self-care
The major results for the analysis of self-care channels, and related activities, are
shown in Tables 7 and 8. Studying these two tables I find:
1) Usage of self-care: Whether customers employing self-care channel or not
during their trial period and subsequent paid period are strongly significant (p-value <
0.01) in predicting all four events: customer conversion, customer churn, customer non-
converted winback and customer churn winback. However the only positive effect of
whether customers adopting self-care is for winning back churned customers (β = 0.3862,
p-value < 0.01), which barely supports H1a. Customers using self-care during their trial
period actually decreases their likelihood of converting from free to fee and subsequent
reactivation of service with the company (β = -0.5640 and -0.7260 respectively, p-value <
0.01). In addition, the employment of self-care during the retention period increases
customer churn probability. All these negative effects of using self-care could suggest
78
customers mainly voluntarily contact the service-provider when they encounter problems
and they use self-care to seek help from the company. To further investigate if this
assumption is true or not, I proceed to examine the interaction and marginal effects of
customer’s usage of each three self-care channel (e.g. call live agent, use online help
channel, or use interactive voice message) and the activity they performed (eg. Change
account information, change service plan, or make payment).
2) Interaction and marginal effects of self-care channel usage and activities: The
interactions between self-care channels and self-care activities here are not the
multiplication of their marginal effects but the number of times two events happen
concurrently in a given time period. It captures what activity a customer performed
through which channel at a given point of time. For example,
‘Retent_LiveAg_MakePa60’ refers to the count of times that a customer called a live
agent to make payment within 60 days of their churn or the right-censored date of March
1st, 2012. For Table 7, I discover that the estimates of all marginal effects and the
interaction of self-care channel and self-care activities are insignificant except the
‘Retent_LiveAg_MakePa60’. Furthermore, the parameter estimate shows a negative
value with regard to churn probability (β = -0.1722, p-value < 0.05). So statistically the
more a customer makes phone calls to a customer representative to make payment within
60 days of their churn period end date, the lower the probability he or she will churn. The
result of overall performance of the interaction and marginal effects of self-care channel
usage and activities fail to support H1b.

79
Table 7 Self-care Channel Activity in 4 events

Variables Conver- NonCov Variables Churn Churn
sion erted WB
WB
AcqSelfcare_dummy - - Retent_Selfcare_dummy 0.4007* 0.3862*
0.5640* 0.7260* * *
* *
Acquis_Live_Agent60 -0.0655 -0.0453 Retent_Live_Agent60 0.0132 -0.0233
Acquis_CUWI60 0.7442 -6.5272 Retent_CUWI60 -0.1436 -0.0527
Acquis_IVR60 0.1764 -5.5450 Retent_IVR60 -0.1299 -0.0340
Acquis_Live_Agent_gt60 0.3278 3.1967 Retent_CUWI_gt60 0.0143 -0.0148
Acquis_CUWI_gt60 -0.1110 -1.1663 Retent_Live_Agent_gt60 0.0445 -0.1021
Acquis_IVR_gt60 -0.0105 -0.5491 Retent_IVR_gt60 0.0205 0.0349
Acquis_ChangeAcctInf60 -0.3589 -1.1416 Retent_Change_Servic60 0.1265 0.1619
Acquis_Change_Service60 -0.7268 5.5601 Retent_Make_Payment60 0.1602 0.0615
Acquis_Make_Payment60 0 0 Retent_ChangeAcctInf60 0 0
Acquis_Make_Payment_gt 0.0649 -3.2603 Retent_ChangeAcctInf_gt60 -0.0955 -0.0271
60
Acquis_ChangeAcctInfo_g -0.3300 -3.0921 Retent_Change_Servic_gt60 -0.0475 0.0909
t60
Acquis_Change_Service_g 0 0 Retent_Make_Payment_gt60 0 0
t60
Acq_LiveAg_ChangeInf60 0.1333 1.1226 Retent_LiveAg_Change60 -0.1378 -0.1446
Acq_LiveAg_ChangeSer6 0.5929 -5.6631 Retent_LiveAg_MakePa60 - -0.0335
0 0.1722*
Acq_LiveAg_MakePaymt6 0 0 Retent_CUWI_ChangeIn60 0.1437 0.0377
0
Acq_CUWI_MakePaymt6 -0.6080 7.2858 Retent_CUWI_ChangeSe60 0.00349 -0.0979
0
Acq_CUWI_ChangeInfo60 -0.5649 7.5668 Retent_CUWI_MakePaym60 0 0
Acq_IVR_MakePaymt60 0 0 Retent_LiveAg_Change60 0 0
Acq_CUWI_ChangeSer60 0 0 Retent_IVR_ChangeSer60 0 0
Acq_IVR_ChangeSer60 0 0 Retent_IVR_MakePaymt60 0 0
Acq_IVR_ChangeInfo60 0 0 Retent_IVR_ChangeInf60 0 0
Acq_LiveAg_MakePaymt_ -0.3953 0.0372 Retent_CUWI_ChangeIn_gt6 0.0713 0.0220
gt60 0
Acq_CUWI_ChangeInfo_g 0.4800 4.4124 Retent_CUWI_ChangeSe_gt6 0.0378 -0.0457
t60 0
Acq_CUWI_ChangeSer_gt 0.0470 1.0053 Retent_LiveAg_MakePa_gt60 -0.0464 0.0983
60
Acq_LiveAg_ChangeSer_ -0.2183 -2.9024 Retent_CUWI_MakePaym_gt 0 0
gt60 60
Acq_CUWI_MakePaymt_ 0 0 Retent_LiveAg_Change_gt60 0.0522 0.0840
gt60
Acq_LiveAg_ChangeInf_g 0 0 Retent_LiveAg_Change_gt60 0 0
t60
Acq_IVR_MakePaymt_gt6 0 0 Retent_IVR_ChangeSer_gt60 0 0
0
Acq_IVR_ChangeInfo_gt6 0 -3.0754 Retent_IVR_ChangeInf_gt60 0 0
0
Acq_IVR_ChangeSer_gt60 0 0 Retent_IVR_MakePaymt_gt6 0 0
80
0
*p<.05
** p<.01
6.2.2 Marketing Initiatives
The results of analyzing marketing initiative variables in Table 8 suggest the
following:
1) Total marketing communication outbound: the parameter estimate of trial
period marketing volumes on conversion and churn (β = 0.00566, and -0.0169
respectively, p <0.001) indicates that there is a positive association between trial period
marketing volumes and conversion and retention. Also the marketing volume beyond the
60 days increase the probability of winning back both unconverted and churned
customers; hence the H2a is largely supported. For the marketing communication volume
in the churn period, the more outbound marketing, the higher the probability that the
customer will churn. This supports H2b and is also consistent with the reciprocal action
theory and reactance theory explanations.
Table 8 Marketing variable estimates
Variables Convers NonCon- Variables Churn Churned

-ion version Winbac
Winback k
Acquis_MDBcount60 0.00566** 0.000845 Retention_MDBcount60 0.0052** 0.00045
Acquis_MDBcount_gt6 0.0393** 0.0584** Retention_MDBcount_gt - 0.00149**
0 60 0.0463**
Trial60Response_Rate 0.0332 0.00964 Churn60Response_Rate 1.0879** -0.246**
Trial_gt60Response_Ra 0.3758** 1.7549** Churn_gt60Response_R 5.4881** -0.674**
te ate
ACQUSI_EM60 -0.0103* 0.00502 RETENTION_DM60 -0.016** -0.015**
81
ACQUSI_DM60 -0.0130* -0.0274 RETENTION_EM60 -0.002** -0.003**
ACQUSI_TELEM60 -0.0797** -0.0996 RETENTION_TELEM6 - -0.020**
0 0.0157**
ACQUSI_EMDM60 0.00438 0.00925 RETENTION_EMDM6 0.00077* 0.000247
0 *
ACQUSI_EMTELEM6 0.0156* -0.00343 RETENTION_EMTELE 0.00051* 0.000182
0 M60
ACQUSI_DMTELEM6 0.0335** 0.0383 RETENTION_DMTELE 0.00363* 0.0061**
0 M60
ACQUSI_EMDMTEL -0.00263 -0.00395 RETENTION_EMDMT -0.00006 -0.00011
EM60 ELEM60
ACQUSI_EM_gt60 -0.00096 0.0124 RETENTION_DM_gt60 - -0.03**
0.0271**
ACQUSI_TELEM_gt6 -0.1833** 0.0239 RETENTION_EM_gt60 - -0.0026**
0 0.0026**
ACQUSI_DM_gt60 -0.0137** 0.0399 RETENTION_TELEM_ - -0.0687**
gt60 0.0296**
ACQUSI_EMDM_gt60 0.00130 -0.0150 RETENTION_EMDM_ 0.000425 0.00073
gt60
ACQUSI_EMTELEM_ 0.00676** 0.00295 RETENTION_EMTELE 0.00109* 0.00109*
gt60 M_gt60 *
ACQUSI_DMTELEM_ 0.0142** -0.0152 RETENTION_DMTELE 0.0116** 0.0325**
gt60 M_gt60
ACQUSI_EMDMTEL - 0.00207 RETENTION_EMDMT 2.44E-6 -0.00049
EM_gt60 0.00067** ELEM_gt60
Acq_MDBcount - -0.01**
0.0169**
Acq_MDB_ResRate - -0.164**
0.2058**
Price_range 0.0223** 0.0156**
Promo_count 0.4544** 0.5644**
MC_Ave_Price - -0.018**
0.0084**
MC2_Ave_Price 0.00017* 0.0004**
*
*p<.05
** p<.01
2) The response rate of customers to marketing efforts: all response rates except
customers’ response rate within 60 days of trial end period are highly significant in
predicting either conversion, or churn or winning back both unconverted customers and
churned customers. Moreover high response rate during the trial period beyond the 60
days of trial end date increases the chance of conversion but the response rate within 60
82
days of trial end date is insignificant in predicting conversion. This could be explained by
the firm’s specific marketing action that they always send mail to customers 30 days
before their trial end date to remind them to convert to paid customer. Thus the increase
in marketing volume and in the response rate within the 60 days of the end of conversion
period fail to be effective to influence customers’ conversion or winback since customers
could depict the increase in marketing volume near their contract end date not real
investment in the relationship but compelling message to persuade them to become paid
customers. Customers’ response rate could also be forced to be high since they have to
communicate whether to terminate the service or convert to paid customer. Therefore H3
is partially supported.
3) The interaction and marginal effects of communication channels: From Table 8
we see that all communications channels, no matter whether looking at interaction or
marginal effects are not significant in predicting winning back unconverted customers.
This could suggest that the unconverted customers were the ones who were not really
interested in the product that the company provides so they could resist the company’s
marketing effort. The interaction effects of all three channels both within and beyond 60
days of both trial end date and churn end date are not significant in predicting conversion,
churn and winback, therefore, the synergy or multiplicative effect that prior researchers
((Naik and Raman 2003, Godfrey et al 2011) found in their studies is not supported by
this study. On the contrary, the additive effect of multichannel communication (denoted
as the marketing volume in this dissertation) is highly significant in influencing
customer’s decision on conversion, churn and winback. This result is consistent with the
83
random forests importance ranking. As to the marginal effects of communication
channels, there is difference in each individual channel’s effect on customer conversion
and churn with the telemarketing being the most effective one. Therefore H4a is supported
by the evidence from this study but H4a is not.
4) Promotional depth: the parameter estimates for both Promo_count and
Price_range are highly significant in predicting customer churn and winback. Particularly,
the positive values for all four coeffients indicate that the more discounts that a customer
gets the more likely he is going churn and be won back. Therefore both H6a and H6b are
partially supported.
5) Average price a customer pays per month: A negative main effect and a
positive quadratic coefficient for the average price in predicting both churn and winback
shows that there is indeed a U-shape relationship between the average price that a
customer pays and the probability that he or she could be reacquired. Therefore H7b is
fully supported by the result. In the mean time, the average price that a customer pays per
month does have a negative impact on their churn. Therefore, the higher the price a
customer pays each month, the less likely he or she will churn. However, when this
diminishing effect reaches a certain point, the higher price will result in higher customer
churn. So H7a is partially support. Both results imply that: in order to win back lost
customers, company should target at the both low-value and high-value customers since
they are the ones who are most likely to be recapture. To reduce customer churn,
company has to control price increases so that they will not drive their most valuable
customers away.
84
6.2.3 Radio Density
The results of the analysis of the effect of radio density on customer conversion,
churn and winback are displayed in Table 9. The negative coefficient of the Radio_Total
on predicting conversion shows that the availability and variety of products are not the
reason that customers convert from free to fee. In addition, the insignificant coefficient of
Radio_Total on predicting churned customer partially supports the H5b. Finally the
negative effect of radio density on customer churn partially supports H5a. In short, radio
density can indeed decrease customer churn and does not show effect in winning back
churned customers.
6.2.4 Duration on Winback
The positive main effects and negative quadratic coefficients of acquisition
duration on predicting winning back both unconverted customers and churned customers
indicate there is indeed an inverted U-shape relationship between prior duration and the
customer winback. This result indicates that companies should not target at the customers
who have the extremely short and long duration with them but focus on the customer
with medium duration.
Table 9 Variables predicting all 4 events
Variables Converted NonChurn NonConversio Churned

n Winback Winback
AcqSelfcare_dummy -0.5640* 0.8468* -0.7260* 0.679*
Acquisition_duration 0.00593* 0.00386* 0.0112* 0.00241
Acquisition_duration2 -0.00088* 0.00226* -0.00261* -0.00075*
85
RADIO_TOTAL -0.00546* 0.000638* 0.0031* -.00046
Competition -0.5024 -0.0489 -8.2520 -0.2573
Cost_price_fee 0.3325 -0.0988 -7.7211 -0.0824
Free_ended -0.0684* -0.1387* -0.5755* -0.0233
NonPay -0.2629* -0.1011* -0.2417 -0.0426
Non_Usage -0.3807* -0.1042* -7.5816 -0.0903
Product_Dissatisfactio -0.2973 -0.0774 -7.9545 0.0275
n
Service_Dissatisfaction 0.0439 -0.1337* -1.2322 -.00275
Uncontrollable_reason -0.2351* -0.1283* -0.2361 -0.0621
Unknown_reason -0.1432* -0.1233* -0.5982* -0.1003
*p<.05
86
CHAPTER 7
CONCLUSIONS, LIMITATIONS AND FUTURE DIRECTIONS
This study aims at exploring how different factors play roles during the three
stages of CRM by identifying the most influential factors for each stage, namely
acquisition, retention and winback of lost customers, and examining how each factor’s
marginal effect evolve in shaping customers’ decision making of conversion, churn and
reactivation of service with the company. With the grant of a big CRM data from a large
multi-media company in the country through WCAI, which covers all three stage of
CRM activities as well as the text data from call center notes, this dissertation examines
how customer characteristics variable, marketing action variables, psycholinguistic
variables from the text mining of call center notes, as well as the customer self-reported
deactivation reasons impact customers’ decision making through three stages of CRM of
whether to convert to a paid customer, to churn, or to reactivate their service with the
firm. In the mean time, applying a new machine learning method – random forest with its
variable importance ranking, this dissertation also shed light on which factors impact
customer conversion, churn and winback most among all the factors from both customer
side and firm side. In summary, this study obtained the following findings.
Findings and Implications
This study reveals that: 1) during the trial period of CRM, the content of the
customer call center notes are the top indicator of whether customers would convert or
not; 2) after customer have become paid customers, the marketing actions that the
company takes, which include the volume they communicate with the customer, the
87
depth of the promotion they offer to each customer, the communication channels they
apply to reach out customers, as well as the radio density in different areas, all play major
roles in retaining customers and preventing them from churn; 3) the factors that influence
the most whether the company can win back the lost customers who only tried their
products but never converted to paid customers are the same ones that affect conversion
most; and the factors that are most predictive of churn or no-churn are also highly
foretelling of company’s chance of recapturing the churned paid customers; 4) customer
self-reported deactivation reasons are not top predictors compared with other factors
throughout all three stages of CRM. Since all deactivation reasons bear the negative
impact that could affect company’s CRM and it is company-specific, so the no effect of
these factors could be explained by the lack of service dissatisfaction or product
dissatisfaction or competition issue for the specific company I study with. 5) Customer
usage of self-care service increases the likelihood of customer retention and winning back
churned customers but decreases the likelihood of customer conversion and winning back
unconverted customers. 6) both acquisition and retention durations affect all three events
of conversion, churn and winback and they both have nonlinear relationship with the
probability of these three events. 7) The multiplicative effect of marketing
communications that were discovered by prior researchers is not significant in affecting
customer conversion, churn or winback. On the contrary, the additive effect of
communication channels plays a major role in shaping customers’ decision making of
conversion, churn and winback.

88
As companies strive to cut costs and to maximize their profit by acquiring as
many customers as possible and retain them as long as they can, this study helps them to
identify which factors are more important in shaping customers’ decision making of
whether to stay with the company or terminate the service can help company better
allocate their resources. In addition, conducting a comprehensive analysis of CRM in
terms of data range and variety, as well as the possible factors that impact customer
acquisition, retention and winback, largely enriched the CRM literature and especially the
understudied winback literature. However, this study bears some limitations.
Limitations and Future Directions
There are several limitations that this study bears: first, this study only focuses on
predicting whether customer would convert, churn or be won back, but did not have
apply the same factors to calculate customer’s lifetime value which is what the CRM in
each company is most concerned with. Therefore, one future direction is to extend the
current research to customer lifetime value analysis. Second, due to time limitation, this
study could not conduct real time series study with machine learning methods but merely
divided the data into two time frames, namely within and beyond 60 days of trial and
paid period. Hence the second possible direction is to explore machine learning methods
that could deal with the real time series data. Third, the psycholinguistic analysis of the
customer’s conversion, churn and winback is limited and I included all 64 LIWC
variables into the prediction of conversion, churn and winback. A possible extension is to
only select certain variables from the 64 LIWC variables and examine how different
psycholinguistic factors evolve throughout the three stages of CRM and their impact on
89
customers’ decision making. Finally, cross-selling or cross-buying of customers is
another important aspect in CRM, but I only used the data for customers who owns one
vehicle. Therefore, including the customers with more than one vehicle into analysis of
cross-buying can be another direction for my future research.
Summary
Even though CRM research has flourished in marketing with the availability of
CRM database, none of the existing research has examined all three stages of CRM
together and how factors’ effect evolve throughout customer acquisition, retention and
winback. Extremely under researched is the customer winback but as studies have shown
winback can greatly cut company’s cost and achieve greater return. This dissertation
enriched the literature in CRM and makes managerial contribution to companies’
customer relationship management.

90
REFERENCES
Aitchison, J., and S. D. Silvey, (1957). “The Generalization of Probit Analysis to the
Case of Multiple Responses,” Biometrika. 57, 253–262.
Albert, J.H, and S. Chib, (1993). “Bayesian Analysis of Binary and Polychotomous
Response Data,” Journal of the American Statistical Association. 88(422):669–679.
Allenby, Greg M., Robert P. Leone, et al. (1999). “A Dynamic Model of Purchase
Timing with Application to Direct Marketing,” Journal of American Statistical
Association. 94 (June). 365–74.
Allwein, E., R. Schapire, et al. (2000). “Reducing multiclass to binary: a unifying

approach for margin classifiers,” Journal of Machine Learning Research. 1, 113–141.
Amit, Y. and Geman, D. (1997). “Shape quantization and recognition with randomized
trees,” Neural Computation. 9, 1545–1588.
Anderson, Eric and Duncan Simester (2004), "Long-Run Effects of Promotional Depth
on New Versus Established Customers: Three Field Studies," Marketing Science, 23 (1),
4-21.
Andrews, R. L., A. Ainslie, et al. (2002). “An empirical comparison of logit choice
models with discrete versus continuous representations of heterogeneity,” Journal of
Marketing Research. 39(4):479–487.
Bake, L. D. and A. K. McCallum, (1998). “Distribustional clustering of words for text

classification,” In Proceedings of SIGIR–98, 21ST ACM International Conference on
Research and Development in Information Retrieval. 96–103.
Bass, F. M. (1969). “A new product growth for model consumer durables,” Management
Science. 15(5):215–227.
Bass, F. M., N. Bruce, et al. (2007). “Wearout effects of different advertising themes: A
dynamic Bayesian model of the advertising–sales relationship,” Marketing Science.
26(2):179–195.
Berger, P., T. Magliozzi. (1992). “The effect of sample size and proportion of buyers in
the sample on the performance of list segmentation equations generated by regression
analysis,” Journal of Direct Marketing. 6(1):13–22.
Blattberg, Robert C. and John Deighton (1996). “Manage Marketing by the Customer
Equity Test,” Harvard Business Review. 74 (July/August). 13 6–44.
91
Bock, H–H., (2002). The Goal of Classification. Handbook of Data Mining and
Knowledge Discovery. 254–258, Oxford University Press.
Bolton, R. N. (1998). “A dynamic model of the duration of the customer's relationship

with a continuous service provider: The role of satisfaction,” Marketing Science.
17(1):45–65.
Bottou, L. C. Cortes, J. Denker, et al. (1994). “Comparison of classifier methods: A case

study in handwriting digit recognition,” in Proc. Int. Conf. Pattern Recognition. 77–87.
Boutell, M. R., J. Luo, et al. (2004). “Learning multi–label scene classification,” Pattern
Recognition. 37(9):1757–1771.
Boulding, W., A. Kalra, et al. (1993). “A Dynamic Process Model of Service Quality –
from Expectations to Behavioral Intentions,” Journal of Marketing Research 30(1):7–27.
Boyer, K.K., Hallowell, R., et al. (2002). “E–Services: operating strategy—a case study
and a method for analyzing operational benefits, “ Journal of Operations Management.
20 (2):175–188.
Brank J., Grobelnik M., et al. (2002). “Interaction of Feature Selection Methods and
Linear Classification Models,” Proc. of the 19th International Conference on Machine
Learning, Australia, 2002.
Braun, M. and D. A. Schweidel (2011). “Modeling Customer Lifetimes with Multiple

Causes of Churn,” Marketing Science 30(5):881–902.
Breiman, Leo (1996). “Bagging predictors,” Machine Learning. 24(2):123–140.
Breiman, Leo (1998). “Arcing Classifiers,” The Annals of Statistics. 26(3):801–849.
Breiman, Leo (2001). “Random Forests,” Machine Learning. 45(1):5–32.
Berger, Paul, Ruth Bolton, Douglas Bowman, Elten Briggs, V. Kumer, A. Parasuraman,
and Creed Terry (2002), "Marketing Actions and the Value of Customer Assets: A
Framework for Customer Asset Management," Journal of Service Research, 5 (1), 39-55
Cecchini, M., H. Aytug, et al. (2010). “Detecting Management Fraud in Public

Companies,” Management Science. 56(7):1146–1160.
Campbell, D. and F. Frei (2010). “Cost Structure, Customer Profitability, and Retention
Implications of Self–Service Distribution Channels: Evidence from Customer Behavior
in an Online Banking Channel,” Management Science 56(1):4–24.
92
Crammer, K. and Y. Singer (2000). “On the learnability and design of output codes for
multiclass problems,” Comput. Learing Theory. 35–46
Cui, D. P. and D. Curry (2005). “Prediction in marketing using the support vector
machine,” Marketing Science, 24(4):595–615.
Cui, G., M. L. Wong, et al. (2006). “Machine learning for direct marketing response
models: Bayesian networks with evolutionary programming,” Management Science,
52(4):597–612.
Danaher, P. J. (2002). “Optimal pricing of new subscription services: Analysis of a

market experiment,” Marketing Science. 21(2):119–138.
Darden, W. R. and W. D. Perreault, (1977). “Classification for Market–Segmentation –

Improved Linear–Model for Solving Problems of Arbitrary Origin,” Management
Science. 24(3):259–271.
Davis, D. 1989. Perceived usefulness, perceived ease of use and user acceptance of
information technology. MIS Quart. 13(3).318–339.
Davis, D., R. P. Bagozzi, P. R. Warshaw (1989). User acceptance of computer

technology: A comparison of two theoretical models. Management Science, 35(8).982–
1003.
Deerwester, S., S. T. Dumais, et al. (1990). “Indexing by latent semantic indexing,”

Journal of the American Society for Information Science. 41(6):391–407.
Dillon, W. R. (1979). “Performance of the Linear Discriminant Function in Non–Optimal

Situations and the Estimation of Classification Error Rates – Review of Recent Findings,”
Journal of Marketing Research. 16(3):370–381.
Dillon, W. R., M. Goldstein, et al. (1978). “Appropriateness of Linear Discriminant and

Multinomial Classification Analysis in Marketing–Research,” Journal of Marketing
Research. 15(1):103–112.
Elisseeff, A., and J. Weston, (2002). “A kernel method for multi–labelled classification,”
Advances in Neural Information Processing Systems. 14.
Fader, P. S., B. G. S. Hardie, K. L. Lee, (2005). “RFM and CLV: Using iso–value curves
for customer base analysis,” Journal of Marketing Research, 42(4):415–430.
Fitzsimmons, J.A. and M.J. Fitzsimmons. (2004). Service Management: Operations,

Strategy, and Information Technology. Irwin–McGraw Hill, New York, NY.
93
Fletcher, Keith, and Colin Wheeler, and Julia Wright (1992). “Success in Database
Marketing: Some Critical Factors,” Marketing Intelligence & Planning. 10, 18–23.
Forman, G., (2003). “An Experimental Study of Feature Selection Metrics for Text
Categorization,” Journal of Machine Learning Research. 3:1289–1305.
Friedman J. H. (1996). “Another Approach to Polychotomous Classification”. Dept.

Statist., Stanford University., Stanford, CA. [Online]. Available: http://www–
stat.stanford.edu/reports/friedman/poly.ps.Z.
Friedman J. H. (1997). “Data Mining and Statistics: What’s the connection?” Available at:
http://www.stat.standford.edu/~jhf/.
Friedman J, Hastie T, Tibshirani R (2000). “Additive Logistic Regression: A Statistical
View of Boosting,” The Annals of Statistics. 28(2):337–407.
Friedman J. H. (2001). “Greedy Function Approximation: A Gradient Boosting Machine,”
The Annals of Statistics. 29(5):1189–1232.
Friedman J (2002). “Stochastic Gradient Boosting,” Computational Statistics & Data
Analysis, 38(4):367–378. doi:10.1016/S0167–9473(01)00065–2.
Freund Y, and R. Schapire. (1996). “Experiments with a New Boosting Algorithm,” In
International Conference on Machine Learning. 148–156.
Freund, Yoav and Robert E. Schapire (1997); “A Decision–Theoretic Generalization of
On–Line Learning and an Application to Boosting,” Journal of Computer and System
Sciences. 55(1):119–139
Froehle C., A. Roth. 2004. New measurement scales for evaluating perceptions of the
technology-mediated customer service experience. Journal of Operations. Management
22(1) 1–21.
Ganesh, J., M. J. Arnold, et al. (2000). “Understanding the customer base of service
providers: An examination of the differences between switchers and stayers,” Journal of
Marketing. 64(3):65–87.
Gelman, Andrew, J. B. Carlin, H.S. Stern, and D. B. Rubin (2004) Bayesian Data
Analysis, Second Edition, Chapman & Hall/CRC.
Ghose, A. and P. G. Ipeirotis (2011). “Estimating the Helpfulness and Economic Impact
of Product Reviews: Mining Text and Reviewer Characteristics,” IEEE Transactions on
Knowledge and Data Engineering. 23(10):1498–1512.
94
Godbole, S., and Sarawagi, S. (2004). “Dis¬criminative methods for multi–labeled

classification,” Proceedings of the 8th Pacific–Asia Conference on Knowledge Discovery
and Data Mining.
Godfrey, A., K. Seiders, et al. (2011). “Enough Is Enough! The Fine Line in Executing
Multichannel Relational Communication,” Journal of Marketing. 75(4):94–109.
Goolsbee, A., P. J. Klenow. (2002). “Evidence on learning and network externalities in

the diffusion of home computers,” Journal of Law Economics. 45(2):317–343.
Grewal, R., M. Chandrashekaran, et al. (2010). “Customer Satisfaction Heterogeneity and

Shareholder Value,” Journal of Marketing Research. 47(4):612–626.
Griffin, Jill and Michael W. Lowenstein( 2001). Customer Win–back: How to Recapture
Lost Customers–And Keep Them Loyal. San Francisco: Jossey–Bass.
Guadagni, P. M., J. D. C. Little. (1983). “A logit model of brand choice calibrated on

scanner data,” Marketing Science. 2(3).203–238.
Gupta, S., D. R. Lehmann, et al. (2004). “Valuing customers,” Journal of Marketing

Research. 41(1):7–18.
Gurland, J., I. Lee, et al. (1960). “Polychotomous Quantal Response in Biological Assay,”
Biometrics. 16, 382–398.
Gustafsson, A., M. D. Johnson, et al. (2005). “The effects of customer satisfaction,

relationship commitment dimensions, and triggers on customer retention,” Journal of
Marketing. 69(4):210–218.
Haenlein, M., A. M. Kaplan, D. Schoder, (2006). “Valuing the real option of abandoning
unprofitable customers when calculating customer lifetime value,” Journal of Marketing.
70(3):5–20.
Hansotia, Behram and Paul Wang( 1997). “Analytical Challenges in Customer

Acquisition,” Journal of Interactive Marketing. 11 (Spring). 7–19.
Hastie, Trevor, Robert Tibshirani, Jerome Friedman (2009). The Elements of Statistical
Learning, Second Edition, Springer.
Hayes, P. J., P. M. Andersen, et al. (1990). “Tcs: a shell for content–based text
categorization,” In Proceedings of CAIA–90, 6th IEEE Conference on Artificial
Intelligence Applications. 320–326.
95
Hill, T., M. OConnor, et al. (1996). “Neural network models for time series forecasts,”
Management Science. 42(7):1082–1092.
Hill, A.V., Collier, D.A., Froehle, et al. (2002),”Research opportunities in service process
design,” Journal of Operations Management. 20(2):189–202.
Hitt, L. M. and F. X. Frei (2002). “Do better customers utilize electronic distribution
channels? The case of PC banking,” Management Science. 48(6):732–748.
Ho, T. K. (1998). “The random subspace method for constructing decision forests,” IEEE
Trans. on Pattern Analysis and Machine Intelligence. 20(8). 832–844.
Hogan, John E., Donald R. Lehmann, Mario Merino, Rajendra K. Srivastava, Jacquelyn
S . Thomas, and Peter C. Verhoef (2002). “Linking Customer Assets to Financial
Performance,” Journal of Service Research. 5(August). 26–38.
Hornik, K., M. B. Stinchcombe, et al. (1989). “Multilayer feedforward networks are

universal approximators,” Neural Networks. 2: 359–366.
Jim Rosenfield (2002). “Customer relationship management: a brief history, and a big
mystery” Australasian Business Intelligence. Mar 31, 2002.
Kantor, J. R. (1953). The Logic of Modern Science. Bloomington IN: Principle Press.
Keaveney, S. M. (1995). “Customer Switching Behavior in–Service Industries – an

Exploratory Study,” Journal of Marketing. 59(2):71–82.
Kemeny, J. G. (1959). A Philosopher Looks at Science. New York: Van Norstand.
Kim, B. D., M. Shi, et al. (2001). “Reward programs and tacit collusion,” Marketing
Science. 20(2):99–120.
Kim, S. P., T. Pantel, et al. “Automatically Assessing Review Helpfulness,” Proceedings

of the 2006 Conference on Empirical Methods in Natural Language Processing (EMNLP
2006). 423–430,
Kim, Y., W. N. Street, et al. (2005). “Customer targeting: A neural network approach
guided by genetic algorithms,” Management Science. 51(2):264–276.
Kinnear, T. C. and J. R. Taylor, (1971). “Multivariate Methods in Marketing Research –

Further Attempt at Classification,” Journal of Marketing. 35(4):56–59.
Kivetz, R. and I. Simonson (2002). “Earning the right to indulge: Effort as a determinant
of customer preferences toward frequency program rewards,” Journal of Marketing
Research. 39(2):155–170.
96
Kivetz, R. and I. Simonson (2003). “The idiosyncratic fit heuristic: Effort advantage as a
determinant of consumer response to loyalty programs,” Journal of Marketing Research.
40(4):454–467.
Knerr, S., L. Personnaz, et al. (1990). Single–layer learning revisited: A stepwise

procedure for building and training a neural network, Neurocomputing: Algorithm,
Architectures and Applications, J. Fogelman, Ed. New York: Springer–Verlag, 1990.
Kreßel, U., (1999). Pairwise Classification and Support Vector Machines, in Advances in
Kernel Methods–Support Vector Learning, B. Schölkopf, C. J. C. Burges, and A. J.
Smola, Eds. Cambridge, MA: MIT Press, 1999, 255–268.
Lemmens, A. and C. Croux (2006). “Bagging and boosting classification trees to predict
churn,” Journal of Marketing Research. 43(2):276–286.
Lewis, D. D. (1992). “An evaluation of phrasal and clustered representations on a text

categorization task,” In Proceedings of SIGIR–92, 15th ACM International Conference
on Research and Development in Information Retrieval (Kobenhavn, DK, 1992). 37–50.
Lewis, M. (2004). “The influence of loyalty programs and short–term promotions on
customer retention,” Journal of Marketing Research. 41(3):281–292
Lewis, M. (2006). “Customer acquisition promotions and customer asset value,” Journal
of Marketing Research. 43(2):195–203.
Li, T., & Ogihara, M. (2003). Detecting emotion in music. Proceedings of the
International Symposium on Music Information Retrieval.
Li, Y. H. and A. K. Jain, (1998). “Classification of text documents,” The Computer

Journal. 41(8):537–546.
Liittschwager, J. M. and C. Wang, (1978). “Integer Programming Solution of a

Classification Problem,” Management Science. 24(14):1515–1525.
Luo, X., & Zincir–Heywood, A. N. (2005). “Evaluation of two systems on multi–class

multi–label document classifica¬tion,” Proceedings of the 15th Interna¬tional
Symposium on Methodologies for Intelligent Systems.
Mahajan, Vijay, Paul E. Green, S. M. Goldberg (1982). “A conjoint model for measuring
self– and cross–price/demand relationships,” Journal of Marketing Research. 19(3). 334–
342.
Mahajan, V., E. Muller, R. A. Kerin. (1984). “Introduction Strategy for New Products
with Positive and Negative Word–of–Mouth,” Management Science. 30(12):1389–1404.
97
Malakooti, B. and Y. Q. Zhou, (1994). “Feedforward Artificial Neural Networks for

Solving Discrete Multiple Criteria Decision–Making Problems,” Management Science
40(11):1542–1561.
Mangasarian, O. L., and D. R. Musicant. (1999). “Successive overrelaxation for support
vector machines,” IEEE Transactions on Neural Networks. 10(5):1032–1037.
Maron, O., and Lozano–Perez, T. (1997). “A framework for multiple–instance learning,”

Proceedings of Neural In¬formation Processing Systems. (NIPS 1997).
McCullagh, P. (1980). “Regression Models for Ordinal Data,” Journal of the Royal
Statistical Society. Ser. B, 109–142.
McCullagh, P., and J. Nelder. (1983). Generalized Linear Models. Chapman and Hall,
London.
McGahan, A. M. and P. Ghemawat (1994). “Competition to Retain Customers,”

Marketing Science. 13(2):165–176.
McKelvey, R., and Zavoina, W. (1975). “A Statistical Model for the Analysis of Ordingal
Level Dependent Variables,” Journal of Mathematical Sociology. 4:103–120.
Mitchell, T.M. (1996). Machine Learning. McGraw Hill, New York, NY.
Musalem, A. and Y. V. Joshi (2009). “How Much Should You Invest in Each Customer
Relationship? A Competitive Strategic Approach,” Marketing Science. 28(3):555–565.
Nahshon Wingard, “CRM History: The Evolution Of Better Customer Service,”
http://www.streetdirectory.com/.
Nam, S., P. Manchanda, et al. (2010). “The Effect of Signal Quality and Contiguous
Word of Mouth on Customer Acquisition for a Video–on–Demand Service,” Marketing
Science. 29(4):690–700.
Norton, J. A. and F. M. Bass (1987). “A Diffusion–Theory Model of Adoption and

Substitution for Successive Generations of High–Technology Products,” Management
Science 33(9):1069–1086.
Oliveira, P., A.V. Roth, et al. (2002). “Achieving competitive capabilities in E–services,”
Technological Forecasting and Social Change. 69(7):721–731.
Petrison, Lisa A., Robert C. Blattberg, et al. (1997). “Database Marketing Past, Present,
and Future,” Journal of Direct Marketing. 7(3):27–43.
98
Piramuthu, S., H. Ragavan, et al. (1998). “Using feature construction to improve the
performance of neural networks,” Management Science. 44(3):416–430.
Reinartz, W., J. S. Thomas, V. Kumar. (2005). “Balancing acquisition and retention

resources to maximize customer profitability,” Journal of Marketing. 69(1):63–79.
Reinartz, W. J. and V. Kumar (2003). “The impact of customer relationship

characteristics on profitable lifetime duration,” Journal of Marketing. 67(1):77–99.
Ridgeway G (2006). gbm: Generalized Boosted Regression Models. R package version

1.5–7, URL http://www.i–pensieri.com/gregr/gbm.shtml.
Riedmiller, M., and H. Braun. (1993). “A direct method for faster backpropagation
learning: the rprop algorithm,” Proceedings of the IEEE International Conference on
Neural Networks (ICNN). 1:586–591.
Ripley, B. D. (1996). Pattern recognition and neural networks. Cambridge University

Press.
Roehm, M. L., E. B. Pullins, et al. (2002). “Designing loyalty–building programs for

packaged goods brands,” Journal of Marketing Research. 39(2):202–213.
Rogers, E. M. (1995). Diffusion of Innovations, 4th ed. Free Press, New York.
Rosenblatt, F. (1958). “The Perceptron: A Probabilistic Model for Information Storage

and Organization in the Brain,” Psychological Review. 65: 386–408.
Roth, A.V. (2000). Service Strategy and the Technological Revolution: The 7 Myths of
E–Services. In: Machuca, J.A.D., Mandakovic, T. (Eds.). POM Facing the New
Millennium:
Evaluating the Past, Leading with the Present and Planning the Future of Operations.
POM Sevilla, 159–168.
Rust, RolandT ., ValarieA Zeithaml, and KatherineN Lemon (2004). “Return on

Marketing: Using Customer Equity to Focus Marketing Strategy,” Journal of Marketing.
68 (January). 23–53.
Rosset, S., Ji Zhu et al. (2004). “Boosting as a regularized path to a maximum margin
classifier”. Journal of Machine Learning Research. 5:941–973.
Rossi, P. E. and G. M. Allenby (1993). “A Bayesian–Approach to Estimating Household
Parameters,” Journal of Marketing Research. 30(2):171–182.
Rossi, P. E., G. M. Allenby, et al. (2005). Bayesian Statistics and Marketing. John Wiley
& Sons Ltd.
99
Salton, G. and Buckley, C. (1988). “Term–weighting approaches in automatic text

retrieval”. Inform. Process. Man. 24(5):513–523. Also reprinted in Sparck Jones and
Willett [1997], 323–328.
Salton, G. and M. J. McGill, (1983). Introduction toModern Information Retrieval.

McGraw Hill.
Schapire, R. (1997). “Using output codes to boost multiclass learning problems,” in

Proceedings of the Fourteenth International Conference on Machine Learning. Morgan
Kauffman.
Schapire, R. and Singer, Y. (1999). “Improved boosting algorithms using confidence–

rated prediction,” Machine Learning 37: 297–336.
Schweidel, D. A., Peter. S. Fader, et al. (2008). “A Bivariate Timing Model of Customer
Acquisition and Retention,” Marketing Science. 27(5):829–843.
Sebastiani F. (2002). “Machine Learning in Automated Text Categorization,” ACM

Computing Surveys. 34 (1):1–47.
Shaw, R. and Stone, M. Database Marketing. New York: John Wiley & Sons, 1988.
Slonim, N. and N. Tishby, (2001). “The power of word clusters for text classification,” In
Proceedings of ECIR–01, 23rd European Colloquium on Information Retrieval Research
(Darmstadt, DE, 2001).
Soucy P. and Mineau G., (2003). “Feature Selection Strategies for Text Categorization,”
AI 2003, LNAI 2671, 505–509.
Sousa P., Pimentao J. P., et al. (2003). “Feature Selection Algorithms to Improve
Documents Classification Performance,” LNAI 2663, 288–296.
Stauss, Bernd and Christian Friege (1999). “Regaining Service Customers,” Journal of
Service Research. 1(4). 347–61.
Sun, M. H., A. Stam, et al. (1996). “Solving multiple objective programming problems
using feed–forward artificial neural networks: The Interactive FFANN Procedure,”
Management Science. 42(6):835–849.
Tellis, G. J. and F. S. Zufryden (1995). “Tackling the Retailer Decision Maze – Which
Brands to Discount, How Much, When and Why,” Marketing Science. 14(3):271–299.
100
Thieme, R. J., M. Song, et al. (2000). “Artificial neural network decision support systems
for new product development project selection,” Journal of Marketing Research.
37(4):499–507.
Thomas, J. S. (2001). “A methodology for linking customer acquisition to customer

retention,” Journal of Marketing Research. 38(2):262–268.
Thomas, J. S., R. C. Blattberg, E. J. Fox, (2004). “Recapturing lost customers,” Journal of

Marketing Research. 41(1):31–45.
Tibshirani, R. (1996). “Regression Shrinkage and Selection via the Lasso,” Journal of the
Royal Statistical Society. Series B (Methodological). 58(1):267–288.
Torkkola K. (2002). “Discriminative Features for Text Document Classification,” Proc.

International Conference on Pattern Recognition, Canada.
Tsoumakas, G. (2007). “Multi–label classification: an overview,” International Joural of

Data Warehousing & Mining. 3(3):1–13.
Vapnik, V. (1998).Statistical Learning Theory. New York: Wiley, 1998.
Venkatesan, R. and V. Kumar (2004). “A Customer Lifetime Value Framework for

Customer Selection and Resource Allocation Strategy,” Journal of Marketing. 68
(October). 106–125.
Villanueva, J., S. Yoo, et al. (2008). “The impact of marketing–induced versus word–of–
mouth customer acquisition on customer equity growth,” Journal of Marketing Research.
45(1):48–59.
Walf, A. (1926). Essentials of Scientific Method. New York. Macmillan Company.
Wang, S. H. (1995). “The Unpredictability of Standard Back–Propagation Neural

Networks in Classification Applications,” Management Science. 41(3):555–559.
Wedel, M., W. A. Kamakura, et al. (2000). “Marketing data, models and decisions,”
International Journal of Research in Marketing. 17(2–3). 203–208.
West, P. M., P. L. Brockett, et al. (1997). “A comparative analysis of neural networks and
statistical methods for predicting consumer choice,” Marketing Science. 16(4):370–391.
Weston, J. and C. Watkins, (1999). “Multi–class support vector machines,” the Proc.
ESANN99. M. Verleysen, Ed., Brussels, Belgium, 1999.
Xue, M., L. M. Hitt, et al. (2011). “Determinants and Outcomes of Internet Banking
Adoption,” Management Science. 57(2):291–307.
101
Yang, S., G. M. Allenby, et al. (2002). “Modeling variation in brand preference: The
roles of objective environment and motivating conditions,” Marketing Science. 21(1):14–
31.
Zeithaml, Valarie A., Berry, Leonard L. & Parasuraman, A. (1996). “The Behavioral
Consequences of Service Quality,” Journal of Marketing, 60, 31-46.
Zeithaml, V.A., A. Parasuraman, et al. (2002). “Service quality delivery through web
sites: a critical review of extant knowledge,” Journal of the Academy of Marketing
Science. 30(4):362–375.
Zhang, M.–L., and Z.–H. Zhou, (2005). “A k–nearest neighbor based algorithm for
multi–label classification,” Proceedings of the 1st IEEEInternational Confer¬ence on
Granular Computing.
Zhu, J., H. Zou, S. Rosset, et al. (2009). “Multi–class AdaBoost,” Statistics and Its
Interface. 2:349–360.
Zhu, K. and K. L. Kraemer (2005). “Post–adoption variations in usage and value of e–

business by organizations: Cross–country evidence from the retail industry,” Information
Systems Research. 16(1):61–84.
Zhu, K., K. L. Kraemer, S. Xu (2003). “Electronic business adoption by European firms:

A cross–country assessment of the facilitators and inhibitors,” European Journal of
Information Systems. 12(4):251–268.
Zou, H., and T. Hastie, (2005). “Regularization and variable selection via the elastic net,”
J.R. Statist. Soc B. 67(2):301–320.
Zou, H., J. Zhu, and T. Hastie, (2008). “New Multicategory Boosting Algorithms Based
on Multicategory Fisher–Consistent Losses,” The Annals of Applied Statistics.
2(4):1290–1306.
102
Appendices:
Appendix A: LIWC2007 Output Variable Information
Category Abbrev Examples Words in category
Linguistic Processes
Total function words funct 464
Total pronouns pronoun I, them, itself 116
Personal pronouns ppron I, them, her 70
Impersonal pronouns ipron It, it’s, those 46
Articles article A, an, the 3
Common verbs verb Walk, went, see 383
Auxiliary verbs auxverb Am, will, have 144
Past tense past Went, ran, had 145
Present tense present Is, does, hear 169
Future tense future Will, gonna 48
Adverbs adverb Very, really, quickly 69
Prepositions prep To, with, above 60
Conjunctions conj And, but, whereas 28
Negations negate No, not, never 57
Quantifiers quant Few, many, much 89
Numbers number Second, thousand 34
Swear words swear Damn, piss, fuck 53
Psychological Processes
Social processes social Mate, talk, they, child 455
Family family Daughter, husband, 64
aunt
Friends friend Buddy, friend, 37
neighbor
Humans human Adult, baby, boy 61
Affective processes affect Happy, cried, abandon 915
Positive emotion posemo Love, nice, sweet 406
Negative emotion negemo Hurt, ugly, nasty 499
Anxiety anx Worried, fearful, 91
nervous
Anger anger Hate, kill, annoyed 184
Sadness sad Crying, grief, sad 101
Cognitive processes cogmech cause, know, ought 730
Insight insight think, know, consider 195
Causation cause because, effect, hence 108
Discrepancy discrep should, would, could 76
Tentative tentat maybe, perhaps, guess 155
Certainty certain always, never 83
Inhibition inhib block, constrain, stop 111
Inclusive incl And, with, include 18
103
(Appendix A, continued)
Exclusive excl But, without, exclude 17
Perceptual processes percept Observing, heard, 273
feeling
See see View, saw, seen 72
Hear hear Listen, hearing 51
Feel feel Feels, touch 75
Biological processes bio Eat, blood, pain 567
Body body Cheek, hands, spit 180
Health health Clinic, flu, pill 236
Sexual sexual Horny, love, incest 96
Ingestion ingest Dish, eat, pizza 111
Relativity relativ Area, bend, exit, stop 638
Motion motion Arrive, car, go 168
Space space Down, in, thin 220
Time time End, until, season 239
Personal Concerns
Work work Job, majors, xerox 327
Achievement achieve Earn, hero, win 186
Leisure leisure Cook, chat, movie 229
Home home Apartment, kitchen, 93
family
Money money Audit, cash, owe 173
Religion relig Altar, church, mosque 159
Death death Bury, coffin, kill 62
Spoken categories
Assent assent Agree, OK, yes 30
Nonfluencies nonflu Er, hm, umm 8
Fillers filler Blah, Imean, 9
youknow
104
Appendix B: Summary Statistics of Conversion table
Variable Mean Std Dev Minimum Maximum

RADIO_TOTAL 55.57648 20.28646 0 106
Acquisition_duration -0.00010 5.14175 -10.703 25.297
Acquisition_duration2 26.43733 37.17775 0 639.94
Unknown_reason 0.25696 0.43696 0 1
Free_ended 0.50762 0.49994 0 1
Uncontrollable_reason 0.00334 0.05767 0 1
Service_Dissatisfaction 0.00181 0.04249 0 1
NonPay 0.00259 0.05078 0 1
Non_Usage 0.00051 0.02262 0 1
Product_Dissatisfaction 0.00005 0.00714 0 1
Cost_price_fee 0.00005 0.00714 0 1
Competition 0.00001 0.00276 0 1
Acquis_Live_Agent60 0.04815 0.26629 0 10
Acquis_CUWI60 0.03363 0.26182 0 7
Acquis_IVR60 0.00038 0.02147 0 3
Acquis_Live_Agent_gt60 0.59940 0.82938 0 44
Acquis_CUWI_gt60 0.02305 0.22012 0 28
Acquis_IVR_gt60 0.00213 0.05469 0 4
Acquis_ChangeAcctInfo60 0.04900 0.28635 0 6
Acquis_Change_Service60 0.02609 0.18034 0 6
Acquis_Make_Payment60 0.00707 0.09589 0 7
Acquis_Make_Payment_gt60 0.02121 0.43458 0 44
Acquis_ChangeAcctInfo_gt60 0.02419 0.20928 0 8
Acquis_Change_Service_gt60 0.57919 0.65766 0 14
Acq_LiveAg_ChangeInfo60 0.01712 0.13579 0 5
Acq_LiveAg_ChangeSer60 0.02433 0.17185 0 6
Acq_LiveAg_MakePaymt60 0.00671 0.09186 0 6
Acq_CUWI_MakePaymt60 0.00031 0.01915 0 4
Acq_CUWI_ChangeInfo60 0.03183 0.24772 0 6
Acq_IVR_MakePaymt60 0.00006 0.01022 0 2
Acq_CUWI_ChangeSer60 0.00149 0.04110 0 4
Acq_IVR_ChangeSer60 0.00028 0.01719 0 2
Acq_IVR_ChangeInfo60 0.00005 0.00714 0 1
Acq_LiveAg_MakePaymt_gt60 0.02040 0.41930 0 44
Acq_CUWI_ChangeInfo_gt60 0.01553 0.17493 0 7
Acq_CUWI_ChangeSer_gt60 0.00690 0.09229 0 7
Acq_LiveAg_ChangeSer_gt60 0.57043 0.64800 0 14
Acq_CUWI_MakePaymt_gt60 0.00061 0.06012 0 28
Acq_LiveAg_ChangeInfo_gt60 0.00858 0.10710 0 7
Acq_IVR_MakePaymt_gt60 0.00020 0.02069 0 4
Acq_IVR_ChangeInfo_gt60 0.00008 0.00889 0 1
Acq_IVR_ChangeSer_gt60 0.00186 0.04777 0 4
Acquis_MDBcount60 3.18858 2.72539 0 27
Acquis_MDBcount_gt60 2.78640 3.37969 1 112
ACQUSI_EM60 0.86992 1.59616 0 15
105
(Appendix B, continued)
ACQUSI_DM60 1.22405 1.43420 0 19
ACQUSI_TELEM60 0.32842 0.57997 0 9
ACQUSI_EM_gt60 1.24364 2.66804 0 107
ACQUSI_TELEM_gt60 0.25627 0.49826 0 17
ACQUSI_DM_gt60 0.47473 1.06432 0 27
Trial60Response_Rate 0.11573 0.19670 0 1
Trial_gt60Response_Rate 0.04397 0.15938 0 1
ACQUSI_EMDM60 1.38454 3.86736 0 133
ACQUSI_EMTELEM60 0.45155 1.52871 0 36
ACQUSI_DMTELEM60 0.79814 1.72681 0 54
ACQUSI_EMDMTELEM60 1.14175 4.44625 0 192
ACQUSI_EMDM_gt60 0.83004 8.42157 0 1475
ACQUSI_EMTELEM_gt60 0.43117 4.28087 0 528
ACQUSI_DMTELEM_gt60 0.30342 1.53058 0 200
ACQUSI_EMDMTELEM_gt60 1.00711 29.45525 0 11800
funct_trial60 10.34647 15.64231 0 100
pronoun_trial60 1.26627 3.36484 0 66.67
ppron_trial60 0.71227 2.36294 0 50
i_trial60 0.21798 1.06868 0 33.33
we_trial60 0.03136 0.36947 0 22.22
you_trial60 0.00138 0.09301 0 20
shehe_trial60 0.44192 1.77311 0 50
they_trial60 0.01962 0.30250 0 18.18
ipron_trial60 0.55400 2.02742 0 66.67
article_trial60 0.97883 2.92367 0 100
verb_trial60 2.92114 5.94516 0 100
auxverb_trial60 0.94131 3.08434 0 100
past_trial60 1.77291 4.20530 0 100
present_trial60 0.92016 3.01119 0 80
future_trial60 0.09920 0.71509 0 50
adverb_trial60 0.32238 1.56397 0 50
preps_trial60 5.34505 8.87729 0 100
conj_trial60 0.82022 2.37829 0 50
negate_trial60 0.51022 2.30106 0 100
quant_trial60 0.17527 1.22178 0 100
number_trial60 0.37952 2.63339 0 100
swear_trial60 0.00034 0.04512 0 11.11
social_trial60 3.29225 6.50796 0 100
family_trial60 0.03959 0.55702 0 100
friend_trial60 0.01013 0.31052 0 33.33
humans_trial60 0.06555 0.68509 0 33.33
affect_trial60 0.79390 2.64456 0 100
posemo_trial60 0.62632 2.30134 0 100
negemo_trial60 0.16694 1.20066 0 100
anx_trial60 0.00520 0.15261 0 20
anger_trial60 0.01729 0.38307 0 50
sad_trial60 0.05432 0.68632 0 50
cogmech_trial60 3.93714 7.26039 0 100
106
insight_trial60 0.28747 1.53418 0 100
cause_trial60 1.29756 3.66394 0 100
discrep_trial60 0.28806 1.41740 0 50
tentat_trial60 0.15004 1.28118 0 66.67
certain_trial60 0.43543 1.89064 0 100
inhib_trial60 0.06256 0.76647 0 100
incl_trial60 0.88547 2.67024 0 66.67
excl_trial60 0.44044 2.13987 0 50
percept_trial60 0.16330 1.30880 0 66.67
see_trial60 0.01837 0.29805 0 33.33
hear_trial60 0.13837 1.24583 0 66.67
feel_trial60 0.00610 0.21409 0 33.33
bio_trial60 0.07071 0.66603 0 50
body_trial60 0.00259 0.14251 0 25
health_trial60 0.06586 0.63788 0 50
sexual_trial60 0.00202 0.10782 0 16.67
ingest_trial60 0.00074 0.08105 0 33.33
relativ_trial60 5.36336 9.63670 0 100
motion_trial60 1.09291 3.02618 0 100
space_trial60 1.72588 4.37158 0 100
time_trial60 2.37067 6.30618 0 100
work_trial60 2.07597 5.09205 0 100
achieve_trial60 0.95585 3.10516 0 66.67
leisure_trial60 0.94802 2.94667 0 100
home_trial60 0.04528 0.57335 0 100
money_trial60 2.00394 5.01842 0 100
relig_trial60 0.07981 1.09529 0 66.67
death_trial60 0.00136 0.09012 0 18.18
assent_trial60 0.04508 0.70340 0 66.67
nonfl_trial60 0.31737 1.41531 0 50
filler_trial60 0.00278 0.11551 0 20
funct_trial_gt60 3.45811 10.28507 0 100
pronoun_trial_gt60 0.43576 2.04467 0 50
ppron_trial_gt60 0.31826 1.60634 0 40
i_trial_gt60 0.08976 0.68009 0 25
we_trial_gt60 0.01311 0.23094 0 25
you_trial_gt60 0.00058 0.05437 0 11.11
shehe_trial_gt60 0.20678 1.23483 0 36.36
they_trial_gt60 0.00803 0.19307 0 28.57
ipron_trial_gt60 0.11750 0.82112 0 33.33
article_trial_gt60 0.36313 1.75678 0 40
verb_trial_gt60 1.25592 4.27054 0 100
auxverb_trial_gt60 0.36100 1.78616 0 100
past_trial_gt60 0.83085 3.18807 0 100
present_trial_gt60 0.33123 1.70857 0 80
future_trial_gt60 0.04611 0.48900 0 33.33
adverb_trial_gt60 0.11898 0.91611 0 66.67
preps_trial_gt60 1.70195 5.22045 0 66.67
107
conj_trial_gt60 0.34520 1.59682 0 28.57
negate_trial_gt60 0.18870 1.24766 0 50
quant_trial_gt60 0.04961 0.52623 0 50
number_trial_gt60 0.03358 0.50699 0 66.67
swear_trial_gt60 0.00033 0.05031 0 14.29
social_trial_gt60 1.37350 4.61078 0 100
family_trial_gt60 0.01416 0.33459 0 50
friend_trial_gt60 0.00135 0.08762 0 14.29
humans_trial_gt60 0.02353 0.36546 0 25
affect_trial_gt60 0.26965 1.66342 0 100
posemo_trial_gt60 0.20756 1.27674 0 100
negemo_trial_gt60 0.06155 1.00593 0 100
anx_trial_gt60 0.00150 0.07966 0 20
anger_trial_gt60 0.00991 0.26877 0 50
sad_trial_gt60 0.00978 0.27431 0 50
cogmech_trial_gt60 1.34394 4.56618 0 100
insight_trial_gt60 0.14073 1.12705 0 100
cause_trial_gt60 0.43551 2.01468 0 100
discrep_trial_gt60 0.10915 0.86927 0 50
tentat_trial_gt60 0.04354 0.64453 0 50
certain_trial_gt60 0.04762 0.52297 0 40
inhib_trial_gt60 0.02604 0.54513 0 100
incl_trial_gt60 0.35199 1.67204 0 50
excl_trial_gt60 0.16595 1.16322 0 50
percept_trial_gt60 0.05079 0.54957 0 50
see_trial_gt60 0.01308 0.27467 0 50
hear_trial_gt60 0.03486 0.44624 0 33.33
feel_trial_gt60 0.00273 0.11291 0 25
bio_trial_gt60 0.01933 0.34940 0 33.33
body_trial_gt60 0.00178 0.10239 0 14.29
health_trial_gt60 0.01661 0.32395 0 33.33
sexual_trial_gt60 0.00147 0.09458 0 20
ingest_trial_gt60 0.00029 0.05108 0 25
relativ_trial_gt60 1.81440 5.83534 0 100
motion_trial_gt60 0.46561 2.07815 0 75
space_trial_gt60 0.71179 2.65124 0 66.67
time_trial_gt60 0.56307 2.43138 0 100
work_trial_gt60 0.44059 2.25539 0 100
achieve_trial_gt60 0.18811 1.17087 0 66.67
leisure_trial_gt60 0.35511 1.67137 0 50
home_trial_gt60 0.04566 0.70325 0 50
money_trial_gt60 0.51242 2.51850 0 100
relig_trial_gt60 0.00542 0.21446 0 33.33
death_trial_gt60 0.00087 0.06666 0 11.11
assent_trial_gt60 0.01640 0.33489 0 25
nonfl_trial_gt60 0.12170 0.86152 0 50
filler_trial_gt60 0.00081 0.05155 0 12.5
AcqSelfcare_dummy 0.62240 0.48479 0 1
108
Appendix C: Summary Statistics of Churn table

RADIO_TOTAL 54.47975 20.28804 0 106
Acquisition_duration 0.02522 4.68418 -10.703 25.297
Retention_duration 0.02041 8.17695 -18.207 102.793
Retention_duration2 66.86294 207.95353 0 10566.4
Non_Usage 0.04623 0.20998 0 1
NonPay 0.09900 0.29866 0 1
Unknown_reason 0.04385 0.20476 0 1
Free_ended 0.00529 0.07251 0 1
Prepaid_period_ended 0.00813 0.08979 0 1
Cost_price_fee 0.00406 0.06362 0 1
Competition 0.00042 0.02058 0 1
Service_count 1.79962 1.00699 1 15
MC_Ave_Price 0.00215 3.37437 -11.67 88.32
MC2_Ave_Price 11.38510 61.20399 0 7800.42
Retent_Live_Agent60 0.80906 2.44931 0 79
Retent_CUWI60 0.09264 0.55942 0 26
Retent_IVR60 0.02102 0.22750 0 14
Retent_CUWI_gt60 0.10369 0.63672 0 63
Retent_Live_Agent_gt60 1.35447 3.91411 0 121
Retent_IVR_gt60 0.03178 0.32865 0 34
Retent_Change_Service60 0.32264 0.82794 0 20
Retent_Make_Payment60 0.46393 1.93774 0 76
Retent_ChangeAcctInfo60 0.13616 0.53019 0 18
Retent_ChangeAcctInfo_gt60 0.15419 0.58628 0 16
Retent_Change_Service_gt60 0.31917 0.79760 0 36
Retent_Make_Payment_gt60 1.01658 3.71397 0 121
Retent_LiveAg_ChangeSer60 0.28982 0.75787 0 20
Retent_LiveAg_MakePaymt60 0.43935 1.88183 0 76
Retent_CUWI_ChangeInfo60 0.05320 0.33804 0 17
Retent_CUWI_ChangeSer60 0.02375 0.20576 0 8
Retent_CUWI_MakePaymt60 0.01569 0.16185 0 21
Retent_LiveAg_ChangeInfo60 0.07988 0.35446 0 15
Retent_IVR_ChangeSer60 0.00907 0.10342 0 8
Retent_IVR_MakePaymt60 0.00888 0.15604 0 14
Retent_IVR_ChangeInfo60 0.00307 0.05839 0 4
Retent_CUWI_ChangeInfo_gt60 0.06175 0.38242 0 11
Retent_CUWI_ChangeSer_gt60 0.01886 0.19527 0 14
Retent_LiveAg_MakePaymt_gt60 0.97436 3.62871 0 121
Retent_CUWI_MakePaymt_gt60 0.02308 0.28635 0 62
Retent_LiveAg_ChangeInfo_gt60 0.08689 0.38124 0 12
Retent_LiveAg_ChangeSer_gt60 0.29322 0.73391 0 21
109
(Appendix C, continued)
Retent_IVR_ChangeSer_gt60 0.00709 0.09940 0 16
Retent_IVR_ChangeInfo_gt60 0.00554 0.07993 0 4
Retent_IVR_MakePaymt_gt60 0.01915 0.25915 0 34
funct_Churn60 12.66208 17.77446 0 100
pronoun_Churn60 1.75367 3.71579 0 50
ppron_Churn60 1.22364 2.76165 0 50
i_Churn60 0.32003 1.07588 0 26.32
we_Churn60 0.04972 0.39317 0 20
you_Churn60 0.00492 0.11816 0 15.38
shehe_Churn60 0.80135 2.12758 0 50
they_Churn60 0.04760 0.41779 0 38.46
ipron_Churn60 0.53003 1.55965 0 33.33
article_Churn60 1.45681 3.13394 0 42.86
verb_Churn60 4.19514 7.18431 0 100
auxverb_Churn60 1.75192 3.65209 0 100
past_Churn60 2.05181 4.39141 0 100
present_Churn60 1.69964 4.27259 0 100
future_Churn60 0.17304 0.79639 0 50
adverb_Churn60 0.48902 1.51933 0 50
preps_Churn60 5.28383 7.86707 0 66.67
conj_Churn60 1.02851 2.30112 0 40
negate_Churn60 1.08353 3.07817 0 100
quant_Churn60 0.42479 1.68596 0 66.67
number_Churn60 0.25901 1.09144 0 33.33
swear_Churn60 0.00062 0.05593 0 20
social_Churn60 4.19741 6.88559 0 100
family_Churn60 0.09647 0.68192 0 50
friend_Churn60 0.00609 0.17199 0 28.57
humans_Churn60 0.09992 0.71050 0 40
affect_Churn60 1.09466 2.85295 0 100
posemo_Churn60 0.85627 2.33540 0 100
negemo_Churn60 0.23618 1.47360 0 100
anx_Churn60 0.01309 0.18583 0 18.18
anger_Churn60 0.02775 0.55166 0 50
sad_Churn60 0.07512 0.75796 0 50
cogmech_Churn60 4.17810 6.97204 0 100
insight_Churn60 0.61368 2.07765 0 100
cause_Churn60 0.88602 2.46660 0 100
discrep_Churn60 0.46660 1.50688 0 50
tentat_Churn60 0.16579 0.96991 0 100
certain_Churn60 0.11737 0.73077 0 50
inhib_Churn60 0.17956 1.05385 0 100
incl_Churn60 0.88447 2.24823 0 66.67
excl_Churn60 0.75803 2.38607 0 50
percept_Churn60 0.41213 1.80707 0 66.67
see_Churn60 0.02724 0.39790 0 40
hear_Churn60 0.37665 1.74456 0 66.67
feel_Churn60 0.00761 0.16061 0 23.08
110
bio_Churn60 0.06941 0.66257 0 66.67
body_Churn60 0.00237 0.10139 0 22.22
health_Churn60 0.06420 0.64310 0 66.67
sexual_Churn60 0.00238 0.09951 0 20
ingest_Churn60 0.00090 0.05088 0 8.33
relativ_Churn60 4.50198 7.29048 0 100
motion_Churn60 0.72729 2.31177 0 100
space_Churn60 1.99210 3.74511 0 66.67
time_Churn60 1.69655 3.66511 0 100
work_Churn60 2.14742 4.60124 0 100
achieve_Churn60 0.32572 1.32823 0 100
leisure_Churn60 0.82532 2.13761 0 100
home_Churn60 0.06669 0.70229 0 50
money_Churn60 2.80244 5.76180 0 100
relig_Churn60 0.01998 0.35626 0 66.67
death_Churn60 0.01562 0.53583 0 80
assent_Churn60 0.02379 0.31848 0 50
nonfl_Churn60 0.38113 1.21258 0 50
filler_Churn60 0.00912 0.21116 0 40
funct_Churn_gt60 9.91040 16.38913 0 100
pronoun_Churn_gt60 1.42978 3.45653 0 60
ppron_Churn_gt60 1.03692 2.65430 0 60
i_Churn_gt60 0.25475 0.99569 0 50
we_Churn_gt60 0.05583 0.42662 0 22.22
you_Churn_gt60 0.00314 0.09919 0 16.33
shehe_Churn_gt60 0.69288 2.04906 0 50
they_Churn_gt60 0.03030 0.33728 0 18.18
ipron_Churn_gt60 0.39285 1.30591 0 33.33
article_Churn_gt60 1.29419 3.06111 0 66.67
verb_Churn_gt60 3.15886 6.08431 0 100
auxverb_Churn_gt60 1.20702 2.91155 0 100
past_Churn_gt60 1.77943 4.14342 0 100
present_Churn_gt60 1.05655 2.64037 0 80
future_Churn_gt60 0.14479 0.70681 0 33.33
adverb_Churn_gt60 0.44109 1.47309 0 50
preps_Churn_gt60 4.39429 7.65871 0 80
conj_Churn_gt60 0.87860 2.23931 0 33.33
negate_Churn_gt60 0.44647 1.46363 0 50
quant_Churn_gt60 0.22057 0.99770 0 50
number_Churn_gt60 0.18851 0.99394 0 66.67
swear_Churn_gt60 0.00037 0.03468 0 9.09
social_Churn_gt60 3.34020 6.34982 0 100
family_Churn_gt60 0.05746 0.49100 0 25
friend_Churn_gt60 0.00635 0.20772 0 66.67
humans_Churn_gt60 0.06811 0.63101 0 100
affect_Churn_gt60 1.04663 2.99821 0 100
posemo_Churn_gt60 0.87345 2.67134 0 100
negemo_Churn_gt60 0.17187 1.16691 0 100
111
anx_Churn_gt60 0.00954 0.15264 0 16.67
anger_Churn_gt60 0.02395 0.45496 0 50
sad_Churn_gt60 0.03000 0.41618 0 60
cogmech_Churn_gt60 3.37272 6.44217 0 100
insight_Churn_gt60 0.66727 2.22587 0 100
cause_Churn_gt60 0.78076 2.47541 0 100
discrep_Churn_gt60 0.33629 1.27911 0 50
tentat_Churn_gt60 0.15554 1.00690 0 100
certain_Churn_gt60 0.10715 0.68747 0 50
inhib_Churn_gt60 0.18815 1.21173 0 100
incl_Churn_gt60 0.74504 2.09983 0 66.67
excl_Churn_gt60 0.34085 1.23972 0 50
percept_Churn_gt60 0.14176 0.80032 0 50
see_Churn_gt60 0.02737 0.39130 0 50
hear_Churn_gt60 0.10760 0.66453 0 50
feel_Churn_gt60 0.00629 0.13340 0 14.29
bio_Churn_gt60 0.05736 0.53219 0 50
body_Churn_gt60 0.00200 0.09068 0 22.22
health_Churn_gt60 0.05252 0.51365 0 50
sexual_Churn_gt60 0.00219 0.10205 0 33.33
ingest_Churn_gt60 0.00108 0.05427 0 10
relativ_Churn_gt60 3.60496 6.87296 0 100
motion_Churn_gt60 0.58544 2.07920 0 100
space_Churn_gt60 1.53675 3.33913 0 66.67
time_Churn_gt60 1.42508 3.81987 0 100
work_Churn_gt60 1.71918 4.39740 0 100
achieve_Churn_gt60 0.32263 1.35865 0 100
leisure_Churn_gt60 0.53329 1.71723 0 100
home_Churn_gt60 0.06137 0.64298 0 50
money_Churn_gt60 2.18155 5.19021 0 100
relig_Churn_gt60 0.01918 0.32686 0 50
death_Churn_gt60 0.00231 0.16634 0 50
assent_Churn_gt60 0.03301 0.38858 0 25
nonfl_Churn_gt60 0.36579 1.28654 0 100
filler_Churn_gt60 0.00499 0.15611 0 33.33
Retention_MDBcount60 10.62872 13.97699 0 200
Retention_MDBcount_gt60 14.45388 17.35395 0 166
RETENTION_DM60 1.02347 2.02116 0 24
RETENTION_EM60 8.11318 12.64675 0 149
RETENTION_TELEM60 0.64619 1.29260 0 23
RETENTION_DM_gt60 0.45168 1.10973 0 57
RETENTION_EM_gt60 12.76222 16.46438 0 132
RETENTION_TELEM_gt60 0.20555 0.72194 0 34
Churn60Response_Rate 0.03134 0.11103 0 1
Churn_gt60Response_Rate 0.03859 0.12786 0 1
RETENTION_EMDM60 11.29186 42.22174 0 2793
RETENTION_EMTELEM60 6.53043 25.79364 0 2277
RETENTION_DMTELEM60 1.61831 5.34861 0 240
112
RETENTION_EMDMTELEM60 18.73759 148.54372 0 20700
RETENTION_EMDM_gt60 4.39596 19.65413 0 2580
RETENTION_EMTELEM_gt60 3.78496 21.60338 0 1802
RETENTION_DMTELEM_gt60 0.20184 1.33223 0 96
RETENTION_EMDMTELEM_gt60 2.50482 30.14699 0 5720
Selfcare_dummy 0.57855 0.49379 0 1
Acq_MDBcount 4.52879 4.68952 0 118
Acq_MDB_ResRate 0.07273 0.12752 0 1
Price_range 2.47240 3.84743 0 93.51
Promo_count 0.52966 0.75499 0 7
113
Appendix D: Summary Statistics of Unconverted Winback table

RADIO_TOTAL 56.93436 20.14063 0 106
Unknown_reason 0.19763 0.39822 0 1
Free_ended 0.48543 0.49979 0 1
NonPay 0.00295 0.05422 0 1
Non_Usage 0.00055 0.02354 0 1
Cost_price_fee 0.00004 0.00615 0 1
Competition 0.00001 0.00251 0 1
Acquis_Live_Agent60 0.04245 0.25017 0 10
Acquis_CUWI60 0.02926 0.24368 0 5
Acquis_IVR60 0.00038 0.02174 0 3
Acquis_Live_Agent_gt60 0.38112 0.76000 0 42
Acquis_CUWI_gt60 0.01520 0.19113 0 28
Acquis_IVR_gt60 0.00144 0.04488 0 4
Acquis_ChangeAcctInfo60 0.04287 0.26821 0 6
Acquis_Change_Service60 0.02303 0.16886 0 6
Acquis_Make_Payment60 0.00618 0.08956 0 6
Acquis_Make_Payment_gt60 0.01582 0.39815 0 42
Acquis_ChangeAcctInfo_gt60 0.01511 0.16495 0 6
Acquis_Change_Service_gt60 0.36682 0.61600 0 14
Acq_LiveAg_ChangeInfo60 0.01493 0.12673 0 5
Acq_LiveAg_ChangeSer60 0.02161 0.16151 0 5
Acq_LiveAg_MakePaymt60 0.00590 0.08698 0 6
Acq_CUWI_MakePaymt60 0.00022 0.01485 0 1
Acq_CUWI_ChangeInfo60 0.02788 0.23219 0 5
Acq_IVR_MakePaymt60 0.00006 0.01004 0 2
Acq_CUWI_ChangeSer60 0.00115 0.03485 0 3
Acq_IVR_ChangeSer60 0.00026 0.01702 0 2
Acq_IVR_ChangeInfo60 0.00006 0.00753 0 1
Acq_LiveAg_MakePaymt_gt60 0.01501 0.37600 0 42
Acq_CUWI_ChangeInfo_gt60 0.00958 0.13649 0 6
Acq_CUWI_ChangeSer_gt60 0.00497 0.08068 0 7
Acq_LiveAg_ChangeSer_gt60 0.36064 0.60626 0 14
Acq_CUWI_MakePaymt_gt60 0.00064 0.08505 0 28
Acq_LiveAg_ChangeInfo_gt60 0.00547 0.08443 0 6
Acq_IVR_MakePaymt_gt60 0.00017 0.01992 0 4
Acq_IVR_ChangeInfo_gt60 0.00006 0.00753 0 1
Acq_IVR_ChangeSer_gt60 0.00121 0.03772 0 2
Acquis_MDBcount60 2.85318 2.75432 0 21
Acquis_MDBcount_gt60 2.60986 2.62832 1 99
ACQUSI_EM60 0.69466 1.51378 0 15
114
(Appendix D, continued)
ACQUSI_DM60 0.98578 1.32000 0 18
ACQUSI_TELEM60 0.27657 0.54776 0 5
ACQUSI_EM_gt60 1.11811 2.71587 0 86
ACQUSI_TELEM_gt60 0.30209 0.51520 0 9
ACQUSI_DM_gt60 0.47374 1.01486 0 25
Trial60Response_Rate 0.11261 0.19737 0 1
Trial_gt60Response_Rate 0.05017 0.17689 0 1
ACQUSI_EMDM60 1.07270 3.45291 0 70
ACQUSI_EMTELEM60 0.37446 1.46548 0 36
ACQUSI_DMTELEM60 0.63463 1.55295 0 27
ACQUSI_EMDMTELEM60 0.90031 4.06694 0 192
ACQUSI_EMDM_gt60 0.83986 8.86491 0 1475
ACQUSI_EMTELEM_gt60 0.46697 4.16481 0 472
ACQUSI_DMTELEM_gt60 0.38653 1.75627 0 200
ACQUSI_EMDMTELEM_gt60 1.21315 35.30163 0 11800
funct_trial60 4.56711 11.85362 0 100
pronoun_trial60 0.51657 2.18941 0 42.86
ppron_trial60 0.27892 1.50891 0 33.33
i_trial60 0.06364 0.54170 0 25
we_trial60 0.01650 0.28036 0 20
you_trial60 0.00061 0.06976 0 20
shehe_trial60 0.18654 1.18202 0 33.33
they_trial60 0.01162 0.25691 0 18.18
ipron_trial60 0.23766 1.20464 0 42.86
article_trial60 0.37851 1.81012 0 100
verb_trial60 1.57350 4.89603 0 80
auxverb_trial60 0.63537 2.94733 0 50
past_trial60 0.74965 2.80629 0 66.67
present_trial60 0.67798 3.05342 0 80
future_trial60 0.05628 0.61743 0 50
adverb_trial60 0.14674 1.03739 0 50
preps_trial60 2.18097 5.90824 0 100
conj_trial60 0.32326 1.50243 0 28.57
negate_trial60 0.40833 2.48477 0 100
quant_trial60 0.08055 0.84965 0 100
number_trial60 0.09478 0.95040 0 66.67
swear_trial60 0.00018 0.03323 0 9.09
social_trial60 1.58697 5.21088 0 100
family_trial60 0.02080 0.41449 0 33.33
friend_trial60 0.00318 0.12903 0 12.5
humans_trial60 0.02226 0.35887 0 25
affect_trial60 0.34573 1.91471 0 100
posemo_trial60 0.25636 1.51209 0 66.67
negemo_trial60 0.08912 1.08978 0 100
anx_trial60 0.00229 0.09620 0 16.67
anger_trial60 0.00706 0.25498 0 50
sad_trial60 0.01750 0.36468 0 25
cogmech_trial60 1.83387 5.58648 0 100
115
insight_trial60 0.13704 1.08812 0 50
cause_trial60 0.52982 2.63200 0 100
discrep_trial60 0.17398 1.19407 0 50
tentat_trial60 0.06983 0.93603 0 66.67
certain_trial60 0.16560 0.98826 0 66.67
inhib_trial60 0.03527 0.74586 0 100
incl_trial60 0.32683 1.65959 0 66.67
excl_trial60 0.34685 2.29082 0 50
percept_trial60 0.07187 0.78570 0 50
see_trial60 0.01035 0.23860 0 33.33
hear_trial60 0.05854 0.72050 0 50
feel_trial60 0.00260 0.13887 0 25
bio_trial60 0.04185 0.58422 0 50
body_trial60 0.00105 0.10032 0 25
health_trial60 0.03996 0.56611 0 50
sexual_trial60 0.00028 0.02772 0 5
ingest_trial60 0.00064 0.09602 0 33.33
relativ_trial60 2.20419 6.49653 0 100
motion_trial60 0.58603 2.39381 0 50
space_trial60 0.75236 3.01510 0 100
time_trial60 0.80521 3.54030 0 100
work_trial60 1.00355 3.70819 0 100
achieve_trial60 0.35318 1.68476 0 66.67
leisure_trial60 0.30312 1.67941 0 50
home_trial60 0.02017 0.32862 0 25
money_trial60 1.03904 3.75244 0 66.67
relig_trial60 0.01839 0.42174 0 50
death_trial60 0.00044 0.04145 0 7.69
assent_trial60 0.01577 0.41792 0 66.67
nonfl_trial60 0.11350 0.83477 0 50
filler_trial60 0.00154 0.08401 0 11.11
funct_trial_gt60 2.99081 9.53919 0 75
pronoun_trial_gt60 0.37963 1.94168 0 40
ppron_trial_gt60 0.27696 1.52030 0 40
i_trial_gt60 0.07677 0.64322 0 25
we_trial_gt60 0.01069 0.20995 0 16.67
you_trial_gt60 0.00073 0.06852 0 11.11
shehe_trial_gt60 0.18141 1.17301 0 28.57
they_trial_gt60 0.00737 0.18153 0 18.18
ipron_trial_gt60 0.10266 0.78359 0 33.33
article_trial_gt60 0.31433 1.66462 0 40
verb_trial_gt60 1.15136 4.15946 0 80
auxverb_trial_gt60 0.30418 1.64537 0 50
past_trial_gt60 0.77170 3.17747 0 75
present_trial_gt60 0.30498 1.65138 0 80
future_trial_gt60 0.04021 0.46897 0 33.33
adverb_trial_gt60 0.09438 0.79895 0 66.67
preps_trial_gt60 1.47799 4.82178 0 66.67
116
conj_trial_gt60 0.29691 1.50553 0 28.57
negate_trial_gt60 0.17561 1.17119 0 40
quant_trial_gt60 0.03551 0.45242 0 33.33
number_trial_gt60 0.02099 0.34346 0 30.77
swear_trial_gt60 0.00012 0.02645 0 8.33
social_trial_gt60 1.21889 4.34485 0 100
family_trial_gt60 0.01006 0.27282 0 25
friend_trial_gt60 0.00107 0.07516 0 9.09
humans_trial_gt60 0.01803 0.31925 0 20
affect_trial_gt60 0.21857 1.58888 0 100
posemo_trial_gt60 0.15744 1.10906 0 50
negemo_trial_gt60 0.06077 1.09265 0 100
anx_trial_gt60 0.00120 0.07654 0 20
anger_trial_gt60 0.01089 0.28021 0 33.33
sad_trial_gt60 0.00900 0.27519 0 50
cogmech_trial_gt60 1.19075 4.34785 0 100
insight_trial_gt60 0.10669 0.98146 0 66.67
cause_trial_gt60 0.42225 2.00048 0 75
discrep_trial_gt60 0.09429 0.82056 0 33.33
tentat_trial_gt60 0.03912 0.63904 0 50
certain_trial_gt60 0.04036 0.46159 0 16.67
inhib_trial_gt60 0.02004 0.47893 0 66.67
incl_trial_gt60 0.29854 1.55727 0 33.33
excl_trial_gt60 0.15350 1.08238 0 40
percept_trial_gt60 0.04856 0.52653 0 25
see_trial_gt60 0.01201 0.26050 0 25
hear_trial_gt60 0.03387 0.43108 0 25
feel_trial_gt60 0.00247 0.09867 0 14.29
bio_trial_gt60 0.01516 0.30223 0 33.33
body_trial_gt60 0.00147 0.08913 0 12.5
health_trial_gt60 0.01272 0.27873 0 33.33
sexual_trial_gt60 0.00130 0.08284 0 9.09
ingest_trial_gt60 0.00028 0.03990 0 9.09
relativ_trial_gt60 1.68313 5.74917 0 100
motion_trial_gt60 0.50349 2.29602 0 75
space_trial_gt60 0.64523 2.51705 0 66.67
time_trial_gt60 0.46999 2.13504 0 100
work_trial_gt60 0.32330 1.85462 0 100
achieve_trial_gt60 0.17987 1.15478 0 66.67
leisure_trial_gt60 0.33131 1.64208 0 50
home_trial_gt60 0.08257 0.99113 0 50
money_trial_gt60 0.41276 2.30718 0 100
relig_trial_gt60 0.00290 0.13790 0 16.67
death_trial_gt60 0.00094 0.06925 0 9.09
assent_trial_gt60 0.01486 0.32424 0 25
nonfl_trial_gt60 0.11300 0.86455 0 50
filler_trial_gt60 0.00068 0.04786 0 7.14
117
Appendix E: Summary Statistics of Churned Winback table

RADIO_TOTAL 54.02857 20.42604 0 106
Retention_duration 0.05629 8.11540 -18.207 102.793
Retention_duration2 65.86273 209.98816 0 10566.4
Non_Usage 0.03912 0.19389 0 1
NonPay 0.08403 0.27743 0 1
Unknown_reason 0.03728 0.18944 0 1
Free_ended 0.00484 0.06942 0 1
Prepaid_period_ended 0.00690 0.08279 0 1
Cost_price_fee 0.00336 0.05783 0 1
Competition 0.00033 0.01829 0 1
Service_count 1.82796 1.03039 1 15
MC_Ave_Price -0.32500 3.46000 -11.67 88.32
MC2_Ave_Price 12.07603 60.59619 0 7800.42
Retent_Live_Agent60 0.70786 2.33191 0 46
Retent_CUWI60 0.08103 0.52297 0 22
Retent_IVR60 0.01837 0.21123 0 14
Retent_CUWI_gt60 0.09320 0.60673 0 42
Retent_Live_Agent_gt60 1.19269 3.74798 0 121
Retent_IVR_gt60 0.02783 0.31078 0 34
Retent_Change_Service60 0.27565 0.77956 0 20
Retent_Make_Payment60 0.41256 1.84239 0 44
Retent_ChangeAcctInfo60 0.11907 0.50027 0 14
Retent_ChangeAcctInfo_gt60 0.13328 0.55059 0 16
Retent_Change_Service_gt60 0.27103 0.74619 0 20
Retent_Make_Payment_gt60 0.90943 3.55355 0 121
Retent_LiveAg_ChangeSer60 0.24746 0.71457 0 20
Retent_LiveAg_MakePaymt60 0.39066 1.78778 0 44
Retent_CUWI_ChangeInfo60 0.04675 0.31648 0 9
Retent_CUWI_ChangeSer60 0.02030 0.19017 0 8
Retent_CUWI_MakePaymt60 0.01398 0.15973 0 21
Retent_LiveAg_ChangeInfo60 0.06974 0.33572 0 13
Retent_IVR_ChangeSer60 0.00788 0.09560 0 7
Retent_IVR_MakePaymt60 0.00791 0.14656 0 14
Retent_IVR_ChangeInfo60 0.00257 0.05318 0 3
Retent_CUWI_ChangeInfo_gt60 0.05470 0.35919 0 11
Retent_CUWI_ChangeSer_gt60 0.01677 0.18264 0 12
Retent_LiveAg_MakePaymt_gt60 0.87066 3.47081 0 121
Retent_CUWI_MakePaymt_gt60 0.02174 0.28156 0 42
Retent_LiveAg_ChangeInfo_gt60 0.07389 0.35511 0 12
Retent_LiveAg_ChangeSer_gt60 0.24815 0.68750 0 19
118
(Appendix E, continued)
Retent_IVR_ChangeSer_gt60 0.00611 0.09146 0 13
Retent_IVR_ChangeInfo_gt60 0.00469 0.07312 0 4
Retent_IVR_MakePaymt_gt60 0.01704 0.24962 0 34
funct_Churn60 10.72665 16.99245 0 100
pronoun_Churn60 1.49490 3.48090 0 42.86
ppron_Churn60 1.04274 2.58215 0 38.46
i_Churn60 0.27474 1.00434 0 25
we_Churn60 0.04242 0.35957 0 18.75
you_Churn60 0.00437 0.11128 0 15.38
shehe_Churn60 0.67984 1.97274 0 33.33
they_Churn60 0.04135 0.39193 0 38.46
ipron_Churn60 0.45216 1.44040 0 33.33
article_Churn60 1.24230 2.93237 0 42.86
verb_Churn60 3.52393 6.70964 0 100
auxverb_Churn60 1.48561 3.43225 0 100
past_Churn60 1.72273 4.07754 0 100
present_Churn60 1.42562 3.88863 0 100
future_Churn60 0.14607 0.71567 0 33.33
adverb_Churn60 0.41835 1.41500 0 50
preps_Churn60 4.46227 7.44710 0 66.67
conj_Churn60 0.87713 2.14798 0 37.5
negate_Churn60 0.90914 2.83215 0 50
quant_Churn60 0.35261 1.51971 0 66.67
number_Churn60 0.21532 0.98975 0 33.33
swear_Churn60 0.00042 0.03510 0 8.33
social_Churn60 3.54851 6.49367 0 66.67
family_Churn60 0.08000 0.60570 0 33.33
friend_Churn60 0.00451 0.13030 0 20
humans_Churn60 0.08182 0.63600 0 40
affect_Churn60 0.92167 2.60694 0 100
posemo_Churn60 0.72166 2.12829 0 100
negemo_Churn60 0.19816 1.33526 0 100
anx_Churn60 0.01080 0.16635 0 18.18
anger_Churn60 0.02512 0.54719 0 50
sad_Churn60 0.06178 0.66022 0 44.44
cogmech_Churn60 3.52407 6.53965 0 100
insight_Churn60 0.51787 1.84925 0 100
cause_Churn60 0.74540 2.26924 0 100
discrep_Churn60 0.39021 1.37432 0 50
tentat_Churn60 0.14099 0.87947 0 50
certain_Churn60 0.09940 0.66454 0 50
inhib_Churn60 0.14999 0.91904 0 100
incl_Churn60 0.75068 2.09473 0 66.67
excl_Churn60 0.64710 2.24934 0 50
percept_Churn60 0.33597 1.60150 0 50
see_Churn60 0.02253 0.36208 0 40
hear_Churn60 0.30623 1.54310 0 50
feel_Churn60 0.00660 0.14593 0 14.29
119
bio_Churn60 0.05772 0.59161 0 50
body_Churn60 0.00170 0.07502 0 11.11
health_Churn60 0.05356 0.57754 0 50
sexual_Churn60 0.00185 0.07983 0 9.52
ingest_Churn60 0.00095 0.05284 0 8.33
relativ_Churn60 3.80843 6.89520 0 100
motion_Churn60 0.61419 2.10344 0 50
space_Churn60 1.68765 3.51674 0 66.67
time_Churn60 1.43370 3.42329 0 100
work_Churn60 1.82464 4.29391 0 100
achieve_Churn60 0.27703 1.22665 0 100
leisure_Churn60 0.69477 1.95730 0 100
home_Churn60 0.05633 0.64256 0 50
money_Churn60 2.36575 5.37446 0 100
relig_Churn60 0.01619 0.29595 0 27.27
death_Churn60 0.01222 0.48101 0 80
assent_Churn60 0.01953 0.27790 0 25
nonfl_Churn60 0.31868 1.10667 0 50
filler_Churn60 0.00790 0.19974 0 40
funct_Churn_gt60 8.52940 15.59441 0 100
pronoun_Churn_gt60 1.22941 3.23210 0 60
ppron_Churn_gt60 0.89356 2.48283 0 60
i_Churn_gt60 0.22083 0.93129 0 50
we_Churn_gt60 0.04755 0.39153 0 22.22
you_Churn_gt60 0.00283 0.08698 0 10.42
shehe_Churn_gt60 0.59737 1.90947 0 40
they_Churn_gt60 0.02497 0.29916 0 16.67
ipron_Churn_gt60 0.33584 1.20862 0 33.33
article_Churn_gt60 1.11644 2.86742 0 50
verb_Churn_gt60 2.70549 5.72786 0 100
auxverb_Churn_gt60 1.04153 2.75298 0 100
past_Churn_gt60 1.51827 3.87816 0 100
present_Churn_gt60 0.90768 2.45582 0 66.67
future_Churn_gt60 0.12687 0.66376 0 33.33
adverb_Churn_gt60 0.37875 1.36759 0 50
preps_Churn_gt60 3.78384 7.25993 0 80
conj_Churn_gt60 0.75436 2.08902 0 33.33
negate_Churn_gt60 0.38287 1.35001 0 50
quant_Churn_gt60 0.19015 0.93559 0 42.86
number_Churn_gt60 0.16047 0.90056 0 40
swear_Churn_gt60 0.00024 0.02468 0 5.88
social_Churn_gt60 2.85670 5.98159 0 100
family_Churn_gt60 0.04961 0.45798 0 25
friend_Churn_gt60 0.00585 0.24071 0 66.67
humans_Churn_gt60 0.05833 0.56900 0 50
affect_Churn_gt60 0.88918 2.77697 0 100
posemo_Churn_gt60 0.74040 2.46604 0 100
negemo_Churn_gt60 0.14769 1.07692 0 100
120
anx_Churn_gt60 0.00822 0.13473 0 12.5
anger_Churn_gt60 0.01973 0.39942 0 50
sad_Churn_gt60 0.02528 0.36532 0 50
cogmech_Churn_gt60 2.89314 6.07282 0 100
insight_Churn_gt60 0.57526 2.08068 0 100
cause_Churn_gt60 0.66001 2.27693 0 100
discrep_Churn_gt60 0.28989 1.17915 0 50
tentat_Churn_gt60 0.13534 0.92790 0 100
certain_Churn_gt60 0.09251 0.64695 0 33.33
inhib_Churn_gt60 0.15994 1.08389 0 100
incl_Churn_gt60 0.64156 1.95751 0 40
excl_Churn_gt60 0.29521 1.15265 0 50
percept_Churn_gt60 0.12240 0.74470 0 50
see_Churn_gt60 0.02405 0.37960 0 50
hear_Churn_gt60 0.09281 0.61010 0 33.33
feel_Churn_gt60 0.00515 0.11984 0 11.11
bio_Churn_gt60 0.04936 0.50788 0 50
body_Churn_gt60 0.00176 0.09455 0 22.22
health_Churn_gt60 0.04518 0.49037 0 50
sexual_Churn_gt60 0.00188 0.08184 0 9.09
ingest_Churn_gt60 0.00090 0.04544 0 5.26
relativ_Churn_gt60 3.09286 6.49983 0 100
motion_Churn_gt60 0.50382 1.94456 0 100
space_Churn_gt60 1.32306 3.15720 0 66.67
time_Churn_gt60 1.21546 3.57158 0 100
work_Churn_gt60 1.46355 4.09769 0 100
achieve_Churn_gt60 0.27851 1.29773 0 100
leisure_Churn_gt60 0.45890 1.58367 0 50
home_Churn_gt60 0.05383 0.60281 0 50
money_Churn_gt60 1.84912 4.80259 0 100
relig_Churn_gt60 0.01647 0.31080 0 50
death_Churn_gt60 0.00205 0.12022 0 25
assent_Churn_gt60 0.02868 0.36688 0 25
nonfl_Churn_gt60 0.31426 1.17800 0 50
filler_Churn_gt60 0.00421 0.15326 0 33.33
Retention_MDBcount60 10.74277 14.63668 0 200
Retention_MDBcount_gt60 8.72867 12.55055 0 144
RETENTION_DM60 0.95491 1.99016 0 22
RETENTION_EM60 7.66921 12.64005 0 149
RETENTION_TELEM60 0.59634 1.28366 0 23
RETENTION_DM_gt60 0.40304 1.05420 0 57
RETENTION_EM_gt60 11.41462 16.25480 0 132
RETENTION_TELEM_gt60 0.19522 0.71442 0 34
Churn60Response_Rate 0.04033 0.12548 0 1
Churn_gt60Response_Rate 0.06494 0.17062 0 1
RETENTION_EMDM60 11.38902 44.98458 0 2793
RETENTION_EMTELEM60 6.36603 27.40730 0 2277
RETENTION_DMTELEM60 1.49181 5.39043 0 240
121
RETENTION_EMDMTELEM60 19.28264 173.58672 0 20700
RETENTION_EMDM_gt60 3.92684 17.86916 0 1104
RETENTION_EMTELEM_gt60 3.71668 21.97903 0 1802
RETENTION_DMTELEM_gt60 0.19123 1.30181 0 96
RETENTION_EMDMTELEM_gt60 2.49735 31.27355 0 3696
Selfcare_dummy 0.68393 0.46494 0 1
Acq_MDBcount 3.93977 4.67568 0 118
Acq_MDB_ResRate 0.06584 0.12501 0 1
Price_range 2.71272 3.94645 0 85
Promo_count 0.65806 0.79883 0 7
122
Appendix F: Conversion Variable Importance sorted by Mean Decrease Accuracy
Variable Converte Unconverte MeanDecreaseAccur MeanDecrease

d d acy Gini
Acquis_MDBcount_gt60 28.418 16.316 46.452 273.206
AcqSelfcare_dummy 30.086 21.682 43.778 150.018
Trial_gt60Response_Rate 27.612 -15.926 31.24 112.126
funct_trial60 -18.892 25.258 29.84 523.264
relativ_trial60 -15.608 20.192 24.71 392.866
funct_trial_gt60 16.372 7.464 24.3 151.034
cogmech_trial60 -13.856 18 22.016 224.416
preps_trial60 -15.118 18.482 21.87 435.95
time_trial60 -11.11 16.024 21.476 249.506
relativ_trial_gt60 13.49 8.488 20.998 137.426
preps_trial_gt60 14.964 6.872 20.992 140.456
Acquisition_duration 4.254 9.812 20.888 115.518
RADIO_TOTAL 12.246 17.984 19.68 274.308
ACQUSI_DM60 1.942 8.216 19.138 110.436
cogmech_trial_gt60 13.506 3.262 19.052 99.53
social_trial_gt60 18.896 -3.988 19.006 102.894
Acquisition_duration2 2.146 9.842 18.672 113.072
work_trial60 -15.136 16.802 18.496 133.812
verb_trial_gt60 13.484 3.644 17.118 106.276
social_trial60 -9.84 13.448 16.902 190.348
money_trial60 -12.868 14.554 16.218 115.526
Free_ended 4.088 4.976 16.166 39.446
money_trial_gt60 10.802 7.692 15.864 67.964
achieve_trial60 -12.5 13.938 15.576 87.412
negate_trial60 -5.794 12.73 15.382 70.68
verb_trial60 -8.34 11.612 15.25 151.358
ACQUSI_TELEM_gt60 2.592 7.584 15.108 56.13
motion_trial_gt60 5.446 12.132 14.96 77.872
home_trial_gt60 4.34 15.262 14.93 21.528
cause_trial60 -9.234 11.75 14.426 106.92
ACQUSI_EM60 -1.876 8.264 14.418 95.054
leisure_trial60 -10.258 12.206 14.188 96.378
work_trial_gt60 7.942 8.206 13.732 59.576
past_trial_gt60 10.404 0.912 13.666 73.82
past_trial60 -7.422 10.608 13.658 118.112
ACQUSI_DM_gt60 3.204 5.318 13.486 75.174
space_trial_gt60 7.906 6.032 13.332 87.074
Acquis_Change_Service_ 1.512 12.04 13.074 180.4
gt60
excl_trial60 -5.038 11.988 12.856 53.136
auxverb_trial60 -3.378 8.61 12.662 84.372
Acquis_Live_Agent_gt60 -0.582 10.816 12.63 160.962
Unknown_reason 1.198 5.616 12.596 38.452
123
(Appendix F, continued)
Acq_LiveAg_ChangeSer_ 1.884 10.86 12.54 158.714
gt60
ACQUSI_EM_gt60 3.408 3.74 12.518 127.186
affect_trial60 -9.334 11.264 12.478 64.612
cause_trial_gt60 4.802 6.706 12.438 55.14
ACQUSI_DMTELEM60 0.012 6.354 12.176 71.788
time_trial_gt60 8.878 3.35 12.082 65.22
pronoun_trial60 -6.396 8.944 11.612 83.914
Acquis_ChangeAcctInfo6 8.124 4.406 11.534 17.352
0
number_trial60 -7.768 9.982 11.466 35.85
ACQUSI_DMTELEM_gt 2.838 5 11.442 46.162
60
ACQUSI_EMDM60 -2.94 8.204 11.172 69.732
ACQUSI_TELEM60 -0.41 5.454 11.15 53.134
achieve_trial_gt60 2.658 8.31 10.848 32.266
Acquis_Live_Agent60 7.992 5.08 10.808 21.124
posemo_trial60 -8.162 9.498 10.706 48.284
present_trial_gt60 3.756 8.702 10.69 44.374
motion_trial60 -3.824 7.006 10.556 83.534
certain_trial60 -0.526 5.532 10.438 32.572
Acquis_MDBcount60 3.692 4.73 10.242 161.888
present_trial60 -5.232 8.182 10.052 82.718
space_trial60 -7.328 8.59 9.954 125.952
Trial60Response_Rate 2.728 3.29 9.914 117.33
leisure_trial_gt60 3.106 5.416 9.76 44.224
ACQUSI_EMTELEM60 -1.86 6.604 9.518 47.656
ACQUSI_EMDMTELE -2.372 7.266 9.448 47.27
M60
affect_trial_gt60 3.714 4.708 9.224 33.41
ACQUSI_EMDM_gt60 -2.504 7.52 9.092 54.594
incl_trial60 -5.664 7.228 9.03 67.822
article_trial_gt60 4.336 4.682 8.954 38.67
ipron_trial60 -3.738 6.244 8.95 44.598
pronoun_trial_gt60 3.762 4.264 8.72 34.698
conj_trial_gt60 5.656 2.07 8.686 37.96
incl_trial_gt60 4.424 2.134 8.444 36.238
ppron_trial60 -2.768 5.368 7.912 44.744
Acq_LiveAg_ChangeInfo 5.008 3.708 7.738 7
60
conj_trial60 -4.22 5.804 7.628 60.554
auxverb_trial_gt60 1.958 5.282 7.616 34.398
Acquis_CUWI60 4.61 3.518 7.6 8.964
ACQUSI_EMTELEM_gt -1.094 5.842 7.496 49.548
60
Acq_CUWI_ChangeInfo6 4.28 3.714 7.432 8.512
0
Acq_LiveAg_ChangeSer6 5.9 2.314 7.334 9.266
124
0
Acquis_Change_Service6 5.652 2.6 7.274 9.874
0
ppron_trial_gt60 3.426 2.572 7.266 28.212
posemo_trial_gt60 2.096 3.478 7.204 25.57
article_trial60 -7.006 7.24 7.11 60.012
negate_trial_gt60 0.054 4.962 6.906 23.76
nonfl_trial_gt60 1.548 4.358 6.744 21.718
excl_trial_gt60 -1.638 5.472 5.988 20.48
nonfl_trial60 -8.504 7.882 5.954 33.356
insight_trial_gt60 2.18 2.808 5.822 20.068
ACQUSI_EMDMTELE -1.71 5.224 5.742 29.82
M_gt60
percept_trial60 -1.212 4.59 5.638 16.722
shehe_trial_gt60 2.12 2.152 5.436 19.562
relig_trial60 1.172 3.426 5.328 5.564
tentat_trial60 -1.568 4.526 4.944 18.298
Acquis_CUWI_gt60 -4.218 7.174 4.928 10.824
Acquis_ChangeAcctInfo_ -5.714 6.922 4.926 10.632
gt60
i_trial_gt60 -1.316 4.956 4.734 15.262
hear_trial60 -1.892 4.134 4.654 14.716
shehe_trial60 -1.258 3.206 4.624 27.078
ipron_trial_gt60 -2.534 5.586 4.584 17.716
adverb_trial_gt60 2.42 1.572 4.538 15.882
adverb_trial60 -4.394 4.686 4.384 28.772
negemo_trial_gt60 -1.338 5.558 4.278 12.232
i_trial60 -3.55 4.052 3.936 20.444
discrep_trial_gt60 -1.012 3.66 3.896 15.814
discrep_trial60 -7.008 9.328 3.892 38.386
Acquis_Make_Payment60 3.008 0.8 3.288 3.406
percept_trial_gt60 -1.5 4.518 3.246 9.864
quant_trial_gt60 2.158 0.364 3.06 10.124
Acq_CUWI_ChangeInfo_ -5.758 6.132 3.022 6.874
gt60
sad_trial60 -4.74 5.252 2.922 7.206
Acquis_Make_Payment_g -2.042 4.23 2.884 6.422
t60
Acq_LiveAg_MakePaymt 1.674 1.714 2.732 3.068
60
tentat_trial_gt60 -0.67 3.148 2.724 8.438
hear_trial_gt60 -2.342 4.746 2.66 7.604
humans_trial_gt60 4.322 -2.878 2.606 5.896
future_trial_gt60 -1.584 2.956 2.45 9.298
Acq_LiveAg_MakePaymt -2.024 3.734 2.282 6.428
_gt60
negemo_trial60 -5.108 4.326 1.836 19.358
certain_trial_gt60 -0.486 2.278 1.806 9.626
125
Acq_CUWI_ChangeSer_ -2.07 3.642 1.57 5.394
gt60
inhib_trial_gt60 0.304 1.238 1.462 5.286
insight_trial60 -6.478 4.876 1.336 28.998
Service_Dissatisfaction -2.272 4.062 1.332 3.806
Acq_LiveAg_ChangeInfo -3.392 3.418 1.326 5.408
_gt60
friend_trial_gt60 0.564 0.806 1.202 0.38
number_trial_gt60 -2.206 2.786 0.996 7.036
relig_trial_gt60 1.362 -0.5 0.866 0.908
you_trial_gt60 0.002 0.606 0.624 0.12
death_trial_gt60 -0.052 0.602 0.548 0.146
assent_trial_gt60 -0.338 0.632 0.41 3.19
Acq_CUWI_MakePaymt 0.17 0.224 0.394 0.358
_gt60
anger_trial_gt60 -2.86 3.038 0.348 2.274
family_trial_gt60 -0.994 1.56 0.284 2.92
Acq_CUWI_ChangeSer6 -0.404 0.696 0.274 0.838
0
Acquis_IVR_gt60 -2.998 3.032 0.266 2.026
Acq_CUWI_MakePaymt -0.17 0.284 0.228 0.152
60
Acq_IVR_ChangeSer60 0 0.22 0.22 0.152
sad_trial_gt60 -0.782 0.964 0.15 2.444
Acq_IVR_MakePaymt_gt 0.116 0 0.116 0.058
60
swear_trial60 -0.054 0.348 0.104 0.34
Acquis_IVR60 -0.348 0.252 0.01 0.194
Competition 0 0 0 0.002
Acq_IVR_MakePaymt60 0 0 0 0.016
Acq_IVR_ChangeInfo60 0 0 0 0.038
Acq_IVR_ChangeInfo_gt 0 0 0 0.02
60
swear_trial_gt60 0 0 0 0.044
ingest_trial_gt60 0 0 0 0.032
sexual_trial_gt60 -0.372 0.266 -0.012 0.358
bio_trial_gt60 -3.45 3.238 -0.11 4.472
ingest_trial60 -0.43 -0.002 -0.112 0.368
health_trial_gt60 -4.046 3.944 -0.158 4.05
feel_trial_gt60 -1.164 0.712 -0.222 0.884
body_trial_gt60 -0.676 0.242 -0.226 0.33
Cost_price_fee -0.084 -0.316 -0.248 0.238
we_trial_gt60 -0.752 0.388 -0.25 3.56
Product_Dissatisfaction -0.27 0 -0.27 0.118
filler_trial_gt60 -0.186 -0.228 -0.338 0.28
NonPay -1.33 0.668 -0.374 3.482
Acq_IVR_ChangeSer_gt6 -3.098 2.598 -0.416 1.836
0
126
see_trial_gt60 -2.47 1.864 -0.44 3.258
anx_trial_gt60 -0.912 0.088 -0.722 0.612
Non_Usage -0.936 -0.14 -0.748 0.868
Uncontrollable_reason -3.318 1.936 -0.764 4.758
they_trial_gt60 -3.794 2.736 -0.798 2.56
sexual_trial60 -1.098 -0.084 -0.99 0.144
death_trial60 -1.196 0.116 -1.034 0.406
health_trial60 -2.968 1.858 -1.248 10.398
bio_trial60 -2.17 1.308 -1.296 10.698
body_trial60 -2.008 0.892 -1.656 0.516
you_trial60 -1.938 -0.284 -1.832 0.272
future_trial60 -5.172 3.518 -2.184 16.904
quant_trial60 -7.35 3.74 -2.534 19.58
feel_trial60 -3.31 1.17 -2.79 1.314
home_trial60 -5.31 2.16 -2.868 6.682
see_trial60 -4.82 3.072 -2.942 4.43
they_trial60 -5.96 5.116 -3.116 6.438
inhib_trial60 -11.148 10.88 -3.442 12.69
filler_trial60 -4.318 1.542 -3.704 1.094
friend_trial60 -3.912 0.432 -3.72 1.62
assent_trial60 -7.334 3.058 -4.452 5.434
anx_trial60 -5.456 2.09 -4.604 1.932
humans_trial60 -10.654 5.426 -4.706 10.33
we_trial60 -6.59 2.014 -4.858 6.63
family_trial60 -7.68 3.91 -5.04 8.084
anger_trial60 -5.674 1.194 -5.05 2.814
127
Appendix G: Churn Variable Importance sorted by Mean Decrease Accuracy
Variable Churn noChurn MeanDecreaseAccuracy MeanDecr

easeGini
Retention_MDBcount_gt60 37.1267 63.56 71.19 1951.73
Churn_gt60Response_Rate 55.2233 39.5033 70.7267 710.733
Promo_count 29.85 32.0233 46.2667 488.943
Selfcare_dummy -9.51667 29.6567 34.06 311.1
AcqSelfcare_dummy -11.1233 26.1833 31.1233 245.307
Churn60Response_Rate 13.11 17.05 22.8267 196.187
MC2_Ave_Price -2.42 17.7067 21.7567 317.117
Retention_MDBcount60 7.40667 13.3367 21.1333 426.227
Acq_MDBcount 4.73 8.17 18.5633 339.493
MC_Ave_Price 10.87 9.78 18.42 299.207
Retention_duration2 10.03 2.91333 16.5767 368.273
Price_range 8.1 14.3367 15.6167 212.71
RETENTION_EM_gt60 11.34 -2.03333 14.63 361.027
Acquisition_duration 10.8233 -0.76333 13.4733 191.623
RETENTION_EM60 9.39 -1.95 12.7067 309.43
Service_count 6.04667 9.53667 12.2667 159.703
funct_Churn60 6.53 3.44333 12.1167 130.507
Retention_duration 7.83667 0.12 11.6233 316.927
RETENTION_EMDM60 6.17333 3.15667 10.71 154.107
RETENTION_EMDMTELE 4.80333 4.02 10.3233 93.5567
M60
RETENTION_DM60 6.33667 0.92 9.91 154.983
Acq_MDB_ResRate 6.93 -0.04667 9.8 210.89
RETENTION_DMTELEM6 5.23333 2.52 9.66 115.063
0
verb_Churn60 4.94 3.54667 9.62 116.923
RETENTION_DM_gt60 7.82 -1.7 9.51333 117.893
funct_Churn_gt60 3.02333 4.86 9.42667 77.5367
RETENTION_EMTELEM6 5.36333 1.77333 9.33 136.297
0
preps_Churn60 5.69667 2.90333 9.27333 125.097
relativ_Churn60 5.29333 2.76333 9.17 122.473
cogmech_Churn60 4.93667 4.16333 9.01667 119.68
RETENTION_TELEM60 6.58667 -0.95 8.85333 126.473
social_Churn60 4.85667 2.03667 8.16667 116.863
preps_Churn_gt60 4.36333 1.92667 7.77667 79.26
verb_Churn_gt60 1.32667 5.34 7.34667 71.26
Retent_Make_Payment_gt60 2.94667 3.50333 7.04 43.5333
relativ_Churn_gt60 2.33333 4.43 7.00667 76.3667
cogmech_Churn_gt60 2.15333 4.33667 6.85333 73.6
pronoun_Churn_gt60 1.69333 3.53667 6.82 52.5233
Retent_LiveAg_MakePaymt 2.14667 4.20667 6.80667 42.97
_gt60
128
(Appendix G, continued)
Retent_Live_Agent_gt60 2.95667 3.26 6.73667 54.8733
work_Churn60 5.55333 -0.93 6.54 104.437
past_Churn60 4.52333 1 6.44333 99.0867
social_Churn_gt60 1.72333 3.71667 6.41667 74.55
pronoun_Churn60 3.17333 2.53333 6.37667 75
space_Churn60 3.99333 1.37667 6.36667 100.47
auxverb_Churn_gt60 1.53 3.04667 6.31333 57.6533
money_Churn60 5.19333 -0.51667 6.23 111.27
RETENTION_TELEM_gt6 5.01 -0.97333 6.13667 76.4467
0
ppron_Churn_gt60 1.82333 3.75333 6.12333 49.54
present_Churn60 2.45333 3.14667 5.87667 84.74
past_Churn_gt60 0.66 4.02667 5.81 65.61
RETENTION_EMDM_gt60 5.12667 -1.22 5.63667 125.353
Retent_Make_Payment60 0.913333 3.49333 5.51 29.8433
Retent_LiveAg_MakePaymt 0.666667 3.47333 5.49333 29.67
60
auxverb_Churn60 3.48333 0.82 5.46 85.8667
conj_Churn60 2.64333 0.59 5.32333 75.9533
Retent_Live_Agent60 1.39 2.72667 5.25333 40.5267
affect_Churn60 2.91 2.60667 5.23667 77.8367
RETENTION_EMTELEM_ 4.1 0.07 5.16 93.2667
gt60
time_Churn60 3.56667 1.11 5.04667 93.18
ppron_Churn60 1.96333 1.43667 4.92667 69.6633
space_Churn_gt60 2.09667 2.1 4.92667 67.3133
work_Churn_gt60 1.21333 2.99 4.88333 65.3967
conj_Churn_gt60 1.43333 2.23667 4.71667 53.0967
Retent_Change_Service60 0.52 2.97 4.59 27.2833
money_Churn_gt60 2.35 1.19667 4.56333 67.4033
time_Churn_gt60 1.33 2.19333 4.53667 63.0567
shehe_Churn_gt60 2.62333 1.45333 4.44667 42.9667
incl_Churn_gt60 1.45 1.83 4.44 49.8067
negate_Churn60 2.37667 1.47 4.38333 72.8867
present_Churn_gt60 2.51667 0.873333 4.34333 55.8333
cause_Churn_gt60 0.236667 2.72 4.31333 52.12
affect_Churn_gt60 1.57667 1.40333 4.21333 53.3033
negate_Churn_gt60 1.38667 1.78 4.04667 43.0033
shehe_Churn60 2.31667 1.34 3.94667 56.5433
RETENTION_EMDMTELE 2.61 1.90667 3.85 41.36
M_gt60
posemo_Churn60 2.30333 0.853333 3.82 70.0433
Retent_LiveAg_ChangeSer6 0.363333 2.96 3.77667 25.7933
0
leisure_Churn60 3.22333 -0.46 3.74333 71.7533
cause_Churn60 3.30667 -0.64 3.58333 71.3367
article_Churn_gt60 1.52333 0.986667 3.58333 61.0433
RETENTION_DMTELEM_ 3.24667 -0.32333 3.54 52.1433
129
gt60
Retent_Change_Service_gt6 2.15667 1.14 3.42333 29.2467
0
discrep_Churn60 0.843333 2.58333 3.41333 55.5767
excl_Churn60 2.22 0.313333 3.34 64.4767
ipron_Churn60 2.11 0.573333 3.29667 53.4467
NonPay 1.02667 1.73667 3.25 13.86
article_Churn60 3.38667 -0.37 3.22333 82.93
incl_Churn60 1.21 1.52333 3.22333 70.4433
insight_Churn_gt60 2.82 -0.09667 3.18333 51.1867
RADIO_TOTAL 1.97667 2.46 3.13667 369.73
excl_Churn_gt60 1.45667 1.48333 3.11667 37.7533
Retent_LiveAg_ChangeSer_ 1.29 0.93 3.05333 28.1633
gt60
posemo_Churn_gt60 0.35 2.01667 3.02667 50.7567
ipron_Churn_gt60 -0.83 2.99333 2.89667 37.93
leisure_Churn_gt60 1.04 1.15 2.83333 45.6067
discrep_Churn_gt60 1.68667 0.276667 2.79 38.3033
percept_Churn60 0.336667 2.06 2.63333 46.44
adverb_Churn_gt60 2.54667 -0.12333 2.63333 42.9933
motion_Churn_gt60 0.73 1.12333 2.51 47.0233
motion_Churn60 1.80333 -0.51 2.31333 66.2533
hear_Churn60 0.363333 1.84333 2.20333 42.6967
nonfl_Churn_gt60 1.98333 -0.81333 1.96333 44.6333
Uncontrollable_reason 0.403333 0.5 1.86333 13.07
achieve_Churn_gt60 0.0266667 1.65333 1.83 38.39
Retent_ChangeAcctInfo_gt6 0.0433333 1.33333 1.72667 19.6567
0
Retent_LiveAg_ChangeInfo 1.26333 0.276667 1.68 13.8467
60
percept_Churn_gt60 -0.523333 2.00667 1.57667 23.2467
Unknown_reason 1.33333 0.326667 1.49 10.6867
insight_Churn60 1.79333 -0.56 1.32667 62.1967
negemo_Churn_gt60 0.84 0.456667 1.28333 24.91
nonfl_Churn60 1.63333 -0.91333 1.24667 54.81
adverb_Churn60 0.606667 0.32 1.23333 55.4667
Retent_ChangeAcctInfo60 1.47 -0.48667 1.10333 17.9567
i_Churn_gt60 0.47 0.513333 1.03333 33.6967
quant_Churn60 3.77667 -3.11 0.836667 52.6667
future_Churn_gt60 0.943333 -0.54 0.746667 26.82
Retent_LiveAg_ChangeInfo 1.65 -1.03 0.693333 15.82
_gt60
hear_Churn_gt60 -0.84 1.31333 0.656667 20.4667
Retent_CUWI60 -0.243333 0.676667 0.603333 13.8833
tentat_Churn_gt60 1.22667 -0.65333 0.566667 25.32
achieve_Churn60 1.46333 -1.27333 0.433333 47.2
negemo_Churn60 1.04667 -0.54333 0.416667 36.22
Retent_IVR_ChangeInfo60 -0.25 0.483333 0.3 1.33667
130
Retent_CUWI_gt60 -0.393333 0.456667 0.283333 14.1733
Non_Usage 0.73 -0.59667 0.16 10.2267
quant_Churn_gt60 1.33333 -1.47333 0.103333 33.1233
Retent_CUWI_MakePaymt6 0.39 -0.43 -0.0533333 5.04667
0
Retent_IVR_gt60 0.54 -0.52 -0.176667 7.24667
number_Churn_gt60 -0.786667 0.48 -0.206667 29.3867
future_Churn60 3.34333 -3.52 -0.216667 35.32
ingest_Churn_gt60 0.6 -0.84 -0.296667 0.586667
Service_Dissatisfaction 0.556667 -0.90667 -0.313333 7.73
certain_Churn_gt60 -0.496667 -0.06 -0.386667 22.3267
Retent_CUWI_ChangeInfo_ -0.19 - -0.43 11.0367
gt60 0.216667
Retent_CUWI_ChangeSer60 -0.306667 -0.16667 -0.436667 7.16667
Free_ended -1.58333 0.563333 -0.483333 2.95667
we_Churn_gt60 2.41333 -2.85667 -0.486667 15.64
bio_Churn_gt60 0.57 -1.1 -0.486667 12.7867
Retent_IVR_ChangeSer60 -0.35 -0.49 -0.513333 3.29333
Retent_IVR_MakePaymt_gt 0.343333 -1.1 -0.52 3.76
60
swear_Churn_gt60 -0.473333 -0.33333 -0.58 0.17
sad_Churn60 0.546667 -1.27667 -0.61 16.5667
Retent_IVR_ChangeSer_gt6 0.173333 -0.66667 -0.646667 3.17667
0
swear_Churn60 -0.333333 -0.33333 -0.666667 0.19
Prepaid_period_ended 1.09333 -1.67667 -0.683333 4.74667
inhib_Churn_gt60 1.65 -1.91333 -0.686667 28.82
Retent_CUWI_ChangeInfo6 -0.01 -0.81667 -0.69 10.3367
0
death_Churn_gt60 -0.096667 -0.68 -0.696667 0.69
Competition -0.333333 -0.6 -0.756667 0.31
i_Churn60 1.65333 -1.99 -0.783333 47.5967
Retent_IVR_ChangeInfo_gt -0.19 -0.77667 -0.793333 2.45667
60
number_Churn60 0.843333 -2.12 -0.866667 44.2733
you_Churn_gt60 -0.9 -0.11 -0.88 1.24333
Retent_CUWI_MakePaymt_ 0.476667 -1.59667 -0.913333 5.86
gt60
Cost_price_fee 0.466667 -1.33 -0.916667 2.67333
sexual_Churn60 -0.016667 -0.96667 -0.98 1.04333
health_Churn60 -1.05667 -0.22 -1.02667 14.44
Retent_IVR60 -0.413333 -0.86 -1.04667 5.11333
sexual_Churn_gt60 -0.303333 -1.05 -1.09333 0.73
see_Churn_gt60 0.97 -2.00333 -1.1 9.34
tentat_Churn60 1.68667 -2.87 -1.13333 31.9367
bio_Churn60 -1.59333 0.063333 -1.15667 16.1067
family_Churn60 -0.65 -0.72333 -1.18333 21.05
feel_Churn_gt60 -0.903333 -0.68 -1.18667 2.77
131
body_Churn60 -0.596667 -0.85667 -1.25667 0.736667
health_Churn_gt60 1.22 -2.35667 -1.25667 11.79
relig_Churn60 -1.06 -0.72333 -1.26667 5.51667
relig_Churn_gt60 0.783333 -2.15333 -1.28 5.61
ingest_Churn60 0.106667 -1.78 -1.28333 0.52
friend_Churn_gt60 -0.2 -1.41333 -1.28333 2.08
body_Churn_gt60 -1.42333 -0.73667 -1.34 0.716667
inhib_Churn60 -0.063333 -1.4 -1.36667 33.79
we_Churn60 -1.09333 -0.66 -1.44333 16.49
certain_Churn60 0.0433333 -1.72667 -1.58333 26.4467
death_Churn60 -0.17 -1.81667 -1.63667 2.58667
Product_Dissatisfaction -0.046667 -1.94667 -1.64333 2.83333
they_Churn_gt60 -0.996667 -1.14333 -1.79333 9.63333
you_Churn60 -0.533333 -1.64 -1.84333 2.39667
filler_Churn_gt60 0.126667 -1.90333 -1.87667 2.47
home_Churn_gt60 -0.586667 -1.64333 -1.91333 12.3633
home_Churn60 0.03 -2.50333 -2.13 17.8333
assent_Churn_gt60 1.44 -3.98667 -2.13 9.5
Retent_IVR_MakePaymt60 0.426667 -2.80667 -2.14333 1.64667
humans_Churn60 0.643333 -3.35 -2.2 20.63
Retent_CUWI_ChangeSer_g -2.44333 -0.65 -2.25333 6.16333
t60
anx_Churn_gt60 -1.35333 -1.67 -2.27 4.68667
humans_Churn_gt60 -0.466667 -2.21 -2.36667 14.7667
sad_Churn_gt60 -1.03667 -1.91 -2.52 7.79333
family_Churn_gt60 -0.143333 -2.77 -2.54 13.3533
friend_Churn60 -1.47 -2.01667 -2.66 2.02333
feel_Churn60 0.636667 -3.49667 -2.66333 3.80667
anger_Churn60 0.93 -4.11667 -2.73 8
anger_Churn_gt60 -1.63 -2.10333 -2.99333 7.12333
filler_Churn60 1.1 -3.83 -3.01 4.23667
see_Churn60 -0.97 -2.65333 -3.09 9.6
assent_Churn60 -0.486667 -2.97667 -3.13667 9.48
they_Churn60 -1.08333 -2.75667 -3.14 13.86
anx_Churn60 -1.36 -3.15 -3.58667 6.48
132
Appendix H: Unconverted Winback Variable Importance sorted by Mean Decrease

Accuracy
Variable NonWinba Winbac MeanDecreaseAccura MeanDecrease

ck k cy Gini
Trial_gt60Response_Rate 24.9 13.38 26.74 37.13
Acquis_MDBcount_gt60 18.72 15.12 19.79 46.12
ACQUSI_DM60 15.55 1.89 15.85 22.74
ACQUSI_EMDM60 15.19 -7.97 15.05 14.51
ACQUSI_EM60 15.01 -4.77 14.9 21.55
AcqSelfcare_dummy 14.56 8.23 14.7 13.84
Free_ended 12.46 1.01 12.65 7.85
Trial60Response_Rate 12.64 -1.8 12.65 27.34
ACQUSI_EMTELEM60 12.79 -4.85 12.65 9.42
ACQUSI_DMTELEM60 11.29 -2.37 11.3 15.15
cogmech_trial60 11.24 -6.45 11.18 17.07
ACQUSI_EM_gt60 11.06 -0.67 11.03 26.06
Unknown_reason 10.68 -3.34 10.66 7.67
funct_trial60 10.62 -9.03 10.63 21.75
ACQUSI_DMTELEM_gt 9.76 -1.47 9.73 9.41
60
relativ_trial60 9.01 -5.03 9.06 19.23
ACQUSI_EMDMTELEM 8.9 -4.17 8.87 9.12
60
cogmech_trial_gt60 8.53 -8.95 8.46 14.41
ACQUSI_EMDMTELEM 8.45 -2.83 8.34 5.9
_gt60
auxverb_trial60 7.73 -2.47 7.72 9.43
social_trial60 7.77 -5.99 7.71 15.97
funct_trial_gt60 7.5 -7.33 7.49 17.71
pronoun_trial60 7.37 -5.7 7.35 10.09
verb_trial60 6.78 -5.31 6.79 16.62
ACQUSI_TELEM60 6.44 2.4 6.73 10.8
Acquis_Change_Service_ 6.57 -4.1 6.59 9.46
gt60
pronoun_trial_gt60 6.1 -4.97 6.11 9.89
social_trial_gt60 6.14 -7.47 6.1 16.42
Acquis_Live_Agent_gt60 5.6 -4.14 5.61 9.56
home_trial_gt60 5.55 -0.59 5.56 1.77
Acq_LiveAg_ChangeInfo 5.26 1.03 5.3 2.03
_gt60
affect_trial60 5.23 -3.48 5.22 8.85
verb_trial_gt60 5.19 -5.34 5.17 13.37
money_trial60 4.93 1.38 5.07 15.33
ACQUSI_EMDM_gt60 4.94 -3.25 4.84 10.62
Acquis_Make_Payment_g 4.74 1.5 4.81 2.13
t60
133
(Appendix H, continued)
Acq_LiveAg_ChangeSer_ 4.73 -2.14 4.74 10
gt60
ACQUSI_EMTELEM_gt 4.71 -0.46 4.72 11.51
60
relativ_trial_gt60 4.66 -4.21 4.64 17.96
preps_trial60 4.52 -3.98 4.51 20.28
present_trial60 4.46 -4.53 4.39 11.04
NonPay 4.53 -1.4 4.38 1.78
Acq_CUWI_ChangeInfo_ 4.39 -2.05 4.34 1.92
gt60
past_trial60 4.22 1.78 4.3 14.94
ppron_trial_gt60 4.29 -4.5 4.26 8.09
conj_trial_gt60 4.24 -2.35 4.23 9.26
negate_trial60 4.13 -1.42 4.13 6.92
preps_trial_gt60 3.99 -5.24 3.98 16.4
ACQUSI_DM_gt60 3.79 0.07 3.89 15.99
Acq_LiveAg_MakePaymt 3.76 -0.39 3.7 1.71
_gt60
percept_trial60 3.67 0.33 3.7 4.06
hear_trial60 3.64 2.95 3.7 3.31
ipron_trial60 3.35 -1.1 3.33 7.34
motion_trial_gt60 3.28 -0.57 3.29 11.65
Acquis_ChangeAcctInfo_ 3.17 -0.56 3.18 3.13
gt60
past_trial_gt60 3.1 -5.98 3.06 12.67
ACQUSI_TELEM_gt60 2.93 -0.42 2.98 11.09
excl_trial60 2.92 -1 2.89 6.51
affect_trial_gt60 2.78 0.17 2.8 7.58
Acquis_CUWI_gt60 2.81 -0.89 2.78 2.42
time_trial60 2.63 0.53 2.65 13.74
percept_trial_gt60 2.51 0.11 2.55 3.32
Acquis_MDBcount60 2.45 1.3 2.49 36.87
cause_trial_gt60 2.45 1.45 2.47 14.15
leisure_trial_gt60 2.38 4.08 2.44 13.12
ppron_trial60 2.39 -3.11 2.38 8.17
Acq_LiveAg_ChangeSer6 2.3 1.32 2.33 2.81
0
shehe_trial60 2.28 -1.03 2.28 6.34
work_trial60 2.23 -0.41 2.25 14.22
space_trial60 2.22 -1.14 2.22 14.96
Acquis_Live_Agent60 2.15 1.97 2.2 3.32
space_trial_gt60 2.15 -1.04 2.15 13.69
incl_trial_gt60 2.13 -1.7 2.12 9.51
shehe_trial_gt60 2.01 -0.63 2.02 6.84
money_trial_gt60 1.96 -1.12 1.95 9.29
article_trial_gt60 1.86 0.11 1.87 9.83
134
discrep_trial_gt60 1.86 -2.46 1.81 4.33
negate_trial_gt60 1.79 -0.77 1.79 7.09
RADIO_TOTAL 1.45 1.99 1.77 63.06
Acquis_IVR_gt60 1.85 -1.74 1.65 0.55
present_trial_gt60 1.46 0.51 1.47 8.99
posemo_trial_gt60 1.36 1.1 1.43 6.69
auxverb_trial_gt60 1.41 -0.48 1.42 9.21
incl_trial60 1.38 -0.3 1.39 10.4
negemo_trial60 1.35 -0.66 1.35 4.79
Acq_IVR_ChangeSer_gt6 1.19 0 1.18 0.62
0
i_trial60 1.04 1.77 1.09 6.81
posemo_trial60 1.03 -0.06 1.04 7.25
sexual_trial_gt60 1 0 1 0.01
article_trial60 0.64 2.8 0.71 10.4
tentat_trial60 0.54 -0.04 0.54 4.06
health_trial60 0.46 1.47 0.5 2.36
family_trial60 0.45 1.29 0.48 2.21
future_trial_gt60 0.46 1.76 0.48 3.21
bio_trial_gt60 0.4 -1.04 0.38 2.02
cause_trial60 0.2 3.22 0.33 12.93
motion_trial60 0.22 3.57 0.3 13.2
they_trial60 0.11 1.57 0.16 2.28
Acquis_ChangeAcctInfo6 -0.03 2.12 0.03 3.82
0
Non_Usage 0 0 0 0.02
Product_Dissatisfaction 0 0 0 0
Cost_price_fee 0 0 0 0
Competition 0 0 0 0
Acq_IVR_MakePaymt60 0 0 0 0
Acq_IVR_ChangeInfo60 0 0 0 0
Acq_CUWI_MakePaymt_ 0 0 0 0.02
gt60
Acq_IVR_MakePaymt_gt 0 0 0 0.01
60
Acq_IVR_ChangeInfo_gt 0 0 0 0.01
60
swear_trial60 0 0 0 0
sexual_trial60 0 0 0 0
you_trial_gt60 0 0 0 0
swear_trial_gt60 0 0 0 0
friend_trial_gt60 0 0 0 0.01
ingest_trial_gt60 0 0 0 0
conj_trial60 -0.04 -0.19 -0.05 10.46
i_trial_gt60 -0.14 2.18 -0.07 5.82
excl_trial_gt60 -0.26 1.92 -0.12 6.29
Uncontrollable_reason -0.75 3.55 -0.18 1.79
insight_trial_gt60 -0.38 0.12 -0.38 4.57
135
Service_Dissatisfaction -0.45 0 -0.45 0.33
Acquis_CUWI60 -0.49 1.39 -0.47 2.99
family_trial_gt60 -0.48 0.4 -0.47 2.25
time_trial_gt60 -0.84 2.54 -0.75 13.55
anx_trial_gt60 -0.82 1.42 -0.76 0.53
discrep_trial60 -0.78 0.11 -0.78 6.76
achieve_trial_gt60 -1.01 2.86 -0.83 7.88
adverb_trial_gt60 -0.9 2.53 -0.84 4.94
Acq_CUWI_MakePaymt6 -0.91 -1 -1.03 0.44
0
Acquis_Make_Payment60 -1.1 0.28 -1.09 1.83
filler_trial_gt60 -1.13 1 -1.09 0.91
Acq_CUWI_ChangeSer_g -1.06 -1.05 -1.14 1.01
t60
bio_trial60 -1.27 1.51 -1.22 2.45
ipron_trial_gt60 -1.34 2.16 -1.26 6.14
number_trial_gt60 -1.34 2.84 -1.26 4.21
inhib_trial_gt60 -1.46 2.3 -1.37 4.18
filler_trial60 -1.53 0.54 -1.49 0.74
see_trial60 -1.74 0.69 -1.73 1.2
Acquis_IVR60 -1.98 0 -1.98 0.23
health_trial_gt60 -1.95 -0.68 -2 1.94
Acq_CUWI_ChangeInfo6 -2.28 1.54 -2.21 2.64
0
ingest_trial60 -2.44 0 -2.44 0.52
future_trial60 -2.52 1.34 -2.49 4.2
body_trial60 -2.81 0 -2.81 0.82
Acq_IVR_ChangeSer60 -2.85 0 -2.85 0.29
Acq_LiveAg_MakePaymt -3.13 2.14 -2.92 1.62
60
feel_trial60 -2.94 0 -2.94 1.02
see_trial_gt60 -2.98 0.69 -2.95 1.24
Acquis_Change_Service6 -3.38 3.16 -3.04 2.67
0
nonfl_trial_gt60 -3.27 2.72 -3.13 6.76
Acq_LiveAg_ChangeInfo -3.36 2.7 -3.18 2.35
60
tentat_trial_gt60 -3.55 2.46 -3.38 4.51
feel_trial_gt60 -3.21 0.01 -3.4 1.19
hear_trial_gt60 -3.63 2.84 -3.41 3.09
Acq_CUWI_ChangeSer60 -3.52 0 -3.52 0.27
adverb_trial60 -3.72 2.97 -3.57 6.24
negemo_trial_gt60 -3.84 1.91 -3.76 4.74
relig_trial60 -3.96 1.01 -3.89 1.31
you_trial60 -4.15 0 -4.15 0.38
friend_trial60 -4.78 1.07 -4.61 0.79
relig_trial_gt60 -4.71 0 -4.68 0.95
assent_trial_gt60 -5.01 2.25 -4.78 1.07
136
they_trial_gt60 -5.16 4.18 -4.87 2.59
anger_trial_gt60 -5.19 0.78 -5.07 1.1
death_trial60 -5.35 0 -5.35 0.81
achieve_trial60 -5.59 3.59 -5.44 11.52
death_trial_gt60 -6.09 2.73 -5.48 1.25
body_trial_gt60 -6.05 0 -6.05 0.55
we_trial_gt60 -6.27 1.02 -6.19 1.3
sad_trial60 -6.6 1.26 -6.38 2.57
work_trial_gt60 -6.77 5.15 -6.4 9.58
anx_trial60 -6.57 0.56 -6.49 1.64
leisure_trial60 -6.79 2.99 -6.58 9.88
insight_trial60 -8.01 4.52 -7.62 6.04
anger_trial60 -8.29 2.15 -7.99 1.68
nonfl_trial60 -8.33 1.71 -8.15 7.33
assent_trial60 -8.44 1.41 -8.26 1.43
sad_trial_gt60 -8.55 1.67 -8.33 1.77
quant_trial_gt60 -9.11 3.95 -8.72 3.38
humans_trial60 -9 1.98 -8.96 3.03
inhib_trial60 -9.66 2.33 -9.46 2.73
humans_trial_gt60 -10.1 2.69 -9.85 1.88
certain_trial_gt60 -10.36 4.04 -10.04 3.65
we_trial60 -10.5 0.22 -10.52 3.28
certain_trial60 -11.35 4.34 -10.94 7.45
home_trial60 -12.39 5.02 -11.45 4.18
number_trial60 -12.05 4.26 -11.66 6.58
quant_trial60 -14.26 2.09 -14.11 5.84
137
Appendix I: Churned Winback Variable Importance sorted by Mean Decrease Accuracy
Variable NonWinback Winback MeanDecrease MeanDecrease

Accuracy Gini
Selfcare_dummy 58 1.53 71.87 914.32
Promo_count 33.18 33.92 42.89 2832.13
Churn_gt60Response_Rate 17.65 10.42 30.44 732.97
AcqSelfcare_dummy -3.49 22.76 28.63 1440.8
RETENTION_EMDMTEL 17.33 -3.87 21.89 293.91
EM60
Churn60Response_Rate 9.31 8.25 20.49 438.33
MC_Ave_Price 4.81 12 20.4 1005.14
RETENTION_EMDM60 14.48 -0.72 19.33 467.44
RETENTION_EMTELEM6 13.1 0.02 19.1 417.78
0
MC2_Ave_Price 10.29 8.61 19.08 950.37
funct_Churn60 15.72 -4.02 18.92 437.79
RETENTION_DMTELEM6 13.4 -2.03 17.83 356.39
0
cogmech_Churn60 12.96 -1.21 17.14 373.01
Price_range 10.09 9.71 16.2 588.96
RETENTION_EM60 4.07 4.15 15.41 907.53
Retention_duration 3.81 4.16 15.05 906.37
verb_Churn60 12.51 -4.37 14.57 374.66
preps_Churn60 10.8 -1.07 14.21 404.64
Retention_MDBcount_gt60 0.55 9.3 14 1446.5
RETENTION_DM60 5.79 2.22 13.94 461.77
relativ_Churn60 11.75 -3.97 13.8 408.43
social_Churn60 11.48 -3.22 13.78 383.44
relativ_Churn_gt60 10.42 -1.03 13.48 253.67
Service_count 5.42 11.11 13.14 488.73
verb_Churn_gt60 11.8 -4.03 13.08 220.93
RETENTION_EM_gt60 1.12 6.25 12.36 1024.47
work_Churn60 9.09 -0.72 12.19 324.31
cogmech_Churn_gt60 10.4 -2.55 11.8 237.74
funct_Churn_gt60 10.31 -5.17 11.74 268.25
RETENTION_EMTELEM_ 8.9 -0.28 11.29 285.4
gt60
RETENTION_EMDMTEL 11.21 -4.4 10.37 143.36
EM_gt60
affect_Churn60 9.2 -3.24 10.29 228.01
Retent_Live_Agent_gt60 8.02 -1.2 9.97 186.93
social_Churn_gt60 8.57 -2.45 9.93 238.33
Retent_Make_Payment_gt6 8.88 -3.26 9.59 134.67
0
preps_Churn_gt60 6.54 -1.23 9.39 252.18
138
(Appendix I, continued)
past_Churn_gt60 6.63 -0.2 9.1 203.18
RETENTION_TELEM60 3.89 1.98 9.08 382.32
conj_Churn60 8.16 -3.32 9.05 231.02
auxverb_Churn_gt60 7.3 -0.94 8.93 174.53
pronoun_Churn60 8.32 -5.28 8.85 231.92
pronoun_Churn_gt60 9.24 -2.9 8.85 161.95
money_Churn60 6.55 -0.41 8.69 354.82
Retention_duration2 0.6 5 8.57 1056.17
auxverb_Churn60 7.04 -2.52 8.52 266.36
present_Churn60 6.87 -2.47 8.37 251.3
affect_Churn_gt60 7.68 -1.13 8.29 167.35
Acq_MDB_ResRate 3.2 1.49 8.13 605
ppron_Churn60 7.67 -4.22 7.84 209.55
RETENTION_DM_gt60 0.4 4.07 7.82 360.39
Retent_Live_Agent60 4.48 0.63 7.73 142.71
space_Churn60 6.6 -2.65 7.65 310.23
past_Churn60 5.72 -0.92 7.5 299.28
ppron_Churn_gt60 6.86 -3.95 7.5 148.68
RETENTION_TELEM_gt6 2.6 2.57 7.19 231.97
0
shehe_Churn60 6.93 -5.3 7.04 172.2
negate_Churn60 7.17 -4.57 6.85 218.76
space_Churn_gt60 5.62 -1.72 6.59 208.03
time_Churn_gt60 3.81 0.68 6.56 191.2
Retent_LiveAg_MakePaymt 5.65 -2.71 6.4 131.92
_gt60
Retention_MDBcount60 -1.63 5.34 6.19 1190.73
Acquisition_duration2 -2.06 6.1 6.14 717.81
RETENTION_EMDM_gt60 3.8 0.62 5.89 388.27
percept_Churn60 6.31 -4.83 5.82 140.75
Retent_Change_Service_gt6 5.14 -0.76 5.6 89.93
0
present_Churn_gt60 3.7 -0.97 5.25 166.19
RETENTION_DMTELEM_ 4.8 0.02 5.16 169.17
gt60
incl_Churn60 4.22 -0.27 4.96 216.76
Retent_LiveAg_MakePaymt 4.04 -1.37 4.94 91.7
60
excl_Churn60 4.55 -3.54 4.93 193.51
conj_Churn_gt60 4.68 -2.25 4.92 159.95
NonPay 4.09 -1.8 4.79 42.49
Retent_LiveAg_ChangeSer6 3.81 -2.27 4.78 78.9
0
Retent_ChangeAcctInfo_gt6 3.77 -0.41 4.75 61.32
0
Retent_LiveAg_ChangeSer_ 4.29 -2.83 4.56 85.04
gt60
incl_Churn_gt60 3.16 0.15 4.5 153.08
139
article_Churn_gt60 3.99 -2.33 4.47 186.08
Retent_Change_Service60 4.3 -2.54 4.37 84.19
Retent_Make_Payment60 4 -2.66 4.24 94.76
shehe_Churn_gt60 3.53 -0.79 4.24 129.65
Non_Usage 3.14 0.16 4.11 31.49
article_Churn60 3.63 -1.75 4.08 254.65
time_Churn60 2.76 -0.57 4.01 286.53
leisure_Churn_gt60 4.01 -2.32 3.97 140.91
money_Churn_gt60 2.34 -0.44 3.75 214.13
hear_Churn60 4.29 -3.18 3.74 133.15
posemo_Churn60 3.7 -3.16 3.73 210.71
work_Churn_gt60 1.24 1.68 3.73 198.62
cause_Churn60 2.8 -1.23 3.63 214.21
Uncontrollable_reason 3.11 -0.7 3.58 40.01
nonfl_Churn60 4.87 -4.92 3.41 166.48
posemo_Churn_gt60 2.41 0.77 3.38 155.36
ipron_Churn_gt60 2.51 -0.25 3.36 121.55
Retent_LiveAg_ChangeInfo 3.1 -2.5 3.01 45.31
_gt60
Unknown_reason 2.14 -0.05 2.59 29.62
cause_Churn_gt60 1.66 0.35 2.59 159.63
Retent_ChangeAcctInfo60 2.44 -1.61 2.36 58.13
RADIO_TOTAL 1 2.54 2.33 1096.98
negate_Churn_gt60 2.26 -1.4 2.31 131.63
adverb_Churn_gt60 1.72 -0.33 2.17 131.58
insight_Churn_gt60 1.58 0.09 2.13 150.98
quant_Churn60 1.91 -1.24 2.05 157.3
discrep_Churn_gt60 1.8 -1.3 2.01 116.15
leisure_Churn60 2.05 -1.32 1.98 222.12
motion_Churn60 1.09 0.56 1.96 205.38
ipron_Churn60 1.87 -1.5 1.63 165.76
motion_Churn_gt60 1.31 -1.05 1.4 144.04
Retent_CUWI_gt60 1.59 -1.55 1.28 48.76
discrep_Churn60 0.58 0.46 1.27 172.73
percept_Churn_gt60 0.58 0.51 0.97 74.27
Retent_LiveAg_ChangeInfo -0.26 2.27 0.89 45.17
60
insight_Churn60 0.77 -0.38 0.85 192.34
Acq_MDBcount -1.7 2.16 0.08 989.92
Retent_CUWI60 0.04 -0.22 -0.05 45.01
Free_ended -0.12 0 -0.13 14.04
tentat_Churn60 0.54 -1.46 -0.18 97.75
excl_Churn_gt60 0.48 -1.9 -0.22 119.04
i_Churn_gt60 -0.42 0.26 -0.45 112.04
i_Churn60 -0.55 0.45 -0.51 142.91
Retent_CUWI_ChangeInfo_ -1.03 0.78 -0.92 36.82
gt60
Retent_CUWI_MakePaymt -0.67 -0.35 -0.93 22.4
140
_gt60
adverb_Churn60 -1.18 0.86 -0.96 164.51
Retent_CUWI_ChangeSer_ -1.27 0.95 -1.22 25.98
gt60
Retent_CUWI_MakePaymt -1.89 2.2 -1.32 18.21
60
negemo_Churn_gt60 -2.03 1.28 -1.7 79.88
hear_Churn_gt60 -3.15 4.12 -1.85 67.21
Retent_IVR_ChangeInfo60 -3.59 4.24 -2.12 6.59
Service_Dissatisfaction -2.52 1.75 -2.2 25.69
Retent_CUWI_ChangeSer6 -1.85 0.46 -2.24 25.85
0
inhib_Churn_gt60 -2.54 1.15 -2.69 88.69
Retent_IVR_ChangeSer_gt6 -4.26 4.2 -2.78 14.51
0
Retent_IVR_MakePaymt_gt -4.13 3.71 -2.88 15.87
60
achieve_Churn_gt60 -2.95 1.91 -2.9 117.79
Competition -3.27 1.74 -2.92 1.62
Retent_IVR_ChangeSer60 -3.16 1.78 -3.01 16.26
quant_Churn_gt60 -2.32 0.53 -3.06 99.97
Retent_IVR_gt60 -4.35 4.16 -3.07 29.04
Retent_CUWI_ChangeInfo6 -2.51 -0.21 -3.28 33.57
0
swear_Churn_gt60 -4.88 3.01 -3.29 1.97
swear_Churn60 -4 1.14 -3.34 1.42
tentat_Churn_gt60 -4.14 3.58 -3.44 80.68
future_Churn_gt60 -3.79 2.76 -3.46 87.2
Retent_IVR_ChangeInfo_gt -4.44 1.7 -3.67 9.89
60
negemo_Churn60 -3.81 1.32 -3.77 113.31
nonfl_Churn_gt60 -4.23 2.68 -3.9 129.55
body_Churn60 -4.81 2 -4.01 4.32
future_Churn60 -4.2 3.24 -4.04 108.48
Retent_IVR60 -5.63 4 -4.07 21.98
Product_Dissatisfaction -5.21 2.73 -4.46 13.85
Cost_price_fee -4.58 2.33 -4.49 9.86
bio_Churn_gt60 -4.17 1.02 -4.49 39.57
humans_Churn_gt60 -5.33 3.64 -4.56 49.39
filler_Churn60 -6.64 4.66 -4.71 15.39
Retent_IVR_MakePaymt60 -6.31 3.75 -4.94 9.49
ingest_Churn60 -7.44 3.91 -5.32 5.51
health_Churn_gt60 -6.35 4.18 -5.36 36.75
family_Churn_gt60 -4.1 0.43 -5.37 46.57
sexual_Churn60 -7.1 3.14 -5.4 6.05
death_Churn_gt60 -7.83 4.75 -5.51 5.86
Prepaid_period_ended -7.15 4.86 -5.61 14.58
ingest_Churn_gt60 -7.47 3.58 -5.79 4.77
141
achieve_Churn60 -7.85 5.66 -5.94 151.96
certain_Churn_gt60 -7.65 3.26 -6.27 72.57
number_Churn_gt60 -7.12 3.94 -6.49 90.39
anger_Churn_gt60 -7.84 5.15 -6.9 28.59
body_Churn_gt60 -8 2.35 -6.99 3.67
sexual_Churn_gt60 -9.35 4.71 -7.51 5.53
health_Churn60 -8.68 4.66 -7.68 51.6
friend_Churn_gt60 -10.03 5.48 -7.85 12.76
relig_Churn_gt60 -10.37 6.47 -8.22 21.19
bio_Churn60 -10.82 6.67 -8.26 55.06
home_Churn_gt60 -9.92 5.44 -8.66 46.47
death_Churn60 -12.17 5.05 -8.8 15.73
you_Churn60 -11.54 5.83 -8.98 13.19
sad_Churn_gt60 -13.31 11.3 -9.05 35.83
assent_Churn_gt60 -9.88 6.14 -9.1 41.78
inhib_Churn60 -10.55 7.36 -9.18 108.43
we_Churn_gt60 -11.47 8.19 -9.18 51.9
feel_Churn60 -9.6 3.22 -9.39 15.32
sad_Churn60 -11.54 7.47 -9.4 61.62
you_Churn_gt60 -10.29 2.53 -9.4 9.3
humans_Churn60 -9.77 4.56 -9.45 70.83
friend_Churn60 -12.44 5.88 -9.54 13.4
number_Churn60 -9.72 5.45 -9.57 135.24
they_Churn_gt60 -11.08 4.05 -10 31.06
see_Churn_gt60 -11.75 6.96 -10 34.75
filler_Churn_gt60 -14.5 9.32 -10.14 14.79
we_Churn60 -10.71 3.9 -10.31 55.15
certain_Churn60 -11.28 4.91 -10.76 88.38
anx_Churn_gt60 -14.82 9.47 -11.47 25.6
feel_Churn_gt60 -14 4.26 -11.8 12.55
anx_Churn60 -15.51 7.49 -12.08 26.9
see_Churn60 -13.92 7.48 -12.3 39.22
relig_Churn60 -15.56 6.53 -12.42 25.15
family_Churn60 -15.17 9.13 -12.59 71.36
home_Churn60 -17.84 11.48 -12.68 67.72
assent_Churn60 -18.84 12.23 -12.72 39.67
they_Churn60 -15.94 6.88 -13.85 51.06
anger_Churn60 -16.7 4.72 -14.62 29.5

Customer Relationship Manageme

Uploaded by

Copyright:

Available Formats

Customer Relationship Manageme

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Customer Relationship Manageme

Uploaded by

Copyright:

Available Formats

CUSTOMER RELATIONSHIP MANAGEMENT: FROM CONVERSION TO

Examining Committee Members:

All rights reserved

INFORMATION TO ALL USERS

customer relationship management, namely customer acquisition, retention, and winback

heterogeneity (e.g. in usage of self-care service channels, duration of service,

marketing communications, the depth of the promotion, the different communication

stages of CRM in terms of their effects on shaping customers’ decision making of

acquiring, retaining and winning back customers.

I would like to express my sincere gratitude to my “Fantanstic Four” dissertation

even though I was deeply impressed by your presentation at O-Bayes conference at

working with all of you will surely benefit my whole career.

one of my mentors in this school who supported me in all kinds of ways.

which means everything to me.

I would acknowledge WCAI and SiriusXM, this dissertation is based on your

grant of data. Without it, it is a mission impossible.

this effort into conclusion.

dream. However, her training and education of me to be an independent woman has

Thank you, mom.

DEDICATION .................................................................................................................. vii

LIST OF TABLES ............................................................................................................. xi

LIST OF FIGURES .......................................................................................................... xii

2. LITERATURE REVIEW ........................................................................................8

2.1 CRM (CUSTOMER RELATIONSHIP MANAGEMENT) ............................8

2.1.1 ACQUISITION AND CONVERSION…...……...............……………8

2.1.2 RETENTION AND CHURN………………...………………………11

2.1.4 ACQUISITION, RETENTION AND WINBACK……….…….……18

2.2 TEXT MINING ..............................................................................................19

2.2.1 TRANSFORMATION OF TEXT ………………...............................20

2.2.2 TEXT CLASSIFICATION ……………………..……………….…..26

3.1 SELF-CARE ...................................................................................................30

3.2 MARKETING INITIATIVES ........................................................................33

3.2.2 RADIO DENSITY…………………………………………………..39

3.2.3 PROMOTIONAL DEPTH…………………………………………..39

3.2.4 PRICING AND CHURN AND WINBACK………………………...40

3.3 DEACTIVATION REASONS .......................................................................42

3.4 DURATION ON WINBACK.........................................................................43

4.1 CLASSIFICATION METHODS....................................................................44

4.1.1 BAYESIAN LOGIT/PROBIT MODEL…………………………......44

4.1.3 RANDOM FOREST……….………………………………………...50

4.2 CLASSIFIER PERFORMANCE MEASURE ...............................................55

5. DATA AND VARIABLES .....................................................................................59

5.1 DATA .............................................................................................................59

5.2 VARIABLES ..................................................................................................61

5.3 TIME FRAME OF VARIABLES ..................................................................63

6. RESULTS AND DISCUSSION ..............................................................................69

6.1 VARIABLE IMPORTANCE IN FOUR EVENTS ........................................69

6.2 HYPOTHESIS TESTING ..............................................................................76

3. LIMITATIONS, FUTURE DIRECTIONS AND CONCLUSIONS ...........................85

A. LIWC2007 OUTPUT VARIABLE INFORMATION ........................................101

B. SUMMARY STATISTICS OF CONVERSION TABLE... ................................103

C. SUMMARY STATISTICS OF CHURN TABLE...............................................107

E. SUMMARY STATISTICS OF CHURNED WINBACK TABLE .....................116

F. CONVERSION VARIABLE IMPORTANCE TABLE ......................................121

G. CHURN VARIABLE IMPORTANCE TABLE .................................................126

H. NONCONVERTED WINBACK VARIABLE IMPORTANCE TABLE ..........131

I. CHURNED WINBACK VARIABLE IMPORTANCE TABLE ........................136

1. Factors Affecting Aacquisition and Conversion .......................................................50

2. Factors Affecting Retention and Churn ..................................................................57

3. Factors Affecting Winback. ......................................................................................67