Building Status in an Online Community - Gwern.net

Page created by Jim Hill
 
CONTINUE READING
Building Status in an Online Community - Gwern.net
ORGANIZATION SCIENCE
                                                                                                                                        Articles in Advance, pp. 1–22
http://pubsonline.informs.org/journal/orsc                                                                             ISSN 1047-7039 (print), ISSN 1526-5455 (online)

Building Status in an Online Community
Inna Smirnova,a,* Markus Reitzig,b Olav Sorensonc
a
  School of Information, University of Michigan, Ann Arbor, Michigan 48109; b Department of Accounting, Innovation, and Strategy, University
of Vienna, 1090 Vienna, Austria; c Anderson School of Management, University of California, Los Angeles, Los Angeles, California 90095
*Corresponding author
Contact: [email protected],     https://orcid.org/0000-0003-2275-1166 (IS); [email protected],
    https://orcid.org/0000-0002-8562-3754 (MR); [email protected],        https://orcid.org/0000-0002-0599-6738 (OS)

Received: August 11, 2020                           Abstract. We argue that the actions for which actors receive recognition vary as they
Revised: May 7, 2021; September 23, 2021            move up the hierarchy. When actors first enter a community, the community rewards
Accepted: October 27, 2021                          them for their easier-to-evaluate contributions to the community. Eventually, however, as
Published Online in Articles in Advance:            these actors rise in status, further increases in stature come increasingly from engaging in
February 11, 2022                                   actions that are more difficult to evaluate or even impossible to judge. These dynamics pro-
https://doi.org/10.1287/orsc.2021.1559              duce a positive feedback loop, in which those who have already been accorded some stat-
                                                    ure garner even greater status through quality-ambiguous actions. We present evidence
Copyright: © The Author(s) 2022                     from Stack Overflow, an online community, and from two online experiments consistent
                                                    with these expected patterns.

                                                    Open Access Statement: This work is licensed under a Creative Commons Attribution 4.0 International Li-
                                                      cense. You are free to copy, distribute, transmit and adapt this work, but you must attribute this
                                                      work as “Organization Science. Copyright © 2022 The Author(s). https://doi.org/10.1287/orsc.2021.
                                                      1559, used under a Creative Commons Attribution License: https://creativecommons.org/licenses/
                                                      by/4.0/.”
                                                    Funding: All authors would like to acknowledge funding from the Austrian Science Fund [Grant P
                                                      25768-G16].
                                                    Supplemental Material: The online appendix is available at https://doi.org/10.1287/orsc.2021.1559.

Keywords: status attainment              •   action ambiguity   •   online communities   •   stack overflow   •   experiment

Introduction                                                                                 Lynn et al. 2009), a process that Correll et al. (2017)
Those held in high esteem enjoy numerous advan-                                              have labelled as socially endogenous inference. Scien-
tages. High-status individuals attract more attention                                        tists, for example, assess contributions not only by
(Simcoe and Waguespack 2011, Bowers and Prato                                                reading articles and attending seminars but also by
2018, Reschke et al. 2018), receive outsized credit for                                      paying attention to who else has cited researchers,
their contributions (Kim and King 2014, Waguespack                                           giving deference to them. In evaluating the quality of
and Salomon 2015), and can more readily access a va-                                         a wine, consumers incorporate both their own opin-
riety of resources (Merton 1968, Bol et al. 2018). High-                                     ions about whether the wine tasted good and their be-
status firms can negotiate better terms from buyers                                           liefs about what others thought (Roberts et al. 2011).
and suppliers (Benjamin and Podolny 1999, Hsu 2004,                                             But this explanation for status attainment also poses
Nanda et al. 2020), receive favorable treatment from                                         a puzzle. Status hierarchies often appear steepest and
authorities (McDonnell and King 2018), and can hire                                          the benefits of status most pronounced in settings in
more able employees without offering higher salaries                                         which consumers cannot even determine ex post—
(Bidwell et al. 2015, Tan and Rider 2017).                                                   after they have consumed the goods—whether what
   How do firms and individuals come to hold high                                             they received had been of high quality (Sauder et al.
status? The most common claim has been that com-                                             2012, Sorenson 2014, Ertug et al. 2016). Consider man-
munities award status to those who have provided                                             agement consulting or investment banking. Even after
the most value to them, both through commitment to                                           receiving a recommendation, clients have little basis
the community and through high-quality contribu-                                             for assessing whether BCG or Goldman Sachs provid-
tions (e.g., Ridgeway 1981, Podolny and Phillips 1996,                                       ed better advice than they might have received from
Sauder et al. 2012, Hahl and Zuckerman 2014). Differ-                                        some less-celebrated firm. Given that these settings of-
ences in the value of these contributions nevertheless                                       fer little in the way of actual, verifiable information on
become amplified because, in assessing quality, audi-                                         past performance, it would seem that socially endoge-
ences rely not just on their own prior experiences but                                       nous inference has nothing to amplify. How then do
also on the judgments of others (e.g., Gould 2002,                                           status differences arise in these settings?

                                                                                     1
Building Status in an Online Community - Gwern.net
Smirnova, Reitzig, and Sorenson: Building Status in an Online Community
2                                                                  Organization Science, Articles in Advance, pp. 1–22, © 2022 The Author(s)

   One possibility is that initial differences in quality     such as artistic expression (McCall 1975, Sgourev and
or perceived quality emerge entirely by chance (Gould         Althuizen 2014). These dynamics produce a positive
2002, Lynn et al. 2009). In venture capital, for example,     feedback loop, in which those who have already re-
Nanda et al. (2020) demonstrate that early perfor-            ceived some recognition become further distanced
mance differences emerge from investing in the right          from the rest.
place at the right time, something that appears almost           We explore this question empirically using data
entirely random. These early successes nevertheless           from Stack Overflow (SO), an online community for
allow these investors to become central players in the        seeking and providing coding advice. Online commu-
community, as entrepreneurs and other investors in-           nities have become increasingly important settings
terpret these random successes as signals of quality.         for the exchange of information (Hwang et al. 2015,
   Another possibility, on which we elaborate here, is        Botelho 2018)—book reviews on Goodreads, travel
that the actions for which actors receive status shift as     advice on TripAdvisor, and product reviews on
they move up the hierarchy. Both individual and or-           Amazon, to name a few. These communities, more-
ganizational actors engage in a range of activities that      over, usually incorporate an evaluation system—
vary in the ease with which others can assess their val-      upvoting, likes, useful votes—as a means of motivating
ue. Some actions are objectively better or worse.             people to provide reviews and of allowing users to sort
Others involve a mix of objective elements and those          through the information (Constant et al. 1996, Lakhani
open to debate. At the extreme, ambiguous actions             and von Hippel 2003, Wasko and Faraj 2005). These sys-
elude any objective evaluation. A management con-             tems create status hierarchies, helping to determine
sultant, for example, could provide benchmarking in-          who becomes most influential to a wide variety of pur-
formation or he or she might proscribe a particular           chasing and consumption activities (Bianchi et al. 2012).
strategy. With a little research, clients could verify the    Understanding the dynamics of status attainment on
former. But, for the latter, they have little hope of de-     these systems represents an important question in its
termining whether another course of action would              own right.
have been better.                                                But SO also offers some notable advantages for un-
   People pay attention to different types of actions         derstanding the origins of status more broadly: We
and evaluate those actions differently depending on           can observe community members from the day that
the status of the actor performing them. When actors          they enter the community, before they have been ac-
first enter a community, we argue that community               corded any status. By contrast, interactions in person
members attend primarily to easier-to-assess actions,         almost always occur under the shadow of preexisting
awarding status to those who exhibit commitment to            status. Even when actors first enter communities, they
the community and competence and quality on rela-             usually arrive with signals of status from their affilia-
tively objective criteria (Ridgeway 1981, Hahl and            tions, their ascriptive characteristics, or their strategic
Zuckerman 2014). However, for actors who have al-             choices (e.g., Ridgeway 1991, Stuart et al. 1999, Phil-
ready attained some status, people increasingly pay           lips et al. 2013, Askin and Bothner 2016).
attention to their harder-to-assess actions, where val-          Community members engage in three main activi-
ue judgments also become more subjective. Because             ties on SO: asking questions, answering them, and
members of the community perceive these middle- to            commenting on questions and answers. We find that
high-status actors as being competent and producing           when individuals first enter, asking questions most
high-quality outputs, they interpret these quality-           strongly predicts their initial movement up the status
ambiguous actions as being valuable. These harder-to-         hierarchy. As they gain stature, however, further
assess actions therefore contribute increasingly to the       movement up the ladder depends primarily on an-
attainment of further status as actors move up the            swering questions and on commenting. Much of the
hierarchy.                                                    rise from the top 10% to the elite of the elite, the top
   A statistician beginning his or her career might first      5% and higher, appears to depend on commenting. To
gain status by providing accurate answers to objective        the extent that these activities range from the value of
questions; but as he or she gained standing, audiences        questions being easier to evaluate to that of answers
would increasingly pay attention to and accord fur-           and comments being harder to evaluate, these results
ther status to him or her for weighing in on matters          are consistent with our expectations.
open to debate, such as the right approach to research           Although our use of individual-level fixed effects in
or the significance of open problems. Early on, judges         our analysis of the SO data allows us to reject many al-
and audiences similarly accord status to artists and          ternative interpretations for these patterns, we cannot
musicians in terms of their technical abilities (McCall       rule out within-person increases in objective quality
1975). For those performers who have already reached          over time, learning, as an alternative explanation. Al-
a moderate level of status, however, the receipt of ad-       though the observational data do not allow for an
ditional status depends on more subjective criteria,          easy resolution to this potential confound, we also ran
Smirnova, Reitzig, and Sorenson: Building Status in an Online Community
Organization Science, Articles in Advance, pp. 1–22, © 2022 The Author(s)                                                           3

two online experiments in which we exogenously as-                          2001, Bendersky and Shah 2012). Although these as-
signed the status of a contributor to each action (a                        sessments seem unsurprising in settings where people
question, an answer, or a comment). Those experi-                           can easily assess the value added by community
ments produced qualitatively consistent results: the                        members, status orderings curiously emerge even in
status of the questioner did not influence the status                        settings—such as management consulting and invest-
gains associated with questions, but higher status did                      ment banking—where it would seem that people have
lead to more positive perceptions of answers and                            little or no objective basis for evaluating quality (e.g.,
comments. Status also appeared somewhat more im-                            Podolny 1993, Ertug et al. 2016). In fact, these settings
portant to the evaluation of comments than it did to                        appear to produce some of the steepest and most resil-
that of answers. We discuss the implications of our re-                     ient status hierarchies (Sorenson 2014).
sults both for online communities, such as SO, and for                         We can reconcile this apparent puzzle and more
the emergence and consequences of status hierarchies                        broadly understand status attainment processes by
in offline communities.                                                      recognizing that actors engage in a variety of actions,
                                                                            some of which allow for relatively easy and objective
Status Attainment                                                           valuation, others of which do not. Some actions are
Actors do not claim status. People bestow status on in-                     easy to evaluate as objectively useful or not. Others
                                                                            mix elements that are easy to evaluate with others
dividuals and on organizations. They have been
                                                                            that are open to interpretation. Yet others, ambiguous
thought to do so on the basis of the value that
                                                                            actions, may elude any objective evaluation. Academ-
they perceive that a particular actor has provided to
                                                                            ics, for example, inform each other on a range of is-
the community (e.g., Ridgeway 1981, Podolny and
                                                                            sues, from the factual to the speculative. Investment
Phillips 1996, Hahl and Zuckerman 2014). But actors
                                                                            bankers similarly advise their clients on many
can influence these conferrals of status through their
                                                                            decisions, from the quickly verified pricing of public
actions. These perceptions of value presumably come
                                                                            securities to the harder-to-assess identification and
from a combination of the effort or commitment that
                                                                            valuation of private firms to acquire.
the actor has demonstrated to the community and the
                                                                               In updating their beliefs about the competence or
competence or quality of their actions. Because status
                                                                            quality of actors, we expect that people will attend to
stems in part from these quality perceptions, it simul-                     different types of actions, or to different dimensions
taneously serves as a signal of quality (Berger et al.                      of those actions, depending on the status of the actors
1972, Podolny 1993, Podolny and Phillips 1996, Cao                          performing them.
and Smith 2021).
   When actors first enter relationships, groups, and                        Easy-to-Assess Actions
communities, they begin those interactions without                          Easy-to-assess actions, by definition, do not require
status. Being without status does not mean being low                        much time, effort, or expertise to evaluate. This fact
status. Low status would imply that others believed                         also means that they should generally involve evalua-
the actor incompetent or of poor quality. Being with-                       tion on objective criteria.
out status instead means that alters simply do not                             Even complex and seemingly ambiguous actions of-
have any beliefs about what value the actor might                           ten have such easy-to-evaluate components. In many
provide.                                                                    settings, for example, simply expending time or effort
   People attend to a wide variety of queues as they at-                    on a community may serve as one of the easiest actions
tempt to situate people in the status hierarchy. Many                       to evaluate. Time and effort signal commitment to the
of these signals provide only diffuse information.                          community. Numerous experiments have therefore
They may, for example, infer the status of an individu-                     found that groups and communities bestow status on
al based on the average status of others with the same                      those who expend effort on their behalf, particularly
ascriptive characteristics, such as gender (e.g., Ridge-                    when that effort appears altruistic (e.g., Ridgeway
way 1991). Or they might observe which other actors                         1981, Willer 2009, Hahl and Zuckerman 2014).
in the community interact and affiliate with the en-                            Beyond simply the time involved, many actions
trant, updating their beliefs based on the status of                        have other easy-to-evaluate components. Clients of
those alters (e.g., Podolny 1993, Stuart et al. 1999, Jen-                  management consultants and investment bankers, for
sen 2006). Symbolic actions might also provide signals                      example, can assess the accuracy of the factual infor-
of status (e.g., Askin and Bothner 2016).                                   mation and calculations reported in a proposal or pre-
   But community members also begin to form direct                          sentation. They can also easily evaluate their quality
judgments of commitment, competence, and quality                            on more superficial features, such as the absence of
and to accord status to entrants to the community on                        misspellings and grammatical errors.
the basis of the actions of those newcomers (Berger                            When actors enter a community, community mem-
et al. 1972, Ridgeway 1981, Henrich and Gil-White                           bers first attend to these easy-to-evaluate actions and
Smirnova, Reitzig, and Sorenson: Building Status in an Online Community
4                                                                    Organization Science, Articles in Advance, pp. 1–22, © 2022 The Author(s)

components of actions when assessing and conferring                As actors climb the status hierarchy, those interact-
status on the actors. They reward those who spend               ing with them then have higher expectations about
time in and on the community. They also hold in                 their commitment, competence, and the quality of all
higher regard those who perform well on easy-to-                of their actions. These expectations rise regardless of
evaluate objective criteria, such as being accurate or          whether alters have observed the actors themselves or
technically able.                                               whether they have simply inferred the status of those
   Even if the perceived quality of these easy-to-              actors based on affiliations or patterns of deference.
evaluate elements does not depend on the status of                 As the expectations of community members rise, it
the actor producing them, the extent to which they              becomes increasingly difficult for actors to exceed
contribute to conferrals of status will. Before people          these expectations on easy-to-evaluate actions. They
have strong priors about an actor, observations of ef-          have already demonstrated commitment. Their accu-
fort and high quality on these easy-to-evaluate dimen-          racy cannot exceed 100%. Easy-to-evaluate actions
sions will lead alters to update their beliefs about the        therefore eventually become self-limiting in terms of
competence or quality of the actor to regard them as            their further contributions to standing in the commu-
higher status.                                                  nity. At some point, the expectations of others for
   Given this line of reasoning, we expect the following:       commitment and competence, based on the actor’s
Hypothesis 1. At low levels of status, actions that are eas-    status, match the actor’s observed easy-to-evaluate
ier to assess contribute to increases in status.                performances.
                                                                   We therefore expect the following:
   Even though these easy-to-evaluate elements often
represent but some of the actions or some of the com-           Hypothesis 2. As status increases, actions that are easier
ponents of the actions in which actors engage, the              to assess contribute less and less to further increases in
evaluation of them influences beliefs about the general          status.
quality of the actor for at least two reasons. On the
one hand, much as people use “test” features when               Difficult-to-Assess Actions
assigning category membership (Hannan et al. 2007),             As actors rise in the status hierarchy, community
people may infer that quality on one type of action             members increasingly pay attention to more-difficult-
should correlate positively with quality on other sorts         to-assess actions or components of actions. Similar to
of actions. If the analysis in a research paper appears         Phillips and Zuckerman’s (2001) argument that actors
solid in technical terms, the reader might place more           often require sufficient status to enter the consider-
faith even in the paper’s review of the literature. If a        ation set, community members will only exert the ef-
lawyer’s brief gets all of its facts right, then readers        fort necessary to assess the quality of these actions if
might give greater credence to any leaps of legal               they believe the actor performing them of sufficient
argumentation.                                                  ability or quality to justify their time. People therefore
   On the other hand, such spillovers in beliefs also           allocate more attention to the ideas and outputs of
stem from automatic psychological processes. Peo-               higher-status actors (Merton 1968, Simcoe and
ple encode their perceptions of quality as moods or             Waguespack 2011). Consistent with the idea that this
emotions (e.g., Swinyard 1993, Danner et al. 2016).             stems from expectations regarding the quality of their
But once encoded as a feeling, people can no longer             outputs, Cao and Smith (2021) demonstrate that peo-
connect that feeling to a specific component of                  ple only differentially attend to those of higher status
the product or the service or the producer. The pos-            when they believe status serves as a meaningful signal
itive effect created by these perceptions therefore             of quality.
creates a general mood that leads alters both to re-               At the extreme, the hardest-to-assess actions, am-
call their past experiences with actors more posi-              biguous actions or components of actions, defy any
tively and to overestimate the probability of having            objective evaluation. Ambiguity does not imply that
good experiences with them in the future (Bower                 people vary in their preferences, in what they would
1981, Johnson and Tversky 1983, Wright and Bower                regard as high value. Almost everyone would agree
1992).                                                          that high-quality management consulting should im-
   The reverse also holds true. The negative effect as-         prove the performance of the firm receiving the ad-
sociated with undesirable experiences can create a              vice. Similarly, most would concede that a highly
pall over the actors responsible and everything that            competent investment banker should accurately pre-
they do (Johnson and Tversky 1983, Wright and Bow-              dict the price that investors would pay for a company
er 1992). When flights have been delayed, for exam-              in an initial public offering or acquisition (Podolny
ple, passengers perceive the plane as less clean, the           1993).
seats as less comfortable, and the food as lower quali-            The ambiguity rather resides in the near impossibil-
ty (Anderson et al. 2009).                                      ity of assessing whether these objectives have been
Smirnova, Reitzig, and Sorenson: Building Status in an Online Community
Organization Science, Articles in Advance, pp. 1–22, © 2022 The Author(s)                                                            5

met. Consider, for example, career advice. Although                         people tend to pay attention to the information that
the person receiving any such advice might perceive                         would support their existing opinions and to ignore
it as useful at the time, any objective evaluation of its                   that which would contradict them (Wason 1960, Klay-
quality can only be made far in the future, after the re-                   man and Ha 1987). For firms and individuals held in
cipient has had the opportunity to act on it. Even                          high regard then, audiences may selectively attend to
then, evaluation would prove difficult. To assess its                        information, even noise, that affirms their opinions.
quality, two types of counterfactuals are needed. First,                       But this effect probably also stems, in part, from so-
what would have happened to the individual in the                           cially endogenous inference. When faced with uncer-
absence of the advice, if they had followed a different                     tainty about how to evaluate an action, people rely on
path? Second, what career advice might another per-                         the choices and opinions of others—assuming that
son have given at the time? Without solid evidence of                       those individuals have information or insight that
both counterfactuals, any evaluation of the quality of                      they do not—as a means of resolving their uncertainty
the advice becomes largely subjective.                                      (Ridgeway and Erickson 2000, Lynn et al. 2009, Cor-
   Such fundamental ambiguity in evaluation exists for                      rell et al. 2017). Scientists, for example, assess contri-
actions in many settings. Consider a management con-                        butions not only by reading articles and attending
sulting firm giving strategy advice. What recommen-                          seminars but also by paying attention to who else has
dations would another consulting firm have given?                            cited these researchers. In evaluating the quality of a
For an investment bank underwriting a public offer-                         song, listeners incorporate both their own opinions
ing, would another bank have proposed a more accu-                          and their beliefs about what others thought (Salganik
rate initial price? With sufficient time and information,                    et al. 2006).
some of these actions might be open to objective evalu-                        However, whereas the existing literature on socially
ation. Someone could, for example, examine the aver-                        endogenous inference has generally treated the pro-
age level of underpricing across many public offerings                      cess as a property of the setting (e.g., Podolny 1993,
or the average performance of client firms many years                        Lynn et al. 2009), we would argue that it only occurs
down the road. But, in any individual instance and at                       for certain types of actions and, more crucially, that it
the time the actions have been performed, the quality                       can only begin after actors have been accorded some
of these actions remains ambiguous.                                         stature on the basis of easy-to-evaluate actions. The
   In the face of this ambiguity, we argue that the per-                    extent to which it operates therefore varies across ac-
ceived quality of these actions will depend on the sta-                     tors within settings and also over time for any given
tus of the actor performing them. People will find it                        actor
near impossible to judge the ambiguous actions of                              Although the perceived quality of these ambiguous
those without status, those about whom they have no                         actions stems from the status of the actors performing
priors of competence or quality. People may even                            them, we believe that they will nevertheless contrib-
treat ambiguous actions from low-status actors as con-                      ute to further gains in the perceived competence or
firming evidence of incompetence or of low-quality                           quality of these actors. If people understood that their
performance (Riecken 1958).                                                 favorable perceptions of these difficult-to-evaluate ac-
   But as status rises, the fact that the actor has status                  tivities reflected the status of the actors, then that un-
positively influences the audience’s interpretation of                       derstanding might inoculate them from using these
the ambiguous action (Merton 1968, Correll et al.                           biased opinions to update their beliefs. However,
2017). Sgourev and Althuizen (2014), for example, viv-                      whether due to confirmation biases or socially endog-
idly recount how the same style inconsistency that                          enous inference, we suspect that people are either not
critics denigrated early in Picasso’s career (before he                     aware of these biases or underestimate the extent to
had status) became seen as evidence of his genius af-                       which they operate. Status then increases the per-
ter he had attained prominence.                                             ceived quality of difficult-to-evaluate actions, which
   Importantly, in contrast to easy-to-evaluate dimen-                      leads to further increases in status, creating a virtuous
sions on which expectations of quality based on status                      cycle of positive feedback.
will eventually match observed quality, in the absence                         This line of reasoning leads us to propose the
of objective evaluation, perceptions rule uncon-                            following:
strained. Advice from a Goldman Sachs or a Nobel
Prize winner almost automatically becomes seen as                           Hypothesis 3. As status increases, difficult-to-evaluate ac-
important and insightful, as high quality. When a                           tions contribute more and more to further increases in
high-status actor weighs in on some topic, audiences                        status.
perceive those opinions as further evidence of the
individual’s brilliance.                                                    Stack Overflow
   In part, this effect probably stems from confirma-                        We investigate the dynamics of status formation on
tion bias. When presented with conflicting evidence,                         Stack Overflow. SO provides a forum in which people
Smirnova, Reitzig, and Sorenson: Building Status in an Online Community
6                                                                  Organization Science, Articles in Advance, pp. 1–22, © 2022 The Author(s)

can find solutions to their programming problems,              best answer given, submitted by “Mark Byers” (again,
can help solve others’ problems, and can discuss a            see the area just below the box). If one scrolled down
range of topics related to computer programming and           the screen, one would see the other answers, se-
software engineering.                                         quenced in terms of the number of “useful” votes that
   SO provides an amazing resource. It is the most ac-        they had received. The smallest box, box 3, mean-
tive online exchange for programming-related infor-           while, surrounds one of the comments, offered by
mation, with almost 20 times as many questions and            “Ankur-m” nearly two years after the question had
answers as the next most active exchange. Since its in-       originally been posed.
ception in 2008, users have posted more than 20 mil-             As in many online communities, status plays an im-
lion questions and have received more than 30 million         portant role here. The platform does not moderate
answers to those questions.1 Most questions receive           participation and questions, answers, and comments
answers in a matter of minutes (Mamykina et al.               vary tremendously in their quality. As in other com-
2011). Every month, 50 million unique visitors search         munities, the solution to this problem involves a type
the site for programming-related information.                 of crowdsourced quality evaluation. Members of the
   SO also offers an excellent setting for examining the      community can upvote (or downvote) questions, eval-
dynamics of status formation. First and foremost,             uating them as clear and useful (or not). In Figure 1,
most members of the community interact only                   for example, one can see that the question received 75
through the platform and the platform documents               more upvotes than downvotes (see the number be-
nearly all of their activity. We therefore have a com-        tween triangles to the left of box 1). Members can also
plete archival record of the actions that contribute to       evaluate answers and comments as useful (or not).
status attainment. Second, the fact that few of these         The top answer to this question also happened to
individuals have prior experience with each other out-        have received 75 more upvotes than downvotes (see
side the platform means that members join the com-            the number between triangles to the left of box 2).
munity without any preexisting status.                           The platform uses community members’ reactions
   Anyone can join SO. Joining allows a user not just         to questions and answers to award points and badges
to read the existing discussions but also to contribute       to members who provide content. These points and
content. Members undoubtedly participate for a varie-         badges both provide rewards for contributing and sig-
ty of reasons. Some may derive satisfaction from              nals to those consuming the content. SO displays
contributing to the public good; others may enjoy the         them prominently. Consider Figure 1 again. Look at
social exchange or the recognition garnered from their        the line just below the user names for the people ask-
contributions; yet others may see providing advice as         ing and answering questions. The first number reports
a form of generalized reciprocity for the benefits that        the points that the individual has received; the num-
they themselves have received (Constant et al. 1996,          bers to the left of the colored dots detail the number of
Lakhani and von Hippel 2003, Penoyer et al. 2018,             badges that the user has received. “Amit Patil,” for ex-
Chen et al. 2019).                                            ample, has 708 points and has earned 3 gold badges,
   We downloaded our data from the archive (https://          11 silver badges, and 22 bronze badges.2
archive.org/details/stackexchange) that Stack Over-              Users can also find out more about any particular
view released to the public on March 13, 2017. Our            member, in their profile, by clicking on the person’s
full data set includes information on all activities on       username. Figure 2 provides an example of a profile:
the SO platform from its inception, on July 31, 2008, to      “nc3b” has been a member for more than nine years
our download date, of March 13, 2017. Because of the          (though one can also see that the user has not been ac-
large size of the archive, our analyses focus on a 1%         tive since 2013), asking 19 questions and posting more
simple random sample of members (30,418 accounts).            than 176 answers. Immediately below the avatar on
Each member who had registered on SO before March             the left-hand side of the screen, you can see the points
13, 2017, had an equal probability of being included in       (10,879) and the number of badges that nc3b has been
the sample, regardless of their status, their year of reg-    awarded. Based on the tag information reported in
istration, and their level of activity. We did, however,      the middle of the page, this member appears to have
exclude elected moderators from this sample as they           expertise primarily in the C programming language.
often appear as outliers in their degree of activity on       But we have no information about the individual
the platform.                                                 beyond his or her activity on SO. The line—
   Figure 1 depicts an example of what one would see          “Apparently, this user prefers to keep an air of mys-
as a set of actions related to a particular question. Box     tery about them.”—represents generic text that SO
1 surrounds the initial question. It includes an expla-       displays for all members who have not provided self-
nation of the problem and a snippet of code reporting         descriptions in their profiles.
what the person, “Amit Patil” (see the shaded area               Although this example does not include any identi-
just below the box), had tried. Box 2 highlights the          fying information, some users do provide personal
Smirnova, Reitzig, and Sorenson: Building Status in an Online Community
Organization Science, Articles in Advance, pp. 1–22, © 2022 The Author(s)                                                            7

Figure 1. (Color online) Example of a Stack Overflow Question Page

Source. https://stackoverflow.com/questions/3361768/copy-data-from-one-column-to-other-column-which-is-in-a-different-table/
13454906#13454906 (accessed April 30, 2019).

information. SO does not maintain official statistics on                     top scorers). For most individuals, therefore, SO users
the proportion of members who have completed their                          would have little if any information on which to base
profiles. We therefore examined two subsets of 100                           prior beliefs about their status (cf. Bianchi et al. 2012).
members—one selected at random from our sample                              High-status members also do not appear to differ
and a second set of the 100 users who had accumulat-                        from the average community member on this outside
ed the most points—and hand coded their profiles.                            information.
Although nothing requires SO members to choose
user names that identify them, 46 of 100 individuals in
the random sample and 41 of the top 100 scorers chose                       Measures
user names that resembled a combination of a fore-                          Dependent Variables. Status has usually been mea-
name and a surname. Of those potentially using real                         sured in one of two ways. The first involves selecting
names, fewer than half provided any additional iden-                        some award, such as the Nobel Prize or an endowed
tifying information, such as an employer, in their pro-                     chair (e.g., Merton 1968, Reschke et al. 2018). This ap-
file (19 of 46 of the random sample and 20 of the 41                         proach has the advantage of having a high degree of
Smirnova, Reitzig, and Sorenson: Building Status in an Online Community
8                                                                           Organization Science, Articles in Advance, pp. 1–22, © 2022 The Author(s)

Figure 2. (Color online) Example of a Member Profile Page

Source. https://stackoverflow.com/users/226266/nc3b (accessed April 30, 2019).

face validity. Few would argue that the Nobel Prize                    award or penalize another user in six main ways: (1)
does not confer prestige on its recipient. But these                   Upvoting (downvoting) someone else’s question adds
prizes and positions also reflect status. People win                    5 points to (subtracts 2 points from) that person’s
Nobel Prizes and receive chairs because they are al-                   score. (2) Upvoting another member’s answer adds 10
ready held in high regard. Studies based on this ap-                   points to (subtracts 2 points from) that member’s
proach therefore compare the pinnacle of the prestige                  score. (3) When the question asker selects an answer
hierarchy to the merely elite (Reschke et al. 2018).                   as the best one offered, the person providing that an-
   A second approach collects information on patterns                  swer receives an additional 15 points. (4) A member
of deference (e.g., Podolny 1993). Highly cited scien-                 can also offer a “bounty” on a question. If they choose
tists, for example, have higher status on average than                 to award the bounty to a particular answer, the person
those receiving less attention. Our own measures fol-                  awarding the bounty effectively transfers those points
low the logic of this second approach.                                 from their own score to the person receiving the boun-
   We examine two outcomes. Our first measure                           ty. (5) If a user proposes an edit to a question, answer,
builds off of the score that SO uses to summarize how                  or comment and the original poster accepts that edit,
other members have evaluated the person’s contribu-                    the person proposing the edit receives 2 points. (6) If a
tions. The second measure captures attention, some-                    person’s post receives six flags identifying it as spam
thing strongly correlated with status (Merton 1968,                    or as being offensive, the person loses 100 points.
Simcoe and Waguespack 2011).                                           These reactions account for nearly all points awarded
   SO scores its members based on how other                            to SO members.3
members respond to their contributions. Much as                           Returning to the example in Figure 1, the asker here
publishing a paper does not ensure that anyone cites                   received 375 points for this question based on the re-
it, posting a question, an answer, or a comment does                   actions of other users ( 5 × 75). The person who pro-
not guarantee the poster any points. During the peri-                  vided the answer meanwhile received 765 points for
od covered by our data, community members could                        the combination of the upvotes plus being accepted as
Smirnova, Reitzig, and Sorenson: Building Status in an Online Community
Organization Science, Articles in Advance, pp. 1–22, © 2022 The Author(s)                                                          9

the best answer ( 10 × 75 + 15). But these numbers                         we scraped the website for profile views every other
represent outliers. The modal question and answer re-                       week from November 30, 2016, to March 18, 2017 for
ceive no reactions—no upvotes or downvotes—and                              all SO members, generating eight observations for
therefore do not result in any points being rewarded                        each member (242,872 user-period records).
to the posters.                                                                Members also vary a great deal in their visibility.
   The archival data from SO only provided this                             The average individual had received 24.7 profile
score at the time of downloading, but SO reports the                        views by March 2017. But the cumulative number of
algorithm that it uses to calculate these scores and                        profile views to that point ranged from just 1 to more
our data include nearly all of the relevant informa-                        than 9,000.
tion for this calculation. We therefore used the al-                           Our dependent variables correlate with each other
gorithm together with the activity information to                           at 0.75. But each measure has its strengths and weak-
create time-varying imputed evaluation scores. We                           nesses. The primary weakness of the evaluation score
computed this variable at the user-month level for                          is that SO has defined the weights for how particular
the period from SO’s inception up until March 2017,                         reactions contribute to status. Visibility, meanwhile,
giving us a total of 1,420,359 distinct user-period ob-                     has the advantage of not assuming any weights but
servations. Our manually reconstructed scores for                           has the disadvantage that members may view profiles
March 2017 correlate to those available from SO at a                        for reasons not connected to status, introducing noise
level of 0.98.4                                                             into that measure. To the extent that both measures re-
   Figure 3 depicts the distribution of these (logged)                      veal similar patterns, however, it should increase our
evaluation scores in our data in a violin plot. The                         confidence that the results reflect actual status attain-
width of the violin at each point depicts the propor-                       ment processes.
tion of the mass of the distribution at that point; the
dot and boxplot down the center represent the mean                          Independent Variables. Our theory argues that differ-
and interquartile range, respectively. One can clearly                      ent types of activities contribute to status formation at
see that a large proportion of members register but                         different points in the process. SO members engage in
then never receive any attention for their activity on                      three main activities: posting questions, answering
the platform. The average individual received 160.6                         them, and commenting.5 Questions seem easiest to
points over our observation window. But this score                          evaluate. They demonstrate engagement with the
has a very long tail, with one person being awarded                         community. Users can understand most aspects of
more than 116,000 points.                                                   their value simply from reading them.
   Our second dependent variable stems from the fact                           Consider an example. One new user posted a ques-
that with status comes attention. This measure, which                       tion, “Combining two vectors element-by-element,”
we label as visibility, counts the cumulative profile                        with the following text: “I have 2 vectors [examples]. I
views—the page depicted in Figure 2—that a member                           would like to combine them so that the resulting vec-
has received up to a given point of time. SO, again,                        tor is [example]. I can easily do this with a loop but it
only provides cross-sectional information on this mea-                      is very slow so can anyone provide a fast way to do
sure. Because we could not reconstruct it retroactively,                    this?” The title clearly and succinctly describes the is-
                                                                            sue. Readers can readily assess whether they would
                                                                            value a resolution to it. To date, it has received eight
Figure 3. (Color online) Distribution of the Logged                         upvotes, adding 40 points to the evaluation score of
Evaluation Scores                                                           the asker.
                                                                               Evaluating answers, by comparison, requires more
                                                                            effort. Simply providing an answer demonstrates
                                                                            commitment to the community. Members devote time
                                                                            to writing them. Answers to questions often run to
                                                                            multiple paragraphs and include lines and lines of
                                                                            code.
                                                                               In our hand-coded sample, however, upvotes and
                                                                            downvotes came not from length but from whether
                                                                            the solution worked. Determining that demands more
                                                                            effort or expertise. Either the evaluator must have
                                                                            sufficient experience that they had tried the solution
                                                                            before or the person must attempt to implement the
                                                                            advice. Many problems also have multiple solutions.
                                                                            Determining the best approach might require a great
                                                                            deal of expertise and may depend on the situation.
Smirnova, Reitzig, and Sorenson: Building Status in an Online Community
10                                                                Organization Science, Articles in Advance, pp. 1–22, © 2022 The Author(s)

   Comments, meanwhile, often seem at least as diffi-         unfold over time, we want to account for any matura-
cult as answers to evaluate in terms of their value.6        tion effects, such as learning at the level of the individ-
Understanding the value of these comments often re-          ual. A user tenure variable, therefore, captures the
quires reading and understanding the entire thread,          logged number of days since the user joined the plat-
not just the original question but also the proposed         form. All of the models also include a count of the
solutions. Consider some examples. As a response to          logged number of “favorite” tags that a user has giv-
a proposed answer, the original asker commented: “I          en, for the logged number of times that a user answers
don’t want R to store 50,000 zeros. Rather, I want           his or her own question, and for the logged number of
some type of sparse storage within each loop.” Anoth-        times a user accepts his or her own answer as being
er user responded to this comment with another com-          the best one, even though these represent rather rare
ment, “plenty of results here on sparse matrices,”           events and even though they have no mechanical rela-
with links to two additional SO threads. In another          tionship to either of our dependent variables. We add
thread, as a comment on a question, one user re-             one to all of these counts prior to logging to avoid the
sponded “just count?? the order of the variables is the      generation of missing values.
same as the order of the columns.” Effectively, the             Our models also include a variety of measures of
user offered a solution to the question without posting      activity and objective quality at the question-answer
an official answer.                                           level. The models include the ratio of questions and
   To the extent that these contributions range from         answers posted by the individual that include snip-
questions being easier to evaluate to answers and            pets of code. The models also control for the number
comments being more difficult to evaluate, we there-          of bounty points received.
fore expect that the SO community will pay more                 We also included variables to capture the propor-
attention to questions for posters at lower levels of        tion of the users’ questions and answers that had been
status but that they will increasingly attend to answers     in popular categories. Because these categories have
and comments as posters rise in the status ranks.            more people posting and reading questions, answers,
   Our independent variables measure each of these           and comments, activity in these domains may attract
activities:                                                  more votes and attention. Table 1 reports descriptive
    Questioning activity counts the (logged) cumulative      statistics for the variables used in our analyses (sum-
number of questions (plus one) an individual has             mary statistics for the control variables appear in Ta-
posted on the platform up to a given point of time.7 In
                                                             ble A1 in the online appendix).
our full sample, SO members post three to four ques-
tions, on average; but the range runs from 0 to 752.
                                                             Estimation Strategy. Our theory argues that the ac-
    Answering activity counts the (logged) cumulative
                                                             tions that the community values and for which it
number of answers (plus one) an individual has posted
                                                             awards status vary as a function of the actor’s current
on the platform up to a given point of time. In our sam-
                                                             status. One obvious approach to exploring this idea
ple, the median member posted five answers; but the
                                                             would involve regressing the evaluation score in one pe-
range in answering activity runs from 0 to 1,966 posts.
                                                             riod on a set of interaction effects between the various
   Commenting activity, meanwhile, counts the (logged)
cumulative number of comments (plus one) that a user
                                                             Figure 4. (Color online) Logged Average Number of Posts
has submitted.8 The average member in our sample
                                                             on Stack Overflow per Quarter per User by Status Level for
posted about 13 comments, but some have posted
                                                             All Users Who Reached the 95th Percentile of the Evaluation
more than 5,000.                                             Score Distribution
   Figure 4 depicts the natural logs of the quarterly
distributions of activities by status level for all users
who eventually reached the 95th percentile of the
evaluation score distribution. It therefore provides
some sense of how activity on the platform evolves
with status, within user. As users rise through the
ranks, they become more active on the system. But
even those at the lowest levels post comments, and
even those at the highest levels still pose questions.
The most notable shift in behavior appears to be that
users flatten out in their rate of asking questions after
reaching the 75th percentile of the evaluation score
distribution.
   We also included control variables to adjust for a
number of user attributes. Because these processes
Smirnova, Reitzig, and Sorenson: Building Status in an Online Community
Organization Science, Articles in Advance, pp. 1–22, © 2022 The Author(s)                                                              11

Table 1. Descriptive Statistics for Key Variables                               various quantile ranges, with the lowest status level
                                                                                appearing at the top of each grouping and with status
Variables                     N        Mean      Std. dev.    Min       Max
                                                                                increasing as one moves down.
Evaluation scores        1,420,359     160.6      1,203.25      1     116,114      Consider first the effects of asking questions. At the
Visibility                242,872       24.74      145.05       1      9,037    lowest levels of status, nearly all gains in status ap-
Questioning activity     1,420,359       3.6        12.95       0       752
                                                                                pear associated with asking questions (consistent with
Answering activity       1,420,359       5.03       34.08       0      1,966
Commenting activity      1,420,359      12.85       75.57       0      5,027    Hypothesis 1). The coefficient implies that a one-unit
                                                                                increase in questioning activity predicts a 0.57% in-
Note. Std. dev., standard deviation.
                                                                                crease in a person’s evaluation score (p < 0.001). Ask-
actions and the evaluation score in the prior period. But                       ing questions continues to predict gains in status all
                                                                                the way up to the 95th percentile of the evaluation
that approach has the disadvantage of imposing a func-
                                                                                score distribution. The apparent value of asking ques-
tional form on how attention to types of action change
                                                                                tions in terms of additional status gains, however, de-
with status.
                                                                                clines rapidly as users move up the distribution of
   To allow the relationship between the reactions to
                                                                                evaluation scores. Based on only questioning activity,
actions and status to vary flexibly, we used a modified
                                                                                an individual entering the platform could move into
version of quantile regression. We began by specify-
                                                                                the top half of the evaluation score distribution by
ing five quantile intervals—0%–50%, 50%–75%,
                                                                                posting 21 questions with average reactions. Moving
75%–90%, 90%–95%, and above 95%—for each of our
                                                                                from the 50th percentile to the 75th percentile would
dependent variables. For the evaluation score, the cut
                                                                                require another 35 average-reaction questions. At the
points fall at 8, 37, 187, and 506 points; for visibility,
                                                                                very highest levels, in the top 5%, posting questions
the boundaries between the quantile intervals come at
                                                                                actually has a negative association with the evaluation
4, 10, 35, and 79 page-views. Although our definition
                                                                                score. At that level, questions disappoint. Consistent
of these boundaries stem from the distributions of                              with Hypothesis 2, then, the value of easy-to-evaluate
these variables at the end of our period, the inclusion                         activities for status gains exhibits diminishing margin-
of period fixed effects should account for the fact that                         al returns.
the underlying distribution evolves over time.9                                    Answering activity, by contrast, does little for status
   We then estimated coefficients for our independent                            at the very lowest levels (we cannot even reject the
variables within each of these quantile intervals using                         null hypothesis that it has no effect). At middle and
a series of models with user-level fixed effects.10 These                        high levels of status, however, answering questions
fixed effects should capture time-invariant unob-                                begins to predict increases in status, consistent with
served differences across community members, such                               Hypothesis 3.
as gender, native language, and formal education (as                               Commenting activity, perhaps the most difficult to
well as ancillary information available on profiles).                            evaluate of the three actions, also has no significant ef-
Our estimates therefore reflect the changing reactions                           fect on status gains for those in the bottom half of the
of the SO community to various types of actions with-                           status distribution. At moderate to high levels of sta-
in a specific individual within a particular range of                            tus, posting comments has a more pronounced associ-
the status distribution.                                                        ation with status attainment (β3  0.16 and 0.20 at the
   SO user-period provides the unit of analysis in                              75th and 90th percentiles, p < 0.001). Consistent with
these regressions. Standard errors have been clustered                          Hypothesis 3, the relationship between commenting
at the user level.11 We estimate our regression models                          and status gains becomes ever stronger as individuals
according to:                                                                   climb the status hierarchy.
 Evaluation scorei,t or Visibilityi,t  α + β1                                     Figure 6 again depicts the main results, this time us-
                                                                                ing visibility as the measure of member status (for the
         × questioning activityi,t + β2 × answering activityi,t
                                                                                corresponding table, see Table 3). The results largely
         + β3 × commenting activityi,t + γ1 × user controlsi,t                  mirror those in Figure 5. At low levels of status, the
         + time dummiest + εi,t ,                         (1)                   community-wide interest correlates primarily with
                                                                                questioning activity, consistent with Hypothesis 1.
where i refers to the individual user and t to the peri-
                                                                                Over this range of status, a one-unit increase in the
od (either a month or two-week interval).
                                                                                number of questions predicts a roughly 0.25% in-
                                                                                crease in profile views (p < 0.001). At these modest
Results                                                                         levels of status, however, providing answers and
Figure 5 plots the coefficient estimates for the relation-                       comments adds little to visibility within the SO
ship between actions and evaluation scores. (Table 2                            community.
reports the results in table form). The plots group the                            As status increases (75%–90% and 90%–95% quan-
coefficients for a particular type of activity across the                        tile intervals), however, more difficult-to-evaluate
Smirnova, Reitzig, and Sorenson: Building Status in an Online Community
12                                                                   Organization Science, Articles in Advance, pp. 1–22, © 2022 The Author(s)

Figure 5. (Color online) Within-Quantile Coefficient Estimates for the Relationship Between Various User Actions on Stack
Overflow and Their Evaluation Scores

actions become the more powerful predictors of fur-             be elite may have changed over time. Although this
ther increases in community-wide attention. Asking              smaller subsample produces noisier estimates (see Ta-
questions becomes less important and does not even              ble A2 in the online appendix), the point estimates fol-
differ significantly from zero once users pass the 90th          low a similar pattern to that found in the full sample.
percentile threshold (p > 0.2). Answering questions
also adds little to further increases in visibility at the
highest levels of the status distribution (β2  0.04 at         Experiments
the 95th percentile, p  0.027). Only posting comments          The individual-level fixed effects allow us to rule out a
continues to correspond to increasing attention at the          wide range of alternative interpretations. For example,
highest levels. Moving from the 90th percentile to the          if members revealed their gender or nationality
95th percentile of visibility would require millions of         through their user names or on their profiles and those
questions generating average reactions, hundreds of             characteristics led to differences in status, the fixed ef-
average-reaction answers, or roughly five average-               fects would absorb those effects. But one important po-
reaction comments.                                              tential confound remains. Members might get better at
   We estimated a number of additional models to as-            these activities over time, meaning that the quality of
sess the extent to which our results might reflect some          their answers and comments might rise in tandem
sample selection or estimation choice. We first restrict-        with their score and their visibility. Solving this simul-
ed the analysis to those who eventually achieved high           taneity problem would either require exogenous varia-
status (the 95th percentile). In other words, this re-          tion in status or accurate measures of the objective
gression estimates what accounted for the status gains          components of question, answer, and comment quali-
of the elite users as they moved from having no status          ty. Because neither of these solutions seemed feasible
to being in the top status category. Figure 7 depicts           in the archival data, we developed a pair of online ex-
the coefficient estimates for the same models within             periments as a second test of our predictions.
this subset of users (Table 4 reports the results in table         We had a panel of Python experts create realistic
form). As one can see, the patterns appear the same             threads of questions, answers, and comments. We as-
even among this set of elite users.                             sembled these threads using the same formatting as
   We next restricted the analysis to those who joined          an SO thread, so that they would appear almost as
the platform during its first full year of operations            screen shots from the SO website (see Figures A3 and
(July 31, 2008 to July 31, 2009). This subset addresses         A4 in the online appendix). However, as opposed to
two potential issues. First, it accounts for the fact that      an actual thread, the experiment allowed us to assign
the platform and the nature of contributions to it              randomly the status of the users associated with the
might have evolved over time. Second, it addresses              question, answers, and comments in each thread. We
the possibility that the definition of what it means to          ran two online experiments.12 The first tested our
Smirnova, Reitzig, and Sorenson: Building Status in an Online Community
Organization Science, Articles in Advance, pp. 1–22, © 2022 The Author(s)                                                              13

Table 2. Within-Quantile OLS User Fixed-Effects Regressions for the Relationship Between Various User Actions on Stack
Overflow and Their Evaluation Scores

Variables                                                          (1)         (2)            (3)             (4)                (5)

                                                                0%–50%      50%–75%        75%–90%         90%–95%          Above 95%
log(Questioning activity + 1)                                     0.57***      0.36***        0.16***        0.05*           −0.06*
                                                                 (0.024)      (0.015)       (0.017)         (0.022)           (0.024)
log(Answering activity + 1)                                     −0.00          0.18***        0.21***        0.17***           0.24***
                                                                 (0.018)      (0.009)       (0.008)         (0.019)           (0.032)
log(Commenting activity + 1)                                    −0.00          0.08***        0.16***        0.20***           0.31***
                                                                 (0.020)      (0.014)       (0.012)         (0.020)           (0.049)
log(User tenure + 1)                                              0.07***      0.25***        0.43***        0.29***           0.59***
                                                                 (0.011)      (0.016)       (0.040)         (0.080)           (0.122)
log(Answering activity to own questions + 1)                    −0.03**      −0.01            0.02***        0.01*             0.02***
                                                                 (0.011)      (0.004)       (0.004)         (0.004)           (0.005)
log(Number of “favorite” votes given + 1)                         0.00         0.01**         0.01***        0.01***           0.01***
                                                                 (0.003)      (0.003)       (0.002)         (0.002)           (0.003)
log(Number of “favorite” votes given to self + 1)               −0.00        −0.00          −0.00            0.00**            0.00*
                                                                 (0.003)      (0.002)       (0.001)         (0.001)           (0.002)
log(Accepting own answers as best + 1)                            0.00         0.00*        −0.00           −0.01**          −0.01**
                                                                 (0.005)      (0.002)       (0.002)         (0.003)           (0.003)
Ratio of posed questions with snippets of code                    0.08***    −0.02*         −0.04***        −0.01              0.00
                                                                 (0.012)      (0.009)       (0.010)         (0.014)           (0.018)
Ratio of given answers with snippets of code                    −0.00          0.01**         0.01           0.00              0.02
                                                                 (0.006)      (0.004)       (0.005)         (0.011)           (0.032)
Ratio of posed popular questions                                  0.00       −0.01†         −0.01           −0.02*             0.01
                                                                 (0.009)      (0.007)       (0.009)         (0.011)           (0.014)
Ratio of given answers to popular questions                     −0.00        −0.01          −0.01**          0.00            −0.06*
                                                                 (0.006)      (0.004)       (0.005)         (0.009)           (0.030)
log(Number of bounty points received + 1)                         0.01***      0.00**         0.00***        0.00†             0.00**
                                                                 (0.000)      (0.001)        (0.000)        (0.000)           (0.000)
Time dummies (month)                                              YES          YES            YES             YES              YES
N                                                               689,765      373,569        214,686          71,306           71,033
Notes. OLS user fixed-effects panel regressions where robust standard errors clustered at the user level are reported in parentheses. All
independent variables are normalized. OLS, ordinary least squares.
  †p < 0.10; *p < 0.05; **p < 0.01; ***p < 0.001 (two-tailed tests).

arguments for questions and the second our predic-                          question asker to one of three levels: low status (a score
tions related to answers and comments.                                      of 6 points), medium status (158 points), or high status
                                                                            (1,714 points). We selected these values based on the
Experiment 1: Methods                                                       distribution of the SO scores in our field data, with low
Participants. We recruited 90 English-speaking partic-                      status being a below-median value, medium status fall-
ipants, who had prior knowledge of Python, through                          ing in the 50th-to-75th percentile interval, and high sta-
Amazon’s Mechanical Turk (MTurk) online plat-                               tus being in the top 5% of the SO score distribution.
form.13 MTurk provides a diverse participant pool for                          Each participant read six fictitious threads (presented
academic research, one demographically similar to the                       in random order), so each of the 18 experimental condi-
general population (Buhrmester et al. 2011, Chandler                        tions of our 6 (six question threads) × 3 (status of the
and Shapiro 2016). To ensure that our participants                          question asker: low, medium, or high) design appears 30
had the relevant expertise to evaluate the questions,                       times in our data. Two threads involved simple
answers, and comments, we screened potential partic-                        beginner-level Python questions; two touched on more
ipants for their prior knowledge of the Python pro-                         intermediate issues; and two concerned advanced topics.
gramming language. We embedded this screening                               Out of each pair, one question included a snippet of
question in a set of questions about their experience                       code and the other did not. Participants then evaluated
with several programming languages to reduce the                            each of the six questions by giving them a “downvote,”
likelihood that participants might not answer truthful-                     a “no vote,” or an “upvote,” mirroring the SO setting.
ly (see Figure A1 in the online appendix).14
                                                                            Experiment 2: Methods
Design. We used a between-subjects design with                              Participants. For the second experiment, we recruited
three different conditions per thread, in which we ex-                      a second set of English-speaking participants with pri-
perimentally manipulated the status level of the                            or knowledge of Python on the MTurk platform (270
You can also read