0% found this document useful (0 votes)

64 views

P NB ProbitE

This document provides instructions for performing Poisson regression and negative binomial regression in SPSS. It summarizes the key steps for each model: check that the dependent variable has the appropriate distribution, examine goodness of fit measures, and evaluate the significance of predictors. Examples are given using data on household size (Poisson) and number of employers (negative binomial). Key output is interpreted to assess model fit and determine which regressors are statistically significant.

Uploaded by

AnarMasimov

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views

P NB ProbitE

Uploaded by

AnarMasimov

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

1.

POISSON REGRESSION
We are keeping count for some quite rare events.

Variable Y is called dependent variable, variables X, Z, W are independent variables or

regressors or predictors.

We are modelling the behavior of the mean value of Y:

= e ,
= + 1 + 2 + 3 .

Estimates for the coefficients , 1 , 2 , 3 are calculated from data.

Data
Y has Poisson distribution. We expect the mean of Y to be similar by its magnitude to the variance
of Y. Possible values for Y are 0, 1,2,3, ...
Regressors can be interval or categorical random variables.
Main steps
1) Check if Y has Poisson distribution.
2) Check if normed deviance is close to 1.
3) Check if maximum likelihood is statistically significant. If p-value 0,05, model is
unacceptable.
4) Check if all regressors are significant (Wald test p < 0,05). If not drop them from the
model. We do not pay attention to p-value for model constant (intercept).

Good Poisson regression model:

Normed deviance is close to 1;

Maximum likelihood has p < 0,05.

For all regressors Wald test p < 0,05.

Poisson regression with SPSS

1. Data
File ESS4FR. Variables:

agea respondents age,

hhmmb number of household members,

imptrad important to keep traditions

eduyrs years of formal schooling,

cldcrsv help for childcare (0 very bad, ... , 10 very good).

We will model number of other than respondent household members by agea and cldrsv. We will
investigate respondents for whom imptrad 2 and eduyrs 10.
Use Select Cases -> If condition is satisfied -> If and write imptrad <= 2 & eduyrs <= 10.
Then Continue -> OK.
Dependent variable (we call it numbhh) can be created with the help of Transform
Compute Variable.

2. Preliminary analysis
First we check if numbhh is similar to Poisson variable. Analyze Descriptive Statistics
Frequences.

Further Statistics. Check Mean and Variance.

We see that the mean of numbhh (1,0036) is close to its variance (1,482). Thus, numbhh
satisfies one of the most important properties of the Poisson variable.
Statistics
numbhh
N

Valid

281

Missing
Mean
Variance

0
1.0036
1.482

It is possible also to check if random variable has Poisson distribution with the help of
Kolmogorov- Smirnov test:
Analyze Nonparametric Tests Legacy Dialogs 1-Sample K-S.

We see that we can assume that numbhh has Poisson distribution (p = 0,169), but is not
normal (p = 0,000).

One-Sample Kolmogorov-Smirnov Test 2

One-Sample Kolmogorov-Smirnov Test

numbhh

numbhh
N

281

Normal

Mean

1.0036

N
Poisson

281
Mean

1.0036

Parametera,b

Parametersa,b

Std. Deviation

Most Extreme

Absolute

.302

Most Extreme

Absolute

.066

Differences

Positive

.302

Differences

Positive

.066

Negative

-.205

Negative

-.026

Kolmogorov-Smirnov Z
Asymp. Sig. (2-tailed)

1.21743

5.060
.000

a. Test distribution is Normal.

Kolmogorov-Smirnov Z
Asymp. Sig. (2-tailed)

1.111
.169

a. Test distribution is Poisson.

3. SPSS Poisson regression options

Choose Analyze Generalized Linear Models Generalized Linear Models.

Choose Type of Model and check Poisson loglinear.

Click on Response and move numbhh into Dependent Vriable.

Click on Predictors and move both regressors agea and cldcsrv into Covariates.
(We do not have categorical variables, which are moved into Factors).

After choosing Model both variables should be moved into Model.

In Statistics check in addition Include exponential parameter estimates. Then -> OK.

4. Results
In Goodness of Fit table we can find normed deviance. We see that the normed deviance is
close to 1 (0,919). Thus, the Poisson regression model fits our data. It remains to decide which
regressors are statistically significant.
Goodness of Fitb
Value

Value/df

Deviance

230.635

251

Scaled Deviance

230.635

251

Pearson Chi-Square

188.314

251

Scaled Pearson Chi-Square

188.314

251

Likelihooda

-301.040

Akaike's Information

608.080

Log

.919

.750

Criterion (AIC)
Finite Sample Corrected

608.176

AIC (AICC)
Bayesian Information

618.692

Criterion (BIC)
Consistent AIC (CAIC)

621.692

In the table Omnibus Test we find p-value for maximum likelihood statistic. Since p <
0,05, we conclude that not all regressors are statistically insignificant.
Omnibus Testa
Likelihood Ratio
Chi-Square
112.919

Sig.
2

.000

In the table Tests of Model Effects we see Wald test p-values for all regressors. We do not
check p-value for intercept. Both p < 0,05. Therefore, we conclude that both regressors (agea and
cldcrsv) are statistically significant and should remain in the model.
Tests of Model Effects
Type III
Source

Wald Chi-Square

(Intercept)
agea

Sig.

41.188

.000

105.703

.000

14.395

.000

cldcrsv

In the table Parameter Estimates information about Wald p-values is repeated. Moreover, the
tabale contains estimates of the models coefficients (Column B).
Parameter Estimates
95 % Wald

95 % Wald Confidence

Confidence Interval
Std.
Parameter

Error

Hypothesis Test

Interval for Exp(B)

Wald ChiLower

Upper

Square

Sig.

Exp(B)

Lower

Upper

(Intercept)

1.535

.2392

1.066

2.004

41.188

.000

4.642

2.905

7.419

agea

-.035

.0034

-.042

-.028

105.703

.000

.966

.959

.972

cldcrsv

.099

.0261

.048

.150

14.395

.000

1.104

1.049

1.162

(Scale)

We can see that coefficient for agea

is negative: -0,035 < 0. This means that when

respondents age increases, the number of household members decreases. Mathematical models
expression is
= exp{1,535 0,035 + 0,099}.
Here is the mean value of other household members.

Forecasting means that we insert given values of agea and cldcrsv into above formula.

5. Categorical regressor
Categorical regressors are included into Generalized Linear Models - Predictors -> Factors

Do not forget to add ctzntr into Model window. Then, in the table Parameter Estimates

Parameter
(Intercept)
agea
[ctzcntr=1]
[ctzcntr=2]
cldcrsv

B
1.239
-.036
.352
0a
.104

Std.
Error
.3115
.0034
.2319
.
.0263

95 % Wald Confidence
Interval
Lower
.629
-.043
-.103
.
.053

Upper
1.850
-.029
.806
.
.156

We get additional information about both ctzcntr. Model then can be written as
0,352, if = 1,
ln = 1,239 0,036 + 0,104 + {
0,
if = 2.

2. NEGATIVE BINOMIAL REGRESSION

We are keeping count for some events.

Variable Y is called dependent variable, variables X, Z, W are independent variables or

regressors or predictors.

We are modelling the behavior of the mean value of Y:

= e ,
= + 1 + 2 + 3 .

Estimates for the coefficients , 1 , 2 , 3 are calculated from data. NB regression is an alternative
to the Poisson regression. The main difference is that the variance of Y is larger than the mean of Y.
Data
Y has negative binomial distribution. We expect the mean of Y to be smaller than the variance of Y.
Possible values for Y are 0, 1,2,3, ...
Regressors can be interval or categorical random variables.

Main steps
5) Check if the variance of Y is greater than the mean of Y. Otherwise, the NB regression is
not applicable.
6) Check if normed deviance is close to 1.
7) Check if maximum likelihood is statistically significant. If p-value 0,05, model is
unacceptable.
8) Check if all regressors are significant (Wald test p < 0,05). If not drop them from the
model. We do not pay attention to p-value for model constant (intercept).
9

Good Negative Binomial regression model:

Normed deviance is close to 1;

Maximum likelihood has p < 0,05.

For all regressors Wald test p < 0,05.

Negative binomial regression with SPSS

1. Data
File ESS4SE. Variables:

emplno respondents number of employers,

emplnof fathers number of employees, (1 if no empoye, 2 has 124 employees, 3

more than 25 employees),

brmwmny borrow money for living (1 is very difficult, ..., 5 very easy),

eduyrs years of formal schooling.

We will model the dependence of emplno from emplnof, brwmny, eduyrs. Variable emplnof

has only one observation greater than 26. Therefore, with recode we create a new dichotomous
variable emplnof2, (0 if no employees, 1 at least one employe).
2. SPSS options for the negative binomial regression
Analyze Generalized Linear Models Generalized Linear Models.

Click on Type of Model. Do not choose Negative binomial with log link.

Check Custom --> Distribution -> Negative binomial, Link function Log , Parameter
Estimate value.

Click on Response and put emplno into Dependent Variable.

In Predictors put both variables eduyrs and brwmny into Covariates. Categorical variable
emplnof2 put into Factors.

Choose Model and put all variables into Model window.

3. Results
At the beginning of output we see descriptive statistics. Observe that standard deviation of emplno
(moreover, its variance) is greater than mean.
Categorical Variable Information
N
Factor emplnof2

Percent

.00

50.0%

1.00

50.0%

Total

100.0%

Continuous Variable Information

N
Dependent

emplno Number of employees

Variable

respondent has/had

Covariate

eduyrs Years of full-time

Minimum

Maximum

Mean

Std. Deviation

763

14.73

93.831

11.71

3.732

3.68

1.069

education completed
brwmny Borrow money to make
ends meet, difficult or easy

In Goodness of Fit table we see that normed deviance is 0,901, that is we see quite good
overall model fit to data.
Goodness of Fitb
Value

Value/df

Deviance

54.989

Scaled Deviance

54.989

.901

--------------------------------------------------------

Omnibus Test table contains maximum likelihood statistics and its p-value. Since p < 0,05,
we conclude that at least one regressor is statistically significant.
Tests of Model Effects contains Wald tests for each regressor. All regressors are statistically
significant (we do not check p-value for intercept).
Tests of Model Effects
Type III

Omnibus Testa

Wald ChiSource

Likelihood

Square

Sig.

Ratio Chi-

(Intercept)

.151

.698

emplnof2

6.298

.012

Eduyrs

4.959

.026

Brwmny

7.399

.007

Square

Sig.

23.777

.000

Parameter Estimates table contains parameter estimates (surprise, surprise)

Parameter Estimates
95 % Wald

95 % Wald

Confidence

Interval

Hypothesis Test

Interval for Exp(B)

Wald
Std.
Parameter

Error

Lower

1.590

2.1316

-2.588

5.768

.556

.456

4.904

.075

319.831

-1.629

.6493

-2.902

-.357

6.298

.012

.196

.055

.700

Eduyrs

.286

.1286

.034

.539

4.959

.026

1.332

1.035

1.714

Brwmny

-.753

.2768

-1.295

-.210

7.399

.007

.471

.274

.810

1.2084

3.415

8.310

(Intercept)
[emplnof2=.00]
[emplnof2=1.00]

ChiUpper

Square

Sig.

Exp(B)

Lower

Upper

(Scale)
(Negative

5.327

binomial)

Estimated model:
ln = 1,590 + 0,286 0,753 + {

0,
if 2 = 1,
1,629, if 2 = 0.

Here is mean value of employees.

3. PROBIT REGRESSION
Model
We are modelling two-valued variable. Probit regression can be used whenever logistic regression
applies and vice versa. Model scheme

Variable Y is dependent variable, X, Z, W are independent variables (regressors). Typically Y

values are coded 0 or 1. Model is constructed for P(Y = 0):
P( = 0) = ( + 1 + 2 + 3 ).
Here ()

is the distribution function of the standard normal random variable. Equivalent

expression
1 (P( = 0)) = + 1 + 2 + 3 .
Here 1 () is inverse function, also known as probit function.
If 1 > 0, then as X grows, also grows P(Y= 0).
If 1 < 0, then as X , also grows P(Y= 1).

Data
a) Variable Y is dichotomous. Data for Y contains at least 20% of zeros and at least 20%
of 1.
b) If model contains many categorical variables, for each combination of categories data
should contain at least 5 observations.
c) No strong correlation between regressors.

Model fit

Model fits data if:

Maximum likelihood p-value p < 0,05.

Wald test p-value p < 0,05 for all regressors.

Correct classification for many cases of Y = 1 and Y = 0.

For all variables Cooks measure 1.

(Pseudo) Coefficient of determination 0,20.

Probit regression with SPSS

1. Data
File LAMS. Variables:

K2 university,

K33_2 studies for achievement of my present position were (1 absolutely unimportant,

..., 5 very important),

K35_1 my present occupation corresponds to bachelor studies (1 agree, 2 more agree,

than disagree, 3 more disagree, than agree, 4 disagree),

K36_1 I use professional skills obtained during studies (1 never, ..., 5 very
frequently),

K37_1 satisfaction with my profession (1 not at all , ...., 5 very much).

With recode we create a new two-valued variable Y, (0 if respondent rarely applies

professional skills obtained during studies, 1 if frequently). Model scheme:

Y = f( K35_1, K33_2, K37_1).
Or graphically

2. SPSS options
Analyze -> Generalized Linear Models Generalized Linear Models.

Choose Type of Model and check Binary probit.

Open Response and put Y into Dependent Variable .

Open Predictors and move K37_1 ir K33_2 into Covariates (with some reservation we treat
these variables as interval ones). Regressor K35_1 obtains only 4 values, therefore is treated as a
categorical variable. Move K35_1 into Factors.

Open Model window and move all regressors to the right:

Open Save and check Predicted category and Cooks distance.

3. Results
Model is constructed for P(Y = 0). In Categorical Variable Information we check that there is
sufficient number of respondents for each value of categorical variables (Y included).
Categorical Variable Information
N
Dependent Variable

Factor

Percent

.00

100

31.1%

1.00

222

68.9%

Total

322

100.0%

K35_1 |Esamo darbo

1 Tikrai taip

153

47.5 %

atitikimas bakalauro

2 Greiiau taip

29.5 %

(vientisj) studij krypiai

3 Greiiau ne

11.2%

4 Tikrai ne

11.8%

322

100.0%

Total

In Omnibus Test table we check that p-value for the maximum likelihood test is sufficiently
small p = 0,00...< 0,05.
Omnibus Testa
Likelihood Ratio
Chi-Square
163.847

Sig.
5

.000

Dependent Variable: Y
Model: (Intercept), K35_1, K37_1, K33_2

Parameter Estimates table contains parameter estimates and Wald tests for the significance
of each regressor. We do not check the significance of Intercept. Categorical variable K35_1 was

replaced by 4 dummy variables, one of which is not statistically significant. However, for one such
insignificant result, it is not rational to drop K35_1 from the model.
Parameter Estimates
95 % Wald Confidence Interval

Hypothesis Test
Wald Chi-

Parameter

Std. Error

Lower

Upper

Square

Sig.

(Intercept)

4.853

.7092

3.463

6.243

46.832

.000

[K35_1=1]

-1.577

.3272

-2.218

-.936

23.229

.000

[K35_1=2]

-1.018

.3226

-1.650

-.385

9.953

.002

[K35_1=3]

-.261

.3722

-.991

.468

.493

.482

[K35_1=4]

K37_1

-.273

.1141

-.496

-.049

5.720

.017

K33_2

-.780

.1151

-1.005

-.554

45.859

.000

(Scale)

Dependent Variable: Y
Model: (Intercept), K35_1, K37_1, K33_2
a. Set to zero because this parameter is redundant.
b. Fixed at the displayed value.

We obtained four models, which differ by the constant only, They can be written in the following
way:
( = 0) = P(rarely applies knowledge in his work) = (),
P
1,57,
1,02,
= 4,85 0,273 371 0,78 332 + {
0,26,
0,

if 35_1 = 1,
if 35_1 = 2,
if 35_1 = 3,
if 35_1 = 4.

Signs of coefficients agrre with general logic of the models. Coefficient for K37_1 is
negative. The larger value of K37_1 (more happy with his work), the less probable that knowledge
is rarely used. Other signs of coefficients can be explained similarly.
We treated probit regression as a partial case of the generalized linear model. Therefore, one can
check the magnitude of deviance in the table Goodness of Fit. We see that deviance is close to
unity (1,156), which demonstrates good fit of the model. Note that for the probit regression more
important is small p-value of the maximum likelihood test (it can be find in Omnibus Test.). If all
models characteristics except deviance show good model fit, we assume that the model is
acceptable.

Goodness of Fitb
Value

Value/df

Deviance

49.722

Scaled Deviance

49.722

Pearson Chi-Square

48.218

Scaled Pearson Chi-Square

48.218

Log Likelihooda

-47.932

Akaike's Information

107.865

1.156

1.121

Criterion (AIC)
Finite Sample Corrected

108.131

AIC (AICC)
Bayesian Information

130.512

Criterion (BIC)
Consistent AIC (CAIC)

136.512

Checking for outliers we choose Analyze Descriptive Statistics Descriptives. Move

variable CooksDistance intoVariable(s). Choose OK.
Descriptive Statistics
N
CooksDistance Cook's

Minimum
322

Maximum

.000

Mean

.039

Std. Deviation

.00324

.006749

Distance
Valid N (listwise)

322

Maximal Cooks distance value is 0,039<1. Therefore, there is no outliers in our data.
To obtain classification table we choose Analyze Descriptive Statistics Crosstabs.
Move Y into Row(s) and PredictedValue. into Column(s) . Choose Cells and check Row. Then
Continue and OK.
Y * PredictedValue Predicted Category Value Crosstabulation
PredictedValue Predicted
Category Value
.00
Y

.00

Count
% within Y

1.00

Count
% within Y

Total

Count
% within Y

1.00

Total

100

66.0%

34.0%

100.0%

205

222

7.7%

92.3%

100.0%

239

322

25.8%

74.2%

100.0%

From 100 respondents, who rarely use professional skills obtained during studies, 66 are
correctly classified ( 66 %). From 222 respondents, who frequently use professional skills obtained
during studies, 205 are correctly classified ( 92,3 %). Recalling the table Categorical Variable
Information and its percents (respectively 31,1 % and 68,9 % ), we see that probit model ensures
much better forecasting than random gues. Final conlusion: probit regression model fits data
sufficiently well.

4. Forecasting
One value can be forecasted in the following way. Let us assume that previous model is applied to
respondent, for whom K33_2 = 4, K35_1 = 1, K37_1 = 4. We add additional row in data writing 4
in the column K33_2 , 1 in the column K35_1 , 4 in the column K37_1 and 1 in the column
filter_$. Remaining columns are empty.

We repeat probit analysis but check Predicted value of mean of response in window Save

In data appears new column MeanPredicted containing probabilities for P( Y = 0). We got
0,175 probability for our respondent. Therefore is unlikely that this respondent will apply skills
from studies in his professional work.

MATH1231-1241-Calculus-Notes-2020T1 (2020 - 10 - 19 10 - 35 - 29 UTC)
No ratings yet
MATH1231-1241-Calculus-Notes-2020T1 (2020 - 10 - 19 10 - 35 - 29 UTC)
213 pages
Homework 1
0% (1)
Homework 1
8 pages
General Gaussian Integral
No ratings yet
General Gaussian Integral
3 pages
Method Validation - Report
100% (1)
Method Validation - Report
8 pages
Engineering Mathematics-III Important University Questions Unit-I Fourier Series Two Marks
100% (10)
Engineering Mathematics-III Important University Questions Unit-I Fourier Series Two Marks
23 pages
Poisson Regression Models
No ratings yet
Poisson Regression Models
14 pages
SSPSS Data Analysis Examples Poisson Regression
No ratings yet
SSPSS Data Analysis Examples Poisson Regression
34 pages
Poisson Regression
No ratings yet
Poisson Regression
3 pages
lect12
No ratings yet
lect12
36 pages
Goodness of Fit Negative Binomial Regression Stata
No ratings yet
Goodness of Fit Negative Binomial Regression Stata
3 pages
Chapter 14
No ratings yet
Chapter 14
15 pages
GLM in R
No ratings yet
GLM in R
6 pages
R Handbook - Regression For Count Data
No ratings yet
R Handbook - Regression For Count Data
13 pages
Regression 101
No ratings yet
Regression 101
18 pages
poisson
No ratings yet
poisson
54 pages
Data Analysis
No ratings yet
Data Analysis
56 pages
(Ebook) Graybill & Iyer 2004 Regression Analysis - Concepts & Applications - With SAS & Minitab
No ratings yet
(Ebook) Graybill & Iyer 2004 Regression Analysis - Concepts & Applications - With SAS & Minitab
648 pages
304BA AdvancedStatisticalMethodsUsingR
No ratings yet
304BA AdvancedStatisticalMethodsUsingR
31 pages
P42003 Bsa
No ratings yet
P42003 Bsa
3 pages
Practicals For Basic Econometrics-2.Docx 20241118 002851 0000
No ratings yet
Practicals For Basic Econometrics-2.Docx 20241118 002851 0000
3 pages
Stats 2015 To 2020
No ratings yet
Stats 2015 To 2020
14 pages
Shorten - Count Data Analysis
No ratings yet
Shorten - Count Data Analysis
24 pages
Statistics and Probability PROJECT 1
No ratings yet
Statistics and Probability PROJECT 1
4 pages
Introductory Econometrics A Modern Approach 5th Edition Wooldridge Solutions Manual 1
100% (46)
Introductory Econometrics A Modern Approach 5th Edition Wooldridge Solutions Manual 1
26 pages
Regression 2006-03-01
No ratings yet
Regression 2006-03-01
53 pages
Session 1.3 Notes
No ratings yet
Session 1.3 Notes
39 pages
Goodness of Fit Negative Binomial R
No ratings yet
Goodness of Fit Negative Binomial R
3 pages
ACTL30004 Assignment
No ratings yet
ACTL30004 Assignment
15 pages
Section: - This Is An Open-Book and Open-Note Test. However, Sharing of Material Is NOT Permitted
No ratings yet
Section: - This Is An Open-Book and Open-Note Test. However, Sharing of Material Is NOT Permitted
9 pages
Regression and Factor
No ratings yet
Regression and Factor
95 pages
Regn_lect_5
No ratings yet
Regn_lect_5
9 pages
Generalized Linear Models
No ratings yet
Generalized Linear Models
5 pages
Poisson Regression
No ratings yet
Poisson Regression
60 pages
Poisson Regression Analysis 136
No ratings yet
Poisson Regression Analysis 136
8 pages
Poisson Regression Analysis 136
No ratings yet
Poisson Regression Analysis 136
8 pages
Res Methods 18-19 Shadrack, KDK
No ratings yet
Res Methods 18-19 Shadrack, KDK
8 pages
Ba9201 - Statistics For Managementjanuary 2010
100% (1)
Ba9201 - Statistics For Managementjanuary 2010
5 pages
SAS Procedures For Common Statistical Analyses: Contents
No ratings yet
SAS Procedures For Common Statistical Analyses: Contents
11 pages
Poisson Regression - Stata Data Analysis Examples
No ratings yet
Poisson Regression - Stata Data Analysis Examples
12 pages
Final Scribd
No ratings yet
Final Scribd
23 pages
Number of Dengue modelling
No ratings yet
Number of Dengue modelling
11 pages
General Linear Model: Advance Methods of Research Masters of Engineering Program Major in Electrical Engineering
No ratings yet
General Linear Model: Advance Methods of Research Masters of Engineering Program Major in Electrical Engineering
33 pages
Assignment 1
No ratings yet
Assignment 1
3 pages
Solutions Week 10
No ratings yet
Solutions Week 10
7 pages
CS1B April 2024
No ratings yet
CS1B April 2024
9 pages
Regression Analysis
No ratings yet
Regression Analysis
6 pages
Text - On - Class Econometrics
No ratings yet
Text - On - Class Econometrics
17 pages
STAT22209 - Chapter 03-Multiple Regression - 2022
No ratings yet
STAT22209 - Chapter 03-Multiple Regression - 2022
41 pages
GLM Sol
No ratings yet
GLM Sol
11 pages
Assignment of Econometrics
No ratings yet
Assignment of Econometrics
12 pages
Solutions_week 6
No ratings yet
Solutions_week 6
5 pages
Topic0 Introduction
No ratings yet
Topic0 Introduction
9 pages
STAT 302-1 Sample Final Exam
No ratings yet
STAT 302-1 Sample Final Exam
26 pages
EconometricsII Exercises
100% (1)
EconometricsII Exercises
27 pages
Specification Test: Vid Adrison
No ratings yet
Specification Test: Vid Adrison
18 pages
A. Discuss What Is Meant by Sampling Distribution of A Population Parameter
No ratings yet
A. Discuss What Is Meant by Sampling Distribution of A Population Parameter
5 pages
Exercises of Statistical Inference
From Everand
Exercises of Statistical Inference
Simone Malacrida
No ratings yet
Practice Problem 2
No ratings yet
Practice Problem 2
7 pages
STAT22209 - Chapter 02-Regression Analyisis - 2022
No ratings yet
STAT22209 - Chapter 02-Regression Analyisis - 2022
41 pages
Count Data Models in SAS
No ratings yet
Count Data Models in SAS
12 pages
statMethods1-lec12
No ratings yet
statMethods1-lec12
64 pages
Lecture 3 - LRM
No ratings yet
Lecture 3 - LRM
40 pages
Untitled 472
No ratings yet
Untitled 472
13 pages
A1
No ratings yet
A1
8 pages
2021 SAC 1 PT 2 Revision - Solutions
No ratings yet
2021 SAC 1 PT 2 Revision - Solutions
89 pages
Three - Operations Research Lecture Notes For Aaoc c312 Part Three
No ratings yet
Three - Operations Research Lecture Notes For Aaoc c312 Part Three
304 pages
Daftar Pustaka
No ratings yet
Daftar Pustaka
6 pages
A 3 Forecasting Final
No ratings yet
A 3 Forecasting Final
16 pages
Wiley Onlinebooks List 201510
0% (1)
Wiley Onlinebooks List 201510
1,986 pages
AGE 212: Mathematics Iii: Luanar 2013/2014 Academic Year Lecturer: Wellam Kamthunzi
No ratings yet
AGE 212: Mathematics Iii: Luanar 2013/2014 Academic Year Lecturer: Wellam Kamthunzi
19 pages
Titration Worksheet 1
No ratings yet
Titration Worksheet 1
6 pages
A Comparative Study of Categorical Variable Encoding Techniques
No ratings yet
A Comparative Study of Categorical Variable Encoding Techniques
4 pages
Fsa Icici Synops
No ratings yet
Fsa Icici Synops
25 pages
Intelligence Cycle
100% (2)
Intelligence Cycle
29 pages
Linear Algebra and Calculus 2019 Syllabus Ktustudents - in
No ratings yet
Linear Algebra and Calculus 2019 Syllabus Ktustudents - in
8 pages
Implicit Function Theorem
No ratings yet
Implicit Function Theorem
2 pages
Power Phrases To Build Your Resume
No ratings yet
Power Phrases To Build Your Resume
3 pages
2005, MATH3801 Assignment 4 With Solutions
No ratings yet
2005, MATH3801 Assignment 4 With Solutions
3 pages
3.2.P.5.6 Justification of Specifications
No ratings yet
3.2.P.5.6 Justification of Specifications
2 pages
Numerical Methods in Heat Conduction: Heat and Mass Transfer: Fundamentals & Applications
No ratings yet
Numerical Methods in Heat Conduction: Heat and Mass Transfer: Fundamentals & Applications
31 pages
The Fundamental Theorem of Calculus 9.7.21
No ratings yet
The Fundamental Theorem of Calculus 9.7.21
10 pages
Criteria Weighting by Using The 5ws H Technique
No ratings yet
Criteria Weighting by Using The 5ws H Technique
8 pages
ECE411 - 4a - The Z-Transform
No ratings yet
ECE411 - 4a - The Z-Transform
7 pages
Fault Detection in Pressure Swing Adsorption Systems-Farshad Amiri
No ratings yet
Fault Detection in Pressure Swing Adsorption Systems-Farshad Amiri
76 pages
Eeg 823
No ratings yet
Eeg 823
71 pages
Eclectic Accounting Problems
No ratings yet
Eclectic Accounting Problems
2 pages
Pert and S Curve
No ratings yet
Pert and S Curve
41 pages
Linear Programming: Penn State Math 484 Lecture Notes: Licensed Under A
No ratings yet
Linear Programming: Penn State Math 484 Lecture Notes: Licensed Under A
167 pages
Sem 8
No ratings yet
Sem 8
7 pages
Malhotra 9 Scaling A
No ratings yet
Malhotra 9 Scaling A
13 pages

P NB ProbitE

Uploaded by

P NB ProbitE

Uploaded by

1.

Variable Y is called dependent variable, variables X, Z, W are independent variables or

We are modelling the behavior of the mean value of Y:

Estimates for the coefficients , 1 , 2 , 3 are calculated from data.

Good Poisson regression model:

Normed deviance is close to 1;

Maximum likelihood has p < 0,05.

For all regressors Wald test p < 0,05.

Poisson regression with SPSS

agea respondents age,

hhmmb number of household members,

imptrad important to keep traditions

eduyrs years of formal schooling,

cldcrsv help for childcare (0 very bad, ... , 10 very good).

Further Statistics. Check Mean and Variance.

One-Sample Kolmogorov-Smirnov Test 2

One-Sample Kolmogorov-Smirnov Test

a. Test distribution is Normal.

a. Test distribution is Poisson.

3. SPSS Poisson regression options

Choose Type of Model and check Poisson loglinear.

Click on Response and move numbhh into Dependent Vriable.

After choosing Model both variables should be moved into Model.

Scaled Pearson Chi-Square

Interval for Exp(B)

We can see that coefficient for agea

is negative: -0,035 < 0. This means that when

2. NEGATIVE BINOMIAL REGRESSION

Variable Y is called dependent variable, variables X, Z, W are independent variables or

We are modelling the behavior of the mean value of Y:

Good Negative Binomial regression model:

Normed deviance is close to 1;

Maximum likelihood has p < 0,05.

For all regressors Wald test p < 0,05.

Negative binomial regression with SPSS

emplno respondents number of employers,

emplnof fathers number of employees, (1 if no empoye, 2 has 124 employees, 3

eduyrs years of formal schooling.

Click on Response and put emplno into Dependent Variable.

Choose Model and put all variables into Model window.

Continuous Variable Information

emplno Number of employees

eduyrs Years of full-time

Parameter Estimates table contains parameter estimates (surprise, surprise)

Interval for Exp(B)

Here is mean value of employees.

Variable Y is dependent variable, X, Z, W are independent variables (regressors). Typically Y

is the distribution function of the standard normal random variable. Equivalent

Model fits data if:

Maximum likelihood p-value p < 0,05.

Wald test p-value p < 0,05 for all regressors.

Correct classification for many cases of Y = 1 and Y = 0.

For all variables Cooks measure 1.

(Pseudo) Coefficient of determination 0,20.

Probit regression with SPSS

K33_2 studies for achievement of my present position were (1 absolutely unimportant,

K35_1 my present occupation corresponds to bachelor studies (1 agree, 2 more agree,

K37_1 satisfaction with my profession (1 not at all , ...., 5 very much).

professional skills obtained during studies, 1 if frequently). Model scheme:

Choose Type of Model and check Binary probit.

Open Response and put Y into Dependent Variable .

Open Model window and move all regressors to the right:

Open Save and check Predicted category and Cooks distance.

K35_1 |Esamo darbo

(vientisj) studij krypiai

Scaled Pearson Chi-Square

Checking for outliers we choose Analyze Descriptive Statistics Descriptives. Move

You might also like