Welcome to Scribd!

0% found this document useful (0 votes)

134 views

Outliers

Uploaded by

The document describes methods for detecting outliers in data sets, including calculating the interquartile range and using the Tukey, Grubbs, and Dixon's Q tests. For the Tukey test, any values below Q1 - 1.5*IQR or above Q3 + 1.5*IQR would be considered suspected outliers. The Grubbs and Dixon's Q tests calculate a test statistic and compare it to a critical value to determine if a value should be rejected as an outlier. Examples are provided to demonstrate applying these tests.

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Outliers

Uploaded by

cormac

0% found this document useful (0 votes)

134 views16 pages

Original Description:

powerpoint statistics outliers

Copyright

Available Formats

PPTX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Download as pptx, pdf, or txt

0% found this document useful (0 votes)

134 views16 pages

Outliers

Uploaded by

cormac

Copyright:

Available Formats

Download as PPTX, PDF, TXT or read online from Scribd

Download as pptx, pdf, or txt

Jump to Page

You are on page 1of 16

Search inside document

DETECT suspects 1: Visual inspection

2.24 2.43 2.36 2.83 2.30

N Titre
o (ml)
1 11.4
2 11.1
3 11.5
4 11.9
5 11.3
6 11.2
DETECT suspects 2: Calculation: Tukey k-Test

The interquartile range (IQR) is the distance between the first and third
quartiles (the length of the box in the boxplot)
IQR = Q3 Q1

An outlier is an individual value that falls outside the overall pattern.

How far outside the overall pattern does a value have to fall to be
considered a suspected outlier?

Suspected low outlier: any value < Q1 1.5 IQR

Suspected high outlier: any value > Q3 + 1.5 IQR

25 7.9
24 5.6
23 5.3
22 4.9
21 4.7
20 4.5
19 4.2 Q3 = 4.35
18 4.1
17 3.9
16 3.8
15 3.7
14 3.6
13 3.4
12 3.3
11 2.9
10 2.8
9 2.5
8 2.3
7 2.3
Q1 = 2.2
6 2.1
5 1.5
4 1.9
3 1.6
2 1.2
1 0.6
DETECT suspects: Calculation: Grubbs Test
ISO test for point outliers
suspect value is value that is furthest away from mean
Normal population
Use entire dataset to calculate statistics
Gcritical depends on n
If G exp> Gcritical value, then REJECT suspect

suspect x
G exp
s
example

The following values were got for the nitrate concentration (mg/L) in a
sample of river water:

0.403 0.410 0.401 0.380

Ideally get more measurements if suspect occurs, esp. if only a few made.
the more values may make it clearer if suspect should be rejected
Also if kept, reduce its effect.
if 3 further measurements...

0.403 0.410 0.401 0.380 0.400 0.413 0.408

You try

set of mass spectrometer measurements on a uranium isotope:

199.31 199.53 200.19 200.82 201.92 201.95 202.18 206.32

DETECT suspects 2: Calculation: Dixon's Q-Test

popular
for small sample (n=3 to 10)
assumes Normal population
if Q > critical value, then REJECT suspect
Dixon's Q-Test
The following values were got for the
nitrate concentration (mg/L) in a sample of
river water:

0.403 0.410 0.401 0.380 0.400 0.413 0.408

suspect nearest
Q
range
You try:
0.189 0.167 0.187 0.183 0.186 0.182

0.181 0.184 0.181 0.177

suspect nearest
Q
range
DECIDE
Correct obvious errors for which data exists
Exclude obvious errors for which no data exists
Ignore? run with/without to see if influential
trimmed mean
Retain?
outliers are expected for large sample sizes
some methods are robust
Replace

DISCLOSE

Full Download PDF of Solution Manual For Computer Organization and Architecture, 11th Edition, William Stallings All Chapter
Document32 pages
Full Download PDF of Solution Manual For Computer Organization and Architecture, 11th Edition, William Stallings All Chapter
dadliseboa
100% (13)
Maths Models in Agriculture
Document36 pages
Maths Models in Agriculture
cormac
No ratings yet
Iso 1660 2017 en PDF
Document11 pages
Iso 1660 2017 en PDF
Leonardo MIRELES
No ratings yet
10 RepeatedMeasuresAndMixedANOVA
Document30 pages
10 RepeatedMeasuresAndMixedANOVA
Cristina Roxana Sarpe
No ratings yet
Hypothesis Testing Assignment
Document12 pages
Hypothesis Testing Assignment
ضیاء گل مروت
No ratings yet
Engineers Guide To CAN Bus
Document16 pages
Engineers Guide To CAN Bus
Sarra Chouchene
100% (2)
OUTLIERS
Document5 pages
OUTLIERS
Rana Arslan Munir
100% (1)
Module 1 Quiz
Document7 pages
Module 1 Quiz
Krishnanjali Vu
No ratings yet
Binary Logistic Regression Mintab Tutorial
Document4 pages
Binary Logistic Regression Mintab Tutorial
Muhammad Imdadullah
No ratings yet
Logistic Regression Mini Tab
Document20 pages
Logistic Regression Mini Tab
Anıl Toraman
100% (3)
Lecture 9 Moments
Document29 pages
Lecture 9 Moments
Azhar Hussain
No ratings yet
Stat Solutions
Document19 pages
Stat Solutions
rajeev_khanna_15
No ratings yet
Unit 3 Z-Scores, Measuring Performance: Learning Outcome
Document10 pages
Unit 3 Z-Scores, Measuring Performance: Learning Outcome
Cheska Atienza
No ratings yet
Quiz Week 7 - Support Vector Machines
Document3 pages
Quiz Week 7 - Support Vector Machines
charu.hitechrobot2889
100% (1)
Exploratory Data Analysis
Document9 pages
Exploratory Data Analysis
Lea Rose Jeorgia Salonga
No ratings yet
Runs Test
Document5 pages
Runs Test
dilpals
No ratings yet
Correlation-Regression 2019
Document76 pages
Correlation-Regression 2019
ANCHURI NANDINI
No ratings yet
Measures of Central Tendency
Document8 pages
Measures of Central Tendency
Fariha Ayaz
No ratings yet
Statistical Methods For Decision Making
Document15 pages
Statistical Methods For Decision Making
Thaku Singh
100% (1)
Project 5 - Gas
Document17 pages
Project 5 - Gas
Areej Aftab Siddiqui
No ratings yet
Solutions To The Above Problems: X y Xy X
Document4 pages
Solutions To The Above Problems: X y Xy X
Yasir Khan
No ratings yet
Dispersion
Document2 pages
Dispersion
rauf tabassum
No ratings yet
Forest Fire Prediction Using Machine Learning
Document28 pages
Forest Fire Prediction Using Machine Learning
temp temp
No ratings yet
Logistic Regression
Document47 pages
Logistic Regression
harish srinivas
No ratings yet
Unit4 Fundamental Stat Maths2 (D)
Document28 pages
Unit4 Fundamental Stat Maths2 (D)
Azizul Anwar
No ratings yet
Lec Set 1 Data Analysis
Document55 pages
Lec Set 1 Data Analysis
Smarika Kulshrestha
No ratings yet
Linear Regression: Major: All Engineering Majors Authors: Autar Kaw, Luke Snyder
Document25 pages
Linear Regression: Major: All Engineering Majors Authors: Autar Kaw, Luke Snyder
gunawan refiadi
100% (1)
Quartiles, Deciles, Percentiles
Document5 pages
Quartiles, Deciles, Percentiles
Romeo Jr Pacheco Opena
100% (1)
ch03 Ver3
Document25 pages
ch03 Ver3
Mustansar Hussain Niazi
No ratings yet
Classification and Regression Trees
Document60 pages
Classification and Regression Trees
ShyamBhatt
100% (1)
Chapter13 Slides
Document24 pages
Chapter13 Slides
Parth Rajesh Sheth
No ratings yet
Name: Darl, M. N Assignment 2: ANOVA Problem Statement and Solution
Document5 pages
Name: Darl, M. N Assignment 2: ANOVA Problem Statement and Solution
betmydarl
No ratings yet
Chap 2 Introduction To Statistics
Document46 pages
Chap 2 Introduction To Statistics
Ananthanarayanan
No ratings yet
08 Split Plots
Document25 pages
08 Split Plots
frawat
No ratings yet
Quiz Module 2 Probability and Probability Distributions PDF
Document16 pages
Quiz Module 2 Probability and Probability Distributions PDF
Varun Lalwani
0% (1)
Practice Exam Stats
Document8 pages
Practice Exam Stats
Greg
No ratings yet
Measures of Central Tendency
Document39 pages
Measures of Central Tendency
Shivam Srivastava
No ratings yet
20180808085223D4998 - Chapter - 07 Continuous Probability Distributions
Document31 pages
20180808085223D4998 - Chapter - 07 Continuous Probability Distributions
devina
No ratings yet
Assignment 02
Document9 pages
Assignment 02
dilhani
No ratings yet
One Way ANOVA
Document31 pages
One Way ANOVA
mathworld_0204
No ratings yet
Catpca
Document19 pages
Catpca
Rodito Acol
No ratings yet
Correlation Regression
Document55 pages
Correlation Regression
jamil
100% (1)
Hypothesis Testing
Document30 pages
Hypothesis Testing
temedebere
No ratings yet
K-Means Clustering
Document6 pages
K-Means Clustering
hifzan786
No ratings yet
Lesson 12 T Test Dependent Samples
Document26 pages
Lesson 12 T Test Dependent Samples
Nicole Daphnie Lis
No ratings yet
Cold Storage1
Document4 pages
Cold Storage1
Anil Bera
No ratings yet
Clustering: ISOM3360 Data Mining For Business Analytics
Document28 pages
Clustering: ISOM3360 Data Mining For Business Analytics
Claire Lee
No ratings yet
L6 - Biostatistics - Linear Regression and Correlation
Document121 pages
L6 - Biostatistics - Linear Regression and Correlation
selamawit
No ratings yet
(6426) Revision Worksheet For Cycle Test - Measures of Dispersion Economics - Grade 11F Final
Document5 pages
(6426) Revision Worksheet For Cycle Test - Measures of Dispersion Economics - Grade 11F Final
Noushad Ali
No ratings yet
ARCH Model
Document26 pages
ARCH Model
Anish S.Menon
No ratings yet
Discriminant Analysis
Document13 pages
Discriminant Analysis
Inception Academy
No ratings yet
Chap 15 Web Site
Document8 pages
Chap 15 Web Site
Sherry Phillips
100% (1)
Probabiliy Distribution
Document22 pages
Probabiliy Distribution
Jakia Sultana
No ratings yet
IMP Video Concept
Document56 pages
IMP Video Concept
ParthJain
100% (1)
Principal Components Analysis
Document50 pages
Principal Components Analysis
Zeeshan Khan
No ratings yet
Sampling Distribution and Point Estimation of Parameters: MATH30-6 Probability and Statistics
Document24 pages
Sampling Distribution and Point Estimation of Parameters: MATH30-6 Probability and Statistics
misaka
No ratings yet
Unit - 4 - Modified
Document152 pages
Unit - 4 - Modified
Shashwat Mishra
No ratings yet
Great Learning AS Project
Document5 pages
Great Learning AS Project
rameshj16708
No ratings yet
Apriori Algorithm
Document23 pages
Apriori Algorithm
Arun Mozhi
No ratings yet
Mvchine Learning Project Report
Document33 pages
Mvchine Learning Project Report
Suraj Shaw
No ratings yet
5b Contingency Tables and Conditional Probability Answer Key
Document2 pages
5b Contingency Tables and Conditional Probability Answer Key
Shehab Khalifa
100% (1)
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
From Everand
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
Joseph George Caldwell
No ratings yet
1 - 4 - Measurement, Uncertainty, and Significant Figures
Document40 pages
1 - 4 - Measurement, Uncertainty, and Significant Figures
Luna Ysabel Nunez
No ratings yet
Goodness of Fit - Chi Square Test-1
Document17 pages
Goodness of Fit - Chi Square Test-1
viswa d
No ratings yet
Knife Skills Pp5a Unprotected
Document16 pages
Knife Skills Pp5a Unprotected
cormac
No ratings yet
Logical Data Modelling
Document43 pages
Logical Data Modelling
Manjeet Singh
No ratings yet
Density
Document1 page
Density
cormac
No ratings yet
Anderson72more Is Different
Document5 pages
Anderson72more Is Different
cormac
No ratings yet
Basic Microscopy: Are There Aliens Among Us?
Document11 pages
Basic Microscopy: Are There Aliens Among Us?
cormac
No ratings yet
Cyber Duck Media 3
Document2 pages
Cyber Duck Media 3
cormac
No ratings yet
Irish Science Words Translated.
Document193 pages
Irish Science Words Translated.
cormac
100% (1)
Modeling
Document300 pages
Modeling
cormac
100% (1)
Classic Yacht PDF
Document112 pages
Classic Yacht PDF
cormac
100% (4)
A001868project4etb2sklowres0212 Unlocked
Document123 pages
A001868project4etb2sklowres0212 Unlocked
Libor Bezecný
No ratings yet
MCQ Anesthesia
Document120 pages
MCQ Anesthesia
Annan Agyekum Joshua
100% (6)
Technical Data SL257/40: Primco
Document3 pages
Technical Data SL257/40: Primco
Mircea
No ratings yet
Current Team Performance Using Velocity Metrics: Sprint Units Breakdown
Document2 pages
Current Team Performance Using Velocity Metrics: Sprint Units Breakdown
Mohamad Dawas
No ratings yet
Inmotion (Atlas Copco) DMC2 User's Manual
Document89 pages
Inmotion (Atlas Copco) DMC2 User's Manual
艾弗
50% (2)
William Shakespeare: in This Chapter You'll Learn
Document8 pages
William Shakespeare: in This Chapter You'll Learn
Abel Yifat
No ratings yet
HPLC - Column Protection Guide
Document46 pages
HPLC - Column Protection Guide
Rafael Cavalcante
No ratings yet
Projet - COLD STORAGE
Document21 pages
Projet - COLD STORAGE
Sanket Gokhale
No ratings yet
Thesis A
Document12 pages
Thesis A
Ramesh Ponnampalam
No ratings yet
Grove Music Online: Feldman, Morton
Document15 pages
Grove Music Online: Feldman, Morton
edition58
No ratings yet
Literature Review of Cell Phone Operated Land Rover
Document8 pages
Literature Review of Cell Phone Operated Land Rover
guirkdvkg
No ratings yet
Microstructural and Mechanical Characterization of A Shot Peening Induced
Document10 pages
Microstructural and Mechanical Characterization of A Shot Peening Induced
Vamsi Apuroop
No ratings yet
6.02 Practice Problems - Routing PDF
Document10 pages
6.02 Practice Problems - Routing PDF
alan
No ratings yet
Understanding Heat Transfer, Conduction, Convection and Radiation
Document23 pages
Understanding Heat Transfer, Conduction, Convection and Radiation
Marnelli Catalan
No ratings yet
Question Paper Code:: Reg. No.
Document2 pages
Question Paper Code:: Reg. No.
Monica Naresh
No ratings yet
Estimation of Carbohydrate by The Anthrone Method: BT 510 Analytical Biotechnology Lab
Document2 pages
Estimation of Carbohydrate by The Anthrone Method: BT 510 Analytical Biotechnology Lab
Sri Endah Wahyuni
100% (2)
Pr1 Research
Document53 pages
Pr1 Research
Syd Matthew Vivar Matic
No ratings yet
C1100 Test
Document2 pages
C1100 Test
Ismail
No ratings yet
CHAPTER 8 Risk and Rates of Return
Document9 pages
CHAPTER 8 Risk and Rates of Return
Lezel Mee Cartalla
No ratings yet
Power Quality Audit: Powerlines Sample Site Anytown, Usa
Document20 pages
Power Quality Audit: Powerlines Sample Site Anytown, Usa
cool_saklshpur
No ratings yet
Ashrae Guideline 14-2002 Measurement of Energy and Demand Saving PDF
Document170 pages
Ashrae Guideline 14-2002 Measurement of Energy and Demand Saving PDF
mileth correa
No ratings yet
AI Final Exam Questions
Document22 pages
AI Final Exam Questions
Nader AlFakeeh
No ratings yet
Simulation of Cold Forming of A Steel Union at DA-TOR S.p.A.
Document4 pages
Simulation of Cold Forming of A Steel Union at DA-TOR S.p.A.
stefanomazzalai
No ratings yet
Chapter 5
Document16 pages
Chapter 5
ABHINANDAN YADAV
No ratings yet
Moodle Installation Instructions PDF
Document11 pages
Moodle Installation Instructions PDF
Rainy Season
No ratings yet
Benefit Analysis of A Hybrid HVAC/HVDC Transmission Line: A Swiss Case Study
Document7 pages
Benefit Analysis of A Hybrid HVAC/HVDC Transmission Line: A Swiss Case Study
Arfie Ikhsan
No ratings yet
Instant Download PDF CMOS Digital Integrated Circuits Analysis and Design 4th Edition Kang Solutions Manual Full Chapter
Document54 pages
Instant Download PDF CMOS Digital Integrated Circuits Analysis and Design 4th Edition Kang Solutions Manual Full Chapter
mltlatify
100% (11)