FREE ELECTRONIC LIBRARY - Books, dissertations, abstract

Pages:   || 2 | 3 | 4 | 5 |   ...   | 34 |

«©2013 Peter C. Bruce This text is based on earlier material developed for statistics.com by Dr. Robert Hayden Table of Contents Advisory Board ...»

-- [ Page 1 ] --

Stats: Data and Analytics

(c) 2013 Peter C. Bruce

Advisory Board

Jeff Witmer

William Peterson

Chris Malone

©2013 Peter C. Bruce

This text is based on earlier material developed for statistics.com by Dr. Robert Hayden

Table of Contents

Advisory Board




If You Can't Measure It, You Can't Manage It

Phantom Protection From Vitamin E

Statistician, Heal Thyself

Identifying Terrorists in Airports

Looking Ahead in the Course

1 Designing and Carrying Out a Statistical Study

1.1 A Small Example

1.2 Is Chance Responsible? The Foundation of Hypothesis Testing

Interpreting This Result

Increasing the Sample Size

1.3 A Major Example

1.4 Designing an Experiment




Before-After Pairing

1.5 What to Measure—Central Location




Expected Value


Proportions for Binary Data

1.6 What to Measure—Variability



©2013 Peter C. Bruce ii Interquartile Range

Deviations and Residuals

Mean Absolute Deviation

Variance and Standard Deviation

Variance and Standard Deviation for a Sample

1.7 What to Measure—Distance (Nearness)

1.8 Test Statistic

Test Statistic for This Study:

1.9 The Data

Database Format

1.10 Variables and Their Flavors

Table Formats

1.11 Examining and Displaying the Data

Errors and Outliers Are Not the Same Thing!

Frequency Tables


Stem and Leaf Plots

Box Plots

Tails and Skew

1.12 Are We Sure We Made a Difference?

Appendix: Historical Note

2 Statistical Inference

The Null Hypothesis

2.1 Repeating the Experiment

Shuffling and Picking Numbers From a Hat or Box

2.2 How Many Reshuffles?



The Normal Distribution

The Exact Test

2.3 How Odd is Odd?

2.4 Statistical and Practical Significance

2.5 When to use Hypothesis Tests

©2013 Peter C. Bruce iii 3 Categorical Data

3.1 Other Kinds of Studies

3.2 A Single Categorical Variable

3.3 Exploring Data Graphically

Choice of Baseline and Time Period


Per Capita Adjustment

3.4 Mendel's Peas

3.5 Simple Probability

Venn Diagrams

3.6 Random Variables and Their Probability Distributions

Weighted Mean

Expected Value

3.7 The Normal Distribution

Standardization (Normalization)

Standard Normal Distribution


The 95 Percent Rule

4 Relationship Between Two Categorical Variables

4.1 Two-Way Tables

Could Chance be Responsible?

A More Complex Example

4.2 More Probability

Conditional Probability

From Numbers to Percentages to Conditional Probabilities

4.3 From Conditional Probabilities to Bayesian Estimates

Let's Review the Different Probabilities

Bayesian Calculations

4.4 Independence

Multiplication Rules

Simpson's Paradox

4.5 Exploratory Data Analysis (EDA)

5 Surveys and Sampling

©2013 Peter C. Bruce iv

5.1 Simple Random Samples

5.2 Margin of Error: Sampling Distribution for a Proportion

The Uncertainty Interval

Summing Up

5.3 Sampling Distribution for a Mean

Simulating the Behavior of Samples from a Hypothetical Population........ 123

5.4 A Shortcut—the Bootstrap

Let's Recap

A Bit of History—1906 at Guinness Brewery

6 Confidence intervals

6.1 Point Estimates

6.2 Confidence Intervals as Resample Results

Confidence Interval vs. Margin of Error

Resampling Procedure (Bootstrap):

6.3 Formula-Based Counterparts to the Bootstrap

Normal Distribution

Central Limit Theorem

FORMULA: Confidence Intervals for a Mean—Z-Interval


For a Mean: T-Interval

Example—Manual Calculations


6.4 Standard Error

Standard Error via Formula

6.5 Beyond Simple Random Sampling

Stratified Sampling

Cluster Sampling

Systematic Sampling

Multistage Sampling

Convenience Sampling

Self Selection

Nonresponse Bias

6.6 Absolute vs. Relative Sample Size

6.7 Appendix A—Alternative Populations

©2013 Peter C. Bruce v

6.8 Appendix B—The Parametric Bootstrap (OPTIONAL)

Resampling Procedure—Parametric Bootstrap:

Formulas and the Parametric Bootstrap

7 Concepts in Inference

Confidence Intervals and Hypothesis Tests

7.1 Confidence Intervals for a Single Proportion

Resampling Steps

Binomial Distribution

Multiplication Rule (An Aside)

Normal Approximation

These Are Alternate Approaches

7.2 Confidence Interval for a Single Mean

7.3 Confidence Interval for a Difference in Means

Resampling Procedure—Bootstrap Percentile Interval

FORMULA–Confidence Interval for a Difference in Means

7.4 Confidence Interval for a Difference in Proportions

Resampling Procedure

Appendix A: Formula Procedure

The Binomial Formula (For Those Interested)

Binomial Formula Example

Normal Approximation to the Binomial

7.5 Appendix B: Resampling Procedure - Parametric Bootstrap (OPTIONAL).... 160

7.6 Review

8 Hypothesis Tests—Introduction


Significance or Alpha Level

Critical Value

8.1 Confidence Intervals vs. Hypothesis Tests

Confidence Interval

Relationship Between the Hypothesis Test and the Confidence Interval.... 167 Comment

8.2 Review

9 Hypothesis Testing—Two Sample Comparison

©2013 Peter C. Bruce vi Review—Basic Two-Sample Hypothesis Test Concept

Review—Basic Two-Sample Hypothesis Test Details

Formula-Based Approaches


9.1 Comparing Two Means

Resampling Procedure

9.2 Comparing Two Proportions

Resampling Procedure

9.3 Formula-Based Alternative—T-Test for Means

9.4 The Null and Alternative Hypotheses

Formulating the Null Hypothesis

Corresponding Alternative Hypotheses

One-Way or Two-Way Hypothesis Tests

The Rule

The Why


9.5 Paired Comparisons

Paired Comparisons: Resampling

Paired Comparisons: T-Test

9.6 Appendix

Formula-Based Variations of Two-Sample Tests

Z-Test With Known Population Variance

Pooled vs. Separate Variances

Formula-Based Alternative: Z-Test for Proportions

9.7 Review

10 Additional Inference Procedures

10.1 A Single Sample Against a Benchmark

Resampling Procedure

Formula Procedure

10.2 A Single Mean

Resampling Procedure for the Confidence Interval

Formula Approach for the Confidence Interval

10.3 More than Two Samples

Count Data

©2013 Peter C. Bruce vii The Key Question


Chi-Square Test

10.4 Continuous Data

Resampling Procedure

10.5 Appendix

Normal Approximation; Hypothesis Test of a Single Proportion............... 206 Confidence Interval for a Mean

11 Correlation

11.1 Example: Delta Wire

11.2 Example: Cotton Dust and Lung Disease

11.3 The Vector Product and Sum Test

Example: Baseball Payroll

11.4 Correlation Coefficient

Inference for the Correlation Coefficient—Resampling

Inference for the Correlation Coefficient: Formulas

11.5 Other Forms of Association

11.6 Correlation is not Causation

A Lurking External Cause


12 Regression

12.1 Finding the regression line by eye

Making predictions based on the regression line

12.2 Finding the regression line by minimizing residuals

12.3 Linear Relationships

Example: Workplace Exposure and PEFR

Residual Plots

12.4 Inference for Regression

Resampling Procedure for a Confidence Interval (the pulmonary data)..... 235 Using Resampling Stats with Excel (the pulmonary data, cont.)................ 235 Formula-based inference

Interpreting Software Output

13 Analysis of Variance—ANOVA

©2013 Peter C. Bruce viii

13.1 Comparing more than two groups: ANOVA

13.2 The Problem of Multiple Inference

13.3 A Single Test

13.4 Components of Variance

Decomposition: The Factor Diagram

Constructing the ANOVA Table

Resampling Procedure

Inference Using the ANOVA Table

The F-Distribution

Different Sized Groups

Caveats and Assumptions

13.5 Two-Way ANOVA

Resampling Approach

Formula Approach

13.6 Factorial Design



13.7 Interaction


Checking for Interaction

14 Multiple Regression

14.1 Regression as Explanation

14.2 Simple Linear Regression -- Explore the Data First

Antimony is negatively correlated with strength

Is there a linear relationship?

14.3 More Independent Variables

Multiple Linear Regression

14.4 Model Assessment and Inference


Inference for Regression—Holdout Sample

Confidence Intervals for Regression Coefficients

Bootstrapping a Regression

Inference for Regression—Hypothesis Tests

14.5 Assumptions

©2013 Peter C. Bruce ix Violation of Assumptions—Is the Model Useless?

14.6 Interaction, Again

Original Regression With No Interaction Term

The Regression With an Interaction Term

Does Crime Pay?

14.7 Regression for Prediction


Binary And Categorical Variables in Regression


Tayko—Building the Model

Reviewing the output

Predicting New Data



©2013 Peter C. Bruce x Preface This text was developed by Statistics.com to meet the needs of its introductory students, based on experience in teaching introductory statistics online since 2003. The field of statistics education has been in ferment for several decades. With this text, which

continues to evolve, we attempt to capture two important strands of recent thinking:

1. Guidelines for the introductory statistics course, developed in 2005 by a group of noted statistics educators, with funding from the American Statistical Association. These Guidelines for Assessment and Instruction in Statistics Education (GAISE) call for the use of real data with active learning, stress statistical literacy and understanding over memorization of formulas and the use of software to develop concepts and analyze data.

2. The use of resampling/simulation methods to develop the underpinnings of statistical inference (the most difficult topic in an introductory course) in a transparent and understandable fashion.

We start off with some examples of statistics in action (including two of statistics gone wrong), then dive right in to look at the proper design of studies and account for the possible role of chance. All the standard topics of introductory statistics are here (probability, descriptive statistics, inference, sampling, correlation, etc.), but sometimes they are introduced not as separate standalone topics but rather in the context of the situation in which they are needed.

Pages:   || 2 | 3 | 4 | 5 |   ...   | 34 |

Similar works:

«Perceptions and Satisfaction with Retail Category Assortments: The Effects of Product Variety, Brand Variety, and Price Range Jack Cadeaux, University of New South Wales Abstract This study reports how consumer perceptions of the variety of products, variety of brands, and range of prices in a category affect stated satisfaction with the assortment on offer. Displays of real organic food products including those of three categories reported here were presented to 110 subjects across two...»

«Distrikt 1870 DGE Joachim Goetz Liebe rotarische Freundinnen und Freunde, das war das Video unseres RI President elect, Ravi Ravindran bei der Vorstellung seines Mottos in San Diego, wo sich weltweit im Januar alle Governor elect zu ihrem Trainingsseminar getroffen haben. Ist das nicht eine überzeugende und starke Botschaft Sei der Welt ein Geschenk Alle Governor und auch wir im Distrikt unterstützen diese Ziele, die für die weitere Entwicklung von Rotary im 21. Jahrhundert so wichtig sind....»

«BACK TO OFFICE REPORT Enhancing the Representation of Environment and Natural Resources in Poverty Reduction Strategies in East Africa Uganda & Kenya March 2003 P.R. van Gardingen Edinburgh Centre for Tropical Forests & Centre for the Study of Environmental Change and Sustainability (CECS) John Muir Building The University of Edinburgh Edinburgh EH9 3JK United Kingdom Tel: +44 141 650 7253 Fax: +44 131 650 7863 Email: p.vangardingen@ed.ac.uk This document is an output from a consultancy funded...»

«_ Examining the retina of DBA/2J mice – a model system for retinal neurodegeneration (Untersuchungen an der Retina der DBA/2J Mäuse – ein Modellsystem für retinale Neurodegeneration) Der Naturwissenschaftlichen Fakultät der Friedrich-Alexander-Universität Erlangen-Nürnberg zur Erlangung des Doktorgrades Dr. rer. nat. vorgelegt von Christine Julie Schlegel aus Nürnberg Als Dissertation genehmigt von der Naturwissenschaftlichen Fakultät der Friedrich-AlexanderUniversität...»

«Embarking on a new voyage? Solvency II in context Speech given by Sam Woods, Executive Director of Insurance Supervision, Bank of England The Insurance Institute of London Lecture, Lloyd’s of London 27 January 2016 I am grateful to Kayleigh Guinan and Mark Cornelius for their assistance in preparing this speech. I am also grateful to Norma Cohen (Bank of England), Dr Adrian Leonard (Cambridge), Professor Philip Rawlings (Queen Mary) and Robert Thoyts (FCA) for their views and input. All...»


 Alethia Erandi Ochoa Manrique M. A. Julia Constantino Reyes Historia Literaria VII-2 May 27, 2013 On Elizabeth Bishop Since childhood Elizabeth Bishop experienced a nomadic lifestyle. Her travelling life began when she was born in February 8, 1911, in Worcester, Massachusetts and had to move with her grandparents in Nova Scotia after her parents died. Bishop had an unusual childhood due to her continual changes of houses from relative to relative provoking her to be a fragile...»

«VISUAL SUMMARY U.S. Bank Holding Companies: Overview of Dodd-Frank Enhanced Prudential Standards Final Rule February 24, 2014 This visual summary provides an overview of key aspects of the Federal Reserve’s Dodd-Frank enhanced prudential standards (EPS) final rule applicable to:  U.S. bank holding companies with ≥ $50 billion in total consolidated assets (Large U.S. BHCs), which are required by the final rule to comply with risk management and qualitative liquidity standards. ...»

«DOCUMENT RESUME ED 267 415 CS 209 5S9 AUTHOR Newkirk, Thomas, Ed. TITLE To Compose: Teaching Writing in the High School. INSTITUTION Northeast Regional Exchange, Inc., Chelmsford, MA. SPONS AGENCY National Inst. of Education (ED), Washington, DC. PUB DATE Dec 85 GRANT NIE-G-C2-0017 NOTE 203p. PUB TYPE Books (010) Information Analyses (070) -Viewpoints (120) EDRS PRICE MF01/PC09 Plus Postage. DESCRIPTORS Content Area Writing; Expository Writing; High Schools; Literature Appreciation; *Secondary...»

«ZULASSUNGSANTRAG (APPLICATION FOR ADMISSION) (For further information, please visit our web site http://www.mst-master.de) Master of Engineering (M. Eng.) in Micro Systems and Nano Technologies Sommersemester (Jahr) or Wintersemester _ (Jahr) Bitte in Großbuchstaben ausfüllen und senden an (Please fill out in capital letters and submit to) Studierendensekretariat Amerikastr. 1 D-66482 Zweibrücken Germany ANGABEN ZUR PERSON (PERSONAL DETAILS) Nachname (Surname/Family name). Vorname(n)...»

«! !! # $ !! %&' Re: Application For Employment as a Bricklayer Dear Sir, Please find enclosed an application form to be filled out and signed. Please also read and sign the conditions of employment. Please note as a condition of appointment you will be required to attend a minimum of two interviews in our office. Upon review of your application will contact you to arrange a suitable time. If further information is required please contact myself or Michael Byrne. I look forward to reviewing your...»

<<  HOME   |    CONTACTS
2016 www.book.dislib.info - Free e-library - Books, dissertations, abstract

Materials of this site are available for review, all rights belong to their respective owners.
If you do not agree with the fact that your material is placed on this site, please, email us, we will within 1-2 business days delete him.