Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Empty file.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file added 06_StatisticalInference/02_03_tCIs/index.pdf
Binary file not shown.
Binary file not shown.
Binary file added 06_StatisticalInference/02_05_Bayes/index.pdf
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file added 06_StatisticalInference/03_03_pValues/index.pdf
Binary file not shown.
Binary file added 06_StatisticalInference/03_04_Power/index.pdf
Binary file not shown.
Binary file not shown.
Binary file not shown.
123 changes: 123 additions & 0 deletions 06_StatisticalInference/announcements.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,123 @@
## First Announcement

Welcome to Week 1 of the Statistical Inference, part of the Coursera
Data Science specialization from Johns Hopkins Biostatistics! This
course presents the fundamentals of statistical inference that you
will need throughout the rest of the Data Science track.

We believe that the key word in Data Science is "science". Our course
track is focused on providing you with three things: (1) an
introduction to the key ideas behind working with data in a scientific
way that will produce new and reproducible insight, (2) an
introduction to the tools that will allow you to execute on a data
analytic strategy, from raw data in a database to a completed report
with interactive graphics, and (3) on giving you plenty of hands on
practice so you can learn the techniques for yourself.

This course represents the most fundamental and foundational
component of the series. Using only a bare minimum of mathematics,
we attempt to give students the fundamentals of using statistics
to draw inferences about populations.

We are excited about the opportunity to attempt to scale Data Science
education. We intend for the courses to be self contained, fast paced,
and interactive. We intend to run them frequently to give people with
busy schedules the opportunity to work on material at their own pace.

Please see the course syllabus for information about the quizes, the
project, due dates, and grading. Don't forget to say hi on the message
boards. The community developed around these courses is one of the
best places to learn and the best things about taking a MOOC!

Brian Caffo and the Data Science Track Team

---

## Week 1 Announcement

Hi all, welcome to Week 1 of the Statistical Inference class.

Please get the materials off of github. Also, make sure that you're
keeping up with the videos and plan on taking the week 1 quiz.

Get those forums going; we're looking forward to seeing some
really active posting!

Good luck and have a great week!

Brian Caffo and the Data Science Track Team

---

## Week 2 Announcement

Welcome to Week 2 of Statistical Inference!

Make sure that you're keeping up with the videos and planning
on taking the second quiz.

Keep those forums rocking.

Good luck and have a great week!

Brian Caffo and the Data Science Track Team


---


## Week 3 Announcement

Welcome to Week 3 of Statistical Inference!

Make sure that you're keeping up with the videos and planning
on taking the third quiz.

Keep up with the forums and if you get a chance, send
us pull requests with changes for the notes.

Good luck and have a great week!

Brian Caffo and the Data Science Track Team



---


## Week 4 Announcement

Welcome to Week 4 of Obtaining Data!

Make sure that you're keeping up with the videos and planning
on taking the fourth quiz.

Keep up with the forums and if you get a chance, send
us pull requests with changes for the notes.

Good luck and have a great week!

Brian Caffo and the Data Science Track Team

---

## Course wrap-up

Congratulations on finishing the Statistical Inference!

We have set the grading and released the Statements of Accomplishment
for the Course. It might take a few hours/days for the statements to
be disbursed to accounts.

A couple of other notes:

* The course will begin again immediately starting in a couple of
days. If you are still interested in keeping in touch with your
fellow learners, please enroll in the new course and keep the conversation going. You may also be an invaluable resource for
new course takers!
* Keep your eye on Hopkins offerings from Coursera. All announcements about future offerings will be posted at: https://twitter.com/jhubiostat and http://simplystatistics.org/, http://twitter.com/simplystats.
* If you liked this course, please consider taking some of the other course offerings through the Data Science Track. If you have completed all the course work in this track you now have the tools you will need to take on the challenges in the rest of our courses or in other Statistics, Data Science, or Machine Learning courses you may encounter.

Thanks again for all of your efforts during the course of the class and best of luck in your career!

Brian Caffo and the Data Science Track Team
17 changes: 17 additions & 0 deletions 06_StatisticalInference/grading.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,17 @@
## Grading and logistics

The grading in this class is very straightforward.

1. There are four quizzes, each containing in the neighborhood of 10 questions.
2. Each question is equally weighted as 1 point.
3. Some require two answers, each giving half of a point (for a maximum total of 1 point for those questions).
4. Your total points is the sum of the points questions across all quizzes that you answered correctly (using all of your quiz attempts).
5. 70% or more of the total points is a pass for the class.
6. 80% or more of the total points is a pass with distinction.







194 changes: 194 additions & 0 deletions 06_StatisticalInference/syllabus.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,194 @@
## Course Title

### Statistical Inference

---

## Course Instructor(s)

The primary instructor of this class is
[Brian Caffo](http://www.bcaffo.com)

Brian is a professor at Johns Hopkins Biostatistics and
co-directs the [SMART working group](http://www.smart-stats.og)

This class is co-taught by Roger Peng and Jeff Leek. In addition,
Sean Kross and Nick Carchedi have been helping greatly.

---

## Course Description

In this class students will learn the fundamentals of statistical
inference. Students will receive a broad overview of the goals,
assumptions and modes of performing statistical inference. Students
will be able to perform inferential tasks in highly targeted settings
and will be able to use the skills developed as a roadmap for more
complex inferential challenges.

---

## Course Content

This class is taught in three modules
1. Probability and probability distributions
2. Basics of inference
3. More advanced inference techniques

Each module has sub modules, labeled such as 01_03. Videos within submodules are
broken up so that 01_03_a is the first video in sub-module 3 in module 1
while 01_03_b is the second video.

For convenience we post the broken up videos, and then also the full videos
for each sub-module on the site.

The full list of topics are as follows

Module 1, probability and probability distributions
* 01_01 Introduction
* 01_02 Probability
* 01_03 Expectations
* 01_04 Independence
* 01_05 Conditional probability

Module 2, basics of inference
* 02_01 Common Distributions
* 02_02 Asymptopia
* 02_03 t confidence intervals
* 02_04 Likelihood
* 02_05 Beginning Bayes Inference

Module 3, more advanced inference
* 03_01 Independent group intervals
* 03_02 Hypothesis testing
* 03_03 P-values
* 03_04 Power
* 03_05 Multiple Testing
* 03_06 resampled inference


---
Github repository

The most up to date information on the course lecture notes will always be in the Github repository

[https://github.com/DataScienceSpecialization/courses](https://github.com/DataScienceSpecialization/courses)

Please issue pull requests so that we may improve the materials.

---

## Lecture Materials

Lecture videos will be released weekly and will be available for the
week and thereafter. You are welcome to view them at your
convenience. Accompanying each video lecture will be a PDF copy of the
slides and a link to an HTML5 version of the slides.

The lecture videos are released in a weekly fashion. They do not
correspond to the modules (as there's three modules and four weeks).

---

## Weekly quizzes

The weekly quizzes will cover the material from that week.

### Quiz 1

Assigned: Class open (1st of Month)
Due: 7th of the Month 12:00 AM UTC


### Quiz 2

Assigned: 8th of the Month 12:01 AM UTC
Due: 14th of the Month 12:00 AM UTC


### Quiz 3

Assigned: 15th of the Month 12:01 AM UTC
Due: 21st of the Month 12:00 AM UTC


### Quiz 4

Assigned: 22nd of the Month 12:01 AM UTC
Due: 28th of the Month 12:00 AM UTC

---

## Quiz Scoring

You may attempt each quiz up to 2 times. Only the score from your final attempt will count toward your grade.

---

## Hard deadlines and soft deadlines

The reported due date is the soft deadline for each quiz. You may turn
in quizzes up to two days after the soft deadline. The hard deadline
is the Tuesday after the Quiz is due at 23:30 UTC-5:00. Each day late
will incur a 10% penalty, but if you use a late day, the penalty will
not be applied to that day.

---

## Late Days for Quizzes

You are permitted 5 late days for quizzes in the course. If you use a late day, your quiz grade will not be affected.

---

## Dates for the project

This class has no project unlike the other classes in the Data Science Series. (The content doesn't lend itself well to a project.)
So be warned that there are more quiz questions here than in the other classes in the Data Science series.

---

## Typos

* We are prone to a typo or two - please report them and we will try
* to update the notes accordingly. In some cases, the videos may
* still contain typos that have been fixed in the lecture notes. The
* lecture notes represent the most up-to-date version of the course
* material.


---

## Differences of opinion

Keep in mind that currently data analysis is as much art as it is
science - so we may have a difference of opinion - and that is ok!
Please refrain from angry, sarcastic, or abusive comments on the
message boards. Our goal is to create a supportive community that
helps the learning of all students, from the most advanced to those
who are just seeing this material for the first time.

---

## Technical Information

Regardless of your platform (Windows or Mac) you will need a
high-speed Internet connection in order to watch the videos on the
Coursera web site. It is possible to download the video files and
watch them on your computer rather than stream them from Coursera and
this may be preferable for some of you.

### Here is some platform-specific information:

_Windows_

The Coursera web site seems to work best with either the Chrome or the
Firefox web browsers. In particular, you may run into trouble if you
use Internet Explorer. The Chrome and Firefox browsers can be
downloaded from: _Chrome:
[http://www.google.com/chrome](http://www.google.com/chrome) _
Firefox: [http://www.mozilla.org](http://www.mozilla.org)

_Mac_

The Coursera site appears to work well with Safari, Chrome, or Firefox, so any of these browsers should be fine.