Last Updated: 2019-04-24
Show your work. The response to each question should include all of the commands necessary to reproduce your results, and should be commented to give a naive reader an idea of the intent behind each section of code. Answers to questions should be clearly shown in the code output or spelled out in text or comments.
Before submitting your assignment, ensure that your code for each question works correctly when the lines are run in sequence, starting with an empty environment. You may submit either this R Markdown document with your code entered into the appropriate code blocks, or a regular R script with the question numbers as comments and the answers following. If you choose to Knit the R Markdown document to HTML or PDF, include those files with your submission (but it is not required that it Knit successfully).
Choose one of the following types of data:
Find an appropriate data set of that type and load it into R.
If you are having trouble finding data, look through the example data sets given in the Course Project description or data sets attached to published papers in your field.
Important: Include the data file in your submission and include a link to its source here.
Perform any data cleaning and wrangling necessary to get your data into the format(s) required for the analyses below.
Conduct exploratory analyses of the data, consisting of at least one descriptive statistic and at least one rough exploratory plot. Briefly describe your findings.
Conduct at least one hypothesis test of your data. You do not need to describe the results, but it should be clear from comments and context what test is being performed and why.
Create one linear model (regression, ANOVA, etc.), correlation matrix, distance matrix, or adjacency matrix of your data (whichever is most appropriate). Show the result.
Conduct one of the following sequences of analyses described in Week 5 or Week 6 of the course material, depending on your data type:
Notes: Analyses should be fully commented, with results shown. If the analyses include a plot, the plot should be as polished and publication-ready as possible. Different analyses are permitted if they are comparable in effort and scope to those given above.