Last Updated: 2019-04-24

Instructions

Show your work. The response to each question should include all of the commands necessary to reproduce your results, and should be commented to give a naive reader an idea of the intent behind each section of code. Answers to questions should be clearly shown in the code output or spelled out in text or comments.

Before submitting your assignment, ensure that your code for each question works correctly when the lines are run in sequence, starting with an empty environment. You may submit either this R Markdown document with your code entered into the appropriate code blocks, or a regular R script with the question numbers as comments and the answers following. If you choose to Knit the R Markdown document to HTML or PDF, include those files with your submission (but it is not required that it Knit successfully).

Question 1

Choose one of the following types of data:

  1. Ecological community data
    1. Note: Must have at least ten “sites” (can be locations, years, etc.)
  2. Likert survey data
    1. Note: Must have at least three Likert-type variables and at least one nominal demographic variable
  3. Network data
    1. Note: Must have at least one additional variable (numeric and/or nominal) aside from vertices, edges, and labels. Can be data from a social network, interaction network, bibliometric network, etc.
  4. Geospatial data
    1. Note: Must have at least two additional variables (numeric and/or nominal) aside from coordinates and labels

Find an appropriate data set of that type and load it into R.

If you are having trouble finding data, look through the example data sets given in the Course Project description or data sets attached to published papers in your field.

Important: Include the data file in your submission and include a link to its source here.

Perform any data cleaning and wrangling necessary to get your data into the format(s) required for the analyses below.

Question 2

Conduct exploratory analyses of the data, consisting of at least one descriptive statistic and at least one rough exploratory plot. Briefly describe your findings.

Question 3

Conduct at least one hypothesis test of your data. You do not need to describe the results, but it should be clear from comments and context what test is being performed and why.

Question 4

Create one linear model (regression, ANOVA, etc.), correlation matrix, distance matrix, or adjacency matrix of your data (whichever is most appropriate). Show the result.

Question 5

Conduct one of the following sequences of analyses described in Week 5 or Week 6 of the course material, depending on your data type:

  1. Ecological community data
    1. Diversity, Richness, Evenness, and Species accumulation curve
    2. Diversity, Richness, Evenness, and Multidimensional scaling plot (classical or nonmetric)
    3. Diversity, Richness, Evenness, and Geospatial plot (if spatial data are included)
    4. Principal component analysis, Principal component plot, and Regression
    5. Cluster analysis, Silhouette plot and/or Cluster plot
  2. Likert survey data
    1. Exploratory factor analysis and Confirmatory factor analysis
      1. Note: Randomly subset your data set into two halves: one for EFA and the other for CFA
    2. Principal component analysis, Principal component plot, and Regression
    3. Cluster analysis, Silhouette plot and/or Cluster plot
  3. Network data
    1. Two descriptive statistics (e.g. centrality, diameter, distance, edge density), Network visualization, Community detection
  4. Geospatial data
    1. Geospatial plot
      1. Note: Must include raster layer and at least one layer of shapefiles of your choice
    2. Geographic distance matrix, Distance matrix of additional variable for the same points, Mantel test, Hierarchical clustering plots of each distance matrix

Notes: Analyses should be fully commented, with results shown. If the analyses include a plot, the plot should be as polished and publication-ready as possible. Different analyses are permitted if they are comparable in effort and scope to those given above.



(pdf / Rmd)