4. Using Structured Query Language
- Due Oct 8, 2015 by 1:30pm
- Points 100
- Submitting a file upload
Learning Objectives
- Store, retrieve, and use data from important data models used in Hydrology such as ArcHydro, NetCDF, and the Observations Data Model (ODM)
- Design and use relational databases to organize, store, and manipulate data
- Query, aggregate, and pivot data using Structured Query Language (SQL), Excel, R, and other software systems
Computer and Data Requirements
- You will need MySQL and MySQL Workbench installed to complete this assignment.
- The data needed for this assignment have been provided in a zip file: Hydroinformatics_Assignment-4_Data.zip Download Hydroinformatics_Assignment-4_Data.zip
- You need to download the data and follow the instructions for setting up your Logan River ODM database before starting the assignment: Hydroinformatics_Assignment-4_Instructions.pdf
Download Hydroinformatics_Assignment-4_Instructions.pdf
The Problem
You are working on a water quality study in the Logan River. One aspect of this study is to assess water temperature for potential impacts on cold-water fish species. The water quality standards for the State of Utah list the Logan River as Class 3A - protected for cold water species of game fish and other cold water aquatic life, including the necessary aquatic organisms in their food chain (see http://www.rules.utah.gov/publicat/code/r317/r317-002.htm Links to an external site.). Given this, state water quality standards require that water temperatures in the river not exceed 20 degrees C to protect aquatic organisms. Your job is to perform some exploratory data analysis using the water temperature datasets in the Logan River ODM database and perform analyses that may identify potential water temperature impairment. In your analysis, you should write SQL queries on the database to assemble the following:
- A table listing the period of record for water temperature measurements (e.g., begin and end date), the number of observations, and the overall minimum, maximum, and average values for each site at which quality controlled (QualityControlLevelID = 1) water temperature (VariableID = 57) data have been collected.
- A table listing the total number of temperature observations, the number of observations greater than the water quality criterion value (i.e., 20 degrees C), and the overall percent exceedence of the water quality criterion value for each site at which quality controlled water temperature data have been collected.
- A table for the Logan River at Mendon Road (SiteID = 2) listing the percent exceedence of the water quality standard for each month of the year.
- A table listing the percent exceedence of the water quality standard for each site at which quality controlled data are available during the month of July, which is generally a critical period with low flows and elevated temperatures.
At which site(s) was the overall percent exceedence the greatest? Given that the state water quality standards are set to protect the health of aquatic organisms, at which site(s) is there most likely to be temperature related effects on fish populations? How does the percent exceedence change from month to month at the Mendon Road site? How does the percent exceedence in July compare to the overall percent exceedence?
Deliverable
Submit a one-page report that introduces the problem and addresses the questions above. Provide your tables as appendices to your one page write up. Additionally, provide an appendix that lists the code of the SQL queries that you used to generate your tables (indicate which query(ies) go with which tables). In your write up, please discuss what you learned.
A PDF version of this assignment for printing purposes is here: Hydroinformatics_Assignment-4.pdf
Download Hydroinformatics_Assignment-4.pdf
An editable version of the Grading Rubric for this assignment is here: Hydroinformatics_Assignment-4_Rubric.docx
Download Hydroinformatics_Assignment-4_Rubric.docx