5. Using Python to Retrieve and Visualize Data
- Due Oct 29, 2015 by 1:30pm
- Points 100
- Submitting a file upload
Learning Objectives
-
Create reproducible data visualizations
-
Write and execute computer code to automate difficult and repetitive data related
tasks
-
Manipulate data and transform it across file systems, flat files, databases,
programming languages, etc.
Computer and Data Requirements
-
Data required for this assignment are the Logan River ODM database in MySQL that you created in Assignment 4.
-
You may use any Python IDE or execution environment. However, PyCharm Community Edition with Python 2.7 is recommended. You will need at least Python along with the PyMySQL and matplotlib packages. PyCharm may be obtained from the JetBrains web site: https://www.jetbrains.com/pycharm/download/ Links to an external site..
Task
Explore the data stored in the Logan River ODM using SQL statements executed through Python. Refer back to Assignment 4 and solve problems 1-4 again, this time using Python. Create two graphs using matplotlib to represent the results from Assignment 4, problems 3 and 4. You can also run additional queries on your data and make additional graphs! Use the advanced layout and graphing functions of the matplotlib library to prepare these graphs. You can refer to gallery here for ideas on how to create your graphs: http://matplotlib.org/gallery.html Links to an external site.. This assignment should be completed using a single Python script file that is well documented internally and structured following Python style guides (in general). The figures should be suitable for publication in a publication, meaning they are well-designed, clear, legible, and have appropriate axis labels.
Deliverable
1) A one-page write up that includes your two figures with captions along with the following.
-
a) A statement of the value of each figure. What can we learn about the hydrology of the Logan River from these figures?
-
b) A statement of your methodology for creating the figures.
-
c) A statement of what you found and why it is important from a hydrologic perspective.
2) Your Python code (.py file) as an appendix to your main report so that the instructors can run your code. Your Python code should include code for querying your ODM database, code for calculating percent exceedances, and code for generating the graphs you included in your report.
Note that you may need to use the text wrapping around figures functionality in Word to make everything fit onto on page.