Creative Commons Attribution Share Alike 4.0 International (CC BY-SA 4.0) All course material is provided under the CC Attribution Share Alike license.

Course Description

This course places an emphasis on working with data, databases, and performing and interpreting descriptive analytics in the context of contemporary, data-rich decision making environments including various engineering and management applications. The course will introduce the use of databases (database management systems and SQL) and discuss the functions of practical (e.g., corporate or public) data warehouses while placing an emphasis on formal methods of descriptive analytics including data preparation, visualization, and interpretation.

Prerequisites: CS 1341, OREM 3340/CS/STAT 4340

Details and the syllabus can be found on Canvas.

Course Outline

Introduction

Spreadsheets

SQL

The Entity-Relationship Model

Data Handling with R

Introduction

Programming Basics

Tidyverse and ggplot2

Exploring Data and Reporting

Modeling in R

Textbooks

The textbooks are not required.

  • A First Course in Database Systems (3rd Edition), Jeffrey D. Ullman, Jennifer Widom, Pearson (2007). ISBN-10: 013600637X [Companion page]

  • R for Data Science: Import, Tidy, Transform, Visualize, and Model Data, by Hadley Wickham andGarrett Grolemund, O’Reilly Media; 1st edition (January 17, 2017), ISBN-10: 1491910399. Free Web book

Code and Data Used in Class

Learning Resources