CSE, Lyle School of Eng., SMU

CSE 8091 - Fall 2011
Advanced Scientific Computing with R

> Instructor: Dr. Michael Hahsler
> Time: M 3:00-3:50pm, Caruth 383
> Course Syllabus

Introduction

Scientific computing applies computational methods to scientific and engineering problems. This course will help you exploit the power of R, a freely available language and environment for statistical computing and graphics, to boost your research with state of the art data analysis and visualization. R is currently the 2nd most widely used environment for data analysis/mining beating the well-known commercial tools IBM/SPSS, SAS and Matlab (2011 KDDnuggets Survey) and the most used programing language for data mining and analytics (another 2011 KDDnuggets Survey).

We will cover R basics and programming; simulating, analyzing and visualizing data; vectorization; interfacing algorithms in C/C++/Java; easy creation of scientific documents with Sweave; packaging R software for reuse; and advanced topics as needed.

Part of the course is a project focusing on each participant's research interest/need.

Presentations

  1. Introduction
  2. Objects, arrays and lists
  3. Loops, apply and functions
  4. Plots and Visualization
  5. Creating simulated data
  6. Objects (S3 object system, S4 object system)
  7. Anatomy of packages (package TSP)
  8. Basic regression and classification models (code example)
  9. Computation using multiple-cores or a cluster (multi-core example, cluster example)
  10. Graphs (igraph example, map.R)

Projects

Reading

Tools and Software

Linux/Ubuntu

This is the preferred installation! You can do one of the following:

In Ubuntu you can use the "Synaptics Package Manager" (under "System Settings") to install "R-base", "g++" and any other software you need.

Windows

You can download R for Windows from CRAN.

Useful Links


Michael Hahsler
Last modified: