CSE 8091 - Fall 2011
Advanced Scientific Computing with R
Introduction
Scientific computing applies computational methods to scientific and
engineering problems. This course will help you exploit the power of R, a
freely available language and environment for statistical computing and
graphics, to boost your research with state of the art data analysis and
visualization. R is currently the 2nd most widely used environment for data
analysis/mining beating the well-known commercial tools IBM/SPSS, SAS and
Matlab (2011 KDDnuggets Survey) and the most used programing language for
data mining and analytics (another 2011 KDDnuggets Survey).
We will cover R basics and programming; simulating, analyzing and visualizing
data; vectorization; interfacing algorithms in C/C++/Java; easy creation of
scientific documents with Sweave; packaging R software for reuse; and advanced
topics as needed.
Part of the course is a project focusing on each participant's research
interest/need.
Presentations
- Introduction
- Objects, arrays and lists
- Loops, apply and functions
- Plots and Visualization
- Creating simulated data
- Objects
(S3 object system,
S4 object system)
-
Anatomy of packages (package TSP)
- Basic regression and classification models (code example)
- Computation using multiple-cores or a cluster (multi-core example, cluster example)
- Graphs (igraph example, map.R)
Projects
- Andy: Infrastructure for Genetic Sequence Classification
of 16S sequences (Nov 14)
- Akshaya: Recommender Systems - Mahout and rJava (Nov 21)
- Hadil: Genetic Sequence Classification (Nov 21)
- Mak: Reengineering a Ranking (Oct 31)
- Matt: Interaction between R and PHP with rServe (Oct 31)
- Ruili: Software Reliability Growth Analysis with R (Oct 31)
- Sudheer: Border security application using spatial data mining (Nov 7)
- Xiaodian: Forecasting for Financial Data (Nov 21)
Reading
Tools and Software
Linux/Ubuntu
This is the preferred installation! You can
do one of the following:
- Install Ubuntu on a spare partition to create a dual boot.
- Install Virtual Box and download my Ubuntu image. See
here.
In Ubuntu you can use the "Synaptics Package Manager"
(under "System Settings") to install "R-base", "g++" and any other software you
need.
Windows
You can download R for Windows
from CRAN.
Useful Links
Michael Hahsler
Last modified: