Autumn 2020 STAT 37820.

Course: STAT 37820

Title: Statistical Computing B

Instructor(s): Mei Wang

Teaching Assistant(s): TBA

Class Schedule: Sec 01: TR 9:40 AM–11:00 AM (second half of quarter) in TBA

Description: Statistical Computing B focuses on common data technology used in statistical computing and broader data science. The course takes place in the second half of the autumn quarter, after STAT 37810 (Statistical Computing A). Topics include storage and accessing of large data, basic working knowledge of relational database and its querying language SQL; introduction to distributed file system and example usage of Hadoop; Python, and its applications in text analysis; access and usage of high-performance computer clusters, rudimentary parallel computing, web data access. XML and Javascript may be used occasionally. A short introduction to SAS will be given if time permits. The main computing software will be Python, with some R.

Prerequisite(s): Instructor Consent. STAT 37810 recommended.