CS 350/550 is an introduction to the representation, visualization, and modeling of large data sets using standard, high-level software tools. The course is designed to expose students to tools and methods useful to conduct analysis of large data sets often encountered in science and engineering pursuits. The goal of this course is to help students understand how they might summarize and interpret data, identify non-trivial facts and patterns in that data, and how to make predictions based on that data. Topics include summarizing data, making predictions from data, and finding hidden relationships in data. Familiarity with spreadsheet software is assumed and students should be able to construct simple programs in C like languages (C, C++, Java, etc.). Knowledge of basic statistics and either Matlab or Octave is desirable, but not required.
College of Engineering and Computer Science