Department: Data Science School
Office location and address
31 Bonnycastle DrCharlottesville, Virginia 22903
The course will expose students to three different programming languages that are core to the Field of Data Science. SQL will be covered first to include a discussion on SQL's mathematical foundations and usage as a declarative language, this will likely cover half of the course. In demand programming language Python and R will be covered in the second half of the class with popular data frame focused packages being targeted.
An introduction to essential programming concepts, structures, and techniques. Students will gain confidence in not only reading code, but learning what it means to write good quality code. Additionally, essential and complementary topics are taught, such as testing and debugging, exception handling, and an introduction to visualization. This course is project based, consisting of a semester project and final project presentations.
This course will focus on Spark, an open-source, general-purpose computing framework that is scalable & fast. Fundamental data types & concepts are covered. You will learn how to use Spark for large-scale analytics & machine learning, among other topics. Tools for data storage and retrieval are covered, including AWS.
This course provides selected special topics in data science to graduate and undergraduate students.