Data Engineering: Infrastructure and Applications

Download as PDF

Overview

Subject area

DSE

Catalog Number

I2400

Course Title

Data Engineering: Infrastructure and Applications

Department(s)

Description

This course will train students in the handling of big data sources derived from various environments including traditional business activities, web-based transactions and social media. The course will also discuss the range of data formats, application types and emerging approaches in data integration. As part of this it will introduce the range of research topics and mentors participating in the Data Science and Engineering Program and offering capstone project opportunities. The course will begin with a discussion of high-end traditional database systems focusing on query processing, crash recovery, and transaction and concurrency control. This will be followed by a detailed look at object-relational databases, distributed and federated databases, and cloud-based data-warehousing. NoSql databases (e.g., Cassandra and Neo4) and parallel data analysis tools (e.g., Hadoop, Spark) will be introduced. The main emphasis of the course is hands-on training in state-of-the-art software development environments. Project based system development work will be an essential component of the course.

Academic Career

Graduate

Liberal Arts

No

Credits

Minimum Units

3

Maximum Units

3

Academic Progress Units

3

Repeat For Credit

No

Components

Name

Lecture

Hours

3

Requisites

031841

Course Schedule