Data analysis for semi-structured data
Unit Code: ITO5212
Duration: 6 weeks
Contact Hours: 20-24 hours of study per week
Credit Points: 6
Description:
Semi-structured data is one of the fastest growing kinds of data in both the public and private sector. Email collections with sender-recipient graphs, metadata and text content is one example. ‘Data analysis for semi-structured data’ will explore basic forms of semi-structured data: text, time-sequence data, graphs and multiple relations in a database. You will learn to apply basic machine learning algorithms – and methodologies such as cohort analysis and market-based analysis – to solve industry problems for the application of semi-structured data.