Big Data Management course study summary
a. Summarize and reflect what you have learned from this course about data parallelism and task parallelism
b. Design a full example
Map Reduce 5-stage solutions) illustrating your understanding of data parallelism. This example’s solution format should be similar to class exercises 3 or 4 but the scenario and raw data must be totally different;
3) Tools and techniques for big data
a. Using a table, compare and contrast between MPP and SMP in terms of their strengths,
weaknesses and application areas;
b. Find another big data platform similar to Hadoop. Compare (using a table) this platform with Hadoop in terms of features, strengths and weaknesses.
4) Data governance for big data management [15 marks]
a. Summarize and reflect what you have learned from this course about the five key concepts for big data oversight;
b. Illustrate your understanding of dimensions for measuring the quality of information used for BDA with one example for each dimension;
5) NoSQL Database [20 marks]
a. Using a table, compare and contrast
b. Find one workable column-oriented NoSQL database product in the IT vendors’ market, and describe it including features and price. Critically analyze why and how this No SQL database can be used inside the case of “What does Big Data have to do with an owl?