KEMBAR78
MapReduce Big Data Assignment | PDF | Map Reduce | Computer Science
0% found this document useful (0 votes)
164 views2 pages

MapReduce Big Data Assignment

This document contains an assignment for a course on managing and modeling big data. It includes two questions to write MapReduce programs for. The first question involves performing inner and full outer joins on two tables. The second question involves finding the friends of friends for a given person using friendship data. Students must complete the assignment in a group of 4 members and submit a single zip file containing their MapReduce code by the specified deadline.

Uploaded by

Alaa Mdina
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
164 views2 pages

MapReduce Big Data Assignment

This document contains an assignment for a course on managing and modeling big data. It includes two questions to write MapReduce programs for. The first question involves performing inner and full outer joins on two tables. The second question involves finding the friends of friends for a given person using friendship data. Students must complete the assignment in a group of 4 members and submit a single zip file containing their MapReduce code by the specified deadline.

Uploaded by

Alaa Mdina
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Cairo University

Faculty of Computers and Artificial Intelligence


Managing and Modelling Big Data (2022/2023)

Assignment 1

Question 1:

Consider the following data items where each item represents a record on a table; each record is
have 3 values (Table name, PK, AttributeValue). The database schema of the tables can be described
as follow:

T1(A1,A2)

T2(A3,A1)

The data items are as follow:

(T1, 1, x)
(T1, 2, x)
(T1, 3, x)
(T1, 4, y)
(T1, 5, y)
(T1, 6, y)
(T1, 7, z)
(T1, 8, z)
(T2, A, 1)
(T2, B, 2)
(T2, C, 3)

1- Write a map/reduce program to do an inner join between T1 and T2 where A1 in T1 is a


foreign key in T2.
The query results should be:
(A, 1,X)
(B, 2,X)
(C, 3,X)

2- Write a map/reduce program to do a full outer join between T1 and T2 where A1 in T1 is a


foreign key in T2.
The query results should be:

(A, 1,X)
(B, 2,X)
(C, 3,X)
(null, 4, y)
(null, 5, y)
(null, 6, y)
(null, 7, z)
(null, 8, z)

3- Write a map/reduce program to find out the difference between two attributes. For
example : A1[T1] – A1[T2] , The result would [4,5,6,7,8]

Question 2:

Consider the following data items where each item represents a friendship relationship between
persons:

(P1,P2)
(P1,P3)
(P3,P4)
(P2,P4)
(P2,P5)
Write a map/reduce program to find the friends of friends for a given person

For example: find the friends of P1 friends

P1 friends would be: P2, P3


Friends of P2 and P3 are (P4, P5)

Important notes:
• This is a group assignment of 4 members (at most) and the members should be from the
same group/lab.
• All team members should work and fully understand everything in the assignment even if
you distributed the questions, you should understand your colleague’s questions.
• The due date is on Saturday, 25th of March until 11:55 pm. No late submission is allowed. No
submission through e-mails.
• Your submission should include a four .jar files for the 4 problems in the assignment.
• Add all jar files to a folder and compress it to a .zip. Rename the .zip file to be
GroupNum_firstStudID_SecondStudID_ThirdStudID_FourthStudID. The compressed file
would be the file to be delivered.
• Do not share your code with anyone, so that no other student would take your files and
submit it under their names.
• Any cheating will be graded ZERO for both teams.
• Each team should discuss the assignment with his/her lab TA. Any team member who misses
attending the discussion will take zero.

You might also like