ITECH1103- Big Data and Analytics Group

1 | P a g e
ITECH1103- Big Data and Analytics Group
Assignment – Semester 3, 2018
Worth – 30%
ANALYTIC REPORT (20%- Due Week 11 Sunday
11:55pm) and PRESENTATION (10% – Due Week
10 in Tutorial Time)
Analytic Report:
Learning Outcomes Assessed: A3, K3, K6, and S2:
Purpose: The purpose of this task is to provide students with practical experience in
working in teams to write a Data Analytical report to provide useful insights, pattern and
trends in the chosen/given dataset. This activity will give students the opportunity to
show innovation and creativity in applying Watson Analytics and designing useful
visualization solutions and predictive solutions for various analytics problems.
Group Presentation: Week 10 (Scheduled Laboratory) Learning Outcomes
Assessed: K4, A1, A2, V1, V2
Purpose: The purpose of the oral presentation is to provide an opportunity for students
to present the results of DATA Analysis and to share this knowledge while practicing
their verbal communication skills
Project Details: Consider you are working as a Content Analyst in an ABC online
multimedia company and your task for this analytical project is to use analytical tool (i.e.
IBM Watson Analytics) to explore, analyse and visualize the given dataset. This dataset
reflects details about different videos, uploaded during the period from 2006 to 2018.
The original dataset is extracted from the Kaggle.com and then modified and uploaded
onto https://data.world/iamdilan/youtube-dataset. Your primary goal is to
download the modified dataset and provide different and interesting insights in the
lights of 20 guided questions listed below along with advance insights . The dataset could
be downloaded from the following link
Dataset source: https://data.world/iamdilan/youtube-dataset
Data Dictionary:

Video_id
Trending_date
Title
Channel_title
Category_id :
Publish_date
Time_frame
Publish_day_of_week
Publish_country
Tags
Unique identity of video
trending date of video
Name of video
Name of channel
see category list below (table)
The date on which the video was published
The time at which the video was uploaded/published
Day of the week video published
Country in which video published
Tags

2 | P a g e

ViewsNumber of views of video
LikesNumber of likes of video
Dislikes
Comments_count
Comments_disable
Ratings_disabled
Number of dislikes of video
Number of comment for a video
Whether comment is disable or not
Whether ratings is disabled or not

Video_error_or_removed Whether video has error or it is removed
YouTube Video Category Id list:

2 – Autos & Vehicles
1 – Film & Animation
10 – Music
15 – Pets & Animals
17 – Sports
18 – Short Movies
19 – Travel & Events
20 – Gaming
21 – Videoblogging
22 – People & Blogs
23 – Comedy
24 – Entertainment
41 – Thriller
42 – Shorts
43 – Shows
44 – Trailers
25 – News & Politics
26 – How to & Style
27 – Education
28 – Science & Technology
29 – Nonprofits & Activism
30 – Movies
31 – Anime/Animation
32 – Action/Adventure
33 – Classics
34 – Comedy
35 – Documentary
36 – Drama
37 – Family
38 – Foreign
39 – Horror
40 – Sci-Fi/Fantasy

3 | P a g e
You are expected to present the data findings in a visual forms (i.e., charts and graphs).
This is a group assignment. You will complete it with your team (max 3 members
enrolled in the same laboratory). It is expected that each team member will contribute
equally in the project. Each team will turn in one joint document and give a joint
presentation in Timetabled Laboratory class in Week 10. In addition, each individual
team member will write a short reflection as part of the report. You will receive feedback
on the draft about presentation choices, content, analysis, and style.
The Questions
Your job is to examine the dataset and present it in a set of informative graphs and text
by answering the following questions.
Guided Questions for Dataset
1. What is the total number of uploaded videos in this dataset?
2. How many different types of uploaded categories are there?
3. What is the number of countries in this dataset?
4. What is the number of (unique) channels in this dataset?
5. Which are the top three countries, according to number of channels, in this dataset?
6. What is the lowest number of channel by country?
7. How many different unique channels are there in the US?
8. Provide a list of the top 10 viewed video titles with respect to each country.
9. Provide a list of least 10 viewed video titles with respect to each country.
10. How many years of uploaded videos are there in the data file?
11. How many uploaded videos have there been in the last month? (Select the last month
of the year)
12. In which year, were the most videos uploaded in GB?
13. Which hour had the most uploaded videos in this dataset? Is there any differences
between countries? (time_frame)
14. What are the top 3 viewed categories in terms of number of uploaded videos?
15. What are the least 3 viewed categories in terms of number of uploaded videos?
16. Which video has the highest percentage of likes?
17. Which video has the highest percentage of dislikes?
18. Which day has the highest uploads of videos?
19. Which day has least uploads of videos?
20. What is monthly breakdown of published videos?
Task 1- Background information
Write a description of the selected dataset and project, and its importance for the firm.
Information must be appropriately referenced. [1 Page]
4 | P a g e
Task 2 – Reporting / Dashboards
For your project, perform the relevant data analysis tasks by answering the above
questions and, identify the visualization and dashboards you need to develop for the
Content Manager of the indicated firm. [2-3 Pages]
Task 3 – Advanced Insights: In addition to the guided questions, it is expected to
provide at least five (5) insights of the data. These insights will be judged in terms of
quality and complexity.
Task 4 – Research
Justify why these BI reporting solution/dashboards are chosen in Task 2 (Reporting /
Dashboards) and why that dataset attributes are present and laid out in the fashion you
proposed (feel free to include all other relevant justifications).
Note: To ensure that you discuss this task properly, you must include visual samples of
the reports you produce (i.e. the screenshots of the BI report/dashboard must be
presented and explained in the written report; use ‘Snipping tool’), and also include any
assumptions that you may have made about the analysis in your Task2 (i.e. the report to
the content manager of the company). [1-2 Pages]
Task 5 – Recommendations for Content Manager
The Content Manager would like to improve the multimedia operations. Based on your
BI analysis and the insights gained from the dataset in the lights of analysis performed
in previous tasks, make some logical recommendations to the Content Manager, and
justify why/how your proposal could enhance company’s multimedia operations and
could assist in achieving operational/strategic objectives with the help of appropriate
references from peer- reviewed sources. [1-2 Pages]
Task 6 – Cover letter
Write a cover letter to the Content Manager with the important data insights and
recommendation to achieve operational/strategic objectives [1 page]
Task 7 – The Reflection: Each Team member is expected to write a brief reflection about
this project in terms of challenges, learning and contribution.
Other Tasks –
Please refer to marking scheme at the end of the assignment for other tasks and
expectations.
Report Submission:
• Hard-copy to tutors/lecturers assignment box in week 10. Double- sided
printing for the hard-copy is encouraged in order to save paper.
• You will also submit a 7-8 pages report (about 1500 words not counting cover
page and references) of this project. At least 15 references in your report must
be from peer-reviewed sources. Include any and all sources of information
including any person(s) you interviewed for this project.
• Please note that all references must adhere to APA style. See
http://owl.english.purdue.edu/owl/resource/560/01 and
5 | P a g e
http://www.apastyle.org/ for details on how to format a report and how to cite
references. Make sure your follow formal report structure with cover page,
introduction, use of headings, subheadings, conclusion sand reference section.
• You are reminded to read the “Plagiarism” section of the course description. Your
essay should be a synthesis of ideas from a variety of sources expressed in your
own words. All reports must use the APA referencing style. University
Referencing/Citation Style Guide: The University has published a style guide to
help students correctly reference and cite information they use in assignments
(American Psychological Association (APA) citation style,
http://www.ballarat.edu.au/aasp/student/learning_support/generalguide/pri
n t/ch06s04.shtml or Australian citation style
• Reports are to be presented in hard copy in size 12 Arial Font and double spaced.
Your report should include a list of references used in the essay and a
bibliography of the wider reading you have done to familiarize yourself on the
topic.
• A passing grade will be awarded to assignments adequately addressing all
assessment criteria. Higher grades require better quality and more effort. For
example, a minimum is set on the wider reading required. A student reading
vastly more than this minimum will be better prepared to discuss the issues in
depth and consequently their report is likely to be of a higher quality. So before
submitting, please read through the assessment criteria very carefully.
6 | P a g e
ITECH1103- Big Data and Analytics Assignment 1
Data Analysis Marking Scheme-Percentage 20%
Due Week 11 (Sunday 11:55pm) – Hard and Soft Copies

TasksMax
Marks
Marks
Awarded
Comments
1- Background of the Project: Description of
Project, Datasets and firm. The important
of project for the firm [1+1+1+2]
5
2- Dashboard/Reports What are the BI
reporting solution/dashboards you will
need to develop for Content Manager of
chosen in the light of Questions of your
Data analysis – [Quality and complexity of
the analysis –
30
3- Advanced Work: The quality and
complexity of additional five (5) insights
provided other than the guided questions.
10
4- Research – Justify why these BI reporting
solution/dashboards are chosen and why
those attributes are present and laid out in
the fashion you proposed (feel free to
include all other relevant justifications).
Note: To ensure that you discuss this task
properly, you must include visual samples
of the reports you produce (i.e. the
screenshots of the BI report/dashboard
must be presented and explained in the
written report; use ‘Snipping tool’), and also
include any assumptions that you may have
made about the analysis in your assignment
report (i.e. the report to the operational
team of the company).
[Each analysis/dashboard and report explanation
with relevant research papers, complexity and in
depth of the justification, use of peer-reviewed
sources]
15

7 | P a g e

5- Recommendations – The Content Manager
would like to improve the advertising
operations. Based on your BI analysis and
the insights gained from dataset in the
lights of analysis performed in previous
tasks, make some logical recommendations
to the Content Manager, and justify
why/how your proposal could assist in
achieving operational/strategic objectives.
15
6- The Reflection: Each Team member is
expected to write a brief reflection about
this project in terms of challenges, learning
and contribution.
10
7- Cover letter (Format, key findings and
recommendation ) [1+1+3]
5
8- Other Tasks Report is well-written and
presented professionally, containing:
• Title page
• Table of Contents
• Introduction
• Appropriate use of headings within report
• Appropriate use of figures (i.e. graphs,
summary tables) and reference to calculations and
summaries to justify all observations and
recommendations
• Overall structure, presentation and
formatting.
Note that the report has to be presented formally.
It must include discussion of calculations,
observations and recommendations with graphs
and/or tables.
1
1
2
1
1
2
2
Total Marks100
Total Marks out of 2020%

8 | P a g e
ITECH1103- Big Data and Analytics
Assignment 1- Data Analysis PresentationMarking Scheme Worth 10%
Due Week 10 (Scheduled Tutorial) – Hard and Soft Copies
Students are expected to create and present a 10-15 minute overview of the findings from
their Team Report.
Student IDs:
Assessment Criteria:

CriteriaMarks
Introduction/10
Content – Guided & Advanced Questions/40
Conclusion/10
Presentation Style e.g. clarity, engagement/20
Team participation/10
Timing (10-15 minutes)/10
Subtotal-/100
General Comments:

Leave a Reply

Your email address will not be published. Required fields are marked *