This assignment is a prelude to the third assignment. It aims at providing you with an authentic experience in carrying a simple data science project that covers all essential stages in a data science lifecycle. Since most professional science projects are performed by teams, you are therefore required to complete this assignment in a team.
Pitch a public, open dataset of your choice.
Pitch 3 or 4 initial hypotheses to be pursued later in Assignment 3.
Profile the data using descriptive and/or inferential statistics techniques (which also requires that you demonstrate proficient data wrangling skills).
Present items 1, 2, and 3 above via a recorded presentation.
Your tasks are open-ended tasks, similar to most real data science projects. This means no two teams are likely to go to the same direction and produce similar results. You will find that your group will become experts in interpreting your own data and answering your own problems. Comparing performance across teams may not be meaningful and your team will be assessed solely against the rubric.
Your Python code base must be available on your Github repo. The extent of the group’s collaboration and individual contribution will be evaluated solely based on Github.
General advice:
Select an open (publicly available) data – data that can be freely downloaded, preferably with an open license, allowing you to share the data freely. Choosing non-public data is not advisable as your instructor may be restricted from accessing the data.
Choose data in the domain for which team member(s) has some background.
Formulate open-ended hypotheses.
Carry out fresh data and/or analysis.
Where possible, choose a dataset and formulate problems pertaining to practical Australian contexts.
It is fine to choose a dataset that has been analysed by others outside of the university. This is the natural consequence of selecting open data. However, you should either show that the analysis and exploration you plan has not been done before, or show that there is no code already available to do the analysis you intend. Your instructor is likely to view highly any original investigation.
Sources of open datasets include but are not limited to:
https://data.gov.au/
https://data.nt.gov.au/
https://data.worldbank.org/
https://www.data.gov/
https://datasetsearch.research.google.com/
https://www.kaggle.com/datasets – Be careful. Many Kaggle datasets have published analyses. Choose something that has not been done before.
Group work activities must be visible on Github Classroom.
The instructor will send an invitation to all students to join Github Classroom after all groups are formed. To accept this invitation, every student must have a free Github account. If you do not already have it, please sign up. This is compulsory.
You should refer to the detailed marking rubric that appears on the side panel of this window.
1 URL to a recorded presentation per team published privately on YouTube. Do not submit multiple recordings and do not submit recording file unless requested specifically.
(optional) supplementary information, where applicable.
Latest Python code base on GitHub repo must be accessible to your instructor. Snapshot of the repo will be taken at the time of submission.
The duration of the presentation is commensurate with the team size. Inline with the Unit Information, 2 to 3 minutes of presentation per team member is required. Not complying with this requirement may attract a mark penalty.
Example:
For a team of 3: the minimum duration is 6 minutes (2 minutes x 3 members) and the maximum duration is 9 minutes (3 minutes x 3 members).
For a team of 4: the minimum duration is 8 minutes (2 minutes x 4 members) and the maximum duration is 12 minutes (3 minutes x 4 members).
Other assessment irregularities are governed by CDU’s Higher Education Assessment Procedures.
When pitching your dataset, consider addressing the following concerns:
source of data
accesssibility of data
validity of data
why the dataset matters (in practical or academic terms)
domain knowledge
relevance to you
etc.
In profiling the data, consider addressing the following concerns:
dimensionality
data types
centrality
spread
shape of data
distributions
etc.
The last task is to pitch 3 to 4 initial hypotheses. Consider addressing the following concerns:
what might the data tells us
what would you like to explore first based on your initial data profiling
what would you like to predict
what existing assumption you want to test previous finding
what new idea you want to test
etc.
We provide professional writing services to help you score straight A’s by submitting custom written assignments that mirror your guidelines.
Get result-oriented writing and never worry about grades anymore. We follow the highest quality standards to make sure that you get perfect assignments.
Our writers have experience in dealing with papers of every educational level. You can surely rely on the expertise of our qualified professionals.
Your deadline is our threshold for success and we take it very seriously. We make sure you receive your papers before your predefined time.
Someone from our customer support team is always here to respond to your questions. So, hit us up if you have got any ambiguity or concern.
Sit back and relax while we help you out with writing your papers. We have an ultimate policy for keeping your personal and order-related details a secret.
We assure you that your document will be thoroughly checked for plagiarism and grammatical errors as we use highly authentic and licit sources.
Still reluctant about placing an order? Our 100% Moneyback Guarantee backs you up on rare occasions where you aren’t satisfied with the writing.
You don’t have to wait for an update for hours; you can track the progress of your order any time you want. We share the status after each step.
Although you can leverage our expertise for any writing task, we have a knack for creating flawless papers for the following document types.
Although you can leverage our expertise for any writing task, we have a knack for creating flawless papers for the following document types.
From brainstorming your paper's outline to perfecting its grammar, we perform every step carefully to make your paper worthy of A grade.
Hire your preferred writer anytime. Simply specify if you want your preferred expert to write your paper and we’ll make that happen.
Get an elaborate and authentic grammar check report with your work to have the grammar goodness sealed in your document.
You can purchase this feature if you want our writers to sum up your paper in the form of a concise and well-articulated summary.
You don’t have to worry about plagiarism anymore. Get a plagiarism report to certify the uniqueness of your work.
Join us for the best experience while seeking writing assistance in your college life. A good grade is all you need to boost up your academic excellence and we are all about it.
We create perfect papers according to the guidelines.
We seamlessly edit out errors from your papers.
We thoroughly read your final draft to identify errors.
Work with ultimate peace of mind because we ensure that your academic work is our responsibility and your grades are a top concern for us!
Dedication. Quality. Commitment. Punctuality
Here is what we have achieved so far. These numbers are evidence that we go the extra mile to make your college journey successful.
We have the most intuitive and minimalistic process so that you can easily place an order. Just follow a few steps to unlock success.
We understand your guidelines first before delivering any writing service. You can discuss your writing needs and we will have them evaluated by our dedicated team.
We write your papers in a standardized way. We complete your work in such a way that it turns out to be a perfect description of your guidelines.
We promise you excellent grades and academic excellence that you always longed for. Our writers stay in touch with you via email.