University of the Cumberlands Data Science and Big Data Analysis Discussion

User Generated

Xvena1076

Computer Science

University of the Cumberlands

Description

In previous weeks you discussed an industry and problems within that industry.

This is an opportunity to expand on that work within the context of the business analytics lifecycle and connect theory to practice.

Construct an essay specific to your industry and a potential, specific problem to be solved that outlines your exploratory data analytics approach.

(a) Review the Kaggle website (https://www.kaggle.com/datasets) or any public dataset such as those obtained from Google dataset research (https://datasetsearch.research.google.com/). Choose a dataset that closely aligns to the problem you wish to solve. Provide a link to the dataset.

  • (b) Identify five types of data that would be useful in solving the problem you identified.
  • (c) Discuss your exploratory data approach. In your discussion include mention of a least one alternative approach that you believe would be inappropriate.

Explanation & Answer:
1250 Words
User generated content is uploaded by users for the purposes of learning and should be used following Studypool's honor code & terms of service.

Explanation & Answer

Thanks for working with you

Running Head: EXPLORATORY DATA ANALYTICS

1

An Exploratory Data Analytics Approach for the Devices Lost in the Past six Days

Name

Institutional Affiliation

EXPLORATORY DATA ANALYTICS

2

Introduction
Exploratory Data Analytics (EDA) approach is a big topic incorporating a lot of
theoretical and practical content in the field of statistics. Many institutions/industries deploy
EDA for various purposes, such as checking assumptions and detecting mistakes in data from an
experiment/research. EDA knowledge is therefore, essential for industry because we do various
research and experiments. In our industry, for example, we collected data regarding the loss of
the company’s electric devices, and we shall use EDA to do the first analysis of the data. Despite
the availability of having many EDA approaches, multivariate graphical, univariate graphical,
multivariate non-graphical, and univariate non-graphical, the multivariate non-graphical
technique is the best to analyze the data we collected on the devices lost in our industry because
we have more than one parameter that we do not have to present in graphical forms.
Background Literature
EDA is a terminology that different people use different clauses to define. However,
according to 1.1.1. What is EDA? (n.d), EDA is a data analysis philosophy/approach employing
several techniques to test underlying assumptions, detect outliers and anomalies, extract
important variables, and uncover underlying structure. The definition mentioned above
incorporates what EDA does, thus implying its importance. Based on the type of data, there exist
several approaches that can be employed, as shown in the next paragraph.
Even though a person can do EDA in their way, there exist four standard techniques of
EDA. There are four standard techniques of EDA namely, multivariate graphical, univariate
graphical, multivariate non-graphical, and univariate non-graphical (Chapter 4, n.d., p.62).
Generally, univariate methods focus on one data colum...


Anonymous
Super useful! Studypool never disappoints.

Studypool
4.7
Trustpilot
4.5
Sitejabber
4.4
Similar Content
Related Tags