Question.4449 - Unit 3: Self-Check Assignment 2: Milligan, Chapter 9: Clusters and Distributions This assignment builds on our previous work through Milligan, Chapter 9, and takes a deeper dive into two key data visualization techniques: clustering and distribution analysis. Clustering can be a key technique for visualizing the relationships between two or more variables, moving from single dimensions into multidimensional analysis. Distribution analysis can highlight central tendencies in a dataset (such as mean or median) and allow visualization to show outliers clearly. In this assignment, you will apply clustering and distribution analysis to analyze the relationships between several factors in a healthcare dataset by following these steps: Go through this document and use Tableau to answer all the questions listed below. Where applicable, paste screenshots into the template below. When you are ready, complete the online quiz , which verifies your homework. Use the answers you found in this document to answer the questions. When you have completed the online quiz, submit the Word document. Remember, you can always ask your instructor for help if needed. If you need to adjust the size of your visualizations to match the options in the questions, use the “Format”-> “Cell size” options. For example, “ Ctrl+Shift+B ” on a Windows computer will make the visualization bigger, and “ Ctrl+Up ” will make it taller. Attachments: Diabetes2.csv ( dataset downloaded from Kaggle ) : Pima Indians Diabetes Database For this assignment, follow these steps: Download the Pima Indians Diabetes dataset Perform exploratory data analysis (EDA) of the data in Tableau Perform clustering analysis of the data in Tableau Perform distribution analysis of the data in Tableau Download the diabetes dataset Click to access the Pima Indians Diabetes Database . You may need to sign in or register if you don’t already have an account on Kaggle. Click the “ Download ” button on the linked page above to download the dataset. Alt text: dataset You can scroll down on this web page to learn more about this dataset, including its original publication date and what the fields mean. When you download the data, it will save it as a .zip file, likely named something like " archive.zip ” ( a standard download protocol for Kaggle ) . Alt text: standard download Double-click on the “ archive ” file . I t should open to a screen like the one below (this is from a Windows machine; if you are on a Mac , it may vary slightly) . You will see that there is one file there (named diabetes.csv, highlighted with a green arrow below). If you click on the “ Extract all ” option , it will extract all the files (in this case, just the one diabetes.csv file). Save the file in a location of your choice, where it will be easy to find in order to connect with Tableau. Alt text: Tableau Open the dataset on your computer. It’s a .csv file, so it should open in Microsoft Excel, and it should look like the table below . Verify that you have these exact numbers showing up in your downloaded file : Alt text: Microsoft Excel Perform Exploratory Data Analysis (EDA) with Tableau Open the dataset in Tableau. Since this is a .csv file, and not an Excel file, you need to connect to a “more” kind of data (see image below) : Alt text: Connect Alt text: Data file When you open the file in Tableau, y ou should see something like this: Alt text: Tableau file Before we do any multidimensional clustering, it’s always a good idea to get an overview of each of the data fields : There are 768 data rows here, each corresponding to a different member of the Pima Indian Tribe. The table contains each member ’s number of pregnancies, blood glucose, blood pressure, and other health measurements. All the way to the right, the table also contains an outcome variable : 0 if the tribe member was not diagnosed with diabetes 1 if the tribe member was diagnosed with diabetes For example, let’s look at the Outcome for the first member . This person has had six pregnancies with a glucose reading of 148 . I f we scroll all the way to the right, we see the age listed as 50 and the Outcome listed as 1. This means this tribe member was diagnosed with diabetes. Alt text: outcome Question 1: Understanding the Data In the diabetes dataset, Row 8 contains a tribe member who reported ten pregnancies. Which other data fields correspond with this person? Glucose of 115, BMI 35.3, Outcome : No Diabetes Glucose of 115, BMI 35.3, Outcome : Diabetes Glucose of 168, BMI 38.0, Outcome : Diabetes Glucose of 168, BMI 38.0, Outcome : No Diabetes Glucose of 139, BMI 27.1, Outcome : No Diabetes None of these Question 1 Answer: Let’s explore the Pima Indians Diabetes Database a little bit more . Anytime you get a new dataset, it’s a good idea to run some general descriptives to see what the data look like. One of the major tools we use is a histogram . Make a histogram of the age . On a new worksheet ( below , Sheet 1), we first pull the Age to the Rows, and then , in the “Show Me” tab , we select the histogram icon . Alt text: histogram Alt text: histogram Alt text: histogram Note that when it makes the histogram, under the Data tab, you can see a new variable under Tables called “Age (bin).” This is the histogram bin for your Age variable. If we look at Sheet 1, we can see that most of our data points report an Age of between 20 and 30. We also have a few people in their 40s and 50s, and many fewer older people. Tableau does the best it can to guess at “good” bin sizes for your dataset, but it doesn’t always guess exactly the way a human would like. Go to the right of the green oval under the Data -> Tables -> Age (bin), pull that menu down, and select Edit to see the exact bin sizes Tableau is suggesting here. Alt text: Data tables We can see here that Tableau is suggesting we group people in age groups of 4.69 years, and that our first bin is suggested to start at age 18.76. So the first bin will contain ages 21, 22, and 23. The second bin, due to rounding issues, contains ages 24, 25, 26, 27, and 28! Let’s update the bins so that they are of bin size 5, so that our yearly increments go in nice, round age buckets. Alt text: Edit bins If we update the bin size to 5, we can now see that the histogram has adjusted slightly. The first bin contains ages 20, 21, 22, 23 and 24. The second bin contains ages 25, 26, 27, 28, and 29. The histogram is “smoother” in the jump between the 35–40 and 40–45 age bins. Alt text: Bar graph Sometimes, the axis labels aren’t obvious. To be sure of which bin you are viewing, you can click on a bin, and it will display the information. Below, we have clicked on one of the bars and learned it is the bin for ages 35 (and contains ages 35, 36, 37, 38, and 39 but not age 40). If you are curious, you can right-click on the bin and choose “View Data” and then “Full Data” to see the exact data points which make up that bar. Alt text: bar graph Alt text: histogram On a new s heet in Tableau, make a histogram of the BMI variable. Adjust the bin sizes so that they are size 5. Question 2: Understanding the Bin Sizes In your BMI histogram, what values are in the most frequent bin? 30 – 34.9 30 – 35 30.40, 135 30.40 – 33.30 none of these Question 2 Answer: Question 3: Understanding Questionable Values In your BMI histogram, are there any values that make you question the data and wonder if those should be filtered out? Yes; the data looks normally distributed, like a bell curve, and this is not expected for these sorts of measurements Yes; there are too many counts of a BMI of 40 and higher, which is much larger than expected Yes; there are 11 counts of a BMI of 0, which is probably not an accurate measure No; the data looks normally distributed, like a bell curve, and this is expected for these sorts of measurements Question 3 Answer: On a new s heet in Tableau, make a histogram of the Glucose variable. Adjust the bin sizes so they are size 10. (Don’t filter out any Glucose measurements at this point.) Question 4: Understanding the most frequent bins In your Glucose histogram, what values are in the most frequent bin? 100 through 107, including 100 and 107 100 through 109, including 100 and 109 100 through 110, including 100 and 110 110 through 120, including 110 and 120 120 through 129, including 120 and 129 Question 4 Answer: Question 5 – Understanding the bin values and counts The most frequent Glucose measurement is about 100. What is the second most frequent Glucose measurement? bin 100, count 117 bin 110, count 94 bin 110, c ount 105 bin 120, count 102 Question 5 Answer: Question 6: Understanding the shape of the data In your Glucose histogram, how would you describe the overall shape of this data? The average/median is about 100, and symmetric. It’s not skewed at all. The average/median is about 100, and it’s skewed to the right . The average/median is about 100, and it’s skewed to the left . It is bimodal : there are two distinct centers of glucose measurements, probably one for diabetes diagnoses and one for non-diabetes diagnoses. N one of these Question 6 Answer: On a new Sheet in Tableau, make a histogram of the Insulin variable. Adjust the bin sizes so that they are size 50. Question 7: Understanding the Descriptives of the Data In your Insulin histogram, how would you describe the overall descriptives of this data? Zero insulin is meaningless, so there must be some mistakes . This data is normally distributed and follows a bell curve symmetric shape . This data is bimodal. We can see two distinct populations : those with diabetes and those without. The most frequent amount of insulin is between 0 and 49, but a few people report large levels of insulin, at 400 units or above. N one of these Question 7 Answer: In your Insulin histogram, drag Outcome to the Color under Marks. Remember that Outcome=0 means No Diabetes, and Outcome=1 means Diabetes. Now go to the leftmost bin, right-click on it to View Data, and then look at the Full Data. How would you describe the data points in this bin? Check all that apply. Question 8: Two-Dimensional Descriptives of the Data How would you describe the data points in the leftmost bin in terms of insulin and outcome? The most frequent amount of insulin is 0, but there are some values here between 0 and 49. All the Insulin values here are 0. All the Outcome values here are 0 (no Diabetes). The values here average to 25. This contains only values from people with No Diabetes as their Outcome. If you have d iabetes , you need insulin. N one of these Question 8 Answer: Now , let’s look at the Outcome variable. We could make a histogram of this binary variable, but that’s not very satisfying. Let’s recode it to be a text variable. Instead of having to remember “0 means No Diabetes,” wouldn’t it be easier to just have the words “No Diabetes” showing on the screen? First, make a new worksheet. Then, under Outcome, choose Create -> Calculated Field. Alt text: outcome text Fill it out as follows: You want to create a new field called “Outcome_Text,” and we want it to be “Diabetes” if the Outcome variable was a 1, 0 otherwise, and “NA” if, for some weird reason, the Outcome variable was neither a 1 nor a 0. If you want to copy and paste the formula with all the glorious brackets and parentheses, here it is: IF ([Outcome]=1) THEN "Diabetes" ELSEIF ([Outcome]=0) THEN "No Diabetes" ELSE "NA" END Alt text: copy and paste formula Let’s check that it coded correctly: How many Outcome=0 do we have? Let’s look at a histogram of the original Outcome variable. Looks like we have 500 in the Outcome=0 bin and 268 in the Outcome=1 bin. (Those of you who are following along at home on your calculators will notice this is about 65% No Diabetes, 35% Diabetes.) Alt text: outcome How does our Outcome_Text = “No Diabetes” or “Diabetes” stack up against our original binary variable? Let’s drag the Outcome_Text to the Color and also to the Rows. Alt text: outcome We can see success! The orange bar is marked “No Diabetes” in the legend, and it is showing on the left (per the binary Outcome variable = 0), while the blue bar is marked “Diabetes” in the legend and is showing on the right (per the binary Outcome variable = 1 ) . Question 9: Understanding the Questions to Ask We are studying this dataset to try to understand diabetes in the Pima Indian tribe. We have a dataset which contains about 35% diabetes diagnoses. Which research statement(s) does this data look like it might be able to answer? Check all that apply. Why do people with zero insulin recorded still have a diabetes diagnosis? What aspects of the modern diet cause diabetes? For the Pima Indian population listed in this dataset, are there any relationships between diabetes, age , and BMI which might be interesting? For the Pima Indian population listed in this dataset, are there any relationships between glucose, insulin, and diabetes diagnosis which might be interesting? For the Pima Indian population listed in this dataset, are there any relationships between household income, gender, and diabetes diagnosis which might be interesting? This is not enough data to ask anything; we really need to go to the CDC and get millions of rows of data. N one of these Question 9 Answer: Let’s go back to our Age histogram. Does d iabetes diagnosis change with age? Pull the Outcome_Text (not the Outcome binary variable, but the Outcome_Text) as a color and also as an additional row variable. We will see something like this: Alt text: histogram From this histogram, we can see that age starts off young on the left and goes to older on the right. The Diabetes population is on the top in blue, and the Non-Diabetes population is on the bottom in orange. We can see that while both groups cover most of the full age range, the Non-Diabetes population has a lot of young people in it while the Diabetes population has a larger percentage of its population in the older age brackets. This may give rise to a hypothesis: Does increasing age bring with it a likelihood of diabetes diagnosis among this population? Question 10: Understanding Stacked Histograms Go back to your BMI histogram. Be sure the bin sizes are still 5. (You can just ignore any BMI of 0; this is probably a data error.) Repeat steps similar to the Age histogram analysis we just did. Which statements do your stacked histograms support for the Pima Indian population from this dataset? Check all that apply. The BMI for the No Diabetes group appears to be lower than the BMI for the Diabetes group if we go by the histogram midpoint . If you look at the BMI bin which contains BMI measures from 40.0 through 44.9, there are about the same number (within 5 people) in the Diabetes and No Diabetes categories. The BMI for the No Diabetes group is heavily skewed in favor of a BMI below 20. T he BMI for the Diabetes group is heavily skewed in favor of a BMI of 45 or higher. If you look at the BMI bin which contains BMI measures from 20.0 through 24.9, there are about the same number (within 5 people) in the Diabetes and No Diabetes categories. N one of these Question 10 Answer: Perform Clustering of the Diabetes Data in Tableau We have completed our EDA (exploratory data analysis) on the diabetes data. We are beginning to understand the shape of individual variables such as Age and BMI, and also their relationship to a diabetes diagnosis. Our next step is to run two-dimensional xy scatterplots and then cluster them to see if we can uncover additional relationships. Let’s start investigating Age and BMI. Make a new s heet in Tableau. Drag Age (not the Age(bin) but the plain old Age from the Measures in the Data Values area) to the Columns, and drag BMI to the Rows. Tableau will give you SUM(Age) and SUM(BMI) and probably one single data point graphed. We circled in red for you below. Alt text: graph We want to see all the data points, so let’s Disaggregate Measures. You can do this under the Analysis Menu. (This is the fix anytime you expect lots of data points and you only have one.) Alt text: Analysis We can now see an XY scatterplot of BMI vs . Age , with Age on the x-axis. Apply a filter for BMI so that BMI is allowed to be between 1 and the highest value (this removes BMI values of 0) . Alt text: scatterplot We want to do two-dimensional clustering. Are there distinct groups, such as Younger people with lower BMI? Younger people with higher BMI? To do this in Tableau, we need to switch from the Data tab on the left to the Analytics tab. From there, under the Model options, we want “Cluster.” Alt text: cluster Drag the “Cluster” tool into the middle of your XY scatterplot: Alt text: cluster Tableau will automatically create some clusters for you. It groups similar data points together so that people of a similar age and BMI will be in the same cluster while people with different ages and different BMI will be in different clusters. Just looking visually at the four clusters Tableau automatically made, we can see the yellow one on the bottom left is younger people with lower BMI while the aqua one on the right-hand side is older people with lower-to-medium BMI. The red cluster is younger people with higher BMI. Alt text: cluster If you like, you can experiment and drag the Clusters Mark onto the Shape Mark , and then the clusters will be distinguished by multiple X, O, and + icons and others which do not require color to differentiate them. We can get numeric descriptives on our clusters to help us understand them. Go to Clusters -> Describe Clusters. Here, we see that there are four clusters. Cluster 1 has 197 people in it, the median age is about 41 years old, and the BMI in this cluster is centered around about 34. Alt text: cluster Alt text: clustering Question 11: Cluster Analysis for BMI vs. Age Look at the BMI vs. Age Clusters created above, with a model of 4 clusters. If you had a data point with an age of 27 and a BMI of 42, which cluster would you expect to be the best match? Cluster 1 Cluster 2 Cluster 3 Cluster 4 N one of these Question 11 Answer: Let’s stay with our BMI vs. Age Clusters. Let’s say we want more clusters in order to more finely analyze our data. Under the Marks menu, go to Clusters -> Edit Clusters , and change the number of clusters to 10. Alt text: Clusters Question 12: Changing the Numbers of Clusters Look at your new clusters, with 10 of them now on your XY scatterplot. What is the average age of those in the cluster with the highest BMI? About 26 About 29 About 51 About 52 Cannot determine from available information Question 12 Answer: Let’s make a new sheet and investigate Glucose vs. Insulin. Make an XY scatterplot with Insulin on the x-axis (because we can administer insulin) and Glucose on the y-axis (because that’s our outcome variable) Add filters to remove 0 values for Glucose (because a zero blood - glucose reading does not make sense ) . Do not add filters for insulin . I t’s OK if the amount of insulin administered is zero. Have Tableau make 3 clusters. Question 13: Mapping Clusters to Measurements Normal blood glucose levels are about 100 in a fasting non-diabetic adult. Which cluster best represents this? Cluster 1, with an average Glucose reading of about 150 and an average Insulin value of about 54 Cluster 2, with an average Glucose reading of about 100 and an average Insulin value of about 52 Cluster 2, with an average Glucose reading of about 100 and 220 data points in it Cluster 3, with an average Glucose reading of about 161 and 347 data points in it Question 13 Answer: Stay on your Glucose vs. Insulin sheet, but let’s add another piece of information. Drag you r Outcome_Text variable (the one which declares “Diabetes” or “No Diabetes”) to the Columns area. This should now give you two panes of graphs, one with Diabetes and one with No Diabetes. Alt text: Clusters Question 14: Adding Classification to Clusters You should have two XY scatterplots of your Glucose vs. Insulin clusters, one with Diabetes and one with No Diabetes. Which statements would you support after inspecting these visualizations? Choose all that apply: Cluster 1 has an average Glucose level of about 150 in both the Diabetes and No Diabetes classifications. Cluster 1, with an average Glucose reading of about 150 and an average Insulin value of about 54 . Cluster 2, which is generally lower insulin usage and lower Glucose levels, has some people with a Diabetes classification, but many more with a No Diabetes classification. Cluster 3 has very high insulin , very high Glucose levels, and only contains people with a Diabetes classification. Cluster 3, with an average Glucose reading of about 161 and 347 data points in it Question 14 Answer: Perform Distribution Analysis with Tableau We have now done EDA (exploratory data analysis) on this dataset, and we’ve also done some cluster analysis to look at relationships between two variables. Now we are going to look at distribution analysis. Sometimes it’s helpful to look for outliers, average levels, or other general distribution characteristics of a dataset. A Distribution Band can visually display that information. Let’s go back to our BMI vs. Age dataset and graph. Remove any clusters and keep a filter on so that Tableau only displays data where the BMI is > 0 (do not display data with a BMI = 0 ) . Go the Analytics tab . U nder Custom, choose Distribution Band. You want to drag this option to Table (Page) for this demonstration. Alt text: cluster table You will be given some options. For this one, you want the scope to be the entire table, and we want +/- 1 Standard Deviation. (You will recall from statistics that if your data are normally distributed, about two-thirds of it will be within +/-1 standard deviation. This means that about 1/6 is above +1 STDEV , and about 1/6 is below -1 STDEV . So if something is “outside” of those bounds, it’s “a little bit different from average.”) Alt text: Standard deviation That last step will put a reference band on the graph. It’s now easy to see which Age data points are “close to the average” (they are inside the grey band ) and which data points are “outside of the average” (they are outside of the grey band.) Alt text: Cluster Let’s go back and put on a reference band of +/-1 STDEV for the BMI as well. We get something like the following : Alt text: cluster Question 15: Understanding the Distribution Bands Look at the distribution band for the BMI vs. Age scatterplot. Match the area with its description. Four Quadrants: A, B, C, D Four Options: Lower age, Lower BMI Lower age, Higher BMI Higher age, Lower BMI Higher age, Higher BMI Question 15 Answer: Go make a new s heet, and this time , make a graph of Glucose vs. Insulin. Review these reminders: Insulin should be on the x-axis Glucose should be on the y-axis No clusters Filter so Glucose is 1 or higher Do not filter on Insulin (OK if Insulin values are 0) Add one +/- 1 STDEV Distribution band for the Insulin Add another +/- 1 STDEV Distribution band for the Glucose (you will do well to add them one at a time, at the Table level) You should get something that looks similar to this: Alt text: cluster Question 16: Understanding the Distribution Band Quadrants Look at your diagram. Look for the quadrant which encompasses “high Insulin, high Glucose” outliers. These data points will be outside both grey reference bands. How would you characterize the Diabetes/No Diabetes diagnosis of these data points? (Hint: consider using the “Outcome_Text” variable as a distinguishing Color Mark or Shape Mark.) There are both Diabetes and No Diabetes data points here, but most of them are Diabetes There are only Diabetes data points here There is a 50/50 mix of Diabetes and No Diabetes data points here There are not really very many data points here at all – maybe one or two Question 16 Answer:
Answer Below:
Unit xxxxxxxxxx Assignment xxxxxxxx Chapter xxxxxxxx and xxxxxxxxxxxxx This xxxxxxxxxx builds xx our xxxxxxxx work xxxxxxx Milligan xxxxxxx and xxxxx a xxxxxx dive xxxx two xxx data xxxxxxxxxxxxx techniques xxxxxxxxxx and xxxxxxxxxxxx analysis xxxxxxxxxx can xx a xxx technique xxx visualizing xxx relationships xxxxxxx two xx more xxxxxxxxx moving xxxx single xxxxxxxxxx into xxxxxxxxxxxxxxxx analysis xxxxxxxxxxxx analysis xxx highlight xxxxxxx tendencies xx a xxxxxxx such xx mean xx median xxx allow xxxxxxxxxxxxx to xxxx outliers xxxxxxx In xxxx assignment xxx will xxxxx clustering xxx distribution xxxxxxxx to xxxxxxx the xxxxxxxxxxxxx between xxxxxxx factors xx a xxxxxxxxxx dataset xx following xxxxx steps xx through xxxx document xxx use xxxxxxx to xxxxxx all xxx questions xxxxxx below xxxxx applicable xxxxx screenshots xxxx the xxxxxxxx below xxxx you xxx ready xxxxxxxx the xxxxxx quiz xxxxx verifies xxxx homework xxx the xxxxxxx you xxxxx in xxxx document xx answer xxx questions xxxx you xxxx completed xxx online xxxx submit xxx Word xxxxxxxx Remember xxx can xxxxxx ask xxxx instructor xxx help xx needed xx you xxxx to xxxxxx the xxxx of xxxx visualizations xx match xxx options xx the xxxxxxxxx use xxx Format x Cell xxxx options xxx example xxxx Shift x on x Windows xxxxxxxx will xxxx the xxxxxxxxxxxxx bigger xxx Ctrl xx will xxxx it xxxxxx Attachments xxxxxxxx csv xxxxxxx downloaded xxxx Kaggle xxxx Indians xxxxxxxx Database xxx this xxxxxxxxxx follow xxxxx steps xxxxxxxx the xxxx Indians xxxxxxxx dataset xxxxxxx exploratory xxxx analysis xxx of xxx data xx Tableau xxxxxxx clustering xxxxxxxx of xxx data xx Tableau xxxxxxx distribution xxxxxxxx of xxx data xx Tableau xxxxxxxx the xxxxxxxx dataset xxxxx to xxxxxx the xxxx Indians xxxxxxxx Database xxx may xxxx to xxxx in xx register xx you xxx t xxxxxxx have xx account xx Kaggle xxxxx the xxxxxxxx button xx the xxxxxx page xxxxx to xxxxxxxx the xxxxxxx Alt xxxx dataset xxx can xxxxxx down xx this xxx page xx learn xxxx about xxxx dataset xxxxxxxxx its xxxxxxxx publication xxxx and xxxx the xxxxxx mean xxxx you xxxxxxxx the xxxx it xxxx save xx as x zip xxxx likely xxxxx something xxxx archive xxx a xxxxxxxx download xxxxxxxx for xxxxxx Alt xxxx standard xxxxxxxx Double-click xx the xxxxxxx file x t xxxxxx open xx a xxxxxx like xxx one xxxxx this xx from x Windows xxxxxxx if xxx are xx a xxx it xxx vary xxxxxxxx You xxxx see xxxx there xx one xxxx there xxxxx diabetes xxx highlighted xxxx a xxxxx arrow xxxxx If xxx click xx the xxxxxxx all xxxxxx it xxxx extract xxx the xxxxx in xxxx case xxxx the xxx diabetes xxx file xxxx the xxxx in x location xx your xxxxxx where xx will xx easy xx find xx order xx connect xxxx Tableau xxx text xxxxxxx Open xxx dataset xx your xxxxxxxx It x a xxx file xx it xxxxxx open xx Microsoft xxxxx and xx should xxxx like xxx table xxxxx Verify xxxx you xxxx these xxxxx numbers xxxxxxx up xx your xxxxxxxxxx file xxx text xxxxxxxxx Excel xxxxxxx Exploratory xxxx Analysis xxx with xxxxxxx Open xxx dataset xx Tableau xxxxx this xx a xxx file xxx not xx Excel xxxx you xxxx to xxxxxxx to x more xxxx of xxxx see xxxxx below xxx text xxxxxxx Alt xxxx Data xxxx When xxx open xxx file xx Tableau x ou xxxxxx see xxxxxxxxx like xxxx Alt xxxx Tableau xxxx Before xx do xxx multidimensional xxxxxxxxxx it x always x good xxxx to xxx an xxxxxxxx of xxxx of xxx data xxxxxx There xxx data xxxx here xxxx corresponding xx a xxxxxxxxx member xx the xxxx Indian xxxxx The xxxxx contains xxxx member x number xx pregnancies xxxxx glucose xxxxx pressure xxx other xxxxxx measurements xxx the xxx to xxx right xxx table xxxx contains xx outcome xxxxxxxx if xxx tribe xxxxxx was xxx diagnosed xxxx diabetes xx the xxxxx member xxx diagnosed xxxx diabetes xxx example xxx s xxxx at xxx Outcome xxx the xxxxx member xxxx person xxx had xxx pregnancies xxxx a xxxxxxx reading xx I x we xxxxxx all xxx way xx the xxxxx we xxx the xxx listed xx and xxx Outcome xxxxxx as xxxx means xxxx tribe xxxxxx was xxxxxxxxx with xxxxxxxx Alt xxxx outcome xxxxxxxx Understanding xxx Data xx the xxxxxxxx dataset xxx contains x tribe xxxxxx who xxxxxxxx ten xxxxxxxxxxx Which xxxxx data xxxxxx correspond xxxx this xxxxxx Glucose xx BMI xxxxxxx No xxxxxxxx Glucose xx BMI xxxxxxx Diabetes xxxxxxx of xxx Outcome xxxxxxxx Glucose xx BMI xxxxxxx No xxxxxxxx Glucose xx BMI xxxxxxx No xxxxxxxx None xx these xxxxxxxx Answer x Glucose xx BMI xxxxxxx No xxxxxxxx Let x explore xxx Pima xxxxxxx Diabetes xxxxxxxx a xxxxxx bit xxxx Anytime xxx get x new xxxxxxx it x a xxxx idea xx run xxxx general xxxxxxxxxxxx to xxx what xxx data xxxx like xxx of xxx major xxxxx we xxx is x histogram xxxx a xxxxxxxxx of xxx age xx a xxx worksheet xxxxx Sheet xx first xxxx the xxx to xxx Rows xxx then xx the xxxx Me xxx we xxxxxx the xxxxxxxxx icon xxx text xxxxxxxxx Alt xxxx histogram xxx text xxxxxxxxx Note xxxx when xx makes xxx histogram xxxxx the xxxx tab xxx can xxx a xxx variable xxxxx Tables xxxxxx Age xxx This xx the xxxxxxxxx bin xxx your xxx variable xx we xxxx at xxxxx we xxx see xxxx most xx our xxxx points xxxxxx an xxx of xxxxxxx and xx also xxxx a xxx people xx their x and x and xxxx fewer xxxxx people xxxxxxx does xxx best xx can xx guess xx good xxx sizes xxx your xxxxxxx but xx doesn x always xxxxx exactly xxx way x human xxxxx like xx to xxx right xx the xxxxx oval xxxxx the xxxx - xxxxxx - xxx bin xxxx that xxxx down xxx select xxxx to xxx the xxxxx bin xxxxx Tableau xx suggesting xxxx Alt xxxx Data xxxxxx We xxx see xxxx that xxxxxxx is xxxxxxxxxx we xxxxx people xx age xxxxxx of xxxxx and xxxx our xxxxx bin xx suggested xx start xx age xx the xxxxx bin xxxx contain xxxx and xxx second xxx due xx rounding xxxxxx contains xxxx and xxx s xxxxxx the xxxx so xxxx they xxx of xxx size xx that xxx yearly xxxxxxxxxx go xx nice xxxxx age xxxxxxx Alt xxxx Edit xxxx If xx update xxx bin xxxx to xx can xxx see xxxx the xxxxxxxxx has xxxxxxxx slightly xxx first xxx contains xxxx and xxx second xxx contains xxxx and xxx histogram xx smoother xx the xxxx between xxx and xxx bins xxx text xxx graph xxxxxxxxx the xxxx labels xxxx t xxxxxxx To xx sure xx which xxx you xxx viewing xxx can xxxxx on x bin xxx it xxxx display xxx information xxxxx we xxxx clicked xx one xx the xxxx and xxxxxxx it xx the xxx for xxxx and xxxxxxxx ages xxx but xxx age xx you xxx curious xxx can xxxxxxxxxxx on xxx bin xxx choose xxxx Data xxx then xxxx Data xx see xxx exact xxxx points xxxxx make xx that xxx Alt xxxx bar xxxxx Alt xxxx histogram xx a xxx s xxxx in xxxxxxx make x histogram xx the xxx variable xxxxxx the xxx sizes xx that xxxx are xxxx Question xxxxxxxxxxxxx the xxx Sizes xx your xxx histogram xxxx values xxx in xxx most xxxxxxxx bin xxxx of xxxxx Question xxxxxx Question xxxxxxxxxxxxx Questionable xxxxxx In xxxx BMI xxxxxxxxx are xxxxx any xxxxxx that xxxx you xxxxxxxx the xxxx and xxxxxx if xxxxx should xx filtered xxx Yes xxx data xxxxx normally xxxxxxxxxxx like x bell xxxxx and xxxx is xxx expected xxx these xxxxx of xxxxxxxxxxxx Yes xxxxx are xxx many xxxxxx of x BMI xx and xxxxxx which xx much xxxxxx than xxxxxxxx Yes xxxxx are xxxxxx of x BMI xx which xx probably xxx an xxxxxxxx measure xx the xxxx looks xxxxxxxx distributed xxxx a xxxx curve xxx this xx expected xxx these xxxxx of xxxxxxxxxxxx Question xxxxxx C xxx there xxx counts xx a xxx of xxxxx is xxxxxxxx not xx accurate xxxxxxx On x new x heet xx Tableau xxxx a xxxxxxxxx of xxx Glucose xxxxxxxx Adjust xxx bin xxxxx so xxxx are xxxx Don x filter xxx any xxxxxxx measurements xx this xxxxx Question xxxxxxxxxxxxx the xxxx frequent xxxx In xxxx Glucose xxxxxxxxx what xxxxxx are xx the xxxx frequent xxx through xxxxxxxxx and xxxxxxx including xxx through xxxxxxxxx and xxxxxxx including xxx through xxxxxxxxx and xxxxxxxx Answer xxxxxxx including xxx Question xxxxxxxxxxxxx the xxx values xxx counts xxx most xxxxxxxx Glucose xxxxxxxxxxx is xxxxx What xx the xxxxxx most xxxxxxxx Glucose xxxxxxxxxxx bin xxxxx bin xxxxx bin x ount xxx count xxxxxxxx Answer x bin xxxxx Question xxxxxxxxxxxxx the xxxxx of xxx data xx your xxxxxxx histogram xxx would xxx describe xxx overall xxxxx of xxxx data xxx average xxxxxx is xxxxx and xxxxxxxxx It x not xxxxxx at xxx The xxxxxxx median xx about xxx it x skewed xx the xxxxx The xxxxxxx median xx about xxx it x skewed xx the xxxx It xx bimodal xxxxx are xxx distinct xxxxxxx of xxxxxxx measurements xxxxxxxx one xxx diabetes xxxxxxxxx and xxx for xxxxxxxxxxxx diagnoses x one xx these xxxxxxxx Answer xxx average xxxxxx is xxxxx and xx s xxxxxx to xxx right xx a xxx Sheet xx Tableau xxxx a xxxxxxxxx of xxx Insulin xxxxxxxx Adjust xxx bin xxxxx so xxxx they xxx size xxxxxxxx Understanding xxx Descriptives xx the xxxx In xxxx Insulin xxxxxxxxx how xxxxx you xxxxxxxx the xxxxxxx descriptives xx this xxxx Zero xxxxxxx is xxxxxxxxxxx so xxxxx must xx some xxxxxxxx This xxxx is xxxxxxxx distributed xxx follows x bell xxxxx symmetric xxxxx This xxxx is xxxxxxx We xxx see xxx distinct xxxxxxxxxxx those xxxx diabetes xxx those xxxxxxx The xxxx frequent xxxxxx of xxxxxxx is xxxxxxx and xxx a xxx people xxxxxx large xxxxxx of xxxxxxx at xxxxx or xxxxx N xxx of xxxxx Question xxxxxx D xxx most xxxxxxxx amount xx insulin xx between xxx but x few xxxxxx report xxxxx levels xx insulin xx units xx above xx your xxxxxxx histogram xxxx Outcome xx the xxxxx under xxxxx Remember xxxx Outcome xxxxx No xxxxxxxx and xxxxxxx mea xx Diabetes xxx go xx the xxxxxxxx bin xxxxxxxxxxx on xx to xxxx Data xxx then xxxx at xxx Full xxxx How xxxxx you xxxxxxxx the xxxx points xx this xxx Check xxx that xxxxx Question xxxxxxxxxxxxxxx Descriptives xx the xxxx How xxxxx you xxxxxxxx the xxxx points xx the xxxxxxxx bin xx terms xx insulin xxx outcome xxx most xxxxxxxx amount xx insulin xx but xxxxx are xxxx values xxxx between xxx All xxx Insulin xxxxxx here xxx All xxx Outcome xxxxxx here xxx no xxxxxxxx The xxxxxx here xxxxxxx to xxxx contains xxxx values xxxx people xxxx No xxxxxxxx as xxxxx Outcome xx you xxxx d xxxxxxx you xxxx insulin x one xx these xxxxxxxx Answer xxx most xxxxxxxx amount xx insulin xx but xxxxx are xxxx values xxxx between xxx Now xxx s xxxx at xxx Outcome xxxxxxxx We xxxxx make x histogram xx this xxxxxx variable xxx that x not xxxx satisfying xxx s xxxxxx it xx be x text xxxxxxxx Instead xx having xx remember xxxxx No xxxxxxxx wouldn x it xx easier xx just xxxx the xxxxx No xxxxxxxx showing xx the xxxxxx First xxxx a xxx worksheet xxxx under xxxxxxx choose xxxxxx - xxxxxxxxxx Field xxx text xxxxxxx text xxxx it xxx as xxxxxxx You xxxx to xxxxxx a xxx field xxxxxx Outcome xxxx and xx want xx to xx Diabetes xx the xxxxxxx variable xxx a xxxxxxxxx and xx if xxx some xxxxx reason xxx Outcome xxxxxxxx was xxxxxxx a xxx a xx you xxxx to xxxx and xxxxx the xxxxxxx with xxx the xxxxxxxx brackets xxx parentheses xxxx it xx IF xxxxxxx THEN xxxxxxxx ELSEIF xxxxxxx THEN xx Diabetes xxxx NA xxx Alt xxxx copy xxx paste xxxxxxx Let x check xxxx it xxxxx correctly xxx many xxxxxxx do xx have xxx s xxxx at x histogram xx the xxxxxxxx Outcome xxxxxxxx Looks xxxx we xxxx in xxx Outcome xxx and xx the xxxxxxx bin xxxxx of xxx who xxx following xxxxx at xxxx on xxxx calculators xxxx notice xxxx is xxxxx No xxxxxxxx Diabetes xxx text xxxxxxx How xxxx our xxxxxxx Text xx Diabetes xx Diabetes xxxxx up xxxxxxx our xxxxxxxx binary xxxxxxxx Let x drag xxx Outcome xxxx to xxx Color xxx also xx the xxxx Alt xxxx outcome xx can xxx success xxx orange xxx is xxxxxx No xxxxxxxx in xxx legend xxx it xx showing xx the xxxx per xxx binary xxxxxxx variable xxxxx the xxxx bar xx marked xxxxxxxx in xxx legend xxx is xxxxxxx on xxx right xxx the xxxxxx Outcome xxxxxxxx Question xxxxxxxxxxxxx the xxxxxxxxx to xxx We xxx studying xxxx dataset xx try xx understand xxxxxxxx in xxx Pima xxxxxx tribe xx have x dataset xxxxx contains xxxxx diabetes xxxxxxxxx Which xxxxxxxx statement x does xxxx data xxxx like xx might xx able xx answer xxxxx all xxxx apply xxx do xxxxxx with xxxx insulin xxxxxxxx still xxxx a xxxxxxxx diagnosis xxxx aspects xx the xxxxxx diet xxxxx diabetes xxx the xxxx Indian xxxxxxxxxx listed xx this xxxxxxx are xxxxx any xxxxxxxxxxxxx between xxxxxxxx age xxx BMI xxxxx might xx interesting xxx the xxxx Indian xxxxxxxxxx listed xx this xxxxxxx are xxxxx any xxxxxxxxxxxxx between xxxxxxx insulin xxx diabetes xxxxxxxxx which xxxxx be xxxxxxxxxxx For xxx Pima xxxxxx population xxxxxx in xxxx dataset xxx there xxx relationships xxxxxxx household xxxxxx gender xxx diabetes xxxxxxxxx which xxxxx be xxxxxxxxxxx This xx not xxxxxx data xx ask xxxxxxxx we xxxxxx need xx go xx the xxx and xxx millions xx rows xx data x one xx these xxxxxxxx Answer x D xxx s xx back xx our xxx histogram xxxx d xxxxxxx diagnosis xxxxxx with xxx Pull xxx Outcome xxxx not xxx Outcome xxxxxx variable xxx the xxxxxxx Text xx a xxxxx and xxxx as xx additional xxx variable xx will xxx something xxxx this xxx text xxxxxxxxx From xxxx histogram xx can xxx that xxx starts xxx young xx the xxxx and xxxx to xxxxx on xxx right xxx Diabetes xxxxxxxxxx is xx the xxx in xxxx and xxx Non-Diabetes xxxxxxxxxx is xx the xxxxxx in xxxxxx We xxx see xxxx while xxxx groups xxxxx most xx the xxxx age xxxxx the xxxxxxxxxxxx population xxx a xxx of xxxxx people xx it xxxxx the xxxxxxxx population xxx a xxxxxx percentage xx its xxxxxxxxxx in xxx older xxx brackets xxxx may xxxx rise xx a xxxxxxxxxx Does xxxxxxxxxx age xxxxx with xx a xxxxxxxxxx of xxxxxxxx diagnosis xxxxx this xxxxxxxxxx Question xxxxxxxxxxxxx Stacked xxxxxxxxxx Go xxxx to xxxx BMI xxxxxxxxx Be xxxx the xxx sizes xxx still xxx can xxxx ignore xxx BMI xx this xx probably x data xxxxx Repeat xxxxx similar xx the xxx histogram xxxxxxxx we xxxx did xxxxx statements xx your xxxxxxx histograms xxxxxxx for xxx Pima xxxxxx population xxxx this xxxxxxx Check xxx that xxxxx The xxx for xxx No xxxxxxxx group xxxxxxx to xx lower xxxx the xxx for xxx Diabetes xxxxx if xx go xx the xxxxxxxxx midpoint xx you xxxx at xxx BMI xxx which xxxxxxxx BMI xxxxxxxx from xxxxxxx there xxx about xxx same xxxxxx within xxxxxx in xxx Diabetes xxx No xxxxxxxx categories xxx BMI xxx the xx Diabetes xxxxx is xxxxxxx skewed xx favor xx a xxx below x he xxx for xxx Diabetes xxxxx is xxxxxxx skewed xx favor xx a xxx of xx higher xx you xxxx at xxx BMI xxx which xxxxxxxx BMI xxxxxxxx from xxxxxxx there xxx about xxx same xxxxxx within xxxxxx in xxx Diabetes xxx No xxxxxxxx categories x one xx these xxxxxxxx Answer x D xxxxxxx Clustering xx the xxxxxxxx Data xx Tableau xx have xxxxxxxxx our xxx exploratory xxxx analysis xx the xxxxxxxx data xx are xxxxxxxxx to xxxxxxxxxx the xxxxx of xxxxxxxxxx variables xxxx as xxx and xxx and xxxx their xxxxxxxxxxxx to x diabetes xxxxxxxxx Our xxxx step xx to xxx two-dimensional xx scatterplots xxx then xxxxxxx them xx see xx we xxx uncover xxxxxxxxxx relationships xxx s xxxxx investigating xxx and xxx Make x new x heet xx Tableau xxxx Age xxx the xxx bin xxx the xxxxx old xxx from xxx Measures xx the xxxx Values xxxx to xxx Columns xxx drag xxx to xxx Rows xxxxxxx will xxxx you xxx Age xxx SUM xxx and xxxxxxxx one xxxxxx data xxxxx graphed xx circled xx red xxx you xxxxx Alt xxxx graph xx want xx see xxx the xxxx points xx let x Disaggregate xxxxxxxx You xxx do xxxx under xxx Analysis xxxx This xx the xxx anytime xxx expect xxxx of xxxx points xxx you xxxx have xxx Alt xxxx Analysis xx can xxx see xx XY xxxxxxxxxxx of xxx vs xxx with xxx on xxx x-axis xxxxx a xxxxxx for xxx so xxxx BMI xx allowed xx be xxxxxxx and xxx highest xxxxx this xxxxxxx BMI xxxxxx of xxx text xxxxxxxxxxx We xxxx to xx two-dimensional xxxxxxxxxx Are xxxxx distinct xxxxxx such xx Younger xxxxxx with xxxxx BMI xxxxxxx people xxxx higher xxx To xx this xx Tableau xx need xx switch xxxx the xxxx tab xx the xxxx to xxx Analytics xxx From xxxxx under xxx Model xxxxxxx we xxxx Cluster xxx text xxxxxxx Drag xxx Cluster xxxx into xxx middle xx your xx scatterplot xxx text xxxxxxx Tableau xxxx automatically xxxxxx some xxxxxxxx for xxx It xxxxxx similar xxxx points xxxxxxxx so xxxx people xx a xxxxxxx age xxx BMI xxxx be xx the xxxx cluster xxxxx people xxxx different xxxx and xxxxxxxxx BMI xxxx be xx different xxxxxxxx Just xxxxxxx visually xx the xxxx clusters xxxxxxx automatically xxxx we xxx see xxx yellow xxx on xxx bottom xxxx is xxxxxxx people xxxx lower xxx while xxx aqua xxx on xxx right-hand xxxx is xxxxx people xxxx lower-to-medium xxx The xxx cluster xx younger xxxxxx with xxxxxx BMI xxx text xxxxxxx If xxx like xxx can xxxxxxxxxx and xxxx the xxxxxxxx Mark xxxx the xxxxx Mark xxx then xxx clusters xxxx be xxxxxxxxxxxxx by xxxxxxxx X x and xxxxx and xxxxxx which xx not xxxxxxx color xx differentiate xxxx We xxx get xxxxxxx descriptives xx our xxxxxxxx to xxxx us xxxxxxxxxx them xx to xxxxxxxx - xxxxxxxx Clusters xxxx we xxx that xxxxx are xxxx clusters xxxxxxx has xxxxxx in xx the xxxxxx age xx about xxxxx old xxx the xxx in xxxx cluster xx centered xxxxxx about xxx text xxxxxxx Alt xxxx clustering xxxxxxxx Cluster xxxxxxxx for xxx vs xxx Look xx the xxx vs xxx Clusters xxxxxxx above xxxx a xxxxx of xxxxxxxx If xxx had x data xxxxx with xx age xx and x BMI xx which xxxxxxx would xxx expect xx be xxx best xxxxx Cluster xxxxxxx Cluster xxxxxxx N xxx of xxxxx Question xxxxxx C xxx s xxxx with xxx BMI xx Age xxxxxxxx Let x say xx want xxxx clusters xx order xx more xxxxxx analyze xxx data xxxxx the xxxxx menu xx to xxxxxxxx - xxxx Clusters xxx change xxx number xx clusters xx Alt xxxx Clusters xxxxxxxx Changing xxx Numbers xx Clusters xxxx at xxxx new xxxxxxxx with xx them xxx on xxxx XY xxxxxxxxxxx What xx the xxxxxxx age xx those xx the xxxxxxx with xxx highest xxx About xxxxx About xxxxx Cannot xxxxxxxxx from xxxxxxxxx information xxxxxxxx Answer x Let x make x new xxxxx and xxxxxxxxxxx Glucose xx Insulin xxxx an xx scatterplot xxxx Insulin xx the xxxxxx because xx can xxxxxxxxxx insulin xxx Glucose xx the xxxxxx because xxxx s xxx outcome xxxxxxxx Add xxxxxxx to xxxxxx values xxx Glucose xxxxxxx a xxxx blood x glucose xxxxxxx does xxx make xxxxx Do xxx add xxxxxxx for xxxxxxx I x s xx if xxx amount xx insulin xxxxxxxxxxxx is xxxx Have xxxxxxx make xxxxxxxx Question xxxxxxx Clusters xx Measurements xxxxxx blood xxxxxxx levels xxx about xx a xxxxxxx non-diabetic xxxxx Which xxxxxxx best xxxxxxxxxx this xxxxxxx with xx average xxxxxxx reading xx about xxx an xxxxxxx Insulin xxxxx of xxxxx Cluster xxxx an xxxxxxx Glucose xxxxxxx of xxxxx and xx average xxxxxxx value xx about xxxxxxx with xx average xxxxxxx reading xx about xxx data xxxxxx in xx Cluster xxxx an xxxxxxx Glucose xxxxxxx of xxxxx and xxxx points xx it xxxxxxxx Answer x Stay xx your xxxxxxx vs xxxxxxx sheet xxx let x add xxxxxxx piece xx information xxxx you x Outcome xxxx variable xxx one xxxxx declares xxxxxxxx or xx Diabetes xx the xxxxxxx area xxxx should xxx give xxx two xxxxx of xxxxxx one xxxx Diabetes xxx one xxxx No xxxxxxxx Alt xxxx Clusters xxxxxxxx Adding xxxxxxxxxxxxxx to xxxxxxxx You xxxxxx have xxx XY xxxxxxxxxxxx of xxxx Glucose xx Insulin xxxxxxxx one xxxx Diabetes xxx one xxxx No xxxxxxxx Which xxxxxxxxxx would xxx support xxxxx inspecting xxxxx visualizations xxxxxx all xxxx apply xxxxxxx has xx average xxxxxxx level xx about xx both xxx Diabetes xxx No xxxxxxxx classifications xxxxxxx with xx average xxxxxxx reading xx about xxx an xxxxxxx Insulin xxxxx of xxxxx Cluster xxxxx is xxxxxxxxx lower xxxxxxx usage xxx lower xxxxxxx levels xxx some xxxxxx with x Diabetes xxxxxxxxxxxxxx but xxxx more xxxx a xx Diabetes xxxxxxxxxxxxxx Cluster xxx very xxxx insulin xxxx high xxxxxxx levels xxx only xxxxxxxx people xxxx a xxxxxxxx classification xxxxxxx with xx average xxxxxxx reading xx about xxx data xxxxxx in xx Question xxxxxx C x Perform xxxxxxxxxxxx Analysis xxxx Tableau xx have xxx done xxx exploratory xxxx analysis xx this xxxxxxx and xx ve xxxx done xxxx cluster xxxxxxxx to xxxx at xxxxxxxxxxxxx between xxx variables xxx we xxx going xx look xx distribution xxxxxxxx Sometimes xx s xxxxxxx to xxxx for xxxxxxxx average xxxxxx or xxxxx general xxxxxxxxxxxx characteristics xx a xxxxxxx A xxxxxxxxxxxx Band xxx visually xxxxxxx that xxxxxxxxxxx Let x go xxxx to xxx BMI xx Age xxxxxxx and xxxxx Remove xxx clusters xxx keep x filter xx so xxxx Tableau xxxx displays xxxx where xxx BMI xx do xxx display xxxx with x BMI xx the xxxxxxxxx tab x nder xxxxxx choose xxxxxxxxxxxx Band xxx want xx drag xxxx option xx Table xxxx for xxxx demonstration xxx text xxxxxxx table xxx will xx given xxxx options xxx this xxx you xxxx the xxxxx to xx the xxxxxx table xxx we xxxx - xxxxxxxx Deviation xxx will xxxxxx from xxxxxxxxxx that xx your xxxx are xxxxxxxx distributed xxxxx two-thirds xx it xxxx be xxxxxx - xxxxxxxx deviation xxxx means xxxx about xx above xxxxx and xxxxx is xxxxx - xxxxx So xx something xx outside xx those xxxxxx it x a xxxxxx bit xxxxxxxxx from xxxxxxx Alt xxxx Standard xxxxxxxxx That xxxx step xxxx put x reference xxxx on xxx graph xx s xxx easy xx see xxxxx Age xxxx points xxx close xx the xxxxxxx they xxx inside xxx grey xxxx and xxxxx data xxxxxx are xxxxxxx of xxx average xxxx are xxxxxxx of xxx grey xxxx Alt xxxx Cluster xxx s xx back xxx put xx a xxxxxxxxx band xx - xxxxx for xxx BMI xx well xx get xxxxxxxxx like xxx following xxx text xxxxxxx Question xxxxxxxxxxxxx the xxxxxxxxxxxx Bands xxxx at xxx distribution xxxx for xxx BMI xx Age xxxxxxxxxxx Match xxx area xxxx its xxxxxxxxxxx Four xxxxxxxxx A x C x Four xxxxxxx Lower xxx Lower xxx Lower xxx Higher xxx Higher xxx Lower xxx Higher xxx Higher xxx Question xxxxxx C x D x Go xxxx a xxx s xxxx and xxxx time xxxx a xxxxx of xxxxxxx vs xxxxxxx Review xxxxx reminders xxxxxxx should xx on xxx x-axis xxxxxxx should xx on xxx y-axis xx clusters xxxxxx so xxxxxxx is xx higher xx not xxxxxx on xxxxxxx OK xx Insulin xxxxxx are xxx one x STDEV xxxxxxxxxxxx band xxx the xxxxxxx Add xxxxxxx - xxxxx Distribution xxxx for xxx Glucose xxx will xx well xx add xxxx one xx a xxxx at xxx Table xxxxx You xxxxxx get xxxxxxxxx that xxxxx similar xx this xxx text xxxxxxx Question xxxxxxxxxxxxx the xxxxxxxxxxxx Band xxxxxxxxx Look xx your xxxxxxx Look xxx the xxxxxxxx which xxxxxxxxxxx high xxxxxxx high xxxxxxx outliers xxxxx data xxxxxx will xx outside xxxx grey xxxxxxxxx bands xxx would xxx characterize xxx Diabetes xx Diabetes xxxxxxxxx of xxxxx data xxxxxx Hint xxxxxxxx using xxx Outcome xxxx variable xx a xxxxxxxxxxxxxx Color xxxx or xxxxx Mark xxxxx are xxxx Diabetes xxx No xxxxxxxx data xxxxxx here xxx most xx them xxx Diabetes xxxxx are xxxx Diabetes xxxx points xxxx There xx a xxx of xxxxxxxx and xx Diabetes xxxx points xxxx There xxx not xxxxxx very xxxx data xxxxxx here xx all xxxxx one xx two xxxxxxxx Answer xMore Articles From Data visualization