Question.4690 - Module 2 Deliverable: DataAssignment InstructionsGeneral Instructions:Academic papers chosen must be either published in an academic journal or posted and distributed on SSRN.All academic papers used in an assignment must be cited.No academic paper may be used for more than one assignment, i.e. different academic papers must be used for each assignment/activity.Academic papers should be chosen based on your research area of interest and come from a diverse set of journal outlets. Moreover, for each paper chosen a pdf copy of the article needs to be saved within your research materials for use later in the program.If Generative AI is used, both the source (BARD, ChatGPT) and the prompt (the set of commands fed to the source) must be cited.Module 2 Assignment/Activity:Theme: Data is messy and unwieldy, its handling, understanding and presentation make all thedifference.Locate and download two distinct datasets (preferably) in an area of research interest to you.For each dataset, house the data in a medium you can access readily: Excel, flat file, etc. and perform the following summary statistics on them. Note this can be executed in any statistical package you wish: Excel, R, etc.Cite the exact source of the datasets you chose.Identify how the data is organized, time-series, cross-section, etc., and provide a snapshot of the raw data.Identify the definition and units of the variables.Calculate the number of observations, max, mean median, min, standard deviation and histogram of key series.Graphs the key series in a manner you feel communicates the nature of your data.Draft a paragraph (no bullet points) description of one of the datasets that could be included in the (data section of a research paper) which:identifies key characteristics of the data, andprovides summary statistics key for the reader to understand the dataset.Reflect on what aspects of this assignment were challenging.
Answer Below:
First xxxxxxx HYPERLINK xxxxx www xxxxxx com xxxxxxxx iammustafatz xxxxxxxxxxxxxxxxxxxxxxxxxxx data xxxxx www xxxxxx com xxxxxxxx iammustafatz xxxxxxxxxxxxxxxxxxxxxxxxxxx data xxx dataset xx organized xx a xxxxxxxxxxxxxxx manner xx consists xx columns xxxx representing x different xxxxxxxx related xx patient xxxxxx gender xxx patient's xxxxxx Categorical xxxxxxxx Female xxx Male xxx The xxxxxxxxx age xxxxxxxxxx variable xxxxxxx value xx years xxxxxxxxxxxx A xxxxxx variable xxxxxxxxxx whether xxx patient xxx hypertension xx Yes xxxxx disease x binary xxxxxxxx indicating xxxxxxx the xxxxxxx has xxxxx disease xx Yes xxxxxxx history xxx patient's xxxxxxx history xxxxxxxxxxx variable xxxxx No xxxx current xxx The xxxxxxxxx body xxxx index xxxxxxxxxx variable xx m xxx c xxxxx The xxxxxxxxx HbA x level xxxxxxxxxx variable xxxxx glucose xxxxx The xxxxxxxxx blood xxxxxxx level xxxxxxxxxx variable xx dL xxxxxxxx The xxxxxx variable xxxxxxxxxx whether xxx patient xxx diabetes xx Yes xxx dataset xxxxxxxx valid xxxxxxx with xx missing xxxx The xxxxxxx age xx the xxxxxx in xxx study xx years xxx the xxxxxx age xx years xxxxxxx most xxxxxx are xx their x When xx comes xx health xxxxxxxxxx only x small xxxxxx of xxxxxx have xxxxxxxxxxxx high xxxxx pressure xxxx an xxxxxxx of xxxxxxx most xxxxxx don x have xx Similarly xxxx a xxx have xxxxx disease xxxx an xxxxxxx of xxxx individuals xxxx a xxx Body xxxx Index xx which xxxx them xx the xxxxxxxxxx category xxxxxxx and xxx average xxx c xxxxx a xxxxxxx of xxxxx sugar xxxxxxx is xxxxx is xxxxxx a xxxxxxx range xxxxxx there xx some xxxxxxxxx Blood xxxxx levels xxxxx from x low xx mg xx to x high xx mg xx with xx average xx mg xx This xxxxx suggests xxxx there xxx some xxxxxx at xxxx for xxxxxxxxxx like xxxxxxxx One xx the xxxxxxxxxx in xxxxxxx with xxxx dataset xxx making xxxx the xxxx was xxxxxxxx especially xxxx some xxxxxxx values xxxx an xxxxxxxxx low xxxxxxx age xxx a xxxx blood xxxxx level xx mg xx which xxx need xxxxxx attentionMore Articles From Quantitative Analysis